CN1764722A

CN1764722A - Cyclic AMP response element activator proteins and uses related thereto

Info

Publication number: CN1764722A
Application number: CN 200480008108
Authority: CN
Inventors: V·尤格恩科; M·A·拉博; 宋川峥; 张文军; 朱健
Original assignee: Novartis AG
Current assignee: Novartis AG
Priority date: 2003-03-26
Filing date: 2004-03-25
Publication date: 2006-04-26

Abstract

The invention discloses newly identified cyclic AMP response element activator proteins (CREAP proteins). It is contemplated herein that said proteins are suitable drug targets for the development of new therapeutics to prevent, treat or ameliorate pathological conditions related to abnormal activation of CRE-dependent gene expression or abnormal activation of chemokines. The invention relates to methods to prevent, treat or ameliorate said pathological conditions and pharmaceutical compositions therefor comprising modulators with inhibitory effect on CREAP protein activity and/or CREAP gene expression. The invention also relates to methods to identify compounds with therapeutic usefulness to treat said pathological conditions, comprising identifying compounds that can inhibit CREAP protein activity and/or CREAP gene expression.

Description

Cyclic AMP response element activator protein and related uses thereof

发明背景Background of the Invention

环-AMP效应元件结合蛋白质(CREB)、激活转录因子1(ATF1)和cAMP效应元件调节子(CREM)是一亚群近相关蛋白质，属于碱性区域亮氨酸拉链(bZIP)转录因子超家族。它们是由多种细胞外刺激物，诸如激素、生长因子、神经肽和神经递质、钙、低氧和氧化应激，所产生的转录控制的中心调节子。主要通过CREB研究已经充分证明，这些蛋白质的激酶-可诱导结构域(KID)内保守丝氨酸残基的磷酸化导致多种靶基因的转录激活，这些靶基因参与细胞生长调节和分化、代谢、生殖和发育、神经元活性调节和免疫调节。所有这些靶基因都具有保守的顺式作用环-AMP效应元件(CRE)，该元件具有TGACGTCA回文序列或者不对称变异，其包括具有核心序列TGAC的CRE半位点(见Mayr B，Montminy M.，Nat RevMol Cell Biol 2001 Aug；2(8)：599-609)。Cyclic-AMP response element binding protein (CREB), activating transcription factor 1 (ATF1), and cAMP response element regulator (CREM) are a subgroup of closely related proteins belonging to the basic domain leucine zipper (bZIP) transcription factor superfamily . They are central regulators of transcriptional control produced by a variety of extracellular stimuli, such as hormones, growth factors, neuropeptides and neurotransmitters, calcium, hypoxia and oxidative stress. It has been well documented, primarily through CREB studies, that phosphorylation of conserved serine residues within the kinase-inducible domain (KID) of these proteins leads to the transcriptional activation of a variety of target genes involved in the regulation of cell growth and differentiation, metabolism, reproduction and development, regulation of neuronal activity, and immune regulation. All of these target genes have a conserved cis-acting loop-AMP response element (CRE) with the TGACGTCA palindromic sequence or an asymmetric variant that includes a CRE half-site with the core sequence TGAC (see Mayr B, Montminy M ., Nat RevMol Cell Biol 2001 Aug;2(8):599-609).

当磷酸化的CREB/CREM/ATF1同-和/或异二聚体通过bZIP结构域结合到CRE位点时发生转录调节，而KID结构域将效应分子诸如265kDCREB结合蛋白CBP或p300和相关Pol II基础转录机器募集至转录起始位点附近。Transcriptional regulation occurs when phosphorylated CREB/CREM/ATF1 homo- and/or heterodimers bind to CRE sites via the bZIP domain, while the KID domain binds effector molecules such as the 265kDCREB-binding protein CBP or p300 and related Pol II The basal transcription machinery is recruited near the transcription start site.

已有大量研究致力于鉴定将细胞刺激与CREB/CREM/ATF1激活作用相连接的分子。这些激活子的复杂性以对用于CREB磷酸化的激酶的研究为例证进行说明。最初，CREB被当作增加cAMP的细胞外刺激物的唯一转录调节子，cAMP又激活用于CREB磷酸化的蛋白激酶A(PKA)。然而，随后的研究揭示CREB蛋白质也由pp90RSK磷酸化以应答生长因子、由MSK-1磷酸化以应答促分裂原和应激、CAMK II/IV磷酸化以应答Ca⁺⁺上升以及AKT磷酸化以应答低氧和存活信号。从这些研究来看很显然，在从多种细胞外刺激物产生适当的细胞输出过程中，在细胞中CREB/CRE/ATF1家族蛋白质的活性调节是非常复杂的以保证特异性和敏感性。Extensive research has been devoted to identifying molecules linking cellular stimulation to CREB/CREM/ATF1 activation. The complexity of these activators is exemplified by the study of kinases for CREB phosphorylation. Originally, CREB was regarded as the sole transcriptional regulator of extracellular stimuli that increased cAMP, which in turn activated protein kinase A (PKA) for CREB phosphorylation. However, subsequent studies revealed that CREB proteins are also phosphorylated by pp90RSK in response to growth factors, MSK-1 in response to mitogens and stress, CAMK II/IV in response to Ca ⁺⁺ rise, and AKT in response to growth factors. Responses to hypoxia and survival signals. From these studies it is clear that the regulation of the activity of CREB/CRE/ATF1 family proteins in cells is highly complex to ensure specificity and sensitivity in generating appropriate cellular output from various extracellular stimuli.

此处我们描述了对大量全长人cDNA克隆(代表11,000到15,000种基因转录物)进行基于基因组-水平细胞功能筛选以获得激活CRE-依赖性基因表达的蛋白质。这些数据指出了几种迄今为止仍未鉴定的CRE激活子，包括KIAA0616，一种先前未知功能的基因并且此处已经重命名为CREAP1。申请人还发现2种更加不同的人类蛋白质在结构和活性方面与CREAP1相似，这两种人类蛋白质在此处命名为CREAP2和CREAP3，以及小鼠和果蝇同系物，全部这些蛋白质都是目前为止未知的调节CRE-依赖性基因表达的基因家族的成员。Here we describe the genome-level cellular function screening of a large number of full-length human cDNA clones (representing 11,000 to 15,000 gene transcripts) for proteins that activate CRE-dependent gene expression. These data point to several hitherto unidentified CRE activators, including KIAA0616, a gene of previously unknown function and here renamed CREAP1. Applicants have also discovered that 2 more distinct human proteins are similar in structure and activity to CREAP1, these two human proteins are named here as CREAP2 and CREAP3, and the mouse and Drosophila homologues, all of which are so far Member of an unknown gene family that regulates CRE-dependent gene expression.

申请人此处还报导了一个令人惊奇的发现，即CREAP1是其它蛋白质，包括烯醇丙酮酸磷酸羧激酶(PEPCK)、双调蛋白和诸如IL-8和Exodus-1/MIPα的趋化因子的有效诱导物。同样，此处可以认为本发明的CREAP蛋白质能够用作新的药物靶标以治疗与启动子区域内含有CRE位点的基因相关的病理状况，以及治疗与PEPCK、双调蛋白和趋化因子特别是IL-8和Exodus-1/MIPα的异常激活相关的疾病。这些疾病包括但不限于骨关节炎、银屑病、气喘、慢性阻塞性肺病(COPD)、类风湿性关节炎、癌症、病理性血管发生、糖尿病、高血压、慢性痛和其它炎症与自身免疫病以及神经变性疾病诸如阿尔茨海默氏病、帕金森氏病和亨廷顿病。Applicants also report here the surprising discovery that CREAP1 is a member of other proteins, including enolpyruvate phosphate carboxykinase (PEPCK), amphiregulin, and chemokines such as IL-8 and Exodus-1/MIPα effective inducer. Likewise, it can be considered herein that the CREAP proteins of the present invention can be used as new drug targets for the treatment of pathological conditions associated with genes containing CRE sites within their promoter regions, as well as for the treatment of diseases associated with PEPCK, amphiregulin and chemokines, especially Diseases associated with abnormal activation of IL-8 and Exodus-1/MIPα. These diseases include but are not limited to osteoarthritis, psoriasis, asthma, chronic obstructive pulmonary disease (COPD), rheumatoid arthritis, cancer, pathological angiogenesis, diabetes, hypertension, chronic pain and other inflammatory and autoimmune and neurodegenerative diseases such as Alzheimer's, Parkinson's and Huntington's diseases.

本发明还提供了鉴定抑制或增强CREAP活性和/或抑制或增强CREAP基因表达的调节子的方法以及此类调节子用于治疗人类和兽类患者中这些疾病的用途。本发明还提供了包含所述调节子的药物组合物。The present invention also provides methods of identifying modulators that inhibit or enhance CREAP activity and/or inhibit or enhance CREAP gene expression and the use of such modulators for the treatment of these diseases in human and veterinary patients. The present invention also provides a pharmaceutical composition comprising the modulator.

发明简述Brief description of the invention

本申请涉及此处称为CREAP的蛋白质新家族的发现，这些蛋白质是CRE-依赖性转录的激活子和趋化因子的诱导物。同样，此处可以认为该蛋白质家族的成员是研发新治疗剂的适当靶标以预防、治疗或缓解与CRE-依赖性基因表达异常激活或趋化因子异常激活有关的病理状况，这些状况包括但不限于骨关节炎、银屑病、气喘、COPD、类风湿性关节炎、癌症、病理性血管发生、糖尿病、高血压、慢性痛以及其它炎症和自身免疫病。此外，由于CREB功能损失已经表明与学习和神经变性方面的缺陷有关，CREAP蛋白质的激动剂可以用于预防、治疗或缓解神经变性疾病如阿尔茨海默氏病、帕金森氏病和亨廷顿病。因此，一方面本发明涉及鉴定用于预防、治疗或缓解所述疾病的调节子的方法，其包括：a)体外、先体外后体内或体内测定候选调节子抑制或增强CREAP蛋白质的活性和/或抑制或增强CREAP蛋白质表达的能力，并且可以进一步包括b)测定所鉴定的CREAP调节子逆转在所述病理状况的体内、先体外后体内或体外模型中和/或对处于所述病理状况的患者的临床研究中所观察到的病理效应的能力。The present application relates to the discovery of a new family of proteins referred to herein as CREAPs, which are activators of CRE-dependent transcription and inducers of chemokines. Likewise, members of this protein family are considered herein to be appropriate targets for the development of new therapeutic agents to prevent, treat or alleviate pathological conditions associated with aberrant activation of CRE-dependent gene expression or aberrant activation of chemokines, including but not Limited to osteoarthritis, psoriasis, asthma, COPD, rheumatoid arthritis, cancer, pathological angiogenesis, diabetes, hypertension, chronic pain, and other inflammatory and autoimmune diseases. Furthermore, since loss of CREB function has been shown to be associated with deficits in learning and neurodegeneration, agonists of the CREAP protein could be used to prevent, treat or alleviate neurodegenerative diseases such as Alzheimer's, Parkinson's and Huntington's diseases. Therefore, in one aspect the present invention relates to a method for identifying a modulator for preventing, treating or alleviating said disease, comprising: a) determining in vitro, ex vivo or in vivo that a candidate modulator inhibits or enhances the activity of a CREAP protein and/or or inhibit or enhance the ability of CREAP protein expression, and may further comprise b) determination of identified CREAP modulator reversal in vivo, ex vivo or in vitro models of said pathological condition and/or on the expression of said pathological condition The ability to observe pathological effects in clinical studies of patients.

另一方面，本发明涉及预防、治疗或缓解与CRE-依赖性基因表达异常激活或趋化因子异常激活相关的病理状况的方法，其包括在患者需要时对其施用有效量的CREAP调节子，其中所述调节子例如可抑制或增强任意一种或多种所述CREAP蛋白质的活性或抑制或增强任意一种或多种CREAP蛋白质的表达，其中所述CREAP蛋白质选自CREAP1、CREAP2和CREAP3。In another aspect, the present invention relates to a method for preventing, treating or alleviating pathological conditions associated with abnormal activation of CRE-dependent gene expression or abnormal activation of chemokines, which comprises administering an effective amount of a CREAP regulator to a patient when needed, Wherein the regulator, for example, can inhibit or enhance the activity of any one or more of the CREAP proteins or inhibit or enhance the expression of any one or more of the CREAP proteins, wherein the CREAP proteins are selected from CREAP1, CREAP2 and CREAP3.

在一个实施方案中，调节子包括任意一种或多种选自反义寡核苷酸、三股螺旋DNA、核酶、RNA适体、siRNA和双链或单链RNA的物质，其中所述物质设计为可抑制CREAP蛋白质的表达。在另一个实施方案中，调节子包括CREAP蛋白质的抗体或其片段，其中所述抗体或其片段能够抑制所述CREAP蛋白质的活性。在本发明的另一个实施方案中，调节子包括CREAP蛋白质的肽模拟物，其中所述肽模拟物能够抑制所述CREAP蛋白质的活性。这里认为此处所述的一种或多种调节子可以同时进行施用。In one embodiment, the modulator comprises any one or more substances selected from antisense oligonucleotides, triple-helix DNA, ribozymes, RNA aptamers, siRNA and double-stranded or single-stranded RNA, wherein the substances Designed to inhibit the expression of CREAP protein. In another embodiment, the modulator comprises an antibody to a CREAP protein or a fragment thereof, wherein said antibody or fragment thereof is capable of inhibiting the activity of said CREAP protein. In another embodiment of the invention, the modulator comprises a peptidomimetic of a CREAP protein, wherein said peptidomimetic is capable of inhibiting the activity of said CREAP protein. It is contemplated that one or more modulators described herein may be administered simultaneously.

另一方面，本发明涉及治疗、预防或缓解与CRE-依赖性基因表达异常激活或趋化因子异常激活相关的病理状况的方法，其包括对需要其的患者施用包含有效量CREAP调节子的药物组合物。在一个实施方案中，所述调节子在患者体内抑制或增强CREAP蛋白质的活性或者抑制或增强编码所述蛋白质的基因的表达，其中所述CREAP蛋白质选自CREAP1、CREAP2或CREAP3。在一个实施方案中，调节子包括任意一种或多种选自反义寡核苷酸、三股螺旋DNA、核酶、RNA适体、siRNA和双链或单链RNA的物质，其中所述物质设计为可抑制CREAP蛋白质的表达。在另一个实施方案中，调节子包括CREAP蛋白质或其片段的抗体或肽模拟物，其中所述抗体或肽模拟物能够例如抑制所述CREAP蛋白质的酶活性或其它活性。此处认为一种或多种所述蛋白质的一种或多种调节子可以同时施用。In another aspect, the present invention relates to a method for treating, preventing or alleviating pathological conditions associated with abnormal activation of CRE-dependent gene expression or abnormal activation of chemokines, comprising administering to a patient in need thereof a medicament comprising an effective amount of a CREAP modulator combination. In one embodiment, the modulator inhibits or enhances the activity of a CREAP protein or inhibits or enhances the expression of a gene encoding said protein in a patient, wherein said CREAP protein is selected from CREAP1, CREAP2 or CREAP3. In one embodiment, the modulator comprises any one or more substances selected from antisense oligonucleotides, triple-helix DNA, ribozymes, RNA aptamers, siRNA and double-stranded or single-stranded RNA, wherein the substances Designed to inhibit the expression of CREAP protein. In another embodiment, a modulator comprises an antibody or peptidomimetic of a CREAP protein or fragment thereof, wherein said antibody or peptidomimetic is capable of, for example, inhibiting the enzymatic or other activity of said CREAP protein. It is contemplated herein that one or more modulators of one or more of said proteins may be administered simultaneously.

另一方面，本发明涉及包含一种或多种一定量CREAP调节子以在需要其的患者体内有效治疗、预防或缓解与CRE-依赖性基因表达异常激活或趋化因子异常激活相关病理状况的药物组合物，其中所述调节子能够抑制或增强CREAP蛋白质的活性和/或抑制或增强编码CREAP蛋白质的表达，其中所述CREAP蛋白质选自CREAP1、CREAP2或CREAP3。在另一个实施方案中，调节子包括任意一种或多种选自反义寡核苷酸、三股螺旋DNA、核酶、RNA适体、siRNA和双链或单链RNA的物质，其中所述物质设计为可抑制CREAP的表达。在另一个实施方案中，调节子包括CREAP蛋白质或其片段的抗体或肽模拟物，其中所述抗体或肽模拟物能够例如抑制所述CREAP蛋白质的酶活性或其它活性。In another aspect, the present invention relates to a method comprising one or more CREAP modulators in an amount effective for treating, preventing or alleviating pathological conditions associated with aberrant activation of CRE-dependent gene expression or aberrant activation of chemokines in a patient in need thereof A pharmaceutical composition, wherein the modulator is capable of inhibiting or enhancing the activity of a CREAP protein and/or inhibiting or enhancing the expression of a protein encoding CREAP, wherein the CREAP protein is selected from CREAP1, CREAP2 or CREAP3. In another embodiment, the modulator includes any one or more substances selected from antisense oligonucleotides, triple-helix DNA, ribozymes, RNA aptamers, siRNA, and double-stranded or single-stranded RNA, wherein the The substance is designed to inhibit the expression of CREAP. In another embodiment, a modulator comprises an antibody or peptidomimetic of a CREAP protein or fragment thereof, wherein said antibody or peptidomimetic is capable of, for example, inhibiting the enzymatic or other activity of said CREAP protein.

另一方面，本发明涉及包含CREAP蛋白质的药物组合物。In another aspect, the present invention relates to a pharmaceutical composition comprising a CREAP protein.

另一方面，本发明涉及治疗、预防或缓解与CRE-依赖性基因表达的异常激活或趋化因子异常激活相关的病理状况的方法，该方法包括对需要其的患者施用包含CREAP蛋白质的药物组合物。In another aspect, the present invention relates to a method of treating, preventing or alleviating pathological conditions associated with abnormal activation of CRE-dependent gene expression or abnormal activation of chemokines, the method comprising administering to a patient in need thereof a pharmaceutical combination comprising a CREAP protein things.

另一方面，本发明涉及诊断患有与CRE-依赖性基因表达异常激活或趋化因子异常激活相关的病理状况的受试者的方法，这些受试者可能是使用CREAP调节子或外源CREAP蛋白质进行治疗的适当的候选者，诊断方法包括检测来自所述受试者的生物学样品中CREAP蛋白质的水平，其中与对照相比具有异常水平的受试者将是用于治疗的适当候选者。In another aspect, the present invention relates to a method of diagnosing a subject suffering from a pathological condition associated with abnormal activation of CRE-dependent gene expression or abnormal activation of chemokines, possibly using a CREAP modulator or exogenous CREAP protein for treatment, the diagnostic method comprising detecting the level of CREAP protein in a biological sample from said subject, wherein a subject with abnormal levels compared to a control would be a suitable candidate for treatment .

另一方面，本发明涉及本发明涉及诊断患有与CRE-依赖性基因表达异常激活或趋化因子异常激活相关的病理状况的受试者的方法，该受试者可能是使用一种或多种CREAP调节子或外源CREAP蛋白质进行治疗的适宜候选者，诊断方法包括测定来自所述受试者的生物学样品中CREAP蛋白质mRNA的水平，其中与对照相比具有异常mRNA水平的受试者将是用于治疗的适当候选者。In another aspect, the present invention relates to a method of diagnosing a subject suffering from a pathological condition associated with abnormal activation of CRE-dependent gene expression or abnormal activation of chemokines, the subject may be treated with one or more A suitable candidate for treatment with a CREAP modulator or an exogenous CREAP protein, the diagnostic method comprising determining the level of CREAP protein mRNA in a biological sample from said subject, wherein the subject has an abnormal mRNA level compared to a control would be suitable candidates for treatment.

另一方面，这里还提供了治疗、预防或缓解与CRE-依赖性基因表达异常激活或趋化因子异常激活相关的病理状况的方法，其包括：(a)测定受试者中CREAP mRNA和/或CREAP蛋白质水平和(b)对与对照相比具有异常水平mRNA和/或CREAP蛋白质的受试者施用一定量CREAP调节子或外源CREAP蛋白质以足以治疗、预防或缓解所述病理状况。In another aspect, there is also provided a method for treating, preventing or alleviating pathological conditions related to abnormal activation of CRE-dependent gene expression or abnormal activation of chemokines, comprising: (a) measuring CREAP mRNA and/or or CREAP protein levels and (b) administering an amount of CREAP modulator or exogenous CREAP protein to a subject with abnormal levels of mRNA and/or CREAP protein compared to controls sufficient to treat, prevent or alleviate said pathological condition.

本发明另一方面还提供了测定方法和试剂盒，其包括对检测来自患者的生物学样品中编码CREAP蛋白质的多核苷酸的表达或CREAP蛋白质或其片段的水平必要的成分，所述试剂盒包括例如结合CREAP蛋白质或其片段的抗体或肽模拟物，或者与CREAP多核苷酸杂交的多核苷酸探针。在优选的实施方案中，此类试剂盒还包括详细说明了操作过程的使用说明书，通过该操作过程使用试剂盒成分。Another aspect of the present invention also provides assay methods and kits, which include the necessary components for detecting the expression of polynucleotides encoding CREAP proteins or the levels of CREAP proteins or fragments thereof in biological samples from patients, said kits Included are, for example, antibodies or peptidomimetics that bind CREAP protein or fragments thereof, or polynucleotide probes that hybridize to CREAP polynucleotides. In preferred embodiments, such kits also include instructions for use detailing the procedures by which the kit components are used.

本发明还涉及CREAP调节子或外源CREAP蛋白质用于生产治疗、预防或缓解与CRE-依赖性基因表达异常激活或趋化因子异常激活相关的病理状况的药物的用途。优选地是，所述病理状况是自身免疫病或神经变性疾病。在一个实施方案中，所述调节子包括任意一种或多种选自反义寡核苷酸、三股螺旋DNA、核酶、RNA适体、siRNA和双链或单链RNA的物质，其中所述物质设计为可抑制CREAP基因的表达。在另一个实施方案中，所述调节子包括CREAP蛋白质的一种或多种抗体或其片段，其中所述抗体或其片段能够例如抑制CREAP酶活性或其它活性。在另一个实施方案中，所述调节子包括CREAP蛋白质的一种或多种肽模拟物，其中所述模拟物能够例如抑制CREAP酶活性或其它活性。The present invention also relates to the use of CREAP regulator or exogenous CREAP protein for the production of medicines for treating, preventing or alleviating pathological conditions associated with abnormal activation of CRE-dependent gene expression or abnormal activation of chemokines. Preferably, said pathological condition is an autoimmune disease or a neurodegenerative disease. In one embodiment, the regulator includes any one or more substances selected from antisense oligonucleotides, triple-helix DNA, ribozymes, RNA aptamers, siRNA, and double-stranded or single-stranded RNA, wherein The above substances are designed to inhibit the expression of the CREAP gene. In another embodiment, the modulator comprises one or more antibodies or fragments thereof to a CREAP protein, wherein said antibodies or fragments thereof are capable of, for example, inhibiting CREAP enzyme activity or other activity. In another embodiment, the modulator comprises one or more peptidomimetics of a CREAP protein, wherein the mimetic is capable of, for example, inhibiting CREAP enzyme activity or other activity.

本发明还涉及用作药物的外源CREAP蛋白质或CREAP蛋白质的调节子。在一个实施方案中，所述调节子包括任意一种或多种选自反义寡核苷酸、三股螺旋DNA、核酶、RNA适体、siRNA和双链或单链RNA的物质，其中所述物质设计为可抑制CREAP的表达。在另一个实施方案中，所述调节子包括一种或多种针对CREAP的抗体或CREAP的肽模拟物或其片段，其中所述抗体、肽模拟物或其片段能够例如抑制CREAP的酶活性或其它活性。在另一个实施方案中，所述调节子包括CREAP蛋白质的一种或多种肽模拟物，其中所述模拟物能够例如抑制CREAP的酶活性或其它活性。The present invention also relates to exogenous CREAP proteins or modulators of CREAP proteins for use as medicaments. In one embodiment, the regulator includes any one or more substances selected from antisense oligonucleotides, triple-helix DNA, ribozymes, RNA aptamers, siRNA, and double-stranded or single-stranded RNA, wherein Said substances are designed to inhibit the expression of CREAP. In another embodiment, the modulator comprises one or more antibodies to CREAP or peptide mimetics or fragments thereof, wherein the antibodies, peptide mimetics or fragments thereof are capable of, for example, inhibiting the enzymatic activity of CREAP or other activities. In another embodiment, the modulator comprises one or more peptidomimetics of a CREAP protein, wherein the mimetic is capable of, for example, inhibiting the enzymatic or other activity of CREAP.

由于迄今为止还没有公开CREAP2和CREAP3的正确多核苷酸序列，此处可以认为本发明还提供了包含分别显示于SEQ ID NO：16和SEQID NO：25的氨基酸序列的分离的多肽。此外，本发明提供了包括显示于SEQ ID NO：16和SEQ ID NO：25的氨基酸序列及其片段的分离多肽。根据本发明该方面，这里还提供了人类来源的新多肽以及其生物学、诊断学或治疗学上有用的片段、变体、同系物及衍生物、片段的变体和衍生物和上述物质的类似物。Since the correct polynucleotide sequences of CREAP2 and CREAP3 have not been disclosed so far, it is considered herein that the present invention also provides isolated polypeptides comprising the amino acid sequences shown in SEQ ID NO: 16 and SEQ ID NO: 25, respectively. In addition, the present invention provides isolated polypeptides comprising the amino acid sequences shown in SEQ ID NO: 16 and SEQ ID NO: 25 and fragments thereof. According to this aspect of the present invention, there are also provided novel polypeptides of human origin and their biologically, diagnostically or therapeutically useful fragments, variants, homologues and derivatives, variants and derivatives of fragments and combinations of the above-mentioned substances. analog.

本发明还可以得到分离的核酸，这些核酸包含编码此处所公开的CREAP蛋白质，具体而言是CREAP2和CREAP3和其同系物和片段和/或等价物的核苷酸序列，或者基本上与具有如SEQ ID NO：15和SEQ IDNO：24中所显示的核苷酸序列的核酸相似的核酸。在优选的实施方案中，所分离的DNA采用载体分子的形式，其包含至少一部分本发明的DNA，具体而言包含由显示于SEQ ID NO：1、SEQ ID NO.15或SEQ ID NO：24中的核苷酸序列所组成的DNA。The present invention can also obtain isolated nucleic acids comprising the nucleotide sequences encoding the CREAP proteins disclosed herein, specifically CREAP2 and CREAP3 and homologues and fragments and/or equivalents thereof, or substantially identical to those having the sequence of SEQ ID NO: A nucleic acid similar to the nucleic acid of the nucleotide sequence shown in ID NO: 15 and SEQ ID NO: 24. In a preferred embodiment, the isolated DNA is in the form of a vector molecule comprising at least a portion of the DNA of the invention, in particular comprising the DNA shown in SEQ ID NO: 1, SEQ ID NO. DNA composed of nucleotide sequences.

本发明的另一方面提供了产生上述多肽、多肽片段、变体和衍生物、变体和衍生物的片段，以及上述物质的类似物的方法。在本发明该方面优选的实施方案中提供了产生上述CREAP蛋白质的方法，其包括在足以在宿主细胞中表达所述多肽的条件下培养表达其中整合载体的宿主细胞，该表达载体含有编码此种多核苷酸的外源来源核苷酸序列，从而引起所述多肽的表达，并且任选地回收所表达的多肽。Another aspect of the invention provides methods of producing the above-mentioned polypeptides, polypeptide fragments, variants and derivatives, fragments of the variants and derivatives, and analogs of the above-mentioned substances. In a preferred embodiment of this aspect of the present invention, a method for producing the above-mentioned CREAP protein is provided, which includes culturing a host cell expressing a vector integrated therein under conditions sufficient to express the polypeptide in the host cell, the expression vector containing An exogenous source of nucleotide sequences for a polynucleotide, thereby causing expression of said polypeptide, and optionally recovering the expressed polypeptide.

在本发明该方面优选的实施方案中，提供了产生包含或由显示于SEQID NO：2、SEQ ID NO：16或SEQ ID NO：25的氨基酸序列组成的多肽的方法，该方法包括在足以在宿主细胞中表达所述多肽的条件下培养其中整合表达载体的宿主细胞，该表达载体含有编码包含或由显示于SEQ ID NO：2、SEQ ID NO：16或SEQ ID NO：25的氨基酸序列组成的多肽的外源来源核苷酸序列，从而引起产生所表达的多肽，并且任选地回收所表达的多肽。优选地，在任何此类方法中，外源来源核苷酸序列包含或为显示于SEQ IDNO：1中的核苷酸序列、显示于SEQ ID NO：15中的核苷酸序列或显示于SEQ ID NO：24中的核苷酸序列。根据本发明的另一方面，这里提供了利用上述多肽和多核苷酸的产品、组合物或方法以尤其用于研究、生物学、临床和治疗目的。In a preferred embodiment of this aspect of the invention, there is provided a method of producing a polypeptide comprising or consisting of the amino acid sequence shown in SEQ ID NO: 2, SEQ ID NO: 16 or SEQ ID NO: 25, the method comprising sufficient steps in Cultivating the host cell in which the expression vector comprising or consisting of the amino acid sequence shown in SEQ ID NO: 2, SEQ ID NO: 16 or SEQ ID NO: 25 is cultured under conditions in which the polypeptide is expressed in the host cell. An exogenously sourced nucleotide sequence of the polypeptide, thereby causing the production of the expressed polypeptide, and optionally recovering the expressed polypeptide. Preferably, in any such method, the exogenous source nucleotide sequence comprises or is the nucleotide sequence shown in SEQ ID NO: 1, the nucleotide sequence shown in SEQ ID NO: 15 or the nucleotide sequence shown in SEQ ID NO: Nucleotide sequence in ID NO:24. According to another aspect of the present invention, there are provided products, compositions or methods utilizing the polypeptides and polynucleotides described above for, inter alia, research, biological, clinical and therapeutic purposes.

另一方面，本发明提供了能够在体外进行繁殖的宿主细胞，优选地为脊椎动物细胞，尤其是哺乳动物细胞，或细菌细胞，这些细胞一旦在培养物中生长则能够产生包括显示于SEQ ID NO：2、SEQ ID NO：16、SEQ IDNO：25中的氨基酸序列或其片段的多肽，其中细胞含有转录控制DNA序列，优选地为除人类CREAP转录控制序列之外的序列，其中转录控制序列控制编码多肽的DNA的转录，该多肽具有根据SEQ ID NO：2、SEQ IDNO：16、SEQ ID NO：25或其片段的氨基酸序列，包括但不限于包含CREAP蛋白质活性部分或片段的氨基酸序列。In another aspect, the invention provides host cells, preferably vertebrate cells, especially mammalian cells, or bacterial cells capable of propagating in vitro, which cells, once grown in culture, are capable of producing NO: 2, SEQ ID NO: 16, the polypeptide of the amino acid sequence in SEQ ID NO: 25 or a fragment thereof, wherein the cell contains a transcriptional control DNA sequence, preferably a sequence other than the human CREAP transcriptional control sequence, wherein the transcriptional control sequence Controlling the transcription of DNA encoding a polypeptide having an amino acid sequence according to SEQ ID NO: 2, SEQ ID NO: 16, SEQ ID NO: 25 or a fragment thereof, including but not limited to an amino acid sequence comprising an active portion or fragment of a CREAP protein.

另一方面，本发明涉及用于将本发明的核酸导入需要治疗受试者的一种或多种组织的方法，结果为在组织内细胞表达或分泌所述核酸所编码的一种或多种蛋白质。In another aspect, the invention relates to methods for introducing a nucleic acid of the invention into one or more tissues of a subject in need of treatment, with the result that cells within the tissue express or secrete one or more of the nucleic acids encoded by said nucleic acid. protein.

附图说明Description of drawings

图1说明CREAP1是高度保守的蛋白质并含有有效转录激活结构域。显示了人CREAP1和预测的小鼠、河豚和果蝇CREAP1相关基因的氨基酸序列。阴影表示相同和高度保守的氨基酸。框内是保守的潜在PKA磷酸化位点。第一条序列代表人、第二条代表小鼠、第三条代表河豚并且第四条代表果蝇。Figure 1 illustrates that CREAP1 is a highly conserved protein and contains a potent transcriptional activation domain. Amino acid sequences of human CREAP1 and predicted mouse, fugu and Drosophila CREAP1-related genes are shown. Shading indicates identical and highly conserved amino acids. Conserved potential PKA phosphorylation sites are boxed. The first sequence represents human, the second mouse, the third fugu and the fourth Drosophila.

图2表示对应人和果蝇CREAP蛋白质的全长cDNA的氨基酸序列。使用ClustalW比对氨基酸并且阴影表示保守氨基酸。Figure 2 shows the amino acid sequences of the full-length cDNAs corresponding to human and Drosophila CREAP proteins. Amino acids were aligned using ClustalW and shading indicates conserved amino acids.

发明详述Detailed description of the invention

认为此处描述的本发明不限于特定的方法、方案和试剂，因为它们可以改变。也可以理解此处所用的术语是仅为了描述特定实施方案，并且无意以任意方式限制本发明的范围。The invention described herein is not considered limited to particular methodology, protocols and reagents as these may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the invention in any way.

除非另外定义，此处使用的全部技术与科学术语与本发明所属领域的普通技术人员所通常理解的意义相同。尽管与此处所述的那些方法和材料相似或相当的任意方法和材料能够用于本发明的实践或测试，现在又描述了优选的方法、设备和材料。为了描述和公开报道于可以与本发明结合使用的出版物中的材料和方法，引用此处所提及的全部出版物作为参考。Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, the preferred methods, devices, and materials are now described. All publications mentioned herein are incorporated by reference for the purpose of describing and disclosing the materials and methodologies reported in the publications which may be used in connection with the present invention.

实践本发明过程中，使用了许多分子生物学的常规技术。这些技术是众所周知的并说明于例如Current Protocols in Molecular Biology，I、II和III卷，1997(F.M.Ausubel编著)；Sambrook等人，1989，MolecularCloning：A Laboratory Manual，第二版，Cold Spring Harbor LaboratoryPress，Cold Spring Harbor，N.Y.；DNA Cloning：A Practical Approach，I和II卷，1985(D.N.Glover编著)；Oligonucleotide Synthesis，1984(M.L.Gait编著)；Nucleic Acid Hybridization，1985，(Hames和Higgins)；Transcription and Translation，1984(Hames和Higgins编著)；Animal CellCulture，1986(R.I.Freshney编著)；Immobilized Cells and Enzymes，1986(IRL Press)；Perbal，1984，A Practical Guide to Molecular Cloning；系列，Methods in Enzymology(Academic Press，Inc.)；Gene Transfer Vectors forMammalian Cells，1987(J.H.Miller和M.P.Calos编著，Cold SpringHarbor Laboratory)；以及Methods in Enzymology 154卷和155卷(Wu和Grossman，和Wu，分别编著)。In practicing the present invention, many conventional techniques of molecular biology are employed. These techniques are well known and described in, for example, Current Protocols in Molecular Biology, Volumes I, II and III, 1997 (eds. F.M. Ausubel); Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.; DNA Cloning: A Practical Approach, Volumes I and II, 1985 (ed. D.N. Glover); Oligonucleotide Synthesis, 1984 (ed. M.L. Gait); Nucleic Acid Hybridization, 1985, (Hames and Higgins); Transcription and Translation , 1984 (edited by Hames and Higgins); Animal Cell Culture, 1986 (edited by R.I. Freshney); Immobilized Cells and Enzymes, 1986 (IRL Press); Perbal, 1984, A Practical Guide to Molecular res Cloning; Series, Methods in P Enzymology (Academic Inc.); Gene Transfer Vectors for Mammalian Cells, 1987 (eds. J.H. Miller and M.P. Calos, Cold Spring Harbor Laboratory); and Methods in Enzymology volumes 154 and 155 (Wu and Grossman, and Wu, eds., respectively).

缩略语：Abbreviations:

ABIN-2 NF-κB激活-2的A20-结合抑制因子ABIN-2 A20-binding inhibitor of NF-κB activation-2

ACT1 NFkB-激活蛋白1ACT1 NFkB-activator protein 1

ANKRD3 锚蛋白重复序列结构域蛋白3ANKRD3 Ankyrin repeat domain protein 3

AP-1 激活蛋白1AP-1 Activator protein 1

ARHGEF1 Rho鸟嘌呤核苷酸交换因子(GEF)1ARHGEF1 Rho guanine nucleotide exchange factor (GEF) 1

ATCC 美国典型培养物保藏中心ATCC American Type Culture Collection

ATF 激活转录因子ATF activates transcription factor

BZIP 碱性区亮氨酸拉链BZIP Basic zone leucine zipper

C/EBP CCAAT/增强子结合蛋白C/EBP CCAAT/enhancer-binding protein

CAD 组成型活化结构域CAD constitutive activation domain

CAMK Ca⁺⁺/钙调蛋白依赖蛋白质激酶CAMK Ca ⁺⁺ /calmodulin-dependent protein kinase

cAMP 环AMPcAMP Cyclic AMP

CBP CREB结合蛋白质CBP CREB binding protein

CNS 中枢神经系统CNS central nervous system

COPD 慢性阻塞性肺病COPD chronic obstructive pulmonary disease

CR53 假定的转录因子CR53CR53 putative transcription factor CR53

CRE 环AMP效应元件CRE Cyclic AMP Effect Element

CREB 环AMP效应元件结合蛋白质CREB cyclic AMP response element binding protein

CREB1 cAMP应答元件结合蛋白质1CREB1 cAMP response element binding protein 1

CRE-BPa cAMP效应元件-结合蛋白质CRE-BPa cAMP response element-binding protein

CREM cAMP效应元件调节子CREM cAMP response element regulator

ERK 细胞外信号调节激酶ERK Extracellular signal-regulated kinase

EST 表达序列标签EST Expressed Sequence Tag

HPH2 果蝇蛋白Polyhomeotic(Ph)的人同系物Human homologue of HPH2 Drosophila protein Polyhomeotic (Ph)

HPH2 人Polycomb同系物2HPH2 Human Polycomb homolog 2

HTS 高通量筛选HTS High throughput screening

IBMX 3-异丁基-1-甲基黄嘌呤IBMX 3-isobutyl-1-methylxanthine

ICER 可诱导cAMP早期阻遏物ICER Inducible cAMP early repressor

IkBα 核因子κ-B激酶α亚基抑制剂IkBα Inhibitor of nuclear factor κ-B kinase α subunit

IKK IkBα激酶IKK IkBα kinase

IKKγ IkBα激酶γIKKγ IkBα Kinaseγ

IL-1 白细胞介素-1IL-1 Interleukin-1

IL-8 白细胞介素-8IL-8 Interleukin-8

IL-8P-Luc IL-8启动子-报告子驱动萤光素酶表达IL-8P-Luc IL-8 promoter-reporter drives luciferase expression

IL-24 白细胞介素-24IL-24 Interleukin-24

KIAA0616 由cDNA克隆KIAA0616预测的假想蛋白KIAA0616 Hypothetical protein predicted from cDNA clone KIAA0616

KID 激酶可诱导结构域KID Kinase Inducible Domain

MAP3K11 促分裂原-活化蛋白激酶激酶激酶11MAP3K11 Mitogen-activated protein kinase kinase kinase 11

MAP3K12 促分裂原-活化蛋白激酶激酶激酶12MAP3K12 Mitogen-activated protein kinase kinase kinase 12

MEK 促分裂原-活化蛋白激酶/ERK激酶MEK Mitogen-activated protein kinase/ERK kinase

MEKK 促分裂原-活化蛋白激酶/ERK激酶激酶-1MEKK Mitogen-activated protein kinase/ERK kinase kinase-1

MSK 促分裂原和应激活化蛋白激酶MSK Mitogen and stress-activated protein kinase

NFAT 激活T细胞核因子NFAT activation of T cell nuclear factor

NF-IL-6 核因子-白细胞介素-6转录因子NF-IL-6 Nuclear factor-interleukin-6 transcription factor

NF-κB B-细胞中κ轻多肽基因增强子的核因子NF-κB Nuclear Factor of κ Light Polypeptide Gene Enhancer in B-Cells

NPY 神经肽YNPY Neuropeptide Y

NR2F2 细胞核受体亚家族2，组F，成员2NR2F2 Nuclear receptor subfamily 2, group F, member 2

Oct-1 八聚体结合转录因子1Oct-1 octamer-binding transcription factor 1

Oct-1/C/EBP八聚体结合转录因子1/CCAAT/增强子结合蛋白质Oct-1/C/EBP octamer-binding transcription factor 1/CCAAT/enhancer-binding protein

PCK1 烯醇丙酮酸磷酸羧激酶IPCK1 Enolpyruvate phosphate carboxykinase I

PKA 环AMP-依赖蛋白激酶PKA Cyclic AMP-dependent protein kinase

POL II RNA聚合酶IIPOL II RNA polymerase II

relA 网状内皮组织增殖病毒癌基因同系物A，别名NF-κB亚基3、relA reticuloendothelial proliferation virus oncogene homologue A, aliased as NF-κB subunit 3,

p65p65

Rho-GEF- p114Rho-特异鸟嘌呤核苷酸交换因子p114Rho-GEF-p114Rho-specific guanine nucleotide exchange factor p114

RIPK2 受体-相互作用丝氨酸-苏氨酸激酶2RIPK2 Receptor-interacting serine-threonine kinase 2

RLU 相对发光单位RLU relative luminescence unit

RSK 核醣体S6激酶RSK ribosomal S6 kinase

TBP TATA-结合蛋白质TBP TATA-binding protein

TEF1 促甲状腺胚胎因子1TEF1 Thyroid-stimulating embryonic factor 1

TF 转录因子TF transcription factor

TNFα 肿瘤坏死因子-αTNFα Tumor necrosis factor-α

TRAF6 TNF受体相关因子6TRAF6 TNF receptor-associated factor 6

TSHα 促甲状腺激素αTSHα Thyroid-stimulating hormone alpha

VCAM1 血管细胞粘附分子-1VCAM1 Vascular cell adhesion molecule-1

XboxP X-盒结合蛋白质1XboxP X-box binding protein 1

如此处和所附权利要求书中所使用，单数形式″一个″、″一种″和″该″包括复数参照，除非上下文另外清楚地指明。因此，例如，“抗体”是本领域技术人员已知的一种或多种抗体及其等价物。另外，CREAP蛋白质或“CREAP”，除非另外指明，包括此处所公开的任意一种或多种CREAP蛋白质，具体而言是此处鉴定的属于CREAP蛋白质家族的任意一种或多种人CREAP1-3多肽。As used herein and in the appended claims, the singular forms "a", "an" and "the" include plural references unless the context clearly dictates otherwise. Thus, for example, an "antibody" is one or more antibodies and equivalents thereof known to those skilled in the art. Additionally, a CREAP protein or "CREAP", unless otherwise indicated, includes any one or more of the CREAP proteins disclosed herein, specifically any one or more of the human CREAP1-3 identified herein as belonging to the CREAP protein family peptide.

物质“调节”CREAP蛋白质(例如“CREAP调节子”)的能力包括但不限于物质抑制或增强CREAP蛋白质的活性和/或抑制或增强任意一种或多种所述蛋白质表达的能力。此类调节子包括CREAP活性的激动剂和拮抗剂。此类调节还可能参与实现其它蛋白质与CREAP蛋白质相互作用的能力，所述其它蛋白质为例如相关调节蛋白质或由CREAP所修饰的蛋白质。The ability of a substance to "modulate" a CREAP protein (eg, a "CREAP modulator") includes, but is not limited to, the ability of a substance to inhibit or enhance the activity of a CREAP protein and/or to inhibit or enhance expression of any one or more of such proteins. Such modulators include agonists and antagonists of CREAP activity. Such regulation may also be involved in enabling the ability of other proteins to interact with the CREAP protein, such as associated regulatory proteins or proteins modified by CREAP.

如此处所使用，术语“激动剂”，是指可以直接或间接调节多肽(例如CREAP多肽)并增加所述多肽生物学活性的分子(即调节子)。激动剂可以包括蛋白质、核酸、碳水化合物或其他分子。增强蛋白质基因转录或生物化学功能的调节子是分别增加所述蛋白质转录或刺激其生物化学特性或活性的物质。As used herein, the term "agonist" refers to a molecule (ie, modulator) that can directly or indirectly regulate a polypeptide (eg, CREAP polypeptide) and increase the biological activity of the polypeptide. Agonists can include proteins, nucleic acids, carbohydrates or other molecules. Modulators that enhance the gene transcription or biochemical function of a protein are substances that increase the transcription of said protein or stimulate its biochemical properties or activity, respectively.

如此处所所用术语“拮抗剂”或“抑制剂”是指可以直接或间接调节多肽(例如CREAP多肽)并阻断或抑制所述多肽生物学活性的分子(即调节子)。拮抗剂和抑制剂可以包括蛋白质、核酸、碳水化合物或其他分子。抑制蛋白质表达或生物化学功能的调节子是分别降低所述蛋白质基因表达或生物学活性的物质。The term "antagonist" or "inhibitor" as used herein refers to a molecule (ie, a modulator) that can directly or indirectly modulate a polypeptide (eg, a CREAP polypeptide) and block or inhibit the biological activity of the polypeptide. Antagonists and inhibitors may include proteins, nucleic acids, carbohydrates or other molecules. Modulators that inhibit the expression or biochemical function of a protein are substances that reduce the gene expression or biological activity of said protein, respectively.

如此处所使用“核酸序列”是指寡核苷酸、核苷酸或多核苷酸及其片段或部分，并且也指基因组或合成来源的DNA或RNA，它们可以是单链的或双链的，并代表有义链或反义链。"Nucleic acid sequence" as used herein refers to oligonucleotides, nucleotides or polynucleotides and fragments or portions thereof, and also to DNA or RNA of genomic or synthetic origin, which may be single-stranded or double-stranded, and represent the sense or antisense strand.

如此出所使用术语“反义”是指与特定DNA或RNA序列互补的核苷酸序列。术语“反义链”此处用于指与“有义”链互补的核酸链。反义分子可以通过任意方法产生，包括通过将目的基因以反方向连接至允许互补链合成的病毒启动子上进行合成。一旦导入细胞，该转录链结合细胞产生的天然序列形成双链体。然后这些双链体阻断进一步的转录或翻译。命名“负”有时用于指反义链，而“正”有时用于指有义链。The term "antisense" as used herein refers to a nucleotide sequence that is complementary to a specific DNA or RNA sequence. The term "antisense strand" is used herein to refer to the nucleic acid strand that is complementary to the "sense" strand. Antisense molecules can be produced by any method, including synthesis by linking the gene of interest in reverse orientation to a viral promoter that allows synthesis of the complementary strand. Once introduced into a cell, the transcribed strand combines with the native sequence produced by the cell to form a duplex. These duplexes then block further transcription or translation. The nomenclature "negative" is sometimes used to refer to the antisense strand and "positive" is sometimes used to refer to the sense strand.

如此处所认为，反义寡核苷酸、三股螺旋DNA、RNA适体、siRNA、核酶和双链或单链RNA设计用于抑制CREAP表达从而根据蛋白质的所选核酸序列设计的抑制分子能够引起内源性CREAP产生特异抑制作用。例如，利用CREAP1核苷酸序列的知识而无需进行过多实验就可以设计反义分子，该反义分子与CREAP mRNA产生最强的杂交作用。同样，能够合成识别目的蛋白质的特定核苷酸序列并将其切割的核酶(Cech.J.Amer.Med Assn.260：3030(1988))。设计此类分子用于基因表达靶定抑制的技术是本领域的技术人员众所周知的。As considered herein, antisense oligonucleotides, triple-helix DNA, RNA aptamers, siRNA, ribozymes, and double- or single-stranded RNA are designed to inhibit CREAP expression so that inhibitory molecules designed according to the selected nucleic acid sequence of the protein can elicit Endogenous CREAP produces specific inhibitory effect. For example, knowledge of the CREAP1 nucleotide sequence can be used without undue experimentation to design antisense molecules that hybridize most strongly to CREAP mRNA. Also, a ribozyme that recognizes a specific nucleotide sequence of a protein of interest and cleaves it can be synthesized (Cech. J. Amer. Med Assn. 260: 3030 (1988)). Techniques for designing such molecules for targeted inhibition of gene expression are well known to those skilled in the art.

此处公开的CREAP蛋白质包括但不限于人CREAP1、CREAP2和CREAP3多肽，这些多肽的任意及全部形式包括但不限于，来自人类或任意其它物种的部分形式、同系物、同种型、前体形式、全长多肽、包含任意上述蛋白质序列或部分的融合蛋白。目的片段包括但不限于那些包含对于正常CREAP功能特别重要的氨基酸的片段，例如包括氨基酸356-580。CREAP1的序列及其变体可以发现于Genbank登录号NM_025021和AB014516。据申请人的认识，以前还没有公开CREAP2和CREAP3的完整、正确序列；部分序列可以发现于Genbank(CREAP 2登录号XM_117201(DNA)和XP_117201(蛋白质)和CREAP3登录号AK090443(DNA)和BAC03424(蛋白质))。CREAP同系物包括文中所公开的那些，和对于本领域的技术人员显而易见的那些同系物，并且它们包括于本发明的范围之内。也可以认为CREAP蛋白质包括那些分离于天然存在的任意物种来源例如基因组DNA文库和包含表达系统的基因工程宿主细胞的CREAP蛋白质，或者通过化学合成，使用例如自动化肽合成仪或此类方法组合产生的CREAP蛋白质。分离和制备此类多肽的方法在本领域是众所周知的。The CREAP proteins disclosed herein include, but are not limited to, human CREAP1, CREAP2, and CREAP3 polypeptides, and any and all forms of these polypeptides include, but are not limited to, partial forms, homologues, isoforms, precursor forms from humans or any other species , a full-length polypeptide, a fusion protein comprising any of the above protein sequences or parts. Fragments of interest include, but are not limited to, those comprising amino acids that are particularly important for normal CREAP function, for example including amino acids 356-580. The sequence of CREAP1 and variants thereof can be found in Genbank accession numbers NM_025021 and AB014516. To the applicant's knowledge, the complete and correct sequences of CREAP2 and CREAP3 have not been published before; partial sequences can be found in Genbank (CREAP 2 accession numbers XM_117201 (DNA) and XP_117201 (protein) and CREAP3 accession numbers AK090443 (DNA) and BAC03424 ( protein)). CREAP homologues include those disclosed herein, and those homologues apparent to those skilled in the art, and are included within the scope of the present invention. CREAP proteins may also be considered to include those isolated from naturally occurring sources of any species such as genomic DNA libraries and genetically engineered host cells containing expression systems, or produced by chemical synthesis, using, for example, an automated peptide synthesizer, or combinations of such methods CREAP protein. Methods for isolating and preparing such polypeptides are well known in the art.

如文中所使用的术语“样本”或“生物样本”以其最广泛含义使用。来自受试者的生物样本可以包括血、尿或其他生物材料，使用这些样本可以测定CREAP蛋白质的活性或基因表达。The term "sample" or "biological sample" as used herein is used in its broadest sense. Biological samples from a subject can include blood, urine, or other biological material, using which samples CREAP protein activity or gene expression can be determined.

如文中所使用，术语“抗体”指完整分子及其片段，例如Fa、F(ab′)x₂和Fv，它们能够结合表位决定簇。使用完整多肽或者含有作为免疫抗原的目的小肽的片段能够制备结合文中所公开的CREAP多肽的抗体。用于免疫动物的多肽或肽可以来自于RNA的翻译或化学合成，并且如果期望，能够将所述变肽或肽连接于载体蛋白质上。通常使用的与肽化学偶联的载体包括牛血清白蛋白和甲状腺球蛋白。然后偶联的肽用于免疫动物(例如小鼠、大鼠或兔)。As used herein, the term "antibody" refers to whole molecules and fragments thereof, such as Fa, F(ab') _x2 and Fv, which are capable of binding an epitopic determinant. Antibodies that bind the CREAP polypeptides disclosed herein can be prepared using the entire polypeptide or a fragment containing a small peptide of interest as an immunizing antigen. Polypeptides or peptides used to immunize animals can be derived from translation of RNA or chemical synthesis, and if desired, the altered peptides or peptides can be linked to a carrier protein. Commonly used carriers for chemically coupling peptides include bovine serum albumin and thyroglobulin. The coupled peptides are then used to immunize animals (eg mice, rats or rabbits).

如文中所使用术语“人源化抗体”指其中非抗原结合区域内的氨基酸已经被替换以与人抗体更相似，但仍保留最初结合能力的抗体分子。The term "humanized antibody" as used herein refers to an antibody molecule in which amino acids in non-antigen-binding regions have been replaced to more closely resemble human antibodies, yet retain the original binding ability.

肽模拟物是基于能够模拟正常多肽功能的主题多肽的关键残基的知识产生的合成的衍生肽或非肽试剂。肽模拟物能够破坏多肽结合其受体或其他蛋白质并且干扰多肽的正常功能。例如，CREAP模拟物将干扰正常CREAP功能。A peptidomimetic is a synthetically derived peptide or non-peptidic agent based on the knowledge of key residues of a subject polypeptide capable of mimicking normal polypeptide function. Peptidomimetics are capable of disrupting the binding of a polypeptide to its receptor or other proteins and interfering with the normal function of the polypeptide. For example, a CREAP mimetic will interfere with normal CREAP function.

“治疗有效量”是足以治疗、预防或缓解与CRE-依赖性基因表达异常激活或趋化因子异常激活相关的病理状况的药物量。A "therapeutically effective amount" is an amount of drug sufficient to treat, prevent or alleviate pathological conditions associated with abnormal activation of CRE-dependent gene expression or abnormal activation of chemokines.

如文中所使用“相关调节蛋白质”和“相关调节多肽”是指参与CREAP蛋白质调节的多肽，通过本领域技术人员使用诸如文中所述的常规方法可以对所述多肽进行鉴定。As used herein, "associated regulatory proteins" and "associated regulatory polypeptides" refer to polypeptides involved in the regulation of CREAP proteins, which can be identified by those skilled in the art using conventional methods such as described herein.

“与CRE-依赖性基因表达异常激活或趋化因子异常激活相关的病理状况”包括但不限于这些疾病例如：骨关节炎、COPD、银屑病、气喘、类风湿性关节炎、癌症、病理性血管发生、糖尿病、高血压、慢性痛和其它炎症与自身免疫病以及神经变性疾病诸如阿尔茨海默氏病、帕金森氏病和亨廷顿病。异常激活可以包括过度激活，例如其中编码CREAP蛋白质的mRNA上调或者这些基因的蛋白质产物在细胞中通过增加绝对量或比活而增强了活性的情况，以及其中CRE-依赖性基因表达下调或异常低的趋化因子激活的情况。"Pathological conditions associated with aberrant activation of CRE-dependent gene expression or aberrant activation of chemokines" includes, but is not limited to, diseases such as: osteoarthritis, COPD, psoriasis, asthma, rheumatoid arthritis, cancer, pathology angiogenesis, diabetes, hypertension, chronic pain and other inflammatory and autoimmune diseases, and neurodegenerative diseases such as Alzheimer's, Parkinson's and Huntington's diseases. Aberrant activation can include overactivation, such as those in which mRNAs encoding CREAP proteins are upregulated or the protein products of these genes have enhanced activity in cells by increasing absolute amounts or specific activities, and in which CRE-dependent gene expression is downregulated or abnormally low activation of chemokines.

如文中所认为，本发明包括使用文中所公开的CREAP基因和基因产物发现分别诱导或抑制CRE-依赖基因的激动剂和拮抗剂的方法。如文中所使用，“CRE-依赖”基因包括那些依赖于通过CRE-结合蛋白质如CREB1、CREB2、CRE-BPa起作用的环amp效应元件的基因(综述，见Lonze，B.，和Ginty，D.(2002)Neuron 35，605；Muller FU，Neumann J，Schmitz W.，Mol Cell Biochem 2000 Sep；212(1-2)：11-7和Mayr B，Montminy M.Nat Rev Mol Cell Biol 2001 Aug；2(8)：599-609)。这些基因包括但不限于对代谢控制至关重要的基因，例如PEPCK、解偶联蛋白质-1、神经调节分子，例如甘丙肽和酪氨酸羟化酶，以及包括胰岛素和双调蛋白在内的生长因子。CREAP激活的趋化因子包括IL-8和Exodus1/MIP3α和CRE激活的趋化因子包括MIP-1β(Proffitt等人，1995，Gene 152：173-179；和Zhang等人，2002；J.Biol Chem，277：19042-19048)。As recognized herein, the present invention encompasses methods of using the CREAP genes and gene products disclosed herein to discover agonists and antagonists, respectively, that induce or inhibit CRE-dependent genes. As used herein, "CRE-dependent" genes include those genes that depend on the loop amp response element acting through CRE-binding proteins such as CREB1, CREB2, CRE-BPa (for review, see Lonze, B., and Ginty, D. .(2002) Neuron 35, 605; Muller FU, Neumann J, Schmitz W., Mol Cell Biochem 2000 Sep; 212(1-2): 11-7 and Mayr B, Montminy M.Nat Rev Mol Cell Biol 2001 Aug; 2(8):599-609). These genes include, but are not limited to, genes critical for metabolic control such as PEPCK, uncoupling protein-1, neuroregulatory molecules such as galanin and tyrosine hydroxylase, and genes including insulin and amphiregulin growth factor. Chemokines activated by CREAP include IL-8 and Exodus1/MIP3α and by CRE include MIP-1β (Proffitt et al., 1995, Gene 152:173-179; and Zhang et al., 2002; J. Biol Chem , 277:19042-19048).

“受试者”指任意人或非人生物体。"Subject" refers to any human or non-human organism.

以其最广泛含义，术语“基本上相似”或“相当”，当文中用于指核苷酸序列时，意思是指对应参照核苷酸序列的核苷酸序列，其中相应序列编码与参照核苷酸所编码的多肽具有基本上相同的结构和功能的多肽，例如仅改变不影响多肽功能的氨基酸。所期望的是基本上相似的核苷酸序列编码参照核苷酸序列所编码的多肽。基本上相似核苷酸序列和参照核苷酸序列之间的同一性百分数期望为至少80％，更加期望的为至少85％，优选地为至少90％，更加优选地为至少95％，更加优选地为至少99％。In its broadest sense, the term "substantially similar" or "equivalent", when used herein in reference to a nucleotide sequence, means a nucleotide sequence corresponding to a reference nucleotide sequence, wherein the corresponding sequence encodes a Polypeptides encoded by nucleotides have substantially the same structure and function as polypeptides, for example, only changes in amino acids that do not affect the function of the polypeptide. It is expected that substantially similar nucleotide sequences encode polypeptides encoded by the reference nucleotide sequence. The percent identity between a substantially similar nucleotide sequence and a reference nucleotide sequence is desirably at least 80%, more desirably at least 85%, preferably at least 90%, even more preferably at least 95%, even more preferably Ground is at least 99%.

“基本上相似”于参照核苷酸序列的核苷酸序列与参照核苷酸序列在50℃下在7％十二烷基硫酸钠(SDS)、0.5M NaPO₄、1mM EDTA中杂交，50℃下在2×SSC，0.1％SDS中洗涤，更加期望地是在50℃下在7％十二烷基硫酸钠(SDS)、0.5M NaPO4、1mM EDTA中杂交，50℃时在1×SSC，0.1％SDS中洗涤，更加期望地是50℃下在7％十二烷基硫酸钠(SDS)、0.5M NaPO₄、1mM EDTA中杂交，50℃下在0.5×SSC，0.1％SDS中洗涤，优选地是50℃下在7％十二烷基硫酸钠(SDS)、0.5M NaPO₄、1mMEDTA中杂交，50℃下在0.1×SSC，0.1％SDS中洗涤，更加优选地是50℃下在7％十二烷基硫酸钠(SDS)、0.5M NaPO₄、1mM EDTA中杂交，65℃下在0.1×SSC，0.1％SDS中洗涤，所述核苷酸序列仍然编码功能性等价基因产物。一般而言，杂交条件可以是高度严格条件或较低高度严格条件。在其中核酸分子是脱氧寡核苷酸(“oligo”)的情况下，高度严格条件可以指例如在37℃(对于14-碱基oligo)、48℃(对于17-碱基oligo)、55℃(对于20-碱基oligo)和60℃(对于23-碱基oligo)下在6×SSC/0.05％焦磷酸钠中洗涤。对于不同组成的核酸此类严格条件的适当范围描述于Krause和Aaronson(1991)，Methods in Enzymology，200：546-556，此外还有Maniatis等人，上面引用。A nucleotide sequence "substantially similar" to a reference nucleotide sequence hybridizes to the reference nucleotide sequence at 50°C in 7% sodium dodecyl sulfate (SDS), 0.5M NaPO ₄ , 1 mM EDTA, 50 Wash in 2×SSC, 0.1% SDS at 50° C., more desirably hybridize in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4, 1 mM EDTA at 50° C., in 1×SSC at 50° C. , wash in 0.1% SDS, more desirably hybridize in 7% sodium dodecyl sulfate (SDS), 0.5M NaPO ₄ , 1 mM EDTA at 50°C, wash in 0.5×SSC, 0.1% SDS at 50°C , preferably hybridized in 7% sodium dodecyl sulfate (SDS), 0.5M NaPO ₄ , 1 mM EDTA at 50°C, washed in 0.1×SSC, 0.1% SDS at 50°C, more preferably at 50°C Hybridized in 7% sodium dodecyl sulfate (SDS), 0.5M NaPO ₄ , 1 mM EDTA, washed at 65°C in 0.1×SSC, 0.1% SDS, the nucleotide sequence still encodes a functionally equivalent gene product. Generally, hybridization conditions can be highly stringent conditions or less highly stringent conditions. In the case where the nucleic acid molecule is a deoxyoligonucleotide ("oligo"), highly stringent conditions can refer to, for example, at 37°C (for 14-base oligos), 48°C (for 17-base oligos), 55°C Wash in 6xSSC/0.05% sodium pyrophosphate (for 20-base oligos) and 60°C (for 23-base oligos). Appropriate ranges of such stringent conditions for nucleic acids of varying composition are described by Krause and Aaronson (1991), Methods in Enzymology, 200:546-556, and also by Maniatis et al., cited above.

“提高的mRNA转录水平”是指在患有与CRE-依赖性基因表达异常激活或趋化因子异常激活相关的病理状况的个体的适当组织或细胞中从编码本发明的CREAP多肽的天然内源人基因转录的信使RNA与对照水平相比更大的量，具体而言是在未患有此种疾病的受试者相应组织内所发现mRNA的量的至少约2倍，优选地为至少约5倍，更加优选地为至少约10倍，最优选地为至少约100倍。此种上升的mRNA水平最终可以导致与健康个体相比在患有所述疾病的个体内从此种mRNA转录的蛋白质水平增加。"Increased mRNA transcript level" refers to the expression of CREAP polypeptide of the present invention from the natural endogenous mRNA in appropriate tissues or cells of an individual suffering from abnormal activation of CRE-dependent gene expression or abnormal activation of chemokines The amount of mRNA transcribed from the human gene is greater than the control level, specifically at least about 2 times the amount of mRNA found in the corresponding tissue of a subject not suffering from the disease, preferably at least about 5 times, more preferably at least about 10 times, most preferably at least about 100 times. Such elevated mRNA levels can ultimately lead to increased levels of protein transcribed from such mRNA in individuals with the disease compared to healthy individuals.

如文中所使用，“宿主细胞”是指含有异源DNA的原核或真核细胞，该异源DNA已经通过任意方法，例如电穿孔、磷酸钙沉淀、显微注射、转化、病毒感染等等导入细胞。As used herein, "host cell" refers to a prokaryotic or eukaryotic cell containing heterologous DNA that has been introduced by any method, such as electroporation, calcium phosphate precipitation, microinjection, transformation, viral infection, etc. cell.

如文中所使用“异源”意思是指“不同天然来源”或代表非天然状态。例如，如果使用来自另一种生物体，特别是来自另一物种的DNA或基因转化宿主细胞，该基因对于宿主细胞是异源的并且对于携带该基因的宿主细胞的子代细胞也是异源的。同样，异源的是指来自并插入到相同天然最初细胞类型的核苷酸序列，但是它以非天然状态存在，例如不同拷贝数，或者在不同调节元件的控制下。"Heterologous" as used herein means a "different natural source" or represents a non-natural state. For example, if a host cell is transformed with DNA or a gene from another organism, particularly from another species, that gene is heterologous to the host cell and also heterologous to progeny cells of the host cell carrying the gene . Likewise, heterologous refers to a nucleotide sequence that is derived from and inserted into the same native cell type, but which is present in a non-native state, eg, at a different copy number, or under the control of different regulatory elements.

“载体”分子是核酸分子，其中可以插入异源核酸然后能够将该核酸分子导入适当宿主细胞。载体优选地具有一个或多个复制起点，和重组DNA能够插入其中的一个或多个位点。载体通常具有便利方法，通过这些方法能够从那些不含该载体的细胞中将含有该载体的细胞选择出来，例如这些载体编码药物抗性基因。常见载体包括质粒、病毒基因组和(主要在酵母和细菌内)“人工染色体”。A "vector" molecule is a nucleic acid molecule into which a heterologous nucleic acid can be inserted which can then be introduced into a suitable host cell. A vector preferably has one or more origins of replication, and one or more sites into which recombinant DNA can be inserted. Vectors generally have convenient means by which cells containing the vector can be selected from those that do not, eg, these vectors encode drug resistance genes. Common vectors include plasmids, viral genomes and (mainly in yeast and bacteria) "artificial chromosomes".

“质粒”文中通常以小写字母p开始，大写字母和/或数字随后进行命名，与本领域技术人员所熟悉的标准命名惯例相一致。文中所公开的起始质粒或者是商业可得的、在无限制基础上公开可得的，或者能够通过众所周知的常规方法、已公开的操作从可获得质粒进行构建。根据本发明能够使用的许多质粒和其他克隆和表达载体是众所周知的并且本领域的那些技术人员很容易获得。而且，技术人员可以容易地构建适用于本发明的任意多种其他质粒。本发明中此类质粒及其他载体的特性、构建和用途对于本公开的技术人员是显而易见的。"Plasmid" is used herein to generally begin with a lowercase p, followed by an uppercase letter and/or number, consistent with standard naming conventions familiar to those skilled in the art. The starting plasmids disclosed herein are either commercially available, publicly available on an unrestricted basis, or can be constructed from available plasmids by well-known routine methods, published procedures. Many plasmids and other cloning and expression vectors that can be used in accordance with the present invention are well known and readily available to those skilled in the art. Moreover, a skilled artisan can easily construct any number of other plasmids suitable for use in the present invention. The nature, construction and use of such plasmids and other vectors in the present invention will be apparent to those skilled in the disclosure.

术语“分离的”意思是指将物质从其最初的环境(例如如果该物质是天然存在的则指天然环境)除去。例如，存在于活的动物中天然存在多核苷酸或多肽不是分离的，但是与天然系统中一些或全部共同存在的物质分离的相同多核苷酸或多肽是分离的，即使它们随后导入该天然系统中。此类多核苷酸可以是载体的部分和/或此类多核苷酸或多肽可以是组合物的部分，并且仍然是分离的，因为此种载体或组合物不是其天然环境中的部分。The term "isolated" means that a material is removed from its original environment (eg, the natural environment if the material is naturally occurring). For example, a naturally occurring polynucleotide or polypeptide present in a living animal is not isolated, but the same polynucleotide or polypeptide separated from some or all co-occurring materials in the natural system is isolated even if they are subsequently introduced into the natural system middle. Such polynucleotides may be part of a vector and/or such polynucleotides or polypeptides may be part of a composition and still be isolated in that such vector or composition is not part of its natural environment.

如文中所使用，术语“转录控制序列”是指DNA序列，例如起始密码子序列、增强子序列和启动子序列，转录控制序列诱导、抑制或者以其他方式控制它们有效连接的编码蛋白质的核酸序列的转录。As used herein, the term "transcriptional control sequence" refers to a DNA sequence, such as an initiation codon sequence, an enhancer sequence, and a promoter sequence, which induces, represses, or otherwise controls the protein-encoding nucleic acid to which they are operably linked. transcription of the sequence.

如文中所使用，“人转录控制序列”是任意那些发现于各自人染色体并通常发现与编码本发明多种CREAP蛋白质的任意一种人基因相关的任意一种转录控制序列。As used herein, a "human transcriptional control sequence" is any of those transcriptional control sequences that are found in the respective human chromosome and are generally found in association with any of the human genes encoding the various CREAP proteins of the present invention.

如文中所使用，“非人转录控制序列”是未发现于人基因组中的任意转录控制序列。As used herein, a "non-human transcriptional control sequence" is any transcriptional control sequence not found in the human genome.

如文中所使用，本发明多肽的“化学衍生物”是本发明的一种多肽，其含有额外的通常不是所述分子部分的化学部分。此化学部分可以改善该分子的可溶性、吸收、生物半寿期等等。该部分可以备选降低分子的毒性，去除或减弱该分子的任何不期望的副作用等。已经例如在Remington′sPharmaceutical Sciences，第16版，Mack Publishing Co.，Easton，Pa.(1980)中公开能够调节此类效应的部分。As used herein, a "chemical derivative" of a polypeptide of the invention is a polypeptide of the invention that contains additional chemical moieties that are not normally part of the molecule. This chemical moiety can improve the solubility, absorption, biological half-life, etc. of the molecule. The moiety may optionally reduce the toxicity of the molecule, remove or attenuate any undesired side effects of the molecule, and the like. Moieties capable of modulating such effects have been disclosed, for example, in Remington's Pharmaceutical Sciences, 16th ed., Mack Publishing Co., Easton, Pa. (1980).

本发明基于一个令人惊奇的发现，即先前在公开序列数据库中称为″KIAA0616”且至今未知功能的蛋白质是CRE-激活蛋白质。文中称为CREAP1，除了通常激活CRE-依赖转录之外，该多肽还能够诱导多种疾病相关基因例如趋化因子，酶例如PEPCK和生长因子例如双调蛋白。The present invention is based on the surprising discovery that a protein previously designated "KIAA0616" in the public sequence database and of as yet unknown function is a CRE-activating protein. Referred to herein as CREAP1, in addition to normally activating CRE-dependent transcription, this polypeptide is also capable of inducing a variety of disease-associated genes such as chemokines, enzymes such as PEPCK and growth factors such as amphiregulin.

另外，对公共数据库的检索表明先前收录而未指出功能的2种cDNA和蛋白质(虽然存在错误且/或仅部分序列)XP 117201和FLJ00364编码具有CREAP1相似活性的蛋白质。如此，本发明包括迄今为止未公开的准确核苷酸序列，如文中详细概括，这些序列编码文中命名为CREAP2和CREAP3的多肽并且这些多肽属于新CREAP蛋白质家族。In addition, a search of public databases revealed that 2 cDNAs and proteins (albeit with errors and/or only partial sequences) that were previously included without indicating function, XP 117201 and FLJ00364, encode proteins with CREAP1-like activity. Thus, the present invention includes the heretofore unpublished exact nucleotide sequences encoding polypeptides designated herein as CREAP2 and CREAP3 and which belong to the novel CREAP protein family, as outlined in detail herein.

因此，本发明提供了包含显示于SEQ ID NO：16和SEQ ID NO：25中的氨基酸序列的分离的多肽。此外，本发明提供了由显示于SEQ ID NO：16和SEQ ID NO：25中的氨基酸序列所组成的分离的多肽。例如，此类多肽可以是包含CREAP2或CREAP3的氨基酸序列的融合蛋白质。文中还涉及到包含CREAP1的融合蛋白质。Accordingly, the invention provides isolated polypeptides comprising the amino acid sequences shown in SEQ ID NO: 16 and SEQ ID NO: 25. Furthermore, the present invention provides isolated polypeptides consisting of the amino acid sequences shown in SEQ ID NO: 16 and SEQ ID NO: 25. For example, such a polypeptide may be a fusion protein comprising the amino acid sequence of CREAP2 or CREAP3. The text also relates to fusion proteins comprising CREAP1.

本发明还包括分离的核酸或核苷酸分子，优选地为DNA分子，具体而言是编码CREAP蛋白质，特别是CREAP2或CREAP3的分子。优选地，本发明分离的核酸分子，优选地DNA分子，编码包含显示于SEQ IDNO：16或SEQ ID NO：25中的氨基酸序列的多肽。同样优选的是分离的核酸分子，优选地为DNA分子，所述核酸分子编码由显示于SEQ ID NO：16或SEQ ID NO：25中的氨基酸序列组成的多肽。The invention also includes isolated nucleic acid or nucleotide molecules, preferably DNA molecules, in particular molecules encoding a CREAP protein, especially CREAP2 or CREAP3. Preferably, the isolated nucleic acid molecule of the invention, preferably a DNA molecule, encodes a polypeptide comprising the amino acid sequence shown in SEQ ID NO: 16 or SEQ ID NO: 25. Also preferred is an isolated nucleic acid molecule, preferably a DNA molecule, encoding a polypeptide consisting of the amino acid sequence shown in SEQ ID NO: 16 or SEQ ID NO: 25.

本发明还包括：(a)载体，其包含CREAP蛋白质，尤其人CREAP1、CREAP2或CREAP3或其部分的核苷酸序列和/或其互补序列(即反义序列)；(b)载体分子，优选地为包含转录控制序列的载体分子，尤其是表达载体，其包含任意上述CREAP蛋白质的编码序列，其与指导编码序列表达的调节元件有效连接；和(c)基因工程化宿主细胞，其包含如文中所述载体分子或至少任意上述核苷酸序列的片段，该序列与指导宿主细胞中编码序列表达的调节元件有效连接。如文中所使用，调节元件包括但不限于可诱导和不可诱导型启动子、增强子、操纵基因以及本领域技术人员熟知的驱动和调控表达的其他元件。优选地为，宿主细胞可以是脊椎动物宿主细胞，优选地为哺乳动物宿主细胞，例如人细胞或啮齿类动物的细胞，例如CHO或BHK细胞。同样优选地为宿主细胞可以是细菌宿主细胞，尤其是大肠杆菌(E.coli)细胞The present invention also includes: (a) carrier, it comprises CREAP protein, especially the nucleotide sequence of human CREAP1, CREAP2 or CREAP3 or its part and/or its complementary sequence (i.e. antisense sequence); (b) carrier molecule, preferably It is a carrier molecule comprising a transcriptional control sequence, especially an expression vector, which comprises the coding sequence of any of the above-mentioned CREAP proteins, which is operatively linked with a regulatory element directing the expression of the coding sequence; and (c) a genetically engineered host cell, which comprises such as The carrier molecule described herein or at least a fragment of any of the above-mentioned nucleotide sequences is operably linked to regulatory elements that direct the expression of the coding sequence in a host cell. As used herein, regulatory elements include, but are not limited to, inducible and non-inducible promoters, enhancers, operators, and other elements that drive and regulate expression well known to those skilled in the art. Preferably, the host cell may be a vertebrate host cell, preferably a mammalian host cell, such as a human cell or a rodent cell, such as a CHO or BHK cell. Also preferably the host cell may be a bacterial host cell, especially an Escherichia coli (E.coli) cell

特别优选宿主细胞，具体而言是上述类型的细胞，它们能够在体外进行繁殖并且能够培养生长时产生CREAP多肽，具体而言包含或由显示于SEQ ID NOs：2、16或25中的氨基酸序列所组成的多肽，其中所述细胞包含不是编码所述多肽的天然内源人基因转录控制序列的至少一种转录控制序列，其中所述一种或多种转录控制序列控制编码所述多肽的DNA的转录。Particularly preferred host cells, specifically the above-mentioned types of cells, they are capable of propagating in vitro and producing CREAP polypeptides when grown in culture, specifically comprising or consisting of the amino acid sequence shown in SEQ ID NOs: 2, 16 or 25 A polypeptide consisting of, wherein said cell comprises at least one transcriptional control sequence that is not the transcriptional control sequence of a native endogenous human gene encoding said polypeptide, wherein said one or more transcriptional control sequences control the DNA encoding said polypeptide transcription.

本发明还包括文中所公开的任意核酸序列的片段。编码CREAP多肽的核酸序列片段可以用作cDNA文库的杂交探针以分离全长基因和分离与CREAP具有相似生物学活性的与CREAP基因具有高度序列相似性的其他基因。该类型探针优选地具有至少约30个碱基并且包含，如约30至约50个碱基、约50至约100个碱基、约100至约200个碱基、或者多于200个碱基。探针还可以用于鉴定对应于全长转录物的cDNA克隆和基因组单个或多个克隆，其中所述的基因组克隆包含包括调节区域和启动子区域、外显子和内含子的完整CREAP基因。筛选的实例包括通过使用已知DNA序列合成寡核苷酸探针以分离CREAP基因编码区。具有与本发明基因序列互补的序列的标记寡核苷酸用于筛选人cDNA、基因组DNA或mRNA文库以确定该探针能够与文库的哪些成员杂交。The invention also includes fragments of any of the nucleic acid sequences disclosed herein. Nucleic acid sequence fragments encoding CREAP polypeptides can be used as hybridization probes for cDNA libraries to isolate full-length genes and to isolate other genes with high sequence similarity to CREAP genes that have similar biological activity as CREAP. Probes of this type preferably have at least about 30 bases and comprise, such as about 30 to about 50 bases, about 50 to about 100 bases, about 100 to about 200 bases, or more than 200 bases . Probes can also be used to identify cDNA clones corresponding to full-length transcripts and genomic single or multiple clones comprising the complete CREAP gene including regulatory and promoter regions, exons and introns . An example of a screen includes isolating the coding region of the CREAP gene by synthesizing oligonucleotide probes using known DNA sequences. Labeled oligonucleotides having a sequence complementary to the gene sequence of the invention are used to screen human cDNA, genomic DNA or mRNA libraries to determine which members of the library the probe hybridizes to.

除了上面所述的基因序列外，文中公开了此种序列的同系物，优选地鉴定出了果蝇、小鼠和红鳍东方豚(Fugu rubripres)的CREAP蛋白质(见下文实施例)。通过本领域公知的分子生物学技术不用过多地进行实验可以鉴定并容易地分离额外的同系物。此外，在基因组内的其他遗传基因座上可能存在编码与此种基因产物的一个或多个结构域有着广泛同源性的蛋白质的基因。通过相似的技术还可以鉴定出这些基因。In addition to the gene sequences described above, homologues of such sequences are disclosed, preferably the Drosophila, mouse and Fugu rubripres CREAP proteins are identified (see Examples below). Additional homologues can be identified and readily isolated by molecular biology techniques well known in the art without undue experimentation. In addition, there may be genes encoding proteins with extensive homology to one or more domains of such gene products at other genetic loci within the genome. These genes can also be identified by similar techniques.

例如，本发明编码CREAP多肽的分离的核苷酸序列可以进行标记并用于筛选从由目的生物体获得的mRNA构建的cDNA文库。当cDNA文库来自于与标记序列所来源的生物体类型不同的生物体时，杂交条件将是较低严格的。备选地，标记的片段可以用于再次使用适当严格条件筛选来自目的生物体的基因组文库。对于本领域技术人员此类低严格条件将是众所周知的，并且将依赖于文库和标记序列所来源的特定生物体进行可预测的改变。关于此类条件的指导参见例如上面引用的Sambrook等人的文献。For example, an isolated nucleotide sequence encoding a CREAP polypeptide of the invention can be labeled and used to screen a cDNA library constructed from mRNA obtained from an organism of interest. When the cDNA library is from a different type of organism than that from which the marker sequences are derived, the hybridization conditions will be less stringent. Alternatively, the labeled fragments can be used again to screen a genomic library from the organism of interest using appropriate stringency conditions. Such low stringency conditions will be well known to those skilled in the art and will vary predictably depending on the particular organism from which the library and marker sequences are derived. Guidance on such conditions is found, eg, in Sambrook et al., cited above.

另外，通过使用基于目的基因内的氨基酸序列设计的2种简并寡核苷酸引物群进行PCR可以分离先前未知的差别表达的基因型序列。反应的模板可以是通过mRNA反转录获得的cDNA，所述mRNA是从已知或怀疑表达差别表达基因等位基因的人或非人细胞系或组织制备得到的。In addition, previously unknown differentially expressed genotypic sequences can be isolated by PCR using 2 degenerate oligonucleotide primer sets designed based on the amino acid sequence within the gene of interest. The template for the reaction may be cDNA obtained by reverse transcription of mRNA prepared from human or non-human cell lines or tissues known or suspected of expressing differentially expressed gene alleles.

PCR产物可以进行亚克隆并测序以保证所扩增的序列代表差别表达基因样核酸序列的序列。然后PCR片段可以用于通过多种方法分离全长cDNA克隆。例如，所扩增的片段可以进行标记并用于筛选噬菌体cDNA文库。备选地，标记的片段可以用于筛选基因组文库。The PCR products can be subcloned and sequenced to ensure that the amplified sequences represent those of the differentially expressed gene-like nucleic acid sequences. The PCR fragments can then be used to isolate full-length cDNA clones by a variety of methods. For example, the amplified fragments can be labeled and used to screen phage cDNA libraries. Alternatively, labeled fragments can be used to screen genomic libraries.

PCR技术还可以用于分离全长cDNA序列。例如，按照标准程序，可以从适当细胞或组织来源分离RNA。使用对所扩增片段的最5′末端特异的寡核苷酸引物对RNA进行逆转录反应以启动第一链合成。然后使用标准末端转移酶反应用鸟嘌呤对所得到的RNA/DNA杂合体进行“加尾”，可以用RNA酶H消化杂合体，并随后使用多聚-C引物启动第二链的合成。因此，所扩增片段的上游cDNA序列可以容易地进行分离。对于可以使用的克隆策略的综述，参见例如Sambrook等人，1989，如前。PCR techniques can also be used to isolate full-length cDNA sequences. For example, RNA can be isolated from appropriate cell or tissue sources following standard procedures. The RNA is subjected to a reverse transcription reaction using oligonucleotide primers specific for the most 5' ends of the amplified fragments to initiate first strand synthesis. The resulting RNA/DNA hybrid is then "tailed" with guanine using a standard terminal transferase reaction, and the hybrid can be digested with RNase H and subsequently primed with poly-C primers to initiate second-strand synthesis. Therefore, the upstream cDNA sequence of the amplified fragment can be easily isolated. For a review of cloning strategies that can be used, see eg Sambrook et al., 1989, supra.

在所鉴定的基因是正常或野生型基因的情况下，该基因可用于分离该基因的突变体等位基因。在已知或怀疑具有遗传基础的过程和失调中此种分离是优选的。突变体等位基因可以从已知或怀疑具有促进与炎症或免疫反应相关疾病症状的基因型的个体分离。然后突变体等位基因和突变体等位基因产物可以用于下面所描述的诊断性测定系统。Where the identified gene is a normal or wild-type gene, the gene can be used to isolate mutant alleles of the gene. Such separation is preferred in processes and disorders known or suspected to have a genetic basis. Mutant alleles can be isolated from individuals known or suspected of having a genotype that promotes disease symptoms associated with an inflammatory or immune response. The mutant alleles and mutant allele products can then be used in the diagnostic assay systems described below.

例如，通过使用PCR-本领域技术人员众所周知的技术，可以分离突变体基因的cDNA。在这种情况下，通过将oligo-dT寡核苷酸与从已知或怀疑在推测携带突变体等位基因的个体中表达的组织中分离的mRNA杂交，并且通过使用逆转录酶延伸新链可以合成第一条cDNA链。然后使用与正常基因5′末端特异杂交的寡核苷酸合成cDNA的第二链。然后使用这2种引物，通过PCR扩增产物，将产物克隆入适当的载体，并通过本领域技术人员众所周知的方法对其进行DNA序列分析。通过将突变体基因和正常基因的DNA序列进行比较，能够确定负责突变体基因产物功能缺失或改变的突变。For example, cDNA of mutant genes can be isolated by using PCR - a technique well known to those skilled in the art. In this case, by hybridizing the oligo-dT oligonucleotide to mRNA isolated from tissues known or suspected to be expressed in individuals putatively carrying the mutant allele, and by extending the new strand using reverse transcriptase The first cDNA strand can be synthesized. The second strand of cDNA is then synthesized using an oligonucleotide that specifically hybridizes to the 5' end of the normal gene. The product was then amplified by PCR using these two primers, cloned into an appropriate vector, and subjected to DNA sequence analysis by methods well known to those skilled in the art. By comparing the DNA sequence of the mutant gene to the normal gene, the mutation responsible for the loss or altered function of the mutant gene product can be identified.

备选地，分别使用DNA或者RNA从怀疑或已知带有突变体等位基因的个体的已知或怀疑表达目的基因的组织中构建并筛选基因组文库或cDNA文库。正常基因或其任何适宜的片段可以被标记并用作探针以鉴定文库中的相应突变体等位基因。然后，通过本领域常规实践的方法将包含此基因的克隆纯化出来并进行如上所述的序列分析。Alternatively, genomic or cDNA libraries are constructed and screened using DNA or RNA, respectively, from tissues known or suspected to express the gene of interest in individuals suspected or known to carry the mutant allele. The normal gene, or any suitable fragment thereof, can be labeled and used as a probe to identify the corresponding mutant allele in the library. Clones containing this gene are then purified by methods routinely practiced in the art and subjected to sequence analysis as described above.

另外，利用从怀疑或已知带有突变体等位基因的个体的已知或怀疑表达目的基因的组织中分离的DNA或者合成的cDNA构建了表达文库。以该种方式，使用如下所述的标准抗体筛选技术结合针对正常基因产物产生的抗体，可以表达和筛选由推定的突变体组织产生的基因产物。(对于筛选技术见，例如Harlow，E.和Lane著，1988，“Antibodies：A LaboratoryManual”，Cold Spring Harbor Press，Cold Spring Harbor。)在突变导致具有改变功能(例如作为错义突变的结果)的表达基因产物的情况下，一组多克隆抗体可能与突变基因产物交叉反应。通过它们与此类标记抗体的反应所检测到的文库克隆能够进行纯化并如上所述进行序列分析。In addition, expression libraries are constructed using DNA isolated or cDNA synthesized from tissues known or suspected to express the gene of interest in individuals suspected or known to carry the mutant allele. In this manner, gene products produced by putative mutant tissues can be expressed and screened using standard antibody screening techniques as described below in conjunction with antibodies raised against the normal gene product. (For screening techniques see, e.g., Harlow, E., and Lane, 1988, "Antibodies: A Laboratory Manual," Cold Spring Harbor Press, Cold Spring Harbor.) In mutations that result in altered function (e.g., as a result of missense mutations), In cases where the gene product is expressed, a panel of polyclonal antibodies may cross-react with the mutated gene product. Library clones detected by their reaction with such labeled antibodies can be purified and sequenced as described above.

本发明包括由显示于任意SEQ ID NO：1、15、24、26、28、31序列中的核苷酸序列所编码的那些蛋白质或其片段。The present invention includes those proteins or fragments thereof encoded by the nucleotide sequences shown in any of the sequences of SEQ ID NO: 1, 15, 24, 26, 28, 31.

此外，本发明包括代表功能等价基因产物的蛋白质。此种等价差别表达基因产物可以包含在由上述差别表达基因序列所编码的氨基酸序列内的氨基酸残基缺失、添加或替代，但是这导致了沉默变化，从而产生功能等价差别表达基因产物。基于所涉及残基的极性、电荷、可溶性、疏水性、亲水性和/或两亲性的相似性，可以进行氨基酸替代。Furthermore, the present invention includes proteins representing functionally equivalent gene products. Such equivalent differentially expressed gene products may comprise deletions, additions or substitutions of amino acid residues within the amino acid sequences encoded by the differentially expressed gene sequences described above, but which result in silent changes resulting in functionally equivalent differentially expressed gene products. Amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or amphipathicity of the residues involved.

例如，非极性(疏水)氨基酸包括丙氨酸、亮氨酸、异亮氨酸、缬氨酸、脯氨酸、苯丙氨酸、色氨酸和蛋氨酸；极性中性氨基酸包括甘氨酸、丝氨酸、苏氨酸、半胱氨酸、酪氨酸、天冬酰胺和谷氨酰胺；正电荷(碱性)氨基酸包括精氨酸、赖氨酸和组氨酸；负电荷(酸性)氨基酸包括天冬氨酸和谷氨酸。如文中所使用，“功能等价”可以指能够与上述差别表达基因序列所编码的内源性差别表达基因产物呈现基本上相似的体内或体外活性的蛋白质或多肽。“功能等价”还可以指能够以与内源性差别表达基因产物将采用的方式基本上相似的方式与其他细胞或细胞外分子相互作用的蛋白质或多肽。例如，“功能等价”肽在免疫测定中将能够降低抗体与内源性蛋白质相应肽(即其氨基酸序列进行修饰以获得“功能等价”肽的肽)或内源性蛋白质自身的结合，其中产生了针对内源性蛋白质的相应肽的抗体。等摩尔浓度功能等价肽将减少上述相应肽的结合的至少约5％，优选地在约5％和10％之间，更加优选地在约10％和25％之间，甚至更加优选地在约25％和50％之间，并且最优选地在约40％和50％之间。For example, nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar neutral amino acids include glycine, Serine, threonine, cysteine, tyrosine, asparagine, and glutamine; positively charged (basic) amino acids include arginine, lysine, and histidine; negatively charged (acidic) amino acids include Aspartic Acid and Glutamic Acid. As used herein, "functional equivalent" may refer to a protein or polypeptide capable of exhibiting substantially similar in vivo or in vitro activities to the endogenous differentially expressed gene product encoded by the above differentially expressed gene sequence. "Functionally equivalent" can also refer to proteins or polypeptides that are capable of interacting with other cells or extracellular molecules in a manner substantially similar to that an endogenous differentially expressed gene product would. For example, a "functionally equivalent" peptide will be able to reduce antibody binding in an immunoassay to the corresponding peptide of the endogenous protein (i.e., a peptide whose amino acid sequence has been modified to obtain a "functionally equivalent" peptide) or to the endogenous protein itself, Antibodies against the corresponding peptides of endogenous proteins are raised therein. An equimolar concentration of a functionally equivalent peptide will reduce the binding of the corresponding peptide above by at least about 5%, preferably between about 5% and 10%, more preferably between about 10% and 25%, even more preferably between Between about 25% and 50%, and most preferably between about 40% and 50%.

此处公开的数据表明特定多肽片段对于CREAP家族蛋白质的活性是关键的。对于CREAP1-3，这些区域特别是保守的氨基末端200个氨基酸和羧基末端100个氨基酸，每个区域作为几个保守的结构域。本发明特别优选的多肽是那些包含对应或包含于进化保守区域内的氨基酸序列，例如每种蛋白质的末端75个氨基酸；例如从氨基酸1-75的区域，更加具体而言是，CREAP1的氨基酸片段1-68、CREAP2的氨基酸片段1-74和CREAP3的氨基酸片段1-66。The data disclosed herein indicate that specific polypeptide fragments are critical for the activity of CREAP family proteins. For CREAP1-3, these regions are in particular the conserved amino-terminal 200 amino acids and carboxy-terminal 100 amino acids, each serving as several conserved domains. Particularly preferred polypeptides of the invention are those comprising an amino acid sequence corresponding to or contained within an evolutionarily conserved region, such as the terminal 75 amino acids of each protein; such as the region from amino acids 1-75, more specifically, an amino acid fragment of CREAP1 1-68, amino acid fragment 1-74 of CREAP2 and amino acid fragment 1-66 of CREAP3.

因此，这些CREAP肽片段和编码文中所公开的CREAP多肽活性部分的核酸片段，以及包含所述片段的载体也属于本发明的范围之内。如文中所使用，编码CREAP多肽活性部分的核酸片段是指具有比编码CREAP多肽全部氨基酸序列的核苷酸序列更少核苷酸的核苷酸序列，并且该序列编码具有文中所述CREAP蛋白质活性的肽(即具有CREAP蛋白质至少一种生物学活性的肽)。一般而言，编码具有CREAP蛋白质活性的肽的核酸将选自编码成熟蛋白质的碱基。然而，在一些情况下，可以期望从CREAP蛋白质的核酸的前导序列部分选择全部或部分肽。这些核酸还可以包括接头序列、修饰的限制性内切酶位点和用于分子克隆、表达或纯化或具有CREAP蛋白质至少一种生物学活性的重组肽的其他序列。按照常规方法可以获得CREAP肽片段和编码具有CREAP蛋白质活性的肽片段的核酸。Therefore, these CREAP peptide fragments and nucleic acid fragments encoding the active part of the CREAP polypeptide disclosed herein, as well as vectors comprising the fragments also fall within the scope of the present invention. As used herein, a nucleic acid fragment encoding an active part of a CREAP polypeptide refers to a nucleotide sequence having fewer nucleotides than the nucleotide sequence encoding the entire amino acid sequence of a CREAP polypeptide, and the sequence encodes a protein having the activity of the CREAP protein described herein. A peptide (ie, a peptide having at least one biological activity of a CREAP protein). Generally, the nucleic acid encoding a peptide having CREAP protein activity will be selected from the bases encoding the mature protein. However, in some cases it may be desirable to select all or part of the peptides from the leader sequence portion of the nucleic acid of the CREAP protein. These nucleic acids may also include linker sequences, modified restriction enzyme sites, and other sequences useful for molecular cloning, expression, or purification, or for recombinant peptides having at least one biological activity of the CREAP protein. CREAP peptide fragments and nucleic acids encoding peptide fragments having CREAP protein activity can be obtained according to conventional methods.

此外，针对这些肽片段的抗体可以如上文所描述进行制备。还可以使用可以增加这些肽片段的肽免疫原性的修饰(例如氨基酸替换)。同样，使用本领域技术人员所熟悉的方法，可以修饰CREAP蛋白质所述多肽以含有信号或前导序列或者偶联至接头或其他序列以容易进行分子操作。In addition, antibodies against these peptide fragments can be prepared as described above. Modifications (such as amino acid substitutions) that may increase the peptide immunogenicity of these peptide fragments may also be used. Likewise, using methods familiar to those skilled in the art, the polypeptides of the CREAP proteins can be modified to contain signal or leader sequences or to be coupled to linkers or other sequences for ease of molecular manipulation.

使用本领域众所周知的技术，通过重组DNA技术可以产生本发明的多肽。因此，这里提供了产生本发明多肽的方法，该方法包括在足以在宿主细胞中表达所述多肽的条件下培养其中整合入表达载体的宿主细胞，所述表达载体含有编码包含显示于SEQ ID NO：2、16、25、27、29和30，优选地为SEQ ID NO 2、16和25的氨基酸序列的外源来源多核苷酸，从而引起所表达多肽的产生。任选地，所述方法进一步包括回收由所述细胞产生的多肽。在此种方法优选的实施方案中，所述外源来源多核苷酸编码由显示于SEQ ID NO：2、16、25、27、29和30的氨基酸序列组成的多肽。优选地，所述外源来源多核苷酸包含显示于SEQ ID NO：1、15、24、26、28和31中任意一个的核苷酸序列。Polypeptides of the invention can be produced by recombinant DNA technology using techniques well known in the art. Accordingly, there is provided a method of producing a polypeptide of the invention comprising culturing a host cell into which is incorporated an expression vector comprising a protein encoding the polypeptide comprising the polypeptide shown in SEQ ID NO under conditions sufficient to express the polypeptide in the host cell. : 2, 16, 25, 27, 29 and 30, preferably an exogenous source polynucleotide of the amino acid sequence of SEQ ID NO 2, 16 and 25, thereby causing the production of the expressed polypeptide. Optionally, the method further comprises recovering the polypeptide produced by the cells. In preferred embodiments of such methods, said polynucleotide of exogenous origin encodes a polypeptide consisting of the amino acid sequence shown in SEQ ID NO: 2, 16, 25, 27, 29 and 30. Preferably, the exogenous source polynucleotide comprises any one of the nucleotide sequences shown in SEQ ID NO: 1, 15, 24, 26, 28 and 31.

因此，文中描述了通过表达编码各个多肽序列的核酸制备本发明多肽和肽的方法。本领域技术人员众所周知的方法能够用于构建包含蛋白质编码序列和适当转录/翻译控制信号的表达载体。这些方法包括例如体外重组DNA技术、合成技术和体内重组/遗传重组。参见，例如描述于Sambrook等人，1989，上述和Ausubel等人，1989，上述中的技术。备选地，使用例如合成仪可以化学合成能够编码差别表达基因蛋白质序列的RNA。参见例如描述于″Oligonucleotide Synthesis″，1984，Gait，M.J.著，IRL Press，Oxford的技术，该文献在文中整体引用作为参考。Thus, described herein are methods of making polypeptides and peptides of the invention by expressing nucleic acids encoding the respective polypeptide sequences. Methods well known to those skilled in the art can be used to construct expression vectors containing protein coding sequences and appropriate transcriptional/translational control signals. These methods include, for example, in vitro recombinant DNA techniques, synthetic techniques and in vivo recombination/genetic recombination. See, eg, the techniques described in Sambrook et al., 1989, supra and Ausubel et al., 1989, supra. Alternatively, RNAs capable of encoding protein sequences of differentially expressed genes can be chemically synthesized using, for example, a synthesizer. See, eg, the technique described in "Oligonucleotide Synthesis", 1984, by Gait, M.J., IRL Press, Oxford, which is hereby incorporated by reference in its entirety.

多种宿主-表达载体系统可以用于表达编码本发明序列的差别表达基因。此类宿主-表达系统不仅代表可以产生和随后纯化目的编码序列的运载体，还代表当用适当核苷酸编码序列转化或转染时可以原位呈现本发明差别表达基因蛋白质的细胞。这些表达系统包括但不限于诸如用含有差别表达基因蛋白质编码序列的重组噬菌体DNA、质粒DNA或粘粒DNA表达载体转化的微生物如细菌(例如大肠杆菌、枯草芽孢杆菌(B.subtilis))；用含有差别表达基因蛋白质编码序列的重组酵母表达载体转化的酵母(例如酵母属(Saccharomyces)、毕赤酵母属(Pichia))；用含有差别表达基因蛋白质编码序列的重组病毒表达载体(例如杆状病毒)感染或转染的昆虫细胞系统；用含有蛋白质编码序列的重组病毒表达载体(例如花椰菜花叶病毒，CaMV；烟草花叶病毒，TMV)感染或用重组载体，包括含有蛋白质编码序列的质粒(例如Ti质粒)转化的植物细胞系统；或者包含含有来自哺乳动物细胞基因组启动子(例如金属硫蛋白启动子)或哺乳动物病毒启动子(例如腺病毒晚期启动子；痘苗病毒7.5K启动子或CMV启动子)的重组表达构建体的哺乳动物细胞系统(例如COS、CHO、BHK、293、3T3)。A variety of host-expression vector systems can be used to express differentially expressed genes encoding sequences of the invention. Such host-expression systems represent not only vehicles that can produce and subsequently purify the coding sequence of interest, but also cells that can present differentially expressed gene proteins of the invention in situ when transformed or transfected with the appropriate nucleotide coding sequence. These expression systems include but are not limited to microorganisms such as bacteria (such as Escherichia coli, Bacillus subtilis (B.subtilis)) transformed with recombinant phage DNA, plasmid DNA or cosmid DNA expression vectors that contain differentially expressed gene protein coding sequences; Yeast transformed with recombinant yeast expression vectors containing differentially expressed gene protein coding sequences (such as Saccharomyces, Pichia); recombinant virus expression vectors containing differentially expressed gene protein coding sequences (such as baculovirus ) infection or transfection of insect cell systems; infection with recombinant viral expression vectors containing protein coding sequences (e.g. cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or with recombinant vectors, including plasmids containing protein coding sequences ( For example, a plant cell system transformed with a Ti plasmid); or a plant cell system containing a promoter from a mammalian cell genome (such as a metallothionein promoter) or a mammalian viral promoter (such as an adenovirus late promoter; a vaccinia virus 7.5K promoter or a CMV promoter) for recombinant expression constructs in mammalian cell systems (eg COS, CHO, BHK, 293, 3T3).

由细胞还能够从细胞天然CREAP编码基因进行本发明CREAP蛋白质的表达。此类表达的方法详述于例如美国专利5,641,670；5,733,761；5,968,502和5,994,127，所有专利作为整体在文中特别引用作为参考。已经通过任意一个美国专利5,641,670；5,733,761；5,968,502和5,994,127的方法诱导表达CREAP的细胞能够植入活动物体预期组织内以增加该组织中CREAP的局部浓度。对于例如发生CREB功能缺失的神经变性疾病，此类方法具有治疗意义，并且同样激动剂和/或外源CREAP蛋白质可以用于预防、治疗或缓解所述疾病。Expression of the CREAP protein of the invention can also be performed by the cell from the cell's native CREAP-encoding gene. Methods of such expression are described in detail in, for example, US Patent Nos. 5,641,670; 5,733,761; 5,968,502 and 5,994,127, all of which are expressly incorporated by reference herein in their entirety. Cells expressing CREAP have been induced by the methods of any one of US Pat. Nos. 5,641,670; 5,733,761; 5,968,502 and 5,994,127 to be implanted into desired tissue of a living subject to increase the local concentration of CREAP in that tissue. Such methods have therapeutic implications for eg neurodegenerative diseases in which loss of CREB function occurs, and likewise agonists and/or exogenous CREAP proteins can be used to prevent, treat or ameliorate said diseases.

在细菌系统中，取决于进行表达的蛋白质的用途可以对许多表达载体进行有利地选择。例如，当产生大量此种蛋白质时，对于抗体的产生或者筛选肽库，可以期望指导易于纯化的高水平融合蛋白质产物表达的载体。在这方面，可以使用如许多出售商(例如Qiagen、Valencia、CA)所提供的包含六组氨酸标签的融合蛋白质(Sisk et alk，1994：J.Virol 68：766-775)。此类载体包括但不限于大肠杆菌表达载体pUR278(Ruther等人，1983，EMBO J.2：1791)，其中蛋白质编码序列可以单独连接入载体与lac Z编码区处于读码框内，从而产生融合蛋白质；pIN载体(Inouye和Inouye，1985，Nucleic Acids Res.13：3101-3109；Van Heeke和Schuster，1989，J.Biol.Chem.264：5503-5509)等等。pGEX载体还可以用于表达作为与谷胱甘肽S-转移酶(GST)的融合蛋白质的外源多肽。一般而言，此类融合蛋白质是可溶的并且通过吸附至谷胱甘肽琼脂糖珠子并随后存在游离谷胱甘肽时进行洗脱能够容易地从裂解细胞进行纯化。pGEX载体设计为包含凝血酶或因子Xa蛋白酶切割位点以至于能够从GST部分释放所克隆的靶基因蛋白质。In bacterial systems, a number of expression vectors can be advantageously selected depending on the use of the protein to be expressed. For example, when large quantities of such proteins are produced, for antibody production or for screening peptide libraries, vectors directing the expression of high-level fusion protein products that are readily purified may be desirable. In this regard, fusion proteins comprising a hexahistidine tag (Sisk et alk, 1994: J. Virol 68: 766-775) can be used as supplied by a number of vendors (eg Qiagen, Valencia, CA). Such vectors include, but are not limited to, the Escherichia coli expression vector pUR278 (Ruther et al., 1983, EMBO J.2: 1791), in which the protein coding sequence can be individually ligated into the vector and the lac Z coding region in reading frame, thereby generating a fusion protein; pIN vector (Inouye and Inouye, 1985, Nucleic Acids Res. 13:3101-3109; Van Heeke and Schuster, 1989, J. Biol. Chem. 264:5503-5509) and the like. The pGEX vector can also be used to express foreign polypeptides as fusion proteins with glutathione S-transferase (GST). In general, such fusion proteins are soluble and can be easily purified from lysed cells by adsorption to glutathione sepharose beads followed by elution in the presence of free glutathione. The pGEX vector is designed to contain a thrombin or Factor Xa protease cleavage site to enable release of the cloned target gene protein from the GST moiety.

使用包含缺乏启动子区域的报道基因转录单位的载体能够从任意期望的基因选择启动子区域，所述报道基因转录单位为例如氯霉素乙酰转移酶(“CAT”)、或萤光素酶转录单位，位于用于导入候选启动子片段，即可以包含启动子的片段的单个或多个限制位点的下游。例如，在CAT基因上游限制位点将包含启动子的片段导入载体引起产生CAT活性，通过标准CAT测定法能够检测该活性。适于该目的的载体是众所周知的并且容易获得。2种此类载体是pKK232-8和pCM7。因此，用于本发明多核苷酸表达的启动子不但包括众所周知且易于获得的启动子，还包括使用报道基因通过上述技术可以容易获得的启动子。A promoter region can be selected from any desired gene using a vector comprising a reporter gene transcription unit lacking a promoter region, such as chloramphenicol acetyltransferase ("CAT"), or luciferase transcription unit, located downstream of single or multiple restriction sites for the introduction of candidate promoter fragments, ie, fragments that may contain a promoter. For example, introduction of a fragment comprising a promoter into a vector at a restriction site upstream of the CAT gene results in CAT activity, which can be detected by standard CAT assays. Vectors suitable for this purpose are well known and readily available. Two such vectors are pKK232-8 and pCM7. Therefore, the promoters used for expression of the polynucleotide of the present invention include not only well-known and readily available promoters, but also promoters that can be easily obtained by the above-mentioned techniques using a reporter gene.

适于表达根据本发明的多核苷酸和多肽的已知细菌启动子包括大肠杆菌lacI和lacZ启动子、T3和T7启动子、T5tac启动子、λPR、PL启动子和trp启动子。已知适于该方面的真核启动子包括CMV立即早期启动子、HSV胸苷激酶启动子、早期和晚期SV40启动子、逆转录病毒LTR启动子，例如劳斯肉瘤病毒(Rous sarcoma virus)(“RSV”)的那些启动子、和金属硫蛋白启动子，例如小鼠金属硫蛋白-I启动子。Known bacterial promoters suitable for expressing polynucleotides and polypeptides according to the invention include the E. coli lacI and lacZ promoters, T3 and T7 promoters, T5tac promoter, lambda PR, PL promoter and trp promoter. Eukaryotic promoters known to be suitable in this regard include the CMV immediate early promoter, the HSV thymidine kinase promoter, the early and late SV40 promoters, retroviral LTR promoters, such as Rous sarcoma virus (Rous sarcoma virus) ( "RSV"), and metallothionein promoters, such as the mouse metallothionein-I promoter.

在昆虫系统中，苜蓿银纹夜蛾核型多角体病毒(Autographa californicanuclear polyhedrosis virus)(AcNPV)是能够用作载体以表达外源基因的几种昆虫系统之一。该病毒生长于草地夜蛾(Spodoptera frugiperda)细胞。编码序列可以单独克隆入病毒非必需区(例如多角体蛋白基因)并置于AcNPV启动子(例如多角体蛋白启动子)控制之下。编码序列的成功插入将导致多角体蛋白基因失活和非封闭重组病毒(即缺乏多角体蛋白基因所编码的蛋白质外壳的病毒)的产生。然后这些重组病毒用于感染草地夜蛾细胞，在其中表达所插入的基因(例如，参见Smith等人，1983，J.Virol.46：584；Smith，美国专利号4,215,051)。Among insect systems, Autographa californica nuclear polyhedrosis virus (AcNPV) is one of several insect systems that can be used as a vector to express foreign genes. The virus grows in Spodoptera frugiperda cells. The coding sequence can be cloned separately into non-essential regions of the virus (eg, the polyhedrin gene) and placed under the control of an AcNPV promoter (eg, the polyhedrin promoter). Successful insertion of the coding sequence will result in inactivation of the polyhedrin gene and production of non-occluded recombinant virus (ie, a virus lacking the protein coat encoded by the polyhedrin gene). These recombinant viruses are then used to infect Spodoptera cells in which the inserted gene is expressed (eg, see Smith et al., 1983, J. Virol. 46:584; Smith, US Patent No. 4,215,051).

在哺乳动物宿主细胞中，可以利用许多基于病毒的表达系统。在腺病毒用作表达载体的情况下，将目的编码序列连接到腺病毒转录/翻译控制复合体中，例如晚期启动子和三联前导序列。然后通过体外或体内重组将这种嵌合基因插入到腺病毒的基因组中。插入到病毒基因组的非必需区(如E1区或E3区)会导致产生有活力的并且能够在感染的宿主细胞内表达目的蛋白质的重组病毒(例如，见Logan和Shenk，1984，Proc.Natl.Acad.Sci.USA 81：3655-3659)。为了有效翻译所插入的基因编码序列还需要特异的起始信号。这些信号包括ATG起始密码子和邻近的序列。在包括其自身的起始密码子和邻近序列的完整的基因插入到适当的表达载体时的情况下，则可以不需要额外的翻译控制信号。然而，在只有部分基因编码序列被插入的情况下，则必须提供外源的翻译控制信号，包括ATG起始密码子。而且，起始密码子必须与目的编码序列阅读框同相以保证完整插入片段的翻译。这些外源翻译控制信号和起始密码子可以有多种来源，可以是天然的和合成的。通过包含适当的转录增强子元件、转录终止子等可以增强表达的效率(见Bittner等，1987，Methods in Enzymol.153：516-544)。其它的常用系统是基于SV40、逆转录病毒或腺伴随病毒的。对适当载体和用于在宿主细胞内表达的启动子的选择是众所周知的过程并且用于载体构建、载体导入宿主和在宿主本身内进行表达的必需技术是本领域的常规技术。通常，重组表达载体会包括复制起点、源于高表达基因的用于指导下游结构序列转录的启动子、和用于在暴露于载体后将包含载体的细胞分离出来的选择标记。In mammalian host cells, a number of viral-based expression systems are available. Where adenovirus is used as the expression vector, the coding sequence of interest is ligated into the adenoviral transcriptional/translational control complex, such as the late promoter and tripartite leader sequence. This chimeric gene is then inserted into the genome of the adenovirus by in vitro or in vivo recombination. Insertion into non-essential regions of the viral genome (such as the El or E3 regions) will result in the production of recombinant viruses that are viable and capable of expressing the protein of interest in infected host cells (see, for example, Logan and Shenk, 1984, Proc. Natl. Acad. Sci. USA 81:3655-3659). Specific initiation signals are also required for efficient translation of the inserted gene coding sequence. These signals include the ATG initiation codon and adjacent sequences. In cases where the complete gene, including its own initiation codon and adjacent sequences, is inserted into an appropriate expression vector, no additional translational control signals may be required. However, in cases where only part of the gene coding sequence is inserted, exogenous translational control signals must be provided, including the ATG initiation codon. Furthermore, the initiation codon must be in phase with the reading frame of the coding sequence of interest to ensure translation of the complete insert. These exogenous translational control signals and initiation codons can come from a variety of sources, both natural and synthetic. The efficiency of expression can be enhanced by inclusion of appropriate transcriptional enhancer elements, transcriptional terminators, etc. (see Bittner et al., 1987, Methods in Enzymol. 153:516-544). Other commonly used systems are based on SV40, retrovirus or adeno-associated virus. Selection of appropriate vectors and promoters for expression in host cells is a well-known procedure and the necessary techniques for vector construction, introduction of the vector into the host and expression in the host itself are routine in the art. Typically, a recombinant expression vector will include an origin of replication, a promoter derived from a highly expressed gene to direct transcription of downstream structural sequences, and a selectable marker to isolate cells containing the vector after exposure to the vector.

此外，可以选择能够调节所插入序列表达的或者以特异的目的方式修饰和加工基因产物的宿主细胞株。蛋白质产物的此类修饰(例如糖基化)和加工(例如切割)对于蛋白质的功能是重要的。不同的宿主细胞具有用于蛋白质翻译后加工和修饰的特征性的和特异的机制。可以选择适当的细胞系或宿主系统以保证对所表达的外来蛋白质进行正确的修饰和加工。为了实现该目的，可以使用具有对初级转录物进行正确加工、对基因产物进行糖基化和磷酸化的细胞机器的真核宿主细胞。此类哺乳动物宿主细胞包括但不限于CHO、VERO、BHK、HeLa、COS、MDCK、293、3T3、WI38等。In addition, it is possible to select host cell strains which are capable of modulating the expression of the inserted sequences or of modifying and processing the gene products in a specific, purposeful manner. Such modification (eg, glycosylation) and processing (eg, cleavage) of protein products are important for protein function. Different host cells have characteristic and specific mechanisms for post-translational processing and modification of proteins. Appropriate cell lines or host systems can be chosen to ensure correct modification and processing of the foreign protein expressed. For this purpose, eukaryotic host cells with the cellular machinery for proper processing of primary transcripts, glycosylation and phosphorylation of gene products can be used. Such mammalian host cells include, but are not limited to, CHO, VERO, BHK, HeLa, COS, MDCK, 293, 3T3, WI38, and the like.

本发明还包括具有CREAP蛋白质活性的重组CREAP肽和肽片段。术语“重组蛋白质”指通过重组技术产生的本发明的蛋白质，其中通常将编码CREAP活性片段的DNA插入到适宜表达载体中，所述表达载体用于转化宿主细胞以产生异源蛋白质。具体而言，具有CREAP蛋白质活性的重组肽片段包括包含CREAP1、2或3的保守的氨基末端200个氨基酸或者羧基末端100个氨基酸的CREAP蛋白质片段。所述的片段包括CREAP1的氨基酸片段1-267和575-650，CREAP2的氨基酸片段1-280和615-693和CREAP3的氨基酸片段1-279和545-619以及上面所讨论的包含人CREAP1-3中氨基酸1-75的区域的片段。The present invention also includes recombinant CREAP peptides and peptide fragments having CREAP protein activity. The term "recombinant protein" refers to the protein of the present invention produced by recombinant technology, wherein usually the DNA encoding the active fragment of CREAP is inserted into a suitable expression vector, which is used to transform the host cell to produce the heterologous protein. Specifically, the recombinant peptide fragments having CREAP protein activity include CREAP protein fragments comprising the conserved amino-terminal 200 amino acids or carboxy-terminal 100 amino acids of CREAP1, 2 or 3. Said fragments include amino acid fragments 1-267 and 575-650 of CREAP1, amino acid fragments 1-280 and 615-693 of CREAP2 and amino acid fragments 1-279 and 545-619 of CREAP3 and the human CREAP1-3 as discussed above. A fragment of the region of amino acids 1-75.

本发明的重组蛋白质还可以包括根据本领域技术人员熟悉的技术可以得到的CREAP和不同多肽的嵌合蛋白或融合蛋白质(见，例如CurrentProtocols in Molecular Biology；Ausubel等著，John Wiley & Sons；1992；PNAS 85：4879(1988))。The recombinant proteins of the present invention may also include chimeric or fusion proteins of CREAP and different polypeptides available according to techniques familiar to those skilled in the art (see, for example, Current Protocols in Molecular Biology; Ausubel et al., John Wiley &Sons;1992; PNAS 85: 4879 (1988)).

为了长期高产量的产生重组蛋白质，优选稳定表达。例如，可以对稳定表达差别表达的基因蛋白质的细胞系进行工程改造。不是使用包含病毒复制起点的表达载体，而是用适当表达控制元件(例如启动子、增强子、序列、转录终止子、多聚腺苷酸位点等)控制的DNA和选择标记转化宿主细胞。在外来DNA导入之后，将工程改造的细胞在富集培养基中生长1-2天然后转入至选择培养基中。重组质粒中的选择标记赋予抗选择的抗性并且允许细胞稳定整合质粒到其染色体中并且允许细胞生长形成转化灶，将所述的细胞转化灶进行克隆并扩大培养成细胞系。该方法可以有利地用于改造表达差别表达基因蛋白质的细胞系。此类工程改造细胞系对于筛选和评价能够影响所表达蛋白质的内源活性的化合物是特别有用的。For long-term high-yield production of recombinant proteins, stable expression is preferred. For example, cell lines can be engineered to stably express proteins of differentially expressed genes. Instead of using an expression vector containing a viral origin of replication, host cells are transformed with DNA and selectable markers controlled by appropriate expression control elements (eg, promoters, enhancers, sequences, transcription terminators, polyadenylation sites, etc.). Following foreign DNA introduction, engineered cells are grown in enriched media for 1-2 days and then transferred to selective media. The selectable marker in the recombinant plasmid confers resistance against selection and allows the cell to stably integrate the plasmid into its chromosome and allow the cell to grow to form foci, which are cloned and expanded into cell lines. This method can be advantageously used to engineer cell lines expressing proteins of differentially expressed genes. Such engineered cell lines are particularly useful for screening and evaluating compounds capable of affecting the endogenous activity of expressed proteins.

可以使用许多选择系统，包括但不限于单纯疱疹病毒胸苷激酶(Wigler等，1977，Cell 11：223)、次黄嘌呤-鸟嘌呤磷酸核糖转移酶(Szybalska和Szybalski，1962，Proc.Natl.Acad.Sci.USA 48：2026)和腺嘌呤磷酸核糖转移酶(Lowy等，1980，Cell 22：817)基因可以分别在tk^-、hgprt^-或aprt^-细胞中采用。还可以使用抗代谢物抗性作为用于赋予氨甲喋呤抗性的dhfr(Wigler等，1980，Natl.Acad.Sci.USA 77：3567；O′Hare等，1981，Proc.Natl.Acad.Sci.USA 78：1527)、赋予对霉酚酸抗性的gpt(Mulligan和Berg，1981，Proc.Natl.Acad.Sci.USA 78：2072)、赋予对氨基糖苷G-418抗性的neo(Colberre-Garapin等，1981，J.Mol.Biol.150：1)和赋予对潮霉素抗性的hygro(Santerre等，1984，Gene 30：147)选择的基础。A number of selection systems can be used, including but not limited to herpes simplex virus thymidine kinase (Wigler et al., 1977, Cell 11:223), hypoxanthine-guanine phosphoribosyltransferase (Szybalska and Szybalski, 1962, Proc. Natl. Acad. .Sci.USA 48:2026) and adenine phosphoribosyltransferase (Lowy et al., 1980, Cell 22:817) genes can be employed in tk ^- , hgprt ^- or aprt ^- cells, respectively. Antimetabolite resistance can also be used as dhfr for conferring resistance to methotrexate (Wigler et al., 1980, Natl. Acad. Sci. USA 77:3567; O'Hare et al., 1981, Proc. Natl. Acad. Sci. USA 78:1527), gpt (Mulligan and Berg, 1981, Proc.Natl.Acad.Sci.USA 78:2072) conferring resistance to mycophenolic acid, neo (Colberre-Garapin et al., 1981, J. Mol. Biol. 150:1) and hygro (Santerre et al., 1984, Gene 30:147) selection basis for conferring resistance to hygromycin.

一种备选的融合蛋白质表达系统允许对人细胞系中表达的非变性的融合蛋白质容易地进行纯化(Janknecht等，1991，Proc.Natl.Acad.Sci.USA88：8972-8976)。在该系统中，目的基因被亚克隆进入痘苗重组质粒中以致于基因的可读框在翻译水平上融合到由6个组氨酸残基组成的氨基末端标签上。将重组痘苗病毒感染的细胞的提取物加到Ni²⁺氮川乙酸-琼脂糖柱子上并且具有组氨酸标签的蛋白质可以用含咪唑的缓冲液选择性洗脱下来。An alternative fusion protein expression system allows facile purification of native fusion proteins expressed in human cell lines (Janknecht et al., 1991, Proc. Natl. Acad. Sci. USA 88:8972-8976). In this system, the gene of interest is subcloned into a vaccinia recombinant plasmid such that the open reading frame of the gene is translationally fused to an amino-terminal tag consisting of six histidine residues. Extracts from recombinant vaccinia virus-infected cells were applied to Ni ²⁺ nitriloacetic acid-agarose columns and proteins with histidine tags could be selectively eluted with imidazole-containing buffer.

当将本发明的蛋白质用作分析系统，如下面所述的那些分析系统中的组分时，本发明的蛋白质可以直接或间接标记以利于检测蛋白质和测试物质之间所形成的复合体。可以使用的多种适宜标记系统的任何一种包括但不限于放射性同位素如125I、当暴露于底物时可以产生可检测量热信号或光的酶标记系统、和荧光标记。When the protein of the present invention is used as a component in an analytical system such as those described below, the protein of the present invention may be directly or indirectly labeled to facilitate detection of the complex formed between the protein and the test substance. Any of a variety of suitable labeling systems that may be used include, but are not limited to, radioisotopes such as125I, enzymatic labeling systems that produce a detectable calorimetric signal or light when exposed to a substrate, and fluorescent labels.

当重组DNA技术用来产生用于此类分析系统的本发明的蛋白质时，可以有利地改造融合蛋白以利于标记、固定化、检测和/或分离。When recombinant DNA techniques are used to generate proteins of the invention for use in such assay systems, fusion proteins may advantageously be engineered to facilitate labelling, immobilization, detection and/or isolation.

间接标记包括使用能够特异结合本发明多肽的蛋白质，如标记抗体。此类抗体包括但不限于多克隆抗体、单克隆抗体、嵌合抗体、单链抗体、Fab片段和由Fab表达文库产生的片段。Indirect labeling involves the use of proteins capable of specifically binding the polypeptide of the invention, such as labeled antibodies. Such antibodies include, but are not limited to, polyclonal antibodies, monoclonal antibodies, chimeric antibodies, single chain antibodies, Fab fragments and fragments produced by a Fab expression library.

文中还预期文中公开的CREAP蛋白质是有用的药物靶标，其中所述的药物靶标可用于开发用于治疗与CRE-依赖性基因表达异常激活或趋化因子异常激活相关的病理状况的治疗剂。此类状况包括但不限于骨关节炎、牛皮癣、气喘、COPD、银屑病、气喘、类风湿关节炎、癌症、病理性血管发生、糖尿病、高血压、慢性疼痛和其它炎症性和自身免疫病以及神经变性疾病，例如阿尔茨海默氏病、帕金森氏病和亨廷顿病。It is also contemplated that the CREAP proteins disclosed herein are useful drug targets for the development of therapeutic agents for the treatment of pathological conditions associated with aberrant activation of CRE-dependent gene expression or aberrant activation of chemokines. Such conditions include, but are not limited to, osteoarthritis, psoriasis, asthma, COPD, psoriasis, asthma, rheumatoid arthritis, cancer, pathological angiogenesis, diabetes, hypertension, chronic pain, and other inflammatory and autoimmune diseases and neurodegenerative diseases such as Alzheimer's, Parkinson's and Huntington's diseases.

除了趋化因子外，资料还表明CREAP蛋白质能够诱导其它基因例如PEPCK和双调蛋白。双调蛋白是一种与癌症相关的EGF样生长因子。PEPCK是葡萄糖的合成中一种限制性因子并且同样是糖原异生所需的，通常认为对PEPCK的阻断是治疗糖尿病的治疗方法。同样，文中还预期能够通过本发明的调节子治疗的病理性状况包括与这些蛋白质的异常活性或表达相关的状况。In addition to chemokines, data also indicate that CREAP proteins can induce other genes such as PEPCK and amphiregulin. Amphiregulin is an EGF-like growth factor associated with cancer. PEPCK is a limiting factor in the synthesis of glucose and is also required for gluconeogenesis, and blockade of PEPCK is generally considered a therapeutic approach for the treatment of diabetes. Likewise, pathological conditions that can be treated by the modulators of the invention are also contemplated herein to include conditions associated with aberrant activity or expression of these proteins.

另一方面，本发明涉及鉴定对于治疗、预防或改善上面讨论的病理状况有用的调节子的方法，所述方法包括：a)测定候选调节子体外、先体外后体内(ex vivo)或体内抑制或增强CREAP活性和/或抑制或增强CREAP表达的能力和进一步包括b)测定所鉴定出来的CREAP调节子逆转在所述病理状况的体外、先体外后体内或体内模型中和/或在对具有所述病理状况的患者的临床研究中所观察到的病理影响的能力。In another aspect, the present invention relates to a method of identifying modulators useful for the treatment, prevention or amelioration of the above-discussed pathological conditions, said method comprising: a) determining the inhibition of a candidate modulator in vitro, ex vivo or in vivo Or the ability to enhance CREAP activity and/or inhibit or enhance CREAP expression and further comprising b) assaying the identified CREAP regulator reversal in an in vitro, ex vivo or in vivo model of said pathological condition and/or in the presence of The ability to observe pathological effects in clinical studies of patients with the pathological condition.

可以用常规的筛选测定法(例如体外、先体外后体内和体内)鉴定能够抑制或增强CREAP蛋白质活性和/或抑制或增强CREAP表达的调节子。通过常规测定方法使用源于受试者的生物样本可以测定受试者体内的CREAP活性和CREAP水平。使用本领域技术人员熟悉的方法包括例如常规Northern分析或商业可得的微阵列也可以确定CREAP的基因表达(例如mRNA水平)。此外，使用ELISA基于抗体的测定法或荧光标记反应测定法可以检测测试化合物对CREAP水平和/或相关调节蛋白质水平的影响。这些技术容易地用于高通量筛选并且是本领域技术人员所熟悉的。Modulators capable of inhibiting or enhancing CREAP protein activity and/or inhibiting or enhancing CREAP expression can be identified using routine screening assays (eg, in vitro, ex vivo, and in vivo). CREAP activity and CREAP levels in a subject can be determined by routine assay methods using a biological sample derived from the subject. Gene expression (eg, mRNA levels) of CREAP can also be determined using methods familiar to those skilled in the art, including, for example, conventional Northern analysis or commercially available microarrays. In addition, the effect of test compounds on CREAP levels and/or levels of related regulatory proteins can be detected using ELISA antibody-based assays or fluorescent labeling reaction assays. These techniques are readily used for high-throughput screening and are familiar to those skilled in the art.

从这些研究中收集到的资料可以用于鉴定对治疗上面所讨论的病理状况具有治疗有效性的那些调节子，例如根据常规的方法在所述病理状况的常规体外或体内模型中和/或在对具有所述病理状况的人的临床试验中对抑制性物质进行进一步测定以便评估所述化合物体内治疗、预防或改善所述病理状况的能力。The data gathered from these studies can be used to identify those modulators that are therapeutically useful for the treatment of the pathological conditions discussed above, e.g., according to conventional methods in conventional in vitro or in vivo models of said pathological conditions and/or in Inhibitory substances are further assayed in clinical trials in humans with the pathological condition in order to assess the ability of the compound to treat, prevent or ameliorate the pathological condition in vivo.

通过应用关于CREAP多肽的活性部分的关键信息，本发明使得能够通过采用本领域技术人员熟悉的合理药物设计来研发具有CREAP功能的调节子，例如小分子激动剂或拮抗剂。By applying key information about the active portion of a CREAP polypeptide, the present invention enables the development of modulators of CREAP function, such as small molecule agonists or antagonists, by employing rational drug design familiar to those skilled in the art.

另一方面，本发明涉及预防、治疗或改善文中所述病理状况的方法，所述的方法包括对需要其的受试者施用包含有效量CREAP调节子的药物组合物。此类调节子包括针对CREAP多肽或其片段的抗体。在一些特别优选的实施方案中，药物组合物包含对人CREAP多肽或人CREAP多肽的部分具有高度选择性的抗体。针对CREAP蛋白质的抗体能够引起该蛋白质在受试者体内聚集并且因此抑制或降低该蛋白质的活性。此类抗体还可以抑制或降低CREAP活性，例如通过与活性位点直接相互作用或阻止物质接近活性位点实现所述抑制或降低。CREAP抗体还可以通过阻止蛋白质-蛋白质的相互作用而用于抑制CREAP活性，所述的蛋白质-蛋白质相互作用可能参与CREAP蛋白质调节并且对于蛋白质的活性是必需的。根据本领域技术人员熟悉的标准测定方法可以产生和鉴定如文中所述具有抑制活性的抗体。In another aspect, the present invention relates to a method of preventing, treating or ameliorating the pathological conditions described herein, said method comprising administering to a subject in need thereof a pharmaceutical composition comprising an effective amount of a CREAP modulator. Such modulators include antibodies directed against CREAP polypeptides or fragments thereof. In some particularly preferred embodiments, the pharmaceutical composition comprises an antibody that is highly selective for a human CREAP polypeptide or a portion of a human CREAP polypeptide. Antibodies against a CREAP protein are capable of causing aggregation of the protein in a subject and thereby inhibiting or reducing the activity of the protein. Such antibodies may also inhibit or reduce CREAP activity, eg, by directly interacting with the active site or preventing access of substances to the active site. CREAP antibodies can also be used to inhibit CREAP activity by preventing protein-protein interactions that may be involved in CREAP protein regulation and are necessary for the protein's activity. Antibodies having inhibitory activity as described herein can be produced and identified according to standard assays familiar to those skilled in the art.

CREAP抗体还可以用于诊断。例如可以根据常规的方法使用这些抗体以便定量受试者体内的CREAP蛋白质水平；例如提高的水平表明CRE-依赖性基因表达的过度激活(例如在其启动子区域具有CRE的基因的激活)并且可能表明过度激活的程度和相关病理状况的相应的严重程度。因而，不同的CREAP水平可以指示与异常的CRE-依赖性基因表达或趋化因子异常活化相关的病理状况的多种临床形式或严重程度。此种信息对于鉴定患有病理状况的患者的亚群是有用的，其中所述的病理状况可能应答或不应答用CREAP调节子进行的治疗。CREAP antibodies can also be used for diagnosis. For example, these antibodies can be used according to conventional methods in order to quantify CREAP protein levels in a subject; for example, increased levels indicate overactivation of CRE-dependent gene expression (eg, activation of genes with CRE in their promoter regions) and possibly Indicates the degree of hyperactivation and the corresponding severity of the associated pathological condition. Thus, different CREAP levels can be indicative of various clinical forms or severity of pathological conditions associated with aberrant CRE-dependent gene expression or aberrant activation of chemokines. Such information is useful for identifying subpopulations of patients with pathological conditions that may or may not be responsive to treatment with a CREAP modulator.

文中还预期监测CREAP水平或活性和/或检测CREAP表达(mRNA水平)可以用作临床试验过程的一部分，例如以确定给定治疗方案的功效。例如，对已经施用药物的患者进行评估并且临床医生可以鉴定出CREAP水平、活性和/或表达水平高于期望值的那些患者(即水平高于或低于没有经历相关疾病状态的对照患者或者通过临床干预疾病状态已充分缓和的患者的水平)。基于这些资料，医生于是调整剂量、施用方案或所开药物的类型。It is also contemplated herein that monitoring CREAP levels or activity and/or detecting CREAP expression (mRNA levels) can be used as part of a clinical trial process, eg, to determine the efficacy of a given treatment regimen. For example, patients who have been administered the drug are evaluated and the clinician can identify those patients with CREAP levels, activity and/or expression levels higher than expected (i.e. levels higher or lower than control patients not experiencing the relevant disease state or by clinical level of patients whose disease state has been sufficiently remitted from the intervention). Based on this information, the physician then adjusts the dosage, administration regimen, or type of drug prescribed.

对于优化患者治疗所考虑的因素包括所治疗的具体状况、所治疗的具体哺乳动物、各个患者的临床状况、活性化合物的递送部位、活性化合物的具体类型、施用方法、施用的时间安排和开业医生已知的其它因素。欲施用的活性化合物的治疗有效量可以通过此种考虑进行控制并且是治疗特定病理状况所需要的最小量。Factors to be considered in optimizing patient treatment include the particular condition being treated, the particular mammal being treated, the clinical condition of the individual patient, the site of delivery of the active compound, the particular type of active compound, the method of administration, the timing of administration, and the practitioner. other known factors. The therapeutically effective amount of active compound to be administered can be governed by such considerations and will be the minimum amount required to treat the particular pathological condition.

由于CREAP基因家族包含高度保守的关键区，所以可以预计CREAP蛋白质的肽模拟物可以用作CREAP调节子。所以，本发明的一个实施方案是从能够阻断CREAP功能的CREAP家族蛋白质衍生的或者设计的肽。可以预计这些肽模拟物能够阻断所有高度相关的CREAP蛋白质的功能。根据常规方法基于对多肽中CREAP蛋白质活性所需的区域的理解可以制造出适宜的肽模拟物。简言之，通过常规的结构功能研究如对野生型蛋白质的缺失或突变分析可以鉴定出蛋白质中的短氨基酸序列。一旦鉴定出关键的区域，可以预期如果它们对应着蛋白质的高度保守部分则该区域会负责关键的功能(例如蛋白质-蛋白质相互作用)。设计成看上去像所述关键区的小的合成肽模拟物预计能够与完整的蛋白质竞争并且因此而干扰该蛋白质的功能。合成的氨基酸序列可以由与该区域全部或部分匹配的氨基酸组成。此类氨基酸可以用类似原先的氨基酸的但是能赋予更好的药理学特性如更高的抑制活性、稳定性、半衰期或生物利用率的其它化学结构代替。Since the CREAP gene family contains highly conserved key regions, it is expected that peptidomimetics of CREAP proteins could serve as CREAP regulators. Therefore, one embodiment of the present invention are peptides derived or designed from CREAP family proteins capable of blocking CREAP function. These peptidomimetics are expected to block the function of all highly related CREAP proteins. Suitable peptidomimetics can be produced according to conventional methods based on the understanding of regions of the polypeptide required for CREAP protein activity. Briefly, short amino acid sequences in proteins can be identified by routine structure-function studies such as deletion or mutation analysis of wild-type proteins. Once critical regions are identified, it can be expected that the regions are responsible for critical functions (eg protein-protein interactions) if they correspond to highly conserved parts of the protein. Small synthetic peptidomimetics designed to look like the critical region are expected to compete with the intact protein and thus interfere with its function. Synthetic amino acid sequences may consist of amino acids matching all or part of this region. Such amino acids can be replaced by other chemical structures that are similar to the original amino acids but confer better pharmacological properties such as higher inhibitory activity, stability, half-life or bioavailability.

针对CREAP蛋白质或相关调节蛋白质的适宜抗体可以从商业来源得到或者根据常规的方法产生。例如文中所述的是用于产生能够特异识别一种或多种差别表达基因表位的抗体的方法。此类抗体可以包括但不限于多克隆抗体、单克隆抗体(mAb)、人源化或嵌合抗体、单链抗体、Fab片段、F(ab′)₂片段、由Fab表达文库产生的片段、抗独特型(anti-Id)抗体、和上面抗体任意一种的表位结合片段。Suitable antibodies against CREAP proteins or related regulatory proteins can be obtained from commercial sources or raised according to routine methods. For example, described herein are methods for generating antibodies capable of specifically recognizing epitopes of one or more differentially expressed genes. Such antibodies may include, but are not limited to, polyclonal antibodies, monoclonal antibodies (mAbs), humanized or chimeric antibodies, single chain antibodies, Fab fragments, F(ab') ₂ fragments, fragments produced by a Fab expression library, Anti-idiotypic (anti-Id) antibodies, and epitope-binding fragments of any of the above antibodies.

为了产生针对文中所讨论的CREAP多肽的抗体，通过注射多肽或其部分对多种宿主动物进行免疫。此类宿主动物包括但不限于兔、小鼠和大鼠。可以使用多种佐剂以增强免疫反应，这取决于宿主物种，佐剂包括但不限于弗氏佐剂(完全佐剂和不完全佐剂)、矿物凝胶例如氢氧化铝、表面活性物质例如溶血卵磷脂、多聚醇、聚阴离子、肽、油乳化液、匙孔血蓝蛋白、二硝基苯酚和潜在有用的人类佐剂例如BCG(卡介苗)和短小棒状杆菌(Corynebacterium parvum)。To raise antibodies against the CREAP polypeptides discussed herein, various host animals are immunized by injection of the polypeptides or portions thereof. Such host animals include, but are not limited to, rabbits, mice and rats. A variety of adjuvants can be used to enhance the immune response, depending on the host species, including but not limited to Freund's adjuvant (complete and incomplete), mineral gels such as aluminum hydroxide, surface active substances such as Lysolecithin, polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, dinitrophenol and potentially useful human adjuvants such as BCG (BCG) and Corynebacterium parvum.

多克隆抗体是来源于用抗原如靶基因产物或抗原的抗原功能衍生物免疫的动物血清的抗体分子的异质群体。为了产生多克隆抗体，通过注射添加有上述佐剂的多肽或其部分对如上述的宿主动物进行免疫。Polyclonal antibodies are heterogeneous populations of antibody molecules derived from the serum of animals immunized with an antigen, such as a target gene product or an antigenic functional derivative of the antigen. For the production of polyclonal antibodies, host animals as described above are immunized by injection of the polypeptide or a part thereof added with the above-mentioned adjuvant.

单克隆抗体是针对特定抗原的同质抗体群体，它们可以通过任何用于由培养的连续细胞系产生抗体分子的技术得到。所述的技术包括但不限于Kohler和Milstein的杂交瘤技术(1975，Nature 256：495-497；和美国专利号4,376,110)、人B-细胞杂交瘤技术(Kosbor等，1983，ImmunologyToday 4：72；Cole等，1983，Proc.Natl.Acad.Sci.USA 80：2026-2030)和EBV-杂交瘤技术(Cole等，1985，Monoclonal Antibodies And CancerTherapy，Alan R.Liss，Inc.，第77-96页)。此类抗体可以是包括IgG、IgM、IgE、IgA、IgD在内的任何免疫球蛋白类型和其任何亚类。对产生本发明mAb的杂交瘤可以进行体外或体内培养。高效价mAb的体内生产使得其成为目前优选的生产方法。Monoclonal antibodies are homogeneous populations of antibodies directed against a specific antigen, which can be obtained by any of the techniques used to produce antibody molecules from cultured continuous cell lines. Such techniques include, but are not limited to, the hybridoma technique of Kohler and Milstein (1975, Nature 256:495-497; and U.S. Pat. No. 4,376,110), the human B-cell hybridoma technique (Kosbor et al., 1983, Immunology Today 4:72; Cole et al., 1983, Proc.Natl.Acad.Sci.USA 80:2026-2030) and EBV-hybridoma technology (Cole et al., 1985, Monoclonal Antibodies And Cancer Therapy, Alan R.Liss, Inc., pages 77-96 ). Such antibodies may be of any immunoglobulin class including IgG, IgM, IgE, IgA, IgD and any subclass thereof. Hybridomas producing mAbs of the present invention can be cultured in vitro or in vivo. The in vivo production of high titer mAbs makes it the presently preferred method of production.

此外，可以使用为了通过将具有适当抗原特异性的小鼠抗体分子的基因和具有适当生物学活性的人抗体分子的基因拼接起来产生“嵌合抗体”而开发的技术(Morrison等，1984，Proc.Natl.Acad.Sci.，81：6851-6855；Neuberger等，1984，Nature，312：604-608；Takeda等，1985，Nature，314：452-454)。嵌合抗体是这样一种分子，其中不同的部分来源于不同的动物物种，例如具有来源于小鼠的mAb的可变区或高变区和人免疫球蛋白恒定区的嵌合抗体。In addition, a technique developed for the production of "chimeric antibodies" by splicing genes of mouse antibody molecules with appropriate antigen specificity and human antibody molecules with appropriate biological activity (Morrison et al., 1984, Proc. Sci., 81:6851-6855; Neuberger et al., 1984, Nature, 312:604-608; Takeda et al., 1985, Nature, 314:452-454). A chimeric antibody is a molecule in which different portions are derived from different animal species, eg, a chimeric antibody having variable or hypervariable regions of a mouse-derived mAb and a human immunoglobulin constant region.

备选地，可以用产生单链抗体的技术(美国专利号4,946,778；Bird，1988，Science 242：423-426；Huston等，1988，Proc.Natl.Acad.Sci.USA85：5879-5883；和Ward等，1989，Nature 334：544-546)以便产生差别表达基因-单链抗体。通过将Fv区的重链和轻链片段借助于氨基酸桥进行连接导致单链多肽而形成单链抗体。Alternatively, techniques for producing single-chain antibodies (US Patent No. 4,946,778; Bird, 1988, Science 242:423-426; Huston et al., 1988, Proc. Natl. Acad. Sci. USA85: 5879-5883; and Ward et al., 1989, Nature 334:544-546) in order to generate differentially expressed genes-single-chain antibodies. Single-chain antibodies are formed by linking the heavy and light chain fragments of the Fv region via an amino acid bridge resulting in a single-chain polypeptide.

最优选地，可以用产生“人源化抗体”的技术产生针对文中公开的多肽、片段、衍生物和功能等价物的抗体。此类技术公开于美国专利号5,932,448；5,693,762；5,693,761；5,585,089；5,530,101；5,910,771；5,569,825；5,625,126；5,633,425；5,789,650；5,545,580；5,661,016和5,770,429，其全部公开全文引用作为参考。Most preferably, techniques for producing "humanized antibodies" can be used to generate antibodies against the polypeptides, fragments, derivatives and functional equivalents disclosed herein. Such techniques are disclosed in U.S. Patent Nos. 5,932,448; 5,693,762; 5,693,761; 5,585,089; 5,530,101; 5,910,771; 5,569,825; 5,625,126;

能够识别特异表位的抗体片段可以通过已知的技术来产生。例如此类片段包括但不限于：通过胃蛋白酶对抗体分子进行消化而产生的F(ab′)₂片段和通过F(ab′)₂片段二硫键的还原而产生的Fab片段。备选地，可以构建Fab表达文库(Huse等，1989，Science，246：1275-1281)以便允许快速并容易地鉴定具有目的特异性的单克隆Fab片段。Antibody fragments capable of recognizing specific epitopes can be produced by known techniques. Examples of such fragments include, but are not limited to, F(ab') ₂ fragments produced by pepsin digestion of antibody molecules and Fab fragments produced by reduction of disulfide bonds of F(ab') ₂ fragments. Alternatively, Fab expression libraries can be constructed (Huse et al., 1989, Science, 246:1275-1281) to allow rapid and easy identification of monoclonal Fab fragments with specificity of interest.

文中所述的抗体检测可以通过使用标准ELISA、FACS分析和体外或体内标准的成像技术来实现。通过将抗体偶联(即物理性连接)到可检测物质有利于检测。可检测物质的实例包括多种酶、辅基、荧光物质、发光物质、生物发光物质和放射性物质。适宜酶的实例包括辣根过氧化物酶、碱性磷酸酶、β-半乳糖苷酶或者乙酰胆碱酯酶；适宜辅基复合体的实例包括链霉亲和素/生物素和抗生物素蛋白/生物素；适宜荧光物质的实例包括伞形酮、萤光素、异硫氰酸荧光素、罗丹明、二氯三嗪基胺荧光素、丹磺酰氯、或藻红蛋白；发光物质的实例包括鲁米诺；生物发光物质的实例包括萤光素酶、萤光素和水母发光蛋白，适宜放射性物质的实例包括¹²⁵I、¹³¹I、³⁵S或³H。Antibody detection as described herein can be achieved using standard ELISA, FACS analysis and standard imaging techniques in vitro or in vivo. Detection is facilitated by conjugating (ie, physically linking) the antibody to a detectable substance. Examples of detectable substances include various enzymes, prosthetic groups, fluorescent substances, luminescent substances, bioluminescent substances, and radioactive substances. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, beta-galactosidase, or acetylcholinesterase; examples of suitable prosthetic complexes include streptavidin/biotin and avidin/ Biotin; examples of suitable fluorescent substances include umbelliferone, luciferin, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride, or phycoerythrin; examples of luminescent substances include Luminol; examples of bioluminescent substances include luciferase, luciferin and aequorin, examples of suitable radioactive substances ^include125I , ^131I , ^35S ^or3H .

为了便于检测，特别优选的是三明治测定法，其中存在多种变化形式，全部的测定法都包括在本发明的之内。例如，在典型的正向测定法中，未标记的抗体固定在固体基质上并且将待测试的样本与结合的分子接触。在足以允许抗体-抗原二元复合体形成的适当孵育阶段后，加入用能够诱导检测信号的报道分子标记的二级抗体并进行孵育一定时间使得足以形成抗体-抗原-标记抗体三元复合体。然后将任何未反应的物质洗掉并且抗原的存在可以通过观察信号进行确定或者通过与包含已知量抗原的对照样本进行比较来定量。正向测定法的变化形式包括同时测定法，其中样本和抗体同时加入到结合的抗体的，或者反向测定法，其中标记的抗体和欲进行测试样本先混合、孵育并加入到未标记的表面结合抗体。这些技术是本领域技术人员众所周知的并且技术的微小变化也是显而易见的。如文中所使用，“三明治测定法”旨在包括基本双位点(two-site)技术的所有变化形式。对于本发明的免疫测定法，唯一的限制因素是标记的抗体是CREAP多肽或相关调节蛋白质或其片段的特异抗体。Particularly preferred for ease of detection are sandwich assays, of which there are many variations, all of which are included within the scope of the invention. For example, in a typical forward assay, unlabeled antibodies are immobilized on a solid substrate and the sample to be tested is contacted with the bound molecules. After an appropriate incubation period sufficient to allow formation of antibody-antigen binary complexes, a secondary antibody labeled with a reporter capable of inducing a detection signal is added and incubated for a time sufficient to form antibody-antigen-labeled antibody ternary complexes. Any unreacted material is then washed away and the presence of antigen can be determined by observing the signal or quantified by comparison to a control sample containing known amounts of antigen. Variations of forward assays include simultaneous assays, where sample and antibody are added simultaneously to the bound antibody, or reverse assays, where labeled antibody and sample to be tested are first mixed, incubated, and added to an unlabeled surface binding antibody. These techniques are well known to those skilled in the art and minor variations of the techniques will be apparent. As used herein, "sandwich assay" is intended to include all variations of the basic two-site technique. For the immunoassays of the present invention, the only limiting factor is that the labeled antibody is specific for the CREAP polypeptide or related regulatory protein or fragment thereof.

最常用的报道分子可以是酶、包含荧光团或放射性核素的分子。对于酶免疫测定法，通常通过戊二醛或高碘酸盐将酶连接到二级抗体上。然而，正如能够容易地认识到的，存在本领域技术人员众所周知的多种不同连接技术。通常使用的酶包括辣根过氧化物酶、葡萄糖氧化酶、β-半乳糖苷酶和碱性磷酸酶。通常选择与特定酶使用的底物用来在被相应的酶水解时产生可检测的颜色变化。例如，对硝基苯基磷酸酯适宜用于碱性磷酸酶缀合物；对于过氧化物酶缀合物通常使用1，2-苯二胺或甲苯胺。采用荧光底物也是可能的，其中得到荧光产物而不是上面提到的显色底物。然后将含有适当底物的溶液加入到三元复合体中。底物与连接到二级抗体上的酶进行反应，给出了定量的可见信号，所述的可见信号可以进一步定量(通常通过分光光度计测量)以评估血清样本中存在的目的多肽或多肽片段的量。The most commonly used reporter molecules can be enzymes, molecules containing fluorophores or radionuclides. For enzyme immunoassays, the enzyme is usually attached to the secondary antibody via glutaraldehyde or periodate. However, as can be readily appreciated, there are many different connection techniques well known to those skilled in the art. Commonly used enzymes include horseradish peroxidase, glucose oxidase, beta-galactosidase and alkaline phosphatase. Substrates for use with specific enzymes are usually chosen to produce a detectable color change when hydrolyzed by the corresponding enzyme. For example, p-nitrophenyl phosphate is suitable for alkaline phosphatase conjugates; for peroxidase conjugates 1,2-phenylenediamine or toluidine are typically used. It is also possible to use fluorogenic substrates, in which a fluorescent product is obtained instead of the chromogenic substrates mentioned above. A solution containing the appropriate substrate is then added to the ternary complex. The substrate reacts with an enzyme linked to the secondary antibody, giving a quantitative visual signal that can be further quantified (usually by spectrophotometer) to assess the presence of the polypeptide or polypeptide fragment of interest in the serum sample amount.

备选地，可以将荧光化合物如荧光素和罗丹明通过化学方法偶联到抗体上而不改变其结合能力。当通过使用特定波长的光照射激活时，荧光染料-标记的抗体吸收光能，诱导分子内的可激发状态，随后发出特征性较长波长的光。发射光呈现使用光学显微镜通过视觉可检测的特征性颜色。免疫荧光和EIA技术都是本领域内完善建立的并且对于本发明的方法是特别优选的。然而，还可以使用其他报道分子例如放射性同位素、化学发光分子或生物发光分子。如何改变操作以适应所要求的用途对于技术人员将是显而易见的。Alternatively, fluorescent compounds such as fluorescein and rhodamine can be chemically coupled to antibodies without altering their binding capacity. When activated by irradiation with light of a specific wavelength, the fluorochrome-labeled antibody absorbs the light energy, induces an excitable state within the molecule, and subsequently emits characteristic longer wavelength light. The emitted light exhibits a characteristic color detectable visually using an optical microscope. Both immunofluorescence and EIA techniques are well established in the art and are particularly preferred for the methods of the invention. However, other reporter molecules such as radioisotopes, chemiluminescent molecules or bioluminescent molecules may also be used. It will be apparent to the skilled person how to alter the operation to suit the required use.

在另一个实施方案中，可通过基因治疗将包含编码CREAP蛋白质或其功能性衍生物的序列的核酸施用以进行治疗。基因治疗是指通过将核酸施用于受试者进行治疗。在本发明的这个实施方案中，核酸产生其编码蛋白质，该蛋白质通过促进正常CRE-依赖性基因表达或趋化因子正常激活介导治疗效果。In another embodiment, a nucleic acid comprising a sequence encoding a CREAP protein or a functional derivative thereof may be administered for treatment by gene therapy. Gene therapy refers to treatment by administering a nucleic acid to a subject. In this embodiment of the invention, the nucleic acid yields its encoded protein that mediates the therapeutic effect by promoting normal CRE-dependent gene expression or normal activation of chemokines.

根据本发明能够使用用于本领域可得到的基因治疗的任意方法。下面描述了代表性方法。Any method for gene therapy available in the art can be used according to the present invention. Representative methods are described below.

在优选的方面，治疗包含CREAP核酸，该核酸是在适当宿主细胞中表达CREAP蛋白质或其片段或嵌合蛋白质的表达载体的一部分。具体而言，此种核酸具有可能连接至CREAP编码区的启动子，所述启动子是可诱导的或组成型的，并且，任选地是组织特异性的。在另一个具体实施方案中，使用核酸分子，其中CREAP编码序列和任意其它预期序列的侧翼为促进基因组内预期位点上同源重组的区域，从而提供CREAP核酸的染色体内表达(Koller和Smithies，1989，Proc.Natl.Acad.Sci.USA86：8932-8935；Zijlstra等，1989，Nature 342：435-438)。In a preferred aspect, the treatment comprises a CREAP nucleic acid that is part of an expression vector that expresses the CREAP protein, or a fragment or chimeric protein thereof, in an appropriate host cell. In particular, such a nucleic acid has a promoter, possibly linked to the CREAP coding region, said promoter being inducible or constitutive and, optionally, tissue-specific. In another specific embodiment, nucleic acid molecules are used wherein the CREAP coding sequence and any other desired sequences are flanked by regions that promote homologous recombination at desired sites within the genome, thereby providing for intrachromosomal expression of the CREAP nucleic acid (Koller and Smithies, 1989, Proc. Natl. Acad. Sci. USA 86:8932-8935; Zijlstra et al., 1989, Nature 342:435-438).

将核酸递送入患者可以是直接的，在这种情况下患者直接暴露于核酸或核酸运送载体，或者是间接的，在这种情况下首先在体外用核酸转化细胞，然后将细胞移植入患者。这两种方法分别是已知的体内或先体外后体内基因疗法。Delivery of nucleic acids into a patient can be direct, where the patient is directly exposed to the nucleic acid or nucleic acid delivery vehicle, or indirect, where cells are first transformed in vitro with the nucleic acid and then transplanted into the patient. These two approaches are known as in vivo or ex vivo gene therapy, respectively.

在特定实施方案中，核酸直接在体内进行施用，其中该核酸表达产生所编码的产物。这能够通过本领域已知的众多方法中的任意一种实现，例如通过将其构建为适当核酸表达载体的一部分并将其施用使其成为细胞内部分，例如通过使用有缺陷或减毒的逆转录病毒或其他病毒载体感染(参见，例如美国专利号4,980,286和其它下文所提及的方法)，或者通过直接注射裸露DNA，或者通过使用微粒轰击(例如基因枪；Biolistic，Dupont)，或者用类脂或细胞表面受体或转染试剂包被，包囊于脂质体、微粒或微囊，或者通过将其与已知进入细胞核的肽连接施用，通过将其与受到受体介导的内吞的配体连接施用(参见例如美国专利5,166,320；5,728,399；5,874,297和6,030,954，所有这些专利在文中全文引用作为参考)(这能够用于靶向特异表达受体的细胞类型)等等。在另一个实施方案中，形成了核酸-配体复合体，其中配体包含病毒融合肽以破坏内体，允许核酸避免溶酶体的降解。在另一个实施方案中，通过靶向特异的受体可以将核酸在体内进行靶向用于细胞的特异摄取和表达(见，例如PCT公开WO92/06180；WO92/22635；WO92/20316；WO93/14188和WO93/20221)。备选地，通过同源重组可以将核酸进行细胞内导入并整合进入宿主细胞DNA以便表达(见，例如美国专利5,413,923；5,416,260和5,574,205；和Zijlstra等，1989，Nature342：435-438)。In certain embodiments, the nucleic acid is administered directly in vivo, wherein expression of the nucleic acid produces the encoded product. This can be achieved by any of numerous methods known in the art, for example by constructing it as part of an appropriate nucleic acid expression vector and administering it as an intracellular part, for example by using a defective or attenuated retro infection with a recording virus or other viral vector (see, e.g., U.S. Pat. Lipid or cell surface receptors or transfection reagents, encapsulated in liposomes, microparticles or microcapsules, or administered by linking them to peptides known to enter the nucleus, by linking them to receptor-mediated internalization Ligand-linked administration of endocytosis (see eg US Patents 5,166,320; 5,728,399; 5,874,297 and 6,030,954, all of which are incorporated by reference in their entirety) (this can be used to target cell types that specifically express the receptor), and the like. In another embodiment, a nucleic acid-ligand complex is formed wherein the ligand comprises a viral fusion peptide to disrupt endosomes, allowing the nucleic acid to avoid lysosomal degradation. In another embodiment, nucleic acids can be targeted in vivo for specific uptake and expression by cells by targeting specific receptors (see, e.g., PCT publications WO92/06180; WO92/22635; WO92/20316; WO93/ 14188 and WO93/20221). Alternatively, the nucleic acid can be introduced intracellularly by homologous recombination and integrated into the host cell DNA for expression (see, eg, US Pat.

在特定实施方案中，使用了包含CREAP核酸的病毒载体。例如可以使用逆转录病毒载体(见，例如美国专利5,219,740；5,604,090；和5,834,182)。这些逆转录病毒经修饰缺失了对于病毒基因组包装和整合入宿主细胞DNA不需要的逆转录病毒序列。将欲在基因治疗中使用的CREAP核酸克隆到载体中，其利于基因递送入患者。In a specific embodiment, a viral vector comprising a CREAP nucleic acid is used. For example, retroviral vectors can be used (see, eg, US Patents 5,219,740; 5,604,090; and 5,834,182). These retroviruses have been modified to delete retroviral sequences not required for viral genome packaging and integration into host cell DNA. The CREAP nucleic acid to be used in gene therapy is cloned into a vector that facilitates gene delivery into patients.

腺病毒是能够在基因治疗中使用的其它病毒载体。腺病毒对于递送基因到呼吸上皮是特别有吸引力的运载工具。腺病毒天然感染呼吸上皮导致产生轻微疾病。基于腺病毒的递送系统的其它靶标是肝脏、中枢神经系统、内皮细胞和肌肉。腺病毒具有能够感染非分裂细胞的优点。进行基于腺病毒的基因治疗的方法描述于例如美国专利5,824,544；5,868,040；5,871,722；5,880,102；5,882,877；5,885,808；5,932,210；5,981,225；5,994,106；5,994,132；5,994,134；6,001,557和6,033,8843，所有这些在文中全文引用作为参考。Adenoviruses are other viral vectors that can be used in gene therapy. Adenoviruses are particularly attractive vehicles for delivering genes to the respiratory epithelium. Adenoviruses naturally infect the respiratory epithelium resulting in mild disease. Other targets for adenovirus-based delivery systems are the liver, central nervous system, endothelial cells, and muscle. Adenoviruses have the advantage of being able to infect non-dividing cells.进行基于腺病毒的基因治疗的方法描述于例如美国专利5,824,544；5,868,040；5,871,722；5,880,102；5,882,877；5,885,808；5,932,210；5,981,225；5,994,106；5,994,132；5,994,134；6,001,557和6,033,8843，所有这些在文中全文引用作为参考.

也已经提出在基因治疗中使用腺伴随病毒(AAV)。产生和利用AAV的方法描述于例如美国专利5,173,414；5,252,479；5,552,311；5,658,785；5,763,416；5,773,289；5,843,742；5,869,040；5,942,496和5,948,675，全部这些在文中全文引用作为参考。The use of adeno-associated virus (AAV) in gene therapy has also been proposed. Methods of producing and utilizing AAV are described, for example, in US Patent Nos. 5,173,414; 5,252,479; 5,552,311; 5,658,785;

基因治疗的另一种方法涉及通过如电穿孔、脂转染、磷酸钙介导的转染或病毒感染的方法将基因转移进入组织培养中的细胞。通常，转移方法包括将选择标记转移入细胞。然后将细胞置于选择的条件下以便分离那些已经摄取并且正在表达所转移基因的那些细胞。然后将那些细胞递送给患者。在该实施方案中，在将所得的重组细胞进行体内使用之前已经将核酸导入了细胞。此种导入可以通过本领域已知的任意方法进行，所述的方法包括但不限于转染、电穿孔、微注射、用包含核酸序列的病毒或噬菌体载体进行感染、细胞融合、染色体介导的基因转移、微细胞介导的基因转移、原生质球融合等等。多种技术是本领域已知的用于将外来基因导入细胞并且可以按照本发明使用，其前提条件是受体细胞必需的发育和生理功能不被破坏。所述技术应该提供将核酸稳定转移入细胞，以致于核酸可由细胞表达并且优选地可由其细胞子代遗传和表达。Another approach to gene therapy involves the transfer of genes into cells in tissue culture by methods such as electroporation, lipofection, calcium phosphate-mediated transfection, or viral infection. Typically, methods of transfer involve transferring a selectable marker into cells. The cells are then subjected to selective conditions to isolate those cells that have taken up and are expressing the transferred gene. Those cells are then delivered to the patient. In this embodiment, the nucleic acid has been introduced into the resulting recombinant cell prior to its use in vivo. Such introduction can be performed by any method known in the art, including but not limited to transfection, electroporation, microinjection, infection with viral or phage vectors containing nucleic acid sequences, cell fusion, chromosome-mediated Gene transfer, minicell-mediated gene transfer, spheroplast fusion, and more. A variety of techniques are known in the art for introducing foreign genes into cells and can be used in accordance with the present invention, provided that essential developmental and physiological functions of the recipient cells are not disrupted. The technique should provide for the stable transfer of the nucleic acid into the cell so that the nucleic acid can be expressed by the cell and preferably inherited and expressed by its progeny of cells.

通过本领域已知的方法可以将所得重组细胞递送给患者。在优选的实施方案中用上皮细胞进行注射例如皮下注射。在另一个实施方案中，重组皮肤细胞作为皮肤移植物应用于患者。重组的血细胞(例如造血干细胞或祖细胞)优选地静脉内施用。预想的所用细胞量取决于预期的效果、患者状况等等并且可以由本领域技术人员确定。The resulting recombinant cells can be delivered to a patient by methods known in the art. In a preferred embodiment epithelial cells are used for injection, eg subcutaneous injection. In another embodiment, recombinant skin cells are applied to a patient as a skin graft. Recombinant blood cells (eg, hematopoietic stem or progenitor cells) are preferably administered intravenously. The envisioned amount of cells used depends on the desired effect, patient condition, etc. and can be determined by one skilled in the art.

为了基因治疗的目的能够将核酸导入的细胞包括任何希望的可得的细胞类型，并且包括但不限于上皮细胞、内皮细胞、角质形成细胞、成纤维细胞、肌细胞、肝细胞；血细胞如T淋巴细胞、B淋巴细胞、单核细胞、巨噬细胞、嗜中性粒细胞、嗜酸性粒细胞、巨核细胞、粒细胞；多种干细胞或祖细胞，尤其造血干细胞或祖细胞，例如从骨髓、脐带血、外周血、胎肝中得到的那些干细胞或祖细胞。Cells into which nucleic acids can be introduced for the purpose of gene therapy include any desired cell type available, and include, but are not limited to, epithelial cells, endothelial cells, keratinocytes, fibroblasts, myocytes, hepatocytes; blood cells such as T lymphocytes; cells, B lymphocytes, monocytes, macrophages, neutrophils, eosinophils, megakaryocytes, granulocytes; various stem or progenitor cells, especially hematopoietic stem or progenitor cells, e.g. from bone marrow, umbilical cord Those stem cells or progenitor cells obtained from blood, peripheral blood, fetal liver.

在优选的实施方案中，用于基因治疗的细胞是患者自体的。In preferred embodiments, the cells used in gene therapy are autologous to the patient.

在重组细胞用于基因治疗的实施方案中，将CREAP核酸导入细胞以致于CREAP核酸可由细胞或其后代表达，然后为了产生治疗效果将重组细胞体内施用。在一个特定实施方案中，使用了干细胞或祖细胞。依照本发明的该实施方案，可以潜在地使用能够在体外分离和维持的任何干细胞和/或祖细胞。此类干细胞包括但不限于造血干细胞(HSC)、上皮组织例如皮肤和消化道的被覆上皮的干细胞、胚胎心肌细胞、肝脏干细胞(见，例如WO94/08598)和神经干细胞(Stemple和Anderson，1992，Cell 71：973-985)。In embodiments where recombinant cells are used for gene therapy, the CREAP nucleic acid is introduced into the cell such that the CREAP nucleic acid can be expressed by the cell or its progeny, and then the recombinant cell is administered in vivo for therapeutic effect. In a particular embodiment, stem or progenitor cells are used. According to this embodiment of the invention, any stem and/or progenitor cells capable of being isolated and maintained in vitro can potentially be used. Such stem cells include, but are not limited to, hematopoietic stem cells (HSCs), epithelial-lined stem cells of epithelial tissues such as the skin and digestive tract, embryonic cardiomyocytes, liver stem cells (see, e.g., WO94/08598) and neural stem cells (Stemple and Anderson, 1992, Cell 71:973-985).

通过已知的方法可以从诸如皮肤和消化道的被覆上皮的组织得到上皮干细胞(ESC)或角质形成细胞(Rheinwald，1980，Meth.Cell Bio.21A：229)。在分层上皮组织例如皮肤中，通过最靠近基膜的生发层内的干细胞的有丝分裂进行更新。消化道被覆上皮内的干细胞提供了该组织的快速更新速率。从患者或者供体的皮肤或者消化道被覆上皮中得到的ESC或者角质形成细胞可以在组织培养中生长(Pittelkow和Scott，1986，Mayo ClinicProc.61：771)。如果ESC是由供体提供的，还可以使用抑制宿主抗移植物反应性的方法(例如辐射、药物或抗体施用以促进适当的免疫抑制)。Epithelial stem cells (ESCs) or keratinocytes can be obtained from epithelial-lined tissues such as skin and digestive tract by known methods (Rheinwald, 1980, Meth. Cell Bio. 21A: 229). In stratified epithelial tissues such as skin, renewal occurs by mitosis of stem cells in the germinal layer closest to the basement membrane. Stem cells within the epithelium lining the digestive tract provide a rapid turnover rate for this tissue. ESCs or keratinocytes obtained from the skin or lining epithelium of the gut of a patient or donor can be grown in tissue culture (Pittelkow and Scott, 1986, Mayo Clinic Proc. 61:771). If the ESC is provided by a donor, methods of suppressing host-versus-graft reactivity (eg, radiation, drug or antibody administration to promote appropriate immunosuppression) can also be used.

至于造血干细胞(HSC)，能够提供HSC体外分离、繁殖和维持的任何技术均可在本发明的该实施方案中使用。可以实现其目的的技术包括(a)对从将来的宿主或供体分离而来的骨髓中分离和建立HSC培养物，或者(b)使用先前建立的长期HSC培养物，其可以是同种异体的或者异种的。非自体的HSC优选地与抑制将来宿主/患者移植免疫反应的方法联合使用。在本发明的特定实施方案中，通过针吸出从后髂嵴得到人骨髓细胞(见，例如Kodo等，1984，J.Clin.Invest.73：1377-1384)。在本发明优选的实施方案中，可以得到高度富集的或者基本上纯的HSC。这种富集可以在长期培养之前、长期培养过程中、或长期培养之后完成，并且可以通过本领域已知的任何技术来实现。通过使用例如改良的Dexter细胞培养技术(Dexter等，1977，J.Cell Physiol.91：335)或者Witlock-Witte培养技术(Witlock和Witte，1982，Proc.Natl.Acad.Sci.USA 79：3608-3612)可以建立和维持骨髓细胞的长期培养物。With regard to hematopoietic stem cells (HSCs), any technique that provides for the isolation, propagation and maintenance of HSCs in vitro may be used in this embodiment of the invention. Techniques that can accomplish this include (a) isolation and establishment of HSC cultures from bone marrow isolated from future hosts or donors, or (b) use of previously established long-term HSC cultures, which can be allogeneic or heterogeneous. Non-autologous HSCs are preferably used in combination with methods of suppressing immune responses to future host/patient transplants. In a specific embodiment of the invention, human bone marrow cells are obtained from the posterior iliac crest by needle aspiration (see, eg, Kodo et al., 1984, J. Clin. Invest. 73:1377-1384). In a preferred embodiment of the invention, highly enriched or substantially pure HSCs can be obtained. This enrichment can be accomplished before, during, or after long-term culture, and can be accomplished by any technique known in the art. By using, for example, the improved Dexter cell culture technique (Dexter et al., 1977, J.Cell Physiol.91:335) or the Witlock-Witte culture technique (Witlock and Witte, 1982, Proc.Natl.Acad.Sci.USA 79:3608- 3612) can establish and maintain long-term cultures of bone marrow cells.

在一个特定实施方案中，为了基因治疗的目的欲被导入的核酸包含有效连接到编码区的可诱导启动子，以致于通过控制转录的适当诱导物的存在或缺乏可以控制核酸的表达。In a specific embodiment, a nucleic acid to be introduced for the purpose of gene therapy comprises an inducible promoter operably linked to the coding region such that expression of the nucleic acid can be controlled by the presence or absence of an appropriate inducer controlling transcription.

本发明还涉及本发明多核苷酸作为诊断试剂的用途。具体而言，本发明涉及对与CRE-依赖性基因表达异常激活或趋化因子异常激活相关的病理状况进行诊断的方法，其中所述的方法包括：检测信使RNA在来源于人的适当的组织和细胞内异常的转录，例如提高的转录，所述的信使RNA是从编码包含SEQ ID NO：2、16、25所示氨基酸序列的天然内源人类基因转录的，其中所述异常转录可以诊断所述人患有上述疾病。具体而言，所述天然内源人基因包含SEQ ID NO：1、15、24中所示的核苷酸序列。在优选实施方案中，此种方法包括将所述适当组织或细胞的样本或源于所述组织或细胞的分离的RNA或者DNA分子与长度至少约20个核苷酸的分离核苷酸序列相接触，其中所述的核苷酸序列在高严格条件下能够与编码由SEQ ID NO：2、16、25所示氨基酸序列组成的多肽分离的核苷酸序列杂交。检测到升高的转录表明受试者是用一种或多种CREAP调节子治疗的适宜候选者。The invention also relates to the use of the polynucleotides of the invention as diagnostic reagents. Specifically, the present invention relates to a method for diagnosing pathological conditions associated with abnormal activation of CRE-dependent gene expression or abnormal activation of chemokines, wherein said method comprises: detecting messenger RNA in a suitable human-derived tissue and abnormal transcription in the cell, such as increased transcription, the messenger RNA is transcribed from a natural endogenous human gene encoding an amino acid sequence comprising SEQ ID NO: 2, 16, 25, wherein the abnormal transcription can be diagnosed The person suffers from the above-mentioned diseases. Specifically, the natural endogenous human gene comprises the nucleotide sequences shown in SEQ ID NO: 1, 15, 24. In a preferred embodiment, such a method comprises comparing a sample of said suitable tissue or cell, or an isolated RNA or DNA molecule derived from said tissue or cell, with an isolated nucleotide sequence of at least about 20 nucleotides in length. Contact, wherein said nucleotide sequence can hybridize with the isolated nucleotide sequence encoding the polypeptide consisting of the amino acid sequences shown in SEQ ID NO: 2, 16, and 25 under high stringency conditions. Detection of elevated transcripts indicates that the subject is a suitable candidate for treatment with one or more CREAP modulators.

对与功能障碍相关的CREAP蛋白质突变形式的检测提供了诊断工具，该诊断工具可以增加或定义疾病或疾病易感性的诊断，所述疾病源于CREAP基因低表达、过表达或改变的时间或空间表达。通过多种技术可以在DNA水平检测携带基因中突变的个体。Detection of mutated forms of the CREAP protein associated with dysfunction provides a diagnostic tool that can augment or define the diagnosis of disease or susceptibility to disease arising from temporally or spatially underexpressed, overexpressed, or altered CREAP genes Express. Individuals carrying mutations in genes can be detected at the DNA level by a variety of techniques.

用于诊断的核酸，尤其mRNA可以从受试者的细胞，如从血、尿、唾液、活组织检查或尸检材料得到。基因组DNA可以直接用于检测或者在分析前使用PCR或其他扩增技术酶促扩增。还可以以相似的方式使用RNA或cDNA。通过扩增产物与正常基因型相比大小的变化可以检测缺失和插入。将扩增的DNA与编码本发明CREAP多肽的标记核苷酸序列进行杂交可以鉴定点突变。通过RNA酶消化或者通过解链温度的差异可以区别完全匹配序列和错配双链体。通过DNA片段在含有或不含有变性剂的凝胶中电泳迁移率的改变或者通过直接DNA测序也可以检测DNA序列差异(例如Myers等，Science(1985)230：1242)。在特定位置的序列改变还可以通过核酸酶保护分析，例如RNA酶和S1保护或者化学断裂方法揭示(见Cotton等，Proc Natl Acad Sci USA(1985)85：4397-4401)。在另一个实施方案中，可以构建包含编码本发明CREAP多核苷酸序列或这一核苷酸序列的片段的寡核苷酸探针阵列以便进行例如遗传突变的有效筛选。阵列技术方法是众所周知的并且具有广泛的应用性并且可以用来解决分子遗传学包括基因表达、遗传连锁和遗传变异中的多种问题(见例如M.Chee等，Science，第274卷，第610-613页(1996))。Nucleic acids for diagnosis, especially mRNA, can be obtained from cells of a subject, such as from blood, urine, saliva, biopsy or autopsy material. Genomic DNA can be used directly for detection or enzymatically amplified using PCR or other amplification techniques prior to analysis. RNA or cDNA can also be used in a similar manner. Deletions and insertions can be detected by changes in the size of the amplified product compared to the normal genotype. Point mutations can be identified by hybridizing amplified DNA to a labeled nucleotide sequence encoding a CREAP polypeptide of the invention. Perfectly matched sequences can be distinguished from mismatched duplexes by RNase digestion or by differences in melting temperature. DNA sequence differences can also be detected by changes in the electrophoretic mobility of DNA fragments in gels with or without denaturing agents or by direct DNA sequencing (eg, Myers et al., Science (1985) 230:1242). Sequence changes at specific positions can also be revealed by nuclease protection assays such as RNase and S1 protection or chemical fragmentation methods (see Cotton et al., Proc Natl Acad Sci USA (1985) 85:4397-4401). In another embodiment, arrays of oligonucleotide probes comprising a polynucleotide sequence encoding a CREAP of the invention or fragments of such a nucleotide sequence can be constructed for efficient screening of, for example, genetic mutations. Array technology methods are well known and widely applicable and can be used to solve a variety of problems in molecular genetics including gene expression, genetic linkage and genetic variation (see e.g. M. Chee et al., Science, Vol. 274, No. 610 -613 pages (1996)).

诊断测定法提供了通过使用所描述的方法检测CREAP基因内突变来诊断或确定对疾病易感性的方法。此外，此类疾病可以通过这样的方法诊断，其中所述方法包括检测源于受试者的样本中的多肽或mRNA的不正常降低或升高的水平。使用本领域众所周知的用于定量多核苷酸的任何方法可以在RNA水平上测量降低或增加的表达，其中所述方法为例如核酸扩增，如PCR、RT-PCR、RNA酶保护、RNA印记和其它杂交方法。用于测定源于宿主的样本内蛋白质，如本发明多肽水平的测定技术是本领域技术人员众所周知的。此种测定方法包括放射免疫测定法、竞争结合测定法、蛋白质印迹分析和ELISA测定法。Diagnostic assays provide methods for diagnosing or determining susceptibility to disease by detecting mutations in the CREAP gene using the methods described. Furthermore, such diseases can be diagnosed by a method comprising detecting abnormally reduced or elevated levels of a polypeptide or mRNA in a sample derived from a subject. Decreased or increased expression can be measured at the RNA level using any method well known in the art for quantifying polynucleotides, such as nucleic acid amplification such as PCR, RT-PCR, RNase protection, Northern blot and other hybridization methods. Assay techniques for determining levels of proteins, such as polypeptides of the invention, in a sample derived from a host are well known to those skilled in the art. Such assays include radioimmunoassays, competitive binding assays, Western blot analysis and ELISA assays.

因此在另一方面本发明涉及诊断试剂盒，其包含：(a)本发明的多核苷酸，优选SEQ ID NO：1、15或24的核苷酸序列或其片段；Therefore in another aspect the present invention relates to a diagnostic kit comprising: (a) a polynucleotide of the present invention, preferably a nucleotide sequence of SEQ ID NO: 1, 15 or 24 or a fragment thereof;

(b)与(a)中核苷酸序列互补的核苷酸序列；(b) a nucleotide sequence complementary to the nucleotide sequence in (a);

(c)本发明的多肽，优选SEQ ID NO：2、16、25的多肽或其片段；(c) a polypeptide of the present invention, preferably a polypeptide of SEQ ID NO: 2, 16, 25 or a fragment thereof;

(d)针对本发明多肽，优选针对SEQ ID NO：2、16、25的多肽的抗体；或(d) against polypeptide of the present invention, preferably against the antibody of the polypeptide of SEQ ID NO:2,16,25; Or

(e)针对CREAP蛋白质，优选SEQ ID NO：2、16、25的肽模拟物。(e) for CREAP protein, preferably the peptide mimetic of SEQ ID NO:2,16,25.

应当理解在任何此类试剂盒中，(a)、(b)、(c)、(d)或(e)包含基本成分。此类试剂盒可用于诊断疾病或者诊断对疾病，特别是与CRE-依赖性基因表达，异常激活或趋化因子异常激活相关的病理状况的易感性。还预期到所述试剂盒包含设计用于检测如文中所讨论的CREAP相关调节蛋白质或被CREAP修饰的蛋白质水平的成分(a)-(e)。It will be understood that in any such kit, (a), (b), (c), (d) or (e) comprise the essential components. Such kits are useful in the diagnosis of disease or susceptibility to disease, especially pathological conditions associated with CRE-dependent gene expression, abnormal activation or abnormal activation of chemokines. It is also contemplated that the kit comprises components (a)-(e) designed to detect levels of CREAP-associated regulatory proteins or proteins modified by CREAP as discussed herein.

本发明的核苷酸序列对于染色体定位也是有价值的。序列特异靶向个体的人染色体上的特定位置并与该位置杂交。根据本发明将相关序列在染色体上定位是将那些序列与基因相关疾病联系起来的重要的第一步。一旦序列被定位到精确的染色体位置，则该序列在染色体上的物理位置可以与遗传图数据联系起来。此种数据可以在例如V.McKusick，MendelianInheritance in Man(通过Johns Hopkins University Welch MedicalLibrary在线得到)中找到。然后通过连锁分析(物理上相邻基因的共同遗传)鉴定已经定位于同一染色体区域的基因和疾病之间的关系。The nucleotide sequences of the invention are also valuable for chromosomal mapping. The sequence is specifically targeted to and hybridizes to a particular location on an individual's human chromosome. Chromosomal mapping of related sequences according to the present invention is an important first step in linking those sequences to gene-associated diseases. Once a sequence has been mapped to a precise chromosomal location, the sequence's physical location on the chromosome can be linked to genetic map data. Such data can be found, for example, in V. McKusick, Mendelian Inheritance in Man (available online through the Johns Hopkins University Welch Medical Library). Relationships between genes and diseases that have been mapped to the same chromosomal region are then identified by linkage analysis (co-inheritance of physically adjacent genes).

还可以确定受侵袭个体和未受侵袭个体之间cDNA或者基因组序列的差异。如果在一些或全部受侵袭个体中观察到突变但在任何正常个体中部没有观察到突变，则该突变可能是该疾病的病因。Differences in cDNA or genomic sequence between affected and non-affected individuals can also be determined. If a mutation is observed in some or all affected individuals but not in any normal individuals, the mutation may be the cause of the disease.

本发明的药物组合物还可以包含在核酸水平抑制CREAP蛋白质表达的物质。此类分子包括核酶、反义寡核苷酸、三股螺旋DNA、RNA适体、siRNA和针对CREAP核酸的适当核苷酸序列的双链或单链RNA。这些抑制分子可以由本领域的技术人员使用常规的技术无需过多负担和实验即可产生。例如，通过设计针对编码文中所讨论的CREAP多肽的基因的控制区，即启动子、增强子和内含子的反义分子、DNA或者RNA可以实现对基因表达的修饰(例如抑制)。例如可以使用源于转录起始位点，例如起始位点的-10到+10位的寡核苷酸。尽管如此，基因的所有区域均可用来设计反义分子以便产生与mRNA有着最强杂交的反义分子并且此类适宜的反义寡核苷酸可以通过本领域技术人员熟悉的标准测定方法产生和鉴定。The pharmaceutical composition of the present invention may also contain a substance that inhibits the expression of CREAP protein at the nucleic acid level. Such molecules include ribozymes, antisense oligonucleotides, triple-helix DNA, RNA aptamers, siRNA, and double- or single-stranded RNA directed to the appropriate nucleotide sequence of the CREAP nucleic acid. These inhibitory molecules can be generated by those skilled in the art using routine techniques without undue burden and experimentation. Modification (eg, suppression) of gene expression can be achieved, for example, by designing antisense molecules, DNA or RNA directed against the control regions of the gene encoding the CREAP polypeptides discussed herein, ie, the promoter, enhancer and intron. For example oligonucleotides derived from the transcription start site, eg positions -10 to +10 of the start site, may be used. Nonetheless, all regions of the gene can be used to design antisense molecules in order to generate antisense molecules that hybridize most strongly to mRNA and such suitable antisense oligonucleotides can be generated and analyzed by standard assays familiar to those skilled in the art. Identification.

类似地，使用“三股螺旋”碱基配对方法可以实现基因表达的表达抑制。三股螺旋配对是有用的，因为它抑制双螺旋足够打开以结合聚合酶、转录因子或调节分子的能力。使用三股DNA的治疗最近进展在文献中有所描述(Gee，J.E.等(1994)Huber，B.E.和B.I.Carr，Molecular andImmunologic Approaches，Futura Publishing Co.，Mt.Kisco，N.Y.)。这些分子还可以设计成通过防止转录物结合核糖体而阻断mRNA翻译。Similarly, expression inhibition of gene expression can be achieved using the "triple helix" base pairing method. Triple helix pairing is useful because it inhibits the ability of the double helix to open sufficiently to bind polymerases, transcription factors, or regulatory molecules. Recent advances in therapy using triple-stranded DNA are described in the literature (Gee, J.E. et al. (1994) Huber, B.E. and B.I. Carr, Molecular and Immunologic Approaches, Futura Publishing Co., Mt. Kisco, N.Y.). These molecules can also be designed to block mRNA translation by preventing the transcript from binding to the ribosome.

核酶，即酶活性RNA分子，可以通过催化RNA的特异切割用来抑制基因表达。核酶作用的机制涉及核酶分子与互补的靶RNA的序列特异的杂交和随后的内切核酸性切割。可以使用的实例包括工程化的“锤头”(“hammerhead”)或“发夹”(“hairpin”)基序核酶分子，所述的核酶分子可以设计成特异地和有效地催化基因序列例如CREAP1、CREAP2或CREAP3基因的内切核酸切割。Ribozymes, enzymatically active RNA molecules, can be used to inhibit gene expression by catalyzing the specific cleavage of RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme molecule to a complementary target RNA and subsequent endonucleic cleavage. Examples that can be used include engineered "hammerhead" or "hairpin" motif ribozyme molecules that can be designed to specifically and efficiently catalyze gene sequences For example endonucleic cleavage of the CREAP1, CREAP2 or CREAP3 genes.

通过扫描靶分子中的包括下列序列GUA、GUU和GUC在内的核酶切割位点初步鉴定任何潜在RNA靶标内的特异核酶切割位点。一旦鉴定出核酶切割位点，则对对应于包含切割位点的靶基因区域的15-20个核糖核苷酸之间的短RNA序列评估可能使该寡核苷酸不起作用的二级结构特征。通过使用核糖核酸酶保护分析测试与互补寡核苷酸杂交的可接近性也可以评估候选靶标的适宜性。Specific ribozyme cleavage sites within any potential RNA target were initially identified by scanning the target molecule for ribozyme cleavage sites including the following sequences GUA, GUU and GUC. Once the ribozyme cleavage site has been identified, evaluate the short RNA sequence between 15-20 ribonucleotides corresponding to the region of the target gene containing the cleavage site for secondary factors that may render the oligo ineffective Structure. Candidate target suitability can also be assessed by testing accessibility to hybridization to complementary oligonucleotides using a ribonuclease protection assay.

核酶方法包括将细胞暴露于核酶中或者在细胞中诱导表达此种小RNA核酶分子(Grassi和Marini，1996，Annals of Medicine 28：499-510；Gibson，1996，Cancer and Metastasis Reviews 15：287-299)。靶向于对应于至少一种文中讨论的基因的mRNA的锤头和发夹型核酶在细胞内的表达可以用来抑制所述基因编码的蛋白质。Ribozyme methods involve exposing cells to ribozymes or inducing expression of such small RNA ribozyme molecules in cells (Grassi and Marini, 1996, Annals of Medicine 28:499-510; Gibson, 1996, Cancer and Metastasis Reviews 15: 287-299). Intracellular expression of hammerhead and hairpin ribozymes targeting mRNA corresponding to at least one of the genes discussed herein can be used to inhibit the protein encoded by said gene.

核酶可以以掺入核酶序列的RNA寡核苷酸的形式直接递送入细胞或者作为编码目的核酶RNA的表达载体导入细胞。核酶通常在体内以足够数目表达以有效催化切割mRNA并由此改变细胞内mRNA丰度(Cotten等，1989，EMBO J.8：3861-3866)。具体而言，可以将能够编码DNA序列的、根据常规的众所周知的规则设计的并通过例如标准亚磷酸胺(phosphoramidite)化学合成的核酶编码DNA序列连接进入编码tRNA的基因的反密码子茎和环的限制酶位点中，然后通过本领域常规的方法将其转化进入目的细胞并在目的细胞中进行表达。优选地，在该构建体中还引入可诱导的启动子(例如糖皮质激素或者四环素效应元件)以致于核酶的表达可以选择性控制。为了进行饱和性使用，可以使用高活性和组成型活性的启动子。在该应用中tDNA基因(即编码tRNA的基因)是有用的，因为它们尺寸小、转录速度高并且在不同类型组织中普遍表达。The ribozyme can be delivered directly into the cell in the form of an RNA oligonucleotide incorporating the ribozyme sequence or introduced into the cell as an expression vector encoding the ribozyme RNA of interest. Ribozymes are usually expressed in sufficient numbers in vivo to efficiently catalytically cleave mRNA and thereby alter intracellular mRNA abundance (Cotten et al., 1989, EMBO J. 8:3861-3866). Specifically, a ribozyme-encoding DNA sequence capable of encoding a DNA sequence designed according to conventional well-known rules and synthesized by, for example, standard phosphoramidite chemistry can be ligated into the anticodon stem of the gene encoding the tRNA and The restriction enzyme site of the loop, and then transformed into the target cell and expressed in the target cell by conventional methods in the art. Preferably, an inducible promoter (such as a glucocorticoid or tetracycline response element) is also incorporated into the construct so that expression of the ribozyme can be selectively controlled. For saturating use, highly active and constitutively active promoters can be used. tDNA genes (ie, genes encoding tRNAs) are useful in this application because of their small size, high transcription rate and ubiquitous expression in different types of tissues.

因此，核酶通常设计成能够切割几乎任何mRNA序列并且细胞通常用编码此种核酶序列的DNA转化，以致于表达可控的和催化有效量的核酶。因此，细胞内的几乎任何RNA种类的丰度均可以改变或干扰。Thus, ribozymes are usually designed to cleave virtually any mRNA sequence and cells are usually transformed with DNA encoding such ribozyme sequences so that controlled and catalytically effective amounts of the ribozymes are expressed. Thus, the abundance of almost any RNA species within a cell can be altered or disturbed.

以与描述用于反义核苷酸基本上相同的方式可以改变核酶序列，例如核酶序列可以包含修饰的碱基部分。The ribozyme sequence may be altered in substantially the same manner as described for antisense nucleotides, eg, the ribozyme sequence may comprise modified base moieties.

也可以将RNA适体导入细胞或者在细胞内表达以改变RNA的丰度或活性。RNA适体是蛋白质的特异RNA配体，所述配体能够特异抑制所述蛋白质的翻译，例如Tat和Rev RNA(Good等，1997，Gene Therapy 4：45-54)。RNA aptamers can also be introduced into cells or expressed within cells to alter the abundance or activity of RNA. RNA aptamers are specific RNA ligands for proteins that are capable of specifically inhibiting the translation of said proteins, such as Tat and Rev RNA (Good et al., 1997, Gene Therapy 4:45-54).

还可以使用常规的双链RNA技术实现基因表达的基因特异抑制。对此种技术的描述可以在WO 99/32619中找到，其公开全文引用作为参考。此外，已经证实siRNA技术可用作为抑制基因表达的方法(Cullen，BR Nat.Immunol.2002，7月；3(7)：597-9)。Gene-specific inhibition of gene expression can also be achieved using conventional double-stranded RNA technology. A description of this technique can be found in WO 99/32619, the disclosure of which is incorporated by reference in its entirety. Furthermore, siRNA technology has been demonstrated as a method to inhibit gene expression (Cullen, BR Nat. Immunol. 2002, Jul;3(7):597-9).

本发明的反义分子、三股螺旋DNA、RNA适体和核酶可以通过本领域已知的用于核酸分子合成的任何方法进行制备。这些技术包括用于化学合成寡核苷酸的技术如固相亚磷酸胺化学合成。备选地，可以通过编码文中所讨论多肽基因的DNA序列的体外和体内转录产生RNA分子。此类DNA序列可以整合到具有适当RNA聚合酶启动子例如T7或SP6的多种载体中。备选地，可以将组成型或诱导性合成反义RNA的cDNA构建体导入到细胞系、细胞或者组织中。Antisense molecules, triple-helix DNA, RNA aptamers and ribozymes of the present invention can be prepared by any method known in the art for the synthesis of nucleic acid molecules. These techniques include those used in the chemical synthesis of oligonucleotides such as solid phase phosphite chemical synthesis. Alternatively, RNA molecules can be produced by in vitro and in vivo transcription of DNA sequences encoding the polypeptide genes discussed herein. Such DNA sequences can be integrated into a variety of vectors with appropriate RNA polymerase promoters such as T7 or SP6. Alternatively, cDNA constructs for constitutive or inducible synthesis of antisense RNA can be introduced into cell lines, cells or tissues.

除了上述的用于抑制CREAP表达的方法外，文中还设想人们可以鉴定和采用小分子或其它天然产物以抑制文中所讨论多肽的体内转录。例如，本领域技术人员使用常规的方法可以建立CREAP1、CREAP2或者CREAP3的测定法，所述测定法可应用于从细胞系的培养得到的样本。使用该测定法，可以对细胞系进行筛选以找出表达目的CREAP蛋白质的细胞系。这些细胞系可以培养于如96孔板中。对基因表达的一些已知调节物例如地塞米松、佛波酯、热休克对原代组织培养物和细胞系影响的比较可以选择出最适合使用的细胞系。筛选将仅仅为在添加到每个孔中的不同化合物存在的条件下将细胞培养一定长度的时间和然后测定CREAP活性/mRNA水平。In addition to the methods described above for inhibiting CREAP expression, it is contemplated that one can identify and employ small molecules or other natural products to inhibit in vivo transcription of the polypeptides discussed herein. For example, one skilled in the art can establish assays for CREAP1 , CREAP2 or CREAP3 using routine methods, which assays can be applied to samples obtained from the culture of cell lines. Using this assay, cell lines can be screened for those expressing the CREAP protein of interest. These cell lines can be cultured, for example, in 96-well plates. Comparison of the effects of known regulators of gene expression such as dexamethasone, phorbol esters, heat shock on primary tissue cultures and cell lines allows selection of the most suitable cell line for use. Screening will simply be culturing cells for a certain length of time in the presence of different compounds added to each well and then measuring CREAP activity/mRNA levels.

为了在上述的测定法中便于检测CREAP，可以将萤光素酶或其它商业可得的荧光蛋白质作为适宜的标志蛋白质基因融合到CREAP1、CREAP2或者CREAP3的启动子。通过使用GenBank登录号NM_025021的序列对NCBI基因组序列(当前CREAP1的基因组重叠群序列的GenBank登录号是NT_011295)进行BLAST分析，可以从基因组序列数据鉴定出例如CREAP1启动子的ATG上游的序列。结果给出了CREAP1的ATG上游的不包含任何未知碱基的至少5kb的序列。可以容易地设计并测试能够从人类基因组DNA扩增2kb或更长片段的两对嵌套式PCR引物。可以将启动子片段容易地插入到经设计用于在人细胞中表达的任意无启动子的报告基因载体中(例如Clontech无启动子增强荧光蛋白载体pECFP-1、pEGFP-1或pEYFP，Clontech，Palo Alto，CA)。筛选将仅仅为在添加到每个孔中的不同化合物存在下将细胞培养适当长度时间，然后测定报道基因活性。使用先前所描述的病理状况的体内模型测定有希望的化合物对CREAP1活性和/或体内mRNA水平的影响。另外的方法详细描述了诸如适当培养时间、培养条件、报告子分析并且其它能够用于鉴定可用于抑制体内CREAP蛋白质转录的小分子或其它天然产物的方法学将是本领域技术人员所熟悉的。To facilitate the detection of CREAP in the assays described above, luciferase or other commercially available fluorescent proteins can be genetically fused to the promoters of CREAP1, CREAP2 or CREAP3 as appropriate marker proteins. Sequences such as those upstream of the ATG of the CREAP1 promoter can be identified from the genomic sequence data by BLAST analysis of the NCBI genome sequence (current GenBank accession number of the genomic contig sequence of CREAP1 is NT_011295) using the sequence of GenBank accession number NM_025021. The results give a sequence of at least 5 kb upstream of the ATG of CREAP1 that does not contain any unknown bases. Two pairs of nested PCR primers capable of amplifying fragments of 2 kb or longer from human genomic DNA can be easily designed and tested. The promoter fragment can be easily inserted into any promoter-less reporter gene vector designed for expression in human cells (e.g. Clontech Promoter-less Enhanced Fluorescent Protein Vectors pECFP-1, pEGFP-1 or pEYFP, Clontech, Palo Alto, CA). Screening would simply be incubating the cells for the appropriate length of time in the presence of the different compounds added to each well and then assaying for reporter gene activity. The effect of promising compounds on CREAP1 activity and/or mRNA levels in vivo is determined using previously described in vivo models of pathological conditions. Additional methodologies such as appropriate culture times, culture conditions, reporter assays and other methodologies that can be used to identify small molecules or other natural products that can be used to inhibit CREAP protein transcription in vivo will be familiar to those skilled in the art.

此外，编码CREAP蛋白质的cDNA和/或CREAP蛋白质自身能够用于鉴定其它蛋白质，例如激酶、蛋白酶或转录因子，在级联反应中所述其它蛋白质由CREAP蛋白质修饰或间接激活的。如此所鉴定的蛋白质能够用于例如药物筛选以治疗文中所述的病理状况。为了鉴定CREAP蛋白质下游的这些基因，设想例如技术人员能够使用常规方法用特异CREAP抑制剂治疗疾病状态模型动物，处死这些动物，回收相关组织并从这些细胞分离总RNA并应用标准微阵列分析技术鉴定相对对照动物(未施用药物的动物)改变的信号水平。Furthermore, the cDNA encoding the CREAP protein and/or the CREAP protein itself can be used to identify other proteins, such as kinases, proteases or transcription factors, that are modified or indirectly activated by the CREAP protein in a cascade of reactions. The proteins thus identified can be used, for example, in drug screening to treat the pathological conditions described herein. In order to identify these genes downstream of the CREAP protein, it is envisioned, for example, that a skilled artisan could use routine methods to treat disease state model animals with specific CREAP inhibitors, sacrifice the animals, recover relevant tissues and isolate total RNA from these cells and identify them using standard microarray analysis techniques. Altered signal levels relative to control animals (animals not administered drug).

另外，常规体外或体内测定可以用于鉴定导致CREAP蛋白质过表达的可能基因。如此鉴定的基因所编码的这些相关调节蛋白质能够用于筛选可能是文中所述病理状况治疗的有效治疗剂的药物。例如，可以使用常规报告基因分析，其中CREAP蛋白质启动子区域置于报告基因的上游，构建体转染进适宜细胞(例如来自ATCC，Manassas，VA)并且使用常规技术，通过检测报告基因的表达分析引起CREAP启动子激活的上游基因。Additionally, conventional in vitro or in vivo assays can be used to identify possible genes responsible for overexpression of CREAP protein. These associated regulatory proteins encoded by the genes so identified can be used to screen for drugs that may be effective therapeutic agents for the treatment of the pathological conditions described herein. For example, conventional reporter gene assays can be used, wherein the CREAP protein promoter region is placed upstream of the reporter gene, the construct is transfected into suitable cells (eg, from ATCC, Manassas, VA) and assayed by detecting expression of the reporter gene using conventional techniques. Upstream genes that cause activation of the CREAP promoter.

文中还设想技术人员可以按照常规方法通过设计例如CREAP蛋白质的抗体或肽模拟物和/或设计靶向此类蛋白质的基因的抑制性反义寡核苷酸、三股螺旋DNA、核酶、siRNA、双链或单链RNA和RNA适体，抑制相关调节蛋白质或CREAP蛋白质所修饰的蛋白质的基因的功能和/或表达来作为治疗文中所述病理状况的途径。还设想用于治疗所述病理状况的包含此类抑制性物质的药物组合物。It is also conceived herein that the skilled person can follow conventional methods by designing, for example, antibodies or peptidomimetics of CREAP proteins and/or designing inhibitory antisense oligonucleotides, triple-helix DNA, ribozymes, siRNA, Double-stranded or single-stranded RNA and RNA aptamers, inhibit the function and/or expression of genes of related regulatory proteins or proteins modified by CREAP proteins as a way to treat the pathological conditions described herein. Pharmaceutical compositions comprising such inhibitory substances for use in the treatment of said pathological conditions are also contemplated.

本发明另外的实施方案涉及将药物组合物与可药用载体、赋形剂或稀释剂联合施用以治疗文中所述的任意病理状况。此类药物组合物可以包含CREAP蛋白质或其片段针对CREAP多肽或肽段的抗体、肽模拟物、和/或CREAP调节子(例如激动剂、拮抗剂或CREAP表达和/或功能抑制剂)。这些组合物可以单独施用或与至少一种其它试剂例如稳定化合物联合施用，该组合物可以在包括但不限于盐水、缓冲盐水、右旋糖和水的任意无菌生物相容性药物载体中施用。组合物可以单独或与其它试剂、药物或激素联合施用于患者。Additional embodiments of the present invention relate to the administration of pharmaceutical compositions in combination with a pharmaceutically acceptable carrier, excipient or diluent for the treatment of any of the pathological conditions described herein. Such pharmaceutical compositions may comprise antibodies, peptidomimetics, and/or CREAP modulators (eg, agonists, antagonists, or inhibitors of CREAP expression and/or function) of the CREAP protein or fragment thereof directed against a CREAP polypeptide or peptide fragment. These compositions can be administered alone or in combination with at least one other agent, such as a stabilizing compound, and the compositions can be administered in any sterile biocompatible pharmaceutical carrier including, but not limited to, saline, buffered saline, dextrose, and water . Compositions may be administered to a patient alone or in combination with other agents, drugs or hormones.

当本领域技术人员认为在医学上有利时，例如其中CREAP功能激动剂对例如神经变性疾病如阿尔茨海默氏病、帕金森氏病和亨廷顿病具有治疗作用的情况下可以施用包含CREAP蛋白质或其片段的药物组合物。根据本发明使用的此类药物组合物可以以常规方式使用一种或多种生理可接受载体或赋形剂形成制剂。When those skilled in the art deem it to be medically advantageous, for example, where a CREAP functional agonist has a therapeutic effect on, for example, neurodegenerative diseases such as Alzheimer's disease, Parkinson's disease and Huntington's disease, it may be administered comprising a CREAP protein or Pharmaceutical composition of its fragments. Such pharmaceutical compositions for use in accordance with the present invention may be formulated in conventional manner using one or more physiologically acceptable carriers or excipients.

文中所公开的用于预防、治疗或缓解与CRE-依赖基因异常表达或趋化因子异常激活相关病理状况的药物组合物可以以治疗有效量施用于患者。治疗有效量是指足以预防、治疗或缓解所述状况的化合物的量。The pharmaceutical compositions disclosed herein for preventing, treating or alleviating pathological conditions associated with abnormal expression of CRE-dependent genes or abnormal activation of chemokines can be administered to patients in a therapeutically effective amount. A therapeutically effective amount refers to that amount of the compound sufficient to prevent, treat or ameliorate the condition.

化合物及其生理可用盐和溶剂化物可以配制以通过吸入或吹入(通过口或鼻)或经局部、口、口含、肠胃外或直肠施用。The compounds and their physiologically acceptable salts and solvates may be formulated for administration by inhalation or insufflation (through the mouth or nose) or topically, orally, buccally, parenterally or rectally.

对于经口施用，药物组合物可以采取例如片剂或胶囊的形式，这些片剂或胶囊是通过常规方法由可药用赋形剂如粘合剂(例如预胶化的玉米淀粉、聚乙烯吡咯烷酮、或羟丙基甲基纤维素)；填充剂(例如乳糖、微晶纤维素或磷酸氢钙)；润滑剂(例如硬脂酸镁、滑石粉或二氧化硅)；崩解剂(例如马铃薯淀粉或淀粉羟乙酸钠)或者湿润剂(例如十二烷基硫酸钠)制备的。这些片剂可以通过本领域众所周知的方法进行包衣。用于经口施用的液体制剂可以采用例如溶液剂、糖浆剂或悬浮剂的形式，或者它们可以呈现为在使用前用水或其它适当载体构成的干燥产物。此类液体制剂可以通过常规方法使用可药用添加剂如悬浮剂(例如山梨醇糖浆、纤维素衍生物或氢化食用脂)；乳化剂(例如卵磷脂或阿拉伯胶)；非水性载体(例如杏仁油、油酯、乙醇或分馏植物油)和防腐剂(例如对羟基苯甲酸甲酯、对羟基苯甲酸丙酯或者山梨酸)进行制备。这些制剂还可以包含适当的缓冲盐、增香剂、着色剂和甜味剂。For oral administration, pharmaceutical compositions may take the form of, for example, tablets or capsules, which are conventionally prepared with pharmaceutically acceptable excipients such as binders (for example, pregelatinized cornstarch, polyvinylpyrrolidone, , or hydroxypropylmethylcellulose); fillers (such as lactose, microcrystalline cellulose, or dibasic calcium phosphate); lubricants (such as magnesium stearate, talc, or silicon dioxide); disintegrants (such as potato starch or sodium starch glycolate) or a humectant (such as sodium lauryl sulfate). These tablets may be coated by methods well known in the art. Liquid preparations for oral administration may take the form of, for example, solutions, syrups or suspensions, or they may be presented as a dry product for constitution with water or other suitable vehicle before use. Such liquid preparations can be prepared by conventional methods using pharmaceutically acceptable additives such as suspending agents (such as sorbitol syrup, cellulose derivatives or hydrogenated edible fats); emulsifying agents (such as lecithin or acacia); non-aqueous vehicles (such as almond oil). , oily esters, ethanol or fractionated vegetable oils) and preservatives (such as methyl paraben, propyl paraben or sorbic acid). These preparations may also contain suitable buffer salts, flavouring, coloring and sweetening agents.

经口施用的制剂可以适当配制以控释活性化合物。Preparations for oral administration may be suitably formulated so as to provide controlled release of the active compound.

对于口含施用，组合物可以采用常规方法配制的片剂或锭剂的形式。For buccal administration, the compositions may take the form of tablets or lozenges formulated in conventional manner.

对于通过吸入施用，根据本发明使用的化合物以包装于增压袋或喷雾器中的气溶胶喷剂的形式，使用适当的推进剂例如二氯二氟甲烷、三氯氟甲烷、二氯四氟乙烷、二氧化碳或其它适当气体进行常规递送。对于增压气溶胶，剂量单位可以通过阀门递送计量量进行确定。在吸入器或吹入器中使用的例如明胶的胶囊和药筒可以配制成包含所述化合物和适当粉末基质如乳糖或淀粉的粉末混合物。For administration by inhalation, the compounds used according to the invention are in the form of aerosol sprays packed in pressurized bags or nebulisers, using suitable propellants such as dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethylene Alkanes, carbon dioxide, or other appropriate gases are routinely delivered. For pressurized aerosols, the dosage unit may be determined by a valve delivering a metered amount. Capsules and cartridges of eg gelatin for use in an inhaler or insufflator may be formulated containing a powder mix of the compound and a suitable powder base such as lactose or starch.

化合物可以配制成通过注射，例如通过快速浓注注射或连续输注进行胃肠外施用。用于注射的制剂可以以加入防腐剂的单位剂量形式，例如在安瓿瓶内或在多剂量容器内的单位剂量形式存在。组合物可以采用诸如油或水性载体的悬浮剂、溶液剂或乳剂形式，并且可以包含配制剂例如悬浮剂、稳定剂和/或分散剂。备选地，活性成分可以是粉剂形式，其使用前与适当载体如无菌、无致热原水构成。The compounds may be formulated for parenteral administration by injection, eg, by bolus injection or continuous infusion. Formulations for injection may be presented in unit dosage form, eg, in ampoules or in multidose containers, with an added preservative. The compositions may take such forms as suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, stabilizing and/or dispersing agents. Alternatively, the active ingredient may be in powder form for constitution with a suitable vehicle, eg sterile, pyrogen-free water, before use.

化合物还可以配制成直肠组合物如栓剂或滞留型灌肠剂，例如包含常规栓剂基质如可可脂或其它甘油酯。The compounds may also be formulated in rectal compositions such as suppositories or retention enemas, eg, containing conventional suppository bases such as cocoa butter or other glycerides.

除了先前所述的制剂，化合物还可以配制成长效制剂。此类长效制剂可以通过植入(例如皮下或肌内)或肌内注射施用。因此，例如，化合物可以使用适当聚合的或疏水物质(例如可用油中的乳剂)或离子交换树脂配制，或配制成微溶衍生物，例如微溶盐。In addition to the formulations described previously, the compounds may also be formulated as depot formulations. Such long-acting formulations may be administered by implantation (eg, subcutaneously or intramuscularly) or by intramuscular injection. Thus, for example, the compounds may be formulated using suitable polymeric or hydrophobic materials (eg, as emulsions in oils) or ion exchange resins, or as sparingly soluble derivatives, eg, sparingly soluble salts.

如果期望，组合物可以存在于包装或分配装置内，它们可以包含含有活性成分的一种或多种剂型。例如包装可以包含金属或塑料箔，如水泡眼包装。包装或分配装置可以带有施用说明书。The compositions may, if desired, be presented in a pack or dispenser device which may contain one or more dosage forms containing the active ingredient. For example packaging may contain metal or plastic foil, such as blister packs. The pack or dispenser device may be accompanied by instructions for administration.

适于用于本发明的药物组合物包括组合物，其中包含有效量活性成分以实现预期目的。有效量的确定是在本领域技术人员能力范围之内。Pharmaceutical compositions suitable for use in the present invention include compositions containing the active ingredient in an effective amount to achieve the intended purpose. Determination of an effective amount is within the ability of those skilled in the art.

对于任意化合物，治疗有效剂量可以最初在如赘生细胞的细胞培养测定中或者动物模型中，通常为小鼠、兔、狗或猪的动物模型中进行估计。动物模型还可以用于确定适当浓度范围和施用途径。剂量在动物模型中确定以获得包括IC50(即实现症状半最大抑制的测试化合物浓度)的循环血浆浓度范围。然后此类信息可以用于确定在人类中的有用剂量和施用途径。For any compound, the therapeutically effective dose can be estimated initially in cell culture assays such as neoplastic cells or in animal models, typically mice, rabbits, dogs or pigs. Animal models can also be used to determine appropriate concentration ranges and routes of administration. A dose is determined in animal models to obtain a circulating plasma concentration range that includes the IC50 (ie, the concentration of the test compound which achieves a half-maximal inhibition of symptoms). Such information can then be used to determine useful doses and routes of administration in humans.

治疗有效量是指用于预防、治疗或缓解特定目的病理状况的活性成分的量。通过细胞培养物或实验动物中的标准药学方法，例如ED50(50％群体的治疗有效剂量)和LD50(50％群体的致死剂量)，可以确定疗效和毒性。毒性和疗效之间的剂量比率是治疗指数，并且该指数能够表示为比率LD50/ED50。呈现大治疗指数的药物组合物是优选的。从细胞培养物测定和动物研究获得的数据用于确定人类使用的剂量范围。包含于此类组合物中的剂量优选地是在包括ED50的循环浓度范围之内并具有很少或无毒性的剂量。在该范围内剂量的改变依赖于所用剂型、患者敏感性和施用途径。A therapeutically effective amount refers to the amount of active ingredient used to prevent, treat or alleviate the specified pathological condition of interest. Efficacy and toxicity can be determined by standard pharmaceutical methods in cell culture or experimental animals, eg, ED50 (the dose therapeutically effective in 50% of the population) and LD50 (the dose lethal to 50% of the population). The dose ratio between toxic and therapeutic effects is the therapeutic index and this index can be expressed as the ratio LD50/ED50. Pharmaceutical compositions exhibiting large therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are used to determine a range of dosage for human use. The dosage contained in such compositions lies preferably within a range of circulating concentrations that include the ED50 with little or no toxicity. Dosage variations within this range are dependent upon the dosage form employed, patient sensitivity and the route of administration.

医生将按照与需要治疗的患者相关的因素确定确切剂量。调节剂量和施用以提供足够水平的活性部分或维持预期效果。可以考虑的因素包括疾病状况的严重性、受试者的一般健康、年龄、体重、和受试的性别、饮食、施用时间和频率、药物联合、反应敏感性和对治疗的耐受性/应答。依赖于特定制剂的半寿期和清除速度，长效药物组合物可以每3或4天、每周或两周一次进行施用。The exact dosage will be determined by the physician according to factors associated with the patient in need of treatment. Dosage and administration are adjusted to provide sufficient levels of the active moiety or to maintain the desired effect. Factors that may be considered include severity of the disease condition, general health of the subject, age, weight, and sex of the subject, diet, time and frequency of administration, drug combination, reaction sensitivity, and tolerance/response to treatment . Long-acting pharmaceutical compositions may be administered every 3 or 4 days, weekly or biweekly, depending on the half-life and clearance rate of the particular formulation.

依赖于施用途径，正常剂量可以为0.1到100,000微克，最多总剂量约1g。在文献中提供了关于特定剂量和递送方法的指导并且本领域的从业者通常可获得。本领域技术人员采用与蛋白质或它们的抑制剂不同的核苷酸制剂。同样，多核苷酸或多肽的递送对于特定细胞、状况、位置等等将是特异的。适于蛋白质经口施用的药物制剂描述于例如美国专利5,008,114；5,505,962；5,641,515；5,681,811；5,700,486；5,766,633；5,792,451；5,853,748；5,972,387；5,976,569和6,051,561。Depending on the route of administration, normal doses may range from 0.1 to 100,000 micrograms, up to a total dose of about 1 g. Guidance as to particular dosages and methods of delivery is provided in the literature and generally available to practitioners in the art. Those skilled in the art employ different formulations of nucleotides than proteins or their inhibitors. Likewise, delivery of polynucleotides or polypeptides will be specific to a particular cell, condition, location, and the like. Pharmaceutical formulations suitable for oral administration of proteins are described, for example, in US Patents 5,008,114; 5,505,962; 5,641,515; 5,681,811;

下面的实施例进一步阐明本发明并且无意限制本发明。The following examples further illustrate the invention and are not intended to limit the invention.

下面的材料和方法用于进行下面实施例1-5：The following materials and methods were used to carry out Examples 1-5 below:

一组人全长cDNA克隆的集合A collection of human full-length cDNA clones

我们已经存档并对来自由33种人组织类型mRNA制备的多个高质量全长cDNA文库的约170,000个克隆在5’末端进行了测序。使用专有的生物信息学方法，我们已经鉴定了具有ORF起始ATG密码子的所有cDNA克隆，其中所述的ORF是通过实验确定的或者概念上预测的，并且这些cDNA克隆因此可能代表全长转录物。使用Q-bot(Genetix Limited，Hampshire，英国)将来自存档克隆系列的pCMVSport6载体(Invitrogen，Carlsbad，CA)中的总共20,702个克隆重新排列到含有60μl LB培养基(LB)的384-孔Genetix板中。基于对这20,702个克隆5’序列的生物信息学分析，它们来源于约11,000个基因，这些基因的结构与存在具有强有力证据，尽管它们中的大部分没有功能，并且6,000个潜在的新序列还不存在于公共cDNA数据库中。We have archived and sequenced approximately 170,000 clones at the 5' end from multiple high-quality full-length cDNA libraries prepared from mRNA from 33 human tissue types. Using proprietary bioinformatics methods, we have identified all cDNA clones with ORF initiation ATG codons that were either experimentally determined or conceptually predicted and thus likely represent full-length transcript. A total of 20,702 clones in the pCMVSport6 vector (Invitrogen, Carlsbad, CA) from archived cloning series were rearranged into 384-well Genetix plates containing 60 μl LB medium (LB) using Q-bot (Genetix Limited, Hampshire, UK) middle. Based on bioinformatic analysis of the 5' sequences of these 20,702 clones, they were derived from approximately 11,000 genes for which there was strong evidence for structure and existence, although most of them were not functional, and 6,000 potential novel sequences Not yet present in public cDNA databases.

将阵列化的克隆复制以产生用于存档的多拷贝。一个拷贝用于使用QIAGEN BioRobot 8000(Qiagen，Valencia，CA)产生小量制备DNA。将DNA样品洗脱到96孔UV板(Corning，Acton，MA)中并且通过在SPECTRAmax 190(Molecular Devices，Sunnyvale，CA)上测定OD260值确定DNA样品浓度和产量。然后将所得到的20,702个DNA样品在384孔板中等分试样以产生多拷贝用于存档(在TE缓冲液中80pg/孔)和用于基于细胞的测定(在OPTI-MEM细胞培养基(Invitrogen)中50ng/孔)。Arrayed clones are replicated to create multiple copies for archiving. One copy was used to generate miniprep DNA using a QIAGEN BioRobot 8000 (Qiagen, Valencia, CA). DNA samples were eluted into 96-well UV plates (Corning, Acton, MA) and DNA sample concentrations and yields were determined by measuring OD260 values on a SPECTRAmax 190 (Molecular Devices, Sunnyvale, CA). The resulting 20,702 DNA samples were then aliquoted in 384-well plates to generate multiple copies for archiving (80 pg/well in TE buffer) and for cell-based assays (in OPTI-MEM cell culture medium ( 50ng/well in Invitrogen).

将板密封并保存于-20℃。Plates were sealed and stored at -20°C.

对环AMP效应元件的激活子进行基因组范围的筛选A genome-wide screen for activators of cyclic AMP response elements

将生长于225ml组织培养瓶中的Hela细胞(ATCC，Manassas，VA)用胰酶消化并在DMEM培养基(Invitrogen)中稀释到10⁵个细胞/ml。然后将细胞悬浮液用Multi-drop 384(Thermo Labsystems，Beverly，MA)以30μl/孔分配到384孔组织培养板。孵育过夜后，将由0.25μl Fugene 6转染试剂(Roche Applied Biosciences)、6μl含有50ng pCRE-Luc质粒构建体(Stratagene)和50ng来自克隆集合的各个cDNA质粒的OPTI-MEM培养基的混合物用Biomek FX液体处理机器人(Beckman Coulter)加入到384孔板的每个孔中。转染后40小时，使用BrightGlo Luciferase Assay System(Promega，Madison，WI)在LUMINOSKAN Ascent发光计(ThermoLabsystems)按照制造商的方法测定每个孔中的萤光素酶活性。使用专门设计用于处理高通量基因功能化项目的室内数据处理和分析系统对原始萤光素酶数据进行处理。整个分析一式两份进行以产生41,404个数据点，每个数据点对应在单个孔中使用单个cDNA克隆的微型化转染实验。Hela cells (ATCC, Manassas, VA) grown in 225 ml tissue culture flasks were trypsinized and diluted to ¹⁰⁵ cells/ml in DMEM medium (Invitrogen). The cell suspension was then dispensed into 384-well tissue culture plates at 30 μl/well using a Multi-drop 384 (Thermo Labsystems, Beverly, MA). After overnight incubation, a mixture consisting of 0.25 μl of Fugene 6 transfection reagent (Roche Applied Biosciences), 6 μl of OPTI-MEM medium containing 50 ng of pCRE-Luc plasmid construct (Stratagene) and 50 ng of each cDNA plasmid from the clone pool was incubated with Biomek FX A liquid handling robot (Beckman Coulter) was added to each well of a 384-well plate. Forty hours after transfection, luciferase activity in each well was assayed using the BrightGlo Luciferase Assay System (Promega, Madison, WI) in a LUMINOSKAN Ascent Luminometer (ThermoLabsystems) following the manufacturer's protocol. Raw luciferase data were processed using an in-house data processing and analysis system specifically designed to handle high-throughput gene functionalization projects. The entire analysis was performed in duplicate to generate 41,404 data points, each corresponding to a miniaturized transfection experiment using a single cDNA clone in a single well.

HTS采样点证实和确认HTS Sampling Point Confirmation and Validation

对于每组一式两份的20,702个数据点，计算了Z得分(计算为激活倍数除以群体标准差)和针对群体中位数的激活倍数并且储存于带注释的可检索数据库中。基于下面2条标准选择了潜在的激活剂：(1)在任一测定中Z得分大于3.0和(2)在两个测定中萤光素酶/中位数的增加倍数大于8.0。基于上述标准鉴定出总共85个克隆(总克隆的0.4％)。从克隆档案对这些采样点的DNA样品进行检索并且将DNA样品重新转化入细菌菌株XL-10Gold(Stratagene)。挑取每个样品的各个克隆并且进行DNA小量制备。对部分小量制备的DNA样品从5’末端测序以进行克隆验证。剩余的样品用于采样点确认，其中将它们与pCRE-Luc报告子构建体和编码处于SV40早期启动子控制下的海肾(kenilla)萤光素酶的pRL-SV40质粒(Promega)一起手工转染入Hela细胞，随后按照制造商的建议进行双萤光素酶测定(Promega)。For each group of 20,702 data points in duplicate, Z-scores (computed as fold activation divided by population standard deviation) and fold activation against population median were calculated and stored in an annotated searchable database. Potential activators were selected based on the following 2 criteria: (1) Z-score greater than 3.0 in either assay and (2) fold increase in luciferase/median greater than 8.0 in both assays. A total of 85 clones (0.4% of total clones) were identified based on the above criteria. DNA samples from these sampling points were retrieved from the clone archive and retransformed into bacterial strain XL-10Gold (Stratagene). Individual clones for each sample were picked and DNA minipreps were performed. A portion of the miniprep DNA sample was sequenced from the 5' end for clonal verification. The remaining samples were used for sampling site confirmation where they were hand-transformed with the pCRE-Luc reporter construct and the pRL-SV40 plasmid (Promega) encoding renilla (kenilla) luciferase under the control of the SV40 early promoter. Hela cells were transfected, followed by a dual luciferase assay (Promega) following the manufacturer's recommendations.

RNA印迹分析和体外转录和翻译分析Northern blot analysis and in vitro transcription and translation analysis

使用EcoRI和NotI消化含有CREAP1 cDNA的pCMVSport6质粒，使用Qiagen DNA凝胶提取试剂盒将插入片段进行凝胶纯化并使用Enzo随机引物标记系统按照出售商的手册(Bio-11-dCTP脱氧核苷酸包装，目录号42723，Enzo Biochem，Farmingdale，NY)进行标记。简而言之，将200ng CREAP1片段或100ngβ肌动蛋白cDNA(Clontech)在100℃变性10分钟，冰上冷却3-5分钟，并且然后与5μl 10x六聚体随机引物、5μldCTP-11-Bio混合物和1μl Klenow片段混合并在37℃孵育4小时。根据所建议的方案，将探针与多组织mRNA RNA印迹膜(Clontech)杂交。利用生物素检测试剂盒(Ambion，Austin，TX)实现信号检测。将膜曝光于X-光片10-30秒。在首次曝光后，将膜去除探针并用β肌动蛋白探针(Clontech)进行重杂交以标准化表达水平。The pCMVSport6 plasmid containing CREAP1 cDNA was digested with EcoRI and NotI, the insert was gel-purified using the Qiagen DNA Gel Extraction Kit and Enzo random primer labeling system was used according to the vendor's manual (Bio-11-dCTP deoxynucleotide package , Cat. No. 42723, Enzo Biochem, Farmingdale, NY) for labeling. Briefly, 200 ng of CREAP1 fragment or 100 ng of β-actin cDNA (Clontech) were denatured at 100 °C for 10 min, cooled on ice for 3-5 min, and then mixed with 5 μl of 10x hexamer random primers, 5 μl of CTP-11-Bio mix Mix with 1 μl Klenow fragments and incubate at 37°C for 4 hours. Probes were hybridized to multi-tissue mRNA Northern blot membranes (Clontech) according to the suggested protocol. Signal detection was achieved using a biotin detection kit (Ambion, Austin, TX). Expose the film to X-ray film for 10-30 seconds. After the first exposure, membranes were deprobed and rehybridized with a β-actin probe (Clontech) to normalize expression levels.

使用TNT SP6 Quick Coupled Transcription and Translation System(Promega)根据销售商的手册进行CREAP1蛋白质的体外转录和翻译。将翻译产物在Nupage预制凝胶(4-20％)(Invitrogen)中分离，转移至硝酸纤维素膜并用Transcend非放射性检测系统(Promega)根据制造商的说明书进行检测。In vitro transcription and translation of CREAP1 protein was performed using TNT SP6 Quick Coupled Transcription and Translation System (Promega) according to the vendor's manual. Translation products were separated on Nupage precast gels (4-20%) (Invitrogen), transferred to nitrocellulose membranes and detected with the Transcend non-radioactive detection system (Promega) according to the manufacturer's instructions.

CREAP1-CREB信号途径分析Analysis of CREAP1-CREB signaling pathway

对于体内激酶测定，使用与酵母GAL4 DNA结合结构域(氨基酸1-147)构建体融合的CREB或ATF2转录因子的激活结构域(Stratagene，PathDetect体内信号转导途径反式报告系统)。按照制造商的方法(Stratagene)使用了含有5×GAL4 DNA结合元件和TATA盒驱动萤光素酶报道基因的HLR细胞系。将10⁴个HLR细胞分散于96孔组织培养板的每个孔中。16小时后，分别用100ng Creb-GAL4或ATF2-GAL4融合构建体、30ng海肾萤光素酶对照质粒与100ng pCMVSPORT6、pCMVSPORT-CREAP1、pFC-PKA或pFC-MEKK(Stratagene)激活子质粒转染细胞。使用Fugene6试剂(Roche Molecular Biochemicals，Basel，瑞士)按照制造商的手册进行转染。转染40小时后，使用制造商的方法进行Dual-Glo萤光素酶测定(Promega)。For in vivo kinase assays, activation domains of CREB or ATF2 transcription factors (Stratagene, PathDetect In Vivo Signal Transduction Pathway Trans Reporter System) fused to yeast GAL4 DNA binding domain (amino acids 1-147) constructs were used. An HLR cell line containing a 5 x GAL4 DNA binding element and a TATA box driven luciferase reporter gene was used according to the manufacturer's protocol (Stratagene). Disperse ¹⁰⁴ HLR cells in each well of a 96-well tissue culture plate. After 16 hours, transfection with 100 ng of Creb-GAL4 or ATF2-GAL4 fusion construct, 30 ng of Renilla luciferase control plasmid, and 100 ng of pCMVSPORT6, pCMVSPORT-CREAP1, pFC-PKA or pFC-MEKK (Stratagene) activator plasmid, respectively cell. Transfection was performed using Fugene6 reagent (Roche Molecular Biochemicals, Basel, Switzerland) according to the manufacturer's manual. Forty hours after transfection, a Dual-Glo luciferase assay (Promega) was performed using the manufacturer's protocol.

对于显性失活CREB测定，使用了CREB显性失活构建体(非磷酸化的S133A突变体或DNA结合结构域K287L突变体K-Creb)(Clontech，目录号K6014-1)。根据制造商对上面的转染和萤光素酶测定方法进行了一些改良。Hela细胞、pCMVSPORT6、pCMV-CREAP1、pS133A-Creb或pK-Creb构建体用于转染。For the dominant negative CREB assay, CREB dominant negative constructs (non-phosphorylated S133A mutant or DNA binding domain K287L mutant K-Creb) (Clontech, cat. no. K6014-1 ) were used. The transfection and luciferase assay methods above were modified with some modifications according to the manufacturer. HeLa cells, pCMVSPORT6, pCMV-CREAP1, pS133A-Creb or pK-Creb constructs were used for transfection.

CREAP1蛋白质缺失的功能分析Functional analysis of CREAP1 protein deletion

利用本领域技术人员熟悉的PCR策略将CREAP1蛋白质氨基酸1-170、1-356、1-494、1-580和170-650插入到pFlag-CMV4表达载体(Sigma，St.Louis，MO)。Amino acids 1-170, 1-356, 1-494, 1-580 and 170-650 of the CREAP1 protein were inserted into the pFlag-CMV4 expression vector (Sigma, St. Louis, MO) using a PCR strategy familiar to those skilled in the art.

将10⁴个Hela细胞分散于96孔组织培养板的每个孔中。16小时后，分别用100ng pCRE-Luc报告子构建体、30ng海肾萤光素酶对照质粒与100ng pCMVSPORT6、pCMVSPORT-CREAP1和不同的Flag-CREAP1缺失融合构建体转染细胞。使用Fugene6试剂(Roche AppliedBiosciences)按照制造商的用法说明进行转染。转染40小时后进行Dual-Glo萤光素酶测定(Promega)。萤火虫萤光素酶计数对海肾萤光素酶进行标准化并作图。Disperse 10 ⁴ Hela cells in each well of a 96-well tissue culture plate. After 16 hours, cells were transfected with 100 ng pCRE-Luc reporter construct, 30 ng Renilla luciferase control plasmid, and 100 ng pCMVSPORT6, pCMVSPORT-CREAP1 and different Flag-CREAP1 deletion fusion constructs, respectively. Transfection was performed using Fugene6 reagent (Roche Applied Biosciences) following the manufacturer's instructions. A Dual-Glo luciferase assay (Promega) was performed 40 hours after transfection. Firefly luciferase counts were normalized and plotted against Renilla luciferase.

实施例1Example 1

对环AMP效应元件激活基因进行基因组范围的筛选 A genome-wide screen for cyclic AMP response element-activated genes

为了鉴定编码能够导致CRE激活的蛋白质的cDNA，我们筛选了带注释的和有索引的20,702个人cDNA克隆的集合，预测这些克隆能够代表微型化CRE-萤光素酶报告基因系统内的11,000-16,000个单个基因的全长转录物。一式两份地进行实验以产生41,404个数据点，每个数据点对应来自瞬时蛋白质过表达测定的萤光素酶活性，其中使用目的cDNA克隆和含有萤火虫萤光素酶基因的质粒对约3,000个Hela细胞进行瞬时转染。对两组数据的统计分析已经产生了一列85个克隆，这些克隆导致在2次一式两份的初步筛选实验中萤光素酶活性与群体中位数相比至少增加8倍。在随后第二次验证实验中，当回收这些克隆各个菌落并进行相似测定但使用SV40启动子控制下的海肾萤光素酶进行数据标准化时，确认了14个克隆(数据未显示)。所获得的采样点包括一种迄今功能未知的蛋白质，由Kazusa DNA研究所命名为KIAA0616(登录号：NM 025021)。基于我们对该蛋白质的功能分析，我们基于文中所描述的瞬时过表达萤光素酶报道基因分析系统中该蛋白质激活CRE的能力重新命名该蛋白质为CRE激活蛋白质1或“CREAP1”。To identify cDNAs encoding proteins that lead to CRE activation, we screened a collection of annotated and indexed 20,702 human cDNA clones predicted to represent 11,000-16,000 clones within the miniaturized CRE-luciferase reporter gene system. full-length transcripts of a single gene. Experiments were performed in duplicate to generate 41,404 data points, each corresponding to luciferase activity from transient protein overexpression assays using approximately 3,000 pairs of cDNA clones of interest and plasmids containing the firefly luciferase gene HeLa cells were transiently transfected. Statistical analysis of the two sets of data has yielded a list of 85 clones resulting in at least an 8-fold increase in luciferase activity compared to the population median in 2 primary screening experiments in duplicate. In a subsequent second validation experiment, when individual colonies of these clones were recovered and assayed similarly but using Renilla luciferase under the control of the SV40 promoter for data normalization, 14 clones were identified (data not shown). The obtained sampling sites included a protein of so far unknown function, named KIAA0616 by the Kazusa DNA Institute (accession number: NM 025021). Based on our functional analysis of this protein, we renamed this protein as CRE-activating protein 1 or "CREAP1" based on its ability to activate CRE in the transiently overexpressed luciferase reporter gene assay system described herein.

为了进一步定义CREAP1的途径或启动子特异性，在Hela细胞中在相似的分析系统中测试了一组多种启动子-萤光素酶构建体。这些构建体能够测试CREAP1激活CREB、NFAT和NFkB转录因子结合元件和IL-8、VCAM、IL-24和NPY的真正启动子的能力。此外，包括用于背景测试和特异性对照的3种萤光素酶载体。结果表明CREAP1是CRE特异激活子(数据未显示)。To further define the pathway or promoter specificity of CREAP1, a panel of various promoter-luciferase constructs was tested in a similar assay system in Hela cells. These constructs enable testing of the ability of CREAP1 to activate CREB, NFAT and NFkB transcription factor binding elements and the authentic promoters of IL-8, VCAM, IL-24 and NPY. Additionally, 3 luciferase vectors are included for background testing and specificity controls. The results indicated that CREAP1 is a CRE-specific activator (data not shown).

实施例2Example 2

CREAP1基因的DNA序列和氨其酸序列 DNA sequence and amino acid sequence of CREAP1 gene

按照常规的方法对活性CREAP1克隆的2.4kb cDNA插入片段的两条链进行测序。结果表明该基因编码区是1950个核苷酸并且氨基酸序列预测为650个氨基酸。生物信息学分析表明CREAP1除了在分子中间的氨基酸379-448的富含脯氨酸的结构域外不包含保守蛋白质功能结构域(例如激酶ATP结合结构域或者转录因子DNA结合结构域)。下面显示了DNA序列和氨基酸序列。Both strands of the 2.4 kb cDNA insert of the active CREAP1 clone were sequenced according to conventional methods. The results showed that the coding region of the gene was 1950 nucleotides and the amino acid sequence was predicted to be 650 amino acids. Bioinformatic analysis indicated that CREAP1 does not contain conserved protein functional domains (such as kinase ATP-binding domains or transcription factor DNA-binding domains) except for a proline-rich domain at amino acids 379-448 in the middle of the molecule. The DNA and amino acid sequences are shown below.

CREAP1的经证实的全长DNA序列：Confirmed full-length DNA sequence of CREAP1:

CCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTG

AACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATC

CAGCCTCCGGACTCTAGCCTAGGCCGCGGGACGGATAACAATTTCACACAGGAAACAGCTATGACCATCAGCCTCCGGACTCTAGCCTAGGCCGCGGGACGGATAACAATTTCACACAGGAAACAGCTATGACCAT

TAGGCCTATTTAGGTGACACTATAGAACAAGTTTGTACAAAAAAGCAGGCTGGTACCGGTCCGGAATTTAGGCCTATTTAGGTGACACTATAGAACAAGTTTGTACAAAAAAAGCAGGCTGGTACCGGTCCGGAATT

CCCGGGAGGAGGAGGAGGTGGCGGCGAGAAGATGGCGACTTCGAACAATCCGCGGAAATTCAGCGAGACCCGGGAGGAGGAGGAGGTGGCGGCGAGAAGATGGCGACTTCGAACAATCCGCGGAAATTCAGCGAGA

AGATCGCGCTGCACAATCAGAAGCAGGCGGAGGAGACGGCGGCCTTCGAGGAGGTCATGAAGGACCTGAGATCGCGCTGCACAATCAGAAGCAGGCGGAGGAGACGGCGGCCTTCGAGGAGGTCATGAAGGACCTG

AGCCTGACGCGGGCCGCGCGGCTCCAGCTCCAGAAATCCCAGTACCTGCAACTGGGCCCCAGCCGAGGAGCCTGACGCGGGCCGCGCGGCTCCAGCTCCAGAAATCCCAGTACCTGCAACTGGGCCCCAGCCGAGG

CCAGTACTATGGCGGGTCCCTGCCCAACGTGAACCAGATCGGGAGTGGCACCATGGACCTGCCCTTCCCCAGTACTATGGCGGGTCCCTGCCCAACGTGAACCAGATCGGGAGTGGCACCATGGACCTGCCCTTCC

AGCCCAGCGGATTTCTGGGGGAGGCCCTGGCAGCGGCTCCTGTCTCTCTGACCCCCTTCCAATCCTCGAGCCCAGCGGATTTCTGGGGGAGGCCCTGGCAGCGGCTCCTGTCTCCTGACCCCCTTCCAATCCTCG

GGCCTGGACACCAGCCGGACCACCCGGCACCATGGGCTGGTGGACAGGGTGTACCGGGAGCGTGGCCGGGCCTGGACACCAGCCGGACCACCCGGCACCATGGGCTGGTGGACAGGGTGTACCGGGAGCGTGGCCG

GCTCGGCTCCCCACACCGCCGGCCCCTGTCAGTGGACAAACACGGACGGCAGGCCGACAGCTGCCCCTGCTCGGCTCCCCACACCGCCGGCCCCTGTCAGTGGACAAACACGGACGGCAGGCCGACAGCTGCCCCT

ATGGCACCATGTACCTCTCACCACCCGCGGACACCAGCTGGAGAAGGACCAATTCTGACTCCGCCCTGATGGCACCATGTACCTCTCACCACCCGCGGACACCAGCTGGAGAAGGACCAATTCTGACTCCGCCCTG

CACCAGAGCACAATGACGCCCACGCAGCCAGAATCCTTTAGCAGTGGGTCCCAGGACGTGCACCAGAACACCAGAGCACAATGACGCCCACGCAGCCAGAATCTTTAGCAGTGGGTCCCAGGACGTGCACCAGAA

AAGAGTCTTACTGTTAACAGTCCCAGGAATGGAAGAGACCACATCAGAGGCAGACAAAAACCTTTCCAAAGAGTCTTACTGTTAACAGTCCCAGGAATGGAAGAGACCACATCAGAGGCAGACAAAAACCTTTCCA

AGCAAGCATGGGACACCAAGAAGACGGGGTCCAGGCCCAAGTCCTGTGAGGTCCCCGGAATCAACATCAGCAAGCATGGGACACCAAGAAGACGGGGTCCAGGCCCAAGTCCTGTGAGGTCCCCGGAATCAACATC

TTCCCGTCTGCCGACCAGGAAAACACTACAGCCCTGATCCCCGCCACCCACAACACAGGGGGGTCCCTTTCCCGTCTGCCGACCAGGAAAACACTACAGCCCTGATCCCCGCCACCCACAACACAGGGGGGTCCCT

GCCCGACCTGACCAACATCCACTTCCCCTCCCCGCTCCCGACCCCGCTGGACCCCGAGGAGCCCACCTGCCCGACCTGACCAACATCCACTTCCCCTCCCCGCTCCCGACCCCGCTGGACCCCGAGGAGCCCACCT

TCCCTGCACTGAGCAGCTCCAGCAGCACCGGCAACCTCGCGGCCAACCTGACGCACCTGGGCATCGGTTCCCTGCACTGAGCAGCTCCAGCAGCACCGGCAACCTCGCGGCCAACCTGACGCACCTGGGCATCGGT

GGCGCCGGCCAGGGAATGAGCACACCTGGCTCCTCTCCACAGCACCGCCCAGCTGGCGTCAGCCCCCTGGCGCCGGCCAGGGAATGAGCACACCTGGCTCCTCTCCACAGCACCGCCCAGCTGGCGTCAGCCCCCT

GTCCCTGAGCACAGAGGCAAGGCGTCAGCAGGCATCGCCCACCCTGTCCCCGCTGTCACCCATCACTCGTCCCTGAGCACAGAGGCAAGGCGTCACGCAGGCATCGCCCACCCTGTCCCGCTGTCACCCCATCACTC

AGGCTGTAGCCATGGACGCCCTGTCTCTGGAGCAGCAGCTGCCCTACGCCTTCTTCACCCAGGCGGGCAGGCTGTAGCCATGGACGCCCTGTCTCTGGAGCAGCAGCTGCCCTACGCCTTCTTCACCCAGGCGGGC

TCCCAGCAGCCACCGCCGCAGCCCCAGCCCCCGCCGCCTCCTCCACCCGCGTCCCAGCAGCCACCACCTCCCAGCAGCCACCGCCGCAGCCCCCAGCCCCCGCCGCCTCCTCCACCCGCGTCCCAGCAGCACCACC

CCCGCCACCCCCACAGGCGCCCGTCCGCCTGCCCCCTGGTGGCCCCCTGTTGCCCAGCGCCAGCCTGACCCGCCACCCCCACAGGCGCCCGTCCGCCTGCCCCCTGGTGGCCCCCTGTTGCCCAGCGCCAGCCTGA

CTCGTGGGCCACAGCCGCCCCCGCTTGCAGTCACGGTACCGTCCTCTCTCCCCCAGTCCCCCCCAGAGCTCGTGGGCCACAGCCGCCCCCGCTTGCAGTCACGGTACCGTCCTCTCTCCCCCAGTCCCCCCCAGAG

AACCCTGGCCAGCCATCGATGGGGATCGACATCGCCTCGGCGCCGGCTCTGCAGCAGTACCGCACTAGAACCCTGGCCAGCCATCGATGGGGATCGACATCGCCTCGGCGCCGGCTCTGCAGCAGTACCGCACTAG

CGCCGGCTCCCCGGCCAACCAGTCTCCCACCTCGCCAGTCTCCAATCAAGGCTTCTCCCCAGGGAGCTCGCCGGCTCCCCGGCCAACCAGTCTCCCCACCTCGCCAGTCTCCAATCAAGGCTTCTCCCCAGGGAGCT

CCCCGCAACACACTTCCACCCTGGGCAGCGTGTTTGGGGACGCGTACTATGAGCAGCAGATGGCGGCCCCCCGCAACACACTTCCACCCTGGGCAGCGTGTTTGGGGACGCGTACTATGAGCAGCAGATGGCGGCC

AGGCAGGCCAATGCTCTGTCCCACCAGCTGGAGCAGTTCAACATGATGGAGAACGCCATCAGCTCCAGAGGCAGGCCAATGCTCTGTCCCACCAGCTGGAGCAGTTCAACATGATGGAGAACGCCATCAGCTCCAG

CAGCCTGTACAGCCCGGGCTCCACACTCAACTACTCGCAGGCGGCCATGATGGGCCTCACGGGCAGCCCAGCCTGTACAGCCCGGGCTCCACACTCAACTACTCGCAGGCGGCCATGATGGGCCTCACGGGCAGCC

ACGGGAGCCTGCCGGACTCGCAGCAACTGGGATACGCCAGCCACAGTGGCATCCCCAACATCATCCTCACGGGAGCCTGCCGGACTCGCAGCAACTGGGATACGCCAGCCACAGTGGCATCCCCAACATCATCCTC

ACAGTGACAGGAGAGTCCCCCCCCAGCCTCTCTAAAGAACTGACCAGCTCTCTGGCCGGGGTCGGCGAACAGTGACAGGAGAGTCCCCCCCCCAGCCTCTCTAAAGAACTGACCAGCTCTCTGGCCGGGGTCGGCGA

CGTCAGCTTCGACTCCGACAGCCAGTTTCCCCTGGACGAACTCAAGATCGACCCCCTGACCCTCGACGCGTCAGCTTCGACTCCGACAGCCAGTTTCCCCTGGACGAACTCAAGATCGACCCCCTGACCCTCGACG

GACTGCACATGCTCAACGACCCCGACATGGTTCTGGCCGACCCAGCCACCGAGGACACCTTCCGGATGGACTGCACATGCTCAACGACCCCGACATGGTTCTGGCCGACCCAGCCACCGAGGACACCTTCCGGATG

GACCGCCTGTGAGCGGGCACGCCGGCACCCTGCCGCTCAGCCGTCCCGACGGCGCCTCCCCAGCCCGGGACCGCCTGTGAGCGGGCACGCCGGCACCCTGCCGCTCAGCCGTCCCGACGGCGCCTCCCCAGCCCGG

GGACGGCCGTGCTCCGTCCCTCGCCAACGGCCGAGCTTGTGATTCTGAGCTTGCAATGCCGCCAAGCGGGACGGCCGTGCTCCGTCCCTCGCCAACGGCCGAGCTTGTGATTCTGAGCTTGCAATGCCGCCAAGCG

CCCCCCGCCAGCCCGCCCCCGGTTGTCCACCTCCCGCGAAGCCCAATCGCGAGGCCGCGAGCCGGGCCCCCCCCGCCAGCCCGCCCCCGGTTGTCCACCTCCCGCGAAGCCCAATCGCGAGGCCGCGAGCCGGGCC

GTCCACCCACCCGCCCGCCCAGGGCTGGGCTGGGATCGGAGGCCGTGAGCCTCCCGCCCCTGCAGACCGTCCACCCACCCGCCCGCCCAGGGCTGGGCTGGGATCGGAGGCCGTGAGCCTCCCGCCCCTGCAGACC

CTCCCTGCACTGGCTCCCTCGCCCCCAGCCCCGGGGCCTGAGCCGTCCCCTGTAAGATGCGGGAAGTGCTCCCTGCACTGGCTCCCTCGCCCCCAGCCCCGGGGCCTGAGCCGTCCCCTGTAAGATGCGGGAAGTG

TCAGCTCCCGGCGTGGCGGGCAGGCTCAGGGGAGGGGCGCGCATGGTCCGCCAGGGCTGTGGGCCGTGTCAGCTCCCGGCGTGGCGGGCAGGCTCAGGGGAGGGGCGCGCATGGTCCGCCAGGGCTGTGGGCCGTG

GCGCATTTTCCGACTGTTTGTCCAGCTCTCACTGCCTTCCTTGGTTCCCGGTCCCCCAGCCCATCCGCGCGCATTTTCCGACTGTTTGTCCAGCTCTCACTGCCTTCCTTGGTTCCCGGTCCCCCCAGCCCATCCGCGC

CATCCCCAGCCCGTGGTCAGGTAGAGAGTGAGCCCCACGCCGCCCCAGGGAGGAGGCGCCAGAGCGCGCATCCCCAGCCCGTGGTCAGGTAGAGAGTGAGCCCCACGCCGCCCCAGGGAGGAGGCGCCAGAGCGCG

GGGCAGACGCAAAGTGAAATAAACACTATTTTGACGGCAAAAAAAAAAAAAAAGGGCGGCCGCTCTAGGGGCAGACGCAAAGTGAAATAAACACTATTTTGACGGCAAAAAAAAAAAAAAAGGGCGGCCGCTCTAG

AGTATCCCTCGAGGGGCCCAAG (SEQ ID NO 1)AGTATCCCTCGAGGGGCCCAAG (SEQ ID NO 1)

预测的CREAP1氨基酸序列(650个氨基酸)：Predicted amino acid sequence of CREAP1 (650 amino acids):

MATSNNPRKFSEKIALHNQKQAEETAAFEEVMKDLSLTRAARLQLQKSQYLQLGPSRGQYYGGSLPNVMATSNNPRKFSEKIALHNQKQAEETAAFEEVMKDLSLTRAARLQLQKSQYLQLGPSRGQYYGGSLPNV

NQIGSGTMDLPFQPSGFLGEALAAAPVSLTPFQSSGLDTSRTTRHHGLVDRVYRERGRLGSPHRRPLSNQIGSGTMDLPFQPSGGLGEALAAAPVSLTPFQSSGLDTSRTTRHHGLVDRVYRERGRLGSPHRRPLS

VDKHGRQADSCPYGTMYLSPPADTSWRRTNSDSALHQSTMTPTQPESFSSGSQDVHQKRVLLLTVPGMVDKHGRQADSCPYGTMYLSPPADTSWRRTNSDSALHQSTMTPTQPESFSSGSQDVHQKRVLLLTVPGM

EETTSEADKNLSKQAWDTKKTGSRPKSCEVPGINIFPSADQENTTALIPATHNTGGSLPDLTNIHFPSEETTSEADKNLSKQAWDTKKTGSRPKSCEVPGINIFPSADQENTTALIPATHNTGGSLPDLTNIHFPS

PLPTPLDPEEPTFPALSSSSSTGNLAANLTHLGIGGAGQGMSTPGSSPQHRPAGVSPLSLSTEARRQQPLPTPLDPEEPTFPALSSSSSTGNLAANLTHLGIGGAGQGMSTPGSSPQHRPAGVSPLSLSTEARRQQ

ASPTLSPLSPITQAVAMDALSLEQQLPYAFFTQAGSQQPPPQPQPPPPPPPASQQPPPPPPPQAPVRLASPTLSPLSPITQAVAMDALSLEQQLPYAFFTQAGSQQPPPQPQPPPPPPPASQQPPPPPPPPQAPVRL

PPGGPLLPSASLTRGPQPPPLAVTVPSSLPQSPPENPGQPSMGIDIASAPALQQYRTSAGSPANQSPTPPGGPLLPSASLTRGPQPPPLAVTVPSSLPQSPPENPGQPSMGIDIASAPALQQYRTSAGSPANQSPT

SPVSNQGFSPGSSPQHTSTLGSVFGDAYYEQQMAARQANALSHQLEQFNMMENAISSSSLYSPGSTLNSPVSNQGFSPGSSPQHTSTLGSVFGDAYYEQQMAARQANALSHQLEQFNMMENAISSSSLYSPGSTLN

YSQAAMMGLTGSHGSLPDSQQLGYASHSGIPNIILTVTGESPPSLSKELTSSLAGVGDVSFDSDSQFPYSQAAMMGLTGSHGSLPDSQQLGYASHSGIPNIILTVTGESPPSLSKELTSSLAGVGDVSFDSDSQFP

LDELKIDPLTLDGLHMLNDPDMVLADPATEDTFRMDRL (SEQ ID NO 2)LDELKIDPLTLDGLHMLNDPDMVLADPATEDTFRMDRL (SEQ ID NO 2)

实施例3Example 3

RNA印迹和CREAP1蛋白质的体外翻译. Northern blot and in vitro translation of CREAP1 protein.

为了研究CREAP1基因在不同人类组织中的表达，我们使用随机标记的CREAP1探针进行了RNA印迹分析。根据RNA印迹法分析，观察到了两种mRNA，2.4Kb和7Kb。2.4Kb条带与编码区大小一致。7.0Kb条带反映了mRNA的备选剪接形式。虽然在大多数人组织中表达，CREAP1 mRNA在脑、心脏、骨骼肌和肾中是丰富的(数据未显示)。To investigate the expression of the CREAP1 gene in different human tissues, we performed northern blot analysis using randomly labeled CREAP1 probes. According to Northern blot analysis, two mRNAs, 2.4Kb and 7Kb, were observed. The 2.4Kb band is consistent with the size of the coding region. The 7.0 Kb band reflects an alternative spliced form of the mRNA. Although expressed in most human tissues, CREAP1 mRNA is abundant in brain, heart, skeletal muscle and kidney (data not shown).

为了测试CREAP1的预测氨基酸序列的精确性，我们使用pCMVSPORT-CREAP1作为模板并且进行了体外转录和翻译反应。之后将体外翻译产物在SDS-PAGE中分辨，可观察到约80Kd的单一CREAP1蛋白质条带，这与它包含650个氨基酸的观点一致(数据未显示)。To test the accuracy of the predicted amino acid sequence of CREAP1, we used pCMVSPORT-CREAP1 as a template and performed in vitro transcription and translation reactions. Afterwards, the in vitro translation products were resolved in SDS-PAGE, and a single CREAP1 protein band of about 80Kd was observed, which is consistent with the view that it contains 650 amino acids (data not shown).

实施例4Example 4

CREAP1通过CREB起作用 CREAP1 acts through CREB

由于CREAP1强烈激活CRE启动子转录，我们接下来研究了CREAP1是否通过CREB途径起作用。为了解决这个问题，使用由CREB或ATF2转录因子的反式激活结构域和GAL4DNA结合结构域(氨基酸1-147)组成的融合构建体和用稳定整合PathDetect反式-报告子质粒(Stratagene)的HLR细胞系进行了体内激酶测定。在该系统中，只有那些激活CREB或ATF2的反式激活结构域的上游调节子(可能是激酶)能够驱动萤光素酶报道基因的表达。结果表明CREAP1强烈刺激GAL4启动子上的CREB-GAL4融合分子的反式激活并且其活性甚至强于PKA催化亚单位，即磷酸化CREB的规范激酶的活性。有趣地是CREAP1不能够激活ATF2-GAL4融合分子而MEKK(ATF2途径的上游激酶，Stratagene试剂盒手册)能够刺激ATF2融合100倍以上。该结果证明CREAP1是CREB途径特异上游激活剂。Since CREAP1 strongly activates CRE promoter transcription, we next investigated whether CREAP1 acts through the CREB pathway. To address this issue, fusion constructs consisting of the transactivation domain of CREB or ATF2 transcription factors and the GAL4 DNA-binding domain (amino acids 1–147) were used with HLRs stably integrated with the PathDetect trans-reporter plasmid (Stratagene). Cell lines were subjected to in vivo kinase assays. In this system, only those upstream regulators (presumably kinases) that activate the transactivation domain of CREB or ATF2 are able to drive the expression of the luciferase reporter gene. The results indicate that CREAP1 strongly stimulates the transactivation of the CREB-GAL4 fusion molecule on the GAL4 promoter and its activity is even stronger than that of the PKA catalytic subunit, the canonical kinase that phosphorylates CREB. Interestingly CREAP1 was unable to activate ATF2-GAL4 fusion molecules whereas MEKK (upstream kinase of the ATF2 pathway, Stratagene kit manual) was able to stimulate ATF2 fusion more than 100-fold. This result demonstrates that CREAP1 is a specific upstream activator of the CREB pathway.

为了进一步证实该观察，两种CREB显性失活构建体(非可磷酸化的S133A突变体或者DNA结合结构域突变体K287L(Clontech))用于共转染测定。实验数据表明或者CREB S133A突变体或者K287L突变体能够完全除去CREAP1对CRE启动子的激活，表明CREAP1特异地作用于CREB信号转导途径的上游并且CREB的磷酸化和DNA结合活性都是CREAP1发信号所需的。To further confirm this observation, two CREB dominant negative constructs (non-phosphorylatable S133A mutant or DNA binding domain mutant K287L (Clontech)) were used in co-transfection assays. Experimental data showed that either the CREB S133A mutant or the K287L mutant could completely abolish the activation of the CRE promoter by CREAP1, indicating that CREAP1 specifically acts upstream of the CREB signal transduction pathway and that the phosphorylation and DNA binding activity of CREB are both signaled by CREAP1. needed.

实施例5Example 5

CREAP1蛋白质分子内结构域的功能分析 Functional Analysis of the Intramolecular Domain of CREAP1 Protein

为了研究CREAP1分子内的功能结构域，通过使用基于PCR的策略将CREAP1蛋白质氨基酸片段1-170、1-356、1-494、1-580和170-650亚克隆入pFlag-CMV4载体并且如上所述在Dual Glo萤光素酶测定中测试了功能。结果表明包含CREAP1的氨基酸1-170的氨基末端片段对于其功能是重要的，因为缺乏该氨基末端的K5(aa 170-650)缺乏几乎全部刺激活性。然而，单独的1-170片段(K1)不足以实现其功能。另一方面，CREAP1的C末端对于其功能不是必要的，因为K4缺失体(缺失氨基酸581-650)保留了几乎全部的野生型活性。K2和K4活性的比较表明氨基酸356-580(具有富含脯氨酸的结构域)对于CREAP1功能是非常重要的，因为从K4除去该部分(这导致产生K2)使CREAP1功能活性降低到1/10(见下表1)。片段 CREAP1的氨基酸活性 SEQ ID NO# K1K2K3K4K5 1-170，1-356，1-494，1-580170-650 无活性无活性部分活性全部活性无活性 3233343536 To study the functional domains within the CREAP1 molecule, the CREAP1 protein amino acid fragments 1-170, 1-356, 1-494, 1-580, and 170-650 were subcloned into the pFlag-CMV4 vector by using a PCR-based strategy and as described above. Functionality was tested in the Dual Glo luciferase assay as described. The results suggest that an amino-terminal fragment comprising amino acids 1-170 of CREAP1 is important for its function, as K5 (aa 170-650) lacking this amino-terminus lacks almost all stimulatory activity. However, the 1-170 fragment (K1) alone is not sufficient for its function. On the other hand, the C-terminus of CREAP1 is not essential for its function, since the K4 deletion (deletion of amino acids 581-650) retained almost all wild-type activity. A comparison of the activities of K2 and K4 showed that amino acids 356-580 (with a proline-rich domain) are very important for CREAP1 function, since removal of this portion from K4 (which results in the production of K2) reduced CREAP1 functional activity to 1/ 10 (see Table 1 below). fragment Amino acids of CREAP1 active SEQ ID NO# K1K2K3K4K5 1-170, 1-356, 1-494, 1-580170-650 Inactive Inactive Partially active All active Inactive 3233343536

表1.CREAP片段的功能。Table 1. Functions of CREAP fragments.

以下材料和方法用于进行下面实施例6-9中列出的实验：The following materials and methods were used to carry out the experiments outlined in Examples 6-9 below:

DNA构建体DNA construct

使用常规方法(Roebuck，J.Interferon and Cytokine Res.19：429-438(1999))室内构建的pGL-2-IL-8_P-Luc包含有含有IL-8启动子的1.5kb序列所驱动的萤火虫萤光素酶基因。通过连接用Hind III/XhoI消化pGL-2-IL-8_P-Luc切割的1.5kb人IL-8启动子DNA并插入到Hind III/XhoI消化的pGL3Basic(Promega)中构建了pGL3B-IL-8_P-Luc。The pGL-2-IL-8 _P -Luc constructed in house using conventional methods (Roebuck, J. Interferon and Cytokine Res. 19: 429-438 (1999)) contains the 1.5 kb sequence containing the IL-8 promoter driven Firefly luciferase gene. pGL3B-IL-8 was constructed by ligation of 1.5 kb human IL-8 promoter DNA digested with Hind III/XhoI pGL-2-IL-8 _P -Luc and inserted into Hind III/XhoI digested pGL3Basic (Promega) _P -Luc.

通过将人IL-8基因-1491到+43区域插入pGL3Basic载体(Promega，Inc)构建了pIL-8Luc报告子。PCR产生野生型最小IL-8启动子和点突变。Wu等(Wu等，J.Biol.Chem.，272：2396-2403(1997))描述了AP-1、C/EBP、NF-κB中的突变。假定的CRE-样位点的序列TGACATAA突变为TCGATCAA。通过将PCR扩增序列连接入pTAL-Luc(BD Biosciences)制备了携带有CRE样效应元件(pCREL-Luc)的6联体拷贝或发现于人PEPCK和CAPL启动子(pCREL2-Luc)的CRE-样元件TGACACAA 5份拷贝的启动子构建体。所有的技术使用常规方法进行。The pIL-8Luc reporter was constructed by inserting the human IL-8 gene -1491 to +43 region into the pGL3Basic vector (Promega, Inc). PCR generated wild-type minimal IL-8 promoter and point mutations. Wu et al. (Wu et al., J. Biol. Chem., 272:2396-2403 (1997)) describe mutations in AP-1, C/EBP, NF-κΒ. The sequence TGACATAA of the putative CRE-like site was mutated to TCGATCAA. Hexaplex copies carrying a CRE-like response element (pCREL-Luc) or the CRE- A promoter construct with 5 copies of the TGACACAA-like element. All techniques were performed using conventional methods.

IL-8启动子缺失和点突变变体的构建Construction of IL-8 promoter deletion and point mutation variants

聚合酶链式反应(PCR)用于产生IL-8启动子变体。PCR扩增循环为：94℃2分钟，5×[94℃15秒，55℃30秒和72℃15秒]和20×[94℃15秒，65℃30秒和72℃15秒]。Advantage 2 DNA聚合酶(BD Biosciences)用于全部扩增步骤。使用共同反义引物P2.1扩增所有变体，该引物具有核苷酸序列(5’-GCCC AAGCTTTGTGCTCTGCTGTCTCTGAAAG-3’)(SEQID NO 3)，其对应人IL-8基因序列+13-+43(Roebuck’J.Interferon andCytokine Res.19：429-438(1999)。BamHI限制性酶切位点(下划线)包括于全部有义引物中。将PCR产物进行凝胶纯化并使用Zero Blunt TOPOPCR克隆试剂盒(Invitrogen)连接。用来自pCR-Blunt II-TOPO的HindIII/BamH I切割下序列证实的克隆并连接入Hind III/BamH I消化的pGL3Basic。携带缺乏AP-1位点的截短的最小IL-8启动子的pIL-8p[δAP-1]-Luc是通过用P2.1和S3(5’-GCCCTGAGGGGATGGGCCATCAG-3’)(SEQ ID NO 4)扩增产生的，P2.1和S3是用于产生对应人IL-8基因序列-114-+43的157nt产物的引物。Polymerase chain reaction (PCR) was used to generate IL-8 promoter variants. PCR amplification cycles were: 94°C for 2 minutes, 5×[94°C for 15 seconds, 55°C for 30 seconds and 72°C for 15 seconds] and 20×[94°C for 15 seconds, 65°C for 30 seconds and 72°C for 15 seconds]. Advantage 2 DNA polymerase (BD Biosciences) was used for all amplification steps. All variants were amplified using a common antisense primer P2.1 having the nucleotide sequence (5'-GCCC AAGCTT TGTGCTCTGCTGTCTCTGAAAG-3') (SEQ ID NO 3), which corresponds to the human IL-8 gene sequence +13-+ 43 (Roebuck' J. Interferon and Cytokine Res. 19:429-438 (1999). The BamHI restriction site (underlined) was included in all sense primers. The PCR product was gel purified and cloned using Zero Blunt TOPOPCR Kit (Invitrogen) ligation. Sequence confirmed clones were cut with HindIII/BamH I from pCR-Blunt II-TOPO and ligated into Hind III/BamH I digested pGL3Basic. Carrying a truncated minimal pIL-8p[δAP-1]-Luc of the IL-8 promoter was generated by amplification with P2.1 and S3 (5'-GCCCTGAGGGGATGGGCCATCAG-3') (SEQ ID NO 4), which were Primers used to generate a 157nt product corresponding to human IL-8 gene sequence -114-+43.

分别使用wtAP-1(5’-CGC GGATCCGAAGTGTGAT GACTCAGGTTTGCCCTG-3’)(SEQID NO 5)和mAP-1(5’-CGC GGATCCGAAGTGTGAT ATCTCAGGTTTGCCCTG-3’)(SEQID NO 6)和P2.1引物扩增携带野生型或突变AP-1位点的最小IL-8启动子。加下划线的是AP-1位点内突变的核苷酸。两种187nt产物都对应人IL-8基因序列-144-+43。野生型和AP-1突变体分别命名为pIL-8p[wtAP-1]-Luc和pIL-8p[mutAP-1]-Luc。Use wtAP-1 (5'-CGC GGATCC GAAGTGTGAT GA CTCAGGTTTGCCCTG-3') (SEQ ID NO 5) and mAP-1 (5'-CGC GGATCC GAAGTGTGAT AT CTCAGGTTTGCCCTG-3') (SEQ ID NO 6) and P2.1 primers respectively Amplification of the minimal IL-8 promoter carrying wild-type or mutant AP-1 sites. Underlined are mutated nucleotides within the AP-1 site. Both 187nt products correspond to human IL-8 gene sequence -144-+43. The wild-type and AP-1 mutants were named pIL-8p[wtAP-1]-Luc and pIL-8p[mutAP-1]-Luc, respectively.

在2步PCR步骤中制备携带有突变Oct-1/C/EBP和NF-κB位点的IL-8最小启动子变体。在第一步PCR中使用SP3_NF-κBmut(5’-GCCCTGAGGGGATGGGCCATCAGTTGCAAATCGT TAACTTTCCTCTGACATAAT-3’)(SEQ ID NO 7)或SP3_Oct-1mut(5’-GCCCTGAGGGGATGGGCCATCAG CT AC GA GTCGTGGAAT-3’)(SEQ ID NO 8)、有义引物和P2.1反义引物，扩增得到分别携带有突变的NF-κB和Oct-1/C/EBP结合位点的157nt产物。加下划线的是NF-κB和Oct-1位点内突变的核苷酸。在第二步PCR中AP-1_Bam有义引物(5’-CGC GGATCCGAAGTGTGATGACTCAGGTTTGCCCTGAGGGGATGGGC-3’)(SEQ ID NO 9)和P2.1反义引物用于再扩增第一步PCR反应的两种产物(每个反应物100fmol)以产生对应人IL-8基因序列-144-+43的187nt cDNA。NF-κB和Oct-1/C/EBP结合位点突变体分别命名为pIL-8p[mutNF-κB]-Luc和pIL-8p[mutOct-1]-Luc。IL-8 minimal promoter variants carrying mutated Oct-1/C/EBP and NF-κB sites were prepared in a 2-step PCR step. In the first step PCR, SP3_NF-κBmut (5'-GCCCTGAGGGGATGGGCCATCAGTTGCAAATCGT TAAC TTTCCTCTGACATAAT-3') (SEQ ID NO 7) or SP3_Oct-1mut (5'-GCCCTGAGGGGATGGGCCATCAG C T A C G A G TCGTGGAAT-3') (SEQ ID NO 8), sense primers and P2.1 antisense primers were amplified to obtain 157nt products carrying mutated NF-κB and Oct-1/C/EBP binding sites, respectively. Underlined are the mutated nucleotides within the NF-κB and Oct-1 sites. In the second step PCR, AP-1_Bam sense primer (5'-CGC GGATCC GAAGTGTGATGACTCAGGTTTGCCCTGAGGGGATGGGC-3') (SEQ ID NO 9) and P2.1 antisense primer are used to reamplify the two products of the first step PCR reaction (100 fmol per reaction) to generate a 187nt cDNA corresponding to human IL-8 gene sequence -144-+43. NF-κB and Oct-1/C/EBP binding site mutants were named pIL-8p[mutNF-κB]-Luc and pIL-8p[mutOct-1]-Luc, respectively.

在三步PCR步骤中制备携带有突变CRE-样效应元件的IL-8最小启动子变体。第一步PCR中，CREmut有义引物(5’-CAGTTGCAAATCGTGGAATTTCCTCT CGATCAATGAAAAGATG-3’)(SEQ ID NO 10)和P2.1反义引物用于产生137nt产物。加下划线的是CRE-样位点内突变的核苷酸。第二步PCR中，SP3_Oct-1wt有义引物(5’-GCCCTGAGGGGATGGGCCATCAGTTGCAAATCGTGGAAT-3’)(SEQ ID NO 11)和P2.1反义引物用于再扩增第一步PCR反应的产物(每个反应物100fmol)产生对应人IL-8基因序列-114-+43的157nt产物。最后，在第三步PCR中，使用AP-1_Bam有义引物和P2.1反义引物并且第二步PCR反应产物作为模板(每个反应物100fmol)，使IL-8最小启动子变体5’末端延伸至人IL-8基因-144核苷酸位置处。所得到的用于本研究的构建体命名为pIL-8p[mutCRE_like]-Luc。通过PCR使用CRElike_S(5’-CGCCTGGTACCGAGCTCTG-3’)(SEQ ID NO 12)有义引物和CRElike_AS(5’-ACCCAAGATCTCGAGCCCG-3’)(SEQ ID NO 13)反义引物和模板寡核苷酸(5’-CGCCTGGTACCGAGCTC TGACATAATGACATAATGACATAAT GACATAATGACATAATGACATAATTACGCGTGCTAGCCCGGGCTCGAGATCTTGGGT-3’(SEQ ID NO 14)，(每个反应物100fmol)扩增制备携带IL-8启动子多联体CRE-样效应元件的启动子构建体。加下划线的是CRE-样效应元件(TGACATAA)的六联体拷贝。PCR扩增参数如上所述。将99个核苷酸的PCR产物用KpnI和BglII切割，凝胶纯化并连接入KpnI/BglII消化的pTAL载体(BD Biosciences)产生pTAL-6×[CRE_like]报告子。IL-8 minimal promoter variants carrying mutated CRE-like response elements were generated in a three-step PCR step. In the first step of PCR, CREmut sense primer (5'-CAGTTGCAAATCGTGGAATTTCCCTCT CGATC AATGAAAAGATG-3') (SEQ ID NO 10) and P2.1 antisense primer were used to generate a 137nt product. Underlined are mutated nucleotides within the CRE-like site. In the second step PCR, SP3_Oct-1wt sense primer (5'-GCCCTGAGGGGATGGGCCATCAGTTGCAAATCGTGGAAT-3') (SEQ ID NO 11) and P2.1 antisense primer were used to reamplify the product of the first step PCR reaction (each reaction 100 fmol) produced a 157nt product corresponding to the human IL-8 gene sequence -114-+43. Finally, in the third step PCR, the IL-8 minimal promoter variant 5 The ' end extends to the -144 nucleotide position of the human IL-8 gene. The resulting construct used in this study was named pIL-8p[mutCRE_like]-Luc. By PCR using CRElike_S (5'-CGCCTGGTACCGAGCTCTG-3') (SEQ ID NO 12) sense primer and CRelike_AS (5'-ACCCAAGATCTCGAGCCCG-3') (SEQ ID NO 13) antisense primer and template oligonucleotide (5 '-CGCCTGGTACCGAGCTC TGACATAATGACATAATGACATAAT GACATAATGACATAATGACATAA TTACGCGTGCTAGCCCGGGCTCGAGATCTTGGGT-3' (SEQ ID NO 14), (each reaction 100fmol) was amplified to prepare the promoter construct carrying the IL-8 promoter concatemer CRE-like response element. Underlined is Hexamer copy of the CRE-like response element (TGACATAA). PCR amplification parameters were as described above. The 99 nucleotide PCR product was cut with KpnI and BglII, gel purified and ligated into KpnI/BglII digested pTAL vector (BD Biosciences) pTAL-6 x [CRE_like] reporter was generated.

用于高通量筛选的DNA制备DNA preparation for high-throughput screening

将上述阵列化克隆进行复制产生用于存档的多拷贝。一个拷贝用于使用QIAGEN BioRobot 8000(Qiagen，Valencia，CA)产生小量制备的DNA。简而言之，对于每个384-孔板，2μl甘油原种用于接种含有100μl LB液体培养基(Gibco BRL)-8％甘油的Greiner 384-深孔板。然后用带气孔的纸(Qiagen)覆盖Greiner板，用莎伦包装膜包裹并在37℃无摇动孵育～22小时。随后，将5μl培养物从384-孔Greiner板转移至每孔中含有1ml Terrific液体培养基(KD Medical)(+100μg/ml氨苄青霉素)的Qiagen 96-孔深板中。用带气孔的纸覆盖4个Qiagen板并在37℃培养箱中以250转/分钟振荡培养～22小时。以4000转/分钟离心15分钟沉淀细菌细胞，倒出上清液，并使用Qiagen BioRobot 8000处理板以产生DNA制备物。所使用的方案基于制造商的方案‘QIAprep Turbo96 PB(1-4板)’，仅有的修改为使用96-孔UV-透明板(Corning)作为洗脱板。通过在SPECTRAmax 190(Molecular Devices)上测量OD260值确定DNA样品的浓度和产量。然后将所得到的20,702个DNA样品等分试样以产生存档的多拷贝(TE缓冲液中80pg/孔)。对于使用2,368cDNA克隆集合的测定，以每孔6μl在96-孔PCR板(ABGene，Rochester，NY)中产生DNA等分试样，浓度为OPTI-MEM I细胞培养基(Gibco BRL，Carlsbad，CA)中20ng DNA/μl。在384-孔PCR板中以每孔4μl，以OPTI-MEM中7.5ng质粒/μl的浓度产生用于筛选20,702个cDNA收集样品的DNA等分试样。用铝箔密封板并保存于-20℃。The arrayed clones described above were replicated to generate multiple copies for archiving. One copy was used to generate miniprep DNA using a QIAGEN BioRobot 8000 (Qiagen, Valencia, CA). Briefly, for each 384-well plate, a 2 μl glycerol stock was used to inoculate Greiner 384-deep well plates containing 100 μl LB broth (Gibco BRL)-8% glycerol. Greiner plates were then covered with air vented paper (Qiagen), wrapped in saran wrap and incubated at 37°C without shaking for -22 hours. Subsequently, 5 μl of the culture was transferred from a 384-well Greiner plate into a Qiagen 96-well deep plate containing 1 ml Terrific broth (KD Medical) (+ 100 μg/ml ampicillin) per well. Four Qiagen plates were covered with air-perforated paper and incubated for -22 hours in a 37°C incubator with shaking at 250 rpm. Bacterial cells were pelleted by centrifugation at 4000 rpm for 15 min, the supernatant was decanted, and the plate was processed using a Qiagen BioRobot 8000 to generate a DNA preparation. The protocol used was based on the manufacturer's protocol 'QIAprep Turbo96 PB (1-4 plates)' with the only modification to use 96-well UV-transparent plates (Corning) as elution plates. Concentration and yield of DNA samples were determined by measuring OD260 values on a SPECTRAmax 190 (Molecular Devices). The resulting 20,702 DNA samples were then aliquoted to generate archived multiple copies (80 pg/well in TE buffer). For assays using pools of 2,368 cDNA clones, DNA aliquots were generated in 96-well PCR plates (ABGene, Rochester, NY) at 6 μl per well at a concentration of OPTI-MEM I cell culture medium (Gibco BRL, Carlsbad, CA 20ng DNA/μl in ). DNA aliquots for screening of 20,702 cDNA pooled samples were generated at a concentration of 7.5 ng plasmid/μl in OPTI-MEM at 4 μl per well in a 384-well PCR plate. Seal the plate with aluminum foil and store at -20°C.

细胞培养cell culture

将胰酶消化的HeLa细胞(ATCC，Manassas，VA)以10⁵细胞/ml重悬于完全生长培养基(DMEM，Invitrogen)，其包含10％胎牛血清(GIBCOBRL Carlsbad，CA Cat#10082-147)和Dulbecco’s改良Eagle培养基中(D-MEM)(GIBCO BRL Carlsbad，CA Cat.#10317-022)的1×抗菌素-抗霉菌素试剂(GIBCO BRL Carlsbad，CA Cat#15240-062)，并使用Multidrop384(ThermoLabsystems)以每孔75μl分布于24个白色96-孔板(Corning，Acton，MA)中用于2,368个cDNA克隆收集样品筛选，或者以每孔30分布于51个白色384-孔板(Costar)用于20,702个cDNA克隆集合筛选。在37℃并含5％CO₂的组织培养箱中将细胞培养过夜。Trypsinized HeLa cells (ATCC, Manassas, VA) were resuspended at ¹⁰⁵ cells/ml in complete growth medium (DMEM, Invitrogen) containing 10% fetal bovine serum (GIBCOBRL Carlsbad, CA Cat# 10082-147 ) and 1× antibiotic-antimycotic reagent (GIBCO BRL Carlsbad, CA Cat# 15240-062) in Dulbecco's Modified Eagle Medium (D-MEM) (GIBCO BRL Carlsbad, CA Cat. #10317-022), and use Multidrop384 (ThermoLabsystems) was distributed in 24 white 96-well plates (Corning, Acton, MA) at 75 μl per well for 2,368 cDNA clone collection sample screening, or at 30 per well in 51 white 384-well plates ( Costar) was used for pool screening of 20,702 cDNA clones. Cells were grown overnight in a tissue culture incubator at 37 °C with 5% _CO2 .

高通量转染步骤High Throughput Transfection Procedure

对于2,368个cDNA克隆收集样品筛选，将330μg pGL3B-IL-8_P-Luc报告质粒重悬于50ml锥形管中的33ml OptiMEM I低血清培养基中，报告质粒最终量为每次转染100ng。将管振荡并分成4×8ml等分试样。转染之前，将0.8ml Fugene 6转染试剂(Roche Applied Bioscience)加入每个8ml等分试样中(4μl Fugene 6/μg转染DNA)。通过吸量管上下吹吸几次混合内容物并以75μl/孔分配于96-孔清洁PCR板(ABGene)中。使用BiomekFX吸量装置(pipetting station)(Beckman Coulter，Fullerton，CA)在包含6μl预稀释cDNA的每个子板的每个孔中加入10μl[OptiMEM-报告子-Fugene 6]混合物。板#24的最后一排用于pCMV-Sport6空载体等分试样作为阴性对照或者用于编码对应人MEKK1 AA360-672的序列的pFC-MEKK表达构建体(Stratagene)作为阳性对照。两种质粒都预稀释为20ng/μl并且等分为每孔6μl。室温孵育15分钟后，将13μl最终混合物转移至96-孔HeLa培养板中。将细胞在含5％CO₂的空气中37℃孵育48小时。For 2,368 cDNA clone collection sample screening, resuspend 330 μg pGL3B-IL-8 _P -Luc reporter plasmid in 33 ml OptiMEM I low serum medium in a 50 ml conical tube, the final amount of reporter plasmid is 100 ng per transfection. The tube was shaken and divided into 4 x 8 ml aliquots. Prior to transfection, 0.8 ml Fugene 6 transfection reagent (Roche Applied Bioscience) was added to each 8 ml aliquot (4 μl Fugene 6/μg transfection DNA). The mixed contents were pipetted up and down several times and dispensed in 96-well clean PCR plates (ABGene) at 75 μl/well. 10 μl of the [OptiMEM-reporter-Fugene 6] mixture was added to each well of each subplate containing 6 μl of pre-diluted cDNA using a BiomekFX pipetting station (Beckman Coulter, Fullerton, CA). The last row of plate #24 was used for pCMV-Sport6 empty vector aliquots as a negative control or for the pFC-MEKK expression construct (Stratagene) encoding the sequence corresponding to human MEKK1 AA360-672 as a positive control. Both plasmids were pre-diluted to 20 ng/μl and aliquoted to 6 μl per well. After incubation for 15 minutes at room temperature, 13 μl of the final mixture was transferred to a 96-well HeLa plate. Cells were incubated for 48 h at 37 °C in an atmosphere containing 5% _CO2 .

对于20,702个cDNA克隆集合的筛选，将1.65mg pGL3B-IL-8_P-Luc报告质粒重悬于250ml锥形烧瓶(Corning)中的100ml OptiMEM I中，报告质粒的最终量为每次转染50ng。将烧瓶振荡并分为8ml的等份试样。转染之前，将0.65ml Fugene 6转染试剂加入每个8ml等份试样中(3μlFugene/μg转染DNA)。通过吸量管上下吹吸几次混合内容物并以75μl/孔分配于96-孔清洁PCR板(ABGene)中。使用BiomekFX(Beckman)在包含4μl预稀释cDNA的每个384-孔子板的每个孔中加入3μl[OptiMEM-报告子-Fugene 6]混合物。室温孵育15分钟后，将7μl混合物从每个孔转移至384-孔组织培养板中。将细胞在含5％CO₂的空气中37℃孵育48小时。For screening of a pool of 20,702 cDNA clones, resuspend 1.65mg of pGL3B-IL-8P- _Luc reporter plasmid in 100ml OptiMEM I in a 250ml Erlenmeyer flask (Corning), the final amount of reporter plasmid is 50ng per transfection . The flask was shaken and divided into 8ml aliquots. Prior to transfection, 0.65 ml Fugene 6 transfection reagent was added to each 8 ml aliquot (3 μl Fugene/μg transfected DNA). The mixed contents were pipetted up and down several times and dispensed in 96-well clean PCR plates (ABGene) at 75 μl/well. 3 μl of the [OptiMEM-reporter-Fugene 6] mixture was added to each well of each 384-well plate containing 4 μl of pre-diluted cDNA using a BiomekFX (Beckman). After 15 minutes of incubation at room temperature, 7 μl of the mixture was transferred from each well to a 384-well tissue culture plate. Cells were incubated for 48 h at 37 °C in an atmosphere containing 5% _CO2 .

萤光素酶测定Luciferase assay

按照制造商所提供的方案使用BrightGlo萤光素酶测定系统(Promega，Madison，WI)测定转染48小时后萤火虫萤光素酶活性。简而言之，使用Multidrop 384(Thermo Labystems，Beverly，MA)分别将90μl或40μl新鲜重构的萤光素酶试剂加入到96-孔或384-孔组织培养板的每孔中。孵育2分钟后，按照制造商说明书使用400毫秒积分时间在LUMINOSKANAscent发光计(Thermo Labsystems)上读取发光值。Firefly luciferase activity was measured 48 hours after transfection using the BrightGlo Luciferase Assay System (Promega, Madison, WI) according to the protocol provided by the manufacturer. Briefly, 90 μl or 40 μl of freshly reconstituted luciferase reagent was added to each well of a 96-well or 384-well tissue culture plate, respectively, using a Multidrop 384 (Thermo Labystems, Beverly, MA). After 2 minutes of incubation, luminescence was read on a LUMINOSKANA Ascent luminometer (Thermo Labsystems) using a 400 msec integration time according to the manufacturer's instructions.

克隆检索用于采样点证实Clone retrieval for sample point verification

对于每次初级测定，按照常规方法计算针对群体中位数的Z得分和激活倍数并保存于带注释的可检索数据库中。基于两条标准选择了潜在采样点：(1)Z得分大于3.0和(2)激活倍数在2,368和20,702个cDNA克隆集合筛选中分别大于10和5。从甘油原种(重排阵列板的拷贝1)检索2,368个克隆亚阵列的初级测定中作为采样点的克隆得分。通过再转化库中的DNA等份试样回收全部20,702个克隆集合的初级测定的采样点。在XL-10 Gold细菌(Stratagene)中进行转化。将每个克隆在LB培养基琼脂糖平板+抗生素(100μg/ml氨苄青霉素)(KD Medical，Columbia，MD)上划线培养，37℃生长过夜并从每个板挑取3个菌落，生长于深孔96孔板中，每个孔含有995μl Terrific液体培养基(KD Medical)+100μg/ml氨苄青霉素。使用带气孔的胶带覆盖这些深孔板，并在37℃孵育过夜，以300转/分钟转速摇动。如上所述进行DNA小量制备。然后将全部DNA制备物稀释至125ng/μl(在浓度大于125ng/μl的孔中)并取8μl用于DNA序列确证。将剩余DNA稀释至25ng/μl并将6μl DNA转移至子96-孔PCR板(ABGene)中并用于使用上述转染步骤进行验证实验。为了标准化转染效率，每次转染中包括了20ng编码SV40早期启动子控制下的海肾萤光素酶基因的pRL-SV40(Promega)。使用DualGlo萤光素酶测定系统(Promega)按照制造商所提供的方法测定萤火虫和海肾萤光素酶的活性。简而言之，使用Multidrop 384将90μl新鲜重构的萤光素酶试剂加入到96-孔组织培养板每孔中并且，在孵育15分钟后，在LUMINOSKAN Ascent发光计上使用400毫秒积分时间读取发光值。随后将90μl Stop-and-Glo试剂加入到96-孔组织培养板每个孔中并且，在孵育15分钟后，LUMINOSKAN Ascent发光计上使用200毫秒积分时间读取发光值。使用基于不同萤光素酶的启动子构建体：pCRE-Luc、p MCS-Luc(Stratagene)、pTAL-Luc(BD Biosciences)、pNF-kB-Luc(BD Biosciences)、pIL-8_P-Luc、pRhoBP-Luc(按照常规方法室内制备/BD Biosciences)和pVCAM _P-Lue(如Iademarcom，M.F.，J.J.McQuillan，G.D.Rosen和D.C.Dean.1992.J Biol Chem 267：16323-9中所述制备)测试所选克隆的特异性。For each primary assay, Z-scores and fold activations against the population median were routinely calculated and stored in an annotated searchable database. Potential sampling sites were selected based on two criteria: (1) Z-score greater than 3.0 and (2) fold activation greater than 10 and 5 in pooled screens of 2,368 and 20,702 cDNA clones, respectively. Clonal scores as sampling points in a primary assay of 2,368 clonal subarrays retrieved from glycerol stocks (copy 1 of the rearranged array plate). Sampling points for the primary assay of the entire 20,702 clonal pool were recovered by retransforming DNA aliquots from the pool. Transformation was performed in XL-10 Gold bacteria (Stratagene). Streak culture each clone on LB medium agarose plate + antibiotic (100 μ g/ml ampicillin) (KD Medical, Columbia, MD), grow overnight at 37 ° C and pick 3 colonies from each plate, grow in In a deep-well 96-well plate, each well contained 995 μl Terrific liquid medium (KD Medical) + 100 μg/ml ampicillin. Cover these deep-well plates with air-perforated tape and incubate overnight at 37 °C, shaking at 300 rpm. DNA minipreps were performed as described above. The total DNA preparation was then diluted to 125 ng/μl (in wells with a concentration greater than 125 ng/μl) and 8 μl was taken for DNA sequence confirmation. The remaining DNA was diluted to 25 ng/μl and 6 μl of the DNA was transferred to a sub 96-well PCR plate (ABGene) and used for a validation experiment using the transfection procedure described above. To normalize transfection efficiency, 20 ng of pRL-SV40 (Promega) encoding the Renilla luciferase gene under the control of the SV40 early promoter was included in each transfection. Firefly and Renilla luciferase activities were assayed using the DualGlo Luciferase Assay System (Promega) following the manufacturer's protocol. Briefly, 90 μl of freshly reconstituted luciferase reagent was added to each well of a 96-well tissue culture plate using a Multidrop 384 and, after 15 minutes of incubation, read on a LUMINOSKAN Ascent luminometer using a 400 ms integration time. Get the glow value. Then 90 μl of Stop-and-Glo reagent was added to each well of the 96-well tissue culture plate and, after 15 minutes of incubation, luminescence was read on a LUMINOSKAN Ascent luminometer using a 200 millisecond integration time. Different luciferase-based promoter constructs were used: pCRE-Luc, pMCS-Luc (Stratagene), pTAL-Luc (BD Biosciences), pNF-kB-Luc (BD Biosciences), pIL- _8P -Luc, pRhoBP-Luc (prepared in-house according to conventional methods/BD Biosciences) and pVCAM _P -Lue (prepared as described in Iademarcom, MF, JJMcQuillan, GDRosen and DCDean. 1992. J Biol Chem 267: 16323-9) tested the selected clones for specificity.

HeLa细胞中IL-8酶联免疫吸附测定IL-8 ELISA in HeLa cells

使用上述方案用选自验证并确证采样点的序列的DNA样本以每孔100ng在96孔板(Costar)中转染HeLa细胞。命名为共激活子的DNA样品以25ng每孔进行共转染。空载体pCMV-Sport6用作阴性对照。转染72小时后，使用IL-8酶联免疫吸附测定试剂盒(Sigma)按照所提供的方案测定对应1-5μl条件生长培养基的预稀释等份细胞生长培养基中IL-8含量。作为阳性对照，生长培养基选自使用空载体转染的细胞并且在收集用于IL-8测定的生长培养基之前分别用5ng/mlIL-1和50ng/ml TNFα(R&DSystems)处理16小时。HeLa cells were transfected at 100 ng per well in 96-well plates (Costar) using the protocol described above with DNA samples selected from sequences from validated and confirmed sampling sites. DNA samples designated as coactivators were co-transfected at 25 ng per well. The empty vector pCMV-Sport6 was used as a negative control. Seventy-two hours after transfection, IL-8 content in pre-diluted aliquots of cell growth medium corresponding to 1-5 μl of conditioned growth medium was assayed using an IL-8 ELISA kit (Sigma) following the protocol provided. As a positive control, growth medium was selected from cells transfected with empty vector and treated with 5 ng/ml IL-1 and 50 ng/ml TNFα (R&D Systems) respectively for 16 hours before harvesting the growth medium for IL-8 assay.

使用Affymetrix DNA微阵列芯片的基因表达谱Gene Expression Profiling Using Affymetrix DNA Microarrays

使用Targefect F1转染试剂(Targeting Systems，Santee，CA)按照随产品提供的方案用文中所述CREAP1或含有relA(Ruben SM等，Science1991 Mar 22；251(5000)：1490-3)、MAP3K11(Hartkamp，J.等，(1999).Cancer Res.59，2195-2202)或ANKRD3(Muto，A.等，(2002)J.Biol.Chem.277，31871-31876)的表达构建体转染HeLa细胞。简而言之，70-80％汇合于T75组织培养瓶中(Falcon)的HeLa细胞用于转染。转染混合物如下制备：将20μg所选质粒DNA加入含有8ml Opti-MEM I的50ml锥形管(Falcon)并轻弹该管混合。用pCMV-Sport6空载体提供两种转染。以全速涡旋Targefect F-1贮存液20秒并取40μl加入每个管，再次通过轻弹管混合并在室温孵育30分钟以允许形成转染复合体。用20ml Opti-MEM I培养基洗涤HeLa细胞两次并在每个1T75培养瓶中加入12ml每种转染复合体。37℃孵育4小时后，将8ml含血清的生长培养基加入每个培养瓶。第二天更换培养基。转染56后小时再次更换培养基并且向一个用pCMV-Sport6质粒转染的培养瓶加入50ng/ml的TNFα(R&D Systems)并在37℃继续孵育16小时。转染后72小时将细胞收集于10ml TRIzol试剂(Gibco BRL)中并冻存于-80℃。按照随TRIzol试剂提供的方法分离总RNA。按照常规方法进行双链cDNA探针的合成标记、Affymetrix基因芯片(Gene-Chip)杂交和数据分析(也见Eberwine，J.等，J.Neurosci.21，8310-8314和Hakak，Y.等，(2001)Proc.Natl.Acad.Sci.U.S.A 98，4746-4751)。Use Targeting F1 transfection reagent (Targeting Systems, Santee, CA) according to the protocol provided with the product with CREAP1 as described in the text or containing relA (Ruben SM et al., Science1991 Mar 22; 251 (5000): 1490-3), MAP3K11 (Hartkamp , J. et al, (1999).Cancer Res.59,2195-2202) or ANKRD3 (Muto, A. et al, (2002) J.Biol.Chem.277,31871-31876) expression construct transfection HeLa cell . Briefly, HeLa cells that were 70-80% confluent in T75 tissue culture flasks (Falcon) were used for transfection. The transfection mix was prepared by adding 20 μg of selected plasmid DNA to a 50 ml conical tube (Falcon) containing 8 ml Opti-MEM I and flicking the tube to mix. Both transfections were provided with pCMV-Sport6 empty vector. Vortex the Targetfect F-1 stock solution at full speed for 20 seconds and add 40 μl to each tube, mix again by flicking the tube and incubate at room temperature for 30 minutes to allow transfection complexes to form. Wash HeLa cells twice with 20 ml Opti-MEM I medium and add 12 ml of each transfection complex to each 1T75 flask. After 4 hours of incubation at 37°C, 8 ml of serum-containing growth medium was added to each flask. The medium was changed the next day. The medium was changed again 56 hours after transfection and 50 ng/ml of TNFα (R&D Systems) was added to a flask transfected with the pCMV-Sport6 plasmid and incubation was continued for 16 hours at 37°C. Cells were harvested 72 hours after transfection in 10 ml TRIzol reagent (Gibco BRL) and frozen at -80°C. Total RNA was isolated according to the method provided with TRIzol reagent. Synthetic labeling of double-stranded cDNA probes, Affymetrix gene chip (Gene-Chip) hybridization and data analysis were carried out according to conventional methods (see also Eberwine, J. et al., J.Neurosci.21, 8310-8314 and Hakak, Y. et al., (2001) Proc. Natl. Acad. Sci. U.S.A 98, 4746-4751).

实施例6Example 6

IL8P萤光素酶载体的表征 Characterization of the IL8P luciferase vector

通过细胞因子介导的基因表达的已知调节子测试了由包含人IL-8启动子片段的1.5kB IL-8启动子控制的萤光素酶报告基因的可诱导性。将pNF-κB-Luc(BD Biosciences)和pGL2-IL-8_P-Luc报告基因共转染入含有表达构建体的HEK 293细胞，该表达构建体编码按照常规方法使用专有的克隆集合制备的NF-κB通路激活子——截短的MEKK(AA 360-672)(Stratagene)和全长TRAF6 cDNA。用空pCMV-Sport6载体共转染的细胞或者不处理或者用TNFα(50ng/ml，处理16小时)处理。转染后48小时测定萤光素酶活性。pNF-κB-Luc报告子用作阳性对照。The inducibility of a luciferase reporter gene controlled by a 1.5 kB IL-8 promoter containing a human IL-8 promoter fragment was tested by known regulators of cytokine-mediated gene expression. pNF-κB-Luc (BD Biosciences) and pGL2-IL-8P- _Luc reporter genes were co-transfected into HEK 293 cells containing an expression construct encoding β-Luc prepared according to routine methods using a proprietary clone pool. NF-κB pathway activator - truncated MEKK (AA 360-672) (Stratagene) and full-length TRAF6 cDNA. Cells co-transfected with empty pCMV-Sport6 vector were either left untreated or treated with TNFα (50 ng/ml, treated for 16 hours). Luciferase activity was measured 48 hours after transfection. The pNF-κB-Luc reporter was used as a positive control.

数据表明MEKK、TRAF6和TNFα显著激活IL-8启动子-报告子，分别增加报告子基因活性16、4.9和4.7倍。对于我们专有的cDNA克隆集合的高通量功能筛选，将IL-8启动子序列亚克隆入pGL3Basic载体(Promega)，该载体是最初pGL2载体的衍生物，其具有提高的特异性和效率。The data indicated that MEKK, TRAF6 and TNFα significantly activated the IL-8 promoter-reporter, increasing the activity of the reporter gene by 16, 4.9 and 4.7 times, respectively. For high-throughput functional screening of our proprietary collection of cDNA clones, the IL-8 promoter sequence was subcloned into the pGL3Basic vector (Promega), a derivative of the original pGL2 vector with improved specificity and efficiency.

实施例7Example 7

对20,000 cDNA集合进行基于IL-8启动子的功能筛选和采样点活性的验20,000 cDNA collections were screened based on IL-8 promoter function and assayed for sampling site activity. 证certificate

在384-孔板中将pGL3B-IL-8_P-Luc与20,702个单独全长cDNA克隆共转染入HeLa细胞并且如上所述转染后48小时进行了单一报告子测定。将pCMV-Sport6与报告子共转染作为阴性对照。使用BrightGlo报告子测定系统(Promega)测定了萤光素酶活性。确定了IL-8启动子报告子活性的绝对值并且鉴定出分值比pCMV-Sport6质粒对照高出5-倍的克隆(数据未显示)。为了验证这些采样点的身份和活性，如上所述回收克隆，并分离出3个独立的菌落。小量制备DNA用于序列验证和使用IL-8_P-Luc报告子进行二次测定(数据未显示)。pGL3B-IL-8β- _Luc was co-transfected with 20,702 individual full-length cDNA clones into HeLa cells in 384-well plates and a single reporter assay was performed 48 hours after transfection as described above. Co-transfection of pCMV-Sport6 with the reporter served as a negative control. Luciferase activity was measured using the BrightGlo reporter assay system (Promega). The absolute value of IL-8 promoter reporter activity was determined and clones with a score 5-fold higher than the pCMV-Sport6 plasmid control were identified (data not shown). To verify the identity and activity of these sampling points, clones were recovered as described above and 3 independent colonies were isolated. Minipreps of DNA were used for sequence verification and secondary assays using the IL- _8β -Luc reporter (data not shown).

对在二次测定中对IL-8_P-Luc报告子产生显著激活的各个克隆分离物测试了其激活7种启动子萤光素酶报告子构建体的能力：pTAL、NF-kB-Luc、IL-8_P-Luc、RhoB_P-Luc、VCAM _P-Luc(BD Biosciences)(与加入4个CRE效应元件的pTAL相同)。Individual clonal isolates that produced significant activation of the IL-8β- _Luc reporter in a secondary assay were tested for their ability to activate seven promoter luciferase reporter constructs: pTAL, NF-kB-Luc, IL- _8P -Luc, _RhoBP -Luc, _VCAMP -Luc (BD Biosciences) (same as pTAL with addition of 4 CRE response elements).

基于预测或表征基因起始密码子的存在从cDNA 12,905个克隆的单一5’末端序列选择了cDNA克隆，其中cDNA 12,905个克隆是与RefSeq基因相匹配的，其中5,463个基因是有功能注释的。将20,704个cDNA与IL-8启动子控制的萤火虫萤光素酶报告基因(pIL-8-Luc)共转染。64个cDNA诱导报告子大于5倍。所验证的活性cDNA包括28个独特基因的1-3个拷贝。选择了22种非冗余cDNA用于进一步研究。在测定中还对全部集合筛选环AMP效应元件(CRE)或血清效应元件(SRE)驱动的报告子的激活。使用层级聚类(hierarchical clustering)(Eisen)将在初级筛选中使用22种cDNA获得的结果进行归类以确定是否有基因在三个测定中都表现出相关活性。许多基因对于IL-8报告子相对特异。这些基因包括NF-κB的已知的诱导子并且由NF-κB转录因子的亚单位relA(p65)、TNF受体超家族成员1A、TNF相关分子TWEAK/TNFSF12、RIPK2和TRAF6、最新鉴定的NF-κB激活子ACT1和激酶PKK代表。第二组代表AP-1转录因子位点激活子，包括JunD和JNK-诱导的MAP激酶MAP3K12和MAP3K11的多个克隆。还鉴定出-C/EBPβ，已知其直接结合至IL-8启动子NF-IL6位点。因此，初级筛选鉴定出预测通过许多不同途径激活IL-8基因的多种诱导物。cDNA clones were selected based on the presence of predicted or characterized gene start codons from the single 5′ end sequences of 12,905 clones that were matched to RefSeq genes, 5,463 of which were functionally annotated. 20,704 cDNAs were co-transfected with a firefly luciferase reporter gene (pIL-8-Luc) controlled by the IL-8 promoter. 64 cDNA induced reporters greater than 5-fold. Validated active cDNA included 1-3 copies of 28 unique genes. Twenty-two non-redundant cDNAs were selected for further study. The entire pool was also screened in the assay for activation of cyclic AMP response element (CRE) or serum response element (SRE) driven reporters. The results obtained using the 22 cDNAs in the primary screen were clustered using hierarchical clustering (Eisen) to determine if any genes exhibited relevant activity in all three assays. Many genes are relatively specific for the IL-8 reporter. These genes include known inducers of NF-κB and are composed of subunit relA (p65) of NF-κB transcription factor, TNF receptor superfamily member 1A, TNF-related molecules TWEAK/TNFSF12, RIPK2 and TRAF6, newly identified NF-κB - kappa B activator ACT1 and kinase PKK representative. The second group represents AP-1 transcription factor site activators, including multiple clones of the JunD and JNK-induced MAP kinases MAP3K12 and MAP3K11. Also identified was -C/EBPβ, which is known to bind directly to the IL-8 promoter NF-IL6 site. Thus, the primary screen identified multiple inducers predicted to activate the IL-8 gene through many different pathways.

CREAP1属于所获得的采样点。因此数据表明CREAP1是CRE-Luc和IL-8_P-Luc构建体的强激活子。事实上，该种迄今未知功能的蛋白质不仅显示为是CRE的最强的激活子(甚至强于两种CRE结合反式激活蛋白CRE-BRa和CREB1(数据未显示)并且证实结果公开于上面提供的实施例中)而且还是这些次级测定中发现的IL-8基因的最强激活子。CREAP1 belongs to the obtained sampling points. The data thus indicate that CREAP1 is a strong activator of the CRE-Luc and IL- _8β -Luc constructs. In fact, this protein of hitherto unknown function was not only shown to be the strongest activator of CRE (even stronger than the two CRE-binding transactivators CRE-BRa and CREB1 (data not shown) and confirming the results disclosed in the above provided in the Examples) and also the strongest activator of the IL-8 gene found in these secondary assays.

实施例8Example 8

CREAP1强烈激活携带串联IL-8启动子特异CRE-样元件的报道子CREAP1 strongly activates a reporter carrying a tandem IL-8 promoter-specific CRE-like element

为了确定是否所述强激活子也诱导内源IL-8基因，用作为NF-κB和AP-1激活子的实例的relA和MAP3K11构建体转染后测定了从HeLa细胞所分泌的IL-8蛋白质的积累。MAP3K11和relA诱导小的增加，但是两者的组合诱导的分泌IL-8的水平比得上用IL-1β——一种已知的IL-8最有效的诱导物——所观察到的结果。该数据表明内源性IL-8基因的调节需要多种信号转导途径的相互作用。To determine whether the strong activator also induces the endogenous IL-8 gene, IL-8 secretion from HeLa cells was measured after transfection with the relA and MAP3K11 constructs as examples of NF-κB and AP-1 activators. accumulation of protein. MAP3K11 and relA induced small increases, but the combination of both induced levels of secreted IL-8 comparable to that observed with IL-1β, the most potent inducer of IL-8 known . This data suggests that the regulation of the endogenous IL-8 gene requires the interaction of multiple signal transduction pathways.

鉴定了几种作用机制仍然不清楚的cDNA。这些包括两种Rho-依赖GTP-GDP增强因子(Rho-GEFs)、p114和ARHGEF1、C16orf15、促甲状腺素细胞胚胎因子1(TEF1)、纤连蛋白(FN1)和核受体家族成员NR2F2。C16orf15编码未知功能的富含脯氨酸蛋白质，其在脑中高水平表达。TEF1是直接通过TEF效应元件起作用的碱性亮氨酸拉链转录因子的成员。FN1是在损伤组织中高水平表达的基质糖蛋白并且它还能够通过AP-1-依赖机制诱导IL-1β。NR2F2在全部测定中都是非常强的激活子并且因此其活性表现为非特异的。Several cDNAs were identified whose mechanism of action remains unclear. These include two Rho-dependent GTP-GDP enhancers (Rho-GEFs), p114 and ARHGEF1, C16orf15, thyrotropin embryonic factor 1 (TEF1), fibronectin (FN1), and the nuclear receptor family member NR2F2. C16orf15 encodes a proline-rich protein of unknown function that is expressed at high levels in the brain. TEF1 is a member of the basic leucine zipper transcription factors that act directly through TEF response elements. FN1 is a matrix glycoprotein expressed at high levels in injured tissues and it is also capable of inducing IL-1β through an AP-1-dependent mechanism. NR2F2 was a very strong activator in all assays and therefore its activity appeared to be non-specific.

几种最强的IL-8激活子是与CRE-依赖性基因表达相关的。C/EBPβ、JunD、c-jun、CRE结合蛋白质CREB1、CRE-BPa和XBP1发现是CRE-驱动报告子的有效诱导物。还将与收录为KIAA0616和MECT1的序列重叠的cDNA鉴定为上述的CREAP1基因。有意思地是，除了在粘液表皮样癌中编码MECT1前44个氨基酸的序列易位至Mastermind-样基因MAML2(Tonon等，Nat.Genet.，33：208-213(2003))外，关于该蛋白质一无所知。Several of the strongest IL-8 activators were associated with CRE-dependent gene expression. C/EBPβ, JunD, c-jun, CRE-binding proteins CREB1, CRE-BPa and XBP1 were found to be potent inducers of CRE-driven reporters. A cDNA overlapping with the sequences recorded as KIAA0616 and MECT1 was also identified as the above-mentioned CREAP1 gene. Interestingly, except that the sequence encoding the first 44 amino acids of MECT1 is translocated to the Mastermind-like gene MAML2 in mucoepidermoid carcinoma (Tonon et al., Nat. Genet., 33:208-213 (2003)), regarding this protein Know nothing.

许多最强的IL-8激活子也是CRE激活子或结合蛋白质的这一观察暗示IL-8启动子可能含有未识别的CRE。首先通过使用植物双萜forskolin(Sigma)即腺苷酸环化酶的一种非特异激活剂检测提高的cAMP水平对IL-8启动子的效应对所述暗示进行了测试。简而言之，使用Fugene6转染试剂(Roche)如上所述用pCRE-Luc或pIL-8-Luc与空载体或CRE-BPa表达构建体共转染HEK 293细胞。转染16小时后，将含有500μM IBMX的等体积生长培养基加入孔中。8小时后，向将来自在生长培养基上制备的50μM贮存液的用IBMX预处理的细胞加入forskolin使终浓度达到5μM。在37℃用forskolin处理细胞16小时。使用Dual-Glo测定试剂盒(Promega)测定萤光素酶活性并如上所述进行标准化。数据表示为与用空载体转染的未处理细胞相比的诱导倍数。结果表明forskolin微弱诱导IL-8报告子。共转染在筛选中发现的CRE结合蛋白质CRE-BPa当forskolin处理时协同地激活IL-8启动子。The observation that many of the strongest IL-8 activators are also CRE activators or bind proteins suggests that the IL-8 promoter may contain an unrecognized CRE. The implications were first tested by detecting the effect of increased cAMP levels on the IL-8 promoter using the plant diterpene forskolin (Sigma), a non-specific activator of adenylyl cyclase. Briefly, HEK 293 cells were co-transfected with pCRE-Luc or pIL-8-Luc with empty vector or CRE-BPa expression constructs using Fugene6 transfection reagent (Roche) as described above. Sixteen hours after transfection, an equal volume of growth medium containing 500 μM IBMX was added to the wells. Eight hours later, forskolin was added to a final concentration of 5 μM to cells pretreated with IBMX from a 50 μM stock prepared in growth medium. Cells were treated with forskolin for 16 hours at 37°C. Luciferase activity was determined using the Dual-Glo assay kit (Promega) and normalized as described above. Data are expressed as fold induction compared to untreated cells transfected with empty vector. The results indicated that forskolin weakly induced the IL-8 reporter. Co-transfection of the CRE-binding protein found in the screen, CRE-BPa, synergistically activates the IL-8 promoter upon forskolin treatment.

然后使用标准技术，检测IL-8启动子序列中潜在CRE序列的存在。在IL-8启动子的-69和-62之间发现了具有序列5’-TGACATAA-3’的潜在不对称变体CRE，以前已经将其作为AP-1结合序列提到但其功能还没有报道(Roebuck，J.Interferon and Cytokine Res.19：429-438(1999))。我们将该位点命名为“CRE-样效应元件”。带有相同DNA序列的寡核苷酸显示被CREB2很好地结合并且被CREB1很差地结合(Benbrook和Jones，Nucleic Acids Res.，22：1463-1469(1994))。有趣地是，已提出CREB2作为转录激活子/抑制子起着双重作用。认为结合到“CRE-样效应元件”的CREB2会损害激活蛋白质如CREB的结合并因此抑制CRE-依赖的转录(Karpinski等，Proc.Natl.Acad.Sci.U.S.A.89：4820-4824(1992))。另一方面，CREB2与其它转录因子如c-Rel、ATF-1或者病毒蛋白质Tax结合能够激活这些情况中的几种基因的转录(Schoch等，Neurochem.Int.38：601-608(2001))。The IL-8 promoter sequence is then tested for the presence of a potential CRE sequence using standard techniques. A potential asymmetric variant CRE with the sequence 5'-TGACATAA-3' was found between -69 and -62 of the IL-8 promoter, which has been previously mentioned as an AP-1 binding sequence but its function has not been Reported (Roebuck, J. Interferon and Cytokine Res. 19:429-438 (1999)). We named this locus "CRE-like response element". Oligonucleotides with identical DNA sequences were shown to be bound well by CREB2 and poorly by CREB1 (Benbrook and Jones, Nucleic Acids Res., 22:1463-1469 (1994)). Interestingly, CREB2 has been proposed to play a dual role as a transcriptional activator/repressor. CREB2 binding to "CRE-like response elements" is thought to impair binding of activating proteins such as CREB and thus inhibit CRE-dependent transcription (Karpinski et al., Proc. Natl. Acad. Sci. U.S.A. 89:4820-4824 (1992)) . On the other hand, the binding of CREB2 to other transcription factors such as c-Rel, ATF-1 or the viral protein Tax can activate the transcription of several genes in these cases (Schoch et al., Neurochem. Int. 38:601-608 (2001)) .

研究了通过MAP3K11和CREAP1诱导IL-8启动子的机制。为了确定是否启动子元件需要由这些基因激活，建立了一系列在IL-8CRE-样调节位点和其它调控位点中带有突变的启动子变体，并测试了MAP3K11、CREAP1或relA的诱导作用。结果表明C/EBP结合位点的突变对任一蛋白质的激活作用没有影响。NF-κB位点突变对MAP3K11或CREAP1的诱导作用具有很小的影响但消除了relA的诱导作用。AP-1位点的突变不显著改变relA的作用但严重降低了MAP3K11的诱导作用。这与MAP3K11激活JNK/SAPK途径和AP-1的能力相一致。令人惊奇地是，该突变还显著降低了CREAP1的激活作用。CRE-样位点的突变急剧降低了或消除了CREAP1和MAP3K11的诱导作用(数据未显示)。The mechanism of IL-8 promoter induction by MAP3K11 and CREAP1 was investigated. To determine whether promoter elements are required to be activated by these genes, a series of promoter variants with mutations in IL-8CRE-like regulatory sites and other regulatory sites were generated and tested for induction of MAP3K11, CREAP1 or relA effect. The results indicated that mutations in the C/EBP binding site had no effect on the activation of either protein. NF-κB site mutations had little effect on the induction of MAP3K11 or CREAP1 but abolished the induction of relA. Mutations at the AP-1 site did not significantly alter the effect of relA but severely reduced the induction of MAP3K11. This is consistent with the ability of MAP3K11 to activate the JNK/SAPK pathway and AP-1. Surprisingly, the mutation also significantly reduced the activation of CREAP1. Mutation of CRE-like sites drastically reduced or abolished induction of CREAP1 and MAP3K11 (data not shown).

为了确定是否“CRE-样元件”直接对CREAP1或MAP3K11做出反应，检测了这两种基因激活携带多联体CRE-样位点的最小启动子(pCREL-Luc)的能力。另外，我们研究了已知的AP-1诱导物PMA的作用。与CRE报告子(pCRE-Luc)相似，pCREL-Luc被CREAP1强烈激活但不被MAP3K11或PMA处理激活(数据未显示)。该数据表明尽管CREAP1和MAP3K11对于它们的活性都需要完整CRE-样和AP-1位点，它们分别使用作为其主要效应元件的CRE-样或AP-1位点通过不同机制诱导IL-8启动子。To determine whether "CRE-like elements" respond directly to CREAP1 or MAP3K11, the ability of these two genes to activate a minimal promoter carrying a concatemer CRE-like site (pCREL-Luc) was tested. Additionally, we investigated the effect of PMA, a known inducer of AP-1. Similar to the CRE reporter (pCRE-Luc), pCREL-Luc was strongly activated by CREAP1 but not by MAP3K11 or PMA treatment (data not shown). This data suggests that although CREAP1 and MAP3K11 both require intact CRE-like and AP-1 sites for their activity, they induce IL-8 priming by different mechanisms using the CRE-like or AP-1 sites, respectively, as their major effector elements son.

我们进一步测定了是否CREAP1-诱导的pIL-8-Luc报告子活性依赖于CREB。CREAP1和CREB的显性失活形式KCREB(BD Biosciences)的共表达导致CREAP1-诱导的IL-8启动子活性显著降低(数据未显示)。相反，用I-ΚBα组成型活性形式-NF-ΚB途径的有效抑制剂-共转染未影响CREAP1的活性。We further determined whether CREAP1 -induced pIL-8-Luc reporter activity is dependent on CREB. Co-expression of CREAP1 and the dominant negative form of CREB, KCREB (BD Biosciences), resulted in a significant reduction in CREAP1-induced IL-8 promoter activity (data not shown). In contrast, co-transfection with the constitutively active form of I-ΚBα - a potent inhibitor of the NF-ΚB pathway - did not affect the activity of CREAP1.

为了确定CREAP1与CRE和AP-1结合位点的相互作用是否与相同或不同结构域相关，我们使用常规方法构建了携带从N-和C-末端缺失的CREAP1的几种变体并测试了这些变体影响CREAP1或MAP3K1激活plL-8-Luc报告子的能力。包含59个N-末端氨基酸缺失的突变体(δ59)降低了野生型CREAP1并极大地抑制了MAP3K11诱导IL-8报告子表达的能力(数据未显示)。由于δ59对relA的激活作用没有影响，所以抑制是特异的。δ59还阻断了PMA或者MAP3K11对含有重复AP-1位点的AP-1特异报告子pAP1(PMA)-Luc的激活(数据未显示)。同时δ59不能阻断forskolin刺激的pCRE-Luc报告子(数据未显示)。此数据表明当CREAP1以CREB-依赖方式通过CRE激活表达时，该蛋白质可能与AP-1激活的必需成分直接或间接相互作用。To determine whether the interaction of CREAP1 with the binding sites of CRE and AP-1 is associated with the same or different domains, we constructed several variants of CREAP1 carrying deletions from the N- and C-termini using conventional methods and tested these Variants affect the ability of CREAP1 or MAP3K1 to activate the pL-8-Luc reporter. A mutant (δ59) comprising a deletion of 59 N-terminal amino acids reduced wild-type CREAP1 and greatly inhibited the ability of MAP3K11 to induce IL-8 reporter expression (data not shown). Inhibition is specific since δ59 has no effect on relA activation. δ59 also blocked the activation of the AP-1-specific reporter pAP1(PMA)-Luc containing repeated AP-1 sites by PMA or MAP3K11 (data not shown). Meanwhile, δ59 could not block forskolin-stimulated pCRE-Luc reporter (data not shown). This data suggests that when CREAP1 is expressed through CRE activation in a CREB-dependent manner, this protein may interact directly or indirectly with an essential component of AP-1 activation.

实施例9Example 9

用CREAP1瞬时转染的HeLa细胞中的基因表达谱 Gene expression profiles in HeLa cells transiently transfected with CREAP1

为了确定CREAP1是否调节真正CREB靶的表达，过表达CREAP1后使用DNA微阵列测定了细胞基因的表达。简而言之，使用Targefect F1试剂(Targeting Systems)用pCMV-Sport6、CREAP1瞬时转染HeLa细胞。将一半pCMV-Sport6转染细胞不处理并且用作阴性对照。如上所述进行总RNA分离、标记探针制备和DNA微芯片杂交方案。To determine whether CREAP1 regulates the expression of bona fide CREB targets, cellular gene expression was measured using DNA microarrays after overexpression of CREAP1. Briefly, HeLa cells were transiently transfected with pCMV-Sport6, CREAP1 using Targetfect F1 reagent (Targeting Systems). Half of the pCMV-Sport6 transfected cells were left untreated and used as a negative control. Total RNA isolation, labeled probe preparation, and DNA microchip hybridization protocols were performed as described above.

有趣地是，结果表明CREAP1转染时HeLa细胞内的基因表达谱明显不同于其它激活子，不同之处在于已知依赖于cAMP/CREB途径的基因特别丰富。具体而言，CREAP1转染诱导7种基因增加10倍(见表2)。其他基因包括CREB和cAMP的众所周知的靶标，包括TSHα、烯醇丙酮酸磷酸羧激酶(PEPCK)、晶体蛋白α-B和EGF-样分子双调蛋白。CREAP1还以较低程度激活CREM(已知cAMP水平上升时被诱导的另一种基因)。诱导c-Jun和AP-1的已知的靶标PAI-2的MAP3K11不影响该组基因(Arts等，1996，Eur.J.Biochem 241：393-402)。因此，CREAP1是真正的CREB靶基因的诱导剂。Interestingly, the results showed that the gene expression profile in HeLa cells upon CREAP1 transfection was distinct from that of other activators in that genes known to be dependent on the cAMP/CREB pathway were particularly enriched. Specifically, CREAP1 transfection induced a 10-fold increase in seven genes (see Table 2). Other genes include well-known targets of CREB and cAMP, including TSHα, enolpyruvate phosphate carboxykinase (PEPCK), crystallin α-B, and the EGF-like molecule amphiregulin. CREAP1 also activates CREM, another gene known to be induced when cAMP levels rise, to a lesser extent. Induction of c-Jun and MAP3K11 of PAI-2, a known target of AP-1, did not affect this group of genes (Arts et al., 1996, Eur. J. Biochem 241:393-402). Thus, CREAP1 is a bona fide inducer of CREB target genes.

筛选中所鉴定的每种激活子还能相对较小程度激活(2-5倍)内源性IL-8基因。与使用人工报告基因构建体所观察到的强激活作用相比内源性IL-8基因的弱激活作用可能是由于需要通过上述多种途径激活。我们还使用层级聚类算法分析了HEK293细胞中由CREAP1或蛋白激酶A(PKA)催化亚基过表达时差别调节的几套基因。我们还发现尽管两种蛋白质都通过CREB起作用，但是上调和下调的基因群不完全重叠。数据表明CREAP1可以提供激活转录的众所周知的磷酸化依赖机制的备选机制。基因 Affymetrix ID 激活倍数 IL-8 1369_s_at 2.5 KIAA0467 41458_at 12 Exodus-1 40385_at 15 CAPL蛋白质 38088_r_at 19 双调蛋白 34898_at 19 DKFZp566K192 32242_at 32 PEPCK 33702_f_at 32 TSHa 39352_at 57 Each of the activators identified in the screen also activated the endogenous IL-8 gene to a relatively small extent (2-5 fold). The weak activation of the endogenous IL-8 gene compared to the strong activation observed with the artificial reporter construct may be due to the need for activation through multiple pathways as described above. We also analyzed several sets of genes differentially regulated by CREAP1 or protein kinase A (PKA) catalytic subunit overexpression in HEK293 cells using a hierarchical clustering algorithm. We also found that although both proteins act through CREB, the groups of up- and down-regulated genes do not completely overlap. The data suggest that CREAP1 may provide an alternative mechanism to the well-known phosphorylation-dependent mechanism for activating transcription. Gene Affymetrix ID Activation multiple IL-8 1369_s_at 2.5 KIAA0467 41458_at 12 Exodus-1 40385_at 15 CAPL protein 38088_r_at 19 Amphiregulin 34898_at 19 DKFZp566K192 32242_at 32 PEPCK 33702_f_at 32 TS Ha 39352_at 57

表2：通过CREAP1诱导cAMP反应基因。显示了通过Affymetrix基因芯片所检测的由CREAP1最强烈诱导的基因mRNA水平增加的倍数。还显示了IL-8转录物诱导水平的比较。这些基因仅是由CREAP1诱导＞10倍的基因并且在重复实验中都发现了。计算了与用对照pCMV-Sport6载体转染后所观察到的表达水平相比较的增加倍数。Table 2: Induction of cAMP-responsive genes by CREAP1. The fold increase in mRNA levels of the genes most strongly induced by CREAP1 as detected by Affymetrix gene chips is shown. A comparison of IL-8 transcript induction levels is also shown. These genes were the only ones that were >10-fold induced by CREAP1 and were all found in replicate experiments. The fold increase compared to the expression level observed after transfection with the control pCMV-Sport6 vector was calculated.

由CREAP1最强烈诱导的两种基因是cAMP的已知靶标，这两种基因是烯醇丙酮酸磷酸羧激酶(PEPCK或者PCK1)(Roesler，W.J.Mol.CellEndocrinol.162：1-7(2000))和促甲状腺激素α(TSHα)的基因(Kim，D.S等Mol Endocrinol.8：528-36(1994))。据报道第三种高度调节的基因即双调蛋白在一些癌细胞系中的表达依赖于PKA(Bianco，C.G.等，Clin.Cancerres.3：439-48(1997))并且我们已经鉴定出了在小鼠和人基因中完全保守的近端双调蛋白启动子共有CRE位点(数据未显示)。The two genes most strongly induced by CREAP1 are known targets of cAMP, enolpyruvate phosphate carboxykinase (PEPCK or PCK1) (Roesler, W. J. Mol. Cell Endocrinol. 162: 1-7 (2000)) and thyroid-stimulating hormone alpha (TSHα) genes (Kim, D.S et al. Mol Endocrinol. 8:528-36 (1994)). Expression of a third highly regulated gene, amphiregulin, has been reported to be dependent on PKA in some cancer cell lines (Bianco, C.G. et al., Clin. Cancerres. 3:439-48 (1997)) and we have identified The fully conserved proximal amphiregulin promoter in the mouse and human genes shared the CRE site (data not shown).

两种由CREAP1最高度诱导的内源基因不是cAMP或CREB蛋白质的已知靶标。第一种是CAPL；第二种是趋化因子Exodus-1(也已知为CCL27、MIP-3α或LARC)。有趣地是，Exodus-1基因也是趋化因子并且以与IL-8基因非常相似的方式受到调节，因为据报导近端启动子包含NF-kB、AP-1和NF-IL6/C/EBP位点。CREAP1还以比诱导内源性IL-8基因更大程度诱导Exodus-1基因。还注意到CREAP1是比TNF-α或NF-κB更强的Exodus-1诱导物(数据未显示)。还不知道Exodus-1启动子是否包含未认识的CRE或者CREAP1是否通过所讨论的变体AP-1位点起作用。然而，CREAP1对Exodus-1表达的激活作用表明Exodus-1基因将通过cAMP或其它CREB诱导途径受到调节。The two endogenous genes most highly induced by CREAP1 were not known targets of cAMP or CREB proteins. The first is CAPL; the second is the chemokine Exodus-1 (also known as CCL27, MIP-3α or LARC). Interestingly, the Exodus-1 gene is also a chemokine and is regulated in a very similar manner to the IL-8 gene, as the proximal promoter is reported to contain NF-kB, AP-1 and NF-IL6/C/EBP loci point. CREAP1 also induces the Exodus-1 gene to a greater extent than the endogenous IL-8 gene. It was also noted that CREAP1 is a stronger inducer of Exodus-1 than TNF-α or NF-κB (data not shown). It is not known whether the Exodus-1 promoter contains an unrecognized CRE or whether CREAP1 functions through the variant AP-1 site in question. However, activation of Exodus-1 expression by CREAP1 suggests that the Exodus-1 gene will be regulated through cAMP or other CREB-induced pathways.

还没有描述CAPL、KIAA0467和DKFZp566K192基因的启动子。我们检查了CAPL启动子的潜在CREAP1-效应元件，对于该启动子没有报导明显的CRE。在PEPCK启动子(核苷酸-249到-256)和CAPL启动子(核苷酸-385到-392)中发现了一种命名为CRE-样2的具有序列5’-TGACACAA-3’的序列。将CRE-样2元件置于最小启动子的上游并测试了CREAP1的诱导作用。该元件足以介导CREAP1的诱导作用。与IL-8启动子相似，IL-8CRE-样和CRE-样2序列都受到升高的cAMP适度激活并且由cAMP和CRE-BPa协同激活。因此，能够通过cAMP途径而不通过CREB1激活CREAP1反应元件，因为IL-8启动子中发现的CRE-样元件和CAPL和PEPCK启动子中发现的CRE-样2元件不可能被CREB1识别。The promoters of the CAPL, KIAA0467 and DKFZp566K192 genes have not been described. We examined potential CREAP1-responsive elements of the CAPL promoter, for which no apparent CRE was reported. A gene named CRE-like 2 with the sequence 5'-TGACACAA-3' was found in the PEPCK promoter (nucleotides -249 to -256) and the CAPL promoter (nucleotides -385 to -392). sequence. The CRE-like 2 element was placed upstream of the minimal promoter and tested for CREAP1 induction. This element is sufficient to mediate the induction of CREAP1. Similar to the IL-8 promoter, both IL-8 CRE-like and CRE-like 2 sequences are moderately activated by elevated cAMP and are synergistically activated by cAMP and CRE-BPa. Thus, CREAP1 response elements could be activated by the cAMP pathway but not by CREB1, since the CRE-like element found in the IL-8 promoter and the CRE-like 2 elements found in the CAPL and PEPCK promoters are unlikely to be recognized by CREB1.

CREAP是吸引人的药物发现靶标。如果CREAP的功能是通过CREB与其它转录因子的相互作用调节CREB-调节基因特定亚组，那么CREAP是药物发现的吸引人的靶标是特别正确的。由于存在大量CREB反应基因，所以直接影响CREB的任意拮抗剂或激动剂将可能具有多种效应。另一方面，CREAP功能的调节子可能具有阻断基因特定亚组的能力，例如阻断趋化因子例如IL-8和Exodus-1以治疗自身免疫病和炎性疾病、阻断双调蛋白以治疗增殖疾病和阻断PEPCK用于治疗糖尿病，因为全部这些基因都是由CREAP1高度诱导的。CREAPs are attractive targets for drug discovery. It is especially true that CREAP is an attractive target for drug discovery if the function of CREAP is to regulate a specific subset of CREB-regulated genes through the interaction of CREB with other transcription factors. Due to the large number of CREB responsive genes, any antagonist or agonist that directly affects CREB will likely have multiple effects. On the other hand, modulators of CREAP function may have the ability to block specific subsets of genes, such as blocking chemokines such as IL-8 and Exodus-1 for the treatment of autoimmune and inflammatory diseases, blocking amphiregulin for Treatment of proliferative diseases and blockade of PEPCK for the treatment of diabetes since all of these genes are highly induced by CREAP1.

实施例10Example 10

CREAP2的鉴定 Identification of CREAP2

将CREAP1的完整氨基酸序列用于公共NCBI数据库的BLASTP检索。最初鉴定了与CREAP1编码区具有显著同源性的两个公共结构域cDNA(XM_117201和FLJ00364)。将XM_117201的核苷酸序列用于专有的cDNA文库EST数据库的BLASTN检索(Altschul S.F.等，NucleicAcids Res.25：3389-3402(1997))并且鉴定了代表XM_117201公共序列的4种克隆。使用与上述那些公开的方法相似的方法用最初发现由CREAP诱导的CRE-Luc和IL-8p-Luc报告子共转染后对4种克隆进行了功能测试。简而言之，将胰酶消化的HeLa细胞以6×10⁴细胞/ml重悬于完全生长培养基并且以每孔100μl的体积分布于白色96孔板(Costar)。将细胞在37℃并含5％CO₂的组织培养箱中培养过夜。将pGL3B-IL-8_P-Luc报告子质粒或CRE-Luc报告子(BD Biosciences)以及所测试的cDNA以25ng/ml重悬于OptiMEM I低血清培养基(GIBCO BRL)。然后分别以4μl/孔和3μl/孔将报告子质粒和cDNA分布于96孔清洁PCR板(ABGene)。以每孔10μl体积将包含每次转染1.5μl的Fugene 6试剂(Roche AppliedBioscience)和每次转染20ng的pRL-SV40(Promega)质粒的混合物加入含有预稀释cDNA的96孔PCR板。用移液器混合每孔的内容物并在室温放置10分钟。从每孔中取15μl转染混合物转移至96孔组织培养板。将细胞孵育48小时。The complete amino acid sequence of CREAP1 was used in a BLASTP search of the public NCBI database. Two common domain cDNAs (XM_117201 and FLJ00364) with significant homology to the CREAP1 coding region were initially identified. The nucleotide sequence of XM_117201 was used in a BLASTN search of the proprietary cDNA library EST database (Altschul SF et al., Nucleic Acids Res. 25:3389-3402 (1997)) and four clones representing the public sequence of XM_117201 were identified. Four clones were functionally tested after co-transfection with the CRE-Luc and IL-8p-Luc reporters originally found to be induced by CREAP using methods similar to those published above. Briefly, trypsinized HeLa cells were resuspended in complete growth medium at 6×10 ⁴ cells/ml and distributed in a volume of 100 μl per well in white 96-well plates (Costar). Cells were grown overnight in a tissue culture incubator at 37 °C with 5% _CO2 . The pGL3B-IL-8β- _Luc reporter plasmid or CRE-Luc reporter (BD Biosciences) and the tested cDNA were resuspended at 25 ng/ml in OptiMEM I low serum medium (GIBCO BRL). The reporter plasmid and cDNA were then distributed in 96-well clean PCR plates (ABGene) at 4 μl/well and 3 μl/well, respectively. A mixture containing 1.5 μl of Fugene 6 reagent (Roche Applied Bioscience) per transfection and 20 ng of pRL-SV40 (Promega) plasmid per transfection was added in a volume of 10 μl per well to a 96-well PCR plate containing pre-diluted cDNA. The contents of each well were mixed by pipette and left at room temperature for 10 minutes. Transfer 15 μl of the transfection mixture from each well to a 96-well tissue culture plate. Cells were incubated for 48 hours.

使用DualGlo萤光素酶测定系统(Promega)按照制造商所提供的方案测定萤火虫和海肾萤光素酶的活性。简而言之，用Multidrop 384将115μl新鲜配制的萤光素酶试剂加入到96孔组织培养板的每一个孔中并且，孵育15分钟后在LUMINOSKAN Ascent发光计(Thermo Labsystems)上以400毫秒积分时间读取发光值。随后，将115μl Stop-and-Glo试剂加入到96孔组织培养板的每一个孔中并且，孵育15分钟后在LUMINOSKAN Ascent发光计(Thermo Labsystems)上以200毫秒积分时间读取发光值。以相应萤火虫和海肾萤光素酶活性的比率测定每种所测试的cDNA的活性。Firefly and Renilla luciferase activities were assayed using the DualGlo Luciferase Assay System (Promega) following the protocol provided by the manufacturer. Briefly, 115 μl of freshly prepared luciferase reagent was added to each well of a 96-well tissue culture plate using a Multidrop 384 and, after 15 minutes of incubation, integrated at 400 milliseconds on a LUMINOSKAN Ascent luminometer (Thermo Labsystems) Time to read the luminescence value. Subsequently, 115 μl of Stop-and-Glo reagent was added to each well of a 96-well tissue culture plate and, after 15 minutes of incubation, luminescence was read on a LUMINOSKAN Ascent luminometer (Thermo Labsystems) with a 200 millisecond integration time. The activity of each cDNA tested was determined as the ratio of the corresponding firefly and Renilla luciferase activities.

4种克隆中一种克隆显示是有活性的。该克隆的插入片段在一个方向上进行全部测序并显示编码586个氨基酸的ORF，其与XM_117201 cDNA所预测的公共结构域蛋白质XP_117201完全重叠。该克隆注释为CREAP2并编码693个氨基酸的预测蛋白质，其起始密码子在核苷酸177处且TGA编码的终止密码子在2256处。尽管对文献检索表明存在编码部分CREAP2的cDNA，但没有提供包含人CREAP2的全部序列的CDMA并且也未提供该蛋白质的功能。One of the 4 clones was shown to be active. The insert of this clone was fully sequenced in one orientation and revealed to encode an ORF of 586 amino acids that completely overlapped with the predicted common domain protein XP_117201 from the XM_117201 cDNA. This clone was annotated as CREAP2 and encoded a predicted protein of 693 amino acids with a start codon at nucleotide 177 and a TGA-encoded stop codon at 2256. Although a search of the literature indicated the existence of a cDNA encoding part of CREAP2, no CDMA containing the complete sequence of human CREAP2 was provided nor the function of the protein.

下面显示了人CREAP2的核苷酸序列。起始密码子定位于核苷酸177并且TGA编码的终止密码子定位在2256，以斜体表示：The nucleotide sequence of human CREAP2 is shown below. The start codon is located at nucleotide 177 and the TGA-encoded stop codon is located at 2256, in italics:

ANTTTTTGTACANAAAAGCAGGCTGTTACCGGTCCGGATTCCCGGGATCTAGGCTGGGGC 60ANTTTTTGTACANAAAAAGCAGGCTGTTACCGGTCCGGATTCCCGGGATCTAGGCTGGGGC 60

CGGGTTCGCGGTGCTCGCTGAGGCGGCGGTGGCTACGGCTGGAGGAGCCGGGCCGAGGCC 120CGGGTTCGCGGTGCTCGCTGAGGCGGCGGTGGCTACGGCTGGAGGAGCCGGGCCGAGGCC 120

GCGGCGGAGGCCGCGGCTGGTACTGGGAGGGTGGCAGGGAGGGACGGGGAAGGAAGATGG 180GCGGCGGAGGCCGCGGCTGGTACTGGGAGGGTGGCAGGGAGGGACGGGGAAGGAAGATGG 180

CGACGTCGGGGGCGAACGGGCCTGGTTCGGCCACGGCCTCGGCTTCCAATCCGCGCAAAT 240CGACGTCGGGGGCGAACGGGCCTGGTTCGGCCACGGCCTCGGCTTCCAATCCGCGCAAAT 240

TTAGTGAGAAGATTGCGCTGCAGAAGCAGCGTCAGGCCGAGGAGACGGCGGCCTTCGAGG 300TTAGTGAGAAGATTGCGCTGCAGAAGCAGCGTCAGGCCGAGGAGACGGCGGCCTTCGAGG 300

AGGTGATGATGGACATCGGCTCCACCCGGTTACAGGCCCAAAAACTGCGACTGGCATACA 360AGGTGATGATGGACATCGGCTCCACCCGGTTACAGGCCCAAAAACTGCGACTGGCATACA 360

CAAGGAGCTCTCATTATGGTGGGTCTCTGCCCAATGTTAACCAGATTGGCTCTGGCCTGG 420CAAGGAGCTCTCATTATGGTGGGTCTCTGCCCAATGTTAACCAGATTGGCTCTGGCCTGG 420

CCGAGTTCCAGAGCCCCCTCCACTCACCTTTGGATTCATCTCGGAGCACTCGGCACCATG 480CCGAGTTCCAGAGCCCCCTCCACTCACCTTTGGATTCATCTCGGAGCACTCGGCACCATG 480

GGCTGGTGGAACGGGTGCAGCGAGATCCTCGAAGAATGGTGTCCCCACTTCGCCGATACA 540GGCTGGTGGAACGGGTGCAGCGAGATCCTCGAAGAATGGTGTCCCCACTTCGCCGATACA 540

CCCGCCACATTGACAGCTCTCCCTATAGTCCTGCCTACTTATCTCCTCCCCCAGAGTCTA 600CCCGCCACATTGACAGCTCTCCCCTATAGTCCTGCCTACTTATCTCTCCCCCAGAGTCTA 600

GCTGGCGAAGGACGATGGCCTGGGGCAATTTCCCTGCAGAGAAGGGGCAGTTGTTTCGAC 660GCTGGCGAAGGACGATGGCCTGGGGCAATTTCCCTGCAGAGAAGGGGCAGTTGTTTCGAC 660

TACCATCTGCACTTAACAGGACAAGCTCTGACTCTGCCCTTCATACAAGTGTGATGAACC 720TACCATCTGCACTTAACAGGACAAGCTCTGACTCTGCCCTTCATACAAGTGTGATGAACC 720

CCAGTCCCCAGGATACCTACCCAGGCCCCACACCTCCCAGCATCCTGCCCAGCCGACGTG 780CCAGTCCCCAGGATACCTACCCAGGCCCCACACCTCCCAGCATCCTGCCCAGCCGACGTG 780

GGGGTATTCTGGATGGTGAAATGGACCCCAAAGTACCTGCTATTGAGGAGAACTTGCTAG 840GGGGTATTCTGGATGGTGAAATGGACCCCAAAGTACCTGCTATTGAGGAGAACTTGCTAG 840

ATGACAAGCATTTGCTGAAGCCATGGGATGCTAAGAAGCTATCCTCATCCTCTTCCCGAC 900ATGACAAGCATTTGCTGAAGCCATGGGATGCTAAGAAGCTATCCTCATCCTCTTCCCGAC 900

CTCGGTCCTGTGAAGTCCCTGGAATTAACATCTTTCCATCTCCTGACCAGCCTGCCAATG 960CTCGGTCCTGTGAAGTCCCTGGAATTAACATCTTTTCCATCTCCTGACCAGCCTGCCAATG 960

TGCCTGTCCTCCCACCTGCCATGAACACGGGGGGCTCCCTACCTGACCTCACCAACCTGC 1020TGCCTGTCCTCCCACCTGCCATGAACACGGGGGGCTCCCTACCTGACCTCACCAACCTGC 1020

ACTTTCCCCCACCACTGCCCACCCCCCTGGACCCTGAAGAGACAGCCTACCCTAGCCTGA 1080ACTTTCCCCCACCACTGCCCACCCCCCTGGACCCTGAAGAGACAGCCTACCCTAGCCTGA 1080

GTGGGGGCAACAGTACCTCCAATTTGACCCACACCATGACTCACCTGGGCATCAGCAGGG 1140GTGGGGGCAACAGTACCTCCAATTTGACCCACACCATGACTCACCTGGGCATCAGCAGGG 1140

GGCATGGGCCTGGGCCCGGCTATGATGCACCAGGACTTCATTCACCTCTCAGCCACCCAT 1200GGCATGGGCCTGGGCCCGGCTATGATGCACCAGGACTTCATTCACCTCTCAGCCACCCAT 1200

CCCTGCAGTCCTCCCTAAGCAATCCCAACCTCCAGGCTTCCCTGAGCAGTCCTCAGCCCC 1260CCCTGCAGTCCTCCCTAAGCAATCCCAACCTCCAGGCTTCCCTGAGCAGTCCTCAGCCCC 1260

AGCTTCAGGGCTCCCACAGCCACCCCTCTCTGCCTGCCTCCTCCTTGGCCTGCCATGTAC 1320AGCTTCAGGGCTCCCACAGCCACCCCTCTCTGCCTGCCTCCTCCTTGGCCTGCCATGTAC 1320

TGCCCACCACCTCCCTGGGCCACCCCTCACTCAGTGCTCCGGCTCTCTCCTCCTCCTCTT 1380TGCCCACCCACCTCCCTGGGCCACCCCACTCAGTGCTCCGGCTCTCTCCCTCCTCCTCTT 1380

CCTCCTCCTCCACTTCATCTCCTGTTTTGGGCGCCCCCTCTTACCCTGCTTCTACCCCTG 1440CCTCCTCCTCCACTTCATCTCCTGTTTTGGGCGCCCCCCTCTTACCCTGCTTCTACCCCTG 1440

GGGCCTCCCCCCACCACCGCCGTGTGCCCCTCAGCCCCCTGAGTTTGCTCGCGGGCCCAG 1500GGGCCTCCCCCCACCACCGCCGTGTGCCCCTCAGCCCCCTGAGTTTGCTCGCGGGCCCAG 1500

CCGACGCCAGAAGGTCCCAACAGCAGCTGCCCAAACAGTTTTCGCCAACAATGTCACCCA 1560CCGACGCCAGAAGGTCCCAACAGCAGCTGCCCAAACAGTTTTCGCCAACAATGTCACCCA 1560

CCTTGTCTTCCATCACTCAGGGCGTCCCCCTGGATACCAGTAAACTGTCCACTGACCAGC 1620CCTTGTCTTCCATCACTCAGGGCGTCCCCCTGGATACCAGTAAACTGTCCACTGACCAGC 1620

GGTTACCCCCCTACCCATACAGCTCCCCAAGTCTGGTTCTGCCTACCCAGCCCCACACCC 1680GGTTACCCCCCTACCCATACAGCTCCCCCAAGTCTGGTTCTGCCTACCCAGCCCCACACCC 1680

CAAAGTCTCTACAGCAGCCAGGGCTGCCCTCTCAGTCTTGTTCAGTGCAGTCCTCAGGTG 1740CAAAGTCTCTACAGCAGCCAGGGCTGCCCTCTCAGTCTTGTTCAGTGCAGTCCTCAGGTG 1740

GGCAGCCCCCAGGCAGGCAGTCTCATTATGGGACACCGTACCCACCTGGGCCCAGTGGGC 1800GGCAGCCCCCAGGCAGGCAGTCTCATTATGGGACACCGTACCCACCTGGGCCCAGTGGGC 1800

ATGGGCAACAGTCTTACCACCGGCCAATGAGTGACTTCAACCTGGGGAATCTGGAGCAGT 1860ATGGGCAACAGTCTTACCACCGGCCAATGAGTGACTTCAACCTGGGGAATCTGGAGCAGT 1860

TCAGCATGGAGAGCCCATCAGCCAGCCTGGTGCTGGATCCCCCTGGCTTTTCTGAAGGGC 1920TCAGCATGGAGAGCCCATCAGCCAGCCTGGTGCTGGATCCCCCTGGCTTTTCTGAAGGGC 1920

CTGGATTTTTAGGGGGTGAGGGGCCAATGGGTGGCCCCCAGGATCCCCACACCTTCAACC 1980CTGGATTTTTAGGGGGTGAGGGGCCAATGGGTGGCCCCCAGGATCCCCACACCTTCAACC 1980

ACCAGAACTTGACCCACTGTTCCCGCCATGGCTCAGGGCCTAACATCATCCTCACAGGGG 2040ACCAGAACTTGACCCACTGTTCCCGCCATGGCTCAGGGCCTAACATCATCCTCACAGGGG 2040

ACTCCTCTCCAGGTTTCTCTAAGGAGATTGCAGCAGCCCTGGCCGGAGTGCCTGGCTTTG 2100ACTCCTCTCTCAGGTTTCTCTAAGGAGATTGCAGCAGCCCTGGCCGGAGTGCCTGGCTTTG 2100

AGGTGTCAGCAGCTGGATTGGAGCTAGGGCTTGGGCTAGAAGATGAGCTGCGCATGGAGC 2160AGGTGTCAGCAGCTGGATTGGAGCTAGGGCTTGGGCTAGAAGATGAGCTGCGCATGGAGC 2160

CACTGGGCCTGGAAGGGCTAAACATGCTGAGTGACCCCTGTGCCCTGCTGCCTGATCCTG 2220CACTGGGCCTGGAAGGGCTAAACATGCTGAGTGACCCCCTGTGCCCTGCTGCCTGATCCTG 2220

CTGTGGAGGAGTCATTCCGCAGTGACCGGCTCCAATGAGGGCACCTCATCACCATCCCTC 2280CTGTGGAGGAGTCATTCCGCAGTGACCGGCTCCAATGAGGGCACCTCATCACCATCCCTC 2280

TTCTTGGCCCCATCCCCCACCACCATTCCTTTCCTCCCTTCCCCCTGGCAGGTAGAGACT 2340TTCTTGGCCCCATCCCCCACCACCATTCCTTTCCTCCCTTCCCCCTGGCAGGTAGAGACT 2340

CTACTCTCTGTCCCCAGATCCTCTTTCTAGCATGAATGAAGGATGCCAAGAATGAGAAAA 2400CTACTCTCTGTCCCCAGATCCCTCTTTCTAGCATGAATGAAGGATGCCAAGAATGAGAAAA 2400

AGCAAGGGGTTTGTCCAGGTGGCCCCTGAANTCTGCGCAAGGGATGGGCCTGNGGGGAAC 2460AGCAAGGGGTTTGTCCAGGTGGCCCCTGAANTCTGCGCAAGGGATGGGCCTGNGGGGAAC 2460

CTCANGGNNAGGGCCCAANGGCCACTTNNAANCTTTGAACCGTCNGTCTGGNANGGTCNN 2520CTCANGGNNAGGGCCCAANGGCCACTTNNAANCTTTGAACCGTCNGTCTGGNANGGTCNN 2520

(SEQ ID NO 15)(SEQ ID NO 15)

下面显示了人CREAP2的预测氨基酸序列：The predicted amino acid sequence of human CREAP2 is shown below:

MATSGANGPGSATASASNPRKFSEKIALQKQRQAEETAAFEEVMMDIGSTRLQAQKLRLMATSGANGPGSATASASNPRKFSEKIALQKQRQAEETAAFEEVMMDIGSTRLQAQKLRL

AYTRSSHYGGSLPNVNQIGSGLAEFQSPLHSPLDSSRSTRHHGLVERVQRDPRRMVSPLAYTRSSHYGGSLPNVNQIGSGLAEFQSPLHSPLDSSRSTRHHGLVERVQRDPRRMVSPL

RRYTRHIDSSPYSPAYLSPPPESSWRRTMAWGNFPAEKGQLFRLPSALNRTSSDSALHTRRYTRHIDSSPYSPAYLSPPPESSWRRTMAWGNFPAEKGQLFRLPSALNRTSSSDSALHT

SVMNPSPQDTYPGPTPPSILPSRRGGILDGEMDPKVPAIEENLLDDKHLLKPWDAKKLSSVMNPSPQDTYPGPTPPSILPSRRGGILDGEMDPKVPAIEENLLDDKHLLKPWDAKKLS

SSSSRPRSCEVPGINIFPSPDQPANVPVLPPAMNTGGSLPDLTNLHFPPPLPTPLDPEESSSSRPRSCEVPGINIFPSPDQPANVPVLPPAMNTGGSLPDLTNLHFPPLPTPLDPEE

TAYPSLSGGNSTSNLTHTMTHLGISRGHGPGPGYDAPGLHSPLSHPSLQSSLSNPNLQATAYPSLSGGNSTSNLTTHTMTHLGISRGHGPGPGYDAPGLHSPLSHPSLQSSLSNPNLQA

SLSSPQPQLQGSHSHPSLPASSLACHVLPTTSLGHPSLSAPALSSSSSSSSTSSPVLGASLSSPQPQLQGSHHSHPSLPASSLACHVLPTTSLGHPSLSAPALSSSSSSSSTSSPVLGA

PSYPASTPGASPHHRRVPLSPLSLLAGPADARRSQQQLPKQFSPTMSPTLSSITQGVPLPSYPASTPGASPHHRRVPLSPLSLLAGPADARRSQQQLPKQFSPTMSPTLSSITQGVPL

DTSKLSTDQRLPPYPYSSPSLVLPTQPHTPKSLQQPGLPSQSCSVQSSGGQPPGRQSHYDTSKLSTDQRLPPYPYSSPSLVLPTQPHTPKSLQQPGLPSQSCSVQSSGGQPPGRQSHY

GTPYPPGPSGHGQQSYHRPMSDFNLGNLEQFSMESPSASLVLDPPGFSEGPGFLGGEGPGTPYPPGPSGHGQQSYHRPMSDFNLGNLEQFSMESPSASLVLDPPGFSEGPGFLGGEGP

MGGPQDPHTFNHQNLTHCSRHGSGPNIILTGDSSPGFSKEIAAALAGVPGFEVSAAGLEMGGPQDPHTFNHQNLTHCSRHGSGPNIILTGDSSPGFSKEIAAALAGVPGFEVSAAGLE

LGLGLEDELRMEPLGLEGLNMLSDPCALLPDPAVEESFRSDRLQ (SEQ ID NO 16)LGLGLEDELRMEPLGLEGLNMLSDPCALLPDPAVEESFRSDRLQ (SEQ ID NO 16)

实施例11Example 11

CREAP3的鉴定Identification of CREAP3

使用与上述公开方法相似的方法，通过与公共结构域克隆cDNAFLJ00364的序列相比较在我们专有的cDNA文库EST数据库中发现了一个克隆。由FLJ00364所编码的预测蛋白质缺乏起始密码子ATG并且具有与CREAP1非同源的N-末端序列。公共结构域克隆序列与我们数据库中的相似克隆相比较的结果显示我们专有克隆序列在序列CCGTCATTTCAC CAAGC(SEQ ID NO 17)中含有额外的C，其中下划线所标出的为该额外的C。通过与基因组序列进行比较证实了该额外的C。该变化导致由FLJ00364 cDNA所预测的前63个氨基酸的去除，并且高度保守的氨基酸序列EETRAFE(SEQ ID NO 18)起始处的框内另外81个氨基酸替换为CREAP1预测蛋白质序列E²³ETAAFE(SEQ ID NO 19)。Using methods similar to those published above, a clone was found in our proprietary cDNA library EST database by comparison to the sequence of the public domain clone cDNAFLJ00364. The predicted protein encoded by FLJ00364 lacks the start codon ATG and has an N-terminal sequence non-homologous to CREAP1. Comparison of public domain clone sequences with similar clones in our database shows that our proprietary clone sequence contains an extra C in the sequence CCGTCATTTCAC C AAGC (SEQ ID NO 17), where the extra C is underlined . This additional C was confirmed by comparison with the genome sequence. This change resulted in the deletion of the first 63 amino acids predicted by FLJ00364 cDNA, and the replacement of another 81 amino acids in frame at the beginning of the highly conserved amino acid sequence EETRAFE (SEQ ID NO 18) with the CREAP1 predicted protein sequence E ²³ ETAAFE (SEQ ID NO 18) ID NO 19).

进行三轮连续的聚合酶链式反应(PCR)以获得专有克隆的完整ORF。PCR扩增循环为：94℃2分钟，23×[94℃15秒，68℃30秒和72℃15秒]和72℃2分钟。使用Advantage 2 DNA聚合酶(BD Biosciences)进行全部扩增步骤。使用具有对应ORF起始部分的核苷酸序列(5’-CCGGAATTCGCCATGGCCGCCTCGCCGGGCTCGGG-3’)(SEQID NO 20)的共同有义引物KIAAhS3_R1扩增全部三种PCR产物。使用常规方法将EcoRI限制性酶切位点包括于引物的5’末端序列。对于最初的PCR，将人基因组DNA(BD Biosciences)用作模板(每个反应2mg)，并且反义引物为KIAAhAS2(5’-CCGCGACAGGGTGAGGTCGGTCATGAGCTGCTCGAAGGCCCGCG-3’)(SEQ ID NO 21)。使用苯酚-氯仿混合物抽提142ntPCR产物并用异丙醇沉淀。用冰预冷的70％乙醇洗涤沉淀并重悬于TE缓冲液。取5ng产物用作第二轮PCR反应的模板，使用KIAAhS3_R1有义引物和 KIAAhAS3(5’-GAAGCTTCTGAAATTGAACCCGCGACAGGGTGAGGTCGGTCATG-3’)(SEQ ID NO 22)反义引物。将161nt PCR产物进行与最初PCR产物相似的处理并取5ng重悬的DNA作为最后一轮PCR的模板，使用KIAAhS3_R1有义和 KIAAhAS4(5’-TGGTAAGGATCCTCCATGGTACTGTGTAAGGCGCAGTTGCTGAAGCTTCTGAAATTGAACCCG-3’)(SEQ ID NO 23)反义引物。全部引物从SIGMA-Aldrich公司(Saint Louis，MO，USA)获得或者按照常规方法制备。Three consecutive rounds of polymerase chain reaction (PCR) were performed to obtain the complete ORF of the proprietary clone. The PCR amplification cycles were: 94°C for 2 minutes, 23x [94°C for 15 seconds, 68°C for 30 seconds and 72°C for 15 seconds] and 72°C for 2 minutes. All amplification steps were performed using Advantage 2 DNA polymerase (BD Biosciences). All three PCR products were amplified using a common sense primer KIAAhS3_R1 having a nucleotide sequence corresponding to the start of the ORF (5'-CCGGAATTCGCCATGGCCGCCTCGCCGGGCTCGGG-3') (SEQ ID NO 20). An EcoRI restriction site was included in the 5' end sequence of the primers using conventional methods. For the initial PCR, human genomic DNA (BD Biosciences) was used as template (2 mg per reaction) and the antisense primer was KIAAhAS2 (5'-CCGCGACAGGGTGAGGTCGGTCATGAGCTGCTCGAAGGCCCGCG-3') (SEQ ID NO 21). The 142 nt PCR product was extracted using a phenol-chloroform mixture and precipitated with isopropanol. The pellet was washed with ice-cold 70% ethanol and resuspended in TE buffer. Get 5ng product as the template of the second round of PCR reaction, use KIAAhS3_R1 sense primer and KIAAhAS3 (5'-GAAGCTTCTGAAATTGAACCCGCGACAGGGTGAGGTCGGTCATG-3') (SEQ ID NO 22) antisense primer. The 161nt PCR product was treated similarly to the original PCR product and 5ng of resuspended DNA was used as the template for the final round of PCR, using KIAAhS3_R1 sense and KIAAhAS4(5'-TGGTAAGGATCCTCCATGGTACTGTGTAAGGCGCAGTTGCTGAAGCTTCTGAAATTGAACCCG-3')(SEQ3) ID NO 2 primers. All primers were obtained from SIGMA-Aldrich Company (Saint Louis, MO, USA) or prepared according to conventional methods.

将202nt产物进行凝胶纯化并用EcoRI和BamHI酶切并插入专有克隆的EcoRI/BamHI消化的质粒中。将重建的全长FLJ00364 cDNA的16个不同的克隆进行序列验证并使用CRE-Luc和IL-8p-Luc报告子如上所述进行功能性测试。将不合PCR-导入的错配并强烈激活两种报告子的#5克隆用于DNA和蛋白质比对并且已经注释为CREAP3。The 202nt product was gel purified and digested with EcoRI and BamHI and inserted into the EcoRI/BamHI digested plasmid of the proprietary clone. Sixteen different clones of the reconstituted full-length FLJ00364 cDNA were sequence verified and functionally tested using the CRE-Luc and IL-8p-Luc reporters as described above. Clone #5, free of PCR-introduced mismatches and strongly activating both reporters, was used for DNA and protein alignment and has been annotated as CREAP3.

下面提供了CREAP3的核苷酸序列。斜体显示在核苷酸46的起始密码子和在1905处的TGA终止密码子。值得注意由于基因组序列和专有的克隆序列的比较，已经加入粗体下划线显示的残基288处C。加下划线的CGAGG序列表明专有克隆的5’-末端。以基因组DNA为模板通过PCR扩增该序列上游的核苷酸序列并如上所述重新插入专有克隆。The nucleotide sequence of CREAP3 is provided below. Italics show the start codon at nucleotide 46 and the TGA stop codon at 1905. Note that the bold underlined residue 288 C has been added due to comparison of the genomic sequence and the proprietary cloned sequence. The underlined CGAGG sequence indicates the 5'-end of the proprietary clone. Nucleotide sequences upstream of this sequence were amplified by PCR using genomic DNA as a template and reinserted into proprietary clones as described above.

CREAP3的核苷酸序列：Nucleotide sequence of CREAP3:

NTTTTTTGTACANAAAAGCAGGCTGTTACCGGTCCGGAATTCGCCATGGCCGCCTCGCCG 60NTTTTTTGTACANAAAAAGCAGGCTGTTACCGGTCCGGAATTCGCCATGGCCGCCTCGCCG 60

GGCTCGGGCAGCGCCAACCCGCGGAAGTTCAGTGAGAAGATCGCGCTGCACACGCAGAGA 120GGCTCGGGCAGCGCCAACCCGCGGAAGTTCAGTGAGAAGATCGCGCTGCACACGCAGAGA 120

CAGGC CGAGGAGACGCGGGCCTTCGAGCAGCTCATGACCGACCTCACCCTGTCGCGGGTT 180CAGGC CGAGG AGACGCGGGCCTTCGAGCAGCTCATGACCGACCTCACCCCTGTCGCGGGTT 180

CAATTTCAGAAGCTTCAGCAACTGCGCCTTACACAGTACCATGGAGGATCCTTACCAAAT 240CAATTTCAGAAGCTTCAGCAACTGCGCCTTACACAGTACCATGGAGGATCCTTACCAAAT 240

GTGAGCCAGCTGCGGAGCAATGCGTCAGAGTTTCAGCCGTCATTTCA CCAAGCTGATAAT 300GTGAGCCAGCTGCGGAGCAATGCGTCAGAGTTTCAGCCGTCATTTCA C CAAGCTGATAAT 300

GTTCGGGGAACCCGCCATCACGGGCTGGTGGAGAGGCCATCCAGGAACCGCTTCCACCCC 360GTTCGGGGAACCCGCCATCACGGGCTGGTGGAGAGGCCATCCAGGAACCGCTTCCACCCC 360

CTCCACCGAAGGTCTGGGGACAAGCCAGGGCGACAATTTGATGGTAGTGCTTTTGGAGCC 420CTCCACCGAAGGTCTGGGGACAAGCCAGGGCGACAATTTGATGGTAGTGCTTTTGGAGCC 420

AATTATTCCTCACAGCCTCTGGATGAGAGTTGGCCAAGGCAGCAGCCTCCTTGGAAAGAC 480AATTATTCCTCCACAGCCTCTGGATGAGAGTTGGCCAAGGCAGCAGCCTCCTTGGAAAGAC 480

GAAAAGCATCCTGGGTTCAGGCTGACATCTGCACTTAACAGGACCAATTCTGATTCTGCT 540GAAAAGCATCCTGGGTTCAGGCTGACATCTGCACTTAACAGGACCAATTCTGATTCTGCT 540

CTTCACACGAGTGCTCTGAGTACCAAGCCCCAGGACCCCTATGGAGGAGGGGGCCAGTCG 600CTTCACACGAGTGCTCTGAGTACCAAGCCCCAGGACCCCTATGGAGGAGGGGGCCAGTCG 600

GCCTGGCCTGCCCCATACATGGGGTTTTGTGATGGTGAGAATAATGGACATGGGGAAGTA 660GCCTGGCCTGCCCCATACATGGGGTTTTGTGATGGTGAGAATAATGGACATGGGGAAGTA 660

GCATCTTTCCCTGGCCCATTGAAAGAAGAGAATCTGTTAAATGTTCCTAAGCCACTGCCA 720GCATCTTTCCCTGGCCCATTGAAAGAAGAGAATCTGTTAAATGTTCCTAAGCCACTGCCA 720

AAACAACTGTGGGAGACCAAGGAGATTCAGTCCCTGTCAGGACGCCCTCGATCCTGTGAT 780AAACAACTGTGGGAGACCAAGGAGATTCAGTCCCTGTCAGGACGCCCTCGATCCTGTGAT 780

GTTGGAGGTGGCAATGCTTTTCCACATAATGGTCAAAACCTAGGCCTCTCACCCTTCTTG 840GTTGGAGGTGGCAATGCTTTTCCACATAATGGTCAAAAACCTAGGCCTCTCACCCTTCTTG 840

GGGACTTTGAACACTGGAGGGTCATTGCCAGATCTAACCAACCTCCACTACTCGACACCC 900GGGACTTTGAACACTGGAGGGTCATTGCCAGATCTAACCAACCTCCACTACTCGACACCC 900

CTGCCAGCCTCCCTGGACACCACCGACCACCACTTTGGCAGTATGAGTGTGGGGAATAGT 960CTGCCAGCCTCCCTGGACACCACCGACCACCACTTTGGCAGTATGAGTGTGGGGAATAGT 960

GTGAACAACATCCCAGCTGCTATGACCCACCTGGGTATAAGAAGCTCCTCTGGTCTCCAG 1020GTGAACAACATCCCAGCTGCTATGACCCACCTGGGTATAAGAAGCTCCTCTGGTCTCCAG 1020

AGTTCTCGGAGTAACCCCTCCATCCAAGCCACGCTCAATAAGACTGTGCTTTCCTCTTCC 1080AGTTCTCGGAGTAACCCTCTCATCCAAGCCACGCTCAATAAGACTGTGCTTTCCTCTTCC 1080

TTAAATAACCACCCACAGACATCTGTTCCCAACGCATCTGCTCTTCACCCTTCGCTCCGT 1140TTAAATAACCACCCACAGACATCTGTTCCCAACGCATCTGCTCTTCACCCTTCGCTCCGT 1140

CTGTTTTCCCTTAGCAACCCATCTCTTTCCACCACAAACCTGAGCGGCCCGTCTCGCCGT 1200CTGTTTTTCCCTTAGCAACCCATCTCTTTCCACCACAAAACCTGAGCGGCCCGTCTCGCCGT 1200

CGGCAGCCTCCCGTCAGCCCTCTCACGCTTTCTCCTGGCCCTGAAGCACATCAAGGTTTC 1260CGGCAGCCTCCCGTCAGCCTCTCACGCTTTCTCCTGGCCCTGAAGCACATCAAGGTTTC 1260

AGCAGACAGCTGTCTTCAACCAGCCCACTGGCCCCATATCCTACCTCCCAGATGGTGTCC 1320AGCAGACAGCTGTCTTCAACCAGCCCACTGGCCCCATATCCTACCCTCCCAGATGGTGTCC 1320

TCAGACCGAAGCCAACTTTCCTTTCTGCCCACAGAAGCTCAAGCCCAGGTGTCGCCGCCA 1380TCAGACCGAAGCCAACTTTTCCTTTCTGCCCACAGAAGCTCAAGCCCAGGTGTCGCCGCCA 1380

CCCCCTTACCCTGCACCCCAGGAGCTCACCCAGCCCCTCCTGCAGCAGCCCCGCGCCCCT 1440CCCCCTTACCCTGCACCCCAGGAGCTCACCCAGCCCCTCCTGCAGCAGCCCCGCGCCCCT 1440

GAGGCCCCTGCCCAGCAGCCCCAGGCAGCCTCCTCACTGCCACAGTCAGACTTTCAGCTT 1500GAGGCCCCTGCCCAGCAGCCCCAGGCAGCCTCTCACTGCCACAGTCAGACTTTCAGCTT 1500

CTCCCGGCCCAGGGCTCATCTTTGACCAACTTCTTCCCAGATGTGGGTTTTGACCAGCAG 1560CTCCCGGCCCAGGGCTCATCTTTGACCAACTTCTTCCCAGATGTGGGTTTTGACCAGCAG 1560

TCCATGAGGCCAGGCCCTGCCTTTCCTCAACAGGTGCCTCTGGTGCAACAAGGTTCCCGA 1620TCCATGAGGCCAGGCCCTGCCTTTCCTCAACAGGTGCCTCTGGTGCAACAAGGTTCCCGA 1620

GAACTGCAGGACTCTTTTCATTTGAGACCAAGCCCGTATTCCAACTGCGGGAGTCTCCCG 1680GAACTGCAGGACTCTTTTCATTTGAGACCAAGCCCGTATTCCAACTGCGGGAGTCTCCCG 1680

AACACCATCCTGCCAGAAGACTCCAGCACCAGCCTGTTCAAAGACCTCAACAGTGCGCTG 1740AACACCATCCTGCCAGAAGACTCCAGGCACCAGCCTGTTCAAAGACCTCAACAGTGCGCTG 1740

GCAGGCCTGCCTGAGGTCAGCCTGAACGTGGACACTCCATTTCCACTGGAAGAGGAGCTG 1800GCAGGCCTGCCTGAGGTCAGCCTGAACGTGGACACTCCATTTCCACTGGAAGAGGAGCTG 1800

CAGATTGAACCCCTGAGCCTGGATGGACTCAACATGTTAAGTGACTCCAGCATGGGCCTG 1860CAGATTGAACCCCTGAGCCTGGATGGACTCAACATGTTAAGTGACTCCAGCATGGGCCTG 1860

CTGGACCCCTCTGTTGAAGAGACGTTTCGAGCTGACAGACTGTGAACAGAAGGCAGTGGA 1920CTGGACCCCCTCTGTTGAAGAGACGTTTCGAGCTGACAGACTGTGAACAGAAGGCAGTGGA 1920

ACAGAAGAATGTTTTTCTGCAACAGCCAAAATAGAATGGAATAGAATGAAGCCAGCTGAT 1980ACAGAAGAATGTTTTTCTGCAACAGCCAAATAGAATGGAATAGAATGAAGCCAGCTGAT 1980

ACCACGGGCTTTCGTTATCTTGACATAGAAGGAAGCAGTGCCACGGCTCCAGGGTTTCAG 2040ACCACGGGCTTTCGTTATTCTTGACATAGAAGGAAGCAGTGCCACGGCTCCAGGGTTTCAG 2040

ATGAGATCCCATCTCAGACACTGTGGCTTCCTCCAGATCACACAGCTTTGTACTGCCTCT 2100ATGAGATCCCATCTCAGACACTGTGGCTTCCTCCAGATCACACAGCTTTGTACTGCCTCT 2100

CCCGCCTGTGGCCAAAGTCGTGTTGCAGCAGGCAGGCTGCTTGGAGCTTCCCATGAACTG 2160CCCGCCTGTGGCCAAAGTCGTGTTGCAGCAGGCAGGCTGCTTGGAGCTTCCCATGAACTG 2160

GAAAGCTCACCTCCACTGCATCTTTTTACTGGCCATCCAGTCAGCCGATGTGTAAGAGTA 2220GAAAGCTCACCTCCACTGCATCTTTTTACTGGCCATCCAGTCAGCCGATGTGTAAGAGTA 2220

GGAAATACTGTGTCACTGGAGGCCCTCCGTAGCATTGGG 2259GGAAATACTGTGTCACTGGAGGCCCTCCGTAGCATTGGG 2259

(SEQ ID NO 24)(SEQ ID NO 24)

CREAP3 cDNA编码如下显示的619个氨基酸的预测蛋白质，其起始密码子在核苷酸46处而TGA编码的终止密码子在1905处。由CREAP3所编码的不同于公共克隆FLJ00364所预测序列的备选正确的氨基酸序列以下划线标出。粗体显示了在氨基酸551和616位的谷氨酸和丙氨酸。The CREAP3 cDNA encodes a predicted protein of 619 amino acids shown below with a start codon at nucleotide 46 and a TGA-encoded stop codon at 1905. The alternative correct amino acid sequence encoded by CREAP3 that differs from the predicted sequence of public clone FLJ00364 is underlined. Glutamate and alanine at amino acid positions 551 and 616 are shown in bold.

MAASPGSGSANPRKFSEKIALHTQRQAEETRAFEQLMTDLTLSRVQFQKLQMAASPGSGSANPRKFSEKIALHTQRQAEETRAFEQLMTDLTLSRVQFQKLQ

QLRLTQYHGGSLPNVSQLRSNASEFQPSFHQADNVRGTRHHGLVERPSRNR QLRLTQYHGGSLPNVSQLRSNASEFQPSFH QADNVRGTRHHGLVERPSRNR

FHPLHRRSGDKPGRQFDGSAFGANYSSQPLDESWPRQQPPWKDEKHPGFRLFHPLHRRSGDKPGRQFDGSAFGANYSSQPLDESWPRQQPPWKDEKHPGFRL

TSALNRTNSDSALHTSALSTKPQDPYGGGGQSAWPAPYMGFCDGENNGHGETSALNRTNSDSALHTSALSTKPQDPYGGGGQSAWPAPYMGFCDGENNGHGE

VASFPGPLKEENLLNVPKPLPKQLWETKEIQSLSGRPRSCDVGGGNAFPHNVASFPGPLKEENLLNVPKPLPKQLWETKEIQSLSGRPRSCDVGGGNAFPHN

GQNLGLSPFLGTLNTGGSLPDLTNLHYSTPLPASLDTTDHHFGSMSVGNSVGQNLGLSPFLGTLNTGGSLPDLTNLHYSTPLPASLDTTDHHFGSMSVGNSV

NNIPAAMTHLGIRSSSGLQSSRSNPSIQATLNKTVLSSSLNNHPQTSVPNANNIPAAMTHLGIRSSSGLQSSRSNPSIQATLNKTVLSSSLNNHPQTSVPNA

SALHPSLRLFSLSNPSLSTTNLSGPSRRRQPPVSPLTLSPGPEAHQGFSRQSALHPSLRLFSLSNPSLSTTNLSGPSRRRQPPVSPLTLSPGPEAHQGFSRQ

LSSTSPLAPYPTSQMVSSDRSQLSFLPTEAQAQVSPPPPYPAPQELTQPLLLSSTSPLAPYPTSQMVSSDRSQLSFLPTEAQAQVSPPPPYPAPQELTQPLL

QQPRAPEAPAQQPQAASSLPQSDFQLLPAQGSSLTNFFPDVGFDQQSMRPGQQPRAPEAPAQQPQAASSLPQSDFQLLPAQGSSLTNFFPDVGFDQQSMRPG

PAFPQQVPLVQQGSRELQDSFHLRPSPYSNCGSLPNTILPEDSSTSLFKDLPAFPQQVPLVQQGSRELQDSFHLRPSPYSNCGSLPNTILPEDSSTSLFKDL

NSALAGLPEVSLNVDTPFPLEEELQIEPLSLDGLNMLSDSSMGLLDPSVEENSALAGLPEVSLNVDTPFPLEEELQIEPLSLDGLNMLSDSSMGLLDPSVEE

TFRADRLTFRADRL

(SEQ ID NO 25)(SEQ ID NO 25)

由于在288位存在上述额外的C，前81个氨基酸在FLJ00364和正确的专有克隆所预测的多肽之间是不同的。Due to the presence of the aforementioned extra C at position 288, the first 81 amino acids differ between FLJ00364 and the predicted polypeptide from the correct proprietary clone.

我们认为所显示的由CREAP3所编码的氨基酸序列是正确的，因为其显示与CREAP1和CREAP2广泛同源。简而言之，使用ClustalW将CREAP基因家族序列进行比较。使用Align，2.0版本(Myers E.W.和Miller W.，Bull.Math Biol 51：5-37(1989))和Blosum 50评分矩阵(CITE)确定氨基酸身份。使用BlastN和Celera CHD数据库(Celera Genomics，Rockville，MD)进行基因组序列比对并且使用掩蔽的共有人序列进行检索。We believe that the shown amino acid sequence encoded by CREAP3 is correct as it shows extensive homology to CREAP1 and CREAP2. Briefly, CREAP gene family sequences were compared using ClustalW. Amino acid identity was determined using Align, version 2.0 (Myers E.W. and Miller W., Bull. Math Biol 51:5-37 (1989)) and the Blosum 50 scoring matrix (CITE). Genomic sequence alignments were performed using BlastN and the Celera CHD database (Celera Genomics, Rockville, MD) and searches were performed using masked consensus sequences.

(文件：CHGD_masked_assembly_500k-i)(File: CHGD_masked_assembly_500k-i)

由专有克隆和FLJ00364 cDNA预测的氨基酸序列在两个其它区域不同。如上显示专有克隆包含额外的GAA三联体，这导致在氨基酸551位处加入一个谷氨酸。最后，CREAP3 cDNA中单个核苷酸A/G改变，导致如上所显示的在氨基酸616位苏氨酸/丙氨酸改变。The amino acid sequences predicted by the proprietary clone and the FLJ00364 cDNA differ in two other regions. The proprietary clone contained an additional GAA triplet as shown above, which resulted in the addition of a glutamic acid at amino acid 551. Finally, a single nucleotide A/G change in the CREAP3 cDNA resulted in a threonine/alanine change at amino acid 616 as shown above.

实施例12Example 12

从其它物种鉴定CREAP基因 Identification of CREAP genes from other species

果蝇CREAP基因dCREAP的鉴定：Identification of the Drosophila CREAP gene dCREAP:

使用CREAP1和CREAP3编码区按照常规方法进行Genebank蛋白质和DNA序列数据库的BLASTP检索鉴定了单一预测的果蝇基因CG6064。该基因已经命名为dCREAP并且其氨基酸序列显示如下。作为未知功能的预测基因从黑腹果蝇(D.melanogaster)基因组的测序发现了该序列：GenBank条目|7293954|gb|AAF49313.1|CG6064-PA[Drosophilamelanogaster](Adams等，Science 287(5461)：2185-2195(2000))。dCREAP基因CG6064不含插入片段并预测的797个氨基酸的蛋白质，其比人CREAP略大。BLASTP searches of the Genebank protein and DNA sequence databases using the CREAP1 and CREAP3 coding regions followed conventional methods and identified a single predicted Drosophila gene, CG6064. This gene has been named dCREAP and its amino acid sequence is shown below. This sequence was discovered from sequencing of the Drosophila melanogaster genome as a predicted gene of unknown function: GenBank entry|7293954|gb|AAF49313.1|CG6064-PA[Drosophilamelanogaster] (Adams et al. : 2185-2195 (2000)). The dCREAP gene CG6064 contains no insert and is a predicted 797 amino acid protein, which is slightly larger than human CREAP.

dCREAP DNA序列dCREAP DNA sequence

ATGGCCAATCCGCGCAAGTTCAGCGAGAAGATCGCTCTGCAGAAGCAGAAGCAGGCGGAGGGCACAGCGGATGGCCAATCCGCGCAAGTTCAGCGAGAAGATCGCTCTGCAGAAGCAGAAGCAGGCGGAGGGCACAGCGG

AATTCGAGCGGATCATGAAGGAGGTGTATGCCACGAAGAGGGATGAGCCGCCTGCGAATCAGAAGATCCTAATTCGAGCGGATCATGAAGGAGGTGTATGCCACGAAGAGGGATGAGCCGCCTGCGAATCAGAAGATCCT

AGACGGCCTTGTCGGCGGTCAGGAGGTAAGCCAATCCTCGCCAGGCGCAGGCAATGGGACGGGCGGAGGTAGACGGCCTTGTCGGCGGTCAGGAGGTAAGCCAATCCTCGCCAGGCGCAGGCAATGGGACGGGCGGAGGT

GGCAGTGGTTCCGGCAGTGGAGCCAGCGGCGGAGGAGCCTCACCAGATGGCCTGGGAGGCGGCGGTGGTTGGCAGTGGTTCCGGCAGTGGAGCCAGCGGCGGAGGAGCCTCACCAGATGGCCTGGGAGGCGGCGGTGGTT

CTCCGACGGCTTATCGAGAATCCCGAGGGCGCAGCGTAGGTGTGGGTCCCATGCGAAGACCGTCGGAGCGCTCCGACGGCTTATCGAGAATCCCGAGGGCGCAGCGTAGGTGTGGGTCCCATGCGAAGACCGTCGGAGCG

CAAGCAGGATCGTTCGCCCTACGGCAGCAGCAGTACGCAACAAACCTTAGACAACGGCCAGCTAAATCCGCAAGCAGGATCGTTCGCCCTACGGCAGCAGCAGTACGCAACAAACCTTAGACAACGGCCAGCTAAATCCG

CATCTTCTTGGTCCACCTACGGCGGAGAGTTTGTGGCGGCGGTCCAGCTCCGATTCGGCGCTGCACCAAACATCTTCTTGGTCCACCTACGGCGGAGAGTTTGTGGCGGCGGTCCAGCTCCGATTCGGCGCTGCACCAAA

GTGCGCTGGTGGCGGGCTTCAATAGCGACGTGAACTCGATGGGCGCCAACTATCAGCAGCAGCAACATCAGTGCGCTGGTGGCGGGCTTCAATAGCGACGTGAACTCGATGGGCGCCAACTATCAGCAGCAGCAACATCA

GCAACAACAGCAACCGGGCCAGCCAAGATCTCACTCGCCGCACCATGGTATAAACAGGACCATGAGTCCGGCAACAACAGCAACCGGGCCAGCCAAGATCTCACTCGCCGCACCATGGTATAAACAGGACCATGAGTCCG

CAGGCGCAACGGAGGAAGTCGCCGCTACTGCAGCCCCATCAGCTGCAGTTGCAGCAACTGCAACAGCAGCCAGGCGCAACGGAGGAAGTCGCCGCTACTGCAGCCCCATCAGCTGCAGTTGCAGCAACTGCAACAGCAGC

AGCAACAGATGCAACATCAGCATCAGCTGCACCAGCAGCTCCAAATGCAGCAGCTGCAACAGCACCAGCAAGCAACAGATGCAACATCAGCATCAGCTGCACCAGCAGCTCCAAATGCAGCAGCTGCAACAGCACCAGCA

GCAACACCAGCAGCAGCAGCAACAACAGAACACGCCATACAACAACGCCAAATTCACGAATCCTGTGTTCGCAACACCAGCAGCAGCAGCAACAACAGAACACGCCATACAACAACGCCAAATTCACGAATCCTGTGTTC

CGGCCGCTGCAGGATCAGGTCAACTTTGCCAACACCGGCTCCCTGCCCGATCTCACGGCCCTTCAAAACTCGGCCGCTGCAGGATCAGGTCAACTTTGCCAACACCGGCTCCCTGCCCGATCTCACGGCCCTTCAAAACT

ATGGACCCCAGCAGCAGCAGCAGCAATCCCAGCAACAGCCGTCGCAGCAACAACAGCAGTTGCAGCAAACATGGACCCCAGCAGCAGCAGCAGCAATCCCAGCAACAGCCGTCGCAGCAACAACAGCAGTTGCAGCAAAC

CCTGTCGCCAGTCATGTCTCCGCACAATCACCGCCGCGAACGGGATCAGTCGCCCAGTCCGTTTAGTCCGCCTGTCGCCAGTCATGTCTCCGCACAATCACCGCCGCGAACGGGATCAGTCGCCCAGTCCGTTTAGTCCG

GCGGGTGGAGGAGGGGGAGCAGGTCCCGGGTCGCCCTATCAGCAGCAACAGCACTCGCCCACCGGAAACAGCGGGTGGAGGAGGGGGAGCAGGTCCCGGGTCGCCCTATCAGCAGCAACAGCACTCGCCCACCGGAAACA

CGCAACAGCAGCAGCAGCAGCACCAACAGCCCAGCAACTCGCCGCACCTGTCCTTTACCAATCTGGCCACCGCAACAGCAGCAGCAGCAGCACCAACAGCCCAGCAACTCGCCGCACCTGTCCTTTACCAATCTGGCCAC

CACGCAGGCAGCTGTTACCACATTTAACCCGCTCCCCACGCTGGGTCCGCACAATGCCACCGACTACCGCCACGCAGGCAGCTGTTACCACATTTAACCCGCTCCCCACGCTGGGTCCGCACAATGCCACCGACTACCGC

CAGCCACCGAATCCTCCTAGTCCACGCTCTTCGCCCGGCTTGCTGAGCAGCGTATCGGCCACGGATCTGCCAGCCACCGAATCCTCCTAGTCCACGCTCTTCGCCCGGCTTGCTGAGCAGCGTATCGGCCACGGATCTGC

ACTCCAGTGCACCGGCCAGTCCCATACGCCAGCAGCAACAGGCCCATCAGCAGCAACAGCAGCAGCAACAACTCCAGTGCACCGGCCAGTCCCATACGCCAGCAGCAACAGGCCCATCAGCAGCAACAGCAGCAGCAACA

GGCGCAGCAACAACAGCAACAGTTTGATAACTCCTACAACAGTCTGAATACCTCGTTTCACAATCAGTTTGGCGCAGCAACAACAGCAACAGTTTGATAACTCCTACAACAGTCTGAATACCTCGTTTCACAAATCAGTTT

GAGATTTTCTCGCTGGGCGACAGCAATTCCTCGCCGGAACAGCAGGGCTTTGCAAATAATTTCGTGGCCCGAGATTTTCTCGCTGGGCGACAGCAATTCCTCGCCGGAACAGCAGGGCTTTGCAAATAATTTCGTGGCCC

TCGACTTTGACGACCTGAGTGGCGGCGGAGGTGGTGGCCCAAGCGGGGGCGGCGGCAGCAATGGAGGAGGTCGACTTTGACGACCTGAGTGGCGGCGGAGGTGGTGGCCCAAGCGGGGGCGGCGGCAGCAATGGAGGAGG

TCTGACCAACGGTTACAACAAGCCGGAGATGTTGGACTTCAGCGAGCTGAGCGGCAGCCCGGAGGCGAGTTCTGACCAACGGTTACAACAAGCCGGAGATGTTGGACTTCAGCGAGCTGAGCGGCAGCCCGGAGGCGAGT

GGGAACAACAACCACATGCGGCGAGGAGTGAGCAACCTGAACAACAACGGGTTGAGCAATGGTGTGGTGGGGGAACAACAACCACATGCGGCGAGGAGTGAGCAACCTGAACAACAACGGGTTGAGCAATGGTGTGGTGG

GATCCACGCACAACGGCAGCACAAATCTAAATGGAGCGGGAAACAACAATAGCAGTAGTGGAGGTGGCACGATCCACGCACAACGGCAGCACAAATCTAAATGGAGCGGGAAACAACAATAGCAGTAGTGGAGGTGGCAC

GGCGCAGGATCCTTTGGGAATAACCACTTCGCCTGTGCCCTCACCCTTGGGCTGCCCCAGTTCACCGCTGGGCGCAGGATCCTTTGGGAATAACCACTTCGCCTGTGCCCTCACCCTTGGGCTGCCCCAGTTCACCGCTG

CCGATACCGATTCCGATGTCGGCGCAAAGCTCGCCACAGCAGCAGCACCACCATCATCAGCAGCAGCAACCCGATACCGATTCCGATGTCGGCGCAAAGCTCGCCACAGCAGCAGCACCACCATCATCAGCAGCAGCAAC

AACAGCATCATCAGCAGCAACACCATCAGCAGCAGCAATTATCATTATCTCTGCACCATTCGCCGCATCAAACAGCATCATCAGCAGCAACACCATCAGCAGCAGCAATTATCATTTATCTCTGCACCATTCGCCGCATCA

TTCGCCAATGCATTCGCCGCACCATGGGAATTCACCGCTTTCAAGCAGCTCGCCAGTGAGTCACAATGCCTTCGCCAATGCATTCGCCGCACCATGGGAATTCACCGCTTTCAAGCAGCTCGCCAGTGAGTCACAATGCC

TGCTCCAACTCCAACGTGGTGATGAACCACCAGCAGCAGCAGCAACAACATCACCACCAGCAACACCATCTGCTCCAACTCCAACGTGGTGATGAACCACCAGCAGCAGCAGCAACAACATCACCACCAGCAACACCATC

ATCAGGGCTCCTCGCAAAGTCACACGCCGACCACAGCGAATATACCCTCTATTATCTTTAGTGATTACTCATCAGGGCTCCTCGCAAAGTCACACGCCGACCACAGCGAATATACCCTCTATTATCTTTAGTGATTACTC

CTCCAACGCGGATTATACCAGGGAGATCTTCGACTCCCTCGATCTGGATCTGGGACAGATGGACGTAGCCCTCCAACGCGGATTATACCAGGGAGATCTTCGACTCCCTCGATCTGGATCTGGGACAGATGGACGTAGCC

GGTTTGCAGATGCTGTCCGACCAGAACCCCATCATGATCGCCGATCCCAACATCGAGGATAGTTTTCGACGGTTTGCAGATGCTGTCCGACCAGAACCCCATCATGATCGCCGATCCCAACATCGAGGATAGTTTTCGAC

GCGACCTCAACTGATACTATGAGGAGGCTGTTGCGGCCATTGAGAGCGGAGTGCTGCTGGAGGAGGACTAGCGACCTCAACTGATACTATGAGGAGGCTGTTGCGGCCATTGAGAGCGGAGTGCTGCTGGAGGAGGACTA

CCAGGCGCTGCTCGGATCAGAGGCGCTGGCGGATGAACAGGTGGTCACAGTCGAGGCCGCCGGAGCCGCACCAGGCGCTGCTCGGATCAGAGGCGCTGGCGGATGAACAGGTGGTCACAGTCGAGGCCGCCGGAGCCGCA

GCAGCAGTAGTAACAGTTGAAGAGGCAGCCACAGTTAGCGAGAAGGACAAAAAAGATTTGGAAGTTGTGGGCAGCAGTAGTAACAGTTGAAGAGGCAGCCACAGTTAGCGAGAAGGACAAAAAAGATTTGGAAGTTGTGG

AACTTCTGGTGTCCGGTGTTATGGATGACCTGGTGGACTCCAGTGACCTGGACGAGGAAGTGCGCAATTTAACTTCTGGTGTCCGGTGTTATGGATGACCTGGTGGACTCCAGTGACCTGGACGAGGAAGTGCGCAATTT

CTTTTTTTAGGCAGCCAGCAAGTCATTTTTGTCGTTAACACAACTGATGGAATTTTCGTTTTTAACACAGCTTTTTTTAGGCAGCCAGCAAGTCATTTTTGTCGTTAACACAACTGATGGAATTTTCGTTTTTAACACAG

ATGAGGAAGTGAATTACGTTTTTTAAACGCATTCACTTGCCATTTCTCGATTAAATGCCATATTACTTAAATGAGGAAGTGAATTACGTTTTTTAAACGCATTCACTTGCCATTTCTCGATTAAATGCCATATTACTTAA

GCTCAGGATTTACAAGCTTAATGCGAATTAAGTTAATTTCGGAAATGCTGACGAGAGTGATTGCAAAGTTGCTCAGGATTTACAAGCTTAATGCGAATTAAGTTAATTTCGGAAATGCTGACGAGAGTGATTGCAAAGTT

CAAAATTGATACAAATTCACTTCCGCAAATTCATGCTGAAACTGAAAGTTTTCTAACAGTCCTCAATATTCAAAATTGATACAAATTCACTTCCGCAAATTCATGCTGAAACTGAAAGTTTTTCTAACAGTCCTCAATATT

GTTATCTCGTTATCGTCCGTGCTTTCGTAGCTAGCTCCTACAACAAAAATACGTTATCTCGTTATCGTCCGTGCTTTCGTAGCTAGCTCTACAACAAAAATAC

(SEQ ID NO 26)(SEQ ID NO 26)

下面显示了dCREAP预测的氨基酸序列：The predicted amino acid sequence of dCREAP is shown below:

MANPRKFSEKIALQKQKQAEGTAEFERIMKEVYATKRDEPPANQKILDGLVGGQEVSQSSPGAGNGTGMANPRKFSEKIALQKQKQAEGTAEFERIMKEVYATKRDEPPANQKILDGLVGGQEVSQSSPGAGNGTG

GGGSGSGSGASGGGASPDGLGGGGGSPTAYRESRGRSVGVGPMRRPSERKQDRSPYGSSSTQQTLDNGGGGSGSGSGASGGGASPDGLGGGGGSPTAYRESRGRSVGVGPMRRPSERKQDRSPYGSSSTQQTLDNG

QLNPHLLGPPTAESLWRRSSSDSALHQSALVAGFNSDVNSMGANYQQQQHQQQQQPGQPRSHSPHHGIQLNPHLLGPPTAESLWRRSSSSDSALHQSALVAGFNSDVNSMGANYQQQQHQQQQQPGQPRSHSPHHGI

NRTMSPQAQRRKSPLLQPHQLQLQQLQQQQQQMQHQHQLHQQLQMQQLQQHQQQHQQQQQQQNTPYNNNRTMSPQAQRRKSPLLQPHQLQLQQLQQQQQQMQHQHQLHQQLQMQQLQQHQQQHQQQQQQQNTPYNN

AKFTNPVFRPLQDQVNFANTGSLPDLTALQNYGPQQQQQQSQQQPSQQQQQLQQTLSPVMSPHNHRREAKFTNPVFRPLQDQVNFANTGSLPDLTALQNYGPQQQQQQSQQQPSQQQQQLQQTLSPVMSPHNHRRE

RDQSPSPFSPAGGGGGAGPGSPYQQQQHSPTGNTQQQQQQHQQPSNSPHLSFTNLATTQAAVTTFNPLRDQSPSPFSPAGGGGGAGPGSPYQQQQHSPTGNTQQQQQQHQQPSNSPHLSFTNLATTQAAVTTFNPL

PTLGPHNATDYRQPPNPPSPRSSPGLLSSVSATDLHSSAPASPIRQQQQAHQQQQQQQQAQQQQQQFDPTLGPHNATDYRQPPNPPSPRSSPGLLSSVSATDLHSSAPASPIRQQQQAHQQQQQQQQAQQQQQQFD

NSYNSLNTSFHNQFEIFSLGDSNSSPEQQGFANNFVALDFDDLSGGGGGGPSGGGGSNGGGLTNGYNKNSYNSLNTSFHNQFEIFSLGDSNSSPEQQGFANNFVALDFDDLSGGGGGGPSGGGGSNGGGLTNGYNK

PEMLDFSELSGSPEASGNNNHMRRGVSNLNNNGLSNGVVGSTHNGSTNLNGAGNNNSSSGGGTAQDPLPEMLDFSELSGSPEASGNNNHMRRGVSNLNNNGLSNGVVGSTHNGSTNLNGAGNNNSSSGGGTAQDPL

GITTSPVPSPLGCPSSPLPIPIPMSAQSSPQQQHHHHQQQQQQHHQQQHHQQQQLSLSLHHSPHHSPMGITTSPVPSPLGCPSSPLPIPIPMSAQSSPQQQHHHHQQQQQQHHQQQHHQQQQLSLSLHHSPHHSPM

HSPHHGNSPLSSSSPVSHNACSNSNVVMNHQQQQQQHHHQQHHHQGSSQSHTPTTANIPSIIFSDYSSHSPHHGNSPLSSSSPVSHNACSNSNVVMNHQQQQQQHHHQQQHHHQGSSQSHTPTTANIPSIIFSDYSS

NADYTREIFDSLDLDLGQMDVAGLQMLSDQNPIMIADPNIEDSFRRDLNNADYTREIFDSLDLDLDLGQMDVAGLQMLSDQNPIMIADPNIEDSFRRDLN

(SEQ ID NO 27)(SEQ ID NO 27)

按照以下方法分析了dCREAP的活性：The activity of dCREAP was analyzed as follows:

使用有义(SEQ ID 37)和反义(SEQ ID 38)引物通过PCR扩增编码dCREAP可读框的2.3kb cDNA。A 2.3 kb cDNA encoding the dCREAP open reading frame was amplified by PCR using sense (SEQ ID 37) and antisense (SEQ ID 38) primers.

用于扩增dCREAP ORF cDNA的有义引物：(加下划线部分是果蝇Kozak序列CAAC)CAACATGGCCAATCCGCGCAAGTTCAGCGAG(SEQ ID 37)用于扩增dCREAP ORF cDNA的反义引物：TCAGTTGAGGTCGCGTCGAAAACTATCCTC(SEQ ID 38)将扩增产物插入果蝇P-元件转化载体pUAST(Brand和Perrimon，Development 118：401-415(1993))。最终构建体pUAS-dCREAP用于黑腹果蝇(Drosophila melanogaster)Schneider细胞(S2)中的转染实验。建立萤火虫萤光素酶报告基因，其包含果蝇CRE增强子元件(SEQ ID 39)的4个拷贝(Eresh，S.等人EMBO J.16：2014-2022(1997))且该报告基因后是hsp70最小启动子。寡核苷酸序列包含果蝇CRE的4个拷贝。加下划线部分是CRE元件的序列：GGAGCC TGGCGTCAGAGAGCC TGGCGTCAGAGAGCC TGGCGTCAGAGAGCC TGGCGTCAGAG (SEQ ID 39)The sense primer that is used to amplify dCREAP ORF cDNA: (the underlined part is fruit fly Kozak sequence CAAC) CAAC ATGGCCAATCCGCGCAAGTTCAGCGAG (SEQ ID 37) The antisense primer that is used to amplify dCREAP ORF cDNA: TCAGTTGAGGTCGCGTCGAAAACTATCCTC (SEQ ID 38) will amplify The amplification product was inserted into the Drosophila P-element transformation vector pUAST (Brand and Perrimon, Development 118:401-415 (1993)). The final construct pUAS-dCREAP was used for transfection experiments in Drosophila melanogaster Schneider cells (S2). A firefly luciferase reporter gene was established comprising 4 copies of the Drosophila CRE enhancer element (SEQ ID 39) (Eresh, S. et al. EMBO J.16:2014-2022 (1997)) and after the reporter gene Is the hsp70 minimal promoter. The oligonucleotide sequence contains 4 copies of the Drosophila Cre. The underlined part is the sequence of the CRE element: GGAGCC TGGCGTCA GAGAGCC TGGCGTCA GAGAGCC TGGCGTCA GAGAGCC TGGCGTCA GAG (SEQ ID 39)

在6孔板(Costar)中通过CaPO₄方法(Bunch，T.和Goldstein，L.NucleicAcids Res.17：9761-9782(1989))转染S2细胞。将总共25μg DNA转染入含有4ml细胞(～1×10⁶细胞/ml)的6孔板。18小时后去除转染混合物并且在48小时后进行萤光素酶测定。使用Norbert Perrimon博士所提供的肌动蛋白启动子-Gal4质粒通过共转染激活UAS-转基因。通过与最小热休克启动子(按照常规方法制备)所驱动的hsp^min海肾萤光素酶共转染对转染效率进行标准化。使用双-萤光素酶测定试剂盒(Promega)测定萤光素酶活性。作为阴性对照，用CRE-hsp-Luc报告子和空pUAST载体共转染S2细胞。数据计算为与在指定为阴性对照的细胞中所测定的报告基因活性相比的诱导倍数。 _S2 cells were transfected by the CaPO4 method (Bunch, T. and Goldstein, L. Nucleic Acids Res. 17:9761-9782 (1989)) in 6-well plates (Costar). A total of 25 μg of DNA was transfected into 6-well plates containing 4 ml of cells (˜1×10 ⁶ cells/ml). The transfection mixture was removed after 18 hours and the luciferase assay was performed after 48 hours. The UAS-transgene was activated by co-transfection using the actin promoter-Gal4 plasmid provided by Dr. Norbert Perrimon. Transfection efficiency was normalized by co-transfection with hsp ^min Renilla luciferase driven by a minimal heat shock promoter (prepared according to conventional methods). Luciferase activity was determined using a dual-luciferase assay kit (Promega). As a negative control, S2 cells were co-transfected with the CRE-hsp-Luc reporter and an empty pUAST vector. Data are calculated as fold induction compared to reporter gene activity determined in cells designated as negative controls.

结果表明(见表3)，与人CREAPS一样，dCREAP也能够调节果蝇中的CRE，因为当存在CRE元件时它能有效地诱导CRE-hsp-Luc报告子的活性。The results showed (see Table 3) that, like human CREAPS, dCREAP is also able to regulate CRE in Drosophila because it efficiently induces the activity of the CRE-hsp-Luc reporter when the CRE element is present.

激活Activate

倍数 STDEVMultiple STDEV

pUAST/CRE-hsp-Luc 1.02 0.28pUAST/CRE-hsp-Luc 1.02 0.28

dCREAP/hsp-Luc 0.96 0.15dCREAP/hsp-Luc 0.96 0.15

dCREAP/CRE-hsp-Luc 136.04 37.13dCREAP/CRE-hsp-Luc 136.04 37.13

表3：dCREAP有效地诱导CRE-hsp-Luc报告子的活性。使用空pUAST载体或pUAST-dCREAP构建体(dCREAP)和hsp-Luc报告子或携带果蝇CRE4个拷贝的hsp-Luc报告子(CRE-hsp-Luc)共转染S2细胞。转染48小时后测定了萤光素酶活性。Table 3: dCREAP efficiently induces the activity of the CRE-hsp-Luc reporter. S2 cells were co-transfected with empty pUAST vector or pUAST-dCREAP construct (dCREAP) and hsp-Luc reporter or hsp-Luc reporter carrying 4 copies of Drosophila CRE (CRE-hsp-Luc). Luciferase activity was measured 48 hours after transfection.

小鼠CREAP1(mCREAP1)基因的鉴定：Identification of the mouse CREAP1 (mCREAP1) gene:

还使用常规方法鉴定了小鼠CREAP1蛋白质。简而言之，按以下顺序装配mCREAP1cDNA：The mouse CREAP1 protein was also identified using conventional methods. Briefly, mCREAP1 cDNA was assembled in the following order:

核苷酸1-483来自小鼠EST BY752080(这里和下面GenBank登录号)；Nucleotides 1-483 are from mouse EST BY752080 (GenBank accession numbers here and below);

核苷酸484-891来自小鼠EST BM950955；Nucleotides 484-891 are from mouse EST BM950955;

核苷酸892-909来自小鼠基因组DNA序列Celera克隆Nucleotides 892-909 from mouse genomic DNA sequence Celera clone

核苷酸910-981来自小鼠EST CA326891。Nucleotides 910-981 are from mouse EST CA326891.

核苷酸982-1610来自小鼠EST BM935820。Nucleotides 982-1610 are from mouse EST BM935820.

核苷酸1611-2416来自小鼠EST BI453510。Nucleotides 1611-2416 are from mouse EST BI453510.

所得到的mCREAP1核苷酸序列：The resulting mCREAP1 nucleotide sequence:

GGGACGAAGAGTAGGAGTAGGAGGAGGCGGCGAGAAGATGGCGACTTCGAACAATCCGCGGAAATTTAGGGACGAAGAGTAGGAGTAGGAGGAGGCGGCGAGAAGATGGCGACTTCGAACAATCCGCGGAAATTTA

GCGAGAAGATCGCACTGCACAACCAGAAGCAGGCGGAGGAGACGGCGGCCTTCGAGGAGGTCATGAAGGCGAGAAGATCGCACTGCACAACCAGAAGCAGGCGGAGGAGACGGCGGCCTTCGAGGAGGTCATGAAG

GACCTGAGCCTGACGCGGGCCGCGCGGCTTCAGCTGCAGAAGTCCCAGTACCTGCAGCTGGGCCCCAGGACCTGAGCCTGACGCGGGCCGCGCGGCTTCAGCTGCAGAAGTCCCAGTACCTGCAGCTGGGCCCCAG

CCGTGGCCAGTACTACGGTGGGTCCCTGCCCAACGTGAACCAGATTGGAAGCAGCAGCGTGGACCTGGCCGTGGCCAGTACTACGGTGGGTCCCTGCCCAACGTGAACCAGATTGGAAGCAGCAGCGTGGACCTGG

CCTTCCAGACCCCATTTCAGTCCTCAGGCCTGGACACGAGTCGGACCACACGACATCATGGGCTTGTGCCTTCCAGACCCCATTTCAGTCCTCAGGCCTGGACACGAGTCGGACCACACGACATCATGGGCTTGTG

GACAGAGTATATCGTGAGCGTGGCAGACTTGGCTCCCCGCACCGTCGACCCCTGTCAGTAGACAAGCAGACAGAGTATATCGTGAGCGTGGCAGACTTGGCTCCCCGCACCGTCGACCCCTGTCAGTAGACAAGCA

TGGGCGACAGGCTGACAGCTGCCCCTATGGCACCGTGTACCTCTCGCCTCCTGCGGACACCAGCTGGATGGGCGACAGGCTGACAGCTGCCCCTATGGCACCGTGTACCTCTCGCCTCCTGCGGACACCAGCTGGA

GGAGGACCAACTCTGACTCTGCCCTGCACCAGAGCACAATGACACCCAGCCAGGCAGAGTCCTTCACAGGAGGACCAACTCTGACTCTGCCCTGCACCAGAGCACAATGACACCCAGCCAGGCAGAGTCTCTTCACA

GGCGGGTCCCAGGATGCGCACCAGAAGAGAGTCTTACTGCTAACTGTCCCAGGAATGGAGGACACCGGGGCGGGTCCCAGGATGCGCACCAGAAGAGAGTCTTACTGCTAACTGTCCCAGGAATGGAGGACACCGG

GGCTGAGACAGACAAGACCCTTTCTAAGCAGTCATGGGACTCAAAGAAGGCGGGTTCCAGGCCCAAGTGGCTGAGACAGACAAGACCCTTTCTAAGCAGTCATGGGACTCAAAGAAGGCGGGTTCCAGGCCCAAGT

CCTGTGAGGTCCCCGGAATCAACATCTTTCCGTCTGCAGACCAGGAGAACACAACAGCCCTGATCCCTCCTGTGAGGTCCCCGGAATCAACATCTTTCCGTCTGCAGACCAGGAGAACACAACAGCCCTGATCCCT

GCCACCCACAACACAGGGGGCTCCCTTCCTGACCTCACCAACATCCACTTCGCCTCCCCACTCCCGACGCCACCCACAACACAGGGGGCTCCCTTCCTGACCTCACCAACATCCACTTCGCCTCCCCACTCCCGAC

ACCACTGGACCCTGAGGAGCCTCCGTTCCCTGCTCTCACCAGCTCCAGCAGCACCGGCAGCCTTGCACACCACTGGACCCTGAGGAGCCTCCGTTCCCTGCTCTCACCAGCTCCAGCAGCACCGGCAGCCTTGCAC

ATCTGGGCGTTGGCGGCGCAGGCGGTATGAACACCCCCAGCTCTTCTCCACAGCACCGGCCAGCAGTCATCTGGGCGTTGGCGGCGCAGGCGGTATGAACACCCCCAGCTCTTCTCCAGCACCGGCCAGCAGTC

GTCAGCCCCCTGTCCCTGAGCACAGAGGCCAGGCGGCAGCAGGCCCAGCAGGTGTCACCCACCCTGTCGTCAGCCCCCTGTCCCTGAGCACAGAGGCCAGGCGGCAGCAGGCCCAGCAGGTGTCACCCCACCCTGTC

TCCGTTGTCACCCATCACTCAGGCCGTGGCTATGGATGCCCTGTCCTTGGAGCAGCAGCTGCCCTATGTCCGTTGTCACCCCATCACTCAGGCCGTGGCTATGGATGCCCTGTCCTTGGAGCAGCAGCTGCCCTATG

CCTTCTTCACCCAGACTGGCTCCCAGCAGCCTCCCCCACAGCCCCAGCCACCGCCTCCACCTCCACCGCCTTTCTTCACCCAGACTGGCTCCCAGCAGCCTCCCCCACAGCCCCAGCCACCGCCTCCACCTCCACCG

GTATCCCAGCAGCAGCCACCACCTCCACAGGTGTCTGTGGGCCTCCCCCAGGGTGGTCCACTGCTGCCGTATCCCAGCAGCAGCCACCACCTCCACAGGTGTCTGTGGGCCTCCCCCAGGGTGGTCCACTGCTGCC

CAGTGCCAGCCTGACTCGGGGGCCCCAGCTGCCACCACTCTCAGTTACTGTACCATCCACTCTTCCCCCAGTGCCAGCCTGACTCGGGGGCCCCAGCTGCCACCACTCTCAGTTACTGTACCATCCACTTCTTCCCC

AGTCCCCTACAGAGAACCCAGGCCAGTCACCAATGGGGATCGATGCCACTTCGGCACCAGCTCTGCAGAGTCCCTACAGAGAACCCAGGCCAGTCACCAATGGGGATCGATGCCACTTCGGCACCAGCTCTGCAG

TACCGCACGAGTGCAGGGTCACCTGCCACCCAGTCTCCCACCTCTCCGGTCTCCAACCAAGGCTTCTCTACCGCACGAGTGCAGGGTCACCTGCCACCCAGTCTCCCACCTCTCCGGTCTCCAACCAAGGCTTCTC

CCCTGGAAGCTCCCCACAGCACACGTCCACCCTGGGCAGCGTGTTTGGGGATGCGTACTATGAGCAGCCCCTGGAAGCTCCCCACAGCACACGTCCACCCTGGGCAGCGTGTTTGGGGATGCGTACTATGAGCAGC

AGATGACAGCCAGGCAGGCCAATGCTCTGTCNCGCCAGCTGGAGCAGTTCAACATGATGGAGAACGCCAGATGACAGCCAGGCAGGCCAATGCTCTGTCNCGCCAGCTGGAGCAGTTCAACATGATGGAGAACGCC

ATCAGCTCCAGCAGCCTATACAACCCGGGCTCCACACTCAACTATTCCCAGGCTGCCATGATGGGTCTATCAGCTCCAGCAGCCTATACAACCCGGGCTCCACACTCAACTATTCCCAGGCTGCCATGATGGGTCT

GAGCGGGAGCCACGGGGGCCTACAGGACCCGCAGCAGCTCGGCTACACAGGCCACGGTGGAATCCCCAGAGCGGGAGCCACGGGGGCCTACAGGACCCGCAGCAGCTCGGCTACACAGGCCACGGTGGAATCCCCA

ACATCATCCTCACGGTGACAGGAGAGTCACCACCGAGCCTCTCTAAGGAACTGAGCAGCACACTGGCAACATCATCCTCACGGTGACAGGAGAGTCACCACCGAGCCTCTCTAAGGAACTGAGCAGCACACTGGCA

GGAGTCAGTGATGTCAGCTTTGATTCGGACCATCAGTTTCCACTGGACGAGCTGAAGATTGACCCTCTGGAGTCAGTGATGTCAGCTTTGATTCGGACCATCAGTTCCACTGGACGAGCTGAAGATTGACCCTCT

GACCCTGGACGGACTCCATATGTTGAATGACCCAGACATGGTTTTAGCCGACCCAGCCACCGAGGACAGACCCTGGACGGACTCCATATGTTGAATGACCCAGACATGGTTTTAGCCGACCCAGCCACCGAGGACA

CCTTCCGAATGGACCGCCTGTGAGTGGCTGTGCCCACCAGCCGCCGCTGGTCAGTCTCCAACGGCGCTCCTTCCGAATGGACCGCCTGTGAGTGGCTGTGCCCACCAGCCGCCGCTGGTCAGTCTCCAACGGCGCT

GCCCCAAACCTGGGGACGGCAATGGCGTCCCCCTTTGCCAACGGCCAAGCTTGTGGTTCTGAGCTTGCGCCCCAAACCTGGGGACGGCAATGGCGTCCCCCCTTTGCCAACGGCCAAGCTTGTGGTTCTGAGCTTGC

AATGCTGCCCAGTGCCCCTGCCAGCCCCCCGCCACCCCGGTCGTTCACCTCCCATGATGCCTGGCGTGAATGCTGCCCAGTGCCCCTGCCAGCCCCCCGCCACCCCGGTCGTTCACCTCCCATGATGCCTGGCGTG

CGTGAGGCCGCTGTGTACTAGGCTGGCTATCTGTCTGTCCATCCATCTACCTGGGGTCAGGCTGATGGCGTGAGGCCGCTGTGTACTAGGCTGGCTATCTGTCTGTCCATCCATCTACCTGGGGTCAGGCTGATGG

CCGAGGCTGTGAGTGCCTGGCCCCCATGGATGTTCCCCGTGCTCGCTCCCTCACCCCTCACTGGGGATCCGAGGCTGTGAGTGCCTGGCCCCCATGGATGTTCCCCGTGCTCGCTCCCTCACCCTCACTGGGGAT

GTGAGAGCCCTCATCAGATACCCAAAGTGTCACTCACTTCCAGCATGTGCTGTGCAACGGAGGGCCGGGTGAGAGCCCTCATCAGATACCCAAAGTGTCACTCACTTCCAGCATGTGCTGTGCAACGGAGGGCCGG

GGCGTGGGTGTGGAGCGCCCAGAGGCTTAGGTGCGCCATCCATTCGACTGTTGTCAGCTGTCACTGCCGGCGTGGGTGTGGAGCGCCCAGAGGCTTAGGTGCGCCATCCATTCGACTGTTGTCAGCTGTCACTGCC

TTCCTCCATCCTGTCCCCCGTCCCACCGCCATCCCTTTCCCTCCATCCTGTCCCCCGTCCCCACCGCCATCCCT

(SEQ ID NO.28)(SEQ ID NO.28)

编码mCREAP1蛋白质序列的可读框是由核苷酸25-1914编码。The open reading frame encoding the mCREAP1 protein sequence is encoded by nucleotides 25-1914.

mCREAP1的蛋白质序列：Protein sequence of mCREAP1:

NQIGSSSVDLAFQTPFQSSGLDTSRTTRHHGLVDRVYRERGRLGSPHRRPLSVDKHGRQADSCPYGTVNQIGSSSVDLAFQTPFQSSGLDTSRTTRHHGLVDRVYRERGRLGSPHRRRPLSVDKHGRQADSCPYGTV

YLSPPADTSWRRTNSDSALHQSTMTPSQAESFTGGSQDAHQKRVLLLTVPGMEDTGAETDKTLSKQSWYLSPPADTSWRRTNSDSALHQSTMTPSQAESFTGGSQDAHQKRVLLLTPGMEDTGAETDKTLSKQSW

DSKKAGSRPKSCEVPGINIFPSADQENTTALIPATHNTGGSLPDLTNIHFASPLPTPLDPEEPPFPALDSKKAGSRPKSCEVPGINIFPSADQENTTALIPATHNTGGSLPDLTNIHFASPLTPLDPEEPFPAL

TSSSSTGSLAHLGVGGAGGMNTPSSSPQHRPAVVSPLSLSTEARRQQAQQVSPTLSPLSPITQAVAMDTSSSSTGSLAHLGVGGAGGMNTPSSSPQHRPAVVSPLSLSTEARRQQAQQVSPTLSPLSPITQAVAMD

ALSLEQQLPYAFFTQTGSQQPPPQPQPPPPPPPVSQQQPPPPQVSVGLPQGGPLLPSASLTRGPQLPPALSLEQQLPYAFFTQTGSQQPPPQPQPPPPPPPVSQQQPPPPQVSVGLPQGGPLLPSASLTRGPQLPP

LSVTVPSTLPQSPTENPGQSPMGIDATSAPALQYRTSAGSPATQSPTSPVSNQGFSPGSSPQHTSTLGLSVTVPSTLPQSPTENPGQSPMGIDATSAPALQYRTSAGSPATQSPTSPVSNQGFSPGSSPQHTSTLG

SVFGDAYYEQQMTARQANALSRQLEQFNMMENAISSSSLYNPGSTLNYSQAAMMGLSGSHGGLQDPQQSVFGDAYYEQQMTARQANALSRQLEQFNMMENAISSSSLYNPGSTLNYSQAAMMGLSGSHGGLQDPQQ

LGYTGHGGIPNIILTVTGESPPSLSKELSSTLAGVSDVSFDSDHQFPLDELKIDPLTLDGLHMLNDPDLGYTGHGGIPNIILTVTGESPPSLSKELSSTLAGVSDVSFDSDHQFPLDELKIDPLTLDGLHMLNDPD

MVLADPATEDTFRMDRL(SEQ ID NO：29)MVLADPATEDTFRMDRL (SEQ ID NO: 29)

河豚CREAP1的鉴定：Identification of Pufferfish CREAP1:

在红鳍东方豚中鉴定了CREAP1。使用TBLASTN通过将人CREAP1蛋白质序列与河豚基因组(版本3)进行比对鉴定了该序列。通过比对找到了高度同源的区域。将所找到的序列进一步进行人工编辑。CREAP1 was identified in the redfin oriental porpoise. The sequence was identified by alignment of the human CREAP1 protein sequence to the pufferfish genome (version 3) using TBLASTN. Highly homologous regions were found by comparison. The found sequences were further manually edited.

河豚CREAP1氨基酸序列：Puffer fish CREAP1 amino acid sequence:

MASSNNPRKFSEKIALHNQKQAEETAAFEEVMKDLNVTRAARLQLQKTQYLQLGQNRGQYYGGSLPNVMASSSNNPRKFSEKIALHNQKQAEETAAFEEVMKDLNVTRAARLQLQKTQYLQLGQNRGQYYGGSLPNV

NQIGNGNIDLPFQVSNSVLDTSRTTRHHGLVERVYRDRNRISSPHRRPLSVDKHGRQRTNSDSALHQSNQIGNGNIDLPFQVSNSVLDTSRTTRHHGLVERVYRDRNRISSPHRRPLSVDKHGRQRTNSDSALHQS

AMNPKPHEVFAGGSQELQPKRLLLTVPGTEKSESNADKDSQEQSWDDKKSIFPSPDQELNPSVLPAAHAMNPKPHEVFAGGSQELQPKRLLLTVPGTEKSESNADKDSQEQSWDDKKSIFPSPDQELNPSVLPAAH

NTGGSLPDLTNIQFPPPLSTPLDPEDTVTFPSLSSSNSTGSLTTNLTHLGISVASHGNNGEKNIFFLKNTGGSLPDLTNIQFPPPLSTPLDPEDTVTFPSLSSSNSTGSLTTNLTHLGISVASHGNNGEKNIFFLK

TCTSCEDVYDFYFVGIPTSSQTTMTATAQRRQPPVVPLTLTSDLTLQQSPQQLSPTLSSPINITQSMKTCTSCEDVYDFYFVGIPTSSQTTMTATAQRRQPPVVPLTLTSDLTLQQSPQQLSPTLSSPINITQSMK

LSASSLQQYRNQTGSPATQSPTSPVSNQGFSPGSSPQPQHIPVVGSIFGDSFYDQQLALRQTNALSHQLSASSSLQQYRNQTGSPATQSPTSPVSNQGFSPGSSPQPQHIPVVGSIFGDSFYDQQLALRQTNALSHQ

VCEDGRRLEITHVRLSRLHAELCFCFSQLEQFNMIENPISSTSLYNQCSTLNYTQAAMMGLTGSSLQDVCEDGRRLEITHVRLSRLHAELCFCFSQLEQFNMIENPISSTSLYNQCSTLNYTQAAMMGLTGSSLQD

SQQLGYGNHGNIPNIILTISVTGESPPSLSKELTNSLAGVGDVSFDPDTQFPLDELKIDPLTLDGLHMSQQLGYGNHGNIPNIILTISVTGESPPSLSKELTNSLAGVGDVSFDPDTQFPLDELKIDPLTLDGLHM

LNDPDMVLADPATEDTFRMDRL(SEQ ID NO.30)LNDPDMVLADPATEDTFRMDRL (SEQ ID NO. 30)

河豚CREAP1DNA序列：Puffer fish CREAP1 DNA sequence:

ATGGCGTCCTCTAACAATCCTCGCAAATTTAGCGAAAAAATCGCACTGCATAACCAGAAACAAGCAGAATGGCGTCCTCTAACAATCCTCGCAAATTTAGCGAAAAAATCGCACTGCATAACCAGAAACAAGCAGA

GGAGACTGCTGCGTTCGAAGAAGTGATGAAGGACCTGAACGTCACAAGGGCTGCCCGGGTAAGACAGCGGAGACTGCTGCGTTCGAAGAAGTGATGAAGGACCTGAACGTCACAAGGGCTGCCCGGGTAAGACAGC

TGCAGTTACAGAAGACCCAGTATTTGCAACTAGGGCAGAATCGTGGACAGTACTATGGAGGCTCACTGTGCAGTTACAGAAGACCCAGTATTTGCAACTAGGGCAGAATCGTGGACAGTACTATGGAGGCTCACTG

CCCAATGTCAATCAGATTGGAAATGGCAACATTGACCTGCCTTTTCAGGTGAGCAGGACAAACTCAGACCCAATGTCAAATCAGATTGGAAATGGCAACATTGACCTGCCTTTTCAGGTGAGCAGGACAAACTCAGA

CTCAGCTTTACATCAGAGTGCCATGAATCCAAAGCCCCACGAAGTGTTTGCTGGGGGGTCGCAGGAGCCTCAGCTTTACATCAGAGTGCCATGAATCCAAAGCCCCACGAAGTGTTTGCTGGGGGGTCGCAGGAGC

TGCAGCCCAAACGACTGCTGCTAACAGTGCCTGGAACCGAAAAATCGGAATCAAACGCAGACAAAGATTGCAGCCCAAACGACTGCTGCTAACAGTGCCTGGAACCGAAAAATCGGAATCAAACGCAGACAAAGAT

TCGCAGGAGCAGTCGTGGGATGACAAAAAGAGTATTTTTCCATCACCAGACCAGGAGTTAAACCCCTCTCGCAGGAGCAGTCGTGGGATGACAAAAAGAGTATTTTTCCATCACCAGACCAGGAGTTAAACCCCTC

CGTGCTTCCAGCCGCGCACAACACCGGCGGTTCGCTCCCCGACCTGACCAACATCCAGTTCCCTCCTCCGTGCTTCCAGCCGCGCACAACACCGGCGGTTCGCTCCCCGACCTGACCAACATCCAGTTCCCTCCTC

CACTGTCCACCCCACTGGACCCCGAGGACACCGTCACCTTCCCCTCCCTCAGCTCCTCTAACAGCACACACTGTCCACCCCACTGGACCCCGAGGACACCGTCACCTTTCCCCTCCCTCAGCTCCTCTAACAGCACA

GGCAGTCTGACTACCAACCTCACCCACCTGGGCATCAGTGTGGCCAGCCATGGTAATAACGGAGAGAAGGCAGTCTGACTACCAACCTCACCCACCTGGGCATCAGTGTGGCCAGCCATGGTAATAACGGAGAGAA

AAATATATTTTTTTTAAAAACATGCACTTCATGCGAGGATGTTAAATAATATTACGACTTTTATTTTGAAATATATTTTTTTTAAAAACATGCACTTCATGCGAGGATGTTAAATAATATTACGACTTTTATTTTG

TAGGGATTCCCACTTCCTCTCAAACCACCATGACAGCAACAGCACAGCGGCGGCAACCACCCGTGGTCTAGGGATTCCCACTTCCTCTCAAAACCACCATGACAGCAACAGCACAGCGGCGGCAACCACCCGTGGTC

CCCCTCACCCTCACCTCTGACCTGACTCTTCAACAGTCCCCCCAGCAGCTTTCACCCACCCTCTCCTCCCCCTCACCCTCACCTCTGACCTGACTCTTCAACAGTCCCCCCAGCTTTTCACCCACCCTCTCCTCTC

ACCCATTAACATCACACAGAGCATGAAGCTTAGTGCTAGCTAACATTCTTCCCTCCAACAGTACCGCAACCCATTAACATCACACAGAGCATGAAGCTTAGTGCTAGCTAACATTCTTCCCTCCAACAGTACCGCA

ATCAGACTGGCTCACCAGCCACTCAGTCTCCAACCTCCCCAGTCTCCAATCAAGGCTTCTCCCCCGGCATCAGACTGGCTCACCAGCCACTCAGTCTCCAACCTCCCCAGTCTCCAATCAAGGCTTCTCCCCCGGC

AGCTCGCCTCAACCACAGCACATTCCTGTGGTGGGCAGTATATTTGGGGACTCCTTCTATGATCAGCAAGCTCGCCTCAACCACAGCACATTCCTGTGGTGGGCAGTATATTTGGGGACTCCTTCTATGATCAGCA

GTTGGCTCTGAGGCAGACCAATGCCCTTTCTCATCAGGTGTGTGAGGACGGCCGCAGGTTAGAAATAAGTTGGCTCTGAGGCAGACCAATGCCCTTTCTCATCAGGTGTGTGAGGACGGCCGCAGGTTAGAAATAA

CACACGTACGTCTCTCACGACTTCACGCCGAGCTTTGTTTTTGTTTTTCTCAGCTGGAGCAGTTCAATCACACGTACGTCTCTCACGACTTCACGCCGAGCTTTGTTTTTGTTTTTCTCAGCTGGAGCAGTTCAAT

ATGATAGAGAACCCCATCAGCTCCACCAGCCTGTACAATCAGTGCTCCACCCTTAATTACACACAGGCATGATAGAGAACCCCATCAGCTCCACCAGCCTGTACAATCAGTGCTCCACCCTTAATTACACACAGGC

AGCCATGATGGGCCTCACCGGGAGCAGCCTGCAGGACTCGCAGCAGCTCGGCTACGGCAATCACGGCAAGCCATGATGGGCCTCACCGGGAGCAGCCTGCAGGACTCGCAGCAGCTCGGCTACGGCAATCACGGCA

ACATCCCCAACATCATACTGACAATTTCAGTCACAGGGGAGTCTCCGCCGAGCCTCTCCAAAGAGCTGACATCCCCAAACATCATACTGACAATTTCAGTCACAGGGGAGTCTCCGCCGAGCCTCTCCAAAGAGCTG

ACCAACTCATTGGCCGGCGTCGGCGACGTCAGCTTTGATCCAGACACGCAGTTTCCTCTGGACGAGCTACCAACTCATTGGCCGGCGTCGGCGACGTCAGCTTTGATCCAGACACGCAGTTTCCTCTGGACGAGCT

GAAGATCGACCCGCTGACCTTGGACGGCCTGCACATGCTCAACGACCCAGACATGGTGCTGGCAGACCGAAGATCGACCCGCTGACCTTGGACGGCCTGCACATGCTCAACGACCCAGACATGGTGCTGGCAGACC

CCGCCACAGAGGACACGTTCAGGATGGACAGGCTGTAA(SEQ ID NO.31)CCGCCACAGAGGACACGTTCAGGATGGACAGGCTGTAA (SEQ ID NO. 31)

实施例13Example 13

人CREAP编码区与来自其它物种的其它CREAP蛋白质的比较Comparison of the human CREAP coding region with other CREAP proteins from other species

如图1中所示通过全部三种CREAP序列编码区的总体比对对这些序列进行了比较。每种蛋白质大小相似，CREAP2略大(与CREAP1和3的分别650和619个氨基酸相比为693个氨基酸)。基于保守性能够将这些蛋白质大致分为3个结构域。第一个是占三分之一的保守氨基末端，与CREAP1的直至氨基酸267(即氨基酸1-267)具有高度同一性。该区域在全部三种CREAP之间大致为33％的同一性。第二个结构域是跨越CREAP1氨基酸289-538的中间区域，该区域脯氨酸、甘氨酸和丝氨酸残基高度丰富。该区域分别对应CREAP1、CREAP2和CREAP3的氨基酸289-529、376-606和235-533。该区域具有很小的氨基酸同一性，但是在氨基酸构成上是相似的。最后，该蛋白质占三分之一的羧基末端(大致对应CREAP1最后78个氨基酸(CREAP1的氨基酸575-650))又是高度保守的，在全部三种蛋白质之间具有38％的氨基酸同一性。These sequences were compared by a global alignment of the coding regions of all three CREAP sequences as shown in Figure 1 . Each protein is similar in size, with CREAP2 being slightly larger (693 amino acids compared to 650 and 619 amino acids for CREAP1 and 3, respectively). These proteins can be roughly divided into three structural domains based on conservation. The first is the one-third conserved amino terminus, which shares high identity with CREAP1 up to amino acid 267 (ie, amino acids 1-267). This region is approximately 33% identical across all three CREAPs. The second domain is the middle region spanning amino acids 289-538 of CREAP1, which is highly abundant in proline, glycine and serine residues. This region corresponds to amino acids 289-529, 376-606 and 235-533 of CREAP1, CREAP2 and CREAP3, respectively. This region has little amino acid identity, but is similar in amino acid composition. Finally, the carboxy-terminal third of the protein (approximately corresponding to the last 78 amino acids of CREAP1 (amino acids 575-650 of CREAP1)) is again highly conserved, with 38% amino acid identity between all three proteins.

有趣地是，该蛋白质的最保守部分是氨基末端。超过24个氨基酸的80％同一性区域存在于全部三种蛋白质。该区域在果蝇中也是保守的并且对于CREAP功能是必需的并可能代表调节CREAP功能的关键区域。CREAP氨基末端的保守性表明该区域对于其功能是关键的。该观点由如下数据支持，所述数据表明氨基末端250个氨基酸的缺失破坏了CREAP1活性(见上面表1)。Interestingly, the most conserved part of the protein is the amino terminus. A region of 80% identity over 24 amino acids is present in all three proteins. This region is also conserved in Drosophila and is essential for CREAP function and may represent a key region regulating CREAP function. The conservation of the amino terminus of CREAP suggests that this region is critical for its function. This notion is supported by data showing that deletion of the amino-terminal 250 amino acids abolishes CREAP1 activity (see Table 1 above).

为了进一步鉴定是否绝大部分氨基末端残基都是关键的，在CREAP1中产生了最N-末端59个氨基酸的缺失。用ScaI/XhoI限制性内切酶(RocheApplied Science，Indianapolis，IN，USA)从最初的pCMV-SPORT6质粒切除CREAP1cDNA。ScaI消化的2382nt CREAP1cDNA片段缺失了CREAP1 ORF的177个核苷酸，将该片段进行凝胶纯化并亚克隆入EcoRV/SalI消化的pFLAG-CMV6B载体(BD Biosciences)。分离正确的克隆并按照常规方法进行序列验证。在启动子-报告子测定中试验了该蛋白质(δ59)。与其保守性相一致，这些残基的缺失导致CREAP活性损失80-90％(数据未显示)。To further identify whether the vast majority of amino-terminal residues are critical, a deletion of the most N-terminal 59 amino acids was made in CREAP1. CREAP1 cDNA was excised from the original pCMV-SPORT6 plasmid with ScaI/XhoI restriction enzymes (Roche Applied Science, Indianapolis, IN, USA). A ScaI digested 2382nt CREAP1 cDNA fragment lacking 177 nucleotides of the CREAP1 ORF was gel purified and subcloned into EcoRV/SalI digested pFLAG-CMV6B vector (BD Biosciences). Correct clones were isolated and sequence verified by routine methods. This protein (delta 59) was tested in a promoter-reporter assay. Consistent with their conservation, deletion of these residues resulted in an 80-90% loss of CREAP activity (data not shown).

图1中显示了人CREAP1和来自其它物种的同系物的相似性。总体来看，所描述的人CREAP的3个结构域也包含于其它CREAP序列。氨基末端是高度保守的。值得注意地是，人CREAP最氨基末端的保守氨基酸在这些蛋白质中也是高度保守的。The similarity between human CREAP1 and homologues from other species is shown in Figure 1 . Taken together, the three domains described for human CREAP are also contained in other CREAP sequences. The amino terminus is highly conserved. Notably, the most amino-terminal conserved amino acid of human CREAP is also highly conserved in these proteins.

本研究中鉴定的人CREAP1 cDNA编码预测的650个氨基酸的蛋白质。该cDNA与注释为KIAA0616的许多cDNA部分重叠但在所编码蛋白质的预测的c-末端不同。我们能够鉴定N-末端卷曲螺旋(coil-coil)结构域(氨基酸8-54)、富含丝氨酸/谷氨酰胺的结构域(氨基酸289-559)和带强负电荷的C-末端结构域(氨基酸602-643)(数据未显示)。如所显示，连同人CREAP2和CREAP3一起，在小鼠和河豚基因组中发现了编码与CREAP1高度相似的蛋白质的基因。全部人和小鼠CREAP1基因具有90％同一性。所预测的河豚蛋白质长为566个氨基酸并66％相同于人CREAP1。The human CREAP1 cDNA identified in this study encodes a predicted 650 amino acid protein. This cDNA partially overlaps with many cDNAs annotated KIAA0616 but differs at the predicted c-terminus of the encoded protein. We were able to identify an N-terminal coil-coil domain (amino acids 8-54), a serine/glutamine-rich domain (amino acids 289-559) and a strongly negatively charged C-terminal domain ( amino acids 602-643) (data not shown). As shown, along with human CREAP2 and CREAP3, genes encoding proteins highly similar to CREAP1 were found in the mouse and pufferfish genomes. All human and mouse CREAP1 genes are 90% identical. The predicted fugu protein is 566 amino acids long and 66% identical to human CREAP1.

我们还鉴定了在果蝇基因组中预测的CREAP1样基因。虽然哺乳动物和鱼类CREAP1基因与果蝇序列仅有约20％的同一性，但果蝇序列与其它CREAP1蛋白质具有相似的组织。每种蛋白质包含高度保守的氨基和羧基末端区域以及富含脯氨酸、谷氨酰胺和丝氨酸残基的中央结构域。我们已经命名果蝇预测基因为dCREAP。dCREAP的28个氨基酸的前22个与人CREAP1相同。氨基末端具有与CREB蛋白质中的磷酸化位点相似的绝对保守的共有PKA或PKC共有磷酸化位点(RKFS)。CREB中该丝氨酸(CREB1中的丝氨酸133)的磷酸化作用是通过cAMP诱导CREB依赖基因表达所需的。We also identified a predicted CREAP1-like gene in the Drosophila genome. Although the mammalian and fish CREAP1 genes share only about 20 percent identity with the Drosophila sequence, the Drosophila sequence shares a similar organization with other CREAP1 proteins. Each protein contains highly conserved amino- and carboxy-terminal regions and a central domain rich in proline, glutamine and serine residues. We have named the Drosophila predictor gene dCREAP. The first 22 of the 28 amino acids of dCREAP are identical to human CREAP1. The amino terminus has an absolutely conserved consensus PKA or PKC consensus phosphorylation site (RKFS) similar to the phosphorylation site in the CREB protein. Phosphorylation of this serine in CREB (serine 133 in CREB1) is required for the induction of CREB-dependent gene expression by cAMP.

DCREAP前32个氨基酸分别69％和84％相同并相似与人CREAP1，这再次支持了该观点，即CREAP的氨基末端对其功能是关键的。另外，dCREAP的中央结构域是具有很小同源性的较低复杂性区域。尽管预测的dCREAP编码区确实具有一些甘氨酸和脯氨酸丰富区，但在谷氨酰胺残基高度丰富方面是独特的。另外，与人CREAP相似，该蛋白质的最羧基末端是与人CREAP1高度保守的(最后30个氨基酸超过30％)。The first 32 amino acids of DCREAP are 69% and 84% identical and similar to human CREAP1, respectively, which again supports the idea that the amino terminus of CREAP is critical for its function. Additionally, the central domain of dCREAP is a region of lower complexity with little homology. Although the predicted dCREAP coding region does have some glycine and proline rich regions, it is unique in being highly rich in glutamine residues. In addition, similar to human CREAP, the most carboxyl terminus of this protein is highly conserved with human CREAP1 (more than 30% of the last 30 amino acids).

表4中显示了CREAP基因的相关性。总体而言，人CREAP基因彼此之间比dCREAP更相关。CREAP2稍微地更加相似于CREAP1和CREAP3，比CREAP1和CREAP3之间更相似，但是全部它们之间具有34-39％的同一性。发现全部人CREAP约20％相同于预测的dCREAP基因，尽管如上所示的相似性更多地是由于蛋白质高度保守的氨基和羧基末端。应该注意到在小鼠和人基因组中全部三个CREAP基因是高度保守的(数据未显示)。这表明不同同种型具有唯一和关键的功能。CREAP的进化保守性支持CREAP是CRE活性的关键调节子的观点。 HCREAP1 HCREAP2 HCREAP3 MCREAP1 FCREAP1 dCREAP HCREAP1HCREAP2HCREAP3 32 3233 893430 633160 191815 MCREAP1FCREAP1dCREAP 60 2120 The correlation of the CREAP gene is shown in Table 4. Overall, human CREAP genes were more related to each other than dCREAP. CREAP2 is slightly more similar to CREAP1 and CREAP3 than between CREAP1 and CREAP3, but all share 34-39% identity between them. All human CREAP was found to be about 20% identical to the predicted dCREAP gene, although as shown above the similarity is more due to the highly conserved amino and carboxy termini of the protein. It should be noted that all three CREAP genes are highly conserved in the mouse and human genomes (data not shown). This suggests that the different isoforms have unique and critical functions. The evolutionary conservation of CREAP supports the notion that CREAP is a key regulator of CRE activity. HCREAP1 HCREAP2 HCREAP3 MCREAP1 FCREAP1 dCREAP HCREAP1HCREAP2HCREAP3 32 3233 893430 633160 191815 MCREAP1FCREAP1dCREAP 60 2120

表4：来自不同物种的CREAP基因的氨基酸相似性。所显示数据代表全部蛋白质编码区氨基酸同一性百分比并且如上所述进行计算。百分比同一性基于使用ClustalW V1.74的自动比对。Table 4: Amino acid similarity of CREAP genes from different species. Data shown represent percent amino acid identity across protein coding regions and were calculated as described above. Percent identities are based on automated alignments using ClustalW V1.74.

实施例14Example 14

CREAP2和CREAP3的活性 Activity of CREAP2 and CREAP3

人CREAP基因之间的同源性表明它们是功能上相关的。为了研究这一点，在如上面所公开的共转染测定中测试了CREAP2和CREAP3激活由IL-8启动子和CRE-依赖启动子所驱动的基因表达的能力。简而言之，用空pCMV-SPORT6表达载体或携带CREAP1、CREAP2或CREAP3cDNA的相同载体共转染后，测定了由IL-8启动子或连接至4个CRE拷贝的最小启动子所驱动的萤光素酶报告子的表达水平。结果表明CREAP1与依赖IL-8启动子或-依赖CRE驱动的萤火虫萤光素酶基因的共转染导致萤光素酶活性急剧增加(见表5)。CREAP2或CREAP3的转染也产生了两种报告子的相似的激活。其它实验表明该活性依赖于存在于IL-8启动子中的CRE或CRE-样位点的完整性(数据未显示)。有趣地是，与CREAP1和CREAP2相比，CREAP3始终如一地显示2-4倍的更高水平诱导基因表达。因此，CREAP2和CREAP3是CRE驱动基因表达的有效激活子并且CREAP家族代表保守序列和活性的家族。此外，当过表达于HLR-CREB细胞系(Stratagene)时，全部三个CREAP家族成员都显示了激活CREB-GAL4融合蛋白质的能力，其中该细胞系携带有基因组整合的UAS-Luc报告子，这支持了如下证据，即CREAP蛋白质可以通过与结合启动子的CREB蛋白质相互作用诱导基因表达(数据未显示)。 CRE IL-8 对照 1 1 CREAP1 28.6 175.8571 CREAP2 38.6 126.8571 CREAP3 71.4 574.5714 The homology between the human CREAP genes suggests that they are functionally related. To investigate this, the ability of CREAP2 and CREAP3 to activate gene expression driven by the IL-8 promoter and the CRE-dependent promoter was tested in a co-transfection assay as disclosed above. Briefly, after co-transfection with an empty pCMV-SPORT6 expression vector or the same vectors carrying CREAP1, CREAP2, or CREAP3 cDNA, the expression of fireflies driven by the IL-8 promoter or a minimal promoter linked to four copies of CRE was determined. Expression levels of the luciferase reporter. The results showed that co-transfection of CREAP1 with either the IL-8-dependent promoter or the -CRE-dependent firefly luciferase gene resulted in a dramatic increase in luciferase activity (see Table 5). Transfection of CREAP2 or CREAP3 also produced similar activation of both reporters. Additional experiments indicated that this activity was dependent on the integrity of CRE or CRE-like sites present in the IL-8 promoter (data not shown). Interestingly, CREAP3 consistently showed 2-4 fold higher levels of induction of gene expression compared to CREAP1 and CREAP2. Thus, CREAP2 and CREAP3 are potent activators of CRE-driven gene expression and the CREAP family represents a family of conserved sequences and activities. Furthermore, all three CREAP family members showed the ability to activate the CREB-GAL4 fusion protein when overexpressed in the HLR-CREB cell line (Stratagene), which carries a genome-integrated UAS-Luc reporter, which This supports evidence that CREAP proteins can induce gene expression by interacting with promoter-bound CREB proteins (data not shown). CRE IL-8 control 1 1 CREAP1 28.6 175.8571 CREAP2 38.6 126.8571 CREAP3 71.4 574.5714

表5.CREAP基因家族对CRE驱动启动子或白细胞介素-8启动子的诱导作用。用空载体(对照)或编码三种CREAP基因的表达载体共转染由连接至CRE多拷贝上的最小启动子或白细胞介素-8启动子所驱动的萤光素酶表达构建体。萤光素酶的表达水平表示为相对于用空载体共转染所获得的水平。Table 5. Induction of CREAP gene family on CRE driven promoter or interleukin-8 promoter. Luciferase expression constructs driven by a minimal or interleukin-8 promoter linked to multiple copies of CREAP were co-transfected with empty vector (control) or expression vectors encoding the three CREAP genes. The expression level of luciferase is expressed relative to the level obtained by co-transfection with empty vector.

实施例15Example 15

CREAP1蛋白质是转录激活子 The CREAP1 protein is a transcriptional activator

一些观察表明CREAP1是转录共激活因子。首先，尽管我们还不能鉴定CREAP1中任意DNA结合活性，但是每种CREAP蛋白质含有预测的N-末端卷曲螺旋结构域(hCREAP1残基8-54)、富含丝氨酸/谷氨酰胺丰富结构域(hCREAP1残基289-559)和带负电荷的羧基末端。Several observations suggest that CREAP1 is a transcriptional coactivator. First, although we have not yet been able to identify any DNA-binding activity in CREAP1, each CREAP protein contains a predicted N-terminal coiled-coil domain (hCREAP1 residues 8-54), a serine/glutamine-rich domain (hCREAP1 residues 289-559) and a negatively charged carboxyl terminus.

为了确定CREAP蛋白质是否能够作为转录激活子起作用，将全部3种CREAP同系物的不同区域(CREAP1的氨基酸300-650、CREAP2的氨基酸296-694和CREAP3的氨基酸335-635)与表达为GAL4DNA结合结构域表达为融合蛋白质并且测试了激活连接至GAL4蛋白质结合序列的报告基因(UAS-Luc(pFR-Luc reporter)表达的能力。简而言之，通过PCR扩增所指出的CREAP1、CREAP2和CREAP3区域并亚克隆入pCMV-BD载体(Stratagene)中编码GAL4DNA结合结构域的读框内。如上所述使用Fugene6转染试剂(Roche)以75ng/孔将所选出质粒和空载体(pCMV-SPORT6)转染入HEK 293细胞。用编码由连接至5联体GAL4结合位点(UAS)的最小启动子驱动的萤火虫萤光素酶基因的pFR-Lucreporter(Stratagene)以100ng/孔进行共转染。作为阳性对照，报告子还与编码GAL4-CREB融合蛋白质的质粒(Stratagene)单独或在存在编码蛋白激酶A催化亚基的表达构建体pFC-PKA(Stratagene)时进行共转染以激活CREB激酶可诱导的激活结构域。将诱导倍数与在仅用携带GAL4 DNA结合结构域的表达载体pCMV-BD转染的细胞中所测定的报告子活性进行比较。虽然3种全长CREAP蛋白质不显著影响报告子活性，但包含CREAP1-3羧基末端部分的融合蛋白质有效诱导UAS-Luc的表达，见表6。诱导倍数 STDEV 载体CREAP1CREAP2CREAP3GAL4-CREAP1.1GAL4-CREAP2.1GAL4-CREAP3-1GAL4-CREBGAL4-CREB/PKA 1.002.4675312.471.582692.601373.881364.177.66351.4352 0.211.4788080.410.74556.19222.52263.620.3411.52481 To determine whether CREAP proteins could function as transcriptional activators, different regions of all three CREAP homologues (amino acids 300-650 of CREAP1, amino acids 296-694 of CREAP2, and amino acids 335-635 of CREAP3) were bound to DNA expressed as GAL4 Domains were expressed as fusion proteins and tested for their ability to activate the expression of a reporter gene (UAS-Luc (pFR-Luc reporter) linked to the GAL4 protein binding sequence. Briefly, the indicated CREAP1, CREAP2 and CREAP3 region and subcloned into the pCMV-BD vector (Stratagene) in the reading frame encoding the GAL4 DNA binding domain. The selected plasmid and the empty vector (pCMV-SPORT6 ) into HEK 293 cells. Co-transfected with pFR-Lucreporter (Stratagene) encoding the firefly luciferase gene driven by a minimal promoter linked to the quintuplex GAL4 binding site (UAS) at 100 ng/well As a positive control, the reporter was also co-transfected with a plasmid encoding the GAL4-CREB fusion protein (Stratagene) alone or in the presence of the expression construct pFC-PKA (Stratagene) encoding the protein kinase A catalytic subunit to activate CREB kinase Inducible activation domain. The fold induction was compared with the reporter activity measured in cells transfected with only the expression vector pCMV-BD carrying the GAL4 DNA-binding domain. Although the three full-length CREAP proteins did not significantly affect reporter activity, but fusion proteins containing the carboxy-terminal part of CREAP1-3 effectively induced the expression of UAS-Luc, see Table 6. Induction multiple STDEV Vector CREAP1CREAP2CREAP3GAL4-CREAP1.1GAL4-CREAP2.1GAL4-CREAP3-1GAL4-CREBGAL4-CREB/PKA 1.002.4675312.471.582692.601373.881364.177.66351.4352 0.211.4788080.410.74556.19222.52263.620.3411.52481

表6：CREAP1蛋白质是转录激活子的证明。测试了编码全长CREAP1、CREAP2和CREAP3以及单独或与CREAP1、CREAP2和CREAP3C-末端部分融合的Gal4 DNA结合结构域的表达构建体诱导由连接至GAL4DNA结合位点的最小启动子所控制的萤光素酶基因(pFRLuciferase)表达的能力。将所显示的对用与pFR-Luc reporter共转染的pCMV-BD载体所得到的数值标准化。Table 6: Demonstration that the CREAP1 protein is an activator of transcription. Expression constructs encoding full-length CREAP1, CREAP2, and CREAP3 and the Gal4 DNA-binding domain alone or fused to C-terminal portions of CREAP1, CREAP2, and CREAP3 were tested to induce fluorescence controlled by a minimal promoter linked to the GAL4 DNA-binding site The ability to express the luciferase gene (pFRLuciferase). Values shown are normalized to those obtained with pCMV-BD vector co-transfected with pFR-Luc reporter.

为了确定CREAP蛋白质是否能够直接激活CREB1蛋白质，将CREAP1、CREAP2和CREAP3表达构建体单独或与GAL4-CREB质粒(Stratagene)一起转染入HLR细胞系(Stratagene)，该细胞系携带整合有pFR-Luc reporter拷贝的基因组DNA。简而言之，按照制造商的说明书保持HLR细胞。如上所述使用Fugene6转染试剂(Roche)以75ng/孔用所选出的质粒和空载体(pCMV-BD)或GAL4-CREB质粒进行转染。作为阳性对照，编码蛋白激酶A催化亚基的表达构建体pFC-PKA(Stratagene)也与GAL4-CREB一起共转染。将活化倍数与在用空载体转染的细胞中所测定的报告子活性进行比较。虽然3种全长CREAP蛋白质不显著影响报告子活性，但与3种全长CREAP共转染时GA4L-CREB融合蛋白质的活性上调，表明CREB和CREAP蛋白质相互作用形成活性转录复合体。见下面表7。诱导倍数 STDEV pCMV-SPORT6GAL4-CREBGAL4-CREB/PKACREAP1GAL4-CREB/CREAP1CREAP2GAL4-CREB/CREAP2CREAP3GAL4-CREB/CREAP3 175.54687245676.35317566.122284386233.14302922.435298629177.55398542.457796272447.635808 1.732050814.424213916.864978483.116913233.13457372.0595979323.06787722.4245262436.439389 To determine whether CREAP proteins can directly activate CREB1 proteins, the CREAP1, CREAP2, and CREAP3 expression constructs were transfected alone or together with the GAL4-CREB plasmid (Stratagene) into an HLR cell line (Stratagene) harboring integrated pFR-Luc Genomic DNA copied by reporter. Briefly, HLR cells were maintained according to the manufacturer's instructions. Transfections were performed with selected plasmids and empty vector (pCMV-BD) or GAL4-CREB plasmid at 75 ng/well using Fugene6 transfection reagent (Roche) as described above. As a positive control, the expression construct pFC-PKA (Stratagene) encoding the catalytic subunit of protein kinase A was also co-transfected with GAL4-CREB. Fold activation was compared to reporter activity determined in cells transfected with empty vector. Although the three full-length CREAP proteins did not significantly affect reporter activity, the activity of the GA4L-CREB fusion protein was upregulated when co-transfected with the three full-length CREAP proteins, suggesting that CREB and CREAP proteins interact to form an active transcriptional complex. See Table 7 below. Induction multiple STDEV pCMV-SPORT6GAL4-CREBGAL4-CREB/PKACREAP1GAL4-CREB/CREAP1CREAP2GAL4-CREB/CREAP2CREAP3GAL4-CREB/CREAP3 175.54687245676.35317566.122284386233.14302922.435298629177.55398542.457796272447.635808 1.732050814.424213916.864978483.116913233.13457372.0595979323.06787722.4245262436.439389

表7：CREAP1通过激活CREB起作用。测试了全长CREAP1、CREAP2和CREAP3诱导GAL4-CREB融合蛋白质(Stratagene)活性的能力。所给出的数据对使用pCMV-BD载体所得到的数值进行标准化。全部CREAP和PKA显著诱导GAL4-CREB介导的激活。注意到当与来自表6的数据进行比较时，使用阳性对照获得的诱导倍数较低，其中表6的数据是用全部包含报告子的质粒进行瞬时转染所获得的。Table 7: CREAP1 functions by activating CREB. The ability of full-length CREAP1, CREAP2 and CREAP3 to induce the activity of the GAL4-CREB fusion protein (Stratagene) was tested. The data presented are normalized to the values obtained using the pCMV-BD vector. All CREAP and PKA significantly induce GAL4-CREB-mediated activation. Note that the fold induction obtained with the positive control is lower when compared with the data from Table 6, which was obtained by transient transfection with all plasmids containing the reporter.

为了确定是否CREAP1能够直接与CREB相互作用，使用Fugene6试剂(Roche Applied Science)按照由制造商提供的方案将CREAP1的K1和K5变体(见表1)转染入生长于100mm平皿(Falcon)中的HEK293细胞。转染40小时后，从板上将细胞刮下置于PBS中并在800μl低严紧度缓冲液中进行裂解，所述缓冲液包含：10mM HEPES pH 7.6、250mMNaCl、5mM EDTA、1mM DTT、0.1％NP-40和新鲜溶解的蛋白酶抑制剂。使用M2-琼脂糖珠(Sigma)进行免疫沉淀。在4-20％的SDS-PAGE(Invitrogen)上分离所沉淀的蛋白质并转移至硝酸纤维素膜(Invitrogen)上。使用针对CREB的抗体(Cell Signaling Technology)进行蛋白质印迹。作为阴性对照使用了编码带FLAG-标记的人组蛋白脱乙酰基酶(HDAC1)的表达构建体。我们发现包含高度保守卷曲螺旋结构域的CREAP1 N-末端170个氨基酸的片段在体内与内源性CREB1结合。表1中显示的数据证明该区域对于CREAP-介导的CRE激活是绝对必需的。To determine whether CREAP1 can directly interact with CREB, the K1 and K5 variants of CREAP1 (see Table 1) were transfected into 100 mm dishes (Falcon) using Fugene6 reagent (Roche Applied Science) following the protocol provided by the manufacturer. HEK293 cells. 40 hours after transfection, cells were scraped from the plate, placed in PBS and lysed in 800 μl low stringency buffer containing: 10 mM HEPES pH 7.6, 250 mM NaCl, 5 mM EDTA, 1 mM DTT, 0.1% NP-40 and freshly dissolved protease inhibitors. Immunoprecipitation was performed using M2-Sepharose beads (Sigma). Precipitated proteins were separated on 4-20% SDS-PAGE (Invitrogen) and transferred to nitrocellulose membranes (Invitrogen). Western blotting was performed using an antibody against CREB (Cell Signaling Technology). An expression construct encoding FLAG-tagged human histone deacetylase (HDAC1) was used as a negative control. We found that a fragment of the N-terminal 170 amino acids of CREAP1 containing a highly conserved coiled-coil domain binds endogenous CREB1 in vivo. The data shown in Table 1 demonstrate that this region is absolutely required for CREAP-mediated CRE activation.

除了最近鉴定的LIM-only蛋白质家族以外(Fimia，G.等2000，MolCellBiol 20，8613-8622)，CREAP家族可以代表CREB共激活子进化保守的分枝。有趣地是，虽然LIM-only蛋白质与CREM——一种已知的CRE抑制剂结合并提供不依赖磷酸化作用和CBP的激活功能，但我们的数据表明CREAP可以与结合至规范CRE位点的CREB1和结合至不被CREB1所识别的CRE-样元件的CREB2相互作用并因此激活不同靶基因库的表达。而且，CREAP1似乎允许表面上结合至CRE和AP-1结合位点的蛋白质之间协同作用。对CREAP1作用的阐明将解释控制对CREB激活子组织选择性反应的机制。In addition to the recently identified LIM-only protein family (Fimia, G. et al. 2000, Mol Cell Biol 20, 8613-8622), the CREAP family may represent an evolutionarily conserved branch of CREB coactivators. Interestingly, while LIM-only proteins bind CREM, a known CRE inhibitor, and provide phosphorylation- and CBP-independent activation functions, our data suggest that CREAP can bind to canonical CRE sites. CREB1 and CREB2 that bind to CRE-like elements not recognized by CREB1 interact and thus activate the expression of diverse target gene repertoires. Furthermore, CREAP1 appears to allow for synergy between proteins that surface bind to the CRE and AP-1 binding sites. Elucidation of the role of CREAP1 will explain the mechanisms governing tissue-selective responses to CREB activators.

这里描述的实验提出了明显的问题，即CRE-样位点在调节疾病期间IL-8表达中的重要性。虽然先前证明没有CRE或CRE-样位点存在于IL-8启动子，但β₂-肾上腺素能受体激动剂(β₂-AR)(其发挥作用而增加细胞内cAMP水平)在呼吸道平滑肌细胞中诱导IL-8分泌(Kavelaars A.等J.Neuroimmunol.1997Aug；77(2)：211-6)。这尤其重要，因为使用β₂-AR激动剂作为支气管扩张剂能够加重气喘并应该与抗炎类固醇结合使用(Cockcroft，D.等，1993；Lancet 342：833-837；Knox，A.J 2002；Curr.Pharm Des.1863-1869；Vathenen等，1988Lancet 1：554-558))。所给出的数据表明该作用可能直接由于通过CRE-样位点并且也许是CREAP1对IL-8转录的激活作用。The experiments described here raise obvious questions about the importance of CRE-like loci in regulating IL-8 expression during disease. Although previously demonstrated that no CRE or CRE-like sites exist on the IL-8 promoter, β ₂ -adrenergic receptor agonists (β ₂ -AR), which act to increase intracellular cAMP IL-8 secretion is induced in cells (Kavelaars A. et al. J. Neuroimmunol. 1997 Aug;77(2):211-6). This is especially important since the use of _β2 -AR agonists as bronchodilators can exacerbate asthma and should be combined with anti-inflammatory steroids (Cockcroft, D. et al., 1993; Lancet 342:833-837; Knox, AJ 2002; Curr. Pharm Des. 1863-1869; Vathenen et al., 1988 Lancet 1:554-558)). The data presented suggest that this effect may be directly due to the activation of IL-8 transcription via CRE-like sites and perhaps CREAP1.

序列表Sequence Listing

<110>诺瓦提斯公司<110> Novatis Corporation

<120>环AMP效应元件激活蛋白及其相关用途<120> Cyclic AMP Response Element Activation Protein and Related Uses

<130>4-32999P2<130>4-32999P2

<150>60/463,934<150>60/463,934

<151>2003-04-18<151>2003-04-18

<160>39<160>39

<170>FastSEQ for Windows版本4.0<170>FastSEQ for Windows version 4.0

<210>1<210>1

<211>2878<211>2878

<212>DNA<212>DNA

<213>人<213> people

<400>1<400>1

ccccattgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata agcagagctc 60ccccattgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata agcagagctc 60

gtttagtgaa ccgtcagatc gcctggagac gccatccacg ctgttttgac ctccatagaa 120gtttagtgaa ccgtcagatc gcctggagac gccatccacg ctgttttgac ctccatagaa 120

gacaccggga ccgatccagc ctccggactc tagcctaggc cgcgggacgg ataacaattt 180gacaccggga ccgatccagc ctccggactc tagcctaggc cgcgggacgg ataacaattt 180

cacacaggaa acagctatga ccattaggcc tatttaggtg acactataga acaagtttgt 240cacacaggaa acagctatga ccattaggcc tattaggtg acactataga acaagtttgt 240

acaaaaaagc aggctggtac cggtccggaa ttcccgggag gaggaggagg tggcggcgag 300acaaaaaagc aggctggtac cggtccggaa ttcccggggag gaggaggagg tggcggcgag 300

aagatggcga cttcgaacaa tccgcggaaa ttcagcgaga agatcgcgct gcacaatcag 360aagatggcga cttcgaacaa tccgcggaaa ttcagcgaga agatcgcgct gcacaatcag 360

aagcaggcgg aggagacggc ggccttcgag gaggtcatga aggacctgag cctgacgcgg 420aagcaggcgg aggagacggc ggccttcgag gaggtcatga aggacctgag cctgacgcgg 420

gccgcgcggc tccagctcca gaaatcccag tacctgcaac tgggccccag ccgaggccag 480gccgcgcggc tccagctcca gaaatccccag tacctgcaac tgggccccag ccgaggccag 480

tactatggcg ggtccctgcc caacgtgaac cagatcggga gtggcaccat ggacctgccc 540tactatggcg ggtccctgcc caacgtgaac cagatcggga gtggcaccat ggacctgccc 540

ttccagccca gcggatttct gggggaggccc tggcagcgg ctcctgtctc tctgaccccc 600ttccagccca gcggatttct gggggaggccc tggcagcgg ctcctgtctc tctgacccccc 600

ttccaatcct cgggcctgga caccagccgg accacccggc accatgggct ggtggacagg 660ttccaatcct cgggcctgga caccagccgg accacccggc accatgggct ggtggacagg 660

gtgtaccggg agcgtggccg gctcggctcc ccacaccgcc ggcccctgtc agtggacaaa 720gtgtaccggg agcgtggccg gctcggctcc ccacaccgcc ggcccctgtc agtggacaaa 720

cacggacggc aggccgacag ctgcccctat ggcaccatgt acctctcacc acccgcggac 780cacggacggc aggccgacag ctgcccctat ggcaccatgt acctctcacc acccgcggac 780

accagctgga gaaggaccaa ttctgactcc gccctgcacc agagcacaat gacgcccacg 840accagctgga gaaggaccaa ttctgactcc gccctgcacc agagcacaat gacgcccacg 840

cagccagaat cctttagcag tgggtcccag gacgtgcacc agaaaagagt cttactgtta 900cagccagaat cctttagcag tgggtcccag gacgtgcacc agaaaagagt cttactgtta 900

acagtcccag gaatggaaga gaccacatca gaggcagaca aaaacctttc caagcaagca 960acagtccccag gaatggaaga gaccacatca gaggcagaca aaaacctttc caagcaagca 960

tgggacacca agaagacggg gtccaggccc aagtcctgtg aggtccccgg aatcaacatc 1020tgggacacca agaagacggg gtccaggccc aagtcctgtg aggtccccgg aatcaacatc 1020

ttcccgtctg ccgaccagga aaacactaca gccctgatcc ccgccaccca caacacaggg 1080ttcccgtctg ccgaccagga aaacactaca gccctgatcc ccgccaccca caacacaggg 1080

gggtccctgc ccgacctgac caacatccac ttcccctccc cgctcccgac cccgctggac 1140gggtccctgc ccgacctgac caacatccac ttcccctccc cgctcccgac cccgctggac 1140

cccgaggagc ccaccttccc tgcactgagc agctccagca gcaccggcaa cctcgcggcc 1200cccgaggagc ccaccttccc tgcactgagc agctccagca gcaccggcaa cctcgcggcc 1200

aacctgacgc acctgggcat cggtggcgcc ggccagggaa tgagcacacc tggctcctct 1260aacctgacgc acctgggcat cggtggcgcc ggccagggaa tgagcacacc tggctcctct 1260

ccacagcacc gcccagctgg cgtcagcccc ctgtccctga gcacagaggc aaggcgtcag 1320ccacagcacc gcccagctgg cgtcagcccc ctgtccctga gcacagaggc aaggcgtcag 1320

caggcatcgc ccaccctgtc cccgctgtca cccatcactc aggctgtagc catggacgcc 1380caggcatcgc ccaccctgtc cccgctgtca cccatcactc aggctgtagc catggacgcc 1380

ctgtctctgg agcagcagct gccctacgcc ttcttcaccc aggcgggctc ccagcagcca 1440ctgtctctgg agcagcagct gccctacgcc ttcttcaccc aggcgggctc ccagcagcca 1440

ccgccgcagc cccagccccc gccgcctcct ccacccgcgt cccagcagcc accacccccg 1500ccgccgcagc cccagccccc gccgcctcct ccacccgcgt cccagcagcc accacccccg 1500

ccacccccac aggcgcccgt ccgcctgccc cctggtggcc ccctgttgcc cagcgccagc 1560ccaccccccac aggcgcccgt ccgcctgccc cctggtggcc ccctgttgcc cagcgccagc 1560

ctgactcgtg ggccacagcc gcccccgctt gcagtcacgg taccgtcctc tctcccccag 1620ctgactcgtg ggccacagcc gcccccgctt gcagtcacgg taccgtcctc tctcccccag 1620

tcccccccag agaaccctgg ccagccatcg atggggatcg acatcgcctc ggcgccggct 1680tcccccccag agaaccctgg ccagccatcg atggggatcg acatcgcctc ggcgccggct 1680

ctgcagcagt accgcactag cgccggctcc ccggccaacc agtctcccac ctcgccagtc 1740ctgcagcagt accgcactag cgccggctcc ccggccaacc agtctcccac ctcgccagtc 1740

tccaatcaag gcttctcccc agggagctcc ccgcaacaca cttccaccct gggcagcgtg 1800tccaatcaag gcttctcccc agggagctcc ccgcaacaca cttccaccct gggcagcgtg 1800

tttggggacg cgtactatga gcagcagatg gcggccaggc aggccaatgc tctgtcccac 1860tttggggacg cgtactatga gcagcagatg gcggccaggc aggccaatgc tctgtccccac 1860

cagctggagc agttcaacat gatggagaac gccatcagct ccagcagcct gtacagcccg 1920cagctggagc agttcaacat gatggagaac gccatcagct ccagcagcct gtacagcccg 1920

ggctccacac tcaactactc gcaggcggcc atgatgggcc tcacgggcag ccacgggagc 1980ggctccacac tcaactactc gcaggcggcc atgatgggcc tcacgggcag ccacgggagc 1980

ctgccggact cgcagcaact gggatacgcc agccacagtg gcatccccaa catcatcctc 2040ctgccggact cgcagcaact gggatacgcc agccacagtg gcatccccaa catcatcctc 2040

acagtgacag gagagtcccc ccccagcctc tctaaagaac tgaccagctc tctggccggg 2100acagtgacag gagagtcccc ccccagcctc tctaaagaac tgaccagctc tctggccggg 2100

gtcggcgacg tcagcttcga ctccgacagc cagtttcccc tggacgaact caagatcgac 2160gtcggcgacg tcagcttcga ctccgacagc cagtttcccc tggacgaact caagatcgac 2160

cccctgaccc tcgacggact gcacatgctc aacgaccccg acatggttct ggccgaccca 2220cccctgaccc tcgacggact gcacatgctc aacgaccccg acatggttct ggccgaccca 2220

gccaccgagg acaccttccg gatggaccgc ctgtgagcgg gcacgccggc accctgccgc 2280gccaccgagg acaccttccg gatggaccgc ctgtgagcgg gcacgccggc accctgccgc 2280

tcagccgtcc cgacggcgcc tccccagccc ggggacggcc gtgctccgtc cctcgccaac 2340tcagccgtcc cgacggcgcc tccccagccc ggggacggcc gtgctccgtc cctcgccaac 2340

ggccgagctt gtgattctga gcttgcaatg ccgccaagcg ccccccgcca gcccgccccc 2400ggccgagctt gtgattctga gcttgcaatg ccgccaagcg ccccccgcca gcccgccccc 2400

ggttgtccac ctcccgcgaa gcccaatcgc gaggccgcga gccgggccgt ccacccaccc 2460ggttgtccac ctcccgcgaa gcccaatcgc gaggccgcga gccgggccgt ccaccccaccc 2460

gcccgcccag ggctgggctg ggatcggagg ccgtgagcct cccgcccctg cagaccctcc 2520gcccgcccag ggctgggctg ggatcggagg ccgtgagcct cccgcccctg cagaccctcc 2520

ctgcactggc tccctcgccc ccagccccgg ggcctgagcc gtcccctgta agatgcggga 2580ctgcactggc tccctcgccc ccagccccgg ggcctgagcc gtcccctgta agatgcggga 2580

agtgtcagct cccggcgtgg cgggcaggct caggggaggg gcgcgcatgg tccgccaggg 2640agtgtcagct cccggcgtgg cgggcaggct caggggaggg gcgcgcatgg tccgccaggg 2640

ctgtgggccg tggcgcattt tccgactgtt tgtccagctc tcactgcctt ccttggttcc 2700ctgtgggccg tggcgcattt tccgactgtt tgtccagctc tcactgcctt ccttggttcc 2700

cggtccccca gcccatccgc catccccagc ccgtggtcag gtagagagtg agccccacgc 2760cggtccccca gcccatccgc catccccagc ccgtggtcag gtagagagtg agccccacgc 2760

cgccccaggg aggaggcgcc agagcgcggg gcagacgcaa agtgaaataa acactatttt 2820cgccccaggg aggaggcgcc agagcgcggg gcagacgcaa agtgaaataa acactatttt 2820

gacggcaaaa aaaaaaaaaa agggcggccg ctctagagta tccctcgagg ggcccaag 2878gacggcaaaa aaaaaaaaaa agggcggccg ctctagagta tccctcgagg ggcccaag 2878

<210>2<210>2

<211>650<211>650

<212>PRT<212>PRT

<213>人<213> people

<400>2<400>2

Met Ala Thr Ser Asn Asn Pro Arg Lys Phe Ser Glu Lys Ile Ala LeuMet Ala Thr Ser Asn Asn Pro Arg Lys Phe Ser Glu Lys Ile Ala Leu

1 5 10 151 5 10 15

His Asn Gln Lys Gln Ala Glu Glu Thr Ala Ala Phe Glu Glu Val MetHis Asn Gln Lys Gln Ala Glu Glu Thr Ala Ala Phe Glu Glu Val Met

20 25 3020 25 30

Lys Asp Leu Ser Leu Thr Arg Ala Ala Arg Leu Gln Leu Gln Lys SerLys Asp Leu Ser Leu Thr Arg Ala Ala Arg Leu Gln Leu Gln Lys Ser

35 40 4535 40 45

Gln Tyr Leu Gln Leu Gly Pro Ser Arg Gly Gln Tyr Tyr Gly Gly SerGln Tyr Leu Gln Leu Gly Pro Ser Arg Gly Gln Tyr Tyr Gly Gly Ser

50 55 6050 55 60

Leu Pro Asn Val Asn Gln Ile Gly Ser Gly Thr Met Asp Leu Pro PheLeu Pro Asn Val Asn Gln Ile Gly Ser Gly Thr Met Asp Leu Pro Phe

65 70 75 8065 70 75 80

Gln Pro Ser Gly Phe Leu Gly Glu Ala Leu Ala Ala Ala Pro Val SerGln Pro Ser Gly Phe Leu Gly Glu Ala Leu Ala Ala Ala Pro Val Ser

85 90 9585 90 95

Leu Thr Pro Phe Gln Ser Ser Gly Leu Asp Thr Ser Arg Thr Thr ArgLeu Thr Pro Phe Gln Ser Ser Gly Leu Asp Thr Ser Arg Thr Thr Arg

100 105 110100 105 110

His His Gly Leu Val Asp Arg Val Tyr Arg Glu Arg Gly Arg Leu GlyHis His Gly Leu Val Asp Arg Val Tyr Arg Glu Arg Gly Arg Leu Gly

115 120 125115 120 125

Ser Pro His Arg Arg Pro Leu Ser Val Asp Lys His Gly Arg Gln AlaSer Pro His Arg Arg Pro Leu Ser Val Asp Lys His Gly Arg Gln Ala

130 135 140130 135 140

Asp Ser Cys Pro Tyr Gly Thr Met Tyr Leu Ser Pro Pro Ala Asp ThrAsp Ser Cys Pro Tyr Gly Thr Met Tyr Leu Ser Pro Pro Ala Asp Thr

145 150 155 160145 150 155 160

Ser Trp Arg Arg Thr Asn Ser Asp Ser Ala Leu His Gln Ser Thr MetSer Trp Arg Arg Thr Asn Ser Asp Ser Ala Leu His Gln Ser Thr Met

165 170 175165 170 175

Thr Pro Thr Gln Pro Glu Ser Phe Ser Ser Gly Ser Gln Asp Val HisThr Pro Thr Gln Pro Glu Ser Phe Ser Ser Ser Gly Ser Gln Asp Val His

180 185 190180 185 190

Gln Lys Arg Val Leu Leu Leu Thr Val Pro Gly Met Glu Glu Thr ThrGln Lys Arg Val Leu Leu Leu Thr Val Pro Gly Met Glu Glu Thr Thr

195 200 205195 200 205

Ser Glu Ala Asp Lys Asn Leu Ser Lys Gln Ala Trp Asp Thr Lys LysSer Glu Ala Asp Lys Asn Leu Ser Lys Gln Ala Trp Asp Thr Lys Lys

210 215 220210 215 220

Thr Gly Ser Arg Pro Lys Ser Cys Glu Val Pro Gly Ile Asn Ile PheThr Gly Ser Arg Pro Lys Ser Cys Glu Val Pro Gly Ile Asn Ile Phe

225 230 235 240225 230 235 240

Pro Ser Ala Asp Gln Glu Asn Thr Thr Ala Leu Ile Pro Ala Thr HisPro Ser Ala Asp Gln Glu Asn Thr Thr Ala Leu Ile Pro Ala Thr His

245 250 255245 250 255

Asn Thr Gly Gly Ser Leu Pro Asp Leu Thr Asn Ile His Phe Pro SerAsn Thr Gly Gly Ser Leu Pro Asp Leu Thr Asn Ile His Phe Pro Ser

260 265 270260 265 270

Pro Leu Pro Thr Pro Leu Asp Pro Glu Glu Pro Thr Phe Pro Ala LeuPro Leu Pro Thr Pro Leu Asp Pro Glu Glu Pro Thr Phe Pro Ala Leu

275 280 285275 280 285

Ser Ser Ser Ser Ser Thr Gly Asn Leu Ala Ala Asn Leu Thr His LeuSer Ser Ser Ser Ser Ser Thr Gly Asn Leu Ala Ala Asn Leu Thr His Leu

290 295 300290 295 300

Gly Ile Gly Gly Ala Gly Gln Gly Met Ser Thr Pro Gly Ser Ser ProGly Ile Gly Gly Ala Gly Gln Gly Met Ser Thr Pro Gly Ser Ser Pro

305 310 315 320305 310 315 320

Gln His Arg Pro Ala Gly Val Ser Pro Leu Ser Leu Ser Thr Glu AlaGln His Arg Pro Ala Gly Val Ser Pro Leu Ser Leu Ser Thr Glu Ala

325 330 335325 330 335

Arg Arg Gln Gln Ala Ser Pro Thr Leu Ser Pro Leu Ser Pro Ile ThrArg Arg Gln Gln Ala Ser Pro Thr Leu Ser Pro Leu Ser Pro Ile Thr

340 345 350340 345 350

Gln Ala Val Ala Met Asp Ala Leu Ser Leu Glu Gln Gln Leu Pro TyrGln Ala Val Ala Met Asp Ala Leu Ser Leu Glu Gln Gln Leu Pro Tyr

355 360 365355 360 365

Ala Phe Phe Thr Gln Ala Gly Ser Gln Gln Pro Pro Pro Gln Pro GlnAla Phe Phe Thr Gln Ala Gly Ser Gln Gln Pro Pro Pro Pro Gln Pro Gln

370 375 380370 375 380

Pro Pro Pro Pro Pro Pro Pro Ala Ser Gln Gln Pro Pro Pro Pro ProPro Pro Pro Pro Pro Pro Pro Pro Ala Ser Gln Gln Pro Pro Pro Pro Pro Pro

385 390 395 400385 390 395 400

Pro Pro Gln Ala Pro Val Arg Leu Pro Pro Gly Gly Pro Leu Leu ProPro Pro Gln Ala Pro Val Arg Leu Pro Pro Gly Gly Pro Leu Leu Pro

405 410 415405 410 415

Ser Ala Ser Leu Thr Arg Gly Pro Gln Pro Pro Pro Leu Ala Val ThrSer Ala Ser Leu Thr Arg Gly Pro Gln Pro Pro Pro Leu Ala Val Thr

420 425 430420 425 430

Val Pro Ser Ser Leu Pro Gln Ser Pro Pro Glu Asn Pro Gly Gln ProVal Pro Ser Ser Leu Pro Gln Ser Pro Pro Glu Asn Pro Gly Gln Pro

435 440 445435 440 445

Ser Met Gly Ile Asp Ile Ala Ser Ala Pro Ala Leu Gln Gln Tyr ArgSer Met Gly Ile Asp Ile Ala Ser Ala Pro Ala Leu Gln Gln Tyr Arg

450 455 460450 455 460

Thr Ser Ala Gly Ser Pro Ala Asn Gln Ser Pro Thr Ser Pro Val SerThr Ser Ala Gly Ser Pro Ala Asn Gln Ser Pro Thr Ser Pro Val Ser

465 470 475 480465 470 475 480

Asn Gln Gly Phe Ser Pro Gly Ser Ser Pro Gln His Thr Ser Thr LeuAsn Gln Gly Phe Ser Pro Gly Ser Ser Pro Gln His Thr Ser Thr Leu

485 490 495485 490 495

Gly Ser Val Phe Gly Asp Ala Tyr Tyr Glu Gln Gln Met Ala Ala ArgGly Ser Val Phe Gly Asp Ala Tyr Tyr Glu Gln Gln Met Ala Ala Arg

500 505 510500 505 510

Gln Ala Asn Ala Leu Ser His Gln Leu Glu Gln Phe Asn Met Met GluGln Ala Asn Ala Leu Ser His Gln Leu Glu Gln Phe Asn Met Met Glu

515 520 525515 520 525

Asn Ala Ile Ser Ser Ser Ser Leu Tyr Ser Pro Gly Ser Thr Leu AsnAsn Ala Ile Ser Ser Ser Ser Ser Leu Tyr Ser Pro Gly Ser Thr Leu Asn

530 535 540530 535 540

Tyr Ser Gln Ala Ala Met Met Gly Leu Thr Gly Ser His Gly Ser LeuTyr Ser Gln Ala Ala Met Met Gly Leu Thr Gly Ser His Gly Ser Leu

545 550 555 560545 550 555 560

Pro Asp Ser Gln Gln Leu Gly Tyr Ala Ser His Ser Gly Ile Pro AsnPro Asp Ser Gln Gln Leu Gly Tyr Ala Ser His Ser Gly Ile Pro Asn

565 570 575565 570 575

Ile Ile Leu Thr Val Thr Gly Glu Ser Pro Pro Ser Leu Ser Lys GluIle Ile Leu Thr Val Thr Gly Glu Ser Pro Pro Ser Leu Ser Lys Glu

580 585 590580 585 590

Leu Thr Ser Ser Leu Ala Gly Val Gly Asp Val Ser Phe Asp Ser AspLeu Thr Ser Ser Leu Ala Gly Val Gly Asp Val Ser Phe Asp Ser Asp

595 600 605595 600 605

Ser Gln Phe Pro Leu Asp Glu Leu Lys Ile Asp Pro Leu Thr Leu AspSer Gln Phe Pro Leu Asp Glu Leu Lys Ile Asp Pro Leu Thr Leu Asp

610 615 620610 615 620

Gly Leu His Met Leu Asn Asp Pro Asp Met Val Leu Ala Asp Pro AlaGly Leu His Met Leu Asn Asp Pro Asp Met Val Leu Ala Asp Pro Ala

625 630 635 640625 630 635 640

Thr Glu Asp Thr Phe Arg Met Asp Arg LeuThr Glu Asp Thr Phe Arg Met Asp Arg Leu

645 650645 650

<210>3<210>3

<211>32<211>32

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>3<400>3

gcccaagctt tgtgctctgc tgtctctgaa ag 32gcccaagctt tgtgctctgc tgtctctgaa ag 32

<210>4<210>4

<211>23<211>23

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>4<400>4

gccctgaggg gatgggccat cag 23gccctgaggg gatgggccat cag 23

<210>5<210>5

<211>36<211>36

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>5<400>5

cgcggatccg aagtgtgatg actcaggttt gccctg 36cgcggatccg aagtgtgatg actcaggttt gccctg 36

<210>6<210>6

<211>36<211>36

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>6<400>6

cgcggatccg aagtgtgata tctcaggttt gccctg 36cgcggatccg aagtgtgata tctcaggttt gccctg 36

<210>7<210>7

<211>54<211>54

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>7<400>7

gccctgaggg gatgggccat cagttgcaaa tcgttaactt tcctctgaca taat 54gccctgaggg gatgggccat cagttgcaaa tcgttaactt tcctctgaca taat 54

<210>8<210>8

<211>39<211>39

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>8<400>8

gccctgaggg gatgggccat cagctacgag tcgtggaat 39gccctgaggg gatgggccat cagctacgag tcgtggaat 39

<210>9<210>9

<211>47<211>47

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>9<400>9

cgcggatccg aagtgtgatg actcaggttt gccctgaggg gatgggc 47cgcggatccg aagtgtgatg actcaggttt gccctgaggg gatgggc 47

<210>10<210>10

<211>43<211>43

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>10<400>10

cagttgcaaa tcgtggaatt tcctctcgat caatgaaaag atg 43cagttgcaaa tcgtggaatt tcctctcgat caatgaaaag atg 43

<210>11<210>11

<211>39<211>39

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>11<400>11

gccctgaggg gatgggccat cagttgcaaa tcgtggaat 39gccctgaggg gatgggccat cagttgcaaa tcgtggaat 39

<210>12<210>12

<211>19<211>19

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>12<400>12

cgcctggtac cgagctctg 19cgcctggtac cgagctctg 19

<210>13<210>13

<211>19<211>19

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> primer

<400>13<400>13

acccaagatc tcgagcccg 19acccaagatc tcgagcccg 19

<210>14<210>14

<211>99<211>99

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>14<400>14

cgcctggtac cgagctctga cataatgaca taatgacata atgacataat gacataatga 60cgcctggtac cgagctctga cataatgaca taatgacata atgacataat gacataatga 60

cataattacg cgtgctagcc cgggctcgag atcttgggt 99cataattacg cgtgctagcc cgggctcgag atcttgggt 99

<210>15<210>15

<211>2520<211>2520

<212>DNA<212>DNA

<213>人<213> people

<220><220>

<221>修饰碱基<221> Modified bases

<222>(0)...(0)<222>(0)...(0)

<221>misc feature<221>misc feature

<222>2，13，2431，2453，2465，2468，2469，2479，2488，2489，2492，<222> 2, 13, 2431, 2453, 2465, 2468, 2469, 2479, 2488, 2489, 2492,

2505，2512，2514，2519，25202505, 2512, 2514, 2519, 2520

<223>n＝A，T，C或G<223>n=A, T, C or G

<400>15<400>15

antttttgta canaaaagca ggctgttacc ggtccggatt cccgggatct aggctggggc 60antttttgta canaaaagca ggctgttacc ggtccggatt cccgggatct aggctggggc 60

cgggttcgcg gtgctcgctg aggcggcggt ggctacggct ggaggagccg ggccgaggcc 120cgggttcgcg gtgctcgctg aggcggcggt ggctacggct ggaggagccg ggccgaggcc 120

gcggcggagg ccgcggctgg tactgggagg gtggcaggga gggacgggga aggaagatgg 180gcggcgggagg ccgcggctgg tactgggagg gtggcaggga gggacggggga aggaagatgg 180

cgacgtcggg ggcgaacggg cctggttcgg ccacggcctc ggcttccaat ccgcgcaaat 240cgacgtcggg ggcgaacggg cctggttcgg ccacggcctc ggcttccaat ccgcgcaaat 240

ttagtgagaa gattgcgctg cagaagcagc gtcaggccga ggagacggcg gccttcgagg 300ttagtgagaa gattgcgctg cagaagcagc gtcaggccga ggagacggcg gccttcgagg 300

aggtgatgat ggacatcggc tccacccggt tacaggccca aaaactgcga ctggcataca 360aggtgatgat ggacatcggc tccacccggt tacaggccca aaaactgcga ctggcataca 360

caaggagctc tcattatggt gggtctctgc ccaatgttaa ccagattggc tctggcctgg 420caaggagctc tcattatggt gggtctctgc ccaatgttaa ccagattggc tctggcctgg 420

ccgagttcca gagccccctc cactcacctt tggattcatc tcggagcact cggcaccatg 480ccgagttcca gagccccctc cactcacctt tggattcatc tcggagcact cggcaccatg 480

ggctggtgga acgggtgcag cgagatcctc gaagaatggt gtccccactt cgccgataca 540ggctggtgga acgggtgcag cgagatcctc gaagaatggt gtccccactt cgccgataca 540

cccgccacat tgacagctct ccctatagtc ctgcctactt atctcctccc ccagagtcta 600cccgccacat tgacagctct ccctatagtc ctgcctactt atctcctccc ccagagtcta 600

gctggcgaag gacgatggcc tggggcaatt tccctgcaga gaaggggcag ttgtttcgac 660gctggcgaag gacgatggcc tggggcaatt tccctgcaga gaaggggcag ttgtttcgac 660

taccatctgc acttaacagg acaagctctg actctgccct tcatacaagt gtgatgaacc 720taccatctgc acttaacagg acaagctctg actctgccct tcatacaagt gtgatgaacc 720

ccagtcccca ggatacctac ccaggcccca cacctcccag catcctgccc agccgacgtg 780ccagtcccca ggatacctac ccaggcccca cacctcccag catcctgccc agccgacgtg 780

ggggtattct ggatggtgaa atggacccca aagtacctgc tattgaggag aacttgctag 840ggggtattct ggatggtgaa atggaccccca aagtacctgc tattgaggag aacttgctag 840

atgacaagca tttgctgaag ccatgggatg ctaagaagct atcctcatcc tcttcccgac 900atgacaagca tttgctgaag ccatgggatg ctaagaagct atcctcatcc tcttcccgac 900

ctcggtcctg tgaagtccct ggaattaaca tctttccatc tcctgaccag cctgccaatg 960ctcggtcctg tgaagtccct ggaattaaca tctttccatc tcctgaccag cctgccaatg 960

tgcctgtcct cccacctgcc atgaacacgg ggggctccct acctgacctc accaacctgc 1020tgcctgtcct cccacctgcc atgaacacgg ggggctccct acctgacctc accaacctgc 1020

actttccccc accactgccc acccccctgg accctgaaga gacagcctac cctagcctga 1080actttccccc accactgccc accccctgg accctgaaga gacagcctac cctagcctga 1080

gtgggggcaa cagtacctcc aatttgaccc acaccatgac tcacctgggc atcagcaggg 1140gtgggggcaa cagtacctcc aatttgaccc acaccatgac tcacctgggc atcagcaggg 1140

ggcatgggcc tgggcccggc tatgatgcac caggacttca ttcacctctc agccacccat 1200ggcatgggcc tgggcccggc tatgatgcac caggacttca ttcacctctc agccacccat 1200

ccctgcagtc ctccctaagc aatcccaacc tccaggcttc cctgagcagt cctcagcccc 1260ccctgcagtc ctccctaagc aatcccaacc tccaggcttc cctgagcagt cctcagcccc 1260

agcttcaggg ctcccacagc cacccctctc tgcctgcctc ctccttggcc tgccatgtac 1320agcttcaggg ctcccacagc cacccctctc tgcctgcctc ctccttggcc tgccatgtac 1320

tgcccaccac ctccctgggc cacccctcac tcagtgctcc ggctctctcc tcctcctctt 1380tgcccaccac ctccctgggc cacccctcac tcagtgctcc ggctctctcc tcctcctctt 1380

cctcctcctc cacttcatct cctgttttgg gcgccccctc ttaccctgct tctacccctg 1440cctcctcctc cacttcatct cctgttttgg gcgccccctc ttaccctgct tctacccctg 1440

gggcctcccc ccaccaccgc cgtgtgcccc tcagccccct gagtttgctc gcgggcccag 1500gggcctcccc ccaccaccgc cgtgtgcccc tcagccccct gagtttgctc gcgggcccag 1500

ccgacgccag aaggtcccaa cagcagctgc ccaaacagtt ttcgccaaca atgtcaccca 1560ccgacgccag aaggtcccaa cagcagctgc ccaaacagtt ttcgccaaca atgtcaccca 1560

ccttgtcttc catcactcag ggcgtccccc tggataccag taaactgtcc actgaccagc 1620ccttgtcttc catcactcag ggcgtccccc tggataccag taaactgtcc actgaccagc 1620

ggttaccccc ctacccatac agctccccaa gtctggttct gcctacccag ccccacaccc 1680ggttacccccc ctacccatac agctccccaa gtctggttct gcctacccag ccccacaccc 1680

caaagtctct acagcagcca gggctgccct ctcagtcttg ttcagtgcag tcctcaggtg 1740caaagtctct acagcagcca gggctgccct ctcagtcttg ttcagtgcag tcctcaggtg 1740

ggcagccccc aggcaggcag tctcattatg ggacaccgta cccacctggg cccagtgggc 1800ggcagccccc aggcaggcag tctcattatg ggacaccgta cccacctggg cccagtgggc 1800

atgggcaaca gtcttaccac cggccaatga gtgacttcaa cctggggaat ctggagcagt 1860atgggcaaca gtcttaccac cggccaatga gtgacttcaa cctggggaat ctggagcagt 1860

tcagcatgga gagcccatca gccagcctgg tgctggatcc ccctggcttt tctgaagggc 1920tcagcatgga gagcccatca gccagcctgg tgctggatcc ccctggcttt tctgaagggc 1920

ctggattttt agggggtgag gggccaatgg gtggccccca ggatccccac accttcaacc 1980ctggattttt agggggtgag gggccaatgg gtggccccca ggatccccac accttcaacc 1980

accagaactt gacccactgt tcccgccatg gctcagggcc taacatcatc ctcacagggg 2040accagaactt gacccactgt tcccgccatg gctcagggcc taacatcatc ctcacagggg 2040

actcctctcc aggtttctct aaggagattg cagcagccct ggccggagtg cctggctttg 2100actcctctcc aggtttctct aaggagattg cagcagccct ggccggagtg cctggctttg 2100

aggtgtcagc agctggattg gagctagggc ttgggctaga agatgagctg cgcatggagc 2160aggtgtcagc agctggattg gagctagggc ttgggctaga agatgagctg cgcatggagc 2160

cactgggcct ggaagggcta aacatgctga gtgacccctg tgccctgctg cctgatcctg 2220cactgggcct ggaagggcta aacatgctga gtgacccctg tgccctgctg cctgatcctg 2220

ctgtggagga gtcattccgc agtgaccggc tccaatgagg gcacctcatc accatccctc 2280ctgtggagga gtcattccgc agtgaccggc tccaatgagg gcacctcatc accatccctc 2280

ttcttggccc catcccccac caccattcct ttcctccctt ccccctggca ggtagagact 2340ttcttggccc catcccccac caccatcct ttcctccctt ccccctggca ggtagagact 2340

ctactctctg tccccagatc ctctttctag catgaatgaa ggatgccaag aatgagaaaa 2400ctactctctg tccccagatc ctctttctag catgaatgaa ggatgccaag aatgagaaaa 2400

agcaaggggt ttgtccaggt ggcccctgaa ntctgcgcaa gggatgggcc tgnggggaac 2460agcaaggggt ttgtccaggt ggcccctgaa ntctgcgcaa gggatggggcc tgggggaac 2460

ctcanggnna gggcccaang gccacttnna anctttgaac cgtcngtctg gnanggtcnn 2520ctcanggnna gggcccaang gccacttnna anctttgaac cgtcngtctg gnagggtcnn 2520

<210>16<210>16

<211>693<211>693

<212>PRT<212>PRT

<213>人<213> people

<400>16<400>16

Met Ala Thr Ser Gly Ala Asn Gly Pro Gly Ser Ala Thr Ala Ser AlaMet Ala Thr Ser Gly Ala Asn Gly Pro Gly Ser Ala Thr Ala Ser Ala

1 5 10 151 5 10 15

Ser Asn Pro Arg Lys Phe Ser Glu Lys Ile Ala Leu Gln Lys Gln ArgSer Asn Pro Arg Lys Phe Ser Glu Lys Ile Ala Leu Gln Lys Gln Arg

20 25 3020 25 30

Gln Ala Glu Glu Thr Ala Ala Phe Glu Glu Val Met Met Asp Ile GlyGln Ala Glu Glu Thr Ala Ala Phe Glu Glu Val Met Met Asp Ile Gly

35 40 4535 40 45

Ser Thr Arg Leu Gln Ala Gln Lys Leu Arg Leu Ala Tyr Thr Arg SerSer Thr Arg Leu Gln Ala Gln Lys Leu Arg Leu Ala Tyr Thr Arg Ser

50 55 6050 55 60

Ser His Tyr Gly Gly Ser Leu Pro Asn Val Asn Gln Ile Gly Ser GlySer His Tyr Gly Gly Ser Leu Pro Asn Val Asn Gln Ile Gly Ser Gly

65 70 75 8065 70 75 80

Leu Ala Glu Phe Gln Ser Pro Leu His Ser Pro Leu Asp Ser Ser ArgLeu Ala Glu Phe Gln Ser Pro Leu His Ser Pro Leu Asp Ser Ser Arg

85 90 9585 90 95

Ser Thr Arg His His Gly Leu Val Glu Arg Val Gln Arg Asp Pro ArgSer Thr Arg His His Gly Leu Val Glu Arg Val Gln Arg Asp Pro Arg

100 105 110100 105 110

Arg Met Val Ser Pro Leu Arg Arg Tyr Thr Arg His Ile Asp Ser SerArg Met Val Ser Pro Leu Arg Arg Tyr Thr Arg His Ile Asp Ser Ser

115 120 125115 120 125

Pro Tyr Ser Pro Ala Tyr Leu Ser Pro Pro Pro Glu Ser Ser Trp ArgPro Tyr Ser Pro Ala Tyr Leu Ser Pro Pro Pro Glu Ser Ser Trp Arg

130 135 140130 135 140

Arg Thr Met Ala Trp Gly Asn Phe Pro Ala Glu Lys Gly Gln Leu PheArg Thr Met Ala Trp Gly Asn Phe Pro Ala Glu Lys Gly Gln Leu Phe

145 150 155 160145 150 155 160

Arg Leu Pro Ser Ala Leu Asn Arg Thr Ser Ser Asp Ser Ala Leu HisArg Leu Pro Ser Ala Leu Asn Arg Thr Ser Ser Asp Ser Ala Leu His

165 170 175165 170 175

Thr Ser Val Met Asn Pro Ser Pro Gln Asp Thr Tyr Pro Gly Pro ThrThr Ser Val Met Asn Pro Ser Pro Gln Asp Thr Tyr Pro Gly Pro Thr

180 185 190180 185 190

Pro Pro Ser Ile Leu Pro Ser Arg Arg Gly Gly Ile Leu Asp Gly GluPro Pro Ser Ile Leu Pro Ser Arg Arg Gly Gly Ile Leu Asp Gly Glu

195 200 205195 200 205

Met Asp Pro Lys Val Pro Ala Ile Glu Glu Asn Leu Leu Asp Asp LysMet Asp Pro Lys Val Pro Ala Ile Glu Glu Asn Leu Leu Asp Asp Lys

210 215 220210 215 220

His Leu Leu Lys Pro Trp Asp Ala Lys Lys Leu Ser Ser Ser Ser SerHis Leu Leu Lys Pro Trp Asp Ala Lys Lys Leu Ser Ser Ser Ser Ser

225 230 235 240225 230 235 240

Arg Pro Arg Ser Cys Glu Val Pro Gly Ile Asn Ile Phe Pro Ser ProArg Pro Arg Ser Cys Glu Val Pro Gly Ile Asn Ile Phe Pro Ser Pro

245 250 255245 250 255

Asp Gln Pro Ala Asn Val Pro Val Leu Pro Pro Ala Met Asn Thr GlyAsp Gln Pro Ala Asn Val Pro Val Leu Pro Pro Ala Met Asn Thr Gly

260 265 270260 265 270

Gly Ser Leu Pro Asp Leu Thr Asn Leu His Phe Pro Pro Pro Leu ProGly Ser Leu Pro Asp Leu Thr Asn Leu His Phe Pro Pro Pro Leu Pro

275 280 285275 280 285

Thr Pro Leu Asp Pro Glu Glu Thr Ala Tyr Pro Ser Leu Ser Gly GlyThr Pro Leu Asp Pro Glu Glu Thr Ala Tyr Pro Ser Leu Ser Gly Gly

290 295 300290 295 300

Asn Ser Thr Ser Asn Leu Thr His Thr Met Thr His Leu Gly Ile SerAsn Ser Thr Ser Asn Leu Thr His Thr Met Thr His Leu Gly Ile Ser

305 310 315 320305 310 315 320

Arg Gly His Gly Pro Gly Pro Gly Tyr Asp Ala Pro Gly Leu His SerArg Gly His Gly Pro Gly Pro Gly Tyr Asp Ala Pro Gly Leu His Ser

325 330 335325 330 335

Pro Leu Ser His Pro Ser Leu Gln Ser Ser Leu Ser Asn Pro Asn LeuPro Leu Ser His Pro Ser Leu Gln Ser Ser Leu Ser Asn Pro Asn Leu

340 345 350340 345 350

Gln Ala Ser Leu Ser Ser Pro Gln Pro Gln Leu Gln Gly Ser His SerGln Ala Ser Leu Ser Ser Pro Gln Pro Gln Leu Gln Gly Ser His Ser

355 360 365355 360 365

His Pro Ser Leu Pro Ala Ser Ser Leu Ala Cys His Val Leu Pro ThrHis Pro Ser Leu Pro Ala Ser Ser Leu Ala Cys His Val Leu Pro Thr

370 375 380370 375 380

Thr Ser Leu Gly His Pro Ser Leu Ser Ala Pro Ala Leu Ser Ser SerThr Ser Leu Gly His Pro Ser Leu Ser Ala Pro Ala Leu Ser Ser Ser Ser

385 390 395 400385 390 395 400

Ser Ser Ser Ser Ser Thr Ser Ser Pro Val Leu Gly Ala Pro Ser TyrSer Ser Ser Ser Ser Ser Thr Ser Ser Ser Pro Val Leu Gly Ala Pro Ser Tyr

405 410 415405 410 415

Pro Ala Ser Thr Pro Gly Ala Ser Pro His His Arg Arg Val Pro LeuPro Ala Ser Thr Pro Gly Ala Ser Pro His His Arg Arg Val Pro Leu

420 425 430420 425 430

Ser Pro Leu Ser Leu Leu Ala Gly Pro Ala Asp Ala Arg Arg Ser GlnSer Pro Leu Ser Leu Leu Ala Gly Pro Ala Asp Ala Arg Arg Ser Gln

435 440 445435 440 445

Gln Gln Leu Pro Lys Gln Phe Ser Pro Thr Mer Ser Pro Thr Leu SerGln Gln Leu Pro Lys Gln Phe Ser Pro Thr Mer Ser Pro Thr Leu Ser

450 455 460450 455 460

Ser Ile Thr Gln Gly Val Pro Leu Asp Thr Ser Lys Leu Ser Thr AspSer Ile Thr Gln Gly Val Pro Leu Asp Thr Ser Lys Leu Ser Thr Asp

465 470 475 480465 470 475 480

Gln Arg Leu Pro Pro Tyr Pro Tyr Ser Ser Pro Ser Leu Val Leu ProGln Arg Leu Pro Pro Tyr Pro Tyr Ser Ser Pro Ser Leu Val Leu Pro

485 490 495485 490 495

Thr Gln Pro His Thr Pro Lys Ser Leu Gln Gln Pro Gly Leu Pro SerThr Gln Pro His Thr Pro Lys Ser Leu Gln Gln Pro Gly Leu Pro Ser

500 505 510500 505 510

Gln Ser Cys Ser Val Gln Ser Ser Gly Gly Gln Pro Pro Gly Arg GlnGln Ser Cys Ser Val Gln Ser Ser Gly Gly Gln Pro Pro Gly Arg Gln

515 520 525515 520 525

Ser His Tyr Gly Thr Pro Tyr Pro Pro Gly Pro Ser Gly His Gly GlnSer His Tyr Gly Thr Pro Tyr Pro Pro Gly Pro Ser Gly His Gly Gln

530 535 540530 535 540

Gln Ser Tyr His Arg Pro Met Ser Asp Phe Asn Leu Gly Asn Leu GluGln Ser Tyr His Arg Pro Met Ser Asp Phe Asn Leu Gly Asn Leu Glu

545 550 555 560545 550 555 560

Gln Phe Ser Met Glu Ser Pro Ser Ala Ser Leu Val Leu Asp Pro ProGln Phe Ser Met Glu Ser Pro Ser Ala Ser Leu Val Leu Asp Pro Pro

565 570 575565 570 575

Gly Phe Ser Glu Gly Pro Gly Phe Leu Gly Gly Glu Gly Pro Met GlyGly Phe Ser Glu Gly Pro Gly Phe Leu Gly Gly Glu Gly Pro Met Gly

580 585 590580 585 590

Gly Pro Gln Asp Pro His Thr Phe Asn His Gln Asn Leu Thr His CysGly Pro Gln Asp Pro His Thr Phe Asn His Gln Asn Leu Thr His Cys

595 600 605595 600 605

Ser Arg His Gly Ser Gly Pro Asn Ile Ile Leu Thr Gly Asp Ser SerSer Arg His Gly Ser Gly Pro Asn Ile Ile Leu Thr Gly Asp Ser Ser

610 615 620610 615 620

Pro Gly Phe Ser Lys Glu Ile Ala Ala Ala Leu Ala Gly Val Pro GlyPro Gly Phe Ser Lys Glu Ile Ala Ala Ala Leu Ala Gly Val Pro Gly

625 630 635 640625 630 635 640

Phe Glu Val Ser Ala Ala Gly Leu Glu Leu Gly Leu Gly Leu Glu AspPhe Glu Val Ser Ala Ala Gly Leu Glu Leu Gly Leu Gly Leu Glu Asp

645 650 655645 650 655

Glu Leu Arg Met Glu Pro Leu Gly Leu Glu Gly Leu Asn Met Leu SerGlu Leu Arg Met Glu Pro Leu Gly Leu Glu Gly Leu Asn Met Leu Ser

660 665 670660 665 670

Asp Pro Cys Ala Leu Leu Pro Asp Pro Ala Val Glu Glu Ser Phe ArgAsp Pro Cys Ala Leu Leu Pro Asp Pro Ala Val Glu Glu Ser Phe Arg

675 680 685675 680 685

Ser Asp Arg Leu GlnSer Asp Arg Leu Gln

690690

<210>17<210>17

<211>17<211>17

<212>DNA<212>DNA

<213>人<213> people

<400>17<400>17

ccgtcatttc accaagc 17ccgtcatttc accaagc 17

<210>18<210>18

<211>7<211>7

<212>PRT<212>PRT

<213>人<213> people

<400>18<400>18

Glu Glu Thr Arg Ala Phe GluGlu Glu Thr Arg Ala Phe Glu

1 51 5

<210>19<210>19

<211>7<211>7

<212>PRT<212>PRT

<213>未知<213> unknown

<220><220>

<223>预测的蛋白质<223> predicted protein

<400>19<400>19

Glu Glu Thr Ala Ala Phe GluGlu Glu Thr Ala Ala Phe Glu

1 51 5

<210>20<210>20

<211>34<211>34

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>20<400>20

ccggaattcg ccatggccgc ctcgccgggc tcgg 34ccggaattcg ccatggccgc ctcgccgggc tcgg 34

<210>2l<210>2l

<211>44<211>44

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>21<400>21

ccgcgacagg gtgaggtcgg tcatgagctg ctcgaaggcc cgcg 44ccgcgacagg gtgaggtcgg tcatgagctg ctcgaaggcc cgcg 44

<210>22<210>22

<211>44<211>44

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>22<400>22

gaagcttctg aaattgaacc cgcgacaggg tgaggtcggt catg 44gaagcttctg aaattgaacc cgcgacaggg tgaggtcggt catg 44

<210>23<210>23

<211>63<211>63

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>23<400>23

tggtaaggat cctccatggt actgtgtaag gcgcagttgc tgaagcttc tgaaattgaac 60tggtaaggat cctccatggt actgtgtaag gcgcagttgc tgaagcttc tgaaattgaac 60

ccg 63ccg 63

<210>24<210>24

<211>2259<211>2259

<212>DNA<212>DNA

<213>人<213> people

<220><220>

<221>misc_feature<221>misc_feature

<222>1，13<222>1,13

<223>n＝A，T，C或G<223>n=A, T, C or G

<400>24<400>24

nttttttgta canaaaagca ggctgttacc ggtccggaat tcgccatggc cgcctcgccg 60nttttttgta canaaaagca ggctgttacc ggtccggaat tcgccatggc cgcctcgccg 60

ggctcgggca gcgccaaccc gcggaagttc agtgagaaga tcgcgctgca cacgcagaga 120ggctcgggca gcgccaaccc gcggaagttc agtgagaaga tcgcgctgca cacgcagaga 120

caggccgagg agacgcgggc cttcgagcag ctcatgaccg acctcaccct gtcgcgggtt 180caggccgagg agacgcgggc cttcgagcag ctcatgaccg acctcaccct gtcgcgggtt 180

caatttcaga agcttcagca actgcgcctt acacagtacc atggaggatc cttaccaaat 240caatttcaga agcttcagca actgcgcctt acacagtacc atggaggatc cttaccaaat 240

gtgagccagc tgcggagcaa tgcgtcagag tttcagccgt catttcacca agctgataat 300gtgagccagc tgcggagcaa tgcgtcagag tttcagccgt catttcacca agctgataat 300

gttcggggaa cccgccatca cgggctggtg gagaggccat ccaggaaccg cttccacccc 360gttcggggaa cccgccatca cgggctggtg gagaggccat ccaggaaccg cttccacccc 360

ctccaccgaa ggtctgggga caagccaggg cgacaatttg atggtagtgc ttttggagcc 420ctccaccgaa ggtctgggga caagccaggg cgacaatttg atggtagtgc ttttggagcc 420

aattattcct cacagcctct ggatgagagt tggccaaggc agcagcctcc ttggaaagac 480aattattcct cacagcctct ggatgagagt tggccaaggc agcagcctcc ttggaaagac 480

gaaaagcatc ctgggttcag gctgacatct gcacttaaca ggaccaattc tgattctgct 540gaaaagcatc ctgggttcag gctgacatct gcacttaaca ggaccaattc tgattctgct 540

cttcacacga gtgctctgag taccaagccc caggacccct atggaggagg gggccagtcg 600cttcacacga gtgctctgag taccaagccc caggacccct atggaggagg gggccagtcg 600

gcctggcctg ccccatacat ggggttttgt gatggtgaga ataatggaca tggggaagta 660gcctggcctg ccccatacat ggggttttgt gatggtgaga ataatggaca tggggaagta 660

gcatctttcc ctggcccatt gaaagaagag aatctgttaa atgttcctaa gccactgcca 720gcatctttcc ctggcccatt gaaagaagag aatctgttaa atgttcctaa gccactgcca 720

aaacaactgt gggagaccaa ggagattcag tccctgtcag gacgccctcg atcctgtgat 780aaacaactgt gggagaccaa ggagattcag tccctgtcag gacgccctcg atcctgtgat 780

gttggaggtg gcaatgcttt tccacataat ggtcaaaacc taggcctctc acccttcttg 840gttggaggtg gcaatgcttt tccacataat ggtcaaaacc taggcctctc acccttcttg 840

gggactttga acactggagg gtcattgcca gatctaacca acctccacta ctcgacaccc 900gggactttga acactggagg gtcattgcca gatctaacca acctccacta ctcgacaccc 900

ctgccagcct ccctggacac caccgaccac cactttggca gtatgagtgt ggggaatagt 960ctgccagcct ccctggacac caccgaccac cactttggca gtatgagtgt ggggaatagt 960

gtgaacaaca tcccagctgc tatgacccac ctgggtataa gaagctcctc tggtctccag 1020gtgaacaaca tcccagctgc tatgacccac ctgggtataa gaagctcctc tggtctccag 1020

agttctcgga gtaacccctc catccaagcc acgctcaata agactgtgct ttcctcttcc 1080agttctcgga gtaacccctc catccaagcc acgctcaata agactgtgct ttcctcttcc 1080

ttaaataacc acccacagac atctgttccc aacgcatctg ctcttcaccc ttcgctccgt 1140ttaaataacc accccacagac atctgttccc aacgcatctg ctcttcaccc ttcgctccgt 1140

ctgttttccc ttagcaaccc atctctttcc accacaaacc tgagcggccc gtctcgccgt 1200ctgttttccc ttagcaaccc atctctttcc accacaaacc tgagcggccc gtctcgccgt 1200

cggcagcctc ccgtcagccc tctcacgctt tctcctggcc ctgaagcaca tcaaggtttc 1260cggcagcctc ccgtcagccc tctcacgctt tctcctggcc ctgaagcaca tcaaggtttc 1260

agcagacagc tgtcttcaac cagcccactg gccccatatc ctacctccca gatggtgtcc 1320agcagacagc tgtcttcaac cagcccactg gccccatatc ctacctccca gatggtgtcc 1320

tcagaccgaa gccaactttc ctttctgccc acagaagctc aagcccaggt gtcgccgcca 1380tcagaccgaa gccaactttc ctttctgccc acagaagctc aagcccaggt gtcgccgcca 1380

cccccttacc ctgcacccca ggagctcacc cagcccctcc tgcagcagcc ccgcgcccct 1440ccccccttacc ctgcacccca ggagctcacc cagcccctcc tgcagcagcc ccgcgcccct 1440

gaggcccctg cccagcagcc ccaggcagcc tcctcactgc cacagtcaga ctttcagctt 1500gaggcccctg cccagcagcc ccaggcagcc tcctcactgc cacagtcaga ctttcagctt 1500

ctcccggccc agggctcatc tttgaccaac ttcttcccag atgtgggttt tgaccagcag 1560ctcccggccc agggctcatc tttgaccaac ttcttcccag atgtgggttt tgaccagcag 1560

tccatgaggc caggccctgc ctttcctcaa caggtgcctc tggtgcaaca aggttcccga 1620tccatgaggc caggccctgc ctttcctcaa caggtgcctc tggtgcaaca aggttcccga 1620

gaactgcagg actcttttca tttgagacca agcccgtatt ccaactgcgg gagtctcccg 1680gaactgcagg actcttttca tttgagacca agcccgtatt ccaactgcgg gagtctcccg 1680

aacaccatcc tgccagaaga ctccagcacc agcctgttca aagacctcaa cagtgcgctg 1740aacaccatcc tgccagaaga ctccagcacc agcctgttca aagacctcaa cagtgcgctg 1740

gcaggcctgc ctgaggtcag cctgaacgtg gacactccat ttccactgga agaggagctg 1800gcaggcctgc ctgaggtcag cctgaacgtg gacactccat ttccactgga agaggagctg 1800

cagattgaac ccctgagcct ggatggactc aacatgttaa gtgactccag catgggcctg 1860cagattgaac ccctgagcct ggatggactc aacatgttaa gtgactccag catgggcctg 1860

ctggacccct ctgttgaaga gacgtttcga gctgacagac tgtgaacaga aggcagtgga 1920ctggacccct ctgttgaaga gacgtttcga gctgacagac tgtgaacaga aggcagtgga 1920

acagaagaat gtttttctgc aacagccaaa atagaatgga atagaatgaa gccagctgat 1980acagaagaat gtttttctgc aacagccaaa atagaatgga atagaatgaa gccagctgat 1980

accacgggct ttcgttatct tgacatagaa ggaagcagtg ccacggctcc agggtttcag 2040accacgggct ttcgttatct tgacatagaa ggaagcagtg ccacggctcc agggtttcag 2040

atgagatccc atctcagaca ctgtggcttc ctccagatca cacagctttg tactgcctct 2100atgagatccc atctcagaca ctgtggcttc ctccagatca cacagctttg tactgcctct 2100

cccgcctgtg gccaaagtcg tgttgcagca ggcaggctgc ttggagcttc ccatgaactg 2160cccgcctgtg gccaaagtcg tgttgcagca ggcaggctgc ttggagcttc ccatgaactg 2160

gaaagctcac ctccactgca tctttttact ggccatccag tcagccgatg tgtaagagta 2220gaaagctcac ctccactgca tctttttact ggccatccag tcagccgatg tgtaagagta 2220

ggaaatactg tgtcactgga ggccctccgt agcattggg 2259ggaaatactg tgtcactgga ggccctccgt agcattggg 2259

<210>25<210>25

<211>619<211>619

<212>PRT<212>PRT

<213>人<213> people

<400>25<400>25

Met Ala Ala Ser Pro Gly Ser Gly Ser Ala Asn Pro Arg Lys Phe SerMet Ala Ala Ser Pro Gly Ser Gly Ser Ala Asn Pro Arg Lys Phe Ser

1 5 10 151 5 10 15

Glu Lys Ile Ala Leu His Thr Gln Arg Gln Ala Glu Glu Thr Arg AlaGlu Lys Ile Ala Leu His Thr Gln Arg Gln Ala Glu Glu Thr Arg Ala

20 25 3020 25 30

Phe Glu Gln Leu Met Thr Asp Leu Thr Leu Ser Arg Val Gln Phe GlnPhe Glu Gln Leu Met Thr Asp Leu Thr Leu Ser Arg Val Gln Phe Gln

35 40 4535 40 45

Lys Leu Gln Gln Leu Arg Leu Thr Gln Tyr His Gly Gly Ser Leu ProLys Leu Gln Gln Leu Arg Leu Thr Gln Tyr His Gly Gly Ser Leu Pro

50 55 6050 55 60

Asn Val Ser Gln Leu Arg Ser Asn Ala Ser Glu Phe Gln Pro Ser PheAsn Val Ser Gln Leu Arg Ser Asn Ala Ser Glu Phe Gln Pro Ser Phe

65 70 75 8065 70 75 80

His Gln Ala Asp Asn Val Arg Gly Thr Arg His His Gly Leu Val GluHis Gln Ala Asp Asn Val Arg Gly Thr Arg His His Gly Leu Val Glu

85 90 9585 90 95

Arg Pro Ser Arg Asn Arg Phe His Pro Leu His Arg Arg Ser Gly AspArg Pro Ser Arg Asn Arg Phe His Pro Leu His Arg Arg Ser Gly Asp

100 105 110100 105 110

Lys Pro Gly Arg Gln Phe Asp Gly Ser Ala Phe Gly Ala Asn Tyr SerLys Pro Gly Arg Gln Phe Asp Gly Ser Ala Phe Gly Ala Asn Tyr Ser

115 120 125115 120 125

Ser Gln Pro Leu Asp Glu Ser Trp Pro Arg Gln Gln Pro Pro Trp LysSer Gln Pro Leu Asp Glu Ser Trp Pro Arg Gln Gln Pro Pro Trp Lys

130 135 140130 135 140

Asp Glu Lys His Pro Gly Phe Arg Leu Thr Ser Ala Leu Asn Arg ThrAsp Glu Lys His Pro Gly Phe Arg Leu Thr Ser Ala Leu Asn Arg Thr

145 150 155 160145 150 155 160

Asn Ser Asp Ser Ala Leu His Thr Ser Ala Leu Ser Thr Lys Pro GlnAsn Ser Asp Ser Ala Leu His Thr Ser Ala Leu Ser Thr Lys Pro Gln

165 170 175165 170 175

Asp Pro Tyr Gly Gly Gly Gly Gln Ser Ala Trp Pro Ala Pro Tyr MetAsp Pro Tyr Gly Gly Gly Gly Gln Ser Ala Trp Pro Ala Pro Tyr Met

180 185 190180 185 190

Gly Phe Cys Asp Gly Glu Asn Asn Gly His Gly Glu Val Ala Ser PheGly Phe Cys Asp Gly Glu Asn Asn Gly His Gly Glu Val Ala Ser Phe

195 200 205195 200 205

Pro Gly Pro Leu Lys Glu Glu Asn Leu Leu Asn Val Pro Lys Pro LeuPro Gly Pro Leu Lys Glu Glu Asn Leu Leu Asn Val Pro Lys Pro Leu

210 215 220210 215 220

Pro Lys Gln Leu Trp Glu Thr Lys Glu Ile Gln Ser Leu Ser Gly ArgPro Lys Gln Leu Trp Glu Thr Lys Glu Ile Gln Ser Leu Ser Gly Arg

225 230 235 240225 230 235 240

Pro Arg Ser Cys Asp Val Gly Gly Gly Asn Ala Phe Pro His Asn GlyPro Arg Ser Cys Asp Val Gly Gly Gly Asn Ala Phe Pro His Asn Gly

245 250 255245 250 255

Gln Asn Leu Gly Leu Ser Pro Phe Leu Gly Thr Leu Asn Thr Gly GlyGln Asn Leu Gly Leu Ser Pro Phe Leu Gly Thr Leu Asn Thr Gly Gly

260 265 270260 265 270

Ser Leu Pro Asp Leu Thr Asn Leu His Tyr Ser Thr Pro Leu Pro AlaSer Leu Pro Asp Leu Thr Asn Leu His Tyr Ser Thr Pro Leu Pro Ala

275 280 285275 280 285

Ser Leu Asp Thr Thr Asp His His Phe Gly Ser Met Ser Val Gly AsnSer Leu Asp Thr Thr Asp His His Phe Gly Ser Met Ser Val Gly Asn

290 295 300290 295 300

Ser Val Asn Asn Ile Pro Ala Ala Met Thr His Leu Gly Ile Arg SerSer Val Asn Asn Ile Pro Ala Ala Met Thr His Leu Gly Ile Arg Ser

305 310 315 320305 310 315 320

Ser Ser Gly Leu Gln Ser Ser Arg Ser Asn Pro Ser Ile Gln Ala ThrSer Ser Gly Leu Gln Ser Ser Arg Ser Asn Pro Ser Ile Gln Ala Thr

325 330 335325 330 335

Leu Asn Lys Thr Val Leu Ser Ser Ser Leu Asn Asn His Pro Gln ThrLeu Asn Lys Thr Val Leu Ser Ser Ser Leu Asn Asn His Pro Gln Thr

340 345 350340 345 350

Ser Val Pro Asn Ala Ser Ala Leu His Pro Ser Leu Arg Leu Phe SerSer Val Pro Asn Ala Ser Ala Leu His Pro Ser Leu Arg Leu Phe Ser

355 360 365355 360 365

Leu Ser Asn Pro Ser Leu Ser Thr Thr Asn Leu Ser Gly Pro Ser ArgLeu Ser Asn Pro Ser Leu Ser Thr Thr Asn Leu Ser Gly Pro Ser Arg

370 375 380370 375 380

Arg Arg Gln Pro Pro Val Ser Pro Leu Thr Leu Ser Pro Gly Pro GluArg Arg Gln Pro Pro Val Ser Pro Leu Thr Leu Ser Pro Gly Pro Glu

385 390 395 400385 390 395 400

Ala His Gln Gly Phe Ser Arg Gln Leu Ser Ser Thr Ser Pro Leu AlaAla His Gln Gly Phe Ser Arg Gln Leu Ser Ser Thr Ser Pro Leu Ala

405 410 415405 410 415

Pro Tyr Pro Thr Ser Gln Met Val Ser Ser Asp Arg Ser Gln Leu SerPro Tyr Pro Thr Ser Gln Met Val Ser Ser Asp Arg Ser Gln Leu Ser

420 425 430420 425 430

Phe Leu Pro Thr Glu Ala Gln Ala Gln Val Ser Pro Pro Pro Pro TyrPhe Leu Pro Thr Glu Ala Gln Ala Gln Val Ser Pro Pro Pro Pro Tyr

435 440 445435 440 445

Pro Ala Pro Gln Glu Leu Thr Gln Pro Leu Leu Gln Gln Pro Arg AlaPro Ala Pro Gln Glu Leu Thr Gln Pro Leu Leu Gln Gln Pro Arg Ala

450 455 460450 455 460

Pro Glu Ala Pro Ala Gln Gln Pro Gln Ala Ala Ser Ser Leu Pro GlnPro Glu Ala Pro Ala Gln Gln Pro Gln Ala Ala Ser Ser Leu Pro Gln

465 470 475 480465 470 475 480

Ser Asp Phe Gln Leu Leu Pro Ala Gln Gly Ser Ser Leu Thr Asn PheSer Asp Phe Gln Leu Leu Pro Ala Gln Gly Ser Ser Leu Thr Asn Phe

485 490 495485 490 495

Phe Pro Asp Val Gly Phe Asp Gln Gln Ser Met Arg Pro Gly Pro AlaPhe Pro Asp Val Gly Phe Asp Gln Gln Ser Met Arg Pro Gly Pro Ala

500 505 510500 505 510

Phe Pro Gln Gln Val Pro Leu Val Gln Gln Gly Ser Arg Glu Leu GlnPhe Pro Gln Gln Val Pro Leu Val Gln Gln Gly Ser Arg Glu Leu Gln

515 520 525515 520 525

Asp Ser Phe His Leu Arg Pro Ser Pro Tyr Ser Asn Cys Gly Ser LeuAsp Ser Phe His Leu Arg Pro Ser Pro Tyr Ser Asn Cys Gly Ser Leu

530 535 540530 535 540

Pro Asn Thr Ile Leu Pro Glu Asp Ser Ser Thr Ser Leu Phe Lys AspPro Asn Thr Ile Leu Pro Glu Asp Ser Ser Thr Ser Leu Phe Lys Asp

545 550 555 560545 550 555 560

Leu Asn Ser Ala Leu Ala Gly Leu Pro Glu Val Ser Leu Asn Val AspLeu Asn Ser Ala Leu Ala Gly Leu Pro Glu Val Ser Leu Asn Val Asp

565 570 575565 570 575

Thr Pro Phe Pro Leu Glu Glu Glu Leu Gln Ile Glu Pro Leu Ser LeuThr Pro Phe Pro Leu Glu Glu Glu Leu Gln Ile Glu Pro Leu Ser Leu

580 585 590580 585 590

Asp Gly Leu Asn Met Leu Ser Asp Ser Ser Met Gly Leu Leu Asp ProAsp Gly Leu Asn Met Leu Ser Asp Ser Ser Met Gly Leu Leu Asp Pro

595 600 605595 600 605

Ser Val Glu Glu Thr Phe Arg Ala Asp Arg LeuSer Val Glu Glu Thr Phe Arg Ala Asp Arg Leu

610 615610 615

<210>26<210>26

<211>2992<211>2992

<212>DNA<212>DNA

<213>黑腹果蝇(drosophila melanogaster)<213> Drosophila melanogaster

<400>26<400>26

atggccaatc cgcgcaagtt cagcgagaag atcgctctgc agaagcagaa gcaggcggag 60atggccaatc cgcgcaagtt cagcgagaag atcgctctgc agaagcagaa gcaggcggag 60

ggcacagcgg aattcgagcg gatcatgaag gaggtgtatg ccacgaagag ggatgagccg 120ggcacagcgg aattcgagcg gatcatgaag gaggtgtatg ccacgaagag ggatgagccg 120

cctgcgaatc agaagatcct agacggcctt gtcggcggtc aggaggtaag ccaatcctcg 180cctgcgaatc agaagatcct agacggcctt gtcggcggtc aggaggtaag ccaatcctcg 180

ccaggcgcag gcaatgggac gggcggaggt ggcagtggtt ccggcagtgg agccagcggc 240ccaggcgcag gcaatgggac gggcggaggt ggcagtggtt ccggcagtgg agccagcggc 240

ggaggagcct caccagatgg cctgggaggc ggcggtggtt ctccgacggc ttatcgagaa 300ggaggagcct caccagatgg cctgggaggc ggcggtggtt ctccgacggc ttatcgagaa 300

tcccgagggc gcagcgtagg tgtgggtccc atgcgaagac cgtcggagcg caagcaggat 360tcccgagggc gcagcgtagg tgtgggtccc atgcgaagac cgtcggagcg caagcaggat 360

cgttcgccct acggcagcag cagtacgcaa caaaccttag acaacggcca gctaaatccg 420cgttcgccct acggcagcag cagtacgcaa caaaccttag acaacggcca gctaaatccg 420

catcttcttg gtccacctac ggcggagagt ttgtggcggc ggtccagctc cgattcggcg 480catcttcttg gtccacctac ggcggagagt ttgtggcggc ggtccagctc cgattcggcg 480

ctgcaccaaa gtgcgctggt ggcgggcttc aatagcgacg tgaactcgat gggcgccaac 540ctgcaccaaa gtgcgctggt ggcgggcttc aatagcgacg tgaactcgat gggcgccaac 540

tatcagcagc agcaacatca gcaacaacag caaccgggcc agccaagatc tcactcgccg 600tatcagcagc agcaacatca gcaacaacag caaccgggcc agccaagatc tcactcgccg 600

caccatggta taaacaggac catgagtccg caggcgcaac ggaggaagtc gccgctactg 660caccatggta taaacaggac catgagtccg caggcgcaac ggaggaagtc gccgctactg 660

cagccccatc agctgcagtt gcagcaactg caacagcagc agcaacagat gcaacatcag 720cagccccatc agctgcagtt gcagcaactg caacagcagc agcaacagat gcaacatcag 720

catcagctgc accagcagct ccaaatgcag cagctgcaac agcaccagca gcaacaccag 780catcagctgc accagcagct ccaaatgcag cagctgcaac agcaccagca gcaacaccag 780

cagcagcagc aacaacagaa cacgccatac aacaacgcca aattcacgaa tcctgtgttc 840cagcagcagc aacaacagaa cacgccatac aacaacgcca aattcacgaa tcctgtgttc 840

cggccgctgc aggatcaggt caactttgcc aacaccggct ccctgcccga tctcacggcc 900cggccgctgc aggatcaggt caactttgcc aacaccggct ccctgcccga tctcacggcc 900

cttcaaaact atggacccca gcagcagcag cagcaatccc agcaacagcc gtcgcagcaa 960cttcaaaact atggaccccca gcagcagcag cagcaatccc agcaacagcc gtcgcagcaa 960

caacagcagt tgcagcaaac cctgtcgcca gtcatgtctc cgcacaatca ccgccgcgaa 1020caacagcagt tgcagcaaac cctgtcgcca gtcatgtctc cgcacaatca ccgccgcgaa 1020

cgggatcagt cgcccagtcc gtttagtccg gcgggtggag gagggggagc aggtcccggg 1080cgggatcagt cgcccagtcc gtttagtccg gcgggtggag gaggggggagc aggtcccggg 1080

tcgccctatc agcagcaaca gcactcgccc accggaaaca cgcaacagca gcagcagcag 1140tcgccctatc agcagcaaca gcactcgccc accggaaaca cgcaacagca gcagcagcag 1140

caccaacagc ccagcaactc gccgcacctg tcctttacca atctggccac cacgcaggca 1200caccaacagc ccagcaactc gccgcacctg tcctttacca atctggccac cacgcaggca 1200

gctgttacca catttaaccc gctccccacg ctgggtccgc acaatgccac cgactaccgc 1260gctgttacca catttaaccc gctccccacg ctgggtccgc acaatgccac cgactaccgc 1260

cagccaccga atcctcctag tccacgctct tcgcccggct tgctgagcag cgtatcggcc 1320cagccaccga atcctcctag tccacgctct tcgcccggct tgctgagcag cgtatcggcc 1320

acggatctgc actccagtgc accggccagt cccatacgcc agcagcaaca ggcccatcag 1380acggatctgc actccagtgc accggccagt cccatacgcc agcagcaaca ggcccatcag 1380

cagcaacagc agcagcaaca ggcgcagcaa caacagcaac agtttgataa ctcctacaac 1440cagcaacagc agcagcaaca ggcgcagcaa caacagcaac agtttgataa ctcctacaac 1440

agtctgaata cctcgtttca caatcagttt gagattttct cgctgggcga cagcaattcc 1500agtctgaata cctcgtttca caatcagttt gagattttct cgctgggcga cagcaattcc 1500

tcgccggaac agcagggctt tgcaaataat ttcgtggccc tcgactttga cgacctgagt 1560tcgccggaac agcagggctt tgcaaataat ttcgtggccc tcgactttga cgacctgagt 1560

ggcggcggag gtggtggccc aagcgggggc ggcggcagca atggaggagg tctgaccaac 1620ggcggcggag gtggtggccc aagcgggggc ggcggcagca atggaggagg tctgaccaac 1620

ggttacaaca agccggagat gttggacttc agcgagctga gcggcagccc ggaggcgagt 1680ggttacaaca agccggagat gttggacttc agcgagctga gcggcagccc ggaggcgagt 1680

gggaacaaca accacatgcg gcgaggagtg agcaacctga acaacaacgg gttgagcaat 1740gggaacaaca accacatgcg gcgaggagtg agcaacctga acaacaacgg gttgagcaat 1740

ggtgtggtgg gatccacgca caacggcagc acaaatctaa atggagcggg aaacaacaat 1800ggtgtggtgg gatccacgca caacggcagc acaaatctaa atggagcggg aaacaacaat 1800

agcagtagtg gaggtggcac ggcgcaggat cctttgggaa taaccacttc gcctgtgccc 1860agcagtagtg gaggtggcac ggcgcaggat cctttgggaa taaccacttc gcctgtgccc 1860

tcacccttgg gctgccccag ttcaccgctg ccgataccga ttccgatgtc ggcgcaaagc 1920tcacccttgg gctgccccag ttcaccgctg ccgataccga ttccgatgtc ggcgcaaagc 1920

tcgccacagc agcagcacca ccatcatcag cagcagcaac aacagcatca tcagcagcaa 1980tcgccacagc agcagcacca ccatcatcag cagcagcaac aacagcatca tcagcagcaa 1980

caccatcagc agcagcaatt atcattatct ctgcaccatt cgccgcatca ttcgccaatg 2040caccatcagc agcagcaatt atcattatct ctgcaccat cgccgcatca ttcgccaatg 2040

cattcgccgc accatgggaa ttcaccgctt tcaagcagct cgccagtgag tcacaatgcc 2100cattcgccgc accatgggaa ttcaccgctt tcaagcagct cgccagtgag tcacaatgcc 2100

tgctccaact ccaacgtggt gatgaaccac cagcagcagc agcaacaaca tcaccaccag 2160tgctccaact ccaacgtggt gatgaaccac cagcagcagc agcaacaaca tcaccaccag 2160

caacaccatc atcagggctc ctcgcaaagt cacacgccga ccacagcgaa tataccctct 2220caacaccatc atcagggctc ctcgcaaagt cacacgccga ccacagcgaa tataccctct 2220

attatcttta gtgattactc ctccaacgcg gattatacca gggagatctt cgactccctc 2280attatcttta gtgattactc ctccaacgcg gattatacca gggagatctt cgactccctc 2280

gatctggatc tgggacagat ggacgtagcc ggtttgcaga tgctgtccga ccagaacccc 2340gatctggatc tgggacagat ggacgtagcc ggtttgcaga tgctgtccga ccagaaccccc 2340

atcatgatcg ccgatcccaa catcgaggat agttttcgac gcgacctcaa ctgatactat 2400atcatgatcg ccgatcccaa catcgaggat agttttcgac gcgacctcaa ctgatactat 2400

gaggaggctg ttgcggccat tgagagcgga gtgctgctgg aggaggacta ccaggcgctg 2460gaggaggctg ttgcggccat tgagagcgga gtgctgctgg aggaggacta ccaggcgctg 2460

ctcggatcag aggcgctggc ggatgaacag gtggtcacag tcgaggccgc cggagccgca 2520ctcggatcag aggcgctggc ggatgaacag gtggtcacag tcgaggccgc cggagccgca 2520

gcagcagtag taacagttga agaggcagcc acagttagcg agaaggacaa aaaagatttg 2580gcagcagtag taacagttga agaggcagcc acagttagcg agaaggacaa aaaagatttg 2580

gaagttgtgg aacttctggt gtccggtgtt atggatgacc tggtggactc cagtgacctg 2640gaagttgtgg aacttctggt gtccggtgtt atggatgacc tggtggactc cagtgacctg 2640

gacgaggaag tgcgcaattt ctttttttag gcagccagca agtcattttt gtcgttaaca 2700gacgaggaag tgcgcaattt ctttttttag gcagccagca agtcattttt gtcgttaaca 2700

caactgatgg aattttcgtt tttaacacag atgaggaagt gaattacgtt ttttaaacgc 2760caactgatgg aattttcgtt tttaacacag atgaggaagt gaattacgtt ttttaaacgc 2760

attcacttgc catttctcga ttaaatgcca tattacttaa gctcaggatt tacaagctta 2820attcacttgc catttctcga ttaaatgcca tattacttaa gctcaggatt tacaagctta 2820

atgcgaatta agttaatttc ggaaatgctg acgagagtga ttgcaaagtt caaaattgat 2880atgcgaatta agttaatttc ggaaatgctg acgagagtga ttgcaaagtt caaaattgat 2880

acaaattcac ttccgcaaat tcatgctgaa actgaaagtt ttctaacagt cctcaatatt 2940acaaattcac ttccgcaaat tcatgctgaa actgaaagtt ttctaacagt cctcaatatt 2940

gttatctcgt tatcgtccgt gctttcgtag ctagctccta caacaaaaat ac 2992gttatctcgt tatcgtccgt gctttcgtag ctagctccta caacaaaaat ac 2992

<210>27<210>27

<211>797<211>797

<212>PRT<212>PRT

<213>黑腹果蝇<213> Drosophila melanogaster

<400>27<400>27

Met Ala Asn Pro Arg Lys Phe Ser Glu Lys Ile Ala Leu Gln Lys GlnMet Ala Asn Pro Arg Lys Phe Ser Glu Lys Ile Ala Leu Gln Lys Gln

1 5 10 151 5 10 15

Lys Gln Ala Glu Gly Thr Ala Glu Phe Glu Arg Ile Met Lys Glu ValLys Gln Ala Glu Gly Thr Ala Glu Phe Glu Arg Ile Met Lys Glu Val

20 25 3020 25 30

Tyr Ala Thr Lys Arg Asp Glu Pro Pro Ala Asn Gln Lys Ile Leu AspTyr Ala Thr Lys Arg Asp Glu Pro Pro Ala Asn Gln Lys Ile Leu Asp

35 40 4535 40 45

Gly Leu Val Gly Gly Gln Glu Val Ser Gln Ser Ser Pro Gly Ala GlyGly Leu Val Gly Gly Gly Gln Glu Val Ser Gln Ser Ser Pro Gly Ala Gly

50 55 6050 55 60

Asn Gly Thr Gly Gly Gly Gly Ser Gly Ser Gly Ser Gly Ala Ser GlyAsn Gly Thr Gly Gly Gly Gly Ser Gly Ser Gly Ser Gly Ala Ser Gly

65 70 75 8065 70 75 80

Gly Gly Ala Ser Pro Asp Gly Leu Gly Gly Gly Gly Gly Ser Pro ThrGly Gly Ala Ser Pro Asp Gly Leu Gly Gly Gly Gly Gly Ser Pro Thr

85 90 9585 90 95

Ala Tyr Arg Glu Ser Arg Gly Arg Ser Val Gly Val Gly Pro Met ArgAla Tyr Arg Glu Ser Arg Gly Arg Ser Val Gly Val Gly Pro Met Arg

100 105 110100 105 110

Arg Pro Ser Glu Arg Lys Gln Asp Arg Ser Pro Tyr Gly Ser Ser SerArg Pro Ser Glu Arg Lys Gln Asp Arg Ser Pro Tyr Gly Ser Ser Ser

115 120 125115 120 125

Thr Gln Gln Thr Leu Asp Asn Gly Gln Leu Asn Pro His Leu Leu GlyThr Gln Gln Thr Leu Asp Asn Gly Gln Leu Asn Pro His Leu Leu Gly

130 135 140130 135 140

Pro Pro Thr Ala Glu Ser Leu Trp Arg Arg Ser Ser Ser Asp Ser AlaPro Pro Thr Ala Glu Ser Leu Trp Arg Arg Ser Ser Ser Asp Ser Ala

145 150 155 160145 150 155 160

Leu His Gln Ser Ala Leu Val Ala Gly Phe Asn Ser Asp Val Asn SerLeu His Gln Ser Ala Leu Val Ala Gly Phe Asn Ser Asp Val Asn Ser

165 170 175165 170 175

Met Gly Ala Asn Tyr Gln Gln Gln Gln His Gln Gln Gln Gln Gln ProMet Gly Ala Asn Tyr Gln Gln Gln Gln His Gln Gln Gln Gln Gln Pro

180 185 190180 185 190

Gly Gln Pro Arg Ser His Ser Pro His His Gly Ile Asn Arg Thr MetGly Gln Pro Arg Ser His Ser Pro His His Gly Ile Asn Arg Thr Met

195 200 205195 200 205

Ser Pro Gln Ala Gln Arg Arg Lys Ser Pro Leu Leu Gln Pro His GlnSer Pro Gln Ala Gln Arg Arg Lys Ser Pro Leu Leu Gln Pro His Gln

210 215 220210 215 220

Leu Gln Leu Gln Gln Leu Gln Gln Gln Gln Gln Gln Met Gln His GlnLeu Gln Leu Gln Gln Leu Gln Gln Gln Gln Gln Gln Gln Met Gln His Gln

225 230 235 240225 230 235 240

His Gln Leu His Gln Gln Leu Gln Met Gln Gln Leu Gln Gln His GlnHis Gln Leu His Gln Gln Leu Gln Met Gln Gln Leu Gln Gln His Gln

245 250 255245 250 255

Gln Gln His Gln Gln Gln Gln Gln Gln Gln Asn Thr Pro Tyr Asn AsnGln Gln His Gln Gln Gln Gln Gln Gln Gln Asn Thr Pro Tyr Asn Asn

260 265 270260 265 270

Ala Lys Phe Thr Asn Pro Val Phe Arg Pro Leu Gln Asp Gln Val AsnAla Lys Phe Thr Asn Pro Val Phe Arg Pro Leu Gln Asp Gln Val Asn

275 280 285275 280 285

Phe Ala Asn Thr Gly Ser Leu Pro Asp Leu Thr Ala Leu Gln Asn TyrPhe Ala Asn Thr Gly Ser Leu Pro Asp Leu Thr Ala Leu Gln Asn Tyr

290 295 300290 295 300

Gly Pro Gln Gln Gln Gln Gln Gln Ser Gln Gln Gln Pro Ser Gln GlnGly Pro Gln Gln Gln Gln Gln Gln Ser Gln Gln Gln Pro Ser Gln Gln

305 310 315 320305 310 315 320

Gln Gln Gln Leu Gln Gln Thr Leu Ser Pro Val Met Ser Pro His AsnGln Gln Gln Leu Gln Gln Thr Leu Ser Pro Val Met Ser Pro His Asn

325 330 335325 330 335

His Arg Arg Glu Arg Asp Gln Ser Pro Ser Pro Phe Ser Pro Ala GlyHis Arg Arg Glu Arg Asp Gln Ser Pro Ser Pro Phe Ser Pro Ala Gly

340 345 350340 345 350

Gly Gly Gly Gly Ala Gly Pro Gly Ser Pro Tyr Gln Gln Gln Gln HisGly Gly Gly Gly Ala Gly Pro Gly Ser Pro Tyr Gln Gln Gln Gln His

355 360 365355 360 365

Ser Pro Thr Gly Asn Thr Gln Gln Gln Gln Gln Gln His Gln Gln ProSer Pro Thr Gly Asn Thr Gln Gln Gln Gln Gln Gln Gln His Gln Gln Pro

370 375 380370 375 380

Ser Asn Ser Pro His Leu Ser Phe Thr Asn Leu Ala Thr Thr Gln AlaSer Asn Ser Pro His Leu Ser Phe Thr Asn Leu Ala Thr Thr Gln Ala

385 390 395 400385 390 395 400

Ala Val Thr Thr Phe Asn Pro Leu Pro Thr Leu Gly Pro His Asn AlaAla Val Thr Thr Phe Asn Pro Leu Pro Thr Leu Gly Pro His Asn Ala

405 410 415405 410 415

Thr Asp Tyr Arg Gln Pro Pro Asn Pro Pro Ser Pro Arg Ser Ser ProThr Asp Tyr Arg Gln Pro Pro Asn Pro Pro Ser Pro Arg Ser Ser Pro

420 425 430420 425 430

Gly Leu Leu Ser Ser Val Ser Ala Thr Asp Leu His Ser Ser Ala ProGly Leu Leu Ser Ser Val Ser Ala Thr Asp Leu His Ser Ser Ala Pro

435 440 445435 440 445

Ala Ser Pro Ile Arg Gln Gln Gln Gln Ala His Gln Gln Gln Gln GlnAla Ser Pro Ile Arg Gln Gln Gln Gln Ala His Gln Gln Gln Gln Gln

450 455 460450 455 460

Gln Gln Gln Ala Gln Gln Gln Gln Gln Gln Phe Asp Asn Ser Tyr AsnGln Gln Gln Ala Gln Gln Gln Gln Gln Gln Phe Asp Asn Ser Tyr Asn

465 470 475 480465 470 475 480

Ser Leu Asn Thr Ser Phe His Asn Gln Phe Glu Ile Phe Ser Leu GlySer Leu Asn Thr Ser Phe His Asn Gln Phe Glu Ile Phe Ser Leu Gly

485 490 495485 490 495

Asp Ser Asn Ser Ser Pro Glu Gln Gln Gly Phe Ala Asn Asn Phe ValAsp Ser Asn Ser Ser Pro Glu Gln Gln Gly Phe Ala Asn Asn Phe Val

500 505 510500 505 510

Ala Leu Asp Phe Asp Asp Leu Ser Gly Gly Gly Gly Gly Gly Pro SerAla Leu Asp Phe Asp Asp Leu Ser Gly Gly Gly Gly Gly Gly Pro Ser

515 520 525515 520 525

Gly Gly Gly Gly Ser Asn Gly Gly Gly Leu Thr Asn Gly Tyr Asn LysGly Gly Gly Gly Ser Asn Gly Gly Gly Leu Thr Asn Gly Tyr Asn Lys

530 535 540530 535 540

Pro Glu Met Leu Asp Phe Ser Glu Leu Ser Gly Ser Pro Glu Ala SerPro Glu Met Leu Asp Phe Ser Glu Leu Ser Gly Ser Pro Glu Ala Ser

545 550 555 560545 550 555 560

Gly Asn Asn Asn His Met Arg Arg Gly Val Ser Asn Leu Asn Asn AsnGly Asn Asn Asn His Met Arg Arg Gly Val Ser Asn Leu Asn Asn Asn

565 570 575565 570 575

Gly Leu Ser Asn Gly Val Val Gly Ser Thr His Asn Gly Ser Thr AsnGly Leu Ser Asn Gly Val Val Gly Ser Thr His Asn Gly Ser Thr Asn

580 585 590580 585 590

Leu Asn Gly Ala Gly Asn Asn Asn Ser Ser Ser Gly Gly Gly Thr AlaLeu Asn Gly Ala Gly Asn Asn Asn Ser Ser Ser Gly Gly Gly Thr Ala

595 600 605595 600 605

Gln Asp Pro Leu Gly Ile Thr Thr Ser Pro Val Pro Ser Pro Leu GlyGln Asp Pro Leu Gly Ile Thr Thr Ser Pro Val Pro Ser Pro Leu Gly

610 615 620610 615 620

Cys Pro Ser Ser Pro Leu Pro Ile Pro Ile Pro Met Ser Ala Gln SerCys Pro Ser Ser Pro Leu Pro Ile Pro Ile Pro Met Ser Ala Gln Ser

625 630 635 640625 630 635 640

Ser Pro Gln Gln Gln His His His His Gln Gln Gln Gln Gln Gln HisSer Pro Gln Gln Gln His His His His Gln Gln Gln Gln Gln Gln His

645 650 655645 650 655

His Gln Gln Gln His His Gln Gln Gln Gln Leu Ser Leu Ser Leu HisHis Gln Gln Gln His His His Gln Gln Gln Gln Gln Leu Ser Leu Ser Leu His

660 665 670660 665 670

His Ser Pro His His Ser Pro Met His Ser Pro His His Gly Asn SerHis Ser Pro His His Ser Pro Met His Ser Pro His His Gly Asn Ser

675 680 685675 680 685

Pro Leu Ser Ser Ser Ser Pro Val Ser His Asn Ala Cys Ser Asn SerPro Leu Ser Ser Ser Ser Pro Val Ser His Asn Ala Cys Ser Asn Ser

690 695 700690 695 700

Asn Val Val Met Asn His Gln Gln Gln Gln Gln Gln His His His GlnAsn Val Val Met Asn His Gln Gln Gln Gln Gln Gln His His His Gln

705 710 715 720705 710 715 720

Gln His His His Gln Gly Ser Ser Gln Ser His Thr Pro Thr Thr AlaGln His His His Gln Gly Ser Ser Gln Ser His Thr Pro Thr Thr Ala

725 730 735725 730 735

Asn Ile Pro Ser Ile Ile Phe Ser Asp Tyr Ser Ser Asn Ala Asp TyrAsn Ile Pro Ser Ile Ile Phe Ser Asp Tyr Ser Ser Asn Ala Asp Tyr

740 745 750740 745 750

Thr Arg Glu Ile Phe Asp Ser Leu Asp Leu Asp Leu Gly Gln Met AspThr Arg Glu Ile Phe Asp Ser Leu Asp Leu Asp Leu Gly Gln Met Asp

755 760 765755 760 765

Val Ala Gly Leu Gln Met Leu Ser Asp Gln Asn Pro Ile Met Ile AlaVal Ala Gly Leu Gln Met Leu Ser Asp Gln Asn Pro Ile Met Ile Ala

770 775 780770 775 780

Asp Pro Asn Ile Glu Asp Ser Phe Arg Arg Asp Leu AsnAsp Pro Asn Ile Glu Asp Ser Phe Arg Arg Asp Leu Asn

785 790 795785 790 795

<210>28<210>28

<211>2416<211>2416

<212>DNA<212>DNA

<213>小鼠<213> mouse

<220><220>

<221>misc feature<221>misc feature

<222>1528<222>1528

<223>n ＝A，T，C或G<223>n = A, T, C or G

<400>28<400>28

gggacgaaga gtaggagtag gaggaggcgg cgagaagatg gcgacttcga acaatccgcg 60gggacgaaga gtaggagtag gaggaggcgg cgagaagatg gcgacttcga acaatccgcg 60

gaaatttagc gagaagatcg cactgcacaa ccagaagcag gcggaggaga cggcggcctt 120gaaatttagc gagaagatcg cactgcacaa ccagaagcag gcggaggaga cggcggcctt 120

cgaggaggtc atgaaggacc tgagcctgac gcgggccgcg cggcttcagc tgcagaagtc 180cgaggaggtc atgaaggacc tgagcctgac gcgggccgcg cggcttcagc tgcagaagtc 180

ccagtacctg cagctgggcc ccagccgtgg ccagtactac ggtgggtccc tgcccaacgt 240ccagtacctg cagctgggcc ccagccgtgg ccagtactac ggtgggtccc tgcccaacgt 240

gaaccagatt ggaagcagca gcgtggacct ggccttccag accccatttc agtcctcagg 300gaaccagatt ggaagcagca gcgtggacct ggccttccag accccatttc agtcctcagg 300

cctggacacg agtcggacca cacgacatca tgggcttgtg gacagagtat atcgtgagcg 360cctggacacg agtcggacca cacgacatca tgggcttgtg gacagagtat atcgtgagcg 360

tggcagactt ggctccccgc accgtcgacc cctgtcagta gacaagcatg ggcgacaggc 420tggcagactt ggctccccgc accgtcgacc cctgtcagta gacaagcatg ggcgacaggc 420

tgacagctgc ccctatggca ccgtgtacct ctcgcctcct gcggacacca gctggaggag 480tgacagctgc ccctatggca ccgtgtacct ctcgcctcct gcggaccacca gctggaggag 480

gaccaactct gactctgccc tgcaccagag cacaatgaca cccagccagg cagagtcctt 540gaccaactct gactctgccc tgcaccagag cacaatgaca cccagccagg cagagtcctt 540

cacaggcggg tcccaggatg cgcaccagaa gagagtctta ctgctaactg tcccaggaat 600cacaggcggg tcccaggatg cgcaccagaa gagagtctta ctgctaactg tcccaggaat 600

ggaggacacc ggggctgaga cagacaagac cctttctaag cagtcatggg actcaaagaa 660ggaggacacc ggggctgaga cagacaagac cctttctaag cagtcatggg actcaaagaa 660

ggcgggttcc aggcccaagt cctgtgaggt ccccggaatc aacatctttc cgtctgcaga 720ggcgggttcc aggcccaagt cctgtgaggt ccccggaatc aacatctttc cgtctgcaga 720

ccaggagaac acaacagccc tgatccctgc cacccacaac acagggggct cccttcctga 780ccaggagaac acaacagccc tgatccctgc cacccacaac acaggggggct cccttcctga 780

cctcaccaac atccacttcg cctccccact cccgacacca ctggaccctg aggagcctcc 840cctcaccaac atccacttcg cctccccact cccgacacca ctggaccctg aggagcctcc 840

gttccctgct ctcaccagct ccagcagcac cggcagcctt gcacatctgg gcgttggcgg 900gttccctgct ctcaccagct ccagcagcac cggcagcctt gcacatctgg gcgttggcgg 900

cgcaggcggt atgaacaccc ccagctcttc tccacagcac cggccagcag tcgtcagccc 960cgcaggcggt atgaacaccc ccagctcttc tccacagcac cggccagcag tcgtcagccc 960

cctgtccctg agcacagagg ccaggcggca gcaggcccag caggtgtcac ccaccctgtc 1020cctgtccctg agcacagagg ccaggcggca gcaggcccag caggtgtcac ccaccctgtc 1020

tccgttgtca cccatcactc aggccgtggc tatggatgcc ctgtccttgg agcagcagct 1080tccgttgtca cccatcactc aggccgtggc tatggatgcc ctgtccttgg agcagcagct 1080

gccctatgcc ttcttcaccc agactggctc ccagcagcct cccccacagc cccagccacc 1140gccctatgcc ttcttcaccc agactggctc ccagcagcct cccccacagc cccagccacc 1140

gcctccacct ccaccggtat cccagcagca gccaccacct ccacaggtgt ctgtgggcct 1200gcctccacct ccaccggtat cccagcagca gccaccacct ccacagggtgt ctgtggggcct 1200

cccccagggt ggtccactgc tgcccagtgc cagcctgact cgggggcccc agctgccacc 1260cccccagggt ggtccactgc tgcccagtgc cagcctgact cgggggcccc agctgccacc 1260

actctcagtt actgtaccat ccactcttcc ccagtcccct acagagaacc caggccagtc 1320actctcagtt actgtaccat ccactcttcc ccagtcccct acagagaacc caggccagtc 1320

accaatgggg atcgatgcca cttcggcacc agctctgcag taccgcacga gtgcagggtc 1380accaatgggg atcgatgcca cttcggcacc agctctgcag taccgcacga gtgcagggtc 1380

acctgccacc cagtctccca cctctccggt ctccaaccaa ggcttctccc ctggaagctc 1440acctgccacc cagtctccca cctctccggt ctccaaccaa ggcttctccc ctggaagctc 1440

cccacagcac acgtccaccc tgggcagcgt gtttggggat gcgtactatg agcagcagat 1500cccacagcac acgtccaccc tgggcagcgt gtttggggat gcgtactatg agcagcagat 1500

gacagccagg caggccaatg ctctgtcncg ccagctggag cagttcaaca tgatggagaa 1560gacagccagg caggccaatg ctctgtcncg ccagctggag cagttcaaca tgatggagaa 1560

cgccatcagc tccagcagcc tatacaaccc gggctccaca ctcaactatt cccaggctgc 1620cgccatcagc tccagcagcc tatacaaccc gggctccaca ctcaactatt cccaggctgc 1620

catgatgggt ctgagcggga gccacggggg cctacaggac ccgcagcagc tcggctacac 1680catgatgggt ctgagcggga gccacggggg cctacaggac ccgcagcagc tcggctacac 1680

aggccacggt ggaatcccca acatcatcct cacggtgaca ggagagtcac caccgagcct 1740aggccacggt ggaatcccca acatcatcct cacggtgaca ggagagtcac caccgagcct 1740

ctctaaggaa ctgagcagca cactggcagg agtcagtgat gtcagctttg attcggacca 1800ctctaaggaa ctgagcagca cactggcagg agtcagtgat gtcagctttg attcggacca 1800

tcagtttcca ctggacgagc tgaagattga ccctctgacc ctggacggac tccatatgtt 1860tcagtttcca ctggacgagc tgaagattga ccctctgacc ctggacggac tccatatgtt 1860

gaatgaccca gacatggttt tagccgaccc agccaccgag gacaccttcc gaatggaccg 1920gaatgaccca gacatggttt tagccgaccc agccaccgag gaacaccttcc gaatggaccg 1920

cctgtgagtg gctgtgccca ccagccgccg ctggtcagtc tccaacggcg ctgccccaaa 1980cctgtgagtg gctgtgccca ccagccgccg ctggtcagtc tccaacggcg ctgccccaaa 1980

cctggggacg gcaatggcgt ccccctttgc caacggccaa gcttgtggtt ctgagcttgc 2040cctggggacg gcaatggcgt ccccctttgc caacggccaa gcttgtggtt ctgagcttgc 2040

aatgctgccc agtgcccctg ccagcccccc gccaccccgg tcgttcacct cccatgatgc 2100aatgctgccc agtgcccctg ccagcccccc gccaccccgg tcgttcacct cccatgatgc 2100

ctggcgtgcg tgaggccgct gtgtactagg ctggctatct gtctgtccat ccatctacct 2160ctggcgtgcg tgaggccgct gtgtactagg ctggctatct gtctgtccat ccatctacct 2160

ggggtcaggc tgatggccga ggctgtgagt gcctggcccc catggatgtt ccccgtgctc 2220gggtcaggc tgatggccga ggctgtgagt gcctggcccc catggatgtt ccccgtgctc 2220

gctccctcac ccctcactgg ggatgtgaga gccctcatca gatacccaaa gtgtcactca 2280gctccctcac ccctcactgg ggatgtgaga gccctcatca gatacccaaa gtgtcactca 2280

cttccagcat gtgctgtgca acggagggcc ggggcgtggg tgtggagcgc ccagaggctt 2340cttccagcat gtgctgtgca acggagggcc ggggcgtggg tgtggagcgc ccagaggctt 2340

aggtgcgcca tccattcgac tgttgtcagc tgtcactgcc ttcctccatc ctgtcccccg 2400aggtgcgcca tccattcgac tgttgtcagc tgtcactgcc ttcctccatc ctgtcccccg 2400

tcccaccgcc atccct 2416tcccaccgcc atccct 2416

<210>29<210>29

<211>629<211>629

<212>PRT<212>PRT

<213>小鼠<213> mouse

<400>29<400>29

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

Leu Pro Asn Val Asn Gln Ile Gly Ser Ser Ser Val Asp Leu Ala PheLeu Pro Asn Val Asn Gln Ile Gly Ser Ser Ser Val Asp Leu Ala Phe

65 70 75 8065 70 75 80

Gln Thr Pro Phe Gln Ser Ser Gly Leu Asp Thr Ser Arg Thr Thr ArgGln Thr Pro Phe Gln Ser Ser Gly Leu Asp Thr Ser Arg Thr Thr Arg

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

Asp Ser Cys Pro Tyr Gly Thr Val Tyr Leu Ser Pro Pro Ala Asp ThrAsp Ser Cys Pro Tyr Gly Thr Val Tyr Leu Ser Pro Pro Ala Asp Thr

130 135 140130 135 140

145 150 155 160145 150 155 160

Thr Pro Ser Gln Ala Glu Ser Phe Thr Gly Gly Ser Gln Asp Ala HisThr Pro Ser Gln Ala Glu Ser Phe Thr Gly Gly Ser Gln Asp Ala His

165 170 175165 170 175

Gln Lys Arg Val Leu Leu Leu Thr Val Pro Gly Met Glu Asp Thr GlyGln Lys Arg Val Leu Leu Leu Thr Val Pro Gly Met Glu Asp Thr Gly

180 185 190180 185 190

Ala Glu Thr Asp Lys Thr Leu Ser Lys Gln Ser Trp Asp Ser Lys LysAla Glu Thr Asp Lys Thr Leu Ser Lys Gln Ser Trp Asp Ser Lys Lys

195 200 205195 200 205

Ala Gly Ser Arg Pro Lys Ser Cys Glu Val Pro Gly Ile Asn Ile PheAla Gly Ser Arg Pro Lys Ser Cys Glu Val Pro Gly Ile Asn Ile Phe

210 215 220210 215 220

225 230 235 240225 230 235 240

Asn Thr Gly Gly Ser Leu Pro Asp Leu Thr Asn Ile His Phe Ala SerAsn Thr Gly Gly Ser Leu Pro Asp Leu Thr Asn Ile His Phe Ala Ser

245 250 255245 250 255

Pro Leu Pro Thr Pro Leu Asp Pro Glu Glu Pro Pro Phe Pro Ala LeuPro Leu Pro Thr Pro Leu Asp Pro Glu Glu Pro Pro Phe Pro Ala Leu

260 265 270260 265 270

Thr Ser Ser Ser Ser Thr Gly Ser Leu Ala His Leu Gly Val Gly GlyThr Ser Ser Ser Ser Ser Thr Gly Ser Leu Ala His Leu Gly Val Gly Gly

275 280 285275 280 285

Ala Gly Gly Met Asn Thr Pro Ser Ser Ser Pro Gln His Arg Pro AlaAla Gly Gly Met Asn Thr Pro Ser Ser Ser Pro Gln His Arg Pro Ala

290 295 300290 295 300

Val Val Ser Pro Leu Ser Leu Ser Thr Glu Ala Arg Arg Gln Gln AlaVal Val Ser Pro Leu Ser Leu Ser Thr Glu Ala Arg Arg Gln Gln Ala

305 310 315 320305 310 315 320

Gln Gln Val Ser Pro Thr Leu Ser Pro Leu Ser Pro Ile Thr Gln AlaGln Gln Val Ser Pro Thr Leu Ser Pro Leu Ser Pro Ile Thr Gln Ala

325 330 335325 330 335

Val Ala Met Asp Ala Leu Ser LeuGlu Gln Gln Leu Pro Tyr Ala PheVal Ala Met Asp Ala Leu Ser LeuGlu Gln Gln Leu Pro Tyr Ala Phe

340 345 350340 345 350

Phe Thr Gln Thr Gly Ser Gln Gln Pro Pro Pro Gln Pro Gln Pro ProPhe Thr Gln Thr Gly Ser Gln Gln Pro Pro Pro Gln Pro Gln Pro Pro

355 360 365355 360 365

Pro Pro Pro Pro Pro Val Ser Gln Gln Gln Pro Pro Pro Pro Gln ValPro Pro Pro Pro Pro Val Ser Gln Gln Gln Pro Pro Pro Pro Pro Gln Val

370 375 380370 375 380

Ser Val Gly Leu Pro Gln Gly Gly Pro Leu Leu Pro Ser Ala Ser LeuSer Val Gly Leu Pro Gln Gly Gly Pro Leu Leu Pro Ser Ala Ser Leu

385 390 395 400385 390 395 400

Thr Arg Gly Pro Gln Leu Pro Pro Leu Ser Val Thr Val Pro Ser ThrThr Arg Gly Pro Gln Leu Pro Pro Leu Ser Val Thr Val Pro Ser Thr

405 410 415405 410 415

Leu Pro Gln Ser Pro Thr Glu Asn Pro Gly Gln Ser Pro Met Gly IleLeu Pro Gln Ser Pro Thr Glu Asn Pro Gly Gln Ser Pro Met Gly Ile

420 425 430420 425 430

Asp Ala Thr Ser Ala Pro Ala Leu Gln Tyr Arg Thr Ser Ala Gly SerAsp Ala Thr Ser Ala Pro Ala Leu Gln Tyr Arg Thr Ser Ala Gly Ser

435 440 445435 440 445

Pro Ala Thr Gln Ser Pro Thr Ser Pro Val Ser Asn Gln Gly Phe SerPro Ala Thr Gln Ser Pro Thr Ser Pro Val Ser Asn Gln Gly Phe Ser

450 455 460450 455 460

Pro Gly Ser Ser Pro Gln His Thr Ser Thr Leu Gly Ser Val Phe GlyPro Gly Ser Ser Pro Gln His Thr Ser Thr Leu Gly Ser Val Phe Gly

465 470 475 480465 470 475 480

Asp Ala Tyr Tyr Glu Gln Gln Met Thr Ala Arg Gln Ala Asn Ala LeuAsp Ala Tyr Tyr Glu Gln Gln Met Thr Ala Arg Gln Ala Asn Ala Leu

485 490 495485 490 495

Ser Arg Gln Leu Glu Gln Phe Asn Met Met Glu Asn Ala Ile Ser SerSer Arg Gln Leu Glu Gln Phe Asn Met Met Glu Asn Ala Ile Ser Ser

500 505 510500 505 510

Ser Ser Leu Tyr Asn Pro Gly Ser Thr Leu Asn Tyr Ser Gln Ala AlaSer Ser Leu Tyr Asn Pro Gly Ser Thr Leu Asn Tyr Ser Gln Ala Ala

515 520 525515 520 525

Met Met Gly Leu Ser Gly Ser His Gly Gly Leu Gln Asp Pro Gln GlnMet Met Gly Leu Ser Gly Ser His Gly Gly Leu Gln Asp Pro Gln Gln

530 535 540530 535 540

Leu Gly Tyr Thr Gly His Gly Gly Ile Pro Asn Ile Ile Leu Thr ValLeu Gly Tyr Thr Gly His Gly Gly Ile Pro Asn Ile Ile Leu Thr Val

545 550 555 560545 550 555 560

Thr Gly Glu Ser Pro Pro Ser Leu Ser Lys Glu Leu Ser Ser Thr LeuThr Gly Glu Ser Pro Pro Ser Leu Ser Lys Glu Leu Ser Ser Thr Leu

565 570 575565 570 575

Ala Gly Val Ser Asp Val Ser Phe Asp Ser Asp His Gln Phe Pro LeuAla Gly Val Ser Asp Val Ser Phe Asp Ser Asp His Gln Phe Pro Leu

580 585 590580 585 590

Asp Glu Leu Lys Ile Asp Pro Leu Thr Leu Asp Gly Leu His Met LeuAsp Glu Leu Lys Ile Asp Pro Leu Thr Leu Asp Gly Leu His Met Leu

595 600 605595 600 605

Asn Asp Pro Asp Met Val Leu Ala Asp Pro Ala Thr Glu Asp Thr PheAsn Asp Pro Asp Met Val Leu Ala Asp Pro Ala Thr Glu Asp Thr Phe

610 615 620610 615 620

Arg Met Asp Arg LeuArg Met Asp Arg Leu

625625

<210>30<210>30

<211>566<211>566

<212>PRT<212>PRT

<213>红鳍东方豚(fugu rubripres)<213>Red-finned oriental dolphin (fugu rubripres)

<400>30<400>30

Met Ala Ser Ser Asn Asn Pro Arg Lys Phe Ser Glu Lys Ile Ala LeuMet Ala Ser Ser Asn Asn Pro Arg Lys Phe Ser Glu Lys Ile Ala Leu

1 5 10 151 5 10 15

20 25 3020 25 30

Lys Asp Leu Asn Val Thr Arg Ala Ala Arg Leu Gln Leu Gln Lys ThrLys Asp Leu Asn Val Thr Arg Ala Ala Arg Leu Gln Leu Gln Lys Thr

35 40 4535 40 45

Gln Tyr Leu Gln Leu Gly Gln Asn Arg Gly Gln Tyr Tyr Gly Gly SerGln Tyr Leu Gln Leu Gly Gln Asn Arg Gly Gln Tyr Tyr Gly Gly Ser

50 55 6050 55 60

Leu Pro Asn Val Asn Gln Ile Gly Asn Gly Asn Ile Asp Leu Pro PheLeu Pro Asn Val Asn Gln Ile Gly Asn Gly Asn Ile Asp Leu Pro Phe

65 70 75 8065 70 75 80

Gln Val Ser Asn Ser Val Leu Asp Thr Ser Arg Thr Thr Arg His HisGln Val Ser Asn Ser Val Leu Asp Thr Ser Arg Thr Thr Arg His His

85 90 9585 90 95

Gly Leu Val Glu Arg Val Tyr Arg Asp Arg Asn Arg Ile Ser Ser ProGly Leu Val Glu Arg Val Tyr Arg Asp Arg Asn Arg Ile Ser Ser Pro

100 105 110100 105 110

His Arg Arg Pro Leu Ser Val Asp Lys His Gly Arg Gln Arg Thr AsnHis Arg Arg Pro Leu Ser Val Asp Lys His Gly Arg Gln Arg Thr Asn

115 120 125115 120 125

Ser Asp Ser Ala Leu His Gln Ser Ala Met Asn Pro Lys Pro His GluSer Asp Ser Ala Leu His Gln Ser Ala Met Asn Pro Lys Pro His Glu

130 135 140130 135 140

Val Phe Ala Gly Gly Ser Gln Glu Leu Gln Pro Lys Arg Leu Leu LeuVal Phe Ala Gly Gly Ser Gln Glu Leu Gln Pro Lys Arg Leu Leu Leu

145 150 155 160145 150 155 160

Thr Val Pro Gly Thr Glu Lys Ser Glu Ser Asn Ala Asp Lys Asp SerThr Val Pro Gly Thr Glu Lys Ser Glu Ser Asn Ala Asp Lys Asp Ser

165 170 175165 170 175

Gln Glu Gln Ser Trp Asp Asp Lys Lys Ser Ile Phe Pro Ser Pro AspGln Glu Gln Ser Trp Asp Asp Lys Lys Ser Ile Phe Pro Ser Pro Asp

180 185 190180 185 190

Gln Glu Leu Asn Pro Ser Val Leu Pro Ala Ala His Asn Thr Gly GlyGln Glu Leu Asn Pro Ser Val Leu Pro Ala Ala His Asn Thr Gly Gly

195 200 205195 200 205

Ser Leu Pro Asp Leu Thr Asn Ile Gln Phe Pro Pro Pro Leu Ser ThrSer Leu Pro Asp Leu Thr Asn Ile Gln Phe Pro Pro Pro Leu Ser Thr

210 215 220210 215 220

Pro Leu Asp Pro Glu Asp Thr Val Thr Phe Pro Ser Leu Ser Ser SerPro Leu Asp Pro Glu Asp Thr Val Thr Phe Pro Ser Leu Ser Ser Ser Ser

225 230 235 240225 230 235 240

Asn Ser Thr Gly Ser Leu Thr Thr Asn Leu Thr His Leu Gly Ile SerAsn Ser Thr Gly Ser Leu Thr Thr Asn Leu Thr His Leu Gly Ile Ser

245 250 255245 250 255

Val Ala Ser His Gly Asn Asn Gly Glu Lys Asn Ile Phe Phe Leu LysVal Ala Ser His Gly Asn Asn Gly Glu Lys Asn Ile Phe Phe Leu Lys

260 265 270260 265 270

Thr Cys Thr Ser Cys Glu Asp Val Tyr Asp Phe Tyr Phe Val Gly IleThr Cys Thr Ser Cys Glu Asp Val Tyr Asp Phe Tyr Phe Val Gly Ile

275 280 285275 280 285

Pro Thr Ser Ser Gln Thr Thr Met Thr Ala Thr Ala Gln Arg Arg GlnPro Thr Ser Ser Gln Thr Thr Met Thr Ala Thr Ala Gln Arg Arg Gln

290 295 300290 295 300

Pro Pro Val Val Pro Leu Thr Leu Thr Ser Asp Leu Thr Leu Gln GlnPro Pro Val Val Pro Leu Thr Leu Thr Ser Asp Leu Thr Leu Gln Gln

305 310 315 320305 310 315 320

Ser Pro Gln Gln Leu Ser Pro Thr Leu Ser Ser Pro Ile Asn Ile ThrSer Pro Gln Gln Leu Ser Pro Thr Leu Ser Ser Pro Ile Asn Ile Thr

325 330 335325 330 335

Gln Ser Met Lys Leu Ser Ala Ser Ser Leu Gln Gln Tyr Arg Asn GlnGln Ser Met Lys Leu Ser Ala Ser Ser Ser Leu Gln Gln Tyr Arg Asn Gln

340 345 350340 345 350

Thr Gly Ser Pro Ala Thr Gln Ser Pro Thr Ser Pro Val Ser Asn GlnThr Gly Ser Pro Ala Thr Gln Ser Pro Thr Ser Pro Val Ser Asn Gln

355 360 365355 360 365

Gly Phe Ser Pro Gly Ser Ser Pro Gln Pro Gln His Ile Pro Val ValGly Phe Ser Pro Gly Ser Ser Ser Pro Gln Pro Gln His Ile Pro Val Val

370 375 380370 375 380

Gly Ser Ile Phe Gly Asp Ser Phe Tyr Asp Gln Gln Leu Ala Leu ArgGly Ser Ile Phe Gly Asp Ser Phe Tyr Asp Gln Gln Leu Ala Leu Arg

385 390 395 400385 390 395 400

Gln Thr Asn Ala Leu Ser His Gln Val Cys Glu Asp Gly Arg Arg LeuGln Thr Asn Ala Leu Ser His Gln Val Cys Glu Asp Gly Arg Arg Leu

405 410 415405 410 415

Glu Ile Thr His Val Arg Leu Ser Arg Leu His Ala Glu Leu Cys PheGlu Ile Thr His Val Arg Leu Ser Arg Leu His Ala Glu Leu Cys Phe

420 425 430420 425 430

Cys Phe Ser Gln Leu Glu Gln Phe Asn Met Ile Glu Asn Pro Ile SerCys Phe Ser Gln Leu Glu Gln Phe Asn Met Ile Glu Asn Pro Ile Ser

435 440 445435 440 445

Ser Thr Ser Leu Tyr Asn Gln Cys Ser Thr Leu Asn Tyr Thr Gln AlaSer Thr Ser Leu Tyr Asn Gln Cys Ser Thr Leu Asn Tyr Thr Gln Ala

450 455 460450 455 460

Ala Met Met Gly Leu Thr Gly Ser Ser Leu Gln Asp Ser Gln Gln LeuAla Met Met Gly Leu Thr Gly Ser Ser Leu Gln Asp Ser Gln Gln Leu

465 470 475 480465 470 475 480

Gly Tyr Gly Asn His Gly Asn Ile Pro Asn Ile Ile Leu Thr Ile SerGly Tyr Gly Asn His Gly Asn Ile Pro Asn Ile Ile Leu Thr Ile Ser

485 490 495485 490 495

Val Thr Gly Glu Ser Pro Pro Ser Leu Ser Lys Glu Leu Thr Asn SerVal Thr Gly Glu Ser Pro Pro Ser Leu Ser Lys Glu Leu Thr Asn Ser

500 505 510500 505 510

Leu Ala Gly Val Gly Asp Val Ser Phe Asp Pro Asp Thr Gln Phe ProLeu Ala Gly Val Gly Asp Val Ser Phe Asp Pro Asp Thr Gln Phe Pro

515 520 525515 520 525

Leu Asp Glu Leu Lys Ile Asp Pro Leu Thr Leu Asp Gly Leu His MetLeu Asp Glu Leu Lys Ile Asp Pro Leu Thr Leu Asp Gly Leu His Met

530 535 540530 535 540

Leu Asn Asp Pro Asp Met Val Leu Ala Asp Pro Ala Thr Glu Asp ThrLeu Asn Asp Pro Asp Met Val Leu Ala Asp Pro Ala Thr Glu Asp Thr

545 550 555 560545 550 555 560

Phe Arg Met Asp Arg LeuPhe Arg Met Asp Arg Leu

565565

<210>31<210>31

<211>1602<211>1602

<212>DNA<212>DNA

<213>红鳍东方豚<213> Red-finned oriental dolphin

<400>31<400>31

atggcgtcct ctaacaatcc tcgcaaattt agcgaaaaaa tcgcac tgca taaccagaaa 60atggcgtcct ctaacaatcc tcgcaaattt agcgaaaaaa tcgcac tgca taaccagaaa 60

caagcagagg agactgctgc gttcgaagaa gtgatgaagg acctgaacgt cacaagggct 120caagcagagg agactgctgc gttcgaagaa gtgatgaagg acctgaacgt cacaagggct 120

gcccgggtaa gacagctgca gttacagaag acccagtatt tgcaactagg gcagaatcgt 180gcccgggtaa gacagctgca gttacagaag accccagtatt tgcaactagg gcagaatcgt 180

ggacagtact atggaggctc actgcccaat gtcaatcaga ttggaaatgg caacattgac 240ggacagtact atggaggctc actgcccaat gtcaatcaga ttggaaatgg caacattgac 240

ctgccttttc aggtgagcag gacaaactca gactcagctt tacatcagag tgccatgaat 300ctgccttttc aggtgagcag gacaaactca gactcagctt tacatcagag tgccatgaat 300

ccaaagcccc acgaagtgtt tgctgggggg tcgcaggagc tgcagcccaa acgactgctg 360ccaaagcccc acgaagtgtt tgctgggggg tcgcaggagc tgcagcccaa acgactgctg 360

ctaacagtgc ctggaaccga aaaatcggaa tcaaacgcag acaaagattc gcaggagcag 420ctaacagtgc ctggaaccga aaaatcggaa tcaaacgcag acaaagattc gcaggagcag 420

tcgtgggatg acaaaaagag tatttttcca tcaccagacc aggagttaaa cccctccgtg 480tcgtgggatg acaaaaagag tatttttcca tcaccagacc aggagttaaa cccctccgtg 480

cttccagccg cgcacaacac cggcggttcg ctccccgacc tgaccaacat ccagttccct 540cttccagccg cgcacaacac cggcggttcg ctccccgacc tgaccaacat ccagttccct 540

cctccactgt ccaccccact ggaccccgag gacaccgtca ccttcccctc cctcagctcc 600cctccactgt ccaccccact ggaccccgag gacaccgtca ccttcccctc cctcagctcc 600

tctaacagca caggcagtct gactaccaac ctcacccacc tgggcatcag tgtggccagc 660tctaacagca caggcagtct gactaccaac ctcacccacc tgggcatcag tgtggccagc 660

catggtaata acggagagaa aaatatattt tttttaaaaa catgcacttc atgcgaggat 720catggtaata acggagagaa aaatatattt tttttaaaaa catgcacttc atgcgaggat 720

gttaaataat attacgactt ttattttgta gggattccca cttcctctca aaccaccatg 780gttaaataat attacgactt ttattttgta gggattccca cttcctctca aaccaccatg 780

acagcaacag cacagcggcg gcaaccaccc gtggtccccc tcaccctcac ctctgacctg 840acagcaacag cacagcggcg gcaaccaccc gtggtccccc tcaccctcac ctctgacctg 840

actcttcaac agtcccccca gcagctttca cccaccctct cctcacccat taacatcaca 900actcttcaac agtcccccca gcagctttca cccaccctct cctcacccat taacatcaca 900

cagagcatga agcttagtgc tagctaacat tcttccctcc aacagtaccg caatcagact 960cagagcatga agcttagtgc tagctaacat tcttccctcc aacagtaccg caatcagact 960

ggctcaccag ccactcagtc tccaacctcc ccagtctcca atcaaggctt ctcccccggc 1020ggctcaccag ccactcagtc tccaacctcc ccagtctcca atcaaggctt ctcccccggc 1020

agctcgcctc aaccacagca cattcctgtg gtgggcagta tatttgggga ctccttctat 1080agctcgcctc aaccacagca cattcctgtg gtgggcagta tatttgggga ctccttctat 1080

gatcagcagt tggctctgag gcagaccaat gccctttctc atcaggtgtg tgaggacggc 1140gatcagcagt tggctctgag gcagaccaat gccctttctc atcaggtgtg tgaggacggc 1140

cgcaggttag aaataacaca cgtacgtctc tcacgacttc acgccgagct ttgtttttgt 1200cgcaggttag aaataacaca cgtacgtctc tcacgacttc acgccgagct ttgtttttgt 1200

ttttctcagc tggagcagtt caatatgata gagaacccca tcagctccac cagcctgtac 1260ttttctcagc tggagcagtt caatatgata gagaacccca tcagctccac cagcctgtac 1260

aatcagtgct ccacccttaa ttacacacag gcagccatga tgggcctcac cgggagcagc 1320aatcagtgct ccacccttaa ttacacacag gcagccatga tgggcctcac cgggagcagc 1320

ctgcaggact cgcagcagct cggctacggc aatcacggca acatccccaa catcatactg 1380ctgcaggact cgcagcagct cggctacggc aatcacggca acatccccaa catcatactg 1380

acaatttcag tcacagggga gtctccgccg agcctctcca aagagctgac caactcattg 1440acaatttcag tcacaggggga gtctccgccg agcctctcca aagagctgac caactcattg 1440

gccggcgtcg gcgacgtcag ctttgatcca gacacgcagt ttcctctgga cgagctgaag 1500gccggcgtcg gcgacgtcag ctttgatcca gacacgcagt ttcctctgga cgagctgaag 1500

atcgacccgc tgaccttgga cggcctgcac atgctcaacg acccagacat ggtgctggca 1560atcgacccgc tgaccttgga cggcctgcac atgctcaacg accccagacat ggtgctggca 1560

gaccccgcca cagaggacac gttcaggatg gacaggctgt aa 1602gaccccgcca cagaggacac gttcaggatg gacaggctgt aa 1602

<210>32<210>32

<211>170<211>170

<212>PRT<212>PRT

<213>人<213> people

<400>32<400>32

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

Ser Trp Arg Arg Thr Asn Ser Asp Ser AlaSer Trp Arg Arg Thr Asn Ser Asp Ser Ala

165 170165 170

<210>33<210>33

<211>356<211>356

<212>PRT<212>PRT

<213>人<213> people

<400>33<400>33

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

305 310 315 320305 310 315 320

325 330 335325 330 335

340 345 350340 345 350

Gln Ala Val AlaGln Ala Val Ala

355355

<210>34<210>34

<211>494<211>494

<212>PRT<212>PRT

<213>人<213> people

<400>34<400>34

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

305 310 315 320305 310 315 320

325 330 335325 330 335

340 345 350340 345 350

355 360 365355 360 365

370 375 380370 375 380

385 390 395 400385 390 395 400

405 410 415405 410 415

420 425 430420 425 430

435 440 445435 440 445

450 455 460450 455 460

465 470 475 480465 470 475 480

Asn Gln Gly Phe Ser Pro Gly Ser Ser Pro Gln His Thr SerAsn Gln Gly Phe Ser Pro Gly Ser Ser Pro Gln His Thr Ser

485 490485 490

<210>35<210>35

<211>580<211>580

<212>PRT<212>PRT

<213>人<213> people

<400>35<400>35

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

305 310 315 320305 310 315 320

325 330 335325 330 335

340 345 350340 345 350

355 360 365355 360 365

370 375 380370 375 380

385 390 395 400385 390 395 400

405 410 415405 410 415

420 425 430420 425 430

435 440 445435 440 445

450 455 460450 455 460

465 470 475 480465 470 475 480

485 490 495485 490 495

500 505 510500 505 510

515 520 525515 520 525

530 535 540530 535 540

545 550 555 560545 550 555 560

565 570 575565 570 575

Ile Ile Leu ThrIle Ile Leu Thr

580580

<210>36<210>36

<211>481<211>481

<212>PRT<212>PRT

<213>人<213> people

<400>36<400>36

Ala Leu His Gln Ser Thr Met Thr Pro Thr Gln Pro Glu Ser Phe SerAla Leu His Gln Ser Thr Met Thr Pro Thr Gln Pro Glu Ser Phe Ser

1 5 10 151 5 10 15

Ser Gly Ser Gln Asp Val His Gln Lys Arg Val Leu Leu Leu Thr ValSer Gly Ser Gln Asp Val His Gln Lys Arg Val Leu Leu Leu Thr Val

20 25 3020 25 30

Pro Gly Met Glu Glu Thr Thr Ser Glu Ala Asp Lys Asn Leu Ser LysPro Gly Met Glu Glu Thr Thr Ser Glu Ala Asp Lys Asn Leu Ser Lys

35 40 4535 40 45

Gln Ala Trp Asp Thr Lys Lys Thr Gly Ser Arg Pro Lys Ser Cys GluGln Ala Trp Asp Thr Lys Lys Thr Gly Ser Arg Pro Lys Ser Cys Glu

50 55 6050 55 60

Val Pro Gly Ile Asn Ile Phe Pro Ser Ala Asp Gln Glu Asn Thr ThrVal Pro Gly Ile Asn Ile Phe Pro Ser Ala Asp Gln Glu Asn Thr Thr

65 70 75 8065 70 75 80

Ala Leu Ile Pro Ala Thr His Asn Thr Gly Gly Ser Leu Pro Asp LeuAla Leu Ile Pro Ala Thr His Asn Thr Gly Gly Ser Leu Pro Asp Leu

85 90 9585 90 95

Thr Asn Ile His Phe Pro Ser Pro Leu Pro Thr Pro Leu Asp Pro GluThr Asn Ile His Phe Pro Ser Pro Leu Pro Thr Pro Leu Asp Pro Glu

100 105 110100 105 110

Glu Pro Thr Phe Pro Ala Leu Ser Ser Ser Ser Ser Thr Gly Asn LeuGlu Pro Thr Phe Pro Ala Leu Ser Ser Ser Ser Ser Ser Thr Gly Asn Leu

115 120 125115 120 125

Ala Ala Asn Leu Thr His Leu Gly Ile Gly Gly Ala Gly Gln Gly MetAla Ala Asn Leu Thr His Leu Gly Ile Gly Gly Ala Gly Gln Gly Met

130 135 140130 135 140

Ser Thr Pro Gly Ser Ser Pro Gln His Arg Pro Ala Gly Val Ser ProSer Thr Pro Gly Ser Ser Pro Gln His Arg Pro Ala Gly Val Ser Pro

145 150 155 160145 150 155 160

Leu Ser Leu Ser Thr Glu Ala Arg Arg Gln Gln Ala Ser Pro Thr LeuLeu Ser Leu Ser Thr Glu Ala Arg Arg Gln Gln Ala Ser Pro Thr Leu

165 170 175165 170 175

Ser Pro Leu Ser Pro Ile Thr Gln Ala Val Ala Met Asp Ala Leu SerSer Pro Leu Ser Pro Ile Thr Gln Ala Val Ala Met Asp Ala Leu Ser

180 185 190180 185 190

Leu Glu Gln Gln Leu Pro Tyr Ala Phe Phe Thr Gln Ala Gly Ser GlnLeu Glu Gln Gln Leu Pro Tyr Ala Phe Phe Thr Gln Ala Gly Ser Gln

195 200 205195 200 205

Gln Pro Pro Pro Gln Pro Gln Pro Pro Pro Pro Pro Pro Pro Ala SerGln Pro Pro Pro Gln Pro Gln Pro Pro Pro Pro Pro Pro Pro Pro Ala Ser

210 215 220210 215 220

Gln Gln Pro Pro Pro Pro Pro Pro Pro Gln Ala Pro Va lArg Leu ProGln Gln Pro Pro Pro Pro Pro Pro Pro Pro Gln Ala Pro Va lArg Leu Pro

225 230 235 240225 230 235 240

Pro Gly Gly Pro Leu Leu Pro Ser Ala Ser Leu Thr Arg Gly Pro GlnPro Gly Gly Pro Leu Leu Pro Ser Ala Ser Leu Thr Arg Gly Pro Gln

245 250 255245 250 255

Pro Pro Pro Leu Ala Val Thr Val Pro Ser Ser Leu Pro Gln Ser ProPro Pro Pro Leu Ala Val Thr Val Pro Ser Ser Leu Pro Gln Ser Pro

260 265 270260 265 270

Pro Glu Asn Pro Gly Gln Pro Ser Met Gly Ile Asp Ile Ala Ser AlaPro Glu Asn Pro Gly Gln Pro Ser Met Gly Ile Asp Ile Ala Ser Ala

275 280 285275 280 285

Pro Ala Leu Gln Gln Tyr Arg Thr Ser Ala Gly Ser Pro Ala Asn GlnPro Ala Leu Gln Gln Tyr Arg Thr Ser Ala Gly Ser Pro Ala Asn Gln

290 295 300290 295 300

Ser Pro Thr Ser Pro Val Ser Asn Gln Gly Phe Ser Pro Gly Ser SerSer Pro Thr Ser Pro Val Ser Asn Gln Gly Phe Ser Pro Gly Ser Ser

305 310 315 320305 310 315 320

Pro Gln His Thr Ser Thr Leu Gly Ser Val Phe Gly Asp Ala Tyr TyrPro Gln His Thr Ser Thr Leu Gly Ser Val Phe Gly Asp Ala Tyr Tyr

325 330 335325 330 335

Glu Gln Gln Met Ala Ala Arg Gln Ala Asn Ala Leu Ser His Gln LeuGlu Gln Gln Met Ala Ala Arg Gln Ala Asn Ala Leu Ser His Gln Leu

340 345 350340 345 350

Glu Gln Phe Asn Met Met Glu Asn Ala Ile Ser Ser Ser Ser Leu TyrGlu Gln Phe Asn Met Met Glu Asn Ala Ile Ser Ser Ser Ser Leu Tyr

355 360 365355 360 365

Ser Pro Gly Ser Thr Leu Asn Tyr Ser Gln Ala Ala Met Met Gly LeuSer Pro Gly Ser Thr Leu Asn Tyr Ser Gln Ala Ala Met Met Gly Leu

370 375 380370 375 380

Thr Gly Ser His Gly Ser Leu Pro Asp Ser Gln Gln Leu Gly Tyr AlaThr Gly Ser His Gly Ser Leu Pro Asp Ser Gln Gln Leu Gly Tyr Ala

385 390 395 400385 390 395 400

Ser His Ser Gly Ile Pro Asn Ile Ile Leu Thr Val Thr Gly Glu SerSer His Ser Gly Ile Pro Asn Ile Ile Leu Thr Val Thr Gly Glu Ser

405 410 415405 410 415

Pro Pro Ser Leu Ser Lys Glu Leu Thr Ser Ser Leu Ala Gly Val GlyPro Pro Ser Leu Ser Lys Glu Leu Thr Ser Ser Leu Ala Gly Val Gly

420 425 430420 425 430

Asp Val Ser Phe Asp Ser Asp Ser Gln Phe Pro Leu Asp Glu Leu LysAsp Val Ser Phe Asp Ser Asp Ser Gln Phe Pro Leu Asp Glu Leu Lys

435 440 445435 440 445

Ile Asp Pro Leu Thr Leu Asp Gly Leu His Met Leu Asn Asp Pro AspIle Asp Pro Leu Thr Leu Asp Gly Leu His Met Leu Asn Asp Pro Asp

450 455 460450 455 460

Met Val Leu Ala Asp Pro Ala Thr Glu Asp Thr Phe Arg Met Asp ArgMet Val Leu Ala Asp Pro Ala Thr Glu Asp Thr Phe Arg Met Asp Arg

465 470 475 480465 470 475 480

LeuLeu

<210>37<210>37

<211>30<211>30

<212>DNA<212>DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>37<400>37

caacatggcc aatccgcgca agttcagcga 30caacatggcc aatccgcgca agttcagcga 30

<210>38<210>38

<211>29<211>29

<212>DNA<212> DNA

<213>人工序列<213> Artificial sequence

<220><220>

<223>引物<223> Primer

<400>38<400>38

tcagttgagg tcgcgtcgaa aactatcct 29tcagttgagg tcgcgtcgaa aactatcct 29

<210>39<210>39

<211>62<211>62

<212>DNA<212>DNA

<213>黑腹果蝇<213> Drosophila melanogaster

<400>39<400>39

ggagcctggc gtcagagagc ctggcgtcag agagcctggc gtcagagagc ctggcgtcag 60ggagcctggc gtcagagagc ctggcgtcag agagcctggc gtcagagagc ctggcgtcag 60

ag 62ag 62

Claims

1. the method for the pathological condition that prevention, treatment or alleviation are relevant with activation of CRE-dependent gene abnormal expression or chemokine abnormal activation, it comprises that the CREAP regulon with significant quantity is applied to the experimenter who needs it.

2. the process of claim 1 wherein that described pathological condition is a neurodegenerative disease.

3. the process of claim 1 wherein that described pathological condition is an autoimmune disease.

4. the process of claim 1 wherein that described pathological condition is an inflammatory diseases.

5. the process of claim 1 wherein that described pathological condition is selected from Alzheimer, Parkinson's disease, Huntington Chorea, osteoarthritis, psoriatic, asthma, COPD, rheumatoid arthritis, cancer, diabetes, hypertension and chronic pain.

6. the process of claim 1 wherein that described CREAP regulon suppresses the CREAP activity of proteins of any one or the multiple CREAP1 of being selected from, CREAP2 or CREAP3.

7. the method for claim 6, wherein said CREAP regulon comprises one or more at the proteinic antibody of CREAP or its fragment, wherein said antibody or its fragment can suppress described CREAP activity of proteins.

8. the method for claim 6, wherein said regulon comprises proteinic one or more peptide mimicses of CREAP, wherein said peptide mimics can suppress described CREAP activity of proteins.

9. the process of claim 1 wherein that described CREAP regulon suppresses to be selected from any one or the multiple CREAP protein expression of CREAP1, CREAP2 or CREAP3.

10. the method for claim 9, wherein said CREAP regulon comprises that one or more are selected from antisense oligonucleotide, triple helix DNA, ribozyme, RNA is fit and the material of double-stranded or single stranded RNA, wherein said agents design is for suppressing the CREAP protein expression.

11. prevention, treatment or alleviation activate with CRE-dependent gene abnormal expression or the method for the pathological condition that the chemokine abnormal activation is relevant, it comprises that the pharmaceutical composition that will comprise significant quantity CREAP regulon is applied to the experimenter who needs it.

12. the method for claim 11, wherein said pathological condition is a neurodegenerative disease.

13. the method for claim 11, wherein said pathological condition is an autoimmune disease.

14. the method for claim 11, wherein said pathological condition is an inflammatory diseases.

15. the method for claim 11, wherein said pathological condition is selected from Alzheimer, Parkinson's disease, Huntington Chorea, osteoarthritis, psoriatic, asthma, COPD, rheumatoid arthritis, cancer, diabetes, hypertension and chronic pain.

16. the method for claim 11, wherein said CREAP regulon suppress the CREAP activity of proteins of any one or the multiple CREAP1 of being selected from, CREAP2 or CREAP3.

17. the method for claim 11, wherein said CREAP regulon comprise one or more at the proteinic antibody of CREAP or its fragment, wherein said antibody or its fragment can suppress described CREAP activity of proteins.

18. the method for claim 11, wherein said CREAP regulon comprise proteinic one or more peptide mimicses of CREAP, wherein said peptide mimics can suppress described CREAP activity of proteins.

19. the method for claim 11, wherein said CREAP regulon suppress to be selected from any one or the multiple CREAP protein expression of CREAP1, CREAP2 or CREAP3.

20. the method for claim 19, wherein said CREAP regulon comprises that one or more are selected from antisense oligonucleotide, triple helix DNA, ribozyme, RNA is fit and the material of double-stranded or single stranded RNA, and wherein said agents design is for suppressing the CREAP protein expression.

21. evaluation is used to prevent, treat or alleviation activates with CRE-dependent gene abnormal expression or the method for the regulon of the pathological condition that the chemokine abnormal activation is relevant, it comprises the ability that candidate's regulon suppresses the CREAP protein active of measuring.

22. the method for claim 21, wherein said CREAP protein is selected from CREAP1, CREAP2 or CREAP3.

23. the method for claim 21, wherein said method comprise that further CREAP that mensuration identifies suppresses regulon and reverses in the body of external, the earlier external back of described pathological condition or in the body inner model and/or suffer from the ability of viewed pathologic effect in experimenter's the clinical study of described pathological condition.

24. the method for claim 21, wherein said pathological condition is a neurodegenerative disease.

25. the method for claim 21, wherein said pathological condition is an autoimmune disease.

26. the method for claim 21, wherein said pathological condition is an inflammatory diseases.

27. the method for claim 21, wherein said pathological condition is selected from Alzheimer, Parkinson's disease, Huntington Chorea, osteoarthritis, psoriatic, asthma, COPD, rheumatoid arthritis, cancer, diabetes, hypertension and chronic pain.

28. evaluation is used to prevent, treat or alleviation activates with CRE-dependent gene abnormal expression or the method for the regulon of the pathological condition that unusual chemokine activation is relevant, it comprises the ability of measuring candidate's regulon inhibition CREAP protein expression.

29. the method for claim 28, wherein said CREAP protein is selected from CREAP1, CREAP2 or CREAP3.

30. the method for claim 28, wherein said method comprise that further CREAP that mensuration identifies suppresses regulon and reverses in the body of external, the earlier external back of described pathological condition or in the body inner model and/or suffer from the ability of viewed pathologic effect in experimenter's the clinical study of described pathological condition.

31. the method for claim 28, wherein said pathological condition is a neurodegenerative disease.

32. the method for claim 28, wherein said pathological condition is an autoimmune disease.

33. the method for claim 28, wherein said pathological condition is an inflammatory diseases.

34. the method for claim 28, wherein said pathological condition is selected from Alzheimer, Parkinson's disease, Huntington Chorea, osteoarthritis, psoriatic, asthma, COPD, rheumatoid arthritis, cancer, diabetes, hypertension and chronic pain.

35. pharmaceutical composition, its one or more CREAP regulon that comprise significant quantity activate or the relevant pathological condition of unusual chemokine activation with CRE-dependent gene abnormal expression to prevent, to treat or to alleviate among the patient who needs it.

36. according to the pharmaceutical composition of claim 35, wherein said pathological condition is a neurodegenerative disease.

37. according to the pharmaceutical composition of claim 35, wherein said pathological condition is an autoimmune disease.

38. according to the pharmaceutical composition of claim 35, wherein said pathological condition is an inflammatory diseases.

39. according to the pharmaceutical composition of claim 35, wherein said pathological condition is selected from Alzheimer, Parkinson's disease, Huntington Chorea, osteoarthritis, psoriatic, asthma, COPD, rheumatoid arthritis, cancer, diabetes, hypertension and chronic pain.

40. according to the pharmaceutical composition of claim 35, wherein said CREAP regulon suppresses to be selected from any one or the multiple CREAP activity of proteins of CREAP1, CREAP2 or CREAP3.

41. the pharmaceutical composition of claim 40, wherein said CREAP regulon comprise one or more at the proteinic antibody of CREAP or its fragment, wherein said antibody or its fragment can suppress described CREAP activity of proteins.

42. the pharmaceutical composition of claim 40, wherein said CREAP regulon comprise proteinic one or more peptide mimicses of CREAP, wherein said peptide mimics can suppress described CREAP activity of proteins.

43. according to the pharmaceutical composition of claim 35, wherein said CREAP regulon suppresses to be selected from any one or the multiple CREAP protein expression of CREAP1, CREAP2 or CREAP3.

44. the pharmaceutical composition of claim 43, wherein said CREAP regulon comprises that one or more are selected from antisense oligonucleotide, triple helix DNA, ribozyme, RNA is fit and the material of double-stranded or single stranded RNA, and wherein said agents design is for suppressing CREAP genetic expression.

45. diagnosis suffers from and activates with CRE-dependent gene abnormal expression or unusual chemokine activates the related pathologies situation and may be to use the experimenter's of the suitable candidate that the CREAP regulon treats method, it comprises mensuration from the proteinic mRNA level of CREAP in described experimenter's the biological sample, and the experimenter who wherein has the mRNA level of increase compared with the control is the suitable candidate that carries out the treatment of CREAP regulon.

46. the method for claim 45, wherein said CREAP protein is selected from CREAP1, CREAP2 or CREAP3.

47. suffering from, diagnosis activates with CRE-dependent gene abnormal expression or chemokine abnormal activation related pathologies situation and may be to use the experimenter's of the suitable candidate that the CREAP regulon treats method, this method comprises detection from the proteinic level of CREAP in described experimenter's the biological sample, and the experimenter who wherein has elevated levels compared with the control is the suitable candidate that carries out the treatment of CREAP regulon.

48. the method for claim 47, wherein said CREAP protein is selected from CREAP1, CREAP2 or CREAP3.

49. prevention, treatment or alleviation and CRE-dependent gene abnormal expression activate or the method for chemokine abnormal activation related pathologies situation, it comprises:

(a) measure CREAP mRNA and/or protein level in the subject; With,

(b) will be enough to prevent, treat or the CREAP regulon of alleviating described pathological condition is applied to the CREAP mRNA that has elevated levels compared with the control and/or the experimenter of protein level.

50. the method for claim 49, wherein said pathological condition is a neurodegenerative disease.

51. the method for claim 49, wherein said pathological condition is an autoimmune disease.

52. the method for claim 49, wherein said pathological condition is an inflammatory diseases.

53. the method for claim 49, wherein said pathological condition is selected from Alzheimer, Parkinson's disease, Huntington Chorea, osteoarthritis, psoriatic, asthma, COPD, rheumatoid arthritis, cancer, diabetes, hypertension and chronic pain.

54. be used for the diagnostic kit of proteinic mRNA level of detection of biological sample CREAP and/or protein level, described test kit comprises:

(a) polynucleotide of CREAP or its fragment;

(b) with (a) in polynucleotide complementary nucleotide sequence;

(c) CREAP polypeptide, or its fragment;

(d) at the antibody of CREAP polypeptide; Or

(e) the proteinic peptide mimics of CREAP

Wherein component (a) and (b), (c), (d) or (e) can comprise etc. congruent.

55. the method for claim 54, wherein said CREAP protein is selected from CREAP1, CREAP2 or CREAP3.

56. isolated polypeptide, it comprises and is selected from SEQ ID NOs:2,16 and 25 CREAP aminoacid sequence.

57. isolated nucleic acid sequences, it comprises the nucleotide sequence of the polypeptide of the claim 56 of encoding.

58. isolated polypeptide, its by be selected from SEQ ID NOs:2,16 and 25 CREAP aminoacid sequence is formed.

59. isolated nucleic acid sequences, it comprises the nucleotide sequence of the polypeptide of the claim 58 of encoding.

60. isolating CREAP polypeptide, it is by the CREAP genes encoding of organism.

61. separated DNA, it comprises the nucleotide sequence of the CREAP polypeptide of the claim 60 of encoding.

62. carrier molecule comprises the fragment according to the isolating nucleic acid of claim 57.

63. according to the carrier molecule of claim 62, it comprises any one or multiple transcriptional control sequence.

64. host cell, it comprises the carrier molecule according to claim 63.

65. antibody or its fragment, described antibody or its fragment specific combination comprise the fragment of the polypeptide or the described polypeptide of the aminoacid sequence that is shown in claim 56.

66. according to the antibody fragment of claim 65, it is Fab or F (ab ') 2 parts.

67. according to the antibody of claim 65, it is a polyclonal antibody.

68. according to the antibody of claim 65, it is a monoclonal antibody.

69. produce method as defined polypeptide in the claim 56, it is included under the condition that is enough to the following polypeptide of expression in host cell and cultivates the wherein host cell of integrating expression vector, this expression vector comprises the polynucleotide in external source source, this polynucleotide encoding comprises the polypeptide that is selected from SEQ ID NOs:2,16 and 25 aminoacid sequence, thereby causes producing expressed polypeptide.

70. according to the method for claim 69, described method further comprises the polypeptide that reclaims described cell generation.

71. according to the method for claim 69, the polynucleotide in wherein said external source source comprise and are selected from SEQID NOs:1,15 and 24 nucleotide sequence.

72. produce method as defined polypeptide in the claim 56, it is included under the condition that is enough to the following polypeptide of expression in host cell and cultivates the wherein host cell of integrating expression vector, this expression vector comprises the polynucleotide in external source source, this polynucleotide encoding comprises the polypeptide that is selected from SEQ ID NOs:2,16 and 25 aminoacid sequence, thereby causes producing expressed polypeptide.

73. according to the method for claim 72, described method further comprises the polypeptide that reclaims described cell generation.

74. according to the method for claim 72, the polynucleotide in wherein said external source source comprise and are selected from SEQID NOs:1,15 and 24 nucleotide sequence.

75. carrier molecule, it comprises the nucleotide sequence that is selected from coding people CREAP protein fragments amino acid region 1-267,289-538,356-580 and 575-650.