HK1189386B

HK1189386B - Designed repeat proteins binding to serum albumin

Info

Publication number: HK1189386B
Application number: HK14102498.7A
Authority: HK
Inventors: Daniel Steiner; Hans Kaspar Binz; Maya Gulotti-Georgieva; Frieder W. Merz; Douglas Phillips; Ivo Sonderegger
Original assignee: Molecular Partners Ag
Priority date: 2010-11-26
Filing date: 2011-11-25
Publication date: 2022-01-14

Description

Designed repeat proteins that bind to serum albumin

技术领域Technical Field

本发明涉及设计的对血清白蛋白具有结合特异性的重复蛋白、以及编码这类血清白蛋白结合蛋白的核酸、包含这类蛋白的药物组合物、这类蛋白用于改变生物活性化合物的药代动力学的用途、和这类蛋白在疾病治疗中的用途。The present invention relates to designed repeat proteins having binding specificity for serum albumin, as well as nucleic acids encoding such serum albumin binding proteins, pharmaceutical compositions comprising such proteins, the use of such proteins for altering the pharmacokinetics of biologically active compounds, and the use of such proteins in the treatment of diseases.

背景技术Background Art

制药工业热烈渴求通过调节或增加生物活性化合物（诸如蛋白治疗剂）的体内药代动力学(PK)性质来增加它们的有效性。就通过肾清除从循环中快速地消除的生物活性化合物而言，尤其如此。肾脏通常会从循环中滤出具有低于60 kDa的表观分子量的分子。改善如此小的生物活性化合物的药代动力学性质的一个策略是，简单地增加它们的表观分子大小(即增加它们的流体动力学半径)，例如通过添加非蛋白性聚合物部分诸如聚乙二醇聚合物或糖残基，或添加蛋白性聚合物部分诸如球状蛋白或未结构化的多肽，诸如在WO 2007/103515和WO 2008/155134中描述的那些。The pharmaceutical industry is eager to increase the effectiveness of bioactive compounds (such as protein therapeutics) by regulating or increasing their in vivo pharmacokinetic (PK) properties. This is especially true for bioactive compounds that are rapidly eliminated from the circulation by renal clearance. The kidneys typically filter out molecules with an apparent molecular weight lower than 60 kDa from the circulation. One strategy for improving the pharmacokinetic properties of such small bioactive compounds is to simply increase their apparent molecular size (i.e., increase their hydrodynamic radius), for example, by adding a non-proteinaceous polymer portion such as a polyethylene glycol polymer or a sugar residue, or adding a proteinaceous polymer portion such as a globular protein or an unstructured polypeptide, such as those described in WO 2007/103515 and WO 2008/155134.

其它策略利用血清蛋白（诸如免疫球蛋白和血清白蛋白）的长循环半衰期。具有67kDa分子量的血清白蛋白是血浆中的最丰富的蛋白，以约50 mg/ml (0.6 mM)存在，且在人类中具有19天的血清半衰期。血清白蛋白帮助维持血浆pH，有助于胶体血压（colloidalblood pressure），对许多代谢物和脂肪酸起载体的作用，并充当血浆的主要药物运输蛋白。已经描述了白蛋白中的几个主要的小分子结合位点。Other strategies exploit the long circulatory half-life of serum proteins, such as immunoglobulins and serum albumin. Serum albumin, with a molecular weight of 67 kDa, is the most abundant protein in plasma, present at approximately 50 mg/ml (0.6 mM) and having a serum half-life of 19 days in humans. Serum albumin helps maintain plasma pH, contributes to colloidal blood pressure, acts as a carrier for many metabolites and fatty acids, and serves as the primary drug transporter in plasma. Several major small molecule binding sites have been described in albumin.

已经证实，与血清白蛋白的非共价结合可以延长短寿命的小分子或多肽的半衰期(WO 1991/001743)。与血清白蛋白特异性地结合并由此可以延长与其偶联的其它分子的体内半衰期的多肽包括：细菌白蛋白结合域的变体(例如WO 2005/097202和WO 2009/016043)，小肽(例如Dennis, M.S., 等人, J. Biol. Chem. 277(3), 35035-43, 2002和WO 2001/045746)和免疫球蛋白片段(例如WO 2008/043822、WO 2004/003019、WO 2008/043821、WO 2006/040153、WO 2006/122787和WO 2004/041865)。WO 2008/043822涉及除了免疫球蛋白片段以外的其它结合蛋白，诸如基于蛋白A结构域、淀粉酶抑肽（tendamistat）、纤连蛋白、脂质运载蛋白、CTLA-4、T-细胞受体、设计的锚蛋白重复序列和PDZ结构域的分子，可能制备它们来特异性地结合血清白蛋白。尽管如此，WO 2008/043822既没有公开设计的对血清白蛋白(SA)具有结合特异性的锚蛋白重复结构域的选择，也没有公开与SA特异性地结合的重复结构域的具体重复序列基序。此外，据称，通过多肽与血清白蛋白的遗传融合，可以延长所述多肽的体内半衰期(例如WO 1991/001743)。药物的体内半衰期的这种改变可能积极地改变它们的药代动力学(PK)和/或药物动力学(PD) 性质。在新的和有效的治疗剂和疾病治疗方法的开发中，这是一个关键问题。因此，本领域需要改变生物活性化合物的PK和/或PD的新途径。It has been demonstrated that non-covalent binding to serum albumin can extend the half-life of short-lived small molecules or polypeptides (WO 1991/001743). Polypeptides that specifically bind to serum albumin and can thereby extend the in vivo half-life of other molecules conjugated thereto include variants of bacterial albumin binding domains (e.g., WO 2005/097202 and WO 2009/016043), small peptides (e.g., Dennis, MS, et al., J. Biol. Chem. 277(3) , 35035-43, 2002 and WO 2001/045746) and immunoglobulin fragments (e.g., WO 2008/043822, WO 2004/003019, WO 2008/043821, WO 2006/040153, WO 2006/122787 and WO 2004/041865). WO 2008/043822 relates to binding proteins other than immunoglobulin fragments, such as molecules based on protein A domains, tendamistat, fibronectin, lipocalin, CTLA-4, T-cell receptors, designed ankyrin repeats, and PDZ domains, which may be prepared to specifically bind to serum albumin. Nevertheless, WO 2008/043822 does not disclose the selection of designed ankyrin repeat domains with binding specificity for serum albumin (SA), nor does it disclose the specific repeat sequence motifs of the repeat domains that specifically bind to SA. Furthermore, it is claimed that the in vivo half-life of polypeptides can be extended by genetically fusing them to serum albumin (e.g., WO 1991/001743). Such alterations in the in vivo half-life of drugs may positively alter their pharmacokinetic (PK) and/or pharmacodynamic (PD) properties. This is a key issue in the development of new and effective therapeutic agents and disease treatment methods. Therefore, there is a need in the art for new approaches to alter the PK and/or PD of bioactive compounds.

除了抗体以外，新颖的结合蛋白或结合域也可以用于特异性地结合靶分子(例如Binz, H.K., Amstutz, P.和Plückthun, A., Nat. Biotechnol. 23, 1257-1268,2005)。一类这样的新颖的结合蛋白或结合域是基于设计的重复蛋白或设计的重复结构域(WO 2002/020565; Binz, H.K., Amstutz, P., Kohl, A., Stumpp, M.T., Briand, C.,Forrer, P., Grütter, M.G., 和Plückthun, A., Nat. Biotechnol. 22, 575-582,2004; Stumpp, M.T., Binz, H.K和Amstutz, P., Drug Discov. Today 13, 695-701,2008)。WO 2002/020565描述了如何构建重复蛋白的大文库和它们的一般用途。尽管如此，WO 2002/020565既没有公开对SA具有结合特异性的重复结构域的选择，也没有公开与SA特异性地结合的重复结构域的具体重复序列基序。此外，WO 2002/020565没有提示，对SA具有结合特异性的重复结构域可以用于调节其它分子的PK或PD。这些设计的重复结构域利用重复蛋白的模块性质，并具有N-端和C-端加帽模块，以通过屏蔽所述结构域的疏水核心来防止设计的重复结构域发生聚集(Forrer, P., Stumpp, M.T., Binz, H.K.和Plückthun,A., FEBS letters 539, 2-6, 2003)。这些加帽模块是基于天然的鸟嘌呤-腺嘌呤-结合蛋白(GA-结合蛋白)的加帽重复序列。经显示，通过改进从GA-结合蛋白衍生出的C-端加帽重复序列，可以进一步增加这些设计的锚蛋白重复结构域的热稳定性和热力学稳定性(Interlandi, G., Wetzel, S.K, Settanni, G., Plückthun, A.和Caflisch, A., J.Mol. Biol. 375, 837-854, 2008; Kramer, M.A, Wetzel, S.K., Plückthun, A.,Mittl, P.R.E,和Grütter, M.G., J. Mol. Biol. 404, 381-391, 2010)。所述作者将共8个突变引入了该加帽模块，并通过添加3个独特氨基酸来延长了它的C-端螺旋。尽管如此，这些修饰在C-端加帽模块中的引入，导致设计的携带该突变的C-端加帽模块的重复结构域的不希望的二聚化趋势。因而，需要制备锚蛋白重复结构域的进一步优化的C-端加帽模块或C-端加帽重复序列。In addition to antibodies, novel binding proteins or binding domains can also be used to specifically bind to target molecules (e.g., Binz, HK, Amstutz, P. and Plückthun, A., Nat. Biotechnol. 23 , 1257-1268, 2005). One such novel binding protein or binding domain is based on designed repeat proteins or designed repeat domains (WO 2002/020565; Binz, HK, Amstutz, P., Kohl, A., Stumpp, MT, Briand, C., Forrer, P., Grütter, MG, and Plückthun, A., Nat. Biotechnol. 22 , 575-582, 2004; Stumpp, MT, Binz, HK and Amstutz, P., Drug Discov. Today 13 , 695-701, 2008). WO 2002/020565 describes how to construct a large library of repeat proteins and their general use. Nevertheless, WO 2002/020565 neither discloses the selection of repeat domains with binding specificity to SA, nor discloses the specific repetitive sequence motif of the repeat domains specifically combined with SA. In addition, WO 2002/020565 does not prompt that the repeat domains with binding specificity to SA can be used to regulate the PK or PD of other molecules. The repeat domains of these designs utilize the modular properties of repeat proteins and have N-terminal and C-terminal capping modules to prevent the repeat domains of design from gathering by shielding the hydrophobic core of the domain (Forrer, P., Stumpp, MT, Binz, HK and Plückthun, A., FEBS letters 539 , 2-6, 2003). These capping modules are based on the capping repetitive sequences of natural guanine-adenine-binding proteins (GA-binding proteins). It has been shown that the thermal and thermodynamic stability of these designed ankyrin repeat domains can be further increased by modifying the C-terminal capping repeat sequence derived from GA-binding proteins (Interlandi, G., Wetzel, SK, Settanni, G., Plückthun, A., and Caflisch, A., J. Mol. Biol. 375 , 837-854, 2008; Kramer, MA, Wetzel, SK, Plückthun, A., Mittl, PRE, and Grütter, MG, J. Mol. Biol. 404 , 381-391, 2010). The authors introduced a total of eight mutations into the capping module and extended its C-terminal helix by adding three unique amino acids. Nevertheless, the introduction of these modifications into the C-terminal capping module resulted in an undesirable tendency for the designed repeat domains carrying this mutated C-terminal capping module to dimerize. Therefore, there is a need to prepare further optimized C-terminal capping modules or C-terminal capping repeat sequences of ankyrin repeat domains.

利用目前可得到的方案靶向SA来调节PK和/或PD，并非总是有效的。甚至日趋明显的是，通过结合（hijacking）SA来调节分子的PK和/或PD是复杂的，且仍然没有被充分理解。Targeting SA to modulate PK and/or PD using currently available approaches is not always effective. It is even becoming increasingly apparent that modulating the PK and/or PD of molecules by hijacking SA is complex and still not fully understood.

总之，需要改进的对SA具有特异性的结合蛋白，其能够改善用于治疗癌症和其它病理学状况的治疗相关分子或多肽的PK和/或PD。In summary, there is a need for improved binding proteins specific for SA that can improve the PK and/or PD of therapeutically relevant molecules or polypeptides for the treatment of cancer and other pathological conditions.

本发明要解决的技术问题是，鉴定新颖的结合蛋白（诸如对SA具有结合特异性的重复结构域），其能够改变用于癌症和其它病理学状况的改善治疗的治疗相关分子的PK和/或PD。通过提供在权利要求书中表征的实施方案，实现了该技术问题的解决方案。The technical problem addressed by the present invention is to identify novel binding proteins (such as repeat domains with binding specificity for SA) that are capable of altering the PK and/or PD of therapeutically relevant molecules for improved treatment of cancer and other pathological conditions. This technical problem is addressed by providing the embodiments characterized in the claims.

发明概述SUMMARY OF THE INVENTION

本发明涉及包含至少一个锚蛋白重复结构域的结合蛋白，其中所述锚蛋白重复结构域对哺乳动物血清白蛋白具有结合特异性，且其中所述锚蛋白重复结构域包含锚蛋白重复模块，所述锚蛋白重复模块具有选自下述的氨基酸序列：SEQ ID NO:49、50、51和52，和这样的序列，其中在SEQ ID NO:49、50、51和52中的最多9个氨基酸被任意氨基酸替换。The present invention relates to a binding protein comprising at least one ankyrin repeat domain, wherein the ankyrin repeat domain has binding specificity for mammalian serum albumin, and wherein the ankyrin repeat domain comprises an ankyrin repeat module having an amino acid sequence selected from the group consisting of SEQ ID NOs: 49, 50, 51 and 52, and sequences wherein up to 9 amino acids in SEQ ID NOs: 49, 50, 51 and 52 are replaced by any amino acid.

在另一个实施方案中，本发明涉及包含至少一个锚蛋白重复结构域的结合蛋白，其中所述重复结构域对哺乳动物血清白蛋白具有结合特异性，且其中所述锚蛋白重复结构域包含与选自下述的一个锚蛋白重复结构域具有至少70%氨基酸序列同一性的氨基酸序列：SEQ ID NO: 17-31和43-48，其中所述锚蛋白重复结构域的在1位处的G和/或在2位处的S任选地缺失。In another embodiment, the invention relates to a binding protein comprising at least one ankyrin repeat domain, wherein the repeat domain has binding specificity for mammalian serum albumin, and wherein the ankyrin repeat domain comprises an amino acid sequence having at least 70% amino acid sequence identity to an ankyrin repeat domain selected from the group consisting of SEQ ID NOs: 17-31 and 43-48, wherein the G at position 1 and/or the S at position 2 of the ankyrin repeat domain is optionally deleted.

具体地，本发明涉及如上文定义的结合蛋白，其中所述锚蛋白重复结构域与选自下述的锚蛋白重复结构域竞争结合哺乳动物血清白蛋白：SEQ ID NO: 17-31和43-48。In particular, the present invention relates to a binding protein as defined above, wherein the ankyrin repeat domain competes for binding to mammalian serum albumin with an ankyrin repeat domain selected from the group consisting of SEQ ID NOs: 17-31 and 43-48.

此外，本发明涉及这样的包含生物活性化合物的结合蛋白，尤其是包含的生物活性化合物在哺乳动物中的终末血浆半衰期与所述未修饰的生物活性化合物的终末血浆半衰期相比高至少2倍的结合蛋白。Furthermore, the present invention relates to such binding proteins comprising a biologically active compound, in particular binding proteins comprising a biologically active compound having a terminal plasma half-life in a mammal that is at least 2 times higher than the terminal plasma half-life of the unmodified biologically active compound.

本发明另外涉及编码本发明的结合蛋白的核酸分子，以及包含一种或更多种上述结合蛋白或核酸分子的药物组合物。The present invention further relates to nucleic acid molecules encoding the binding proteins of the present invention, and pharmaceutical compositions comprising one or more of the above-mentioned binding proteins or nucleic acid molecules.

本发明另外涉及使用本发明的结合蛋白治疗病理学状况的方法。The present invention further relates to methods of treating pathological conditions using the binding proteins of the invention.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1.通过SEC对选择的DARPin的稳定性分析。 Figure 1. Stability analysis of selected DARPins by SEC.

用Superdex 200柱5/150 (图1a或图1b)或用superdex200 10/300GL (图1c)分析的对xSA具有特异性的DARPin在下述情况下的尺寸排阻色谱法(SEC)运行的洗脱曲线：温育前（图1a），在PBS中在40℃以30 mg/ml (约2 mM)温育28天以后(图1b)，或在-80℃贮存1个月以后(图1c)。如在实施例1中所述，表达和纯化所有样品。为了SEC分析，将样品稀释至500µM的浓度。用箭头指示了分子质量标准品抑酶肽(AP) 6.5 kDa、碳酸酐酶(CA) 29 kDa和伴清蛋白(CO) 75 kDa。Elution profiles of size exclusion chromatography (SEC) runs of DARPins specific for xSA, analyzed using a Superdex 200 column 5/150 (Figure 1a or 1b) or Superdex 200 10/300GL (Figure 1c), before incubation (Figure 1a), after incubation at 40°C for 28 days at 30 mg/ml (approximately 2 mM) in PBS (Figure 1b), or after storage at -80°C for one month (Figure 1c). All samples were expressed and purified as described in Example 1. For SEC analysis, samples were diluted to a concentration of 500 µM. Molecular mass standards aprotinin (AP) 6.5 kDa, carbonic anhydrase (CA) 29 kDa, and conalbumin (CO) 75 kDa are indicated by arrows.

，哺乳动物血清白蛋白，A，在280 nm的吸光度；t，按分钟计的保留时间；, mammalian serum albumin, A, absorbance at 280 nm; t, retention time in minutes;

DARPin #19 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:19)；DARPin #19 (SEQ ID NO: 19 with a His-tag (SEQ ID NO: 15) fused at its N-terminus);

DARPin #20 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:20)；DARPin #20 (SEQ ID NO: 20 with a His-tag (SEQ ID NO: 15) fused at its N-terminus);

DARPin #21 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:21)；DARPin #21 (SEQ ID NO: 21 with a His-tag (SEQ ID NO: 15) fused at its N-terminus);

DARPin #22 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:22)；DARPin #22 (SEQ ID NO: 22 with a His-tag (SEQ ID NO: 15) fused at its N-terminus);

DARPin #27 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:27)；DARPin #27 (SEQ ID NO: 27 with a His-tag (SEQ ID NO: 15) fused at its N-terminus);

DARPin #28 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:28)；DARPin #28 (SEQ ID NO: 28 with a His-tag (SEQ ID NO: 15) fused at its N-terminus);

DARPin #29 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:29)；DARPin #29 (SEQ ID NO: 29 with a His-tag (SEQ ID NO: 15) fused at its N-terminus);

DARPin #30 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:30)。DARPin #30 (SEQ ID NO: 30 with a His-tag (SEQ ID NO: 15) fused at its N-terminus).

DARPin #43 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:43)。DARPin #43 (SEQ ID NO: 43 with a His-tag (SEQ ID NO: 15) fused at its N-terminus).

DARPin #44 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:44)。DARPin #44 (SEQ ID NO: 44 with a His-tag (SEQ ID NO: 15) fused at its N-terminus).

DARPin #45 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:45)。DARPin #45 (SEQ ID NO: 45 with a His-tag (SEQ ID NO: 15) fused at its N-terminus).

DARPin #46 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:46)。DARPin #46 (SEQ ID NO: 46 with a His-tag (SEQ ID NO: 15) fused at its N-terminus).

DARPin #47 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:47)。DARPin #47 (SEQ ID NO: 47 with a His-tag (SEQ ID NO: 15) fused at its N-terminus).

DARPin #48 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:48)。DARPin #48 (SEQ ID NO: 48 with a His-tag (SEQ ID NO: 15) fused at its N-terminus).

图2. 选择的DARPin的热稳定性。 Figure 2. Thermal stability of selected DARPins.

对xSA具有特异性的DARPin在pH 7.4的PBS中(图2a)和在pH 5.8的MES缓冲液(250mM (2-N-吗啉代)-乙磺酸pH 5.5)、150 mM NaCl以1:4 (v/v)与pH 7.4的PBS混合，并将pH调至5.8)中(图2b) 的热变性迹线(通过存在于缓冲液中的染料SYPRO Orange的荧光强度增加来跟踪)。Thermal denaturation traces (tracked by the increase in fluorescence intensity of the dye SYPRO Orange present in the buffer) of a DARPin specific for xSA in PBS at pH 7.4 ( FIG. 2 a ) and in MES buffer at pH 5.8 (250 mM (2-N-morpholino)-ethanesulfonic acid, pH 5.5), 150 mM NaCl mixed in a 1:4 (v/v) ratio with PBS at pH 7.4 and adjusted to pH 5.8) ( FIG. 2 b ).

，相对荧光单位(RFU)，在515-535 nm激发，在560-580 nm检测；T，按℃计的温度；DARPin的定义参见上文。, relative fluorescence units (RFU), excitation at 515-535 nm, detection at 560-580 nm; T, temperature in °C; see above for definition of DARPin.

图3.选择的DARPin在小鼠中的血浆清除率。 Figure 3. Plasma clearance of selected DARPins in mice.

在小鼠中评估了对MSA (小鼠血清白蛋白)具有特异性的DARPin和对照DARPin的血浆清除率。The plasma clearance of DARPins specific for MSA (mouse serum albumin) and a control DARPin was assessed in mice.

（图3a) 与对MSA没有结合特异性的DARPin #32 (参见下文)相比，包含仅一个对MSA具有结合特异性的重复结构域的DARPin。( FIG. 3 a ) DARPin comprising only one repeat domain with binding specificity for MSA, compared to DARPin #32 (see below), which has no binding specificity for MSA.

（图3b) 与对MSA没有结合特异性的DARPin #32相比，包含2个蛋白结构域(其中之一是对MSA具有结合特异性的重复结构域) 的DARPin。( FIG. 3 b ) A DARPin comprising two protein domains (one of which is a repeat domain with binding specificity for MSA) is shown in comparison to DARPin #32, which has no binding specificity for MSA.

经由His-标签（His-Tag），用^99mTc-羰基化合物标记DARPin，并静脉内地注射给小鼠。在注射后的不同时间点测量被注射的小鼠的血液的放射性，并显示为针对^99mTc的放射性衰变校正过的注射剂量的比率(%ID)。拟合曲线显示了在不同时间点处测量的放射性的非线性回归的结果——两阶段衰变(Graphpad Prism)。每个数据点代表每组2只小鼠的平均值。DARPin was labeled with a ^99m Tc-carbonyl compound via a His-Tag and injected intravenously into mice. Radioactivity in the blood of the injected mice was measured at various time points after injection and displayed as the ratio of the injected dose (%ID) corrected for the radioactive decay of ^99m Tc. The fitted curve shows the results of a nonlinear regression of the radioactivity measured at various time points—a two-stage decay (Graphpad Prism). Each data point represents the mean of two mice per group.

，针对^99mTc的放射性衰变校正过的注射剂量百分比；t，按小时计的时间；, percentage of injected dose corrected for radioactive decay of ^99m Tc; t, time in hours;

DARPin #18 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:18)；DARPin #18 (SEQ ID NO: 18 with a His-tag (SEQ ID NO: 15) fused at its N-terminus);

DARPin #23 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:23)；DARPin #23 (SEQ ID NO: 23 with a His-tag (SEQ ID NO: 15) fused at its N-terminus);

DARPin #25 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:25)；DARPin #25 (SEQ ID NO: 25 with a His-tag (SEQ ID NO: 15) fused at its N-terminus);

DARPin #32 (对xSA没有结合特异性的阴性对照DARPin，具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:32)；DARPin #32 (negative control DARPin with no binding specificity for xSA, SEQ ID NO: 32 having a His-tag (SEQ ID NO: 15) fused at its N-terminus);

DARPin #33 (包含2个重复结构域的DARPin，其中一个重复结构域对xSA具有结合特异性，具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:33)；DARPin #33 (a DARPin comprising two repeat domains, one of which has binding specificity for xSA, SEQ ID NO: 33 having a His-tag (SEQ ID NO: 15) fused to its N-terminus);

DARPin #35 (包含2个重复结构域的DARPin，其中一个重复结构域对xSA具有结合特异性，具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:35)；DARPin #35 (a DARPin comprising two repeat domains, one of which has binding specificity for xSA, SEQ ID NO: 35 having a His-tag (SEQ ID NO: 15) fused to its N-terminus);

DARPin #36 (包含2个重复结构域的DARPin，其中一个重复结构域对xSA具有结合特异性，具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:36)。DARPin #36 (DARPin comprising two repeat domains, one of which has binding specificity for xSA, SEQ ID NO: 36 having a His-tag (SEQ ID NO: 15) fused to its N-terminus).

图4.选择的DARPin在食蟹猴（cynomolgus monkeys）中的血浆清除率。 Figure 4. Plasma clearance of selected DARPins in cynomolgus monkeys.

在食蟹猴中评估了对CSA (食蟹猴血清白蛋白)具有特异性的DARPin和对照DARPin的血浆清除率。The plasma clearance of DARPins specific for CSA (cynomolgus serum albumin) and a control DARPin was assessed in cynomolgus monkeys.

（图4a) 将DARPin #26与对CSA没有结合特异性的DARPin #32进行了对比。( FIG. 4 a ) DARPin #26 was compared with DARPin #32, which has no binding specificity for CSA.

（图4b) 将DARPin #24、34和17与对CSA没有结合特异性的DARPin #32进行了对比。在t = 0小时，以0.5 mg/ml (DARPin #26、DARPin #24、DARPin #17和DARPin #32)或1mg/ml (DARPin #34)的浓度，将下述DARPin静脉内地注射给食蟹猴：在注射后的不同时间点，通过ELISA测量猴血浆中的DARPin浓度。曲线显示了在不同时间点测得的浓度的非线性回归的结果——两阶段衰变(Graphpad Prism)。从第二阶段，可以确定DARPin的终末血浆半衰期。每个单独数据点代表同一血清样品的2次独立ELISA测量的平均值。(Figure 4b) DARPins #24, 34, and 17 were compared with DARPin #32, which has no binding specificity for CSA. At t = 0 hours, the following DARPins were intravenously injected into cynomolgus monkeys at a concentration of 0.5 mg/ml (DARPin #26, DARPin #24, DARPin #17, and DARPin #32) or 1 mg/ml (DARPin #34). At various time points after injection, DARPin concentrations in monkey plasma were measured by ELISA. The curves show the results of a nonlinear regression of the concentrations measured at different time points—a two-stage decay (Graphpad Prism). From the second stage, the terminal plasma half-life of the DARPin was determined. Each individual data point represents the average of two independent ELISA measurements from the same serum sample.

，按nM计的DARPin浓度；t，按小时计的时间；, DARPin concentration in nM; t, time in hours;

DARPin #17 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:17)；DARPin #17 (SEQ ID NO: 17 with a His-tag (SEQ ID NO: 15) fused at its N-terminus);

DARPin #24 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:24)；DARPin #24 (SEQ ID NO: 24 with a His-tag (SEQ ID NO: 15) fused at its N-terminus);

DARPin #26 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:26)；DARPin #26 (SEQ ID NO: 26 with a His-tag (SEQ ID NO: 15) fused at its N-terminus);

DARPin #34 (包含2个重复结构域的DARPin，其中一个重复结构域对xSA具有结合特异性，具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:34)。DARPin #34 (DARPin comprising two repeat domains, one of which has binding specificity for xSA, SEQ ID NO: 34 having a His-tag (SEQ ID NO: 15) fused to its N-terminus).

具体实施方式DETAILED DESCRIPTION

根据本发明的结合域对哺乳动物血清白蛋白(xSA)是特异性的。优选地，根据本发明的结合域对小鼠、大鼠、狗、兔、猴或人起源的血清白蛋白是特异性的。更优选地，根据本发明的结合域对人起源的血清白蛋白(HSA)是特异性的。The binding domain according to the present invention is specific for mammalian serum albumin (xSA). Preferably, the binding domain according to the present invention is specific for serum albumin of mouse, rat, dog, rabbit, monkey or human origin. More preferably, the binding domain according to the present invention is specific for serum albumin of human origin (HSA).

术语“蛋白”指多肽，其中多肽的至少部分具有或者能够在其多肽链内和/或之间通过形成二级、三级或四级结构获得限定的三维排列。如果蛋白包含两条或更多条多肽，则每条多肽链可非共价连接或者例如通过两条多肽之间的二硫键共价连接。各自具有或者能够通过形成二级或三级结构获得限定的三维排列的蛋白部分，称为“蛋白结构域”。此类蛋白结构域为本领域技术人员熟知。The term "protein" refers to a polypeptide, wherein at least a portion of the polypeptide has or is capable of achieving a defined three-dimensional arrangement by forming secondary, tertiary, or quaternary structures within and/or between its polypeptide chains. If the protein comprises two or more polypeptides, each polypeptide chain may be non-covalently linked or covalently linked, for example, by disulfide bonds between the two polypeptides. Portions of a protein that each have or are capable of achieving a defined three-dimensional arrangement by forming secondary or tertiary structures are referred to as "protein domains." Such protein domains are well known to those skilled in the art.

在重组蛋白、重组蛋白结构域、重组结合蛋白等中使用的术语“重组”意指，所述多肽通过使用相关领域技术人员熟知的重组DNA技术制备。例如，可将编码多肽的重组DNA分子(如通过基因合成制备)克隆入细菌表达质粒(例如pQE30，Qiagen)、酵母表达质粒或哺乳动物表达质粒内。当例如将这样构建的重组细菌表达质粒插入适当的细菌(例如大肠杆菌（Escherichia coli）)时，该细菌可产生由这种重组DNA编码的多肽。相应产生的多肽称为重组多肽。The term "recombinant" as used in connection with recombinant proteins, recombinant protein domains, recombinant binding proteins, and the like, means that the polypeptide is produced using recombinant DNA techniques well known to those skilled in the relevant art. For example, a recombinant DNA molecule encoding a polypeptide (e.g., prepared by gene synthesis) can be cloned into a bacterial expression plasmid (e.g., pQE30, Qiagen), a yeast expression plasmid, or a mammalian expression plasmid. When, for example, the recombinant bacterial expression plasmid constructed in this manner is inserted into an appropriate bacterium (e.g., Escherichia coli ), the bacterium can produce the polypeptide encoded by the recombinant DNA. The resulting polypeptide is referred to as a recombinant polypeptide.

在本发明范围内，术语“多肽”是指由经肽键连接的多个(即两个或更多个)氨基酸的一条或多条链组成的分子。优选地，多肽由超过八个经肽键连接的氨基酸组成。In the context of the present invention, the term "polypeptide" refers to a molecule consisting of one or more chains of multiple (ie two or more) amino acids linked by peptide bonds. Preferably, a polypeptide consists of more than eight amino acids linked by peptide bonds.

术语“多肽标签”指连接于多肽/蛋白的氨基酸序列，其中所述氨基酸序列可用于纯化、检测或靶向所述多肽/蛋白，或者其中所述氨基酸序列改善多肽/蛋白的物理化学行为，或者其中所述氨基酸序列具备效应子功能。结合蛋白的个别多肽标签、部分和/或结构域相互之间可直接或经多肽接头连接。这些多肽标签在本领域众所周知，本领域技术人员完全可获得。多肽标签的实例是允许检测所述多肽/蛋白的小的多肽序列，例如His (例如SEQ ID NO:15的His-标签)、myc、FLAG或者Strep-标签或部分诸如酶(例如酶如碱性磷酸酶)，或者可用于靶向(诸如免疫球蛋白或其片段)和/或作为效应分子的部分。The term "polypeptide tag" refers to an amino acid sequence attached to a polypeptide/protein, wherein the amino acid sequence can be used to purify, detect or target the polypeptide/protein, or wherein the amino acid sequence improves the physicochemical behavior of the polypeptide/protein, or wherein the amino acid sequence has an effector function. Individual polypeptide tags, parts and/or domains of a binding protein may be linked to each other directly or via a polypeptide linker. Such polypeptide tags are well known in the art and are fully available to those skilled in the art. Examples of polypeptide tags are small polypeptide sequences that allow detection of the polypeptide/protein, such as His (e.g., the His-tag of SEQ ID NO: 15), myc, FLAG or Strep-tags, or parts such as enzymes (e.g., enzymes such as alkaline phosphatase), or that can be used for targeting (such as immunoglobulins or fragments thereof) and/or as part of an effector molecule.

术语“多肽接头”指氨基酸序列，它能够连接例如两个蛋白结构域，多肽标签和蛋白结构域，蛋白结构域和非多肽部分诸如聚乙二醇，或两个序列标签。相关领域技术人员已知此类另外的结构域、标签、非多肽部分和接头。在专利申请WO 2002/020565的说明书中提供实例的列表。此类接头的具体实例是各种长度的甘氨酸-丝氨酸-接头和脯氨酸-苏氨酸-接头；优选所述接头具有2至24个氨基酸的长度；更优选所述接头具有2至16个氨基酸的长度。在SEQ ID NO:16中提供了甘氨酸-丝氨酸-接头的一个实例。The term "polypeptide linker" refers to an amino acid sequence that is capable of connecting, for example, two protein domains, a polypeptide tag and a protein domain, a protein domain and a non-polypeptide moiety such as polyethylene glycol, or two sequence tags. Such additional domains, tags, non-polypeptide moieties, and linkers are known to those skilled in the relevant art. A list of examples is provided in the specification of patent application WO 2002/020565. Specific examples of such linkers are glycine-serine linkers and proline-threonine linkers of various lengths; preferably, the linkers have a length of 2 to 24 amino acids; more preferably, the linkers have a length of 2 to 16 amino acids. An example of a glycine-serine linker is provided in SEQ ID NO: 16.

术语“聚合物部分”指蛋白性聚合物部分或者非蛋白性聚合物部分。“蛋白性聚合物部分”优选是这样的多肽，其在室温(RT)以约0.1 mM浓度在磷酸盐缓冲盐水(PBS) 中贮存1月时，不形成稳定的三级结构，同时不形成超过10% ，优选不超过5%；还优选不超过2%；甚至更优选不超过1%；最优选没有可检测到的量的通过尺寸排阻色谱法(SEC)确定的寡聚体或聚集体。当使用球蛋白作为SEC的分子量标准时，此类蛋白性聚合物部分以高于其有效分子量的表观分子量在SEC中运行。优选通过SEC确定的所述蛋白性聚合物部分的表观分子量为用其氨基酸序列计算的其有效分子量的1.5x、2x或2.5x。还优选通过SEC确定的所述非蛋白性聚合物部分的表观分子量为用其分子组成计算的其有效分子量的2x、4x或8x。优选所述蛋白性聚合物部分超过50%、70%或甚至90%氨基酸以约0.1 mM浓度在PBS中在RT下不形成稳定的二级结构，通过圆二色性(CD)测量确定。最优选所述蛋白性聚合物显示无规卷曲构象的典型近UV CD-谱。本领域技术人员熟知此类CD分析。还优选的是，由超过50、优选超过100、200、300、400、500、600、700或最优选超过800个氨基酸组成的蛋白性聚合物部分。蛋白性聚合物部分的实例是XTEN®(Amunix的注册商标; WO 2007/103515)多肽，或者包含脯氨酸、丙氨酸和丝氨酸残基的多肽，如在WO 2008/155134中所述。通过用标准DNA克隆技术制备基因融合多肽，可将此类蛋白性聚合物部分共价连接于例如本发明的结合域，接着进行标准表达和纯化。The term "polymer portion" refers to a proteinaceous polymer portion or a non-proteinaceous polymer portion. A "proteinaceous polymer portion" is preferably a polypeptide that does not form a stable tertiary structure when stored in phosphate-buffered saline (PBS) at a concentration of about 0.1 mM at room temperature (RT) for one month, and does not form more than 10%, preferably no more than 5%; further preferably no more than 2%; even more preferably no more than 1%; and most preferably no detectable amount of oligomers or aggregates as determined by size exclusion chromatography (SEC). When globulins are used as molecular weight standards for SEC, such proteinaceous polymer portions are run in SEC with an apparent molecular weight higher than their effective molecular weight. Preferably, the apparent molecular weight of the proteinaceous polymer portion as determined by SEC is 1.5x, 2x, or 2.5x its effective molecular weight calculated using its amino acid sequence. Also preferably, the apparent molecular weight of the non-proteinaceous polymer portion as determined by SEC is 2x, 4x, or 8x its effective molecular weight calculated using its molecular composition. Preferably, the proteinaceous polymer portion has more than 50%, 70%, or even 90% of its amino acids that do not form a stable secondary structure at about 0.1 mM concentration in PBS at RT, as determined by circular dichroism (CD). Most preferably, the proteinaceous polymer exhibits a typical near-UV CD spectrum of a random coil conformation. Such CD analysis is well known to those skilled in the art. Also preferred are proteinaceous polymer portions consisting of more than 50, preferably more than 100, 200, 300, 400, 500, 600, 700, or most preferably more than 800 amino acids. Examples of proteinaceous polymer portions are XTEN® (a registered trademark of Amunix; WO 2007/103515) polypeptides, or polypeptides comprising proline, alanine, and serine residues, as described in WO 2008/155134. Such proteinaceous polymer portions can be covalently linked to, for example, a binding domain of the present invention by preparing a gene fusion polypeptide using standard DNA cloning techniques, followed by standard expression and purification.

本发明的聚合物部分在分子量上可宽泛地变化(即约1 kDa至约150 kDa)。优选所述聚合物部分具有至少2、更优选至少5、10、20、30、50、70或最优选至少100 kDa的分子量。优选所述聚合物部分通过多肽接头与结合域连接。The polymer moiety of the present invention can vary widely in molecular weight (i.e., from about 1 kDa to about 150 kDa). Preferably, the polymer moiety has a molecular weight of at least 2, more preferably at least 5, 10, 20, 30, 50, 70, or most preferably at least 100 kDa. Preferably, the polymer moiety is linked to the binding domain via a polypeptide linker.

非蛋白性聚合物部分的实例是羟乙基淀粉(HES)、聚乙二醇(PEG)、聚丙二醇或聚氧化烯。术语“聚乙二醇化”意指PEG部分共价连接于例如本发明的多肽。Examples of non-proteinaceous polymer moieties are hydroxyethyl starch (HES), polyethylene glycol (PEG), polypropylene glycol or polyoxyalkylenes.The term "PEGylated" means that a PEG moiety is covalently attached, eg to a polypeptide of the invention.

在具体实施方案中，PEG部分或任何其它非蛋白性聚合物可以例如经马来酰亚胺接头与半胱氨酸巯基偶联，半胱氨酸经肽接头与本文所述结合域的N-或C-端偶联。In a specific embodiment, a PEG moiety or any other non-proteinaceous polymer can be coupled to a cysteine thiol group, for example via a maleimide linker, and the cysteine is coupled to the N- or C-terminus of a binding domain described herein via a peptide linker.

术语“结合蛋白”指包含一个或多个结合域、一个或多个生物活性化合物和一个或多个聚合物部分的蛋白，如下文进一步解释。优选所述结合蛋白包含至多四个结合域。更优选所述结合蛋白包含至多两个结合域。最优选所述结合蛋白只包含一个结合域。而且，任何此类结合蛋白可包含非结合域的另外的蛋白结构域、多聚化部分、多肽标签、多肽接头和/或单个Cys残基。多聚化部分的实例是免疫球蛋白重链恒定区(其配对以提供功能性免疫球蛋白Fc域)，和亮氨酸拉链或者包含在两条此类多肽之间形成分子间二硫键的游离巯基的多肽。单个Cys残基可用于使其它部分与多肽缀合，例如通过使用本领域技术人员熟知的马来酰亚胺化学。优选地，所述结合蛋白是重组结合蛋白。也优选地，结合蛋白的结合域具有不同的靶标特异性。The term "binding protein" refers to a protein comprising one or more binding domains, one or more biologically active compounds and one or more polymer parts, as further explained below. Preferably, the binding protein comprises at most four binding domains. More preferably, the binding protein comprises at most two binding domains. Most preferably, the binding protein comprises only one binding domain. Moreover, any such binding protein may comprise additional protein domains other than the binding domain, a multimerization portion, a polypeptide tag, a polypeptide linker and/or a single Cys residue. Examples of multimerization portions are immunoglobulin heavy chain constant regions (which pair to provide functional immunoglobulin Fc domains), and leucine zippers or polypeptides comprising free sulfhydryl groups that form intermolecular disulfide bonds between two such polypeptides. A single Cys residue can be used to conjugate other moieties to the polypeptide, for example by using maleimide chemistry well known to those skilled in the art. Preferably, the binding protein is a recombinant binding protein. Also preferably, the binding domains of the binding protein have different target specificities.

术语“结合域”意指表现与蛋白支架相同的“折叠”(三维排列)且具有如下文限定的预定性质的蛋白结构域。此类结合域可通过合理的，或者最通常是组合的蛋白工程技术的本领域已知的技术获得（Binz等, 2005, 在上述引文中）。例如，具有预定性质的结合域可通过包含以下步骤的方法获得：(a)提供表现与如下文进一步限定的蛋白支架相同的折叠的不同蛋白结构域集合；和(b)筛选所述不同的集合和/或从所述不同集合选择以获得至少一种具有所述预定性质的蛋白结构域。不同的蛋白结构域集合可根据所用筛选和/或选择系统通过几种方法提供，并可包括使用本领域技术人员熟知的方法，诸如噬菌体展示或核糖体展示。优选地，所述结合域是重组结合域。The term "binding domain" means a protein domain that exhibits the same "fold" (three-dimensional arrangement) as a protein scaffold and has a predetermined property as defined below. Such binding domains can be obtained by techniques known in the art using reasonable, or most commonly combined, protein engineering techniques (Binz et al., 2005, in the above-mentioned citations). For example, a binding domain with a predetermined property can be obtained by a method comprising the following steps: (a) providing a set of different protein domains that exhibit the same fold as a protein scaffold as further defined below; and (b) screening the different sets and/or selecting from the different sets to obtain at least one protein domain with the predetermined property. The different sets of protein domains can be provided by several methods depending on the screening and/or selection system used, and can include the use of methods well known to those skilled in the art, such as phage display or ribosome display. Preferably, the binding domain is a recombinant binding domain.

术语“蛋白支架”意指具有其中高度耐受氨基酸插入、取代或缺失的暴露表面区域的蛋白。可用于产生本发明结合域的蛋白支架实例是抗体或其片段诸如单链Fv或Fab片段，来自金黄色葡萄球菌(Staphylococcus aureus)的蛋白A，来自大菜粉蝶(Pieris brassicae)的后胆色素结合蛋白或其它脂质运载蛋白，锚蛋白重复蛋白或其它重复蛋白，和人纤连蛋白。本领域技术人员已知蛋白支架(Binz等, 2005, 在上述引文中; Binz等,2004, 在上述引文中)。The term "protein scaffold" refers to a protein having exposed surface regions that are highly tolerant to amino acid insertions, substitutions, or deletions. Examples of protein scaffolds that can be used to generate the binding domains of the present invention are antibodies or fragments thereof, such as single-chain Fv or Fab fragments, protein A from Staphylococcus aureus , meta-billin-binding protein from Pieris brassicae or other lipocalins, ankyrin repeat proteins or other repeat proteins, and human fibronectin. Protein scaffolds are known to those skilled in the art (Binz et al., 2005, loc. cit.; Binz et al., 2004, loc. cit.).

术语“预定性质”指诸如结合靶标、阻断靶标、激活靶标介导的反应、酶活性和相关其它性质等性质。根据期望的性质类型，普通技术人员将能够鉴定用于进行筛选和/或选择具有期望性质的结合域的格式和必需步骤。优选所述预定性质是结合靶标。The term "predetermined property" refers to properties such as binding to a target, blocking a target, activating a target-mediated reaction, enzymatic activity, and other related properties. Depending on the type of property desired, one of ordinary skill will be able to identify the format and necessary steps for screening and/or selecting binding domains having the desired property. Preferably, the predetermined property is binding to a target.

下文关于重复蛋白的定义基于专利申请WO 2002/020565。专利申请WO 2002/020565进一步含有重复蛋白特征、技术和应用的一般描述。The following definition of repeat proteins is based on patent application WO 2002/020565. Patent application WO 2002/020565 further contains a general description of repeat protein characteristics, technology and applications.

术语“重复蛋白”指包含一个或多个重复结构域的蛋白。优选每个所述重复蛋白包含至多四个重复结构域。更优选每个所述重复蛋白包含至多两个重复结构域。最优选每个重复蛋白只包含一个重复结构域。而且，所述重复蛋白可包含另外的非重复蛋白结构域、多肽标签和/或多肽接头。The term "repeat protein" refers to a protein comprising one or more repeat domains. Preferably, each of the repeat proteins comprises at most four repeat domains. More preferably, each of the repeat proteins comprises at most two repeat domains. Most preferably, each repeat protein comprises only one repeat domain. Furthermore, the repeat protein may comprise additional non-repeat protein domains, polypeptide tags, and/or polypeptide linkers.

术语“重复结构域”指包含两个或更多个连续重复单元(模块)作为结构单元的蛋白结构域，其中所述结构单元具有相同的折叠，并紧密堆叠以产生例如具有接合疏水核心的超螺旋结构。优选地，重复结构域另外包含N-端和/或C-端加帽单元(或模块)。甚至更优选地，所述N-端和/或C-端加帽单元(或模块)是加帽重复序列。The term "repeat domain" refers to a protein domain comprising two or more consecutive repeating units (modules) as structural units, wherein the structural units have the same fold and are tightly stacked to produce, for example, a supercoiled structure with a joined hydrophobic core. Preferably, the repeat domain further comprises an N-terminal and/or C-terminal capping unit (or module). Even more preferably, the N-terminal and/or C-terminal capping unit (or module) is a capping repeat sequence.

术语“设计的重复蛋白”和“设计的重复结构域”分别指通过专利申请WO 2002/020565中解释的发明程序获得的重复蛋白或重复结构域。设计的重复蛋白和设计的重复结构域是合成的，并非出于天然。它们分别是通过对应设计的核酸的表达所得的人造蛋白或结构域。优选表达是在真核或原核细胞诸如细菌细胞中进行，或者通过使用无细胞体外表达系统进行。因此，设计的锚蛋白重复蛋白(即DARPin) 与包含至少一个锚蛋白重复结构域的本发明的结合蛋白对应。The terms "designed repeat protein" and "designed repeat domain" refer to a repeat protein or repeat domain, respectively, obtained by the inventive procedure explained in patent application WO 2002/020565. Designed repeat proteins and designed repeat domains are synthetic and not naturally occurring. They are artificial proteins or domains, respectively, obtained by expressing corresponding designed nucleic acids. Preferably, expression is performed in eukaryotic or prokaryotic cells, such as bacterial cells, or by using a cell-free in vitro expression system. Thus, a designed ankyrin repeat protein (i.e., a DARPin) corresponds to a binding protein of the present invention comprising at least one ankyrin repeat domain.

术语“结构单元”指局部有序的多肽部分，其通过沿多肽链相互靠近的两个或更多个二级结构区段之间的三维相互作用形成。此类结构单元表现结构基序。术语“结构基序”指存在于至少一个结构单元中的二级结构元件的三维排列。本领域技术人员熟知结构基序。单独的结构单元不能获得限定的三维排列；但是，它们的连续排列，例如作为重复结构域的重复模块，导致相邻单元的相互稳定，产生超螺旋结构。The term "structural unit" refers to a locally ordered portion of a polypeptide formed by three-dimensional interactions between two or more secondary structure segments located adjacent to each other along the polypeptide chain. Such structural units exhibit a structural motif. The term "structural motif" refers to the three-dimensional arrangement of secondary structure elements present in at least one structural unit. Structural motifs are well known to those skilled in the art. Structural units alone cannot achieve a defined three-dimensional arrangement; however, their continuous arrangement, for example as a repeating module of a repeating domain, results in mutual stabilization of adjacent units, resulting in a supercoiled structure.

术语“重复单元”指包含一种或多种天然存在的重复蛋白的重复序列基序的氨基酸序列，其中所述“重复单元”以多个拷贝出现，且其表现确定蛋白折叠的所有所述基序共同的限定的折叠拓扑学。这样的重复单元对应于：Forrer等人（2003, 在上述引文中）所述的重复蛋白的“重复结构单元(重复序列)”，或Binz等人（2004, 在上述引文中）所述的重复蛋白的“连续同源结构单元(重复序列)”。此类重复单元包含框架残基和相互作用残基。此类重复单元的实例是犰狳重复单元、富亮氨酸重复单元、锚蛋白重复单元、三十四肽重复单元、HEAT重复单元和富亮氨酸变体重复单元。含有两个或更多个此类重复单元的天然存在的蛋白称为“天然存在的重复蛋白”。重复蛋白的各重复单元的氨基酸序列相互之间比较时可具有显著数量的突变、取代、添加和/或缺失，但仍然基本保留重复单元的一般模式或基序。The term "repeat unit" refers to an amino acid sequence comprising a repeating sequence motif of one or more naturally occurring repeat proteins, wherein the "repeat unit" occurs in multiple copies and exhibits a defined folding topology common to all of the motifs that determine protein folding. Such repeat units correspond to the "repeating structural units (repeats)" of repeat proteins described by Forrer et al. (2003, loc. cit.) or the "continuous homologous structural units (repeats)" of repeat proteins described by Binz et al. (2004, loc. cit.). Such repeat units comprise framework residues and interacting residues. Examples of such repeat units are armadillo repeat units, leucine-rich repeat units, ankyrin repeat units, tetratricopeptide repeat units, HEAT repeat units, and leucine-rich variant repeat units. Naturally occurring proteins containing two or more such repeat units are referred to as "naturally occurring repeat proteins." The amino acid sequences of individual repeat units of a repeat protein, when compared to one another, may have a significant number of mutations, substitutions, additions, and/or deletions while still substantially retaining the general pattern or motif of the repeat unit.

术语“锚蛋白重复单元”是指这样的重复单元，它是例如Forrer等人（2003, 在上述引文中）所述的锚蛋白重复序列。锚蛋白重复序列是本领域技术人员熟知的。The term "ankyrin repeat unit" refers to a repeating unit which is an ankyrin repeat sequence as described, for example, in Forrer et al. (2003, loc. cit.) Ankyrin repeat sequences are well known to those skilled in the art.

术语“框架残基”是指重复单元的氨基酸残基，或重复模块的相应氨基酸残基，其参予折叠拓扑学，即其参予所述重复单元(或模块)的折叠或其参予与相邻单元(或模块)的相互作用。这类参予可以是与重复单元(或模块)中的其它残基的相互作用，或对α-螺旋或β-折叠中存在的多肽主链构象或形成线型多肽或环的氨基酸段的影响。The term "framework residue" refers to an amino acid residue of a repeating unit, or a corresponding amino acid residue of a repeating module, which participates in the folding topology, i.e., it participates in the folding of the repeating unit (or module) or participates in the interaction with adjacent units (or modules). Such participation can be an interaction with other residues in the repeating unit (or module), or an influence on the conformation of the polypeptide backbone present in an α-helix or β-sheet, or an amino acid stretch forming a linear polypeptide or a loop.

术语“靶标相互作用残基”是指重复单元的氨基酸残基，或重复模块的相应氨基酸残基，其参予与靶物质的相互作用。这类参予可以是与靶物质的直接相互作用，或对其它直接相互作用残基的影响，例如通过稳定重复单元(或模块)的多肽构象以允许或增强直接相互作用残基与所述靶标的相互作用。这类框架和靶标相互作用残基可以通过以下方法鉴定：对通过物理化学方法(诸如X-射线晶体学、NMR和/或CD光谱法)获得的结构数据进行分析，或与结构生物学和/或生物信息学领域的技术人员公知的已知的和相关的结构信息进行比较。The term "target interaction residue" refers to an amino acid residue of a repeating unit, or a corresponding amino acid residue of a repeating module, that participates in an interaction with a target substance. Such participation can be a direct interaction with the target substance, or an influence on other directly interacting residues, such as by stabilizing the polypeptide conformation of the repeating unit (or module) to allow or enhance the interaction of the directly interacting residue with the target. Such framework and target interaction residues can be identified by analyzing structural data obtained by physicochemical methods (such as X-ray crystallography, NMR and/or CD spectroscopy), or by comparing with known and relevant structural information known to those skilled in the art of structural biology and/or bioinformatics.

优选地，用于推断重复序列基序的重复单元是同源重复单元，其中所述重复单元包括相同的结构基序，并且其中所述重复单元的超过70%的框架残基是彼此同源的。优选地，所述重复单元的超过80%的框架残基是同源的。最优选地，所述重复单元的超过90%的框架残基是同源的。用于确定多肽之间同源性百分比的计算机程序(诸如Fasta、Blast或Gap)是本领域技术人员已知的。进一步优选地，用于推断重复序列基序的重复单元是从靶标上选择的重复结构域获得(如实施例1所述)并具有相同靶标特异性的同源重复单元。Preferably, the repeat unit for inferring the tumor-necrosis factor glycoprotein motif is a homologous repeat unit, wherein the repeat unit comprises the identical structural motif, and wherein the framework residues exceeding 70% of the repeat unit are homologous to each other. Preferably, the framework residues exceeding 80% of the repeat unit are homologous. Most preferably, the framework residues exceeding 90% of the repeat unit are homologous. Computer programs (such as Fasta, Blast or Gap) for determining homology percentage between polypeptides are well known to those skilled in the art. Further preferably, the repeat unit for inferring the tumor-necrosis factor glycoprotein motif is a homologous repeat unit obtained from the repeat domain selected on the target (as described in Example 1) and having the same target specificity.

术语“重复序列基序”是指从一个或多个重复单元或重复模块推断得出的氨基酸序列。优选地，所述重复单元或重复模块来自于对相同靶标具有结合特异性的重复结构域。这类重复序列基序包括框架残基位置和靶标相互作用残基位置。所述框架残基位置对应于重复单元(或模块)的框架残基的位置。同样，所述靶标相互作用残基位置对应于重复单元(或模块)的靶标相互作用残基的位置。重复序列基序包括固定位置和随机化位置。术语“固定位置”是指这样的重复序列基序中的氨基酸位置，其中所述位置被设置为特定氨基酸。最常见的是，这类固定位置对应于框架残基的位置和/或对某靶标特异性的靶标相互作用残基的位置。术语“随机化位置”表示这样的重复序列基序中的氨基酸位置，其中在所述氨基酸位置允许两个或更多个氨基酸，例如，其中允许任意的常见的二十种天然存在的氨基酸，或其中允许二十种天然存在氨基酸的大部分，诸如除了半胱氨酸以外的氨基酸，或除了甘氨酸、半胱氨酸和脯氨酸以外的氨基酸。最常见的是，这类随机化位置对应靶标相互作用残基的位置。但是，框架残基的一些位置也可以是随机化的。The term "repeat sequence motif" refers to an amino acid sequence inferred from one or more repeating units or repeating modules. Preferably, the repeating units or repeating modules are derived from repeating domains with binding specificity to the same target. Such repeating sequence motifs include framework residue positions and target interaction residue positions. The framework residue positions correspond to the positions of the framework residues of the repeating unit (or module). Similarly, the target interaction residue positions correspond to the positions of the target interaction residues of the repeating unit (or module). Repeating sequence motifs include fixed positions and randomized positions. The term "fixed position" refers to an amino acid position in such a repeating sequence motif, wherein the position is set to a specific amino acid. Most commonly, such fixed positions correspond to the positions of framework residues and/or target interaction residues specific to a certain target. The term "randomized position" refers to an amino acid position in such a repeating sequence motif, wherein two or more amino acids are allowed at the amino acid position, for example, wherein any of the twenty common naturally occurring amino acids are allowed, or wherein most of the twenty naturally occurring amino acids are allowed, such as amino acids other than cysteine, or amino acids other than glycine, cysteine and proline. Most commonly, such randomized positions correspond to target-interacting residues, however, some framework residues can also be randomized.

术语“折叠拓扑学”指所述重复单元或重复模块的三级结构。折叠拓扑学取决于形成α-螺旋或β-折叠的至少部分的氨基酸段，或者形成线型多肽或环，或者α-螺旋、β-折叠和/或线型多肽/环的任何组合的氨基酸段。The term "fold topology" refers to the tertiary structure of the repeating unit or repeating module. The fold topology is determined by the amino acid stretches that form at least part of an α-helix or β-sheet, or that form linear polypeptides or loops, or any combination of α-helices, β-sheets, and/or linear polypeptides/loops.

术语“连续”指这样的排列，其中重复单元或重复模块串联排列。在设计的重复蛋白中，有至少2个，通常为约2至6个，特别是至少约6个，经常是20个或更多个重复单元(或模块)。在大多数情况下，重复结构域的重复单元(或模块) 将表现高度的序列同一性(在对应位置的相同氨基酸残基)或序列相似性(氨基酸残基不同，但具有相似的物理化学性质)，以及一些氨基酸残基可能是非常保守的关键残基。但是，在重复结构域的不同重复单元(或模块)之间因氨基酸插入和/或缺失和/或取代所致的高度序列变异性将是可能的，只要维持重复单元(或模块)的共同折叠拓扑学。The term "continuously" refers to such arrangement, wherein repeat units or repeat modules are arranged in series. In the repeat protein of design, there are at least 2, generally about 2 to 6, particularly at least about 6, often 20 or more repeat units (or modules). In most cases, the repeat units (or modules) of the repeat domain will show a high degree of sequence identity (identical amino acid residues in corresponding positions) or sequence similarity (amino acid residues are different, but have similar physicochemical properties), and some amino acid residues may be very conservative key residues. However, between the different repeat units (or modules) in the repeat domain, the high degree of sequence variability due to amino acid insertion and/or disappearance and/or substitution will be possible, as long as the common folding topology of the repeat unit (or module) is maintained.

本领域技术人员熟知通过物理化学方式诸如X-射线晶体学、NMR或CD光谱学直接确定重复蛋白的折叠拓扑学的方法。在生物信息学领域充分建立了且本领域技术人员熟知鉴定和确定重复单元或重复序列基序或者鉴定包含此类重复单元或基序的相关蛋白家族的方法，诸如同源性检索(BLAST等)。精化初始重复序列基序的步骤可包含迭代过程。Methods for directly determining the folding topology of repeat proteins by physicochemical means such as X-ray crystallography, NMR or CD spectroscopy are well known to those skilled in the art. Methods for identifying and determining repeat units or repetitive sequence motifs or identifying families of related proteins comprising such repeat units or motifs are well established in the field of bioinformatics and are well known to those skilled in the art, such as homology searches (BLAST etc.). The step of refining the initial repetitive sequence motif may comprise an iterative process.

术语“重复模块”指设计的重复结构域的重复氨基酸序列，它们最初来自天然存在重复蛋白的重复单元。包含在重复结构域中的各重复模块来自天然存在重复蛋白家族或亚家族(如犰狳重复蛋白或锚蛋白重复蛋白的家族)的一种或多种重复单元。The term "repeat module" refers to the repeating amino acid sequence of a designed repeat domain that is originally derived from the repeating units of naturally occurring repeat proteins. Each repeat module contained in a repeat domain is derived from one or more repeating units of a naturally occurring repeat protein family or subfamily (e.g., the family of armadillo repeat proteins or ankyrin repeat proteins).

“重复模块”可包含具有存在于对应重复模块的所有拷贝中的氨基酸残基的位置(“固定位置”)和具有不同或“随机化”氨基酸残基的位置(“随机化位置”)。A "repeat module" may comprise positions with amino acid residues that are present in all copies of the corresponding repeat module ("fixed positions") and positions with different or "randomized" amino acid residues ("randomized positions").

根据本发明的结合蛋白包含至少一个锚蛋白重复结构域，其中所述重复结构域对哺乳动物血清白蛋白(xSA)具有结合特异性。The binding protein according to the present invention comprises at least one ankyrin repeat domain, wherein said repeat domain has binding specificity for mammalian serum albumin (xSA).

术语“对靶标具有结合特异性”、“特异性地结合靶标”或“靶标特异性”等是指，结合蛋白或结合域在PBS中与靶标结合的解离常数低于与诸如大肠杆菌麦芽糖结合蛋白(MBP)等无关蛋白结合的解离常数。优选地，在PBS中对靶标的解离常数比对MBP的相应解离常数低至少10倍、更优选10²倍、甚至更优选10³倍或最优选10⁴倍。The terms "having binding specificity for a target,""specifically binds to a target," or "target specificity" and the like mean that the binding protein or binding domain binds to the target with a dissociation constant in PBS that is lower than the dissociation constant for binding to an unrelated protein, such as E. coli maltose binding protein (MBP). Preferably, the dissociation constant for the target in PBS is at least 10-fold, more preferably 10 ^- fold, even more preferably 10 ^- fold, or most preferably 10- ^fold lower than the corresponding dissociation constant for MBP.

本发明的结合蛋白不是抗体或其片段，诸如Fab或scFv片段。本领域技术人员熟知抗体及其片段。The binding proteins of the present invention are not antibodies or fragments thereof, such as Fab or scFv fragments. Antibodies and fragments thereof are well known to those skilled in the art.

并且，本发明的结合域不包含免疫球蛋白折叠（如存在于抗体中）和/或III型纤连蛋白结构域。免疫球蛋白折叠是常见的全β蛋白折叠，由排列在两个β折叠中的约7条反向平行β-链的2层夹心组成。本领域技术人员熟知免疫球蛋白折叠。例如，此类包含免疫球蛋白折叠的结合域描述于WO 2007/080392或WO 2008/097497。Furthermore, the binding domains of the present invention do not comprise an immunoglobulin fold (such as found in antibodies) and/or a type III fibronectin domain. The immunoglobulin fold is a common all-beta protein fold consisting of a two-layer sandwich of approximately seven antiparallel beta strands arranged between two beta sheets. Those skilled in the art are familiar with the immunoglobulin fold. For example, such binding domains comprising an immunoglobulin fold are described in WO 2007/080392 or WO 2008/097497.

具体地，本发明涉及包含至少一个锚蛋白重复结构域的结合蛋白，其中所述锚蛋白重复结构域对哺乳动物血清白蛋白具有结合特异性，且其中所述锚蛋白重复结构域包含锚蛋白重复模块，所述锚蛋白重复模块具有选自下述的氨基酸序列：SEQ ID NO:49、50、51和52，和这样的序列，其中在SEQ ID NO:49、50、51和52中的最多9个氨基酸被任意氨基酸替换。In particular, the present invention relates to a binding protein comprising at least one ankyrin repeat domain, wherein the ankyrin repeat domain has binding specificity for mammalian serum albumin, and wherein the ankyrin repeat domain comprises an ankyrin repeat module having an amino acid sequence selected from the group consisting of SEQ ID NO: 49, 50, 51 and 52, and sequences wherein up to 9 amino acids in SEQ ID NO: 49, 50, 51 and 52 are replaced by any amino acid.

优选地，在SEQ ID NO:49、50、51和52中的最多8个氨基酸被其它氨基酸替换，在SEQ ID NO:49、50、51和52中更优选最多7个氨基酸、更优选最多6个氨基酸、更优选最多5个氨基酸、甚至更优选最多4个氨基酸、更优选最多3个氨基酸、更优选最多2个氨基酸、更优选最多1个氨基酸，和最优选零个氨基酸被替换。Preferably, a maximum of 8 amino acids in SEQ ID NOs: 49, 50, 51 and 52 are replaced by other amino acids, more preferably a maximum of 7 amino acids, more preferably a maximum of 6 amino acids, more preferably a maximum of 5 amino acids, even more preferably a maximum of 4 amino acids, more preferably a maximum of 3 amino acids, more preferably a maximum of 2 amino acids, more preferably a maximum of 1 amino acid, and most preferably zero amino acids are replaced in SEQ ID NOs: 49, 50, 51 and 52.

优选地，当SEQ ID NO:49、50、51和52中的氨基酸被替换时，这些氨基酸选自：A、D、E、F、H、I、K、L、M、N、Q、R、S、T、V、W和Y；更优选地选自：A、D、E、H、I、K、L、Q、R、S、T、V和Y。也优选地，氨基酸被同源氨基酸替换；即氨基酸被含有具有类似生物物理性质的侧链的氨基酸替换。例如，带负电荷的氨基酸D可以被带负电荷的氨基酸E替换，或者疏水氨基酸诸如L可以被A、I或V替换。同源氨基酸对氨基酸的替换是本领域技术人员熟知的。Preferably, when amino acids in SEQ ID NOs: 49, 50, 51, and 52 are replaced, these amino acids are selected from the group consisting of A, D, E, F, H, I, K, L, M, N, Q, R, S, T, V, W, and Y; more preferably, they are selected from the group consisting of A, D, E, H, I, K, L, Q, R, S, T, V, and Y. Also preferably, the amino acids are replaced with homologous amino acids; that is, the amino acids are replaced with amino acids having side chains with similar biophysical properties. For example, the negatively charged amino acid D can be replaced with the negatively charged amino acid E, or a hydrophobic amino acid such as L can be replaced with A, I, or V. The replacement of amino acids with homologous amino acids is well known to those skilled in the art.

优选的结合蛋白包含至少一个锚蛋白重复结构域，其中所述重复结构域在PBS中与xSA结合的解离常数(Kd) 低于10^-4M。优选地，所述重复结构域在PBS中与xSA结合的Kd低于10^-4M，更优选地低于10^-5M、10^-6M、10^-7M，或最优选地低于10^-8M。Preferred binding proteins comprise at least one ankyrin repeat domain, wherein the repeat domain binds to xSA in PBS with a dissociation constant (Kd) of less than 10 ⁻⁴ M. Preferably, the repeat domain binds to xSA in PBS with a Kd of less than 10 ⁻⁴ M, more preferably less than 10 ⁻⁵ M, 10 ⁻⁶ M, 10 ⁻⁷ M, or most preferably less than 10 ⁻⁸ M.

用于确定蛋白-蛋白相互作用的解离常数的方法，诸如基于表面等离子体共振(SPR)的技术(例如SPR平衡分析)或等温滴定量热法(ITC)，是本领域技术人员熟知的。如果在不同条件(例如，盐浓度、pH)下测量，测得的特定蛋白-蛋白相互作用的Kd值可以变化。因而，优选地用标准化的蛋白溶液和标准化的缓冲液（诸如PBS），进行Kd值的测量。Methods for determining the dissociation constant of protein-protein interactions, such as techniques based on surface plasmon resonance (SPR) (e.g., SPR equilibrium analysis) or isothermal titration calorimetry (ITC), are well known to those skilled in the art. If measured under different conditions (e.g., salt concentration, pH), the Kd value of a specific protein-protein interaction measured may vary. Thus, Kd values are preferably measured using standardized protein solutions and standardized buffers (such as PBS).

在实施例中显示了包含锚蛋白重复结构域的结合蛋白，所述锚蛋白重复结构域在PBS中以低于10^-4M的Kd与xSA结合。In the Examples, binding proteins comprising ankyrin repeat domains are shown that bind to xSA with a Kd below 10 ⁻⁴ M in PBS.

本发明的结合蛋白的锚蛋白重复结构域会结合xSA。优选的是，包含与人血清白蛋白(HSA)结合的锚蛋白重复结构域的结合蛋白。The ankyrin repeat domain of the binding protein of the present invention binds to xSA. Preferably, the binding protein comprises ankyrin repeat domain that binds to human serum albumin (HSA).

进一步优选的是，包含70-300个氨基酸、尤其是100-200个氨基酸的结合域。Further preferred are binding domains comprising 70-300 amino acids, in particular 100-200 amino acids.

进一步优选的是，不含游离Cys残基的结合蛋白或结合域。“游离Cys残基”不参与二硫键的形成。甚至更优选的是，不含任何Cys残基的结合蛋白或结合域。Further preferred are binding proteins or binding domains that do not contain free Cys residues. "Free Cys residues" do not participate in the formation of disulfide bonds. Even more preferred are binding proteins or binding domains that do not contain any Cys residues.

本发明的结合域是锚蛋白重复结构域或设计的锚蛋白重复结构域(Binz等人,2004, 在上述引文中)，优选地如在WO 2002/020565中所述。设计的锚蛋白重复结构域的实例显示在实施例中。The binding domain of the present invention is an ankyrin repeat domain or a designed ankyrin repeat domain (Binz et al., 2004, loc. cit.), preferably as described in WO 2002/020565. Examples of designed ankyrin repeat domains are shown in the Examples.

优选地，在本发明的结合蛋白中的这种锚蛋白重复结构域包含与选自下述的一个锚蛋白重复结构域具有至少70% 氨基酸序列同一性的氨基酸序列：SEQ ID NO: 21、27和46；优选地27和46。如上所定义的，所述锚蛋白重复结构域在PBS中与xSA结合的解离常数(Kd)低于10^-4M。优选地，所述重复结构域在PBS中与xSA结合的Kd低于10^-4M，更优选地低于10^-5M、10^-6M、10^-7M，或最优选地低于10^-8M。Preferably, the ankyrin repeat domain in the binding protein of the present invention comprises an amino acid sequence having at least 70% amino acid sequence identity with an ankyrin repeat domain selected from the group consisting of SEQ ID NOs: 21, 27 and 46; preferably 27 and 46. As defined above, the ankyrin repeat domain binds to xSA in PBS with a dissociation constant (Kd) of less than 10 ⁻⁴ M. Preferably, the repeat domain binds to xSA in PBS with a Kd of less than 10 ⁻⁴ M, more preferably less than 10 ⁻⁵ M, 10 ⁻⁶ M, 10 ⁻⁷ M, or most preferably less than 10 ⁻⁸ M.

优选地，在本发明的结合蛋白中的这种锚蛋白重复结构域包含与在选自下述的锚蛋白重复结构域中的“随机化重复单元”或“随机化位置”具有至少70%氨基酸序列同一性的氨基酸序列：SEQ ID NO: 17-31和43-48。Preferably, such ankyrin repeat domain in the binding protein of the present invention comprises an amino acid sequence having at least 70% amino acid sequence identity with a “randomized repeat unit” or a “randomized position” in an ankyrin repeat domain selected from the group consisting of SEQ ID NOs: 17-31 and 43-48.

优选地，作为70% 氨基酸序列同一性的替代，在本发明的结合蛋白中的这种锚蛋白重复结构域具有至少75%、更优选至少80%、更优选至少85%、更优选至少90%或最优选至少95% 的氨基酸序列同一性的氨基酸序列。Preferably, instead of 70% amino acid sequence identity, such ankyrin repeat domains in the binding proteins of the present invention have an amino acid sequence with at least 75%, more preferably at least 80%, more preferably at least 85%, more preferably at least 90% or most preferably at least 95% amino acid sequence identity.

在一个具体实施方案中，对哺乳动物血清白蛋白具有结合特异性的结合蛋白（通过替换SEQ ID NO:49、50、51和52的锚蛋白重复模块中的最多9个氨基酸来定义，或者通过与选自SEQ ID NO: 17-31和43-48的一个锚蛋白重复结构域具有至少70% 氨基酸序列同一性来定义）在哺乳动物中的终末血浆半衰期比不与哺乳动物血清白蛋白结合的对应结合蛋白（例如SEQ ID NO:32的锚蛋白重复结构域）高至少5倍。在这样的优选的结合蛋白中，在人类中的最小终末血浆半衰期是至少1天，更优选至少3天，甚至更优选至少5天。In a specific embodiment, a binding protein with binding specificity for mammalian serum albumin (defined by replacing up to 9 amino acids in the ankyrin repeat module of SEQ ID NOs: 49, 50, 51, and 52, or by having at least 70% amino acid sequence identity with an ankyrin repeat domain selected from SEQ ID NOs: 17-31 and 43-48) has a terminal plasma half-life in mammals that is at least 5-fold greater than that of a corresponding binding protein that does not bind to mammalian serum albumin (e.g., the ankyrin repeat domain of SEQ ID NO: 32). In such preferred binding proteins, the minimum terminal plasma half-life in humans is at least 1 day, more preferably at least 3 days, and even more preferably at least 5 days.

在另一个实施方案中，本发明涉及这样的结合蛋白，其中所述锚蛋白重复结构域包含具有下述锚蛋白重复序列基序的重复模块：X₁DX₂X₃X₄X₅TPLHLAAX₆X₇GHLX₈IVEVLLKX₉GADVNA (SEQ ID NO:53)In another embodiment, the present invention relates to a binding protein, wherein the ankyrin repeat domain comprises a repeat module having the following ankyrin repeat motif: X ₁ DX ₂ X ₃ X ₄ X ₅ TPLHLAAX ₆ X ₇ GHLX ₈ IVEVLLKX ₉ GADVNA (SEQ ID NO: 53)

其中X₁、X₂、X₃、X₄、X₅、X₆、X₇、X₈和X₉、彼此独立地代表选自下述的氨基酸残基：A、D、E、F、H、I、K、L、M、N、Q、R、S、T、V、W和Y；wherein X ₁ , X ₂ , X ₃ , X ₄ , X ₅ , X ₆ , X ₇ , X ₈ and X ₉ independently represent an amino acid residue selected from the group consisting of A, D, E, F, H, I, K, L, M, N, Q, R, S, T, V, W and Y;

优选地，其中Preferably, wherein

X₁代表选自下述的氨基酸残基：A、D、M、F、S、I、T、N、Y和K；更优选K和A； _X1 represents an amino acid residue selected from the group consisting of A, D, M, F, S, I, T, N, Y and K; more preferably K and A;

X₂代表选自下述的氨基酸残基：E、K、D、F、M、N、I和Y；更优选I、E和Y； _X2 represents an amino acid residue selected from the group consisting of E, K, D, F, M, N, I and Y; more preferably I, E and Y;

X₃代表选自下述的氨基酸残基：W、R、N、T、H、K、A和F；更优选W、R和F； _X3 represents an amino acid residue selected from the group consisting of W, R, N, T, H, K, A and F; more preferably W, R and F;

X₄代表选自下述的氨基酸残基：G和S； _X4 represents an amino acid residue selected from the group consisting of G and S;

X₅代表选自下述的氨基酸残基：N、T和H； _X5 represents an amino acid residue selected from the group consisting of N, T and H;

X₆代表选自下述的氨基酸残基：N、V和R；X ₆ represents an amino acid residue selected from the group consisting of N, V and R;

X₇代表选自下述的氨基酸残基：E、Y、N、D、H、S、A、Q、T和G；更优选E、Y和N； _X7 represents an amino acid residue selected from the group consisting of E, Y, N, D, H, S, A, Q, T and G; more preferably E, Y and N;

X₈代表选自下述的氨基酸残基：E和K；X ₈ represents an amino acid residue selected from the group consisting of E and K;

X₉代表选自下述的氨基酸残基：S、A、Y、H和N；更优选Y和H；且 _X9 represents an amino acid residue selected from the group consisting of S, A, Y, H and N; more preferably Y and H; and

其中任选地最多5个在SEQ ID NO:53中用X表示的位置以外的氨基酸被任意氨基酸替换。wherein optionally up to 5 amino acids other than the positions indicated by X in SEQ ID NO: 53 are replaced by any amino acid.

具体地，本发明涉及这样的结合蛋白，其中所述锚蛋白重复结构域包含具有下述锚蛋白重复序列基序的重复模块：X₁DX₂X₃GX₄TPLHLAAX₅X₆GHLEIVEVLLKX₇GADVNA (SEQ IDNO:10)In particular, the present invention relates to such binding proteins, wherein the ankyrin repeat domain comprises a repeat module having the following ankyrin repeat sequence motif: X ₁ DX ₂ X ₃ GX ₄ TPLHLAAX ₅ X ₆ GHLEIVEVLLKX ₇ GADVNA (SEQ ID NO: 10)

其中in

X₁代表选自下述的氨基酸残基：A、D、M、F、S、I、T、N、Y和K；优选K和A； _X1 represents an amino acid residue selected from the group consisting of A, D, M, F, S, I, T, N, Y and K; preferably K and A;

X₂代表选自下述的氨基酸残基：E、K、D、F、M、N、I和Y；优选I、E和Y； _X2 represents an amino acid residue selected from the group consisting of E, K, D, F, M, N, I and Y; preferably I, E and Y;

X₃代表选自下述的氨基酸残基：W、R、N、T、H、K、A和F；优选W、R和F； _X3 represents an amino acid residue selected from the group consisting of W, R, N, T, H, K, A and F; preferably W, R and F;

X₄代表选自下述的氨基酸残基：N、T和H； _X4 represents an amino acid residue selected from the group consisting of N, T and H;

X₅代表选自下述的氨基酸残基：N、V和R；X ₅ represents an amino acid residue selected from the group consisting of N, V and R;

X₆代表选自下述的氨基酸残基：E、Y、N、D、H、S、A、Q、T和G；优选E、Y和N；X ₆ represents an amino acid residue selected from the group consisting of E, Y, N, D, H, S, A, Q, T and G; preferably E, Y and N;

X₇代表选自下述的氨基酸残基：S、A、Y、H和N；优选Y和H；且 _X7 represents an amino acid residue selected from the group consisting of S, A, Y, H and N; preferably Y and H; and

其中任选地最多5个在SEQ ID NO:10中用X表示的位置以外的氨基酸被任意氨基酸替换。wherein optionally up to 5 amino acids other than the positions indicated by X in SEQ ID NO: 10 are replaced by any amino acid.

在另一个实施方案中，本发明涉及这样的结合蛋白，其中所述锚蛋白重复结构域包含具有下述锚蛋白重复序列基序的重复模块：In another embodiment, the present invention relates to a binding protein, wherein the ankyrin repeat domain comprises a repeat module having the following ankyrin repeat motif:

X₁DYFX₂HTPLHLAARX₃X₄HLX₅IVEVLLKX₆GADVNA (SEQ ID NO:11)X ₁ DYFX ₂ HTPLHLAARX ₃ X ₄ HLX ₅ IVEVLLKX ₆ GADVNA (SEQ ID NO:11)

其中in

X₁代表选自下述的氨基酸残基：D、K和A；优选地K和A； _X1 represents an amino acid residue selected from the group consisting of D, K and A; preferably K and A;

X₂代表选自下述的氨基酸残基：D、G和S；优选地G和S； _X2 represents an amino acid residue selected from the group consisting of D, G and S; preferably G and S;

X₃代表选自下述的氨基酸残基：E、N、D、H、S、A、Q、T和G；优选地Q、D和N；更优选Q和N；X ₃ represents an amino acid residue selected from the group consisting of E, N, D, H, S, A, Q, T and G; preferably Q, D and N; more preferably Q and N;

X₄代表选自下述的氨基酸残基：G和D； _X4 represents an amino acid residue selected from the group consisting of G and D;

X₅代表选自下述的氨基酸残基：E、K和G；优选地E和K； _X5 represents an amino acid residue selected from the group consisting of E, K and G; preferably E and K;

X₆代表选自下述的氨基酸残基：H、Y、A和N；优选地H、A和Y；更优选A和Y；且X ₆ represents an amino acid residue selected from the group consisting of H, Y, A and N; preferably H, A and Y; more preferably A and Y; and

其中任选地最多5个在SEQ ID NO:11中用X表示的位置以外的氨基酸被任意氨基酸替换。wherein optionally up to 5 amino acids other than the positions indicated by X in SEQ ID NO: 11 are replaced by any amino acid.

X₁DFX₂G X₃TPLHLAAX₄X₅GHLEIVEVLLKX₆GADVNA (SEQ ID NO:54)X ₁ DFX ₂ GX ₃ TPLHLAAX ₄ X ₅ GHLEIVEVLLKX ₆ GADVNA (SEQ ID NO:54)

其中in

X₁代表选自下述的氨基酸残基：F、S、L和K；优选S和K； _X1 represents an amino acid residue selected from the group consisting of F, S, L and K; preferably S and K;

X₂代表选自下述的氨基酸残基：V和A； _X2 represents an amino acid residue selected from the group consisting of: V and A;

X₃代表选自下述的氨基酸残基：R和K；X ₃ represents an amino acid residue selected from the group consisting of R and K;

X₄代表选自下述的氨基酸残基：S和N； _X4 represents an amino acid residue selected from the group consisting of S and N;

X₅代表选自下述的氨基酸残基：N、D、Q、S、A、T和E；优选地D和Q；X ₅ represents an amino acid residue selected from the group consisting of N, D, Q, S, A, T and E; preferably D and Q;

X₆代表选自下述的氨基酸残基：A、H、Y、S和N；优选A和H；且X ₆ represents an amino acid residue selected from the group consisting of A, H, Y, S and N; preferably A and H; and

其中任选地最多5个在SEQ ID NO:54中用X表示的位置以外的氨基酸被任意氨基酸替换。wherein optionally up to 5 amino acids other than the positions indicated by X in SEQ ID NO: 54 are replaced by any amino acid.

具体地，本发明涉及这样的结合蛋白，其中所述锚蛋白重复结构域包含具有下述锚蛋白重复序列基序的重复模块：In particular, the present invention relates to binding proteins, wherein the ankyrin repeat domain comprises a repeat module having the following ankyrin repeat motif:

X₁DFX₂G X₃TPLHLAAX₄DGHLEIVEVLLKX₅GADVNA (SEQ ID NO:12)X ₁ DFX ₂ GX ₃ TPLHLAAX ₄ DGHLEIVEVLLKX ₅ GADVNA (SEQ ID NO:12)

其中in

X₁代表选自下述的氨基酸残基：F、S、L和K；优选地S和K； _X1 represents an amino acid residue selected from the group consisting of F, S, L and K; preferably S and K;

X₅代表选自下述的氨基酸残基：A、H、Y、S和N；优选地A和H；且 _X5 represents an amino acid residue selected from the group consisting of A, H, Y, S and N; preferably A and H; and

其中任选地最多5个在SEQ ID NO:12中用X表示的位置以外的氨基酸被任意氨基酸替换。wherein optionally up to 5 amino acids other than the positions indicated by X in SEQ ID NO: 12 are replaced by any amino acid.

优选的是这样的结合蛋白，其中所述锚蛋白重复结构域包含具有SEQ ID NO:12的锚蛋白重复序列基序的重复模块，该重复模块前面是具有SEQ ID NO:11的锚蛋白重复序列基序的重复模块。Preferred is a binding protein, wherein the ankyrin repeat domain comprises a repeat module having ankyrin repeat motif of SEQ ID NO:12, which is preceded by a repeat module having ankyrin repeat motif of SEQ ID NO:11.

X₁DX₂X₃GTTPLHLAAVYGHLEX₄VEVLLKX₅GADVNA (SEQ ID NO:13)X ₁ DX ₂ X ₃ GTTPLHLAAVYGHLEX ₄ VEVLLKX ₅ GADVNA (SEQ ID NO:13)

其中in

X₁代表选自下述的氨基酸残基：K、A、D、M、F、S、I、T、N和Y；优选地K和A； _X1 represents an amino acid residue selected from the group consisting of K, A, D, M, F, S, I, T, N and Y; preferably K and A;

X₂代表选自下述的氨基酸残基：E、K、D、F、M、N和Y；优选地E、D和Y； _X2 represents an amino acid residue selected from the group consisting of E, K, D, F, M, N and Y; preferably E, D and Y;

X₃代表选自下述的氨基酸残基：R、N、T、H、K、A和F；优选地R和F； _X3 represents an amino acid residue selected from the group consisting of R, N, T, H, K, A and F; preferably R and F;

X₄代表选自下述的氨基酸残基：I和M； _X4 represents an amino acid residue selected from the group consisting of I and M;

X₅代表选自下述的氨基酸残基：H、Y、K、A和N；优选地K和A；且 _X5 represents an amino acid residue selected from the group consisting of H, Y, K, A and N; preferably K and A; and

其中任选地最多5个在SEQ ID NO:13中用X表示的位置以外的氨基酸被任意氨基酸替换。wherein optionally up to 5 amino acids other than the positions indicated by X in SEQ ID NO: 13 are replaced by any amino acid.

X₁NETGYTPLHLADSSGHX₂EIVEVLLKX₃X₄X₅DX₆NA (SEQ ID NO:14)X ₁ NETGYTPLHLADSSGHX ₂ EIVEVLLKX ₃ X ₄ X ₅ DX ₆ NA (SEQ ID NO:14)

其中in

X₁代表选自下述的氨基酸残基：Q和K； _X1 represents an amino acid residue selected from the group consisting of Q and K;

X₂代表选自下述的氨基酸残基：L和P； _X2 represents an amino acid residue selected from the group consisting of L and P;

X₃代表选自下述的氨基酸残基：H、N、Y和A；优选地H和A； _X3 represents an amino acid residue selected from the group consisting of H, N, Y and A; preferably H and A;

X₅代表选自下述的氨基酸残基：A、V、T和S；优选地S和A；X ₅ represents an amino acid residue selected from the group consisting of A, V, T and S; preferably S and A;

X₆代表选自下述的氨基酸残基：V和F；和其中任选地最多5个在SEQ ID NO:14中用X表示的位置以外的氨基酸被任意氨基酸替换。X ₆ represents an amino acid residue selected from the group consisting of V and F; and wherein optionally up to 5 amino acids other than the positions indicated by X in SEQ ID NO: 14 are replaced by any amino acid.

优选的是这样的结合蛋白，其中所述锚蛋白重复结构域包含具有SEQ ID NO:14的锚蛋白重复序列基序的重复模块，该重复模块前面是具有SEQ ID NO:13的锚蛋白重复序列基序的重复模块。Preferred is a binding protein, wherein the ankyrin repeat domain comprises a repeat module having ankyrin repeat motif of SEQ ID NO:14, which is preceded by a repeat module having ankyrin repeat motif of SEQ ID NO:13.

术语“加帽模块”指与重复结构域的N-端或C-端重复模块融合的多肽，其中所述加帽模块与所述重复模块形成紧密的三级相互作用(即三级结构相互作用)，从而提供帽，其将所述重复模块的疏水核心在不与连续重复模块接触的侧部与溶剂隔开。所述N-端和/或C-端加帽模块可以是，或者可来自，存在于天然存在的重复蛋白中邻近重复单元的加帽单元或其它结构单元。术语“加帽单元”指天然存在的折叠多肽，其中所述多肽限定在N-端或C-端与重复单元融合的特定结构单元，其中所述多肽与所述重复单元形成紧密的三级结构相互作用，从而提供帽，其将所述重复单元的疏水核心在一侧与溶剂隔开。优选地，加帽模块或加帽单元是加帽重复序列。术语“加帽重复序列”表示这样的加帽模块或加帽单元：其具有与所述邻近重复单元(或模块)类似或相同的折叠、和/或与所述邻近重复单元(或模块)的序列相似性。加帽模块和加帽重复序列参见：WO 2002/020565，和Interlandi等人,2008 (在上述引文中)。例如，WO 2002/020565描述了具有下述氨基酸序列的N-端加帽模块(即加帽重复序列)：The term "capping module" refers to a polypeptide fused to the N-terminal or C-terminal repeat module of a repeat domain, wherein the capping module forms a tight tertiary interaction (i.e., a tertiary structural interaction) with the repeat module, thereby providing a cap that separates the hydrophobic core of the repeat module from the solvent on the side that is not in contact with the consecutive repeat module. The N-terminal and/or C-terminal capping module can be, or can be derived from, a capping unit or other structural unit present in a naturally occurring repeat protein adjacent to the repeat unit. The term "capping unit" refers to a naturally occurring folded polypeptide, wherein the polypeptide is defined as a specific structural unit fused to a repeat unit at the N-terminus or C-terminus, wherein the polypeptide forms a tight tertiary structural interaction with the repeat unit, thereby providing a cap that separates the hydrophobic core of the repeat unit from the solvent on one side. Preferably, the capping module or capping unit is a capping repeat sequence. The term "capping repeat sequence" refers to a capping module or capping unit that has a similar or identical fold to the adjacent repeat unit (or module) and/or sequence similarity to the adjacent repeat unit (or module). Capping modules and capping repeats are described in WO 2002/020565 and Interlandi et al., 2008 (cited above). For example, WO 2002/020565 describes an N-terminal capping module (i.e., a capping repeat) having the following amino acid sequence:

GSDLGKKLLEAARAGQDDEVRILMANGADVNA (SEQ ID NO:1)，和GSDLGKKLLEAARAGQDDEVRILMANGADVNA (SEQ ID NO:1), and

具有下述氨基酸序列的C-端加帽模块(即加帽重复序列)A C-terminal capping module (i.e., capping repeat sequence) having the following amino acid sequence:

QDKFGKTAFDISIDNGNEDLAEILQKLN (SEQ ID NO:2)。QDKFGKTAFDISIDNGNEDLAEILQKLN (SEQ ID NO:2).

等人, 2008 (在上述引文中) 描述了具有下述氨基酸序列的C-端加帽模块：QDKFGKTPFDLAIREGHEDIAEVLQKAA (SEQ ID NO:3)和QDKFGKTPFDLAIDNGNEDIAEVLQKAA (SEQID NO:4)。et al., 2008 (loc. cit.) described C-terminal capping modules having the following amino acid sequences: QDKFGKTPFDLAIREGHEDIAEVLQKAA (SEQ ID NO: 3) and QDKFGKTPFDLAIDNGNEDIAEVLQKAA (SEQ ID NO: 4).

例如，SEQ ID NO:17的N-端加帽模块由1-32位的氨基酸编码，SEQ ID NO:17的C-端加帽模块由99-126位的氨基酸编码。For example, the N-terminal capping module of SEQ ID NO: 17 is encoded by amino acids 1-32, and the C-terminal capping module of SEQ ID NO: 17 is encoded by amino acids 99-126.

一种优选的N-端加帽模块包含下述序列基序A preferred N-terminal capping module comprises the following sequence motif

X₁LX₂KKLLEAARAGQDDEVRILX₃AX₄GADVNA (SEQ ID NO:5)X ₁ LX ₂ KKLLEAARAGQDDEVRILX ₃ AX ₄ GADVNA (SEQ ID NO:5)

其中X₁代表氨基酸残基G、A或D；wherein X ₁ represents an amino acid residue G, A or D;

其中X₂代表氨基酸残基G或D；Wherein X ₂ represents amino acid residue G or D;

其中X₃代表氨基酸残基L、V、I、A或M；优选地，L或M；且wherein X ₃ represents an amino acid residue L, V, I, A or M; preferably, L or M; and

其中X₄代表氨基酸残基A、H、Y、K、R或N；优选地，A或N。wherein X ₄ represents an amino acid residue A, H, Y, K, R or N; preferably, A or N.

进一步优选的是，任意这样的包含N-端加帽重复序列的N-端加帽模块，其中所述加帽重复序列中的一个或多个氨基酸残基被特定氨基酸残基替换，所述特定氨基酸残基在比对相应加帽单元或重复单元时存在于对应位置处。Further preferred is any such N-terminal capping module comprising an N-terminal capping repeat sequence, wherein one or more amino acid residues in the capping repeat sequence are replaced by specific amino acid residues that are present at corresponding positions when aligning the corresponding capping units or repeat units.

一种优选的C-端加帽模块包含下述序列基序A preferred C-terminal capping module comprises the following sequence motif

X₁DKX₂GKTX₃X₄D X₅X₆X₇DX₈GX₉EDX₁₀AEX₁₁LQKAA (SEQ ID NO:6)X ₁ DKX ₂ GKTX ₃ X ₄ DX ₅ X ₆ X ₇ DX ₈ GX ₉ EDX ₁₀ AEX ₁₁ LQKAA (SEQ ID NO:6)

其中X₁代表氨基酸残基Q或K；Wherein X ₁ represents amino acid residue Q or K;

其中X₂代表氨基酸残基A、S或F；优选地，S或F；wherein X ₂ represents an amino acid residue A, S or F; preferably, S or F;

其中X₃代表氨基酸残基A或P；wherein X ₃ represents an amino acid residue A or P;

其中X₄代表氨基酸残基A或F；wherein X ₄ represents amino acid residue A or F;

其中X₅代表氨基酸残基I或L；wherein X ₅ represents amino acid residue I or L;

其中X₆代表氨基酸残基S或A；wherein X ₆ represents an amino acid residue S or A;

其中X₇代表氨基酸残基I或A；wherein X ₇ represents amino acid residue I or A;

其中X₈代表氨基酸残基A、E或N；优选地，A或N；wherein X ₈ represents an amino acid residue A, E or N; preferably, A or N;

其中X₉代表氨基酸残基N或H；wherein X ₉ represents an amino acid residue N or H;

其中X₁₀代表氨基酸残基L或I；wherein X ₁₀ represents amino acid residue L or I;

其中X₁₁代表氨基酸残基I或V；且wherein X ₁₁ represents amino acid residue I or V; and

其中如果X₄代表F、且X₇代表I、且X₈代表N或E，那么X₂不代表F。If _X4 represents F, _X7 represents I, and _X8 represents N or E, then _X2 does not represent F.

另一种优选的C-端加帽模块包含下述序列基序Another preferred C-terminal capping module comprises the following sequence motif

X₁DKX₂GKTX₃ADX₄X₅X₆DX₇GX₈EDX₉AEX₁₀LQKAA (SEQ ID NO:7)X ₁ DKX ₂ GKTX ₃ ADX ₄ X ₅ X ₆ DX ₇ GX ₈ EDX ₉ AEX ₁₀ LQKAA (SEQ ID NO:7)

其中X₄代表氨基酸残基I或L；wherein X ₄ represents amino acid residue I or L;

其中X₅代表氨基酸残基S或A；wherein X ₅ represents an amino acid residue S or A;

其中X₆代表氨基酸残基I或A；wherein X ₆ represents amino acid residue I or A;

其中X₇代表氨基酸残基A、E或N；优选地，A或N；wherein X ₇ represents an amino acid residue A, E or N; preferably, A or N;

其中X₈代表氨基酸残基N或H；wherein X ₈ represents an amino acid residue N or H;

其中X₉代表氨基酸残基L或I；且wherein X ₉ represents an amino acid residue L or I; and

其中X₁₀代表氨基酸残基I或V。wherein _X10 represents amino acid residue I or V.

X₁DKX₂GKTX₃ADX₄X₅ADX₆GX₇EDX₈AEX₉LQKAA (SEQ ID NO:8)X ₁ DKX ₂ GKTX ₃ ADX ₄ X ₅ ADX ₆ GX ₇ EDX ₈ AEX ₉ LQKAA (SEQ ID NO:8)

其中X₆代表氨基酸残基A、E或N；优选地，A或N；wherein X ₆ represents an amino acid residue A, E or N; preferably, A or N;

其中X₇代表氨基酸残基N或H；wherein X ₇ represents an amino acid residue N or H;

其中X₈代表氨基酸残基L或I；且wherein X ₈ represents an amino acid residue L or I; and

其中X₉代表氨基酸残基I或V。wherein X ₉ represents amino acid residue I or V.

优选地，这样的包含SEQ ID NO:6、7或8的序列基序的C-端加帽模块在与所述序列基序的3位相对应的位置处具有氨基酸残基A、I或K；优选，I或K。Preferably, such a C-terminal capping module comprising the sequence motif of SEQ ID NO: 6, 7 or 8 has an amino acid residue A, I or K at the position corresponding to position 3 of said sequence motif; preferably, I or K.

也优选地，这样的包含SEQ ID NO:6、7或8的序列基序的C-端加帽模块在与所述序列基序的14位相对应的位置处具有氨基酸残基R或D。Also preferably, such a C-terminal capping module comprising the sequence motif of SEQ ID NO: 6, 7 or 8 has an amino acid residue R or D at the position corresponding to position 14 of said sequence motif.

一种优选的C-端加帽模块是具有下述氨基酸序列的C-端加帽模块：QDKSGKTPADLAADAGHEDIAEVLQKAA (SEQ ID NO:9)。A preferred C-terminal capping module is a C-terminal capping module having the following amino acid sequence: QDKSGKTPADLAADAGHEDIAEVLQKAA (SEQ ID NO: 9).

进一步优选的是，具有SEQ ID NO:9的氨基酸序列的C-端加帽模块，其中Further preferred is a C-terminal capping module having the amino acid sequence of SEQ ID NO: 9, wherein

在1位的氨基酸残基是Q或K；The amino acid residue at position 1 is Q or K;

在4位的氨基酸残基是S或F；The amino acid residue at position 4 is S or F;

在9位的氨基酸残基是A或F；The amino acid residue at position 9 is A or F;

在13位的氨基酸残基是A或I；The amino acid residue at position 13 is A or I;

在15位的氨基酸残基是A、E或N；且The amino acid residue at position 15 is A, E, or N; and

其中所述C-端加帽模块不具有SEQ ID NO:2、3或4的氨基酸序列。wherein the C-terminal capping module does not have the amino acid sequence of SEQ ID NO: 2, 3 or 4.

进一步优选的是这样的C-端加帽模块，其具有在与SEQ ID NO:9或2比对时具有至少70%、优选至少75%、80%、85%、90%或最优选至少95% 氨基酸序列同一性的氨基酸序列。优选地，所述C-端加帽模块在比对时在与SEQ ID:9的4位相对应的位置处的氨基酸残基是S，所述C-端加帽模块在比对时在与SEQ ID:9的9位相对应的位置处的氨基酸残基是A，所述C-端加帽模块在比对时在与SEQ ID:9的13位相对应的位置处的氨基酸残基是A，和/或所述C-端加帽模块在比对时在与SEQ ID:9的15位相对应的位置处的氨基酸残基是A。进一步优选地，所述C-端加帽模块在比对时在与SEQ ID:9的9位相对应的位置处的氨基酸残基是A，和/或所述C-端加帽模块在比对时在与SEQ ID:9的13位相对应的位置处的氨基酸残基是A。也优选地，所述C-端加帽模块包含28个氨基酸。Further preferred is a C-terminal capping module having an amino acid sequence having at least 70%, preferably at least 75%, 80%, 85%, 90% or most preferably at least 95% amino acid sequence identity when aligned with SEQ ID NO: 9 or 2. Preferably, the amino acid residue at the position of the C-terminal capping module corresponding to position 4 of SEQ ID: 9 when aligned is S, the amino acid residue at the position of the C-terminal capping module corresponding to position 9 of SEQ ID: 9 when aligned is A, the amino acid residue at the position of the C-terminal capping module corresponding to position 13 of SEQ ID: 9 when aligned is A, and/or the amino acid residue at the position of the C-terminal capping module corresponding to position 15 of SEQ ID: 9 when aligned is A. Further preferably, the amino acid residue at the position of the C-terminal capping module corresponding to position 9 of SEQ ID: 9 when aligned is A, and/or the amino acid residue at the position of the C-terminal capping module corresponding to position 13 of SEQ ID: 9 when aligned is A. Also preferably, the C-terminal capping module comprises 28 amino acids.

进一步优选的是这样的C-端加帽模块，其具有SEQ ID NO:2或9的氨基酸序列，其中Further preferred is a C-terminal capping module having an amino acid sequence of SEQ ID NO: 2 or 9, wherein

所述C-端加帽模块的一个或多个氨基酸残基被特定氨基酸替换，所述特定氨基酸在比对对应的C-端加帽重复序列或加帽单元时存在于对应位置处，且其中One or more amino acid residues of the C-terminal capping module are replaced by specific amino acids that are present at corresponding positions when aligning corresponding C-terminal capping repeat sequences or capping units, and wherein

在4位的氨基酸残基是S；The amino acid residue at position 4 is S;

在9位的氨基酸残基是A；The amino acid residue at position 9 is A;

在13位的氨基酸残基是A；和/或者The amino acid residue at position 13 is A; and/or

在15位的氨基酸残基是A。The amino acid residue at position 15 is A.

优选地，所述C-端加帽模块的最多30%的氨基酸残基被替换，更优选最多20%和甚至更优选最多10% 的氨基酸残基被替换。也优选地，这样的C-端加帽模块是天然存在的C-端加帽重复序列。Preferably, at most 30% of the amino acid residues of the C-terminal capping module are replaced, more preferably at most 20% and even more preferably at most 10% of the amino acid residues are replaced. Also preferably, such a C-terminal capping module is a naturally occurring C-terminal capping repeat sequence.

还优选的是这样的C-端加帽模块，其包含基于SEQ ID NO:9的任意上述C-端加帽模块的1-25位或1-26位的氨基酸。Also preferred is a C-terminal capping module comprising amino acids 1-25 or 1-26 of any of the above C-terminal capping modules based on SEQ ID NO: 9.

进一步优选的是这样的C-端加帽模块，其具有不包含氨基酸N和随后的G的氨基酸序列。Further preferred is a C-terminal capping module having an amino acid sequence which does not comprise the amino acid N followed by G.

还优选的是这样的C-端加帽模块，其与基于SEQ ID NO:9的任意上述C-端加帽模块或与SEQ ID NO:9本身具有至少70%、优选至少75%、80%、85%、90%、或最优选至少95% 氨基酸序列同一性。Also preferred are C-terminal capping modules that have at least 70%, preferably at least 75%, 80%, 85%, 90%, or most preferably at least 95% amino acid sequence identity with any of the above C-terminal capping modules based on SEQ ID NO: 9 or with SEQ ID NO: 9 itself.

进一步优选的是这样的C-端加帽模块，其与SEQ ID NO:2或9具有至少70%、优选至少75%、80%、85%、90%或最优选至少95% 氨基酸序列同一性，且其中所述C-端加帽模块具有在9位处的氨基酸A；优选地，所述C-端加帽模块具有在9和13位处的氨基酸A；更优选地，所述C-端加帽模块具有在9、13和15位处的氨基酸A；最优选地，所述C-端加帽模块具有在9、13和15位处的氨基酸A和在4位处的氨基酸S。Further preferred is a C-terminal capping module having at least 70%, preferably at least 75%, 80%, 85%, 90% or most preferably at least 95% amino acid sequence identity to SEQ ID NO: 2 or 9, and wherein the C-terminal capping module has amino acid A at position 9; preferably, the C-terminal capping module has amino acid A at positions 9 and 13; more preferably, the C-terminal capping module has amino acid A at positions 9, 13 and 15; most preferably, the C-terminal capping module has amino acid A at positions 9, 13 and 15 and amino acid S at position 4.

进一步优选的是这样的C-端加帽模块，其不具有在14位处的氨基酸R，和/或不具有在15位处的氨基酸E。Further preferred is a C-terminal capping module which does not have the amino acid R at position 14 and/or does not have the amino acid E at position 15.

还优选的是这样的C-端加帽模块，其不具有与SEQ ID NO:2、3或4相同的氨基酸序列。Also preferred are C-terminal capping modules that do not have the same amino acid sequence as SEQ ID NO: 2, 3 or 4.

进一步优选的是这样的C-端加帽模块，其具有基于SEQ ID NO:9的氨基酸序列，其中所述C-端加帽模块具有在26、27和28位处的选自下述的氨基酸：A、L、R、M、K和N；更优选A、L、R和K；且最优选K、A和L。Further preferred is a C-terminal capping module having an amino acid sequence based on SEQ ID NO: 9, wherein the C-terminal capping module has amino acids at positions 26, 27 and 28 selected from the group consisting of: A, L, R, M, K and N; more preferably A, L, R and K; and most preferably K, A and L.

通过组合本领域技术人员已知的技术（诸如氨基酸序列比对、诱变和基因合成），可以用本发明的加帽模块替换重复结构域的加帽模块。例如，可以如下用SEQ ID NO:9的C-端加帽重复序列替换SEQ ID NO:17的C-端加帽重复序列：(i) 通过与SEQ ID NO:9进行序列比对，确定SEQ ID NO:17的C-端加帽重复序列(即序列位置99-126)，(ii) 用SEQ ID NO:9的序列替换确定的SEQ ID NO:17的C-端加帽重复序列的序列，(iii) 制备编码重复结构域的基因，所述重复结构域编码替换的C-端加帽模块，(iv) 在大肠杆菌的细胞质中表达修饰的重复结构域，和(v) 通过标准方式纯化修饰的重复结构域。By combining techniques known to those skilled in the art (such as amino acid sequence alignment, mutagenesis, and gene synthesis), the capping module of the repeat domain can be replaced with the capping module of the present invention. For example, the C-terminal capping repeat sequence of SEQ ID NO: 9 can be replaced with the C-terminal capping repeat sequence of SEQ ID NO: 17 as follows: (i) determining the C-terminal capping repeat sequence of SEQ ID NO: 17 (i.e., sequence positions 99-126) by sequence alignment with SEQ ID NO: 9, (ii) replacing the determined sequence of the C-terminal capping repeat sequence of SEQ ID NO: 17 with the sequence of SEQ ID NO: 9, (iii) preparing a gene encoding a repeat domain encoding the replaced C-terminal capping module, (iv) expressing the modified repeat domain in the cytoplasm of Escherichia coli, and (v) purifying the modified repeat domain by standard means.

此外，可以如下遗传地构建本发明的重复结构域：借助于基因合成，装配N-端加帽模块(即SEQ ID NO:1的N-端加帽重复序列)，继之以一个或多个重复模块(即包含SEQ IDNO:17的33-98位氨基酸残基的重复模块)和C-端加帽模块(即SEQ ID NO:9的C-端加帽重复序列)。然后可以如上所述在大肠杆菌中表达遗传地装配的重复结构域基因。In addition, the repeat domain of the present invention can be constructed genetically as follows: by gene synthesis, an N-terminal capping module (i.e., the N-terminal capping repeat sequence of SEQ ID NO: 1) is assembled, followed by one or more repeat modules (i.e., a repeat module comprising amino acid residues 33-98 of SEQ ID NO: 17) and a C-terminal capping module (i.e., the C-terminal capping repeat sequence of SEQ ID NO: 9). The genetically assembled repeat domain gene can then be expressed in E. coli as described above.

还优选的是这样的结合蛋白，其中所述锚蛋白重复结构域或设计的锚蛋白重复结构域包含具有SEQ ID NO:6、7或8的序列基序的C-端加帽模块，其中所述加帽模块具有在3位处的氨基酸I，且其中所述重复模块的前面是具有SEQ ID NO:12的锚蛋白重复序列基序的重复模块。Also preferred are binding proteins, wherein the ankyrin repeat domain or designed ankyrin repeat domain comprises a C-terminal capping module having a sequence motif of SEQ ID NO: 6, 7 or 8, wherein the capping module has amino acid 1 at position 3, and wherein the repeat module is preceded by a repeat module having ankyrin repeat sequence motif of SEQ ID NO: 12.

进一步优选的是，具有不含氨基酸C、M或N的氨基酸序列的结合蛋白、重复结构域、N-端加帽模块或C-端加帽模块。Further preferred are binding proteins, repeat domains, N-terminal capping modules or C-terminal capping modules having an amino acid sequence that does not contain amino acids C, M or N.

进一步优选的是，具有不含氨基酸N和随后的G的氨基酸序列的结合蛋白、重复结构域、N-端加帽模块或C-端加帽模块。Further preferred are binding proteins, repeat domains, N-terminal capping modules or C-terminal capping modules having an amino acid sequence that does not contain the amino acid N followed by G.

进一步优选的是，任意这样的包含C-端加帽重复序列的C-端加帽模块，其中所述加帽重复序列中的一个或多个氨基酸残基被特定氨基酸残基替换，所述特定氨基酸残基在比对相应加帽单元或重复单元时存在于对应位置处。Further preferred is any such C-terminal capping module comprising a C-terminal capping repeat sequence, wherein one or more amino acid residues in the capping repeat sequence are replaced by specific amino acid residues that are present at corresponding positions when aligning the corresponding capping units or repeat units.

进一步优选的是，包含任意这样的N-端或C-端加帽模块的结合蛋白。Further preferred are binding proteins comprising any such N-terminal or C-terminal capping modules.

这样的C-端加帽模块的氨基酸序列的实例是，在SEQ ID NO:19、21、27、28、38、40和42中的99-126位的氨基酸序列。实施例6证实，通过用本发明的加帽模块替换它们的C-端加帽模块，可以增加重复结构域的热稳定性。Examples of amino acid sequences of such C-terminal capping modules are the amino acid sequences at positions 99-126 in SEQ ID NOs: 19, 21, 27, 28, 38, 40, and 42. Example 6 demonstrates that the thermal stability of the repeat domains can be increased by replacing their C-terminal capping modules with the capping modules of the present invention.

术语“靶标”指个别分子诸如核酸分子、多肽或蛋白、碳水化合物，或者任何其它天然存在的分子，包括此类个别分子的任何部分，或者两个或更多个此类分子的复合物。靶标可以是整个细胞或组织样品，或者可以是任何非天然的分子或部分。优选靶标是天然存在或非天然的多肽或者含有化学修饰的多肽，例如通过天然或非天然的磷酸化、乙酰化或甲基化修饰。在本发明的特定应用中，靶标是xSA。The term "target" refers to an individual molecule such as a nucleic acid molecule, a polypeptide or protein, a carbohydrate, or any other naturally occurring molecule, including any portion of such an individual molecule, or a complex of two or more such molecules. The target can be a whole cell or tissue sample, or can be any non-natural molecule or portion. Preferably, the target is a naturally occurring or non-natural polypeptide or a polypeptide containing a chemical modification, such as modification by natural or non-natural phosphorylation, acetylation, or methylation. In a specific application of the present invention, the target is xSA.

术语“xSA”表示哺乳动物血清白蛋白，诸如得自小鼠、大鼠、兔、狗、猪、猴或人的血清白蛋白。术语“MSA”表示小鼠血清白蛋白(UniProtKB/Swiss-Prot基本登录号P07724)，术语“CSA”表示食蟹猴(即成束猴（macaca fascicularis）) 血清白蛋白(UniProtKB/Swiss-Prot基本登录号A2V9Z4)，术语“HSA”表示人血清白蛋白(UniProtKB/Swiss-Prot基本登录号P02768)。The term "xSA" refers to mammalian serum albumin, such as serum albumin from mouse, rat, rabbit, dog, pig, monkey, or human. The term "MSA" refers to mouse serum albumin (UniProtKB/Swiss-Prot Basic Accession No. P07724), the term "CSA" refers to cynomolgus monkey ( i.e. , macaca fascicularis ) serum albumin (UniProtKB/Swiss-Prot Basic Accession No. A2V9Z4), and the term "HSA" refers to human serum albumin (UniProtKB/Swiss-Prot Basic Accession No. P02768).

术语“共有序列”指氨基酸序列，其中所述共有序列通过多个重复单元的结构和/或序列比对获得。使用两个或更多个经结构和/或序列比对的重复单元并且在比对时允许空位，可以确定各位置最常见的氨基酸残基。共有序列是包含在各位置最常出现的氨基酸的序列。在其中两种或更多种氨基酸超过平均数地出现在单个位置的情况下，共有序列可包括那些氨基酸的子集。所述两个或更多个重复单元可取自包含在单种重复蛋白中或者来自两种或更多种不同的重复蛋白的重复单元。The term "consensus sequence" refers to an amino acid sequence, wherein the consensus sequence is obtained by structural and/or sequence alignment of a plurality of repeating units. Using two or more repeating units that have been structurally and/or sequence aligned and allowing for gaps in the alignment, the most common amino acid residues at each position can be determined. A consensus sequence is a sequence comprising the most commonly occurring amino acids at each position. In cases where two or more amino acids occur at a single position in excess of the average, the consensus sequence may comprise a subset of those amino acids. The two or more repeating units may be taken from repeating units contained in a single repeating protein or from two or more different repeating proteins.

本领域技术人员熟知共有序列及其确定方法。Those skilled in the art are familiar with consensus sequences and methods for determining them.

“共有氨基酸残基”是见于共有序列的特定位置的氨基酸。如果两种或更多种，如三、四或五种氨基酸残基以相似概率见于所述两个或更多个重复单元中，共有氨基酸可以是最常见的氨基酸之一或者所述两种或更多种氨基酸残基的组合。A "consensus amino acid residue" is an amino acid that is found at a particular position in a consensus sequence. If two or more, such as three, four or five, amino acid residues are found in two or more repeating units with similar probability, the consensus amino acid may be one of the most common amino acids or a combination of the two or more amino acid residues.

进一步优选非天然存在的加帽模块、重复模块、结合蛋白或结合域。Further preferred are non-naturally occurring capping modules, repeat modules, binding proteins or binding domains.

术语“非天然存在的”意指合成的或不来自天然，更具体来讲，术语意指用人手制造。术语“非天然存在的结合蛋白”或“非天然存在的结合域”意指所述结合蛋白或所述结合域是合成的(即通过化学合成用氨基酸制备)或重组的且不来自天然。“非天然存在的结合蛋白”或“非天然存在的结合域”分别是通过对应设计的核酸表达获得的人造蛋白或结构域。优选表达在真核或细菌细胞中进行，或者通过使用无细胞体外表达系统进行。而且，术语意指所述结合蛋白或所述结合域的序列不作为非人造序列条目存在于序列数据库，例如GenBank、EMBL-Bank或Swiss-Prot中。本领域技术人员熟知这些数据库及其它相似的序列数据库。The term "non-naturally occurring" means synthetic or not from nature, more specifically, the term means made by human hands. The term "non-naturally occurring binding protein" or "non-naturally occurring binding domain" means that the binding protein or the binding domain is synthetic (i.e., prepared by chemical synthesis with amino acids) or recombinant and not from nature. "Non-naturally occurring binding protein" or "non-naturally occurring binding domain" are artificial proteins or domains obtained by expressing nucleic acids of corresponding design, respectively. Preferably, expression is carried out in eukaryotic or bacterial cells, or by using a cell-free in vitro expression system. Moreover, the term means that the sequence of the binding protein or the binding domain does not exist as a non-artificial sequence entry in a sequence database, such as GenBank, EMBL-Bank or Swiss-Prot. Those skilled in the art are familiar with these databases and other similar sequence databases.

本发明涉及包含结合域的结合蛋白，其中所述结合域是锚蛋白重复结构域且特异性地结合xSA，其中所述结合蛋白和/或结合域在PBS中热解折叠以后具有高于40℃的中点变性温度(Tm)，并在最高10 g/L的浓度在PBS中在37℃温育1天时形成小于5% (w/w) 的不溶性聚集体。The present invention relates to a binding protein comprising a binding domain, wherein the binding domain is an ankyrin repeat domain and specifically binds to xSA, wherein the binding protein and/or binding domain has a midpoint denaturation temperature (Tm) above 40°C after thermal unfolding in PBS and forms less than 5% (w/w) insoluble aggregates when incubated in PBS at 37°C for 1 day at a concentration of up to 10 g/L.

术语“PBS”意指含有137 mM NaCl、10 mM磷酸盐和2.7 mM KCl且具有pH 7.4的磷酸盐缓冲水溶液。The term "PBS" means an aqueous phosphate-buffered saline solution containing 137 mM NaCl, 10 mM phosphate, and 2.7 mM KCl and having a pH of 7.4.

优选地，所述结合蛋白和/或结合域在pH 7.4的PBS中或在pH 5.8的MES缓冲液中热解折叠以后具有高于45℃，更优选高于50℃，更优选高于55℃，和最优选高于60℃的中点变性温度(Tm)。本发明的结合蛋白或结合域在生理条件下具有确定的二级和三级结构。这样的多肽的热解折叠会导致它的三级和二级结构的丧失，这可以通过例如圆二色性(CD)测量来跟踪。结合蛋白或结合域在热解折叠以后的中点变性温度与这样的温度相对应：通过将温度从10℃缓慢地升高至约100℃使所述蛋白或结构域热变性以后，在生理缓冲液中的协同转变的中点处的温度。在热解折叠以后的中点变性温度的确定，是本领域技术人员熟知的。结合蛋白或结合域在热解折叠以后的该中点变性温度会指示所述多肽的热稳定性。Preferably, the binding protein and/or binding domain has a midpoint denaturation temperature (Tm) of greater than 45°C, more preferably greater than 50°C, more preferably greater than 55°C, and most preferably greater than 60°C after thermal unfolding in PBS at pH 7.4 or in MES buffer at pH 5.8. The binding protein or binding domain of the present invention has a defined secondary and tertiary structure under physiological conditions. Thermal unfolding of such a polypeptide will result in the loss of its tertiary and secondary structure, which can be tracked by, for example, circular dichroism (CD) measurements. The midpoint denaturation temperature of the binding protein or binding domain after thermal unfolding corresponds to the temperature at the midpoint of the cooperative transition in a physiological buffer after thermal denaturation of the protein or domain by slowly increasing the temperature from 10°C to about 100°C. The determination of the midpoint denaturation temperature after thermal unfolding is well known to those skilled in the art. This midpoint denaturation temperature of the binding protein or binding domain after thermal unfolding will indicate the thermal stability of the polypeptide.

还优选的是这样的结合蛋白和/或结合域，其在最高20 g/L、优选地最高40 g/L、更优选最高60 g/L、甚至更优选最高80 g/L和最优选最高100 g/L的浓度，当在PBS中在37℃温育超过5天、优选地超过10天、更优选地超过20天、更优选地超过40天和最优选地超过100天时，形成小于5% (w/w)的不溶性聚集体。通过肉眼可见沉淀的出现、凝胶过滤或动态光散射（其在不溶性聚集体形成后强烈增加），可以检测不溶性聚集体的形成。通过在10’000 x g离心10分钟，可以从蛋白样品中除去不溶性聚集体。优选地，结合蛋白和/或结合域在PBS中在37℃的所述温育条件下形成小于2%、更优选地小于1%、0.5%、0.2%、0.1%或最优选地小于0.05% (w/w)的不溶性聚集体。可以如下确定不溶性聚集体的百分比：使不溶性聚集体与可溶性蛋白分离，随后通过标准定量方法确定在可溶性级分和不溶性级分中的蛋白量。Also preferred are binding proteins and/or binding domains that form less than 5% (w/w) insoluble aggregates when incubated in PBS at 37°C for more than 5 days, preferably more than 10 days, more preferably more than 20 days, more preferably more than 40 days, and most preferably more than 100 days at a concentration of up to 20 g/L, preferably up to 40 g/L, more preferably up to 60 g/L, even more preferably up to 80 g/L, and most preferably up to 100 g/L. The formation of insoluble aggregates can be detected by the appearance of a visible precipitate, gel filtration, or dynamic light scattering, which increases strongly after the formation of insoluble aggregates. Insoluble aggregates can be removed from protein samples by centrifugation at 10'000 x g for 10 minutes. Preferably, the binding protein and/or binding domain forms less than 2%, more preferably less than 1%, 0.5%, 0.2%, 0.1%, or most preferably less than 0.05% (w/w) insoluble aggregates in PBS under the described incubation conditions at 37°C. The percentage of insoluble aggregates can be determined by separating the insoluble aggregates from the soluble protein and then determining the amount of protein in the soluble and insoluble fractions by standard quantitative methods.

还优选的是，在含有100 mM二硫苏糖醇(DTT)的PBS中在37℃温育1或10小时后不丧失其自然三维结构的结合蛋白和/或结合域。Also preferred are binding proteins and/or binding domains that do not lose their native three-dimensional structure upon incubation in PBS containing 100 mM dithiothreitol (DTT) at 37°C for 1 or 10 hours.

在一个特定实施方案中，本发明涉及包含结合域的结合蛋白，所述结合域是锚蛋白重复结构域，其特异性地结合xSA，并具有如上文限定的所示或优选的中点变性温度和非聚集性质，其中所述结合蛋白在哺乳动物中的终末血浆半衰期比不结合血清蛋白（诸如xSA）的结合域高至少5倍。In a specific embodiment, the present invention relates to a binding protein comprising a binding domain which is an ankyrin repeat domain which specifically binds to xSA and has the indicated or preferred midpoint denaturation temperature and non-aggregation properties as defined above, wherein the terminal plasma half-life of the binding protein in a mammal is at least 5-fold higher than a binding domain which does not bind to a serum protein such as xSA.

优选地，所述结合域在哺乳动物中的终末血浆半衰期比不结合血清蛋白（诸如xSA）的结合域高至少10倍、更优选至少20倍、40倍、100倍、300倍或最优选至少10³倍。Preferably, the terminal plasma half-life of the binding domain in a mammal is at least 10-fold, more preferably at least 20-fold, 40-fold, 100-fold, 300-fold or most preferably at least 10 ³ -fold greater than a binding domain that does not bind to a serum protein such as xSA.

也优选地，所述结合域不结合xSA，这表现为与xSA结合的Kd高于10^-4M、更优选高于10^-3M或最优选高于10^-2M。不结合xSA的结合域的一个实例是SEQ ID NO:32的重复结构域。Also preferably, the binding domain does not bind xSA, as evidenced by binding to xSA with a Kd higher than 10 ⁻⁴ M, more preferably higher than 10 ⁻³ M or most preferably higher than 10 ⁻² M. An example of a binding domain that does not bind xSA is the repeat domain of SEQ ID NO: 32.

进一步优选地，所述结合域是重复结构域，且与SEQ ID NO:32的重复结构域相比，或者与DARPin #32、DARPin #41或DARPin #42相比，其在哺乳动物中的终末血浆半衰期高(即长)至少5倍、更优选至少10倍、20倍、40倍、100倍、300倍或最优选至少10³倍。Further preferably, the binding domain is a repeat domain and has a terminal plasma half-life in a mammal that is at least 5-fold higher (i.e., longer) than the repeat domain of SEQ ID NO: 32, or compared to DARPin #32, DARPin #41, or DARPin #42, more preferably at least 10-fold, 20-fold, 40-fold, 100-fold, 300-fold, or most preferably at least 10 ³ -fold higher.

一种优选的结合蛋白包含对HSA具有结合特异性的结合域，其在人类中具有超过1天、更优选超过3天、5天、7天、10天、15天或最优选超过20天的终末血浆半衰期。A preferred binding protein comprises a binding domain with binding specificity for HSA that has a terminal plasma half-life in humans of greater than 1 day, more preferably greater than 3 days, 5 days, 7 days, 10 days, 15 days, or most preferably greater than 20 days.

通过本领域技术人员熟知的测定(Toutain, P.L., 和Bousquet-Mélou, A., J.Vet. Pharmacol. Ther. 27(5), 427-439, 2004)，可以确定结合域的终末血浆半衰期。在实施例中给出了确定终末血浆半衰期的实例。The terminal plasma half-life of the binding domain can be determined by assays well known to those skilled in the art (Toutain, PL, and Bousquet-Mélou, A., J. Vet. Pharmacol. Ther. 27(5), 427-439, 2004). Examples of determining the terminal plasma half-life are given in the Examples.

术语药物（诸如本发明的结合蛋白或结合域）的“终末血浆半衰期”表示，施用给哺乳动物的药物在达到假平衡以后，达到血浆浓度的半数所需的时间。该半衰期没有被定义为消除施用给哺乳动物的药物的半数剂量所需的时间。The term "terminal plasma half-life" of a drug (such as a binding protein or binding domain of the invention) refers to the time required for the drug administered to a mammal to reach half its plasma concentration after reaching pseudo-equilibrium. This half-life is not defined as the time required to eliminate half of the dose of the drug administered to the mammal.

在一个具体实施方案中，本发明涉及包含结合域的结合蛋白，所述结合域是锚蛋白重复结构域，特异性地结合xSA，且所述结合蛋白包含生物活性化合物。In a specific embodiment, the present invention relates to a binding protein comprising a binding domain that is an ankyrin repeat domain that specifically binds to xSA, and the binding protein comprises a biologically active compound.

术语“生物活性化合物”表示，当施用给具有疾病的哺乳动物时，会改变所述疾病的化合物。生物活性化合物可以具有拮抗或激动性质，且可以是蛋白性生物活性化合物或非蛋白性生物活性化合物。The term "biologically active compound" refers to a compound that, when administered to a mammal having a disease, alters the disease. A biologically active compound may have antagonistic or agonistic properties and may be a proteinaceous or non-proteinaceous biologically active compound.

通过用标准DNA克隆技术制备基因融合多肽，可将这样的蛋白性生物活性化合物共价连接至例如本发明的结合域，接着进行标准表达和纯化。例如，DARPin #36包含对人生长因子(即生物活性化合物)具有结合特异性的重复结构域和随后的对HSA具有结合特异性的重复结构域。Such proteinaceous biologically active compounds can be covalently linked to, for example, a binding domain of the present invention by preparing a gene fusion polypeptide using standard DNA cloning techniques, followed by standard expression and purification. For example, DARPin #36 comprises a repeat domain with binding specificity for human growth factor (i.e., the biologically active compound) followed by a repeat domain with binding specificity for HSA.

通过化学方法，例如，经由马来酰亚胺接头与半胱氨酸巯基偶联，其中半胱氨酸经由肽接头与本文所述的结合域的N-或C-端偶联，可以将这样的非蛋白性生物活性化合物共价连接至例如本发明的结合域。Such non-proteinaceous biologically active compounds can be covalently linked to, for example, a binding domain of the invention by chemical means, for example, coupling via a maleimide linker to a cysteine thiol group, wherein the cysteine is coupled to the N- or C-terminus of a binding domain described herein via a peptide linker.

蛋白性生物活性化合物的实例是，具有独特靶标特异性的结合域(例如通过结合生长因子来中和它)、细胞因子(例如白介素)、生长因子(例如人生长激素)、抗体及其片段、激素(例如GLP-1)和任何可能的蛋白性药物。Examples of proteinaceous bioactive compounds are binding domains with unique target specificity (e.g., by binding to a growth factor and thereby neutralizing it), cytokines (e.g., interleukins), growth factors (e.g., human growth hormone), antibodies and fragments thereof, hormones (e.g., GLP-1), and any possible proteinaceous drug.

非蛋白性生物活性化合物的实例是，毒素(例如得自免疫原（ImmunoGen）的DM1)、靶向GPCR的小分子、抗生素和任何可能的非蛋白性药物。Examples of non-proteinaceous bioactive compounds are toxins (eg DM1 from ImmunoGen), small molecules targeting GPCRs, antibiotics and any possible non-proteinaceous drugs.

在一个具体实施方案中，本发明涉及包含锚蛋白重复结构域的结合蛋白，所述锚蛋白重复结构域特异性地结合xSA，且所述结合蛋白另外包含生物活性化合物，其中所述结合蛋白在哺乳动物中的终末半衰期比所述未修饰的生物活性化合物的终末半衰期高至少2倍，其中所述更高的终末半衰期由所述重复结构域赋予给所述结合蛋白。In a specific embodiment, the present invention is directed to a binding protein comprising an ankyrin repeat domain that specifically binds xSA, and said binding protein further comprises a biologically active compound, wherein the terminal half-life of said binding protein in a mammal is at least 2-fold higher than the terminal half-life of said unmodified biologically active compound, wherein said higher terminal half-life is conferred to said binding protein by said repeat domain.

优选地，所述结合蛋白在哺乳动物中的终末血浆半衰期比所述未修饰的生物活性化合物高至少5倍、更优选至少10倍、20倍、40倍、100倍、300倍或最优选至少10³倍。Preferably, the terminal plasma half-life of the binding protein in a mammal is at least 5-fold, more preferably at least 10-fold, 20-fold, 40-fold, 100-fold, 300-fold or most preferably at least 10 ³ -fold greater than that of the unmodified biologically active compound.

另一个优选的实施方案是这样的重组结合蛋白，其包含特异性地结合xSA的结合域，且其中所述结合域是锚蛋白重复结构域或设计的锚蛋白重复结构域。这类锚蛋白重复结构域可以包含1个、2个、3个或更多个内部重复模块，所述重复模块将参与结合xSA。优选地，这类锚蛋白重复结构域包含N-端加帽模块、2-4个内部重复模块和C-端加帽模块。优选地，所述结合域是锚蛋白重复结构域或设计的锚蛋白重复结构域。也优选地，所述加帽模块是加帽重复序列。Another preferred embodiment is a recombinant binding protein comprising a binding domain that specifically binds to xSA, and wherein the binding domain is an ankyrin repeat domain or a designed ankyrin repeat domain. Such ankyrin repeat domains may comprise 1, 2, 3 or more internal repeat modules that will participate in binding to xSA. Preferably, such ankyrin repeat domains comprise an N-terminal capping module, 2-4 internal repeat modules and a C-terminal capping module. Preferably, the binding domain is an ankyrin repeat domain or a designed ankyrin repeat domain. Also preferably, the capping module is a capping repeat sequence.

具体地，本发明涉及如上文定义的结合蛋白，其中所述锚蛋白重复结构域与选自下述的锚蛋白重复结构域竞争结合哺乳动物血清白蛋白：SEQ ID NO: 17-31和43-48；优选SEQ ID NO: 17-31；更优选SEQ ID NO:19、21、27和28，尤其是SEQ ID NO:19和27。In particular, the present invention relates to a binding protein as defined above, wherein the ankyrin repeat domain competes for binding to mammalian serum albumin with ankyrin repeat domain selected from the group consisting of SEQ ID NOs: 17-31 and 43-48; preferably SEQ ID NOs: 17-31; more preferably SEQ ID NOs: 19, 21, 27 and 28, in particular SEQ ID NOs: 19 and 27.

最优选的是这样的结合蛋白，其中所述锚蛋白重复结构域选自：SEQ ID NO: 17-31和43-48，其中所述锚蛋白重复结构域的在1位处的G和/或在2位处的S任选地缺失。Most preferred is a binding protein wherein the ankyrin repeat domain is selected from the group consisting of: SEQ ID NOs: 17-31 and 43-48, wherein the G at position 1 and/or the S at position 2 of the ankyrin repeat domain is optionally deleted.

也优选地，所述重复结构域与选自DARPin #17-31和43-48的结合蛋白竞争结合xSA。优选地，所述重复结构域与选自DARPin #19、21、27、28、45、46、47和48的结合蛋白竞争结合xSA。更优选地，所述重复结构域与结合蛋白DARPin #19、45、46、48或27竞争结合xSA；甚至更优选地，所述重复结构域与结合蛋白DARPin #46或27竞争结合xSA。Also preferably, the repeat domain competes for xSA binding with a binding protein selected from DARPins #17-31 and 43-48. Preferably, the repeat domain competes for xSA binding with a binding protein selected from DARPins #19, 21, 27, 28, 45, 46, 47, and 48. More preferably, the repeat domain competes for xSA binding with binding proteins DARPin #19, 45, 46, 48, or 27; even more preferably, the repeat domain competes for xSA binding with binding proteins DARPin #46 or 27.

术语“竞争结合”意指本发明的两种不同结合域不能同时与相同靶标结合，但两者各自能够与相同靶标结合。因此，这样的两种结合域竞争结合所述靶标。优选地，所述两种竞争性结合域会结合在所述靶标上的重叠或相同结合表位。本领域技术人员熟知确定两种结合域是否竞争结合靶标的方法，诸如竞争性酶联免疫吸附测定(ELISA)或竞争性SPR测量(如通过使用BioRad的Proteon仪器)。The term "competitive binding" means that two different binding domains of the present invention cannot bind to the same target at the same time, but each of them can bind to the same target. Therefore, such two binding domains compete for binding to the target. Preferably, the two competitive binding domains bind to overlapping or identical binding epitopes on the target. Methods for determining whether two binding domains compete for binding to a target are well known to those skilled in the art, such as competitive enzyme-linked immunosorbent assays (ELISAs) or competitive SPR measurements (e.g., by using a BioRad Proteon instrument).

另一个优选的实施方案是这样的结合蛋白，其包含选自下述的对xSA具有结合特异性的重复结构域：SEQ ID NO:17-31的重复结构域。优选地，所述重复结构域是SEQ IDNO:19、21、27或28的重复结构域。更优选地，所述重复结构域是SEQ ID NO:19的重复结构域。还更优选地，所述重复结构域是SEQ ID NO:21的重复结构域。还更优选地，所述重复结构域是SEQ ID NO:27的重复结构域。还更优选地，所述重复结构域是SEQ ID NO:28的重复结构域。Another preferred embodiment is a binding protein comprising a repeat domain with binding specificity for xSA selected from the group consisting of the repeat domains of SEQ ID NOs: 17-31. Preferably, the repeat domain is a repeat domain of SEQ ID NOs: 19, 21, 27, or 28. More preferably, the repeat domain is a repeat domain of SEQ ID NO: 19. Even more preferably, the repeat domain is a repeat domain of SEQ ID NO: 21. Even more preferably, the repeat domain is a repeat domain of SEQ ID NO: 27. Even more preferably, the repeat domain is a repeat domain of SEQ ID NO: 28.

进一步优选的是这样的结合蛋白，其中所述对xSA具有结合特异性的重复结构域包含与所述重复结构域集合的重复结构域具有至少70% 氨基酸序列同一性的氨基酸序列。优选地，所述氨基酸序列同一性是至少75%、更优选至少80%、更优选至少85%、更优选至少90%和最优选至少95%。Further preferred are binding proteins wherein the repeat domain with binding specificity for xSA comprises an amino acid sequence that has at least 70% amino acid sequence identity with a repeat domain of the set of repeat domains. Preferably, the amino acid sequence identity is at least 75%, more preferably at least 80%, more preferably at least 85%, more preferably at least 90%, and most preferably at least 95%.

进一步优选的是这样的结合蛋白，其中所述对xSA具有结合特异性的重复结构域包含这样的重复模块，其与所述重复结构域集合的重复结构域的重复模块具有至少70%氨基酸序列同一性。优选地，所述氨基酸序列同一性是至少75%、更优选至少80%、更优选至少85%、更优选至少90%和最优选至少95%。Further preferred are binding proteins wherein the repeat domain with binding specificity for xSA comprises a repeat module that has at least 70% amino acid sequence identity with a repeat module of a repeat domain of the set of repeat domains. Preferably, the amino acid sequence identity is at least 75%, more preferably at least 80%, more preferably at least 85%, more preferably at least 90%, and most preferably at least 95%.

进一步优选的是这样的结合蛋白，其中所述结合蛋白包含2个或更多个所述对xSA具有结合特异性的重复结构域。优选地，所述结合蛋白包含2个或3个所述重复结构域。所述2个或更多个重复结构域具有相同的或不同的氨基酸序列。Further preferred is a binding protein comprising two or more repeat domains having binding specificity for xSA. Preferably, the binding protein comprises two or three repeat domains. The two or more repeat domains have the same or different amino acid sequences.

在根据本发明的包含重复结构域的结合蛋白的另一个优选的实施方案中，所述重复结构域的重复模块的一个或多个氨基酸残基被比对重复单元时在对应位置上发现的氨基酸残基替换。优选地，最多30%的氨基酸残基被替换，更优选地，最多20%，和甚至更优选地，最多10%的氨基酸残基被替换。最优选地，这类重复单元是天然存在的重复单元。In another preferred embodiment of the binding protein comprising a repeat domain according to the present invention, one or more amino acid residues of the repeat module of the repeat domain are replaced by amino acid residues found at the corresponding position when aligning the repeat units. Preferably, at most 30% of the amino acid residues are replaced, more preferably at most 20%, and even more preferably at most 10% of the amino acid residues are replaced. Most preferably, such repeat units are naturally occurring repeat units.

在根据本发明的包含重复结构域的结合蛋白的另一个优选的实施方案中，所述重复结构域的N-端加帽模块的一个或多个氨基酸残基被比对N-端加帽单元时在对应位置上发现的氨基酸残基替换。优选地，最多30%的氨基酸残基被替换，更优选地，最多20%，和甚至更优选地，最多10%的氨基酸残基被替换。最优选地，这类N-端加帽单元是天然存在的N-端加帽单元。In another preferred embodiment of the binding protein comprising a repeat domain according to the present invention, one or more amino acid residues of the N-terminal capping module of the repeat domain are replaced by amino acid residues found at corresponding positions when aligning the N-terminal capping unit. Preferably, at most 30% of the amino acid residues are replaced, more preferably at most 20%, and even more preferably at most 10% of the amino acid residues are replaced. Most preferably, such N-terminal capping unit is a naturally occurring N-terminal capping unit.

在根据本发明的包含重复结构域的结合蛋白的另一个优选的实施方案中，所述重复结构域的C-端加帽模块的一个或多个氨基酸残基被比对C-端加帽单元时在对应位置上发现的氨基酸残基替换。优选地，最多30%的氨基酸残基被替换，更优选地，最多20%，和甚至更优选地，最多10%的氨基酸残基被替换。最优选地，这类C-端加帽单元是天然存在的C-端加帽单元。In another preferred embodiment of the binding protein comprising a repeat domain according to the present invention, one or more amino acid residues of the C-terminal capping module of the repeat domain are replaced by amino acid residues found at corresponding positions when aligning the C-terminal capping unit. Preferably, at most 30% of the amino acid residues are replaced, more preferably at most 20%, and even more preferably at most 10% of the amino acid residues are replaced. Most preferably, such C-terminal capping unit is a naturally occurring C-terminal capping unit.

在另一个具体实施方案中，最多30%的氨基酸残基，更优选地，最多20%，和甚至更优选地，最多10%的氨基酸残基被不是在重复单元、N-端加帽单元或C-端加帽单元的对应位置发现的氨基酸替换。In another specific embodiment, up to 30% of the amino acid residues, more preferably, up to 20%, and even more preferably, up to 10% of the amino acid residues are replaced with an amino acid not found at the corresponding position in the repeat unit, the N-terminal capping unit or the C-terminal capping unit.

在其它实施方案中，本文描述的任一种xSA结合蛋白或结构域可以与一个或多个额外部分共价结合，所述额外部分包括例如结合不同靶标以产生双特异性结合剂的部分、生物活性化合物、标记部分(例如荧光标记诸如荧光素，或放射性示踪剂)、促进蛋白纯化的部分(例如小肽标签，诸如His-或strep-标签)、提供改善的治疗效果的效应物功能的部分(例如提供抗体依赖性细胞介导的细胞毒性的抗体Fc部分，毒性蛋白部分诸如铜绿假单胞 菌(Pseudomonas aeruginosa)外毒素A (ETA)或小分子毒性剂诸如美登木素生物碱或DNA烷化剂)或提供改善的药代动力学的部分。可以根据感知的治疗需要评估改善的药代动力学。经常希望增加生物利用度和/或增加给药之间的时间，其可能通过增加给药后血清中的蛋白维持可用的时间实现。在一些情况下，希望改善蛋白血清浓度随时间的连续性(例如，减少在施用后不久的浓度和下次施用前不久的浓度之间的蛋白血清浓度差异)。In other embodiments, any of the xSA binding proteins or domains described herein can be covalently bound to one or more additional moieties, including, for example, moieties that bind to different targets to create bispecific binding agents, biologically active compounds, labeling moieties (e.g., fluorescent labels such as fluorescein, or radioactive tracers), moieties that facilitate protein purification (e.g., small peptide tags such as His- or strep-tags), moieties that provide effector functions that provide improved therapeutic efficacy (e.g., antibody Fc portions that provide antibody-dependent cell-mediated cytotoxicity, toxic protein moieties such as Pseudomonas aeruginosa exotoxin A (ETA) or small molecule toxic agents such as maytansinoids or DNA alkylating agents), or moieties that provide improved pharmacokinetics. Improved pharmacokinetics can be assessed based on perceived therapeutic need. It is often desirable to increase bioavailability and/or increase the time between doses, which may be achieved by increasing the time that a protein remains available in the serum following administration. In some cases, it is desirable to improve the continuity of protein serum concentration over time (e.g., to reduce the difference in protein serum concentration between the concentration shortly after administration and the concentration shortly before the next administration).

在另一个实施方案中，本发明涉及编码特定结合蛋白、特定N-端加帽模块或特定C-端加帽模块的核酸分子。此外，还涉及包含所述核酸分子的载体。In another embodiment, the present invention relates to a nucleic acid molecule encoding a specific binding protein, a specific N-terminal capping module or a specific C-terminal capping module. In addition, it also relates to a vector comprising the nucleic acid molecule.

此外，涉及药物组合物，其包含一种或更多种上述结合蛋白(尤其是包含重复结构域的结合蛋白)、或编码特定结合蛋白的核酸分子、以及任选的药学上可接受的载体和/或稀释剂。药学上可接受的载体和/或稀释剂是本领域技术人员已知的，将在下文更详细的解释。更进一步，涉及包括一种或更多种上述结合蛋白(尤其是包含重复结构域的结合蛋白)的诊断组合物。In addition, the present invention relates to a pharmaceutical composition comprising one or more of the above-mentioned binding proteins (especially binding proteins comprising repeat domains), or nucleic acid molecules encoding specific binding proteins, and optionally a pharmaceutically acceptable carrier and/or diluent. Pharmaceutically acceptable carriers and/or diluents are known to those skilled in the art and will be explained in more detail below. Further, the present invention relates to a diagnostic composition comprising one or more of the above-mentioned binding proteins (especially binding proteins comprising repeat domains).

药物组合物包含如上所述的结合蛋白和药学上可接受的载体、赋形剂或稳定剂例如如在Remington's Pharmaceutical Sciences第16版, Osol, A. 编[1980]中所述。技术人员已知的合适载体、赋形剂或稳定剂是盐水、林格溶液、葡萄糖（dextrose）溶液、汉克氏（Hank's）溶液、非挥发油、油酸乙酯、5%葡萄糖盐水、提高等渗性和化学稳定性的物质、缓冲剂和防腐剂。其它合适载体包括本身不引起产生对接受组合物的个体有害的抗体的任何载体，诸如蛋白、多糖、聚乳酸、聚乙醇酸、聚合氨基酸和氨基酸共聚物。药物组合物还可以是组合制剂，其包含另外的活性剂，诸如抗癌剂或抗血管生成剂。The pharmaceutical composition comprises a binding protein as described above and a pharmaceutically acceptable carrier, excipient or stabilizer, for example as described in Remington's Pharmaceutical Sciences 16th edition, Osol, A. ed. [1980]. Suitable carriers, excipients or stabilizers known to those skilled in the art are saline, Ringer's solution, dextrose solution, Hank's solution, fixed oils, ethyl oleate, 5% dextrose saline, substances that improve isotonicity and chemical stability, buffers and preservatives. Other suitable carriers include any carrier that does not itself cause the production of antibodies that are harmful to the individual receiving the composition, such as proteins, polysaccharides, polylactic acid, polyglycolic acid, polymeric amino acids and amino acid copolymers. The pharmaceutical composition can also be a combination formulation that includes another active agent, such as an anticancer agent or an anti-angiogenic agent.

将用于体内给药的制剂必须消毒或无菌。这通过过滤用无菌过滤膜容易完成。Preparations to be used for in vivo administration must be sterile or aseptic. This is readily accomplished by filtration through sterile filtration membranes.

药物组合物可通过技术人员知识范围内的任何合适方法给药。优选的给药途径是胃肠外。在胃肠外给药中，将本发明的药物配制成与如上文限定的与药学上可接受的赋形剂组合的单位剂量可注射形式，诸如溶液剂、混悬剂或乳剂。给药的剂量和方式将取决于将治疗的个体和具体疾病。一般而言，施用药物组合物，使得以1 μg/kg至20 mg/kg、更优选10μg/kg至5 mg/kg、最优选0.1至2 mg/kg的剂量施用本发明的结合蛋白。优选其以推注剂量(bolus dose)给药。也可使用连续输注，包括经渗透性微型泵连续皮下递送。如果这样的话，可将药物组合物以5至20 μg/kg/分钟、更优选7至15 μg/kg/分钟的剂量输注。The pharmaceutical composition can be administered by any suitable method within the knowledge of the skilled person. The preferred route of administration is parenteral. In parenteral administration, the drug of the present invention is formulated into a unit dose injectable form, such as a solution, suspension or emulsion, in combination with a pharmaceutically acceptable excipient as defined above. The dosage and mode of administration will depend on the individual and specific disease to be treated. In general, the pharmaceutical composition is administered so that the binding protein of the present invention is administered at a dose of 1 μg/kg to 20 mg/kg, more preferably 10 μg/kg to 5 mg/kg, and most preferably 0.1 to 2 mg/kg. It is preferably administered in a bolus dose. Continuous infusion can also be used, including continuous subcutaneous delivery via an osmotic minipump. If so, the pharmaceutical composition can be infused at a dose of 5 to 20 μg/kg/minute, more preferably 7 to 15 μg/kg/minute.

此外，考虑将任一种上述药物组合物用于病症的治疗。Furthermore, it is contemplated that any of the above-described pharmaceutical compositions may be used in the treatment of a disorder.

本发明另外提供了治疗方法。所述方法包括：给有此需要的患者施用治疗有效量的本发明的结合蛋白。The present invention further provides a method of treatment comprising administering a therapeutically effective amount of a binding protein of the present invention to a patient in need thereof.

此外，考虑到治疗哺乳动物（包括人类）的病理学状况的方法，所述方法包括：给有此需要的患者施用有效量的上述药物组合物。Furthermore, methods of treating a pathological condition in a mammal, including a human, are contemplated, comprising administering to a patient in need thereof an effective amount of the pharmaceutical composition described above.

根据本发明的结合蛋白可通过几种方法获得和/或进一步进化，诸如在噬菌体(WO1990/002809、WO 2007/006665)或细菌细胞(WO 1993/010214)表面上展示、核糖体展示(WO1998/048008)、质粒展示(WO 1993/008278)、或者通过使用共价的RNA-重复蛋白杂合构建体(WO 2000/032823)，或者例如通过蛋白互补测定(WO 1998/341120)进行的细胞内表达和选择/筛选。本领域技术人员已知此类方法。Binding proteins according to the present invention can be obtained and/or further evolved by several methods, such as display on the surface of phage (WO 1990/002809, WO 2007/006665) or bacterial cells (WO 1993/010214), ribosome display (WO 1998/048008), plasmid display (WO 1993/008278), or by using covalent RNA-repeat protein hybrid constructs (WO 2000/032823), or for example by intracellular expression and selection/screening by protein complementation assays (WO 1998/341120). Such methods are known to those skilled in the art.

用于选择/筛选根据本发明的结合蛋白的锚蛋白重复蛋白文库，可根据本领域技术人员已知的方案获得(WO 2002/020565, Binz, H.K., 等人, J. Mol. Biol., 332,489-503, 2003,和Binz等人, 2004, 在上述引文中)。在实施例1给出了此类文库用于选择xSA特异性的DARPin的用途。类似地，如上文提出的锚蛋白重复序列基序可用于建立可用于选择或筛选xSA特异性的DARPin的锚蛋白重复蛋白文库。此外，使用标准重组DNA技术(例如WO 2002/020565, Binz等人, 2003, 在上述引文中和Binz等人, 2004, 在上述引文中)，可以从本发明的重复模块和适当的加帽模块或加帽重复序列模块地（modularly）装配本发明的重复结构域(Forrer, P., 等人, FEBS letters 539, 2-6, 2003) 。Ankyrin repeat protein libraries for selecting/screening binding proteins according to the present invention can be obtained according to protocols known to those skilled in the art (WO 2002/020565, Binz, HK, et al., J. Mol. Biol., 332 , 489-503, 2003, and Binz et al., 2004, loc. cit.). The use of such libraries for selecting xSA-specific DARPins is described in Example 1. Similarly, the ankyrin repeat sequence motifs described above can be used to create ankyrin repeat protein libraries that can be used to select or screen xSA-specific DARPins. Furthermore, the repeat domains of the invention can be assembled modularly from the repeat modules of the invention and appropriate capping modules or capping repeat sequences using standard recombinant DNA techniques (e.g., WO 2002/020565, Binz et al., 2003, loc. cit. and Binz et al., 2004, loc. cit.) (Forrer, P., et al., FEBS letters 539 , 2-6, 2003).

本发明不限于在实施例描述的特定实施方案。按照下文描述的概要可使用和处理其它来源。The present invention is not limited to the specific embodiments described in the examples. Other sources can be used and processed according to the outline described below.

实施例Example

下文公开的所有起始原料和试剂已为本领域技术人员所知，且可商购得到或者可用众所周知的技术制备。All starting materials and reagents disclosed below are known to those skilled in the art and are either commercially available or can be prepared using well-known techniques.

材料Material

化学品购自Fluka (瑞士)。寡核苷酸来自Microsynth (瑞士)。除非另外说明，DNA聚合酶、限制酶和缓冲液来自New England Biolabs (美国)或Fermentas (立陶宛)。克隆和蛋白制备株是大肠杆菌XL1-blue (Stratagene, 美国)或BL21 (Novagen, 美国)。购买得自不同物种的纯化的血清白蛋白和血清(例如从Sigma-Aldrich, 瑞士或InnovativeResearch, 美国)。使用标准的生物素化试剂和方法(Pierce, 美国)，将生物素部分与纯化的血清白蛋白的伯胺偶联，以化学方式获得不同物种的生物素化的血清白蛋白。Chemicals were purchased from Fluka (Switzerland). Oligonucleotides were from Microsynth (Switzerland). Unless otherwise stated, DNA polymerases, restriction enzymes, and buffers were from New England Biolabs (USA) or Fermentas (Lithuania). Cloning and protein preparation strains were Escherichia coli XL1-blue (Stratagene, USA) or BL21 (Novagen, USA). Purified serum albumin and serum from various species were purchased (e.g., from Sigma-Aldrich, Switzerland or Innovative Research, USA). Biotinylated serum albumin from various species was chemically obtained by coupling the biotin moiety to primary amines of purified serum albumin using standard biotinylation reagents and methods (Pierce, USA).

分子生物学Molecular Biology

除非另外说明，根据所述方案(Sambrook J., Fritsch E.F. 和Maniatis T.,Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory 1989,New York)，执行方法。Unless otherwise stated, methods were performed according to the described protocols (Sambrook J., Fritsch E.F. and Maniatis T., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory 1989, New York).

设计的锚蛋白重复蛋白文库Designed ankyrin repeat protein library

描述了N2C和N3C设计的锚蛋白重复蛋白文库(WO 2002/020565; Binz等人.2003, 在上述引文中; Binz等人. 2004, 在上述引文中)。在N2C和N3C中的数字描述了存在于N-端和C-端加帽模块之间的随机化重复模块的数目。用于定义重复单元和模块内的位置的命名法是基于：Binz等人. 2004, 在上述引文中，修改是，将锚蛋白重复模块和锚蛋白重复单元的边缘移动一个氨基酸位置。例如，Binz等人. 2004 (在上述引文中) 的锚蛋白重复模块的1位对应于本公开内容的锚蛋白重复模块的2位，因此，Binz等人. 2004（在上述引文中）的锚蛋白重复模块的33位对应于本公开内容的下一锚蛋白重复模块的1位。Ankyrin repeat protein libraries based on N2C and N3C designs are described (WO 2002/020565; Binz et al. 2003, loc. cit.; Binz et al. 2004, loc. cit.). The numbers in N2C and N3C describe the number of randomized repeat modules present between the N-terminal and C-terminal capping modules. The nomenclature used to define repeat units and positions within the modules is based on that of Binz et al. 2004, loc. cit., with the modification that the ankyrin repeat modules and the edges of the ankyrin repeat units are shifted by one amino acid position. For example, position 1 of the ankyrin repeat module of Binz et al. 2004 (loc. cit.) corresponds to position 2 of the ankyrin repeat module of the present disclosure. Thus, position 33 of the ankyrin repeat module of Binz et al. 2004 (loc. cit.) corresponds to position 1 of the next ankyrin repeat module of the present disclosure.

所有DNA序列都通过测序确认，所有所述蛋白的计算分子量都通过质谱确认。All DNA sequences were confirmed by sequencing, and the calculated molecular weights of all proteins were confirmed by mass spectrometry.

实施例1: 包含对xSA具有结合特异性的重复结构域的结合蛋白的选择 Example 1: Selection of Binding Proteins Comprising Repeated Domains with Binding Specificity for xSA

使用核糖体展示(Hanes, J.和Plückthun, A., PNAS 94, 4937-42, 1997)，从由Binz等人. 2004 (在上述引文中)描述的N2C或N3C DARPin文库选择出许多对xSA具有结合特异性的设计的锚蛋白重复蛋白(DARPin)。通过粗提取ELISA，评估了所选克隆对特异性的(xSA；即MSA、HSA或CSA)和非特异性的(MBP，大肠杆菌麦芽糖结合蛋白)靶标的结合，提示成功选择xSA结合蛋白。SEQ ID NO:17-31的重复结构域构成所选结合蛋白的氨基酸序列，所述结合蛋白包含对xSA具有结合特异性的重复结构域。所选结合剂（binder）的序列分析揭示了某些所选结合剂家族固有的特异性锚蛋白重复序列基序。在SEQ ID NO:11-14中提供了存在于对xSA具有结合特异性的重复结构域中的这类锚蛋白重复序列基序。Using ribosome display (Hanes, J. and Plückthun, A., PNAS 94 , 4937-42, 1997), a number of designed ankyrin repeat proteins (DARPins) with binding specificity for xSA were selected from the N2C or N3C DARPin library described by Binz et al. 2004 (loc. cit.). Selected clones were evaluated for binding to specific (xSA; i.e., MSA, HSA, or CSA) and nonspecific (MBP, E. coli maltose binding protein) targets by crude extract ELISA, indicating successful selection of xSA binding proteins. The repeat domains of SEQ ID NOs: 17-31 constitute the amino acid sequences of the selected binding proteins, which contain repeat domains with binding specificity for xSA. Sequence analysis of the selected binders revealed specific ankyrin repeat sequence motifs inherent to some of the selected binder families. Such ankyrin repeat motifs present in the repeat domains having binding specificity for xSA are provided in SEQ ID NOs: 11-14.

通过核糖体展示选择血清白蛋白特异性的锚蛋白重复蛋白Selection of serum albumin-specific ankyrin repeat proteins by ribosome display

用HSA、CSA或MSA作为靶蛋白，使用如描述的设计的锚蛋白重复蛋白文库(WO2002/020565, Binz等人, 2003, 在上述引文中和Binz等人, 2004, 在上述引文中)和建立的方案(Zahnd, C., Amstutz, P. 和Plückthun, A., Nat. Methods 4, 69-79,2007)，通过核糖体展示(Hanes和Plückthun, 在上述引文中) 进行了血清白蛋白特异性的锚蛋白重复蛋白的选择。使用建立的方案(Binz等人. 2004, 在上述引文中)，用N2C和N3CDARPin文库两者对HSA、CSA或MSA(包括用Neutravidin或抗生蛋白链菌素固定的HSA或MSA的生物素化变体)进行核糖体展示选择循环。每个选择循环后的反转录(RT)-PCR循环数恒定地从40减至30，因应富含结合剂的收率作出调节。对HSA、CSA或MSA的4个选择循环得到了微摩尔至纳摩尔-亲和力DARPin库，如通过对单一克隆的ELISA和SPR测量所揭示的。通过本领域技术人员熟知的方法(例如通过易错PCR使DARPin克隆多样化，并如上所述选择和筛选改进的结合剂)，使用亲和力成熟，进一步提高某些DARPin的亲和力。Selection of serum albumin-specific ankyrin repeat proteins was performed by ribosome display (Hanes and Plückthun, loc. cit.) using HSA, CSA, or MSA as target proteins using ankyrin repeat protein libraries designed as described (WO 2002/020565, Binz et al., 2003, loc. cit. and Binz et al., 2004, loc. cit.) and established protocols (Zahnd, C., Amstutz, P. and Plückthun, A., Nat. Methods 4 , 69-79, 2007). Ribosome display selection cycles were performed on HSA, CSA, or MSA (including biotinylated variants of HSA or MSA immobilized with Neutravidin or streptavidin) using both N2C and N3CDARPin libraries using established protocols (Binz et al., 2004, loc. cit.). The number of reverse transcription (RT)-PCR cycles after each selection cycle was constantly reduced from 40 to 30, adjusted for the yield of enriched binders. Four selection cycles for HSA, CSA, or MSA yielded a library of DARPins with micromolar to nanomolar affinities, as revealed by ELISA and SPR measurements of individual clones. The affinity of certain DARPins was further improved using affinity maturation by methods well known to those skilled in the art (e.g., diversification of DARPin clones by error-prone PCR and selection and screening of improved binders as described above).

如通过粗提取ELISA所示，经选择的克隆特异性地结合血清白蛋白Selected clones specifically bound serum albumin as shown by crude extract ELISA

利用标准方案，用DARPin表达细胞的粗制大肠杆菌提取物，通过酶联免疫吸附测定(ELISA)鉴定各种所选特异性地结合xSA的DARPin。通过核糖体展示，将所选克隆克隆入pQE30 (Qiagen)表达载体内，转化入大肠杆菌XL1-Blue (Stratagene)内，然后在装有1 ml生长培养基(2YT，含有1% 葡萄糖和100µg/ml氨苄西林)的96深孔板(每个克隆在单个孔内)中，在37℃培养过夜。在新的96深孔板中，用100µl过夜培养物接种1 ml含有50µg/ml氨苄西林的新鲜2YT。在37℃温育2小时后，用IPTG (1 mM终浓度)诱导表达，并继续3小时。将细胞收集，再悬浮于100µl B-PERII (Pierce)中，在室温振荡温育15分钟。然后加入900µl PBS-TC (补充了0.25% 酪蛋白水解物、0.1% 吐温20®、pH 7.4的PBS)，离心除去细胞碎片。将100µl各裂解的克隆施加于NeutrAvidin包被的MaxiSorp板的孔内，所述孔装有通过其生物素部分固定的xSA或不相关MBP，并在室温温育1小时。用PBS-T (补充了0.1% 吐温20®、pH7.4的PBS)充分洗涤后，用标准ELISA程序使板显影，所述程序使用单克隆抗-RGS(His)₄抗体(34650, Qiagen)作为第一抗体，并使用与碱性磷酸酶缀合的多克隆山羊抗小鼠抗体(A3562, Sigma)作为第二试剂。然后用4-硝基苯基磷酸二钠(4NPP, Fluka)作为碱性磷酸酶的底物，检测结合。在405 nm测量颜色显影。通过此类粗制细胞提取物ELISA对几百个克隆的筛选，揭示了超过一百种对xSA具有特异性的不同DARPin。选择这些结合蛋白用于进一步分析。在SEQ ID NO:17-31、37-40和43-48中提供了特异性地结合xSA的所选重复结构域的氨基酸序列实例。Using standard protocols, crude E. coli extracts from DARPin-expressing cells were used to identify each selected DARPin that specifically binds to xSA by enzyme-linked immunosorbent assay (ELISA). Selected clones were cloned into the pQE30 (Qiagen) expression vector by ribosome display, transformed into E. coli XL1-Blue (Stratagene), and cultured overnight at 37°C in a 96-deep-well plate (each clone in a single well) containing 1 ml of growth medium (2YT containing 1% glucose and 100 µg/ml ampicillin). In a new 96-deep-well plate, 100 µl of the overnight culture was inoculated with 1 ml of fresh 2YT containing 50 µg/ml ampicillin. After incubation at 37°C for 2 hours, expression was induced with IPTG (1 mM final concentration) for another 3 hours. Cells were harvested, resuspended in 100 µl of B-PERII (Pierce), and incubated with shaking at room temperature for 15 minutes. 900µl of PBS-TC (PBS supplemented with 0.25% casein hydrolysate, 0.1% Tween 20®, pH 7.4) was then added, and cell debris was removed by centrifugation. 100µl of each lysed clone was applied to the wells of a NeutrAvidin-coated MaxiSorp plate containing xSA or an irrelevant MBP immobilized via its biotin moiety and incubated at room temperature for 1 hour. After extensive washing with PBS-T (PBS supplemented with 0.1% Tween 20®, pH 7.4), the plate was developed using a standard ELISA procedure using a monoclonal anti-RGS(His) ₄ antibody (34650, Qiagen) as the primary antibody and a polyclonal goat anti-mouse antibody conjugated to alkaline phosphatase (A3562, Sigma) as the secondary reagent. Binding was then detected using disodium 4-nitrophenyl phosphate (4NPP, Fluka) as the substrate for alkaline phosphatase. Color development was measured at 405 nm. Screening of several hundred clones by such crude cell extract ELISA revealed over a hundred different DARPins specific for xSA. These binding proteins were selected for further analysis. Examples of amino acid sequences of selected repeat domains that specifically bind to xSA are provided in SEQ ID NOs: 17-31, 37-40, and 43-48.

从对xSA具有结合特异性的所选重复结构域推断重复序列基序（motives）Inference of repeat sequence motifs from selected repeat domains with binding specificity for xSA

通过本领域技术人员已知的序列分析工具(WO 2002/020565; Forrer等人,2003, 在上述引文中; Forrer, P., Binz, H.K., Stumpp, M.T.和Plückthun, A.,ChemBioChem, 5(2), 183-189, 2004)，进一步分析对xSA具有结合特异性的所选重复结构域的氨基酸序列。但是，与其中使用天然存在的重复基序来推断重复序列基序的WO 2002/020565相反，这里从对xSA具有结合特异性的所选重复结构域的重复单元推断重复序列基序。从而确定了包含共同重复序列基序的所选重复结构域家族。在SEQ ID NO:11-14中提供了对xSA具有结合特异性的重复结构域中存在的这类重复序列基序。The amino acid sequences of the selected repeat domains with binding specificity for xSA were further analyzed using sequence analysis tools known to those skilled in the art (WO 2002/020565; Forrer et al., 2003, loc. cit.; Forrer, P., Binz, HK, Stumpp, MT and Plückthun, A., ChemBioChem, 5(2) , 183-189, 2004). However, in contrast to WO 2002/020565, in which naturally occurring repeat motifs were used to infer repeat sequence motifs, repeat sequence motifs were inferred from the repeat units of the selected repeat domains with binding specificity for xSA. This resulted in the identification of a family of selected repeat domains containing a common repeat sequence motif. Such repeat sequence motifs present in the repeat domains with binding specificity for xSA are provided in SEQ ID NOs: 11-14.

DARPin的高水平和可溶性表达 High-level and soluble expression of DARPins

为了进一步分析，将如上所述在粗制细胞提取ELISA中显示出特异性xSA结合的所选克隆在大肠杆菌BL21或XL1-Blue细胞中表达，并用标准方案利用其His-标签纯化。用25ml静止过夜的培养物(LB，1% 葡萄糖，100 mg/l的氨苄西林；37℃) 接种1 l培养物(相同培养基)。在600 nm吸光度为0.7时，用0.5 mM IPTG诱导培养物，并在37℃温育4小时。将培养物离心，使所得沉淀再悬浮于40 ml TBS500 (50 mM Tris–HCl, 500 mM NaCl, pH 8)中，并超声处理。将裂解物再次离心，将甘油(10% (v/v)终浓度)和咪唑(20 mM终浓度)加入所得上清液中。根据生产商的说明书(QIAgen, 德国)，用Ni-次氮基三乙酸柱(2.5 ml柱体积)纯化蛋白。可替换地，根据本领域技术人员已知的标准树脂和方案，通过阴离子交换色谱法和随后的尺寸排阻色谱法，纯化不含6xHis-标签的DARPin或所选重复结构域。从1升大肠杆菌培养物中可纯化至多200 mg对血清白蛋白具有结合特异性的高度可溶性DARPin，根据SDS-15% PAGE估计的纯度> 95%。这样纯化的DARPin用于进一步表征。For further analysis, selected clones that demonstrated specific xSA binding in the crude cell extract ELISA as described above were expressed in Escherichia coli BL21 or XL1-Blue cells and purified using their His-tag using standard protocols. 1 liter of culture (same medium) was inoculated with 25 ml of a stationary overnight culture (LB, 1% glucose, 100 mg/l ampicillin; 37°C). The culture was induced with 0.5 mM IPTG at an absorbance of 0.7 at 600 nm and incubated at 37°C for 4 hours. The culture was centrifuged, and the resulting pellet was resuspended in 40 ml of TBS500 (50 mM Tris–HCl, 500 mM NaCl, pH 8) and sonicated. The lysate was centrifuged again, and glycerol (10% (v/v) final concentration) and imidazole (20 mM final concentration) were added to the resulting supernatant. The protein was purified using a Ni-nitrilotriacetic acid column (2.5 ml column volume) according to the manufacturer's instructions (QIAgen, Germany). Alternatively, DARPins or selected repeat domains without a 6xHis tag can be purified by anion exchange chromatography followed by size exclusion chromatography using standard resins and protocols known to those skilled in the art. Up to 200 mg of highly soluble DARPins with binding specificity for serum albumin can be purified from 1 liter of E. coli culture with a purity estimated at >95% by SDS-15% PAGE. The purified DARPins were used for further characterization.

实施例2: 对xSA具有结合特异性的DARPin的稳定性分析和尺寸排阻色谱法 Example 2: Stability Analysis and Size Exclusion Chromatography of DARPins with Binding Specificity for xSA

如上所述使用它们的His-标签将对xSA具有结合特异性的DARPin #19-22和DARPin #27-30纯化至接近同质，并以30 mg/ml (约2 mM) 在PBS中在40℃储存28天(稳定性研究)。在第0天(图1a)和第28天(图1b)，取出样品，稀释至500µM，并通过尺寸排阻色谱法(SEC)进行分析，以评估它们的表观分子量和随时间的稳定性(即它们的聚集、多聚化或降解趋势)。DARPins #19-22 and DARPin #27-30, which have binding specificity for xSA, were purified to near homogeneity using their His-tags as described above and stored at 30 mg/ml (approximately 2 mM) in PBS at 40°C for 28 days (stability study). On day 0 (Figure 1a) and day 28 (Figure 1b), samples were removed, diluted to 500 µM, and analyzed by size exclusion chromatography (SEC) to assess their apparent molecular weight and stability over time (i.e., their tendency to aggregate, multimerize, or degrade).

在另一个实验(图1c)中，如上所述使用它们的His-标签将对xSA具有结合特异性的DARPin #19和43-48纯化至接近同质，并以约100 mg/ml在PBS中在-80℃储存28天，稀释至500µM，并通过尺寸排阻色谱法(SEC)进行分析，用于表征(即它们的聚集、多聚化或降解趋势)。值得注意的是，与第一个分析系列相比，使用更大的柱(参见下文)。In another experiment (Figure 1c), DARPins #19 and 43-48 with binding specificity for xSA were purified to near homogeneity using their His-tags as described above and stored at -80°C in PBS at approximately 100 mg/ml for 28 days, diluted to 500 µM, and analyzed by size exclusion chromatography (SEC) for characterization (i.e., their tendency to aggregate, multimerize, or degrade). Of note, a larger column was used compared to the first analytical series (see below).

尺寸排阻色谱法(SEC)Size Exclusion Chromatography (SEC)

使用HPLC系统(Agilent 1200系列)，使用Superdex 200 5/150柱(图1a和图1b)或Superdex 200 10/300GL柱(图1c) (GE Healthcare)，在20℃进行分析型SEC。Superdex200 5/150柱具有3.0 ml的床体积和1.08 ml的外水体积(使用蓝葡聚糖经实验确定)。Superdex 200 10/300GL柱具有24 ml的床体积和约8 ml的外水体积。根据本领域技术人员已知的标准规程，进行测量。在0.2 ml/min的流速和15巴的最大压力(Superdex 200 5/150)，或者在0.6 ml/min的流速和18巴的最大压力(Superdex 200 10/300GL)，在PBS中运行。将蛋白样品在PBS中稀释至约20-500µM，过滤(0.22µm)，并将20-100µl稀释的样品注射到柱上进行分离。通过读出在280 nm的吸光度，记录蛋白样品的洗脱曲线。使用分子量为6.5 kDa的抑酶肽(AP)、分子量为29 kDa的碳酸酐酶(CA)和分子量为75 kDa的伴清蛋白(CO) 作为标准蛋白来得到校正曲线，从该校正曲线可以确定样品蛋白的表观分子量。Analytical SEC is carried out at 20 ℃ using an HPLC system (Agilent 1200 series) using a Superdex 200 5/150 column (Fig. 1a and Fig. 1b) or a Superdex 200 10/300GL column (Fig. 1c) (GE Healthcare). The Superdex 200 5/150 column has a bed volume of 3.0 ml and an external water volume of 1.08 ml (determined experimentally using blue dextran). The Superdex 200 10/300GL column has a bed volume of 24 ml and an external water volume of approximately 8 ml. Measured according to standard procedures known to those skilled in the art. At a flow rate of 0.2 ml/min and a maximum pressure of 15 bar (Superdex 200 5/150), or at a flow rate of 0.6 ml/min and a maximum pressure of 18 bar (Superdex 200 10/300GL), run in PBS. Protein samples were diluted to approximately 20-500 µM in PBS, filtered (0.22 µm), and 20-100 µl of the diluted sample was injected onto the column for separation. The elution profile of the protein sample was recorded by reading the absorbance at 280 nm. A calibration curve was generated using aprotinin (AP) with a molecular weight of 6.5 kDa, carbonic anhydrase (CA) with a molecular weight of 29 kDa, and conalbumin (CO) with a molecular weight of 75 kDa as standard proteins. The apparent molecular weight of the sample protein was determined from this calibration curve.

结果显示在图1中。DARPin #19-22和27-30在稳定性研究的第0天和第28天显示出不能辨别的SEC色谱图(即不能辨别的洗脱曲线)。结论是，所有DARPin在测定条件下作为单体洗脱，且DARPin #19-23和27-30在PBS中在40℃稳定至少1个月(即它们的洗脱曲线没有揭示任何聚集、多聚化或降解趋势)。The results are shown in Figure 1. DARPins #19-22 and 27-30 showed indistinguishable SEC chromatograms (i.e., indistinguishable elution profiles) on days 0 and 28 of the stability study. It was concluded that all DARPins eluted as monomers under the assay conditions, and that DARPins #19-23 and 27-30 were stable in PBS at 40°C for at least 1 month (i.e., their elution profiles did not reveal any tendency toward aggregation, multimerization, or degradation).

实施例3: 对xSA具有结合特异性的DARPin的热稳定性 Example 3: Thermal stability of DARPins with binding specificity for xSA

用基于荧光的热稳定性测定(Niesen, F.H., Nature Protocols 2(9): 2212-2221, 2007)，分析了对xSA具有特异性的DARPin的热稳定性。由此，通过对蛋白的疏水部分（其随着蛋白解折叠而暴露）具有亲和力的染料(例如SYPRO Orange; Invitrogen, 目录号S6650) 的荧光的增加，测量蛋白(即这样的DARPin)解折叠时的温度。在由此得到的荧光转变中点(从较低荧光强度至较高荧光强度)处的温度则对应于分析的蛋白的中点变性温度(Tm)。The thermal stability of DARPins specific for xSA was analyzed using a fluorescence-based thermal stability assay (Niesen, FH, Nature Protocols 2(9) : 2212-2221, 2007). The temperature at which a protein (i.e., such a DARPin) unfolds is measured by the increase in fluorescence of a dye (e.g., SYPRO Orange; Invitrogen, catalog number S6650) that has affinity for the hydrophobic portion of the protein (which is exposed as the protein unfolds). The temperature at the midpoint of the resulting fluorescence transition (from lower fluorescence intensity to higher fluorescence intensity) corresponds to the midpoint denaturation temperature (Tm) of the analyzed protein.

基于荧光的热稳定性测定Fluorescence-based thermal stability assay

使用实时PCR仪器(即与CFX96光学系统(BioRad)相组合的C1000热循环仪(BioRad))，使用SYPRO Orange作为荧光染料，测量了DARPin的热变性。在含有1x SYPROOrange (从5’000x SYPRO Orange储备溶液稀释, Invitrogen)的pH 7.4的PBS或pH 5.8的MES缓冲液中，制备了80µM浓度的DARPin，并将50µl这样的蛋白溶液或仅缓冲液加入白色96-孔PCR板(Bio-Rad)中。用Microseal 'B' Adhesive Seals (Bio-Rad)密封所述板，并在实时PCR仪器中以0.5℃的增量从20℃加热至95℃，包括在每个温度增量以后的25秒保温步骤，在DARPin的热变性之后，测量样品在每个温度增量处的相对荧光单位。使用实时PCR仪器的通道2(即在515-535 nm激发和在560-580 nm检测)，测量板孔中的相对荧光单位，并减去用仅缓冲液得到的对应值。从由此得到的热变性转变中点，可以确定分析的DARPin的Tm值。Thermal denaturation of DARPins was measured using a real-time PCR instrument (i.e., a C1000 Thermal Cycler (BioRad) coupled with a CFX96 optical system (BioRad)) using SYPRO Orange as the fluorescent dye. DARPins were prepared at an 80 µM concentration in either PBS (pH 7.4) or MES buffer (pH 5.8) containing 1x SYPRO Orange (diluted from a 5'000x SYPRO Orange stock solution, Invitrogen), and 50 µl of this protein solution or buffer alone was added to a white 96-well PCR plate (Bio-Rad). The plate was sealed with Microseal 'B' Adhesive Seals (Bio-Rad) and heated in a real-time PCR instrument from 20°C to 95°C in 0.5°C increments, including a 25-second incubation step after each temperature increment. Following thermal denaturation of the DARPins, the relative fluorescence units of the samples at each temperature increment were measured. Using channel 2 of the real-time PCR instrument (i.e., excitation at 515-535 nm and detection at 560-580 nm), the relative fluorescence units in the wells were measured and the corresponding values obtained with buffer alone were subtracted. From the resulting thermal denaturation transition midpoint, the Tm value of the analyzed DARPin can be determined.

在图2和图3中显示了DARPin在pH7.4的PBS或pH 5.8的MES-缓冲液中的热变性结果，所述热变性通过SYPRO Orange的荧光强度增加来跟踪。测得的热变性转变证实，分析的所有对xSA具有结合特异性的DARPin都具有远高于40℃的Tm值(在pH 7.4和pH 5.8两者)。The results of thermal denaturation of DARPins in PBS at pH 7.4 or MES buffer at pH 5.8, which was followed by an increase in the fluorescence intensity of SYPRO Orange, are shown in Figures 2 and 3. The measured thermal denaturation transitions confirmed that all analyzed DARPins with binding specificity for xSA had Tm values well above 40°C (at both pH 7.4 and pH 5.8).

实施例4：通过表面等离子体共振分析表征对xSA具有结合特异性的DARPin Example 4: Characterization of DARPins with Binding Specificity for xSA by Surface Plasmon Resonance Analysis

经由它们的His-标签，在流动池中将对xSA具有结合特异性的DARPin固定至包被的α-RGS-His抗体(Qiagen, 目录号34650)上，并分析人、食蟹猴(cyno)、小鼠、大鼠、兔和狗血清白蛋白与固定的DARPin的相互作用。DARPins with binding specificity for xSA were immobilized via their His-tags to coated α-RGS-His antibodies (Qiagen, Cat. No. 34650) in a flow cell, and the interaction of human, cynomolgus monkey (cyno), mouse, rat, rabbit, and dog serum albumin with the immobilized DARPins was analyzed.

表面等离子体共振(SPR) 分析Surface Plasmon Resonance (SPR) Analysis

使用ProteOn仪器(BioRad)测量SPR，并根据本领域技术人员已知的标准规程进行测量。运行缓冲液为pH 7.4的PBS，其含有0.01% 吐温20®。将抗-RGS-His抗体共价地固定至GLC芯片(BioRad)上，达到约2000共振单位(RU)的水平。然后通过在300 s中注射150µl 1µM DARPin溶液(流速= 30µl/min)，进行DARPin在抗体包被的芯片上的固定化。然后如下测量与不同物种的血清白蛋白的相互作用：在60秒中注射100µl体积的运行缓冲液(含有0.01% 吐温的PBS)，其含有浓度为400 nM、200 nM、100 nM、50 nM的不同血清白蛋白(结合速率测量)，随后让运行缓冲液流动10-30分钟(流速= 100µl/min) (解离速率测量)。从注射血清白蛋白以后得到的RU迹线减去未包被的参考池和参考注射(即仅注射运行缓冲液)的信号(即共振单位(RU)值)(双重参考)。从得自结合速率和解离速率测量的SRP迹线，可以确定对应的DARPin血清白蛋白相互作用的结合速率和解离速率。SPR measurements were performed using a ProteOn instrument (BioRad) according to standard protocols known to those skilled in the art. The running buffer was PBS, pH 7.4, containing 0.01% Tween 20®. Anti-RGS-His antibodies were covalently immobilized on a GLC chip (BioRad) to a level of approximately 2000 resonance units (RU). DARPin immobilization on the antibody-coated chip was then performed by injecting 150 µl of a 1 µM DARPin solution over 300 s (flow rate = 30 µl/min). Interactions with serum albumin from different species were then measured as follows: a 100 µl volume of running buffer (PBS containing 0.01% Tween) containing serum albumin at concentrations of 400 nM, 200 nM, 100 nM, and 50 nM was injected over 60 seconds (on-rate measurement), followed by a 10-30 minute flow of running buffer (flow rate = 100 µl/min) (off-rate measurement). The signals (i.e., resonance unit (RU) values) of an uncoated reference cell and a reference injection (i.e., running buffer alone) were subtracted from the RU traces obtained after serum albumin injection (double referencing). The association and dissociation rates of the corresponding DARPin-serum albumin interactions can be determined from the SRP traces obtained from the association and dissociation rate measurements.

结果总结在表1和表2中。使用本领域技术人员已知的标准规程，从估测的结合速率和解离速率计算出解离常数(Kd)，并发现是在约3nM至约300 nM范围内。尽管分析的所有DARPin都结合人和食蟹猴血清白蛋白，兔、小鼠、大鼠和狗血清白蛋白仅被这些DARPin的子集结合。The results are summarized in Tables 1 and 2. The dissociation constant (Kd) was calculated from the estimated association and dissociation rates using standard procedures known to those skilled in the art and was found to be in the range of about 3 nM to about 300 nM. Although all DARPins analyzed bound to human and cynomolgus monkey serum albumin, rabbit, mouse, rat, and dog serum albumin were bound by only a subset of these DARPins.

实施例5: 对xSA具有结合特异性的DARPin的终末血浆半衰期 Example 5: Terminal plasma half-life of DARPins with binding specificity for xSA

根据本领域技术人员已知的标准规程(Toutain, 等人, 在上述引文中)，确定DARPin在小鼠和食蟹猴(成束猴, 也缩写为“cyno”) 中的终末血浆半衰期。将一定量的DARPin静脉内地注射进哺乳动物中，并通过跟踪DARPin的血浆浓度，跟踪DARPin随时间而变化的从血浆清除。DARPin浓度最初下降，直到达到假平衡(α-期)，随后是DARPin血浆浓度的指数式进一步下降(β-期)。然后可以从该β-期计算出DARPin终末血浆半衰期。The terminal plasma half-life of DARPins in mice and cynomolgus monkeys (also abbreviated as "cyno") is determined according to standard procedures known to those skilled in the art (Toutain et al., loc. cit.). A predetermined amount of DARPin is injected intravenously into the mammals, and the clearance of the DARPin from the plasma is tracked over time by tracking the DARPin plasma concentration. The DARPin concentration initially decreases until a pseudo-equilibrium is reached (the α-phase), followed by an exponential further decrease in the DARPin plasma concentration (the β-phase). The terminal plasma half-life of the DARPin can then be calculated from this β-phase.

DARPin在小鼠中的血浆清除的确定 Determination of plasma clearance of DARPins in mice

为了评估对xSA具有结合特异性的DARPin的血浆清除，将试验蛋白进行放射性标记，并注射进首次用于实验的Balb/c小鼠的尾静脉中。注射下述的DARPin：DARPin#19、DARPin#21、DARPin #23、DARPin #25、DARPin #18、、DARPin #32、DARPin #35、DARPin #36、DARPin #33、DARPin #34、DARPin #37、DARPin #38、DARPin #43、DARPin#44、DARPin#45、DARPin#46、DARPin#47和DARPin#48。如以前所述(Waibel, R., 等人, NatureBiotechnol. 17(9), 897-901, 1999)，用^99mTc-羰基复合物对DARPin进行放射性标记。将DARPin (40µg)与^99mTc-羰基(0.8-1.6 m Ci)一起温育1 h，然后在PBS (pH 7.4)中稀释至400µl。给每只小鼠静脉内注射100µl由此得到的标记的DARPin溶液(相当于10µg蛋白和0.2-0.4 m Ci)。在初次注射以后1 h、4 h、24 h和48 h，采集小鼠的血液样品，并测量样品的放射性。在特定时间点测得的放射性水平，是在该时间点时在血浆中仍然存在的DARPin的量的直接量度。%注射剂量是，在特定时间点测得的小鼠全血(就18 g小鼠而言，1.6 ml)的总放射性相对于针对^99mTc的放射性衰变校正过的注射样品的总放射性的百分比。To evaluate the plasma clearance of DARPins with binding specificity for xSA, the test proteins were radiolabeled and injected into the tail vein of naive Balb/c mice. The following DARPins were injected: DARPin #19, DARPin #21, DARPin #23, DARPin #25, DARPin #18, DARPin #32, DARPin #35, DARPin #36, DARPin #33, DARPin #34, DARPin #37, DARPin #38, DARPin #43, DARPin #44, DARPin #45, DARPin #46, DARPin #47, and DARPin #48. DARPins were radiolabeled with ^99m Tc-carbonyl complexes as previously described (Waibel, R., et al., Nature Biotechnol. 17(9) , 897-901, 1999). DARPin (40 µg) was incubated with ^99m Tc-carbonyl (0.8-1.6 m Ci) for 1 hour and then diluted to 400 µl in PBS (pH 7.4). Each mouse was injected intravenously with 100 µl of the resulting labeled DARPin solution (equivalent to 10 µg protein and 0.2-0.4 m Ci). Blood samples were collected from the mice 1, 4, 24, and 48 hours after the initial injection, and the radioactivity in the samples was measured. The radioactivity level measured at a specific time point is a direct measure of the amount of DARPin still present in the plasma at that time point. The % injected dose is the percentage of the total radioactivity measured in the mouse whole blood (1.6 ml for an 18 g mouse) at a specific time point relative to the total radioactivity in the injected sample corrected for radioactive decay of ^99m Tc.

与对xSA没有结合特异性的DARPin #32相比，对MSA具有结合特异性的DARPin在小鼠中具有大幅增加的终末血浆半衰期(图4)。DARPin#19、DARPin#21、DARPin#23、DARPin#33、DARPin #37、DARPin #43、DARPin#44、DARPin#45、DARPin#46、DARPin#47和DARPin#48在小鼠中具有约2–2.5天的终末血浆半衰期。Compared to DARPin #32, which has no binding specificity for xSA, DARPins with binding specificity for MSA had significantly increased terminal plasma half-lives in mice (Figure 4). DARPin #19, DARPin #21, DARPin #23, DARPin #33, DARPin #37, DARPin #43, DARPin #44, DARPin #45, DARPin #46, DARPin #47, and DARPin #48 had terminal plasma half-lives of approximately 2–2.5 days in mice.

DARPin在食蟹猴中的血浆清除的确定 Determination of plasma clearance of DARPin in cynomolgus monkeys

将在PBS中稀释的DARPin作为快速推注注射进食蟹猴的头静脉中。注射下述DARPin：DARPin #26 (0，5 mg/kg)、DARPin #24 (0.5 mg/kg)、DARPin #17 (0.5 mg/kg)、DARPin #34 (1 mg/kg)和DARPin #32 (0.5 mg/kg)。在注射后的不同时间点，从采自动物股静脉的血液制备血浆。然后使用本领域技术人员已知的标准方案和具有已知DARPin浓度的适当DARPin标准曲线，通过夹心ELISA确定血浆样品中的DARPin浓度。DARPins diluted in PBS were injected as a bolus into the cephalic vein of cynomolgus monkeys. The following DARPins were injected: DARPin #26 (0.5 mg/kg), DARPin #24 (0.5 mg/kg), DARPin #17 (0.5 mg/kg), DARPin #34 (1 mg/kg), and DARPin #32 (0.5 mg/kg). At various time points after injection, plasma was prepared from blood drawn from the femoral vein of the animals. DARPin concentrations in the plasma samples were then determined by sandwich ELISA using standard protocols known to those skilled in the art and appropriate DARPin standard curves with known DARPin concentrations.

在用抗-DARPin的特异性兔单克隆抗体包被的MaxiSorp ELISA平板上，在PBS-C(含有0.25% 酪蛋白的PBS, pH 7.4)中对食蟹猴的血浆样品进行系列稀释。在用PBS-T (补充了0.1% 吐温20®的PBS，pH 7.4) 充分洗涤以后，用被辣根过氧化物酶HRP标记的单克隆抗-RGS(His)4抗体(Qiagen)使平板显影。然后使用100µl BM-Blue POD底物(RocheDiagnostics)，检测结合。通过加入50µl 1 M H₂SO₄，停止反应，并测量在450 nm的吸光度(和减去在620 nm的吸光度)。通过对在猴血清中稀释的DARPin的标准曲线进行单指数回归(GraphPad Prism)，计算血浆样品中的DARPin的浓度。通过对注射最多240 h测得的浓度值进行非线性回归(两阶段衰变)，计算DARPin的血浆终末半衰期。第二(β)阶段的半衰期对应于终末血浆半衰期。Cynomolgus monkey plasma samples were serially diluted in PBS-C (PBS containing 0.25% casein, pH 7.4) on MaxiSorp ELISA plates coated with anti-DARPin-specific rabbit monoclonal antibodies. After extensive washing with PBS-T (PBS supplemented with 0.1% Tween 20®, pH 7.4), the plates were developed with a horseradish peroxidase (HRP)-conjugated monoclonal anti-RGS(His)4 antibody (Qiagen). Binding was detected using 100 µl of BM-Blue POD substrate (Roche Diagnostics). The reaction was stopped by adding 50 µl _of 1 M _H₂SO₄ , and the absorbance at 450 nm was measured (and the absorbance at 620 nm was subtracted). The DARPin concentration in the plasma samples was calculated by single exponential regression (GraphPad Prism) against a standard curve of DARPin diluted in monkey serum. The terminal plasma half-life of DARPins was calculated by nonlinear regression (two-phase decay) of concentrations measured up to 240 h after injection. The second (β) phase half-life corresponds to the terminal plasma half-life.

与对xSA没有结合特异性的DARPin #32相比，对xSA具有结合特异性的DARPin在食蟹猴中具有增加的终末血浆半衰期(图4、表3)。DARPin#19、DARPin#21、DARPin #43、DARPin#44、DARPin#45、DARPin#46、DARPin#47和DARPin#48在食蟹猴中具有约10-15天的终末血浆半衰期。Compared to DARPin #32, which has no binding specificity for xSA, DARPins with binding specificity for xSA had increased terminal plasma half-lives in cynomolgus monkeys ( Figure 4 , Table 3 ). DARPin #19, DARPin #21, DARPin #43, DARPin #44, DARPin #45, DARPin #46, DARPin #47, and DARPin #48 had terminal plasma half-lives of approximately 10-15 days in cynomolgus monkeys.

实施例6：具有改进的C-端加帽模块的DARPin的较高热稳定性 Example 6: Higher thermal stability of DARPins with improved C-terminal capping modules

如在实施例3中所述，用基于荧光的热稳定性测定来分析DARPin的热稳定性。可替换地，通过CD光谱测定法分析DARPin的热稳定性；即通过本领域技术人员熟知的技术跟踪它在222 nm的圆二色性(CD) 信号，从而测量它的热变性。在Jasco J-715仪器(Jasco, 日本)中在222 nm记录样品的CD信号，同时使用1℃/min的温度增量，将浓度为0.02 mM的蛋白（在pH 7.4的PBS中）缓慢地从20℃加热至95℃。这是跟踪DARPin的变性的有效方式，因为DARPin主要由α螺旋组成，所述α螺旋在解折叠后在222 nm处表现出它们的CD信号的强烈变化。观察到的DARPin的这样测量的CD信号迹线的转变中点对应于它的Tm值。As described in Example 3, the thermal stability of DARPins was analyzed using a fluorescence-based thermal stability assay. Alternatively, the thermal stability of a DARPin was analyzed by CD spectrometry; that is, its thermal denaturation was measured by tracking its circular dichroism (CD) signal at 222 nm using techniques well known to those skilled in the art. The sample's CD signal was recorded at 222 nm in a Jasco J-715 instrument (Jasco, Japan) while slowly heating a 0.02 mM concentration of the protein (in PBS, pH 7.4) from 20°C to 95°C using a temperature increment of 1°C/min. This is an effective way to track the denaturation of DARPins, as DARPins are primarily composed of α-helices, which exhibit a strong change in their CD signal at 222 nm upon unfolding. The midpoint of the observed transition of the DARPin's CD signal trace measured in this manner corresponds to its Tm value.

使用基于荧光的热稳定性测定，将DARPin #37 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:37)的热稳定性与DARPin #38 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:38)的热稳定性进行了对比。除了它们的重复结构域的C-端加帽模块以外，这2种DARPin具有相同的氨基酸序列。DARPin #38的重复结构域包含如本文所述的改进的C-加帽模块，但是DARPin #37不包含。测得的DARPin #37和DARPin#38在pH 7.4的PBS中的Tm值分别为约63℃和约73℃。测得的DARPin #37和DARPin #38在pH5.8的MES缓冲液中的Tm值分别为约54.5℃和约66℃。The thermal stability of DARPin #37 (SEQ ID NO: 37 with a His-tag (SEQ ID NO: 15) fused to its N-terminus) was compared to that of DARPin #38 (SEQ ID NO: 38 with a His-tag (SEQ ID NO: 15) fused to its N-terminus) using a fluorescence-based thermal stability assay. These two DARPins have identical amino acid sequences, except for the C-terminal capping module of their repeat domains. The repeat domain of DARPin #38 contains the improved C-capping module described herein, while DARPin #37 does not. The Tm values of DARPin #37 and DARPin #38 were measured in PBS at pH 7.4 at approximately 63°C and approximately 73°C, respectively. The Tm values of DARPin #37 and DARPin #38 were measured in MES buffer at pH 5.8 at approximately 54.5°C and approximately 66°C, respectively.

使用基于荧光的热稳定性测定，将DARPin #39 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:39) 的热稳定性与DARPin #40 (具有融合在它的N-端处的His-标签(SEQ ID NO:15)的SEQ ID NO:40)的热稳定性进行了对比。除了它们的重复结构域的C-端加帽模块以外，这2种DARPin具有相同的氨基酸序列。DARPin #40的重复结构域包含如本文所述的改进的C-加帽模块，但是DARPin #39不包含。测得的DARPin #39和DARPin #40在pH 5.8的MES缓冲液中的Tm值分别为约51℃和约55℃。Using a fluorescence-based thermostability assay, the thermostability of DARPin #39 (SEQ ID NO: 39 with a His-tag (SEQ ID NO: 15) fused to its N-terminus) was compared to that of DARPin #40 (SEQ ID NO: 40 with a His-tag (SEQ ID NO: 15) fused to its N-terminus). These two DARPins have identical amino acid sequences, except for the C-terminal capping module of their repeat domains. The repeat domain of DARPin #40 contains the improved C-capping module described herein, while DARPin #39 does not. The T values of DARPin #39 and DARPin #40 were measured to be approximately 51°C and approximately 55°C, respectively, in MES buffer at pH 5.8.

使用CD光谱法，将DARPin #41 (SEQ ID NO:41)的热稳定性与DARPin #42 (SEQID NO:42)的热稳定性进行了对比。除了它们的重复结构域的C-端加帽模块以外，这2种DARPin具有相同的氨基酸序列。DARPin #42的重复结构域包含如本文所述的改进的C-加帽模块，但是DARPin #41不包含。测得的DARPin #41和DARPin #42在pH 7.4的PBS中的Tm值分别为约59.5℃和约73℃。The thermal stability of DARPin #41 (SEQ ID NO:41) was compared with that of DARPin #42 (SEQ ID NO:42) using CD spectroscopy. These two DARPins share the same amino acid sequence, except for the C-terminal capping module of their repeat domains. The repeat domain of DARPin #42 contains the improved C-capping module described herein, while DARPin #41 does not. The Tm values of DARPin #41 and DARPin #42 in PBS at pH 7.4 were measured to be approximately 59.5°C and approximately 73°C, respectively.

序列表Sequence Listing

<110> Molecular Partners AG<110> Molecular Partners AG

Steiner, DanielSteiner, Daniel

Binz, Hans KasparBinz, Hans Kaspar

Gulotti-Georgieva, Maya Gulotti-Georgieva, Maya

Merz, Frieder W.Merz, Frieder W.

Phillips, DouglasPhillips, Douglas

Sonderegger, IvoSonderegger, Ivo

<120> 设计的与血清白蛋白结合的重复蛋白<120> Designed repeat proteins that bind to serum albumin

<130> P393A<130> P393A

<150> EP10192711.9<150> EP10192711.9

<151> 2010-11-26<151> 2010-11-26

<160> 54<160> 54

<170> PatentIn 3.5版<170> PatentIn version 3.5

<210> 1<210> 1

<211> 32<211> 32

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 1<400> 1

Gly Ser Asp Leu Gly Lys Lys Leu Leu Glu Ala Ala Arg Ala Gly GlnGly Ser Asp Leu Gly Lys Lys Leu Leu Glu Ala Ala Arg Ala Gly Gln

1 5 10 151 5 10 15

Asp Asp Glu Val Arg Ile Leu Met Ala Asn Gly Ala Asp Val Asn AlaAsp Asp Glu Val Arg Ile Leu Met Ala Asn Gly Ala Asp Val Asn Ala

20 25 3020 25 30

<210> 2<210> 2

<211> 28<211> 28

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 2<400> 2

Gln Asp Lys Phe Gly Lys Thr Ala Phe Asp Ile Ser Ile Asp Asn GlyGln Asp Lys Phe Gly Lys Thr Ala Phe Asp Ile Ser Ile Asp Asn Gly

1 5 10 151 5 10 15

Asn Glu Asp Leu Ala Glu Ile Leu Gln Lys Leu AsnAsn Glu Asp Leu Ala Glu Ile Leu Gln Lys Leu Asn

20 2520 25

<210> 3<210> 3

<211> 28<211> 28

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 3<400> 3

Gln Asp Lys Phe Gly Lys Thr Pro Phe Asp Leu Ala Ile Arg Glu GlyGln Asp Lys Phe Gly Lys Thr Pro Phe Asp Leu Ala Ile Arg Glu Gly

1 5 10 151 5 10 15

His Glu Asp Ile Ala Glu Val Leu Gln Lys Ala AlaHis Glu Asp Ile Ala Glu Val Leu Gln Lys Ala Ala

20 2520 25

<210> 4<210> 4

<211> 28<211> 28

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 4<400> 4

Gln Asp Lys Phe Gly Lys Thr Pro Phe Asp Leu Ala Ile Asp Asn GlyGln Asp Lys Phe Gly Lys Thr Pro Phe Asp Leu Ala Ile Asp Asn Gly

1 5 10 151 5 10 15

Asn Glu Asp Ile Ala Glu Val Leu Gln Lys Ala AlaAsn Glu Asp Ile Ala Glu Val Leu Gln Lys Ala Ala

20 2520 25

<210> 5<210> 5

<211> 30<211> 30

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<220><220>

<221> misc_feature<221> misc_feature

<222> (1)..(1)<222> (1)..(1)

<223> Xaa可以是任意天然存在的氨基酸<223> Xaa can be any naturally occurring amino acid

<220><220>

<221> misc_feature<221> misc_feature

<222> (3)..(3)<222> (3)..(3)

<220><220>

<221> misc_feature<221> misc_feature

<222> (22)..(22)<222> (22)..(22)

<220><220>

<221> misc_feature<221> misc_feature

<222> (24)..(24)<222> (24)..(24)

<400> 5<400> 5

Xaa Leu Xaa Lys Lys Leu Leu Glu Ala Ala Arg Ala Gly Gln Asp AspXaa Leu Xaa Lys Lys Leu Leu Glu Ala Ala Arg Ala Gly Gln Asp Asp

1 5 10 151 5 10 15

Glu Val Arg Ile Leu Xaa Ala Xaa Gly Ala Asp Val Asn AlaGlu Val Arg Ile Leu Xaa Ala Xaa Gly Ala Asp Val Asn Ala

20 25 3020 25 30

<210> 6<210> 6

<211> 28<211> 28

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<220><220>

<221> misc_feature<221> misc_feature

<222> (1)..(1)<222> (1)..(1)

<220><220>

<221> misc_feature<221> misc_feature

<222> (4)..(4)<222> (4)..(4)

<220><220>

<221> misc_feature<221> misc_feature

<222> (8)..(9)<222> (8)..(9)

<220><220>

<221> misc_feature<221> misc_feature

<222> (11)..(13)<222> (11)..(13)

<220><220>

<221> misc_feature<221> misc_feature

<222> (15)..(15)<222> (15)..(15)

<220><220>

<221> misc_feature<221> misc_feature

<222> (17)..(17)<222> (17)..(17)

<220><220>

<221> misc_feature<221> misc_feature

<222> (20)..(20)<222> (20)..(20)

<220><220>

<221> misc_feature<221> misc_feature

<222> (23)..(23)<222> (23)..(23)

<400> 6<400> 6

Xaa Asp Lys Xaa Gly Lys Thr Xaa Xaa Asp Xaa Xaa Xaa Asp Xaa GlyXaa Asp Lys Xaa Gly Lys Thr Xaa Xaa Asp Xaa Xaa Xaa Asp Xaa Gly

1 5 10 151 5 10 15

Xaa Glu Asp Xaa Ala Glu Xaa Leu Gln Lys Ala AlaXaa Glu Asp Xaa Ala Glu Xaa Leu Gln Lys Ala Ala

20 2520 25

<210> 7<210> 7

<211> 28<211> 28

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<220><220>

<221> misc_feature<221> misc_feature

<222> (1)..(1)<222> (1)..(1)

<220><220>

<221> misc_feature<221> misc_feature

<222> (4)..(4)<222> (4)..(4)

<220><220>

<221> misc_feature<221> misc_feature

<222> (8)..(8)<222> (8)..(8)

<220><220>

<221> misc_feature<221> misc_feature

<222> (11)..(13)<222> (11)..(13)

<220><220>

<221> misc_feature<221> misc_feature

<222> (15)..(15)<222> (15)..(15)

<220><220>

<221> misc_feature<221> misc_feature

<222> (17)..(17)<222> (17)..(17)

<220><220>

<221> misc_feature<221> misc_feature

<222> (20)..(20)<222> (20)..(20)

<220><220>

<221> misc_feature<221> misc_feature

<222> (23)..(23)<222> (23)..(23)

<400> 7<400> 7

Xaa Asp Lys Xaa Gly Lys Thr Xaa Ala Asp Xaa Xaa Xaa Asp Xaa GlyXaa Asp Lys Xaa Gly Lys Thr Xaa Ala Asp Xaa Xaa Xaa Asp Xaa Gly

1 5 10 151 5 10 15

20 2520 25

<210> 8<210> 8

<211> 28<211> 28

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<220><220>

<221> misc_feature<221> misc_feature

<222> (1)..(1)<222> (1)..(1)

<220><220>

<221> misc_feature<221> misc_feature

<222> (4)..(4)<222> (4)..(4)

<220><220>

<221> misc_feature<221> misc_feature

<222> (8)..(8)<222> (8)..(8)

<220><220>

<221> misc_feature<221> misc_feature

<222> (11)..(12)<222> (11)..(12)

<220><220>

<221> misc_feature<221> misc_feature

<222> (15)..(15)<222> (15)..(15)

<220><220>

<221> misc_feature<221> misc_feature

<222> (17)..(17)<222> (17)..(17)

<220><220>

<221> misc_feature<221> misc_feature

<222> (20)..(20)<222> (20)..(20)

<220><220>

<221> misc_feature<221> misc_feature

<222> (23)..(23)<222> (23)..(23)

<400> 8<400> 8

Xaa Asp Lys Xaa Gly Lys Thr Xaa Ala Asp Xaa Xaa Ala Asp Xaa GlyXaa Asp Lys Xaa Gly Lys Thr Xaa Ala Asp Xaa Xaa Ala Asp Xaa Gly

1 5 10 151 5 10 15

20 2520 25

<210> 9<210> 9

<211> 28<211> 28

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 9<400> 9

Gln Asp Lys Ser Gly Lys Thr Pro Ala Asp Leu Ala Ala Asp Ala GlyGln Asp Lys Ser Gly Lys Thr Pro Ala Asp Leu Ala Ala Asp Ala Gly

1 5 10 151 5 10 15

20 2520 25

<210> 10<210> 10

<211> 33<211> 33

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<220><220>

<221> misc_feature<221> misc_feature

<222> (1)..(1)<222> (1)..(1)

<220><220>

<221> misc_feature<221> misc_feature

<222> (3)..(4)<222> (3)..(4)

<220><220>

<221> misc_feature<221> misc_feature

<222> (6)..(6)<222> (6)..(6)

<220><220>

<221> misc_feature<221> misc_feature

<222> (14)..(15)<222> (14)..(15)

<220><220>

<221> misc_feature<221> misc_feature

<222> (27)..(27)<222> (27)..(27)

<400> 10<400> 10

Xaa Asp Xaa Xaa Gly Xaa Thr Pro Leu His Leu Ala Ala Xaa Xaa GlyXaa Asp Xaa Xaa Gly Xaa Thr Pro Leu His Leu Ala Ala Xaa Xaa Gly

1 5 10 151 5 10 15

His Leu Glu Ile Val Glu Val Leu Leu Lys Xaa Gly Ala Asp Val AsnHis Leu Glu Ile Val Glu Val Leu Leu Lys Xaa Gly Ala Asp Val Asn

20 25 3020 25 30

AlaAla

<210> 11<210> 11

<211> 33<211> 33

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<220><220>

<221> misc_feature<221> misc_feature

<222> (1)..(1)<222> (1)..(1)

<220><220>

<221> misc_feature<221> misc_feature

<222> (5)..(5)<222> (5)..(5)

<220><220>

<221> misc_feature<221> misc_feature

<222> (15)..(16)<222> (15)..(16)

<220><220>

<221> misc_feature<221> misc_feature

<222> (19)..(19)<222> (19)..(19)

<220><220>

<221> misc_feature<221> misc_feature

<222> (27)..(27)<222> (27)..(27)

<400> 11<400> 11

Xaa Asp Tyr Phe Xaa His Thr Pro Leu His Leu Ala Ala Arg Xaa XaaXaa Asp Tyr Phe Xaa His Thr Pro Leu His Leu Ala Ala Arg Xaa Xaa

1 5 10 151 5 10 15

His Leu Xaa Ile Val Glu Val Leu Leu Lys Xaa Gly Ala Asp Val AsnHis Leu Xaa Ile Val Glu Val Leu Leu Lys Xaa Gly Ala Asp Val Asn

20 25 3020 25 30

AlaAla

<210> 12<210> 12

<211> 33<211> 33

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<220><220>

<221> misc_feature<221> misc_feature

<222> (1)..(1)<222> (1)..(1)

<220><220>

<221> misc_feature<221> misc_feature

<222> (4)..(4)<222> (4)..(4)

<220><220>

<221> misc_feature<221> misc_feature

<222> (6)..(6)<222> (6)..(6)

<220><220>

<221> misc_feature<221> misc_feature

<222> (14)..(14)<222> (14)..(14)

<220><220>

<221> misc_feature<221> misc_feature

<222> (27)..(27)<222> (27)..(27)

<400> 12<400> 12

Xaa Asp Phe Xaa Gly Xaa Thr Pro Leu His Leu Ala Ala Xaa Asp GlyXaa Asp Phe Xaa Gly Xaa Thr Pro Leu His Leu Ala Ala Xaa Asp Gly

1 5 10 151 5 10 15

20 25 3020 25 30

AlaAla

<210> 13<210> 13

<211> 33<211> 33

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<220><220>

<221> misc_feature<221> misc_feature

<222> (1)..(1)<222> (1)..(1)

<220><220>

<221> misc_feature<221> misc_feature

<222> (3)..(4)<222> (3)..(4)

<220><220>

<221> misc_feature<221> misc_feature

<222> (20)..(20)<222> (20)..(20)

<220><220>

<221> misc_feature<221> misc_feature

<222> (27)..(27)<222> (27)..(27)

<400> 13<400> 13

Xaa Asp Xaa Xaa Gly Thr Thr Pro Leu His Leu Ala Ala Val Tyr GlyXaa Asp Xaa Xaa Gly Thr Thr Pro Leu His Leu Ala Ala Val Tyr Gly

1 5 10 151 5 10 15

His Leu Glu Xaa Val Glu Val Leu Leu Lys Xaa Gly Ala Asp Val AsnHis Leu Glu Xaa Val Glu Val Leu Leu Lys Xaa Gly Ala Asp Val Asn

20 25 3020 25 30

AlaAla

<210> 14<210> 14

<211> 33<211> 33

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<220><220>

<221> misc_feature<221> misc_feature

<222> (1)..(1)<222> (1)..(1)

<220><220>

<221> misc_feature<221> misc_feature

<222> (18)..(18)<222> (18)..(18)

<220><220>

<221> misc_feature<221> misc_feature

<222> (27)..(29)<222> (27)..(29)

<220><220>

<221> misc_feature<221> misc_feature

<222> (31)..(31)<222> (31)..(31)

<400> 14<400> 14

Xaa Asn Glu Thr Gly Tyr Thr Pro Leu His Leu Ala Asp Ser Ser GlyXaa Asn Glu Thr Gly Tyr Thr Pro Leu His Leu Ala Asp Ser Ser Gly

1 5 10 151 5 10 15

His Xaa Glu Ile Val Glu Val Leu Leu Lys Xaa Xaa Xaa Asp Xaa AsnHis Xaa Glu Ile Val Glu Val Leu Leu Lys Xaa Xaa Xaa Asp Xaa Asn

20 25 3020 25 30

AlaAla

<210> 15<210> 15

<211> 10<211> 10

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 15<400> 15

Met Arg Gly Ser His His His His His HisMet Arg Gly Ser His His His His His

1 5 101 5 10

<210> 16<210> 16

<211> 20<211> 20

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 16<400> 16

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser GlyGly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly

1 5 10 151 5 10 15

Gly Gly Gly SerGly Gly Gly Ser

2020

<210> 17<210> 17

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 17<400> 17

1 5 10 151 5 10 15

20 25 3020 25 30

Ala Asp Tyr Phe Gly His Thr Pro Leu His Leu Ala Ala Arg Asp GlyAla Asp Tyr Phe Gly His Thr Pro Leu His Leu Ala Ala Arg Asp Gly

35 40 4535 40 45

His Leu Lys Ile Val Glu Val Leu Leu Lys Tyr Gly Ala Asp Val AsnHis Leu Lys Ile Val Glu Val Leu Leu Lys Tyr Gly Ala Asp Val Asn

50 55 6050 55 60

Ala Ser Asp Phe Ala Gly Lys Thr Pro Leu His Leu Ala Ala Asn AspAla Ser Asp Phe Ala Gly Lys Thr Pro Leu His Leu Ala Ala Asn Asp

65 70 75 8065 70 75 80

Gly His Leu Glu Ile Val Glu Val Leu Leu Lys His Gly Ala Asp ValGly His Leu Glu Ile Val Glu Val Leu Leu Lys His Gly Ala Asp Val

85 90 9585 90 95

Asn Ala Gln Asp Lys Phe Gly Lys Thr Ala Phe Asp Ile Ser Ile AspAsn Ala Gln Asp Lys Phe Gly Lys Thr Ala Phe Asp Ile Ser Ile Asp

100 105 110100 105 110

Asn Gly Asn Glu Asp Leu Ala Glu Ile Leu Gln Lys Leu AsnAsn Gly Asn Glu Asp Leu Ala Glu Ile Leu Gln Lys Leu Asn

115 120 125115 120 125

<210> 18<210> 18

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 18<400> 18

1 5 10 151 5 10 15

20 25 3020 25 30

Ala Asp Tyr Phe Ser His Thr Pro Leu His Leu Ala Ala Arg Asp GlyAla Asp Tyr Phe Ser His Thr Pro Leu His Leu Ala Ala Arg Asp Gly

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

Asn Ala Gln Asp Ile Phe Gly Lys Thr Ala Phe Asp Ile Ser Ile AspAsn Ala Gln Asp Ile Phe Gly Lys Thr Ala Phe Asp Ile Ser Ile Asp

100 105 110100 105 110

115 120 125115 120 125

<210> 19<210> 19

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 19<400> 19

1 5 10 151 5 10 15

20 25 3020 25 30

Lys Asp Tyr Phe Ser His Thr Pro Leu His Leu Ala Ala Arg Asn GlyLys Asp Tyr Phe Ser His Thr Pro Leu His Leu Ala Ala Arg Asn Gly

35 40 4535 40 45

His Leu Lys Ile Val Glu Val Leu Leu Lys Ala Gly Ala Asp Val AsnHis Leu Lys Ile Val Glu Val Leu Leu Lys Ala Gly Ala Asp Val Asn

50 55 6050 55 60

Ala Lys Asp Phe Ala Gly Lys Thr Pro Leu His Leu Ala Ala Asn AspAla Lys Asp Phe Ala Gly Lys Thr Pro Leu His Leu Ala Ala Asn Asp

65 70 75 8065 70 75 80

85 90 9585 90 95

Asn Ala Gln Asp Ile Phe Gly Lys Thr Pro Ala Asp Ile Ala Ala AspAsn Ala Gln Asp Ile Phe Gly Lys Thr Pro Ala Asp Ile Ala Ala Asp

100 105 110100 105 110

Ala Gly His Glu Asp Ile Ala Glu Val Leu Gln Lys Leu AsnAla Gly His Glu Asp Ile Ala Glu Val Leu Gln Lys Leu Asn

115 120 125115 120 125

<210> 20<210> 20

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 20<400> 20

Gly Ser Asp Leu Asp Lys Lys Leu Leu Glu Ala Ala Arg Ala Gly GlnGly Ser Asp Leu Asp Lys Lys Leu Leu Glu Ala Ala Arg Ala Gly Gln

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 21<210> 21

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 21<400> 21

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

His Leu Glu Ile Val Glu Val Leu Leu Lys Ala Gly Ala Asp Val AsnHis Leu Glu Ile Val Glu Val Leu Leu Lys Ala Gly Ala Asp Val Asn

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 22<210> 22

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 22<400> 22

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 23<210> 23

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 23<400> 23

1 5 10 151 5 10 15

Asp Asp Glu Val Arg Ile Leu Leu Ala Ala Gly Ala Asp Val Asn AlaAsp Asp Glu Val Arg Ile Leu Leu Ala Ala Gly Ala Asp Val Asn Ala

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 24<210> 24

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 24<400> 24

1 5 10 151 5 10 15

Asp Asp Glu Val Arg Ile Leu Met Ala Ser Gly Ala Asp Val Asn AlaAsp Asp Glu Val Arg Ile Leu Met Ala Ser Gly Ala Asp Val Asn Ala

20 25 3020 25 30

35 40 4535 40 45

His Leu Lys Ile Val Glu Val Leu Leu Lys Tyr Gly Ala Asp Val AspHis Leu Lys Ile Val Glu Val Leu Leu Lys Tyr Gly Ala Asp Val Asp

50 55 6050 55 60

65 70 75 8065 70 75 80

Gly His Leu Glu Ile Val Glu Val Leu Leu Lys His Asp Ala Asp ValGly His Leu Glu Ile Val Glu Val Leu Leu Lys His Asp Ala Asp Val

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 25<210> 25

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 25<400> 25

Gly Ser Asp Leu Gly Lys Glu Leu Leu Glu Ala Ala Arg Ala Gly GlnGly Ser Asp Leu Gly Lys Glu Leu Leu Glu Ala Ala Arg Ala Gly Gln

1 5 10 151 5 10 15

20 25 3020 25 30

Ala Asp Tyr Phe Gly His Thr Pro Leu His Leu Ala Ala Arg Asn GlyAla Asp Tyr Phe Gly His Thr Pro Leu His Leu Ala Ala Arg Asn Gly

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

Asn Ala Gln Asp Ile Phe Gly Lys Thr Ala Phe Glu Ile Ser Ile AspAsn Ala Gln Asp Ile Phe Gly Lys Thr Ala Phe Glu Ile Ser Ile Asp

100 105 110100 105 110

115 120 125115 120 125

<210> 26<210> 26

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 26<400> 26

1 5 10 151 5 10 15

Asp Asp Glu Val Arg Phe Leu Met Ala Asn Gly Ala Asp Val Asn AlaAsp Asp Glu Val Arg Phe Leu Met Ala Asn Gly Ala Asp Val Asn Ala

20 25 3020 25 30

Ala Asp Glu Arg Gly Thr Thr Pro Leu His Leu Ala Ala Val Tyr GlyAla Asp Glu Arg Gly Thr Thr Pro Leu His Leu Ala Ala Val Tyr Gly

35 40 4535 40 45

His Leu Glu Ile Val Glu Val Leu Leu Lys Asn Gly Ala Asp Val AsnHis Leu Glu Ile Val Glu Val Leu Leu Lys Asn Gly Ala Asp Val Asn

50 55 6050 55 60

Ala Gln Asn Glu Thr Gly Tyr Thr Pro Leu His Leu Ala Asp Ser SerAla Gln Asn Glu Thr Gly Tyr Thr Pro Leu His Leu Ala Asp Ser Ser

65 70 75 8065 70 75 80

Gly His Leu Glu Ile Val Glu Val Leu Leu Lys His Ser Ala Asp ValGly His Leu Glu Ile Val Glu Val Leu Leu Lys His Ser Ala Asp Val

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 27<210> 27

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 27<400> 27

1 5 10 151 5 10 15

20 25 3020 25 30

Lys Asp Glu Arg Gly Thr Thr Pro Leu His Leu Ala Ala Val Tyr GlyLys Asp Glu Arg Gly Thr Thr Pro Leu His Leu Ala Ala Val Tyr Gly

35 40 4535 40 45

50 55 6050 55 60

Ala Lys Asn Glu Thr Gly Tyr Thr Pro Leu His Leu Ala Asp Ser SerAla Lys Asn Glu Thr Gly Tyr Thr Pro Leu His Leu Ala Asp Ser Ser

65 70 75 8065 70 75 80

Gly His Leu Glu Ile Val Glu Val Leu Leu Lys Ala Gly Ala Asp ValGly His Leu Glu Ile Val Glu Val Leu Leu Lys Ala Gly Ala Asp Val

85 90 9585 90 95

Asn Ala Gln Asp Lys Ser Gly Lys Thr Pro Ala Asp Leu Ala Ala AspAsn Ala Gln Asp Lys Ser Gly Lys Thr Pro Ala Asp Leu Ala Ala Asp

100 105 110100 105 110

115 120 125115 120 125

<210> 28<210> 28

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 28<400> 28

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 29<210> 29

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 29<400> 29

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 30<210> 30

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 30<400> 30

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

Gly His Leu Glu Ile Val Glu Val Leu Leu Lys Ala Ser Ala Asp ValGly His Leu Glu Ile Val Glu Val Leu Leu Lys Ala Ser Ala Asp Val

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 31<210> 31

<211> 157<211> 157

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 31<400> 31

1 5 10 151 5 10 15

20 25 3020 25 30

Val Asp Ile Trp Gly Asn Thr Pro Leu His Leu Ala Ala Asn Glu GlyVal Asp Ile Trp Gly Asn Thr Pro Leu His Leu Ala Ala Asn Glu Gly

35 40 4535 40 45

His Leu Glu Ile Val Glu Val Leu Leu Lys His Gly Ala Asp Val AsnHis Leu Glu Ile Val Glu Val Leu Leu Lys His Gly Ala Asp Val Asn

50 55 6050 55 60

Ala Leu Asp His Trp Gly Asp Thr Pro Leu His Leu Ala Ala Met TrpAla Leu Asp His Trp Gly Asp Thr Pro Leu His Leu Ala Ala Met Trp

65 70 75 8065 70 75 80

85 90 9585 90 95

Asn Ala Leu Asp Asn Asn Gly Phe Thr Pro Leu His Leu Gly Tyr GlyAsn Ala Leu Asp Asn Asn Gly Phe Thr Pro Leu His Leu Gly Tyr Gly

100 105 110100 105 110

115 120 125115 120 125

Ala Gln Asp Lys Phe Gly Lys Thr Ala Phe Asp Ile Ser Ile Asp AsnAla Gln Asp Lys Phe Gly Lys Thr Ala Phe Asp Ile Ser Ile Asp Asn

130 135 140130 135 140

Gly Asn Glu Asp Leu Ala Glu Ile Leu Gln Lys Leu AsnGly Asn Glu Asp Leu Ala Glu Ile Leu Gln Lys Leu Asn

145 150 155145 150 155

<210> 32<210> 32

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 32<400> 32

1 5 10 151 5 10 15

20 25 3020 25 30

Arg Asp Ser Thr Gly Trp Thr Pro Leu His Leu Ala Ala Pro Trp GlyArg Asp Ser Thr Gly Trp Thr Pro Leu His Leu Ala Ala Pro Trp Gly

35 40 4535 40 45

His Pro Glu Ile Val Glu Val Leu Leu Lys Asn Gly Ala Asp Val AsnHis Pro Glu Ile Val Glu Val Leu Leu Lys Asn Gly Ala Asp Val Asn

50 55 6050 55 60

Ala Ala Asp Phe Gln Gly Trp Thr Pro Leu His Leu Ala Ala Ala ValAla Ala Asp Phe Gln Gly Trp Thr Pro Leu His Leu Ala Ala Ala Val

65 70 75 8065 70 75 80

Gly His Leu Glu Ile Val Glu Val Leu Leu Lys Tyr Gly Ala Asp ValGly His Leu Glu Ile Val Glu Val Leu Leu Lys Tyr Gly Ala Asp Val

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 33<210> 33

<211> 271<211> 271

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 33<400> 33

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

Ala Gly His Glu Asp Ile Ala Glu Val Leu Gln Lys Leu Gly Gly GlyAla Gly His Glu Asp Ile Ala Glu Val Leu Gln Lys Leu Gly Gly Gly

115 120 125115 120 125

Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly GlyGly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly

130 135 140130 135 140

Ser Arg Ser Asp Leu Gly Lys Lys Leu Leu Glu Ala Ala Arg Ala GlySer Arg Ser Asp Leu Gly Lys Lys Leu Leu Glu Ala Ala Arg Ala Gly

145 150 155 160145 150 155 160

Gln Asp Asp Glu Val Arg Ile Leu Met Ala Asn Gly Ala Asp Val AsnGln Asp Asp Glu Val Arg Ile Leu Met Ala Asn Gly Ala Asp Val Asn

165 170 175165 170 175

Ala Lys Asp Lys Asp Gly Tyr Thr Pro Leu His Leu Ala Ala Arg GluAla Lys Asp Lys Asp Gly Tyr Thr Pro Leu His Leu Ala Ala Arg Glu

180 185 190180 185 190

195 200 205195 200 205

Asn Ala Lys Asp Lys Asp Gly Tyr Thr Pro Leu His Leu Ala Ala ArgAsn Ala Lys Asp Lys Asp Gly Tyr Thr Pro Leu His Leu Ala Ala Arg

210 215 220210 215 220

Glu Gly His Leu Glu Ile Val Glu Val Leu Leu Lys Ala Gly Ala AspGlu Gly His Leu Glu Ile Val Glu Val Leu Leu Lys Ala Gly Ala Asp

225 230 235 240225 230 235 240

Val Asn Ala Gln Asp Lys Phe Gly Lys Thr Ala Phe Asp Ile Ser IleVal Asn Ala Gln Asp Lys Phe Gly Lys Thr Ala Phe Asp Ile Ser Ile

245 250 255245 250 255

Asp Asn Gly Asn Glu Asp Leu Ala Glu Ile Leu Gln Lys Leu AsnAsp Asn Gly Asn Glu Asp Leu Ala Glu Ile Leu Gln Lys Leu Asn

260 265 270260 265 270

<210> 34<210> 34

<211> 271<211> 271

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 34<400> 34

1 5 10 151 5 10 15

20 25 3020 25 30

Ala Asp Tyr Phe Ser His Thr Pro Leu His Leu Ala Ala Arg Asn GlyAla Asp Tyr Phe Ser His Thr Pro Leu His Leu Ala Ala Arg Asn Gly

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

Asn Gly Asn Glu Asp Leu Ala Glu Ile Leu Gln Lys Leu Gly Gly GlyAsn Gly Asn Glu Asp Leu Ala Glu Ile Leu Gln Lys Leu Gly Gly Gly

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

<210> 35<210> 35

<211> 271<211> 271

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 35<400> 35

1 5 10 151 5 10 15

20 25 3020 25 30

Lys Asp Lys Asp Gly Tyr Thr Pro Leu His Leu Ala Ala Arg Glu GlyLys Asp Lys Asp Gly Tyr Thr Pro Leu His Leu Ala Ala Arg Glu Gly

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

Gln Asp Asp Glu Val Arg Ile Leu Leu Ala Ala Gly Ala Asp Val AsnGln Asp Asp Glu Val Arg Ile Leu Leu Ala Ala Gly Ala Asp Val Asn

165 170 175165 170 175

Ala Lys Asp Tyr Phe Ser His Thr Pro Leu His Leu Ala Ala Arg AsnAla Lys Asp Tyr Phe Ser His Thr Pro Leu His Leu Ala Ala Arg Asn

180 185 190180 185 190

Gly His Leu Lys Ile Val Glu Val Leu Leu Lys Ala Gly Ala Asp ValGly His Leu Lys Ile Val Glu Val Leu Leu Lys Ala Gly Ala Asp Val

195 200 205195 200 205

Asn Ala Lys Asp Phe Ala Gly Lys Thr Pro Leu His Leu Ala Ala AsnAsn Ala Lys Asp Phe Ala Gly Lys Thr Pro Leu His Leu Ala Ala Asn

210 215 220210 215 220

Asp Gly His Leu Glu Ile Val Glu Val Leu Leu Lys His Gly Ala AspAsp Gly His Leu Glu Ile Val Glu Val Leu Leu Lys His Gly Ala Asp

225 230 235 240225 230 235 240

Val Asn Ala Gln Asp Ile Phe Gly Lys Thr Pro Ala Asp Ile Ala AlaVal Asn Ala Gln Asp Ile Phe Gly Lys Thr Pro Ala Asp Ile Ala Ala

245 250 255245 250 255

Asp Ala Gly His Glu Asp Ile Ala Glu Val Leu Gln Lys Leu AsnAsp Ala Gly His Glu Asp Ile Ala Glu Val Leu Gln Lys Leu Asn

260 265 270260 265 270

<210> 36<210> 36

<211> 302<211> 302

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 36<400> 36

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

Ala Val Asp Ile Trp Gly Asn Thr Pro Leu His Leu Ala Ala Asn GluAla Val Asp Ile Trp Gly Asn Thr Pro Leu His Leu Ala Ala Asn Glu

180 185 190180 185 190

195 200 205195 200 205

Asn Ala Leu Asp His Trp Gly Asp Thr Pro Leu His Leu Ala Ala MetAsn Ala Leu Asp His Trp Gly Asp Thr Pro Leu His Leu Ala Ala Met

210 215 220210 215 220

Trp Gly His Leu Glu Ile Val Glu Val Leu Leu Lys His Gly Ala AspTrp Gly His Leu Glu Ile Val Glu Val Leu Leu Lys His Gly Ala Asp

225 230 235 240225 230 235 240

Val Asn Ala Leu Asp Asn Asn Gly Phe Thr Pro Leu His Leu Gly TyrVal Asn Ala Leu Asp Asn Asn Gly Phe Thr Pro Leu His Leu Gly Tyr

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

<210> 37<210> 37

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 37<400> 37

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 38<210> 38

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 38<400> 38

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

Asn Gly His Glu Asp Ile Ala Glu Val Leu Gln Lys Leu AsnAsn Gly His Glu Asp Ile Ala Glu Val Leu Gln Lys Leu Asn

115 120 125115 120 125

<210> 39<210> 39

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 39<400> 39

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 40<210> 40

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 40<400> 40

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

Asn Ala Gln Asp Lys Ser Gly Lys Thr Pro Ala Asp Ile Ala Ala AspAsn Ala Gln Asp Lys Ser Gly Lys Thr Pro Ala Asp Ile Ala Ala Asp

100 105 110100 105 110

115 120 125115 120 125

<210> 41<210> 41

<211> 103<211> 103

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 41<400> 41

Met Arg Gly Ser His His His His His His Gly Ser Asp Leu Gly LysMet Arg Gly Ser His His His His His Gly Ser Asp Leu Gly Lys

1 5 10 151 5 10 15

Lys Leu Leu Glu Ala Ala Arg Ala Gly Gln Asp Asp Glu Val Arg IleLys Leu Leu Glu Ala Ala Arg Ala Gly Gln Asp Asp Glu Val Arg Ile

20 25 3020 25 30

Leu Met Ala Asn Gly Ala Asp Val Asn Ala Lys Asp Lys Asp Gly TyrLeu Met Ala Asn Gly Ala Asp Val Asn Ala Lys Asp Lys Asp Gly Tyr

35 40 4535 40 45

Thr Pro Leu His Leu Ala Ala Arg Glu Gly His Leu Glu Ile Val GluThr Pro Leu His Leu Ala Ala Arg Glu Gly His Leu Glu Ile Val Glu

50 55 6050 55 60

Val Leu Leu Lys Ala Gly Ala Asp Val Asn Ala Gln Asp Lys Phe GlyVal Leu Leu Lys Ala Gly Ala Asp Val Asn Ala Gln Asp Lys Phe Gly

65 70 75 8065 70 75 80

Lys Thr Ala Phe Asp Ile Ser Ile Asp Asn Gly Asn Glu Asp Leu AlaLys Thr Ala Phe Asp Ile Ser Ile Asp Asn Gly Asn Glu Asp Leu Ala

85 90 9585 90 95

Glu Ile Leu Gln Lys Leu AsnGlu Ile Leu Gln Lys Leu Asn

100100

<210> 42<210> 42

<211> 103<211> 103

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 42<400> 42

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

Val Leu Leu Lys Ala Gly Ala Asp Val Asn Ala Gln Asp Lys Ser GlyVal Leu Leu Lys Ala Gly Ala Asp Val Asn Ala Gln Asp Lys Ser Gly

65 70 75 8065 70 75 80

Lys Thr Pro Ala Asp Leu Ala Ala Asp Asn Gly His Glu Asp Ile AlaLys Thr Pro Ala Asp Leu Ala Ala Asp Asn Gly His Glu Asp Ile Ala

85 90 9585 90 95

Glu Val Leu Gln Lys Ala AlaGlu Val Leu Gln Lys Ala Ala

100100

<210> 43<210> 43

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 43<400> 43

1 5 10 151 5 10 15

Asp Asp Glu Val Arg Ile Leu Leu Lys Ala Gly Ala Asp Val Asn AlaAsp Asp Glu Val Arg Ile Leu Leu Lys Ala Gly Ala Asp Val Asn Ala

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 44<210> 44

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 44<400> 44

1 5 10 151 5 10 15

Asp Asp Glu Val Arg Glu Leu Leu Lys Ala Gly Ala Asp Val Asn AlaAsp Asp Glu Val Arg Glu Leu Leu Lys Ala Gly Ala Asp Val Asn Ala

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 45<210> 45

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 45<400> 45

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 46<210> 46

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 46<400> 46

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 47<210> 47

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 47<400> 47

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 48<210> 48

<211> 126<211> 126

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 48<400> 48

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

<210> 49<210> 49

<211> 33<211> 33

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 49<400> 49

1 5 10 151 5 10 15

20 25 3020 25 30

AlaAla

<210> 50<210> 50

<211> 33<211> 33

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 50<400> 50

Lys Asp Phe Ala Gly Lys Thr Pro Leu His Leu Ala Ala Asn Asp GlyLys Asp Phe Ala Gly Lys Thr Pro Leu His Leu Ala Ala Asn Asp Gly

1 5 10 151 5 10 15

20 25 3020 25 30

AlaAla

<210> 51<210> 51

<211> 33<211> 33

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 51<400> 51

1 5 10 151 5 10 15

20 25 3020 25 30

AlaAla

<210> 52<210> 52

<211> 33<211> 33

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<400> 52<400> 52

Lys Asn Glu Thr Gly Tyr Thr Pro Leu His Leu Ala Asp Ser Ser GlyLys Asn Glu Thr Gly Tyr Thr Pro Leu His Leu Ala Asp Ser Ser Gly

1 5 10 151 5 10 15

His Leu Glu Ile Val Glu Val Leu Leu Lys His Ser Ala Asp Val AsnHis Leu Glu Ile Val Glu Val Leu Leu Lys His Ser Ala Asp Val Asn

20 25 3020 25 30

AlaAla

<210> 53<210> 53

<211> 33<211> 33

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<220><220>

<221> misc_feature<221> misc_feature

<222> (1)..(1)<222> (1)..(1)

<220><220>

<221> misc_feature<221> misc_feature

<222> (3)..(6)<222> (3)..(6)

<220><220>

<221> misc_feature<221> misc_feature

<222> (14)..(15)<222> (14)..(15)

<220><220>

<221> misc_feature<221> misc_feature

<222> (19)..(19)<222> (19)..(19)

<220><220>

<221> misc_feature<221> misc_feature

<222> (27)..(27)<222> (27)..(27)

<400> 53<400> 53

Xaa Asp Xaa Xaa Xaa Xaa Thr Pro Leu His Leu Ala Ala Xaa Xaa GlyXaa Asp Xaa Xaa Xaa Xaa Thr Pro Leu His Leu Ala Ala Xaa Xaa Gly

1 5 10 151 5 10 15

20 25 3020 25 30

AlaAla

<210> 54<210> 54

<211> 33<211> 33

<212> PRT<212> PRT

<213> 人工序列<213> Artificial sequence

<220><220>

<223> 合成的构建体<223> Synthetic constructs

<220><220>

<221> misc_feature<221> misc_feature

<222> (1)..(1)<222> (1)..(1)

<220><220>

<221> misc_feature<221> misc_feature

<222> (4)..(4)<222> (4)..(4)

<220><220>

<221> misc_feature<221> misc_feature

<222> (6)..(6)<222> (6)..(6)

<220><220>

<221> misc_feature<221> misc_feature

<222> (14)..(15)<222> (14)..(15)

<220><220>

<221> misc_feature<221> misc_feature

<222> (27)..(27)<222> (27)..(27)

<400> 54<400> 54

Xaa Asp Phe Xaa Gly Xaa Thr Pro Leu His Leu Ala Ala Xaa Xaa GlyXaa Asp Phe Xaa Gly Xaa Thr Pro Leu His Leu Ala Ala Xaa Xaa Gly

1 5 10 151 5 10 15

20 25 3020 25 30

AlaAla

Claims

1. A binding protein comprising at least one ankyrin repeat domain, wherein the ankyrin repeat domain is specific for binding to mammalian serum albumin, and wherein the ankyrin repeat domain comprises an ankyrin repeat module consisting of an amino acid sequence selected from the following: SEQ ID NO: 49, 50, 51 and 52.

2. A binding protein comprising at least one ankyrin repeating domain, wherein the repeating domain is specific for binding to mammalian serum albumin, and wherein the ankyrin repeating domain is selected from SEQ ID NO: 17-31 and 43-48.

3. The binding protein of claim 1 or 2, wherein the ankyrin repeat domain has a terminal plasma half-life in mammals that is at least 5 times longer than that of the ankyrin repeat domain of SEQ ID NO:32.

4. The binding protein of claim 1 or 2, wherein the ankyrin repeating domain has a terminal plasma half-life of at least 1 day in humans.

5. The binding protein of claim 1 or 2, wherein the binding protein further comprises a bioactive compound, and wherein the terminal plasma half-life of the binding protein in mammals is at least twice as long as the terminal plasma half-life of the bioactive compound alone, wherein the at least twice as long terminal plasma half-life is conferred by the ankyrin repeating domain of the binding protein.

6. The binding protein of claim 1 or 2, wherein the ankyrin repeat domain competes with the ankyrin repeat domain selected from SEQ ID NO: 17-31 and 43-48 for binding to mammalian serum albumin.

7. The binding protein of claim 1, wherein the ankyrin repeat module comprises the amino acid sequence of SEQ ID NO:50.

8. The binding protein of claim 1, wherein the ankyrin repeat module consists of the amino acid sequence of SEQ ID NO:49.

9. The binding protein of claim 8, wherein the ankyrin repeating domain further comprises ankyrin repeating modules consisting of an amino acid sequence selected from SEQ ID NO:50, and such sequences wherein up to two amino acids in SEQ ID NO:50 are replaced by other amino acids.

10. The binding protein of claim 2, wherein the ankyrin repeat domain is the ankyrin repeat domain of SEQ ID NO: 46.

11. The binding protein of any one of claims 1, 2 and 7-10, wherein the ankyrin repeating domain has a dissociation constant of less than ^10⁻⁷ M with human serum albumin in PBS.

12. A nucleic acid encoding the binding protein as described in any one of claims 1-11.

13. A pharmaceutical composition comprising a binding protein as described in any one of claims 1-11 or a nucleic acid as described in claim 12, and optionally a pharmaceutically acceptable carrier and/or diluent.