[go: up one dir, main page]

WO1997038710A1 - Synthesis of human procollagens and collagens in recombinant dna systems - Google Patents

Synthesis of human procollagens and collagens in recombinant dna systems Download PDF

Info

Publication number
WO1997038710A1
WO1997038710A1 PCT/US1997/007300 US9707300W WO9738710A1 WO 1997038710 A1 WO1997038710 A1 WO 1997038710A1 US 9707300 W US9707300 W US 9707300W WO 9738710 A1 WO9738710 A1 WO 9738710A1
Authority
WO
WIPO (PCT)
Prior art keywords
collagen
subunit
hydroxylase
cells
prolyl
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US1997/007300
Other languages
French (fr)
Other versions
WO1997038710A9 (en
Inventor
Kari I. Kivirikki
Taina Pihlajaniemi
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ACADEMY OF FINLAND
Fibrogen Inc
Original Assignee
ACADEMY OF FINLAND
Fibrogen Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ACADEMY OF FINLAND, Fibrogen Inc filed Critical ACADEMY OF FINLAND
Priority to JP9537462A priority Critical patent/JP2000508544A/en
Priority to AU30566/97A priority patent/AU3056697A/en
Publication of WO1997038710A1 publication Critical patent/WO1997038710A1/en
Publication of WO1997038710A9 publication Critical patent/WO1997038710A9/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0071Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/78Connective tissue peptides, e.g. collagen, elastin, laminin, fibronectin, vitronectin or cold insoluble globulin [CIG]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2799/00Uses of viruses
    • C12N2799/02Uses of viruses as vector
    • C12N2799/021Uses of viruses as vector for the expression of a heterologous nucleic acid
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2799/00Uses of viruses
    • C12N2799/02Uses of viruses as vector
    • C12N2799/021Uses of viruses as vector for the expression of a heterologous nucleic acid
    • C12N2799/026Uses of viruses as vector for the expression of a heterologous nucleic acid where the vector is derived from a baculovirus

Definitions

  • the present invention is directed to the recombinant production of procollagen, collagen and fragments thereof.
  • the ExtraCeUular Matrix The most abundant component of the extracellular matrix is collagen. Collagen molecules are generally the result of the trimeric assembly of three polypeptide chains contaimng, in their primary sequence,
  • the three polypeptide chains comprising collagen undergo various post-translational 5 modifications which permit the formation of these triple helical domains (Van der Rest et al. , Adv. Mol. Cell Biol. 6: 1-67 (1993)).
  • the proline residues of collagen are hydroxylated into 4- hydroxyproline, thereby allowing for the formation of interchain hydrogen bonds by the enzyme prolyl 4-hydroxylase (Kivirikko e*t al. , Post-translational modifications of proteins (Harding, J. J., Crabbe, M. J. C, eds) pp. 1-51 , CRC Press, Boca Raton, FL (1992)).
  • the triple-helical molecule is then further processed to render collagens.
  • the N-propeptide and C-propeptide comprising the collagen precursor molecule
  • procollagen are cleaved during post-translational events by the enzymes N-proteinase and C-proteinase, respectively.
  • collagen can contribute significantly to the
  • Collagen Types Nineteen distinct collagen types have been identified in vertebrates, including bovine, ovine, porcine, chicken and human collagens. These collagen types are numbered by Roman numerals and the chains found in each collagen type are identified with Arabic numerals. A detailed description of structure 25 and biological functions of the various different types of naturally occurring collagens can be found, among other places, in Ayad et al. , The Extracellular Matrix Facts
  • Type I collagen is the major fibrillar collagen of bone and skin.
  • Type I collagen is a heterotrimeric molecule comprising two ⁇ l(I) chains and one ⁇ 2(I) chain. Details on preparing purified type I collagen can be found, among other places, in Miller et al. , Methods In Enzymology 82:33-64 (1982), Academic Press.
  • Type II collagen is a homotrimeric collagen comprising three identical ⁇ (II) chains.
  • Purified Type II collagen may be prepared from tissues by, among other methods, the procedure described in Miller et al. , Methods In Enzymology. 82:33-64 (1982), Academic Press.
  • Type III collagen is a major fibrillar collagen found in skin and vascular tissues.
  • Type III collagen is a homotrimeric collagen comprising three identical ⁇ (III) chains. Methods for purifying type III collagen from tissues can be found in, among other places, Byers et al. , Biochemistry 13_: 5243-5248 (1974) and Miller et al. , Methods in Enzymology 82:33-64 (1982), Academic Press.
  • Type IV collagen is found in basement membranes in the form of a sheet rather than fibrils.
  • the most common form of type IV collagen contains two ⁇ l(IV) chains and one ⁇ 2(IV) chain.
  • the particular chains comprising type IV collagen are tissue-specific.
  • Type IV collagen may be purified by, among other methods, the procedures described in Furuto et al. , Methods in Enzymology 144:41-61 (1987), Academic Press.
  • Type V collagen is a fibrillar collagen found in, primarily, bones, tendon, cornea, skin, and blood vessels. Type V collagen exists in both homotrimeric and heterotrimeric forms.
  • One type of type V collagen is a heterotrimer of two ⁇ l(V) chains and ⁇ 2(V).
  • Another type of type V collagen is a heterotrimer of ⁇ l(V), c ⁇ 2(V), and ⁇ 3(V).
  • Yet another type of type V collagen is a homotrimer of ⁇ l(V).
  • Type VI collagen has a small triple helical region and two large non-collagenous remainder portions.
  • Type VI collagen is a heterotrimer comprising ⁇ l(VI), ⁇ 2(VI), and ⁇ 3(VI) chains.
  • Type VI collagen is found in many connective tissues. Descriptions of how to purify type VI collagen from natural sources can be found, among other places, in Wu et al. , Biochem. J.
  • Type VII collagen is a fibrillar collagen found in particular epithelial tissues.
  • Type VII is a homotrimeric molecule of three ⁇ l(VII) chains. Descriptions of how to purify type VII collagen from tissue can be found in, among other places, Lundstrom et al., J. Biol. Chem. 261:9042-9048 (1986), and Bentz et al , Proc. Natl. Acad. Sci. USA 80:3168-3172 (1983).
  • Type VIII collagen can be found in Descemet's membrane in the cornea.
  • Type VIII collagen is a heterotrimer comprising two ⁇ l(VIII) chains and one ⁇ 2(VIII) chain, although other chain compositions have been reported.
  • Methods for the purification of type VIII collagen from nature can be found, among other places, in Benya et al. , J. Biol. Chem. 261:4160-4169 (1986), and Kapoor et al., Biochemistry 25:3930-3937 (1986).
  • Type IX collagen is a fibril associated collagen which can be found in cartilage and vitreous humor.
  • Type IX collagen is a heterotrimeric molecule comprising ⁇ l(IX), ⁇ 2(IX), and ⁇ 3(IX) chains.
  • Procedures for purifying type IX collagen can be found, among other places, in Duance et al., Biochem. J. 221 :885-889 (1984), Ayad et al. , Biochem. J. 262:753-761 (1989), Grant et al., The Control of Tissue Damage. Glauert, A. M. , Ed. , El Sevier, Amsterdam, pp. 3-28 (1988).
  • Type X collagen is a homotrimeric compound of ⁇ l(X) chains. Type X collagen has been isolated from, among other tissues, hypertrophic cartilage found in growth plates.
  • Type XI collagen can be found in cartilaginous tissues associated with type II and type IX collagens, as well as other locations in the body.
  • Type XI collagen is a heterotrimeric molecule comprising ⁇ l(XI), ⁇ 2(XI), and ⁇ 3(XI) chains. Methods for purifying type XI collagen can be found, among other places, in Grant et al., In The Control of Tissue Damage. Glauert, A. M., ed., El Savier, Amsterdam, pp.3-28 (1988).
  • Type XII collagen is a fibril associated collagen found primarily associated with type I collagen.
  • Type XII collagen is a homotrimeric molecule comprising three ⁇ l(XII) chains. Methods for purifying type XII collagen and variants thereof can be found, among other places, in Dublet et al , J. Biol. Chem. 264:13150-13156 (1989), Lundstrum et al. , J. Biol. Chem. 267:20087-20092 (1992), Watt et al., L Biol. Chem. 267:20093-20099 (1992).
  • Type XIII is a non-fibrillar collagen found, among other places, in skin, intestine, bone, cartilage, and striated muscle. A detailed description of the type
  • XIII collagen may be found, among other places, in Juvonen et al. J. Biol. Chem. 267:24700-24707 (1992).
  • Type XIV is a fibril associated collagen.
  • Type XIV collagen is a homotrimeric molecule comprising three ⁇ l(XIV) chains.
  • XIV collagen can be found, among other places, in Aubert-Foucher et al., J. Biol. Chem. 266:19759-19764 (1992) and Watt et al. , J. Biol. Chem. 267:20093-20099 (1992).
  • Type XV collagen is homologous in structure to type XVIII collagen.
  • Type XVI collagen is a fibril associated collagen, found in skin, lung fibrobiast, keratinocytes, and elsewhere. Information on the structure of type XVI collagen and the gene encoding type XVI can be found, among elsewhere, in Pan et al., Proc. Natl. Acad. Sci. USA 1989:6565-6569 (1992), and Yamaguchi et al , L 25
  • Type XVII collagen is a hemidesmosal transmembrane collagen. Information on the structure of type XVII collagen and the gene encoding type XVII collagen can be found, among elsewhere, in Li et al , J. Biol. Chem. 268(12):8825-8834 (1993),
  • Type XVIII collagen is similar in structure to type XV collagen and can be isolated from the liver. Descriptions of the structures and isolation of type XVIII collagen from natural sources can be found, among other places, in Rehn et al. ,
  • Type XIX collagen' s gene structure classify it as another member of the
  • Type XIX mRNA was recently isolated from rhabdomyosarcoma cell. Descriptions of the structures and isolation of type XIX collagen can be found, among other places, in Inoguchi et al. , J. Biochem. 117: 137-146 (1995), Yoshioka et al , Genomics 13:884-886 (1992), Myers et al , L Biol. Chem. 289: 18549-18557 (1994).
  • Prolyl 4-hydroxylase is an important post-translational enzyme necessary for the synthesis of procollagen or collagen by cells. The enzyme is required to hydroxy late prolyl residues in the Y-position of the repeating -Gly-X-Y- sequences to 4-hydroxyproline. Prockop et al , N. Engl. J. Med. 3Jl:376-386 (1984). Unless an appropriate number of Y-position prolyl residues are hydroxy lated to 4-hydroxyproline by prolyl 4- hydroxylase, the newly synthesized chains cannot fold into a triple-helical conformation at 37°C. Moreover, if the hydroxylation does not occur, the polypeptides remain non-helical, are poorly secreted by cells, and cannot self- assemble into collagen fibrils.
  • Prolyl-4-hydroxylase from vertebrates is an al ⁇ 2 tetramer. Berg et al. , L .
  • the a subunits "63 kDa) contain the catalytic sites involved in the hydroxylation of prolyl residues but are insoluble in the absence of ⁇ subunits.
  • the ⁇ subunits ("55 kDa) were found to be identical to the protein disulfide isomerase, which catalyzes thiol/disulfide interchange in a protein substrate, leading to the formation of the set of disulfide bonds which permit establishment of the most stable state of the protein.
  • the ⁇ subunits retain 50% of protein disulfide isomerase activity when part of the prolyl-4-hydroxylase tetramer. Pihlajaniemi et al. , Embo J.
  • N-proteinase N-proteinase, lysyl oxidase, and lysyl hydroxylase.
  • the present invention comprises the expression of at least one nucleic acid sequence encoding a collagen chain, and at least one nucleic acid sequence encoding a collagen post-translational enzyme.
  • the present invention provides for methods of expressing at least a single procoUagen or collagen gene (or other nucleic acid molecule) or a number of different procoUagen or collagen genes (or other nucleic acid molecule) within a cell. Further, it is contemplated that there can be one or more copies of a single procoUagen or collagen gene (or other nucleic acid molecule) or of the number of different such genes introduced into cells (/. e. , transformation or transduction) and expressed.
  • the present invention provides that these cells can be transformed or transfected with nucleic acids encoding collagen and enzymes that modify collagen so that they express at least one procoUagen or collagen chain, preferably human, that will assemble into a homotrimer or heterotrimer procoUagen or collagen.
  • the method utilizes a procoUagen or collagen gene (or other nucleic acid molecule) transfected into and expressed within cells which are a mutant, variant, hybrid or recombinant gene (or other nucleic acid molecule).
  • a mutant, variant, hybrid or recombinant gene may include, for example, a mutation which provides unique restriction sites for cleavage of the hybrid gene.
  • such mutations provide one or more unique restriction sites do not alter the amino acid sequence encoded by the nucleic acid molecule, but merely provide unique restriction sites useful for manipulation of the molecule.
  • the modified molecule would be made up of a number of discrete regions, or D-regions, flanked by unique restriction sites. These discrete regions of the molecule are herein referred to as cassettes.
  • cassettes designated as Dl through D4.4 are shown in Figure 4.
  • Molecules formed of multiple copies of a cassette are another variant of the present gene which is encompassed by the present invention.
  • Recombinant or mutant nucleic acid molecules or cassettes which provide desired characteristics such as resistance to endogenous enzymes such as collagenase are also encompassed by the present invention.
  • a novel feature of the methods of the invention is that relatively large amounts of a human procoUagen or collagen can be synthesized in a recombinant cell culture system that does not make any other procoUagen or collagen.
  • Systems that make other procoUagens or collagens are preferred because of the extreme difficulty of separating the product of the endogenous genes for procoUagen or collagen from recombinant collagen products.
  • purification of procoUagen including human, bovine, porcine, chicken and other mammalian collagens, is greatly facilitated.
  • the amounts of protein synthesized by the methods of the present invention are high relative to other systems used in the art.
  • procoUagens synthesized are correctly folded proteins so that they exhibit the normal triple-helical conformation characteristic of procoUagens and collagens. Therefore, the procoUagens can be used to generate stable collagen by cleavage of the procoUagens with proteases.
  • the present invention provides methods for the production of procoUagens or collagens derived solely from transformed or transfected procoUagen and collagen genes, such methods are not limited, however, to the production of procoUagen and collagen derived solely from transformed or transfected genes.
  • Vectors are also directed to vectors and plasmids used in the methods of the invention.
  • Such vectors and/or plasmids are comprised of the nucleic acid sequence encoding the desired procoUagens and collagens and necessary promoters, and other sequences necessary for the proper expression of such procoUagens and collagens.
  • the vectors and plasmids of the present invention further include at least one sequence encoding one or more post-translational enzymes.
  • the present invention further comprises cells in which a procoUagen or collagen, either alone or in combination with one or more post translational enzymes, is expressed both as mRNA and as a protein.
  • a procoUagen or collagen is expressed in mammalian cells, insect cells, or yeast cells.
  • other cells including plant cells and algae, can be manufactured.
  • cells such as mammalian, insect and yeast cells, which may not naturally produce sufficient amounts of post- translational enzymes, are transformed with at least one set of genes coding for a post-translational enzyme, such as prolyl 4-hydroxylase, C-proteinase, N-proteinase, lysyl oxidase or lysyl hydroxylase.
  • a post-translational enzyme such as prolyl 4-hydroxylase, C-proteinase, N-proteinase, lysyl oxidase or lysyl hydroxylase.
  • Polypeptides comprising the recombinant polypeptides expressed according to the methods of the present invention, including fusion products produced from chimeric genes wherein, for example, relevant epitopes of collagen or procoUagen can be manufactured for therapeutic and other uses.
  • the polypeptides of the present invention further include deglycosolated, unglycosolated and partially glycosolated collagens and procoUagens.
  • recombinant collagens of the present invention will not produce allergic responses in the mammals to which they are administered provided that the recombinant collagen is manufactured utilizing the nucleic acid sequence encoding such mammal's native collagen .
  • the recombinant collagen is manufactured utilizing the nucleic acid sequence encoding such mammal's native collagen .
  • collagen of the present invention prepared from cultured cells should be of a higher quality than collagen obtained from animal sources, and should form larger and more tightly packed proteins.
  • Figure 1 is a photograph showing analysis by polyacrylamide gel electrophoresis in SDS of the proteins secreted into medium by HT-1080 cells that were transfected with a gene construct containing the promoter, first exon and most of the first intron of the human COLI A 1 gene linked to 30 kb fragment containing all of COL2A1 except the first two exons.
  • Figure 2 is a photograph evidencing the secretion type II procoUagen into the medium from cells described in Figure 1 was folded into a correct native conformation.
  • Figure 3 is a photograph showing analysis of medium of HT-1080 cells co-transfected with a gene for COLI A 1 and a gene for COL1A2.
  • Figure 4 is a schematic representation of the cDNA for the pro ⁇ l(l) chain of human type I procoUagen that has been modified to contain artificial sites for cleavage by specific restriction endonucleases.
  • Figure 5 is a photograph showing analysis by nondenaturing 7.5% polyacrylamide gel electrophoresis (lanes 1-3) and 10% polyacrylamide gel electrophoresis in SDS (lanes 4-6) of purified chick prolyl 4-hydroxylase (lanes 1 and 4) and the proteins secreted into medium by Sf9 cells expressing the gene for the a-subunit and the B- subunit of human prolyl 4-hydroxylase and infected with a58/B virus (lanes 2 and 5) or with a59/B virus (lanes 3 and 6). a58/B and a59/B differ by a stretch of 64 base pairs.
  • Figure 6 is a gel showing the expression of recombinant human type III procoUagen in Sf9 and High Five cells.
  • Figure 7 is a gel showing the expression of recombinant human type I procoUagen in insect cells, analyzed on a silver stained, 5% SDS-PAGE gel.
  • Lane 1 is a pepsin digested sample from cells expressing only the pro cd chain of type I procoUagen.
  • Lane 2 is a pepsin digested sample from cells coexpressing pro ⁇ l and pro ⁇ 2 chains of type I procoUagen.
  • Figure 8 is a gel showing the expression of recombinant human type II procoUagen in insect cells, analyzed on a coomassie stained 5% SDS-PAGE gel.
  • Figure 9 is an SDS-PAGE analysis under reducing and nonreducing conditions of purified type III collagen. The gel was stained with Coomassie
  • Figure 10 is a non-reducing SDS-PAGE analysis of trimer formation of the pro ⁇ l (III) chains expressed in High Five insect cells. The samples were electrophoresed on 5% SDS-PAGE under nonreducing conditions and analyzed by Coomassie staining. Lane 1, molecular weight markers; lane 2, cell extract; lane 3, cell extract digested with pepsin; lane 4, proteins soluble in 1 % SDS. The positions of the trimeric pro ⁇ l (III) and ⁇ l (III) chains are shown by arrows.
  • Figure 11 is an analysis of the thermal stability of the recombinant human type III collagen produced in insect cells by a brief protease digestion.
  • collagen refers to any one of the collagen types I-XIX, as well as any novel collagens produced according to the methods of this invention.
  • the term also encompasses both procoUagen and mature collagen assembled as hetero- and homo-trimers, and any single chain polypeptides of procoUagen or 5 collagen for any of the collagen types, and any heterotrimers of any combination of the collagen constructs of the invention.
  • collagen is meant to encompasses all of the foregoing, unless the context dictates otherwise.
  • procoUagen refers to any one of the collagen types I-XIX, as well ° as any novel collagens produced by this invention, that possess additional C-terminal and/or N-terminal peptides that assist in trimer assembly, solubility, purification or other function, and then are subsequently cleaved by N-proteinase, C-proteinase or other proteins.
  • collagen subunit refers to the amino acid sequence of one polypeptide chain of a collagen protein encoded by a single gene, as well as derivatives, including deletion derivatives, conservative substitutions, etc.
  • a “fusion protein” is a protein in which peptide sequences from different proteins are covalently linked together.
  • collagen post-translational enzyme refers to any enzyme that modifies a procoUagen, collagen, or components comprising a collagen molecule, and encompasses, but is not limited to, prolyl-4-hydroxylase, C-proteinase, N-proteinase, lysyl hydroxylase, and lysyl oxidase.
  • prolyl-4-hydroxylase C-proteinase
  • N-proteinase lysyl hydroxylase
  • lysyl oxidase lysyl oxidase.
  • infection refers to the introduction of nucleic acids into an 15 organism by use of a virus or viral vector, and preferably, baculovirus or Semliki Forest virus.
  • transformation means introducing DNA into an organism so that the DNA is replicable, either as an extrachromosomal element, or by chromosomal 2 Q integration.
  • transfection refers to the taking up of an expression vector by a host cell, whether or not any coding sequences are in fact expressed.
  • stringent conditions refers to those hybridizing conditions that (1) employ low ionic strength and high temperature for washing, for
  • purified denotes that the indicated collagen or procoUagen is present in the substantial absence of other biological macromolecules, e.g. , polynucleotides, proteins, and the like.
  • purified as used herein preferably means at least 95% by weight, more preferably at least 99.8% by weight, of the indicated biological macromolecules present (but water, buffers, and other small molecules, especially molecules having a molecular weight of less than 1000 daltons, can be present).
  • isolated refers to a protein molecule separated not only from other proteins that are present in the natural source of the protein, but also from other proteins, and preferably refers to a protein found in the presence of (if anything) only a solvent, buffer, ion, or other component normally present in a solution of the same.
  • isolated and purified do not encompass proteins present in their natural source.
  • polynucleotide sequences which encode any collagen subunit, or functional equivalents thereof may be used to generate recombinant DNA molecules that direct the expression of that subunit of _ collagen, or a functional equivalent thereof, in appropriate host cells.
  • Preferred embodiments of the invention relate to polynucleotide sequences encoding human collagens or functional equivalents thereof.
  • Preferred embodiments of the invention also include the polynucleotide sequences of collagen subunits of type I - type IV, type XIII, type XV, and type XVIII, or functional equivalents thereof. 5
  • nucleic acid sequences encoding the known collagen types have been generally described in the art. See, e.g. , Fukai et al , Methods of Enzymology
  • New collagens/procollagens or known collagens/procollagens from which nucleic acid sequence is not available may be 0 obtained from cDNA libraries prepared from tissues believed to possess a "novel" type of collagen and to express the novel collagen at a detectable level.
  • a cDNA library could be constructed by obtaining polyadenylated mRNA from a cell line known to express the novel collagen, or a cDNA library previously 5 made to the tissue/cell type could be used.
  • the cDNA library is screened with appropriate nucleic acid probes, and/or the library is screened with suitable polyclonal or monoclonal antibodies that specificaUy recognize other collagens.
  • nucleic acid probes include oligonucleotide probes that encode known portions of the novel collagen from the same or different species.
  • suitable probes include, without limitation, oligonucleotides, cDNAs, or fragments thereof 5 that encode the same or similar gene, and/or homologous genomic DNAs or fragments thereof. Screening the cDNA or genomic library with the selected probe may be accomplished using standard procedures known to those in the art, such as those described in Chapters 10-12 of Sambrook et al. , Molecular Cloning: A
  • Altered DNA sequences which may be used in accordance with the invention include deletions, additions or substitutions of different nucleotide residues resulting in a sequence that encodes the same or a functionally equivalent gene product.
  • the gene product itself may contain deletions, additions or substitutions of amino acid residues within a collagen sequence, which result in a functionally equivalent
  • nucleic acid sequences of the invention may be engineered in order to alter the collagen coding sequence for a variety of ends including, but not limited to, alterations which modify processing and expression of the gene product.
  • alternative secretory signals may be substituted for the native human secretory signal and/or mutations may be introduced using techniques which are well known in the art, e.g. , site-directed mutagenesis, to insert new restriction sites, to alter glycosylation patterns, phosphorylation, etc. Additionally, when expressing in
  • the polynucleotides encoding the collagens of the invention may be modified in the silent position of any triplet amino acid codon so as to better conform to the codon preference of the particular host organism.
  • the nucleic acid sequences of the invention are further directed to sequences which encode variants of the described collagens and fragments.
  • These amino acid sequence variants of native collagens and collagen fragments may be prepared by methods known in the art by introducing appropriate nucleotide changes into a native or variant collagen encoding polynucleotide.
  • the amino acid sequence variants of collagen are preferably constructed by mutating the polynucleotide to give an amino acid sequence that does not occur in nature. These amino acid alterations can be made at sites that differ in collagens from different species (variable positions) or in highly conserved regions (constant regions).
  • Sites at such locations will typically be modified in series, e.g. , by substituting first with conservative choices (e.g. , hydrophobic amino acid to a different hydrophobic amino acid) and then with more distant choices (e.g. , hydrophobic amino acid to a charged amino acid), and then deletions or insertions may be made at the target site.
  • conservative choices e.g. , hydrophobic amino acid to a different hydrophobic amino acid
  • more distant choices e.g. , hydrophobic amino acid to a charged amino acid
  • Amino acids are divided into groups based on the properties of their side chains (polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipatic nature): (1) hydrophobic (leu, met, ala, ile), (2) neutral hydrophobic
  • amino acid position that are within the same group as the "native" amino acid.
  • amino acid position that are in a group that is closely related to the "native" amino acid (e.g. , neutral hydrophobic to weakly basic).
  • non-conservative changes encompass variants of an amino acid position that are in a group that is distantly related to the "native" amino acid (e.g.
  • Amino acid sequence deletions generally range from about 1 to 30 residues, preferably about 1 to 10 residues, and are typically contiguous.
  • Amino acid insertions include amino- and/or carboxyl-terminal fusions ranging in length from one to one hundred or more residues, as well as intrasequence insertions of single or multiple amino acid residues. Intrasequence insertions may range generally from about 1 to 10 amino residues, preferably from 1 to 5 residues. Examples of terminal insertions include the heterologous signal sequences necessary for secretion or for intracellular targeting in different host cells.
  • polynucleotides encoding a collagen are changed via site-directed mutagenesis.
  • This method uses oligonucleotide sequences that encode the polynucleotide sequence of the desired amino acid variant, as well as a sufficient adjacent nucleotide on both sides of the changed amino acid to form a stable duplex on either side of the site of being changed.
  • site-directed mutagenesis is well known to those of skill in the art and this technique is exemplified by publications such as, Edelman et al. , DNA 2:183 (1983).
  • a versatile and efficient method for producing site-specific changes in a polynucleotide sequence was published by Zoller and Smith, Nucleic Acids Res. 10:6487-6500 (1982).
  • PCR may also be used to create amino acid sequence variants of a collagen.
  • primer(s) that differs slightly in sequence from the corresponding region in the template DNA can generate the desired amino acid variant.
  • PCR amplification results in a population of product DNA fragments that differ from the polynucleotide template encoding the collagen at the position specified by the primer. The product DNA fragments replace the corresponding region in the plasmid and this gives the desired amino acid variant.
  • a further technique for generating amino acid variants is the cassette mutagenesis technique described in Wells et al , Gene 34:315 (1985); and other mutagenesis techniques well known in the art, such as, for example, the techniques in Sambrook et al. , supra, and Current Protocols in Molecular Biology. Ausubel et al. , supra.
  • a collagen sequence may be ligated to a heterologous sequence to encode a fusion protein.
  • a fusion protein may be engineered to contain a cleavage site located between an (3(IX) collagen sequence and the heterologous protein sequence, so that the (3(IX) collagen may be cleaved away from the heterologous moiety.
  • DNA sequences which encode substantially the same or a functionally equivalent amino acid sequence may be used in the practice of the invention for the cloning and expression of these collagen proteins.
  • DNA sequences include those which are capable of hybridizing to the appropriate human collagen sequence under stringent conditions.
  • collagens are structural proteins comprised of one or more collagen subunits which together form at least one triple-helical domain.
  • a variety of enzymes are utilized in order to transform the collagen subunits into procoUagen or other precursor molecules and then mature collagen.
  • Such enzymes include prolyl-4-hydroxylase, C-proteinase, N-proteinase, lysyl oxidase and lysye hydroxylase.
  • Prolyl 4-hydroxylase plays a central role in the biosynthesis of all collagens, 0 as the 4-hydroxyproline residues stabilize the folding of the newly synthesized polypeptide chains, into triple-helical molecules.
  • pro ⁇ l chains of type III procoUagen were expressed in insect cells, without recombinant prolyl 4-hydroxylase, considerable amounts of procoUagen were made in the cells, and the pro ⁇ l chains formed triple-helical molecules as indicated by the resistance of the collagenous domains of the collagen to protease Q degradation at 22°C.
  • the Tm of the triple helices of such molecules was about 6°C lower than procoUagen produced in the presence of the recombinant prolyl
  • Lysyl hydroxylase an ⁇ 2 homodimer, catalyzes the post-translation 5 modification of collagen to form hydroxy lysine in collagens. See generally,
  • Kivirikko et al Post-Translational Modifications of Proteins. Harding, J.J. , and Crabbe, M.J.C., eds. , CRC Press, Boca Raton, FL (1992); Kivirikko, Principles of Medical Biology. Vol. 3 Cellular Organelles and the Extracellular Matrix. Bittar, E.E., and Bittar, N., eds., JAI Press, Greenwich, Great Britain (1995).
  • C-proteinase processes the assembled procoUagen by cleaving off the C-terminal ends of the procoUagens that assist in assembly of, but are not part of, the triple helix of the collagen molecule. See generally, Kadler et al , J. Biol. Chem. 262: 15969-15701 (1987), Kadler et al , Ann. NY Acad. Sci. 580:214-224 (1990).
  • N-proteinase processes the assembled procoUagen by cleaving off the N-terminal ends of the procoUagens that assist in the assembly of, but are not part of, the collagen triple helix. See generally, Hojima et al. , J. Biol. Chem. 269:11381-11390 (1994).
  • Lysyl oxidase is an extracellular copper enzyme that catalyzes the oxidative 5 deamination of the e-amino group in certain lysine and hydroxy lysine residues to form a reactive aldehyde. These aldehydes then undergo an aldol condensation to form aldols, which cross links collagen fibrils.
  • Kivirikko Principles of 0 Medical Biology. Vol. 3 Cellular Organelles and the Extracellular Matrix. Bittar, E.E. , and Bittar, N., eds., JAI Press, Greenwich, Great Britain (1995), Kagan, Path. Res. Pract.
  • nucleic acid sequences encoding a number of these post-translational enzymes have been reported. See e.g. Vuori et al. , Proc. Natl. Acad. Sci. USA 89:7467-7470 (1992); Kessler et al , Science 271:360-362 (1996).
  • the nucleic acid ° sequences encoding the various post-translational enzymes may also be determined according to the methods generally described above and include use of appropriate probes and nucleic acid libraries.
  • the nucleotide sequence encoding the collagen, or a functional equivalent is inserted into an appropriate expression vector, i.e. , a vector which contains the necessary elements for the transcription and translation of the inserted coding sequence, or in the case of an RNA viral vector, the necessary elements for replication and translation.
  • an appropriate expression vector i.e. , a vector which contains the necessary elements for the transcription and translation of the inserted coding sequence, or in the case of an RNA viral vector, the necessary elements for replication and translation.
  • a variety of host-expression vector systems may be utilized to express a collagen coding sequence. These include, but are not limited to, microorganisms such as bacteria transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors containing a procoUagen or collagen coding sequence; yeast or filamentous fungi transformed with recombinant yeast or fungi expression vectors containing a procoUagen or collagen coding sequence; insect cell systems infected with recombinant virus expression vectors (e.g. , baculovirus) containing sequence encoding the procoUagen or collagen of the invention; plant cell systems infected with recombinant virus expression vectors (e.g.
  • plasmid expression vectors e.g. , cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV
  • the expression elements of these systems vary in their strength and specificities.
  • any of a number of suitable transcription and translation elements including constitutive and inducible promoters, may be used in the expression vector.
  • inducible promoters such as pL of bacteriophage ⁇ , plac, ptrp, ptac (pt ⁇ -lac hybrid promoter) and the like may be used; when cloning in insect cell systems, promoters such as the baculovirus polyhedron promoter may be used; when cloning in plant cell systems, promoters derived from the genome of plant cells (e.g. , heat shock promoters; the promoter for the small subunit of RUBISCO; the promoter for the chlorophyll a/b binding protein) or from plant viruses (e.g.
  • the 35S RNA promoter of CaMV; the coat protein promoter of TMV may be used; when cloning in mammalian cell systems, promoters derived from the genome of mammalian cells (e.g. , metallothionein promoter) or from mammalian viruses (e.g. , the adenovirus late promoter; the vaccinia virus 7.5 K promoter) may be used; when generating cell lines that contain multiple copies of a collagen DNA,
  • promoters derived from the genome of mammalian cells e.g. , metallothionein promoter
  • mammalian viruses e.g. , the adenovirus late promoter; the vaccinia virus 7.5 K promoter
  • SV40-, BPV- and EBV-based vectors may be used with an appropriate selectable marker.
  • a number of expression vectors may be advantageously selected depending upon the use intended for the collagen expressed. For example, when large quantities of the collagens of the invention are to be produced for the generation of antibodies, vectors which direct the expression of high levels of fusion protein products that are readily purified may be desirable.
  • Such vectors include, but are not limited to, the E. coli expression vector pUR278 (Ruther et al , EMBO J.
  • pGEX vectors may also be used to express foreign polypeptides as fusion proteins with glutathione S- transferase
  • GST GST
  • fusion proteins are soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose beads followed by elution in the presence of free glutathione.
  • the pGEX vectors are designed to include thrombin or factor Xa protease cleavage sites so that the cloned polypeptide of interest can be released from the GST moiety.
  • a preferred expression system is a yeast expression system.
  • yeast a number of vectors containing constitutive or inducible promoters may be used.
  • Current Protocols in Molecular Biology Vol. 2, Ed. Ausubel et al , Greene Publish. Assoc. & Wiley Interscience, Ch. 13 (1988); Grant et al ,
  • a particularly preferred system useful for cloning and expression of the collagen proteins of the invention uses host cells from the yeast Pichia.
  • Species of non-Saccharomyces yeast such as Pichia pastoris appear to have special advantages in producing high yields of recombinant protein in scaled up procedures.
  • a Pichia expression kit is available from Invitrogen Corporation (San Diego, CA).
  • methanol responsive genes in methy lotrophic yeasts such as Pichia pastoris, the expression of each being controlled by methanol responsive regulatory regions (also referred to as promoters). Any of such methanol responsive promoters are suitable for use in the practice of the present invention.
  • Examples of specific regulatory regions include the promoter for the primary alcohol oxidase gene from Pichia pastoris AOX1, the promoter for the secondary alcohol oxidase gene from P. pastoris AXO2, the promoter for the dihydroxyacetone synthase gene from P. pastoris (DAS), the promoter for the P40 gene from P. pastoris, the promoter for the catalase gene from P. pastoris, and the like.
  • Typical expression in Pichia pastoris is obtained by the promoter from the tightly regulated AOX1 gene. See Ellis et al. , Mol. Cell. Biol. 5: 1111 (1985) and
  • MOX methanol oxidase
  • DAS dihydroxyacetone synthase
  • FMHD formate dehydrogenase
  • MOX methanol oxidase
  • DAS dihydroxyacetone synthase
  • FMHD formate dehydrogenase
  • These enzymes can constitute up to 30-40% of the total cell protein.
  • the genes encoding MOX, DAS, and FMDH production are controlled by very strong promoters which are induced by growth on methanol and repressed by growth on glucose. Any or all three of these promoters may be used to obtain high level expression of heterologous genes in H. potymorpha.
  • the gene encoding a collagen of the invention is cloned into an expression vector under the control of an inducible H. polymorpha promoter.
  • a polynucleotide encoding a signal sequence for secretion in yeast such as the 5. cerevisiae prepro-mating factor ⁇ l, is fused in frame with the coding sequence for the collagen of the invention.
  • the expression vector preferably contains an auxotrophic marker gene, such as URA3 or LEU2, which may be used to complement die deficiency of an auxotrophic host.
  • the expression vector is then used to transform H. polymorpha host cells using techniques known to those of skill in the art.
  • An interesting and useful feature of H. polymorpha transformation is the spontaneous integration of up to 100 copies of the expression vector into the genome. In most cases, the integrated DNA forms multimers exhibiting a head-to-tail arrangement.
  • the integrated foreign DNA has been shown to be mitotically stable in several recombinant strains, even under non-selective conditions. This phenomena of high copy integration further adds to the high productivity potential of the system.
  • Filamentous fungi may also be used to produce the collagens of the instant invention.
  • Vectors for expressing and/or secreting recombinant proteins in filamentous fungi are well known, and one of skill in the art could use these vectors to express recombinant collagen.
  • the expression of sequences encoding the collagens of the invention may be driven by any of a number of promoters.
  • viral promoters such as the 35S RNA and 19S RNA promoters of CaMV (Brisson et al , Namre 3_1Q:511-514 (1984), or the coat protein promoter of TMV (Takamatsu et al , EMBO J. £:307-311 (1987)) may be used; alternatively, plant promoters such as the small subunit of RUBISCO (Coruzzi et al. , EMBO J.
  • An alternative expression system which could be used to express the collagens 5 of the invention is an insect system.
  • Autographa californica nuclear polyhidrosis virus (AcNPV) is used as a vector to express foreign genes.
  • the virus grows in Spodoptera frugiperda cells.
  • Coding sequence for the collagens of the invention may be cloned into non-essential regions (for example the o polyhedron gene) of the virus and placed under control of an AcNPV promoter (for example, the polyhedron promoter).
  • Successful insertion of a collagen coding sequence will result in inactivation of the polyhedron gene and production of non-occluded recombinant virus (i.e. , virus lacking the proteinaceous coat coded for by the polyhedron gene).
  • These recombinant viruses are then used to infect 5
  • a number of viral based expression systems may be utilized.
  • coding sequence for the collagens of the invention may be ligated to an adenovirus transcription/translation control complex, e.g. , the late promoter and tripartite leader 5 sequence.
  • This chimeric gene may then be inserted in the adenovirus genome by in vitro or in vivo recombination. Insertion in a non-essential region of the viral genome (e.g.
  • region El or E3 will result in a recombinant virus that is viable and capable of expressing collagen in infected hosts.
  • the vaccinia 7.5 K promoter may be used.
  • Mackett et al Proc. Natl. Acad. Sci. USA 79:7415-7419 (1982); Mackett et al , J. Virol. 49:857-864 (1984); Panicali et al , Proc. Natl. Acad. Sci. USA 79:4927-4931 (1982).
  • Specific initiation signals may also be required for efficient translation of inserted collagen coding sequences. These signals include the ATG initiation codon and adjacent sequences. In cases where the entire collagen gene, including its own initiation codon and adjacent sequences, is inserted into the appropriate expression vector, no additional translational control signals may be needed. However, in cases where only a portion of a collagen coding sequence is inserted, exogenous translational control signals, including the ATG initiation codon, must be provided. Furthermore, the initiation codon must be in phase with the reading frame of the collagen coding sequence to ensure translation of the entire insert. These exogenous translational control signals and initiation codons can be of a variety of origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of appropriate transcription enhancer elements, transcription terminators, etc. (see Bittner et al. , Methods in Enzvmol. 153:516-544 (1987)).
  • the collagens of the invention are expressed as secreted proteins.
  • the engineered cells used for expression of the proteins are non-human host cells, it is often advantageous to replace the human secretory signal peptide of the collagen protein with an alternative secretory signal peptide which is more efficiently recognized by the host cell's secretory targeting machinery.
  • the appropriate secretory signal sequence is particularly important in obtaining optimal fungal expression of mammalian genes. For example, in methylotrophic yeasts, a DNA sequence encoding the in-reading frame 5. cerevisiae ⁇ -mating factor pre-pro sequence may be inserted at the amino-terminal end of the coding sequence.
  • the ⁇ MF pre-pro sequence is a leader sequence contained in the ⁇ MF precursor molecule, and includes the lys-arg encoding sequence which is necessary for proteolytic processing and secretion (see, e.g. , Brake et al , Proc. Natl. Acad. Sci. USA 81:4642 (1984)).
  • Other signal sequences for prokaryotic, yeast, fungi, insect or mammalian cells are well known in the art, and one of ordinary skill could easily select a signal sequence appropriate for the host cell of choice.
  • the vectors of this invention may autonomously replicate in the host cell, or may integrate into the host chromosome.
  • Suitable vectors with autonomously replicating sequences are well known for a variety of bacteria (e.g. , the ars from pBR322 functions in the majority of gram negative bacteria), yeast (the 2 ⁇ plasmid ars), and various viral replications sequences for both prokaryotes and eukaryotes (prokaryote: ⁇ , T-even phages, M13, etc; eukaryote: adenovirus, SV40, polyoma, VSV or BPV, vaccina, etc.).
  • Vectors may integrate into the host cell genome when they have a DNA sequence that is homologous to a sequence found in the host cell's genomic DNA.
  • the vectors of the invention also encode a selection gene, also termed a selectable marker, that encodes a product necessary for the host cell to grow and survive under certain conditions.
  • Selection genes include genes encoding (1) a protein that confers resistance to an antibiotic or other toxin (e.g. , tetracyclme, _ ampicillin, neomycin, methotrexate, etc.), and (2) a protein that complements an auxotrophic requirement of the host cell, etc.
  • selection genes include: the herpes simplex virus thymidine kinase (Wigler et al , Cell 11:223 (1977)), hypoxanthine-guanine phosphoribosyltransferase (Szybalska et al , Proc.
  • trpB which allows cells to utilize indole in place of tryptophan
  • hisD which allows cells to utilize histinol in place of histidine
  • ODC omithine decarboxylase
  • DFMO 2-(difluoromethyl)-DL-ornithine
  • promoters and enhancers are untranslated sequences located upstream from the start codon of the strucmral gene that control the transcription of the nucleic acid under its control.
  • Inducible promoters are promoters that alter their level of transcription initiation in response to a change in culmre conditions, e.g. , the presence or absence of a nutrient.
  • a change in culmre conditions e.g. , the presence or absence of a nutrient.
  • Promoters useful in the present invention include, but are not limited to, the following: (prokaryote) (1) the lactose promoter, the alkaline phosphatase promoter, the tryptophan promoter, and hybrid promoters such as the tac promoter, (yeast) (2) the promoter for 3 -phosphoglycerate kinase, other glycolytic enzyme promoters (hexokinase, pyruvate decarboxylase, phophofructosekinase, glucose -6-phosphate isomerase, etc.), the promoter for alcohol dehydrogenase, the metallothionein promoter, the maltose promoter, and the galactose promoter,
  • eukaryotic (3) virtually all eukaryotic genes have an AT-rich region located approximately 25 to 30 bases upstream from the site where transcription is initiated
  • suitable eukaryotic promoters include: promoters from the viruses polyoma, fowlpox, adenovirus, bovine papilloma virus, avian sarcoma virus, cytomegalovirus, retroviruses, SV40, and promoters from the target eukaryote including: the glucoamylase promoter from Aspergillus, the actin promoter or an immunoglobin promoter from a mammal, and native collagen promoters. See, e.g. , de Boer et al, Proc. Natl.
  • Enhancers are cis-acting elements, usually about from 10 to 300 bp, that act to increase the rate of transcription initiation at a promoter. Many enhancers are known for both eukaryotes and prokaryotes, and one of ordinary skill could select an appropriate enhancer for the host cell of interest. See, e.g. , Yaniv, Nature 297:17-18 (1982) for 5 eukaryotic enhancers.
  • a host cell strain may be chosen which modulates the expression of the inserted sequences, or modifies and processes the gene product in the specific fashion desired. Such modifications (e.g. , glycosylation) and processing (e.g. , Q cleavage) of protein products may be important for the function of the protein.
  • Different host cells have characteristic and specific mechanisms for die post-translational processing and modification of proteins. Appropriate cells lines or host systems can be chosen to ensure the correct modification and processing of the foreign protein expressed.
  • eukaryotic host cells which possess the 5 cellular machinery for proper processing of the primary transcript, glycosylation, and phosphorylation of the gene product may be used.
  • mammalian host cells include, but are not limited to, CHO, VERO, BHK, HeLa, COS, MDCK, 293,
  • host cells may be engineered to express various enzymes to 0 ensure the proper processing of the collagen molecules.
  • the gene for prolyl-4-hydroxylase may be coexpressed with the collagen gene in the host cell.
  • cell lines which stably express the collagens 5 of the invention may be engineered.
  • host cells can be transformed with collagen encoding DNA controlled by appropriate expression control elements (e.g. , promoter, enhancer, sequences, transcription terminators, polyadenylation sites, etc.), and a selectable marker.
  • appropriate expression control elements e.g. , promoter, enhancer, sequences, transcription terminators, polyadenylation sites, etc.
  • engineered cells may be allowed to grow for 1-2 days in an enriched media, and then are switched to a selective media.
  • the selectable marker in the recombinant plasmid confers resistance to the selection and allows cells to stably integrate the plasmid into their chromosomes and grow to form foci which in mm can be cloned and expanded into cell lines.
  • This method may advantageously be used to engineer cell lines which express a desired collagen.
  • Host cells are transfected or preferably infected or transformed with 5 the above-described expression vectors, and culmred in nutrient media appropriate for selecting transductants or transformants containing the collagen encoding vector.
  • the host cells which contain the coding sequence and which express the biologically active gene product may be identified by at least four general approaches; (a) DNA-DNA or DNA-RNA hybridization; (b) the presence or absence of "marker" gene functions; (c) assessing the level of transcription as measured by the expression of collagen mRNA transcripts in the host cell; and (d) detection of the gene product as measured by immunoassay or by its biological activity.
  • the presence of the collagen coding sequence inserted in 5 the expression vector can be detected by DNA-DNA or DNA-RNA hybridization using probes comprising nucleotide sequences that are homologous to the collagen coding sequence, respectively, or portions or derivatives tiiereof.
  • the recombinant expression vector/host system can be ° identified and selected based upon the presence or absence of certain "marker" gene functions (e.g. , thymidine kinase activity, resistance to antibiotics, resistance to methotrexate, transformation phenotype, occlusion body formation in baculovirus, etc.).
  • certain "marker” gene functions e.g. , thymidine kinase activity, resistance to antibiotics, resistance to methotrexate, transformation phenotype, occlusion body formation in baculovirus, etc.
  • a marker gene can be placed in tandem with the collagen sequence under the control of the same or different promoter used to control the expression of the collagen coding sequence.
  • Expression of the marker in response to induction or selection indicates expression of the collagen coding sequence.
  • transcriptional activity of the collagen coding region can be assessed by hybridization assays.
  • RNA can be isolated and analyzed by Northern blot using a probe homologous to the collagen coding sequence or particular portions thereof.
  • total nucleic acids of the host cell may be extracted and assayed for hybridization to such probes.
  • the expression of a collagen protein product can be assessed immunologically, for example by Western blots, immunoassays such as radioimmuno-precipitation, enzyme-linked immunoassays and the like.
  • the expressed collagen of the invention which is preferably secreted into the culture medium, is purified to homogeneity by chromatography.
  • the recombinant collagen protein is purified by size exclusion chromatography.
  • other purification techniques known in the art can also be used, including ion exchange chromatography, and reverse-phase chromatography. See, e.g. , Maniatis et al , Molecular Cloning A Laboratory Manual. Cold Spring
  • Example 1 Synthesis of Human Type ⁇ ProcoUagen
  • a recombinant COLI A 1 gene construct employed in the present invention comprised a fragment of the 5'- end of COL1A1 having a promotor, exon 1 and intron 1 fused to exons 3 through 54 of a COL2A1 gene.
  • the hybrid construct was transfected into HT-1080 cells. These cells were co-transfected with a neomycin-resistance gene and grown in the presence of the neomycin analog G418. The hybrid construct was used to generate transfected cells.
  • a series of clones were obtained that synthesized mRNA for human type II procoUagen.
  • the cells were incubated with [ 14 C] proline so that the medium proteins could be analyzed by autoradiography (storage phosphor film analyzer).
  • lane 1 shows that the unpurified medium proteins are comprised of three major polypeptide chains.
  • the medium proteins contained the expected type II procoUagen comprised of pro ⁇ l (II) chains together with pro ⁇ l (IV) and pro ⁇ 2(IV) chains of type IV collagen normally synthesized by the cells.
  • the upper two are pro ⁇ l (IV) and pro ⁇ 2 (IV) chains of type IV collagen that are synthesized by cells not transfected by the construct.
  • the third band is the pro ⁇ l (II) chains of human type II procoUagen synthesized from the construct.
  • Lanes 2 and 3 are the same medium protein after chromatography of the medium on an ion exchange column. As indicated in Lanes 2 and 3, the type II procoUagen was readily purified by a single step of ion exchange chromatography.
  • the type II procoUagen secreted into the medium was correctly folded by a protease-thermal stability test.
  • the medium proteins were digested at the temperatures indicated with a high concentration of trypsin and chymotrypsin under conditions in which conectly folded triple-helical procoUagen or collagen resists digestion but unfolded or incorrectly folded procoUagen of collagen is digested to small fragments.
  • the products of the digestion were than analyzed by polyacrylamide gel electrophoresis in SDS and fluorography. The results show that the type II procoUagen resisted digestion up to 43°C, the normal temperature at which type II procoUagen unfolds. Therefore, the type II procoUagen is correctly folded and can be used to generate collagen fibrils.
  • B. Example 2 Synthesis of Human Type I ProcoUagen
  • HT-1080 cells were co-transfected with a
  • COLI A 1 gene and a COL1A2 gene were both genes and a cytomegalic virus promoter linked to a full-length cDNA.
  • the COL1A2 gene construct but not the
  • COLI A 1 gene construct contained a neomycin-resistance gene.
  • the cells were selected for expression of the COL1A2- neomycin resistance gene construct by growth in the presence of the neomycin-analog G418.
  • the medium was then examined for expression of the COLI A 1 with a specific polyclonal antibody for human pro ⁇ l(l) chains.
  • the COLIA2 was linked to an active neomycin-resistance gene but the COLI A 1 was not.
  • the cells were screened for expression of the COLlA2-neomycin resistance gene construct with the neomycin analog G418.
  • the medium was analyzed for expression of the COLI A 1 by Western blotting with a polyclonal antibody specific for the human pro ⁇ l(I) chain.
  • lane 1 indicates that the medium proteins contained pro ⁇ (I) chains ( ⁇ l(I) and ⁇ 2(I)).
  • Lane 2 is an authentic standard of type I procoUagen containing pro ⁇ l(I) and pro ⁇ 2(I) chains and partially processed pC ⁇ l(I) chains. The results demonstrate that the cells synthesized human type procoUagen that contained pro ⁇ l(I) chains, presumably in the form of the normal heterotrimer with the composition two pro ⁇ (I) chains and one pro ⁇ 2(I) chain.
  • TABLE I presents a summary of some of the DNA constructs containing human procoUagen genes. The constructs were assembled from discrete fragments of the genes or cDNAs from the genes together with appropriate promoter fragments. TABLE I
  • a cosmid plasmid clone containing the gene construct was cleaved with a restriction endonuclease to release the construct from the vector.
  • a plasmid vector comprising a neomycin resistance gene, (Law et al , Mol. Cell. Biol. 3:2110-2115 (1983)) was linearized by cleavage with
  • the cells were then transferred to DMEM medium containing newborn calf serum for 24 hours and then to the same medium containing 450 ⁇ g/ml of G418. Incubation in the medium containing G418 was continued for about 4 weeks with a change of medium every third day.
  • G418-resistant cells were either pooled or separate clones obtained by isolating foci with a plastic cylinder and subcultured.
  • polyclonal antibodies were prepared in rabbits using a 23-residue synthetic peptide that had an amino acid sequence found in the COOH-terminal telopeptide of type II collagen. See generally,
  • COOH-terminal polypeptide of the pro ⁇ (I) chain were employed. See generally, Olsen et a/. , J. Biol. Chem. 266: 1117-1121 (1991).
  • Culmre medium from pooled clones or individual clones was removed and separately precipitated by the addition of solid ammonium sulfate to 30% saturation and precipitates were collected by centrifugation at 14,000 x g and then dialyzed against a buffer containing 0.15 M NaCl, 0.5 mM EDTA, 0.5 mM
  • Aliquots of the samples were heated to 10°C for 5 minutes in 1 % SDS, 50 mM DTT and 10% (v/v) glycerol, and separated by electrophoresis on 6% polyacrylamide gels using a mini-gel apparatus (Holford SE250, Holford Scientific) run at 125 V for 90 minutes. Separated proteins were electroblotted from the polyacrylamide gel at 40 V for 90 minutes onto a supported nitrocellulose membrane (Schleicher and Schuell).
  • the transferred proteins were reacted for 30 minutes with the polyclonal antibodies at a 1:500 (v/v) dilution. Proteins reacting with the antibodies were detected with a secondary anti-rabbit IgG antibody coupled to alkaline phosphatase (Promega Biotech) for 30 minutes. Alkaline phosphatase was visualized with NBT/BCIP (Promega Biotech) as directed by the manufacturer.
  • the medium proteins were digested with high concentrations of proteases under conditions in which only correctly folded procoUagens and collagens resist digestion.
  • the cell layer from a 25 cm flask was scraped into 0.5 ml of modified Krebs II medium containing 10 mM EDTA and 0.1 % Nonidet P-40 (Sigma). The cells were vigorously agitated in a Vortex mixer for 1 minute and immediately cooled to 4°C. The supernatant was transferred to new tubes.
  • the sample was preincubated at the temperamre indicated for 10 minutes and the digestion was carried out at the same temperature for 2 minutes.
  • a 0.1 volume of the modified Krebs II medium containing 1 mg/ml trypsin and 2.5 mg/ml ⁇ -chyrnotrypsin (Boehringer Manheim) was added.
  • the digestion was stopped by adding a 0.1 volume of 5 mg/ml soybean trypsin inhibitor (Sigma).
  • the sample was rapidly immersed in boiling water for 2 minutes with the concomitant addition of a 0.2 volume of 5 x electrophoresis sample buffer that consisted of 10% SDS, 50% glycerol, and 0.012% bromphenol blue in 0.625 M Tris-HCl buffer (pH 6.8). Samples were applied to SDS gels with prior reduction by incubating for 3 minutes in boiling water after the addition of 2% 2-mercaptoethanol. Electrophoresis was performed using the discontinuous system of Laemli, Namre 227:680-685 (1979), with minor modifications described by de Wet et al , J. Biol. Chem. 258:7721-7728 (1983).
  • Sf9 cells were grown on glass slides and fixed in 100% ethanol at -20°C. Alternatively, cells in monolayer were detached, washed twice with a solution of 0.15 M NaCl and 0.02 M phosphate, pH 7.4 (washing solution), suspended in cold ethanol and spread on silanated (Maples, J.A., (1985), Am. J.
  • Triton X-100 extracts of cell homogenates were analyzed for prolyl 4-hydroxylase activity with an assay based on the hydroxylation-coupled decarboxylation of 2-oxo [1-14C] glutarate (Kivirikko et al. ,
  • the amount of the purified type III collagen was determined by using the Sircol collagen assay (Biocolor). Amino acid analysis of the purified type III collagen was performed in an Applied Biosystems 421 Amino Acid Analyzer.
  • DNA and some cDNA for the pro ⁇ l (I) chain of human type I procoUagen was the starting material.
  • the DNA sequence of the hybrid gene was analyzed and the codons for amino acids that formed the junctions between the repeating D-periods were modified in ways that did not change the amino acids encoded but did create unique sites for cleavage of the hybrid gene by restriction endonucleases.
  • the D3-period of pro ⁇ l (I) is excised using Srfl and Nael restriction nucleases.
  • the bases coding for the amino acids found in the collagenase recognition site present in the D3 period are modified so that they code for a different amino acid sequence.
  • the cassette is amplified and reinserted in the gene. Expression of the gene in an appropriate host cell will result in type I collagen which cannot be cleaved by collagenase.
  • a D2 period cassette (of the pro ⁇ l (I) chain) is excised from the gene described above by digestion with Smal.
  • the gene is reassembled to provide a gene having a specific 5 in-frame deletion of the codons for the D-2 period.
  • Multiple copies of one or more D-cassettes may be inserted at the engineered sites to provide multiple copies of desired regions of procoUagen or collagen.
  • the baculovirus transfer vector pVl ⁇ 58 was constructed by digesting a pBluescript (Stratagene) vector containing in the Small site the full-length cDNA for the ⁇ subunit of human prolyl
  • the cDNA clones P ⁇ -58 and P ⁇ -59 differ by a stretch of 64 bp.
  • the pVL ⁇ vector was constructed by litigation of an EcoRI-BamHI fragment of a full-length cDNA for the ⁇ subunit of human prolyl 4-hydroxylase, S-l 38
  • the resultant viral pool in the supernatant of the transfected cells was collected 4 days later and used for plaque assay.
  • Recombinant occlusion-negative plaques were subjected to three rounds of plaque purification to generate recombinant viruses totally free of contaminating wild-type virus.
  • the screening procedure and isolation of the recombinant viruses essentially followed by the method of Summers and Smith, supra.
  • the resulting recombinant viruses from pVL ⁇ 58, pVL ⁇ 59, and pvLfi were designated as the ⁇ 58 virus, ⁇ 59 virus and ⁇ virus, respectively.
  • Sf9 cells were cultured in TNM-FH medium (Sigma) supplemented with 10% 5 fetal bovine serum at 27°C either as monolayers or in suspension in spinner flasks (Techne).
  • TNM-FH medium Sigma
  • 5 fetal bovine serum at 27°C either as monolayers or in suspension in spinner flasks (Techne).
  • Sf9 cells seeded at a density of 106 cells per ml were injected at a multiplicity of 5-10 with recombinant viruses when the ⁇ 58, ⁇ 59, or ⁇ virus was used alone.
  • the ⁇ and ⁇ viruses were used for Q infection in ratios of 1: 10-10: 1 when producing the prolyl 4-hydroxylase tetramer.
  • the cells were harvested 72 hours after infection, homogenized in 0.01 M Tris, pH 7.8/0.1 M NaCl/0.1 M glycine/lO ⁇ M dithiothreitol/0.1 % Triton X-100, and centrifuged. The resulting supernatants were analyzed by SDS/10% PAGE or nondenaturing 7.5% PAGE and assayed for enzyme activities. The cell pellets were 5 further solubilized in 1 % SDS and analyzed by SDS/10% PAGE. The cell medium at 24-96 hours postinfection was also analyzed by SDS/10% PAGE to identify any secretion of the resultant proteins into the medium. The cells in these experiments were grown in TNM-FH medium without serum.
  • the Km values were determined by varying the concentrations of one substrate in the presence of fixed concentration of the second, while the concentrations of the other substrates were held constant (Myllyla et al , Eur. J. Biochem. 80:349-357 (1977)). Protein disulfide-isomerase activity of the ⁇ subunit was measured by glutathione: insulin transhydrogenase assay (Carmichael et al , J. Biol. Chem. 252:7163-7167 (1977)). Western blot analysis was performed using a monoclonal antibody, 5B5, to the ⁇ subunit of human prolyl 4-hydroxylase (Hoyhtya et al. , Eur. J. Biochem.
  • Prolyl 4-hydroxylase was purified by a procedure consisting of poly (L-proline) affinity chromatography, DEAE-cellulose chromatography, and gel filtration (Kivirikko et al , Methods in Enzvmologv 144:96-114 (1987)).
  • Figure 5 presents analysis of the prolyl 4-hydroxylase synthesized by the insect cells after purification of the protein by affinity-column chromatography.
  • the recombinant enzyme When examined by polyacrylamide gel electrophoresis in a nondenaturing gel, the recombinant enzyme co-migrated with the tetrameric and active form of the normal enzyme purified from chick embryos. After the purified recombinant enzyme was reduced, the (- and (- subunits were detected.
  • lanes 1-3 are protein separated under non-denaturing conditions and showing tetramers of the two kinds of subunits.
  • Lanes 4-6 are the same samples separated under denaturing 5 conditions so that the two subunits appear as separate bonds.
  • the Km values were determined by varying the concentration of one substrate in the presence of fixed concentrations of the second while the concentration of the other ° substrates were held constant.
  • Michales-Mento (Km) values for the recombinant enzyme were essentially the same as for the authentic normal enzyme from chick embryos.
  • transfected insect cells Since the transfected insect cells synthesize large amounts of active prolyl 5 4-hydroxylase, they are appropriate cells to transfect with genes of the present invention coding for procoUagens and collagens so as to obtain synthesis of large amounts of the procoUagens and collagens. Transfection of the cells with genes of the present invention is performed as described in Example 3. 0
  • the yeast Saccharomyces cerevisiae can be used with any of a large number of expression vectors.
  • One of the most commonly employed expression 5 vectors is the multi-copy 2 ⁇ plasmid that contains sequences for propagation both in yeast and E. coli, a yeast promoter and terminator for efficient transmission of the foreign gene.
  • Typical examples of such vectors based on 2 ⁇ plasmids are pWYG4 that has the 2 ⁇ ORI-STB elements, the GALI remoter, and the 2 ⁇ D gene ° terminator.
  • an Ncol cloning site is used insert the gene for either the ( or ( subunit of prolyl 4-hydroxylase, and provide the ATG start codon for either the ⁇ or ⁇ subunit.
  • the expression vector can be pWYG7L that has intact 2 ⁇ ORI, STB, REP1 and REP2, the GAL7 promoter, and uses the FLP 5 terminator.
  • the gene for either the ( or ( subunit of prolyl 4- hydroxylase is inserted in the polylinker with its 5' ends at a BamHI or Ncol site.
  • the vector containing the prolyl 4-hydroxylase gene is transformed into S. cerevisiae either after removal of the cell wall to produce spheroplasts that take up DNA on treatment with calcium and polyethylene glycol or by treatment of intact cells with lithium ions.
  • DNA can be introduced by electroporation.
  • Transformants can be selected by using host yeast cells that are auxotrophic for leucine, tryptophane, uracil or histidine together with selectable marker genes such as LEU2, TRO1, URA3, HIS3 or LEU2-D.
  • Selection of the prolyl 4-hydroxylase genes driven by the galactose promoters can be induced by growing the culmre on a non-repressing, non-inducing sugar so that very rapid induction follows addition of galactose; by growing the culmre in glucose medium and then removing the glucose by centrifugation and washing the cells before resuspension in galactose medium; and by growing the cells in medium containing both glucose and galactose so that the glucose is preferentially metabolized before galactose-induction can occur.
  • Expression of the genes for prolyl 4-hydroxylase and procoUagens or collagens can also be in non- Saccharomyces yeast such as Pichia pastoris that appear to have special advantages in producing high yields of recombinant protein in scaled-up procedures.
  • Typical expression in the methylotroph P. pastoris is obtained by the promoter from the tightly regulated AOX1 gene that encodes for alcohol oxidase and can be induced to give high levels of recombinant protein driven by the promoter after addition of methanol to the cultures. Since P. Pastoris has no native plasmids, the yeast is employed with expression vectors designed for chromosomal integration and genes such as HIS4 are used for selection.
  • genes for procoUagens and collagens described herein are achieved under conditions where the recombinant protein is adequately hydroxylated by prolyl 4-hydroxylase and, therefore, can fold into a stable helix that is required for the normal biological function of the proteins in forming fibrils.
  • the 3' end of the collagen DNA was synthesized from 4195 bp downstream (EcoRI site) of the translation initiation codon to stop codon (4401 bp) of the translation by PCR (see Ala-Kokko et al, 1989 Biochem J.. 260:509-516 5 accession number X144207) using pBluescript-SM38. Notl and Xbal sites were created in the 3' end of the fragment.
  • pBluescript-SM38 was digested with
  • the 5' end of the collagen D ⁇ A was synthesized from 73bp downstream of the translation initiation codon to 176 bp (BamHI site) by PCR (for sequences, see
  • the 3' end of the collage was created from 4195 bp downstream
  • Hpal was digested by Hpal.
  • the Hpal fragment of ARG4 gene was inserted into the EcoRV sites of pAO815 (Invitrogen) to create a vector pARG815, which contains the ARG4 gene instead of the HIS4 gene.
  • EcoRI -fragment was inserted into the EcoRI site of PAO815 (Invitrogen) to create single expression cassette vector.
  • the 5' end of the ⁇ -subunit was synthesized from the translation initiation codon to 689 bp downstream (Hindlll site) by PCR. Hindlll and Smal sites were created in 5' end of the fragment.
  • Plasmid pA-59 (see Vuori et al) was digested by Hindllland the large fragment (approximately 4.9 kb) was isolated. The large fragment (approximately 4.9 kb) and the 5' PCR fragment were ligated with T4 ligase, to give plasmid pA-59/15.
  • Plasmid pA-59/15 was digested with Pstland BamHI, the large fragment (approximately 3.9 kb) was isolated, and the large fragment and 3' PCR fragment were ligated with T4 ligase, to give plasmid pA-59/3.
  • Plasmid pA-59 was digested with Smal and the Smal-Smal ⁇ -subunit fragment (1 bp-1605 bp) was ligated into EcoRI site of the pARG815.
  • the ⁇ single cassette vector was digested by Bglll-BamHl to excise the expression cassette and the expression cassette was reinserted into one for the BamHI site of pARG815 ⁇ expression vector.
  • the vector contains two expression cassettes: one for the ⁇ -subunit and one for the ⁇ -subunit.
  • the ⁇ -subunit without its signal sequence was synthesized by PCR from 52 bp downstream of the translation initiation codon to the translation stop codon. ⁇ coRI restriction sites were created in 5' and 3' ends. This PCR fragment was cloned into the ⁇ coRI site of pSP72 (Promega).
  • pVLClAl The baculovirus transfer vector, was constructed using the eukaryotic expression vector CMV- COLIAI (Geddis et al , Matrix
  • CMV-COL1A1 contains the sequences coding for the full length cDNA sequence of the ⁇ l chain of the human procoUagen I (COLIAI).
  • pVLClA2 The baculovirus transfer vector was constructed using the vector pUC-HP2010 (Kuivaniem et al , Biochem. J. 252:633-640 (1988)) and the polyhedrin-based baculovirus transfer vector pVL 1392 (Luckow et al. , Virology 170:31-39 (1989)).
  • pUC-HP2010 contains the sequences coding for the full length cDNA sequence of the ⁇ 2 chain of the human procoUagen I (COL1A2) in the Sphl site of pUC19.
  • pUC-HP2010 is digested with Sphl, the GTAC overhang is removed with T4
  • the BgHl-BamHl fragment from pSP72-ClA2T has the full length COL1A2 sequence plus six bp 5' untranslated, and 278 bp 3' untranslated, and this fragment is cloned into the BgHl-BamHl sites of pVL1392 to give pVLClA2.
  • pVLC3Al A Bglll site was created 16 bp upstream of the translation initiation codon to a full- length cDNA including 92 bp 5' untranslated region and 715 bp 3' untranslated region for the pro ⁇ l chain of human type III procoUagen in the plasmid pBS-SM38 (derived from sequences presented in Ala-Kokko et al Biochem. J. 260: 509-516 (1989), and GenBank accession number X 14420) by PCR, to give the plasmid pBS-C3Al .
  • pBS-C3Al was digested with Bglll and Xbal restriction enzymes and the Bglll/Xbal fragment containing the full-length cDNA of pro ⁇ l chain of human type III procoUagen including 16 bp 5' untranslated region, and 715 bp 3' untranslated region, was then ligated to pVL1392 (Luckow et al.
  • pVLC3A15'UT/C2Al The baculovirus transfer vector was constructed using the sequences presented in Baldwin et al , Biochem. J. 262:521-528 (1989) resulting in the vector pG ⁇ MC2Al and the polyhedrin-based baculovirus transfer vector pVL
  • pGEMC2Al contains the sequences coding for exon I from type I collagen, and type II collagen starts from exon 2B.
  • pGEMC2Al is digested with Xbal-Dral to generate a fragment with the full length cDNA fusion, and six bp 5' untranslated region and 396 bp 3' untranslated region, and this fragment is cloned into the Xbal-Smal sites of pVL1392 to give the plasmid pVLClAl/C2Al.
  • the 5' untranslated region was then changed to GATCTGATATT by cloning an oligonucleotide into the BgHl-Xbal sites of the COL
  • PNLC3A1NP/C2A1 pGEMC2Al is digested with Xbal-BamHl and the full length cDNA fusion is cloned into the Xbal-BamHl sites of pBS(SK-) to give the plasmid pBSClAl/C2Al.
  • pBSClAl/C2Al is digested with BgHl-Narl to generate a full length cDNA without the N-propeptide, the N-propeptide with 16 bp 5' untranslated from type III collagen was synthesized by PCR using the plasmid pBS-C3Al as a template.
  • N-propeptide were as follows: 5' oligo
  • telopeptide from type II collagen was synthesized by oligonucleotides (chemical synthesis). The following oligonucleotides were used
  • Niecht K*ln (based on the sequence published by Brazel et al. , Eur. J. Biochem. 168:529-536 (1987), and Soininen et al , FEBS Lett. 225:188-194 (1987)) and the polyhedrin-based baculovirus transfer vector pVL 1392 (Luckow et al. , Virology 170:31-39 (1989)).
  • ⁇ lCMVC was digested with Clal to generate a full length cDNA with 18 bp 5' untranslated and 203 bp 3' untranslated, and this fragment was blunt ended using
  • Klenow polymerase (Pharmacia Biotech) and a mixture of dNTPS and cloned into the
  • pVLE26 The baculovirus transfer vector was constructed using the cDNA clone E-26 in vector pBluescript (SK-) (Pihlajaniemi et al. , J. Biol. Chem.
  • the cDNA clone E-26 encodes the ⁇ l chain of human type XIII collagen that is ligated into the EcoRI site of pBS(SK-) (construct termed clone ⁇ -26).
  • the E-26 clone is described in, for example, Pihlajaniemi et al , J. Biol. Chem. 265: 16922-16928 (1990).
  • the cDNA E-26 was obtained from a ⁇ gtll cDNA library derived from human umbilical vein endothelial cells (Clontech), and the insert was released by digestion with EcoRI. This EcoRI fragment was ligated into the EcoRI site of pBR322 to give the clone ⁇ -26. Clone ⁇ -26 is digested with EcoRI to generate the ⁇ -26 cDNA covering type XIII coding sequences. 123 bp 5' untranslated region and 117 bp 3' untranslated region are included, and this fragment is cloned into the EcoRI site of pVL1392 to give the plasmid pVL ⁇ 26.
  • pVLhuXIII The baculovirus transfer vector was constructed using clone E-26 (Pihlajaniemi et al , J. Biol. Chem. 265: 16922-16928 (1990)), genomic human type XIII collagen sequences (Tikka et al, J. Biol. Chem. 266:17713-17719 (1991)) and the polyhedrin-based baculovirus transfer vector pVL1932 (Luckow et al. Virology 170:31-39 (1989)).
  • pBShuXIII A clone called pBShuXIII was constructed and it contains the clone E-26 of the ⁇ l chain of human type XIII collagen with the 5' end of genomic human type XIII collagen covering nucleotides 1-272 from the type XIII collagen gene generated by PCR, in the Notl-EcoRl site of pBS(SK-) to give the full-length cDNA of type XIII collagen (Tikka et al , J. Biol. Chem. 266: 17713-17719 (1991)).
  • the 5' end of the genomic human type XIII collagen was generated using CL412 (a lambda clone isolated from a human genomic library (Clontech)) as the template and the PCR primers: 5' primer (5'-ATGCGGCCGCACGCGAGAGGATGGTAGC-3'), and 3' primer (5'- TAGCTGTCTCCATTTGCTGCTC-3').
  • the 5'-PCR-primer included a new Notl restriction site preceding the type XIII sequences, which was used as well as a Pstl site between nucleotides 216 and 217 (Tikka et al, J. Biol. Chem.
  • pVLmoXIII The baculovirus transfer vector was constructed using the vector pBSmoXIII and the polyhedrin-based baculovirus transfer vector pVL1392, which is described in Luckow et al , Virology 170:31-39 (1989).
  • pBSmoXIII consists of the cD ⁇ A clone 689 in pBluescript encoding the ⁇ l chain of mouse type XIII collagen wherein the 5' end was generated by PCR and the 3' end by ligation of a fragment from the plasmid moC-2.
  • Clone 689 is a cD ⁇ A derived from mouse spleen R ⁇ A as follows: total mouse spleen R ⁇ A is reacted with reverse transcriptase and the primer 5'-ACACACACAGGCCAGT-3'.
  • the reverse transcriptase products are then used as a template for a first PCR reaction with the primers: 5' primer (5'-ATGAATTCGCCAGTCCCAGGTTAGAGGCA-3'), and 3' primer (5'-ATGAATTCAAGTTCTACTCGCGTAGGCGC-3'), and these products were used as a template for second PCR reaction with the primers: 5' primer (5'-ATGAATTCGTTCCAGCAGCCTTGGACTG GTAAGC-3'), and 3' primer (5'-ATGAATTCCCGAAGATGTCTCCAGGATGT- 3').
  • the PCR fragment covers nucleotides 466-969 from the cD ⁇ A sequences for mouse ⁇ l chain of type XIII collagen.
  • cD ⁇ A clone GUT 219.2.4 was used as a template with the PCR primers: 5' primer (5'- ATAAGCTTGAATTCCGAGGGCATGGTGGCGG-3'), and 3' primer (5'-CGAGGCCCGACGATGGACAT-3').
  • GUT 219.2.4 was obtained from a cD ⁇ A library derived from newborn mouse gut R ⁇ A using random hexamers as primers and the You-Prime-cD ⁇ A synthesis kit (Pharmacia) by probmg with reverse transcriptase - PCR clones of the ⁇ C1 to ⁇ C4 domain of mouse type XIII collagen.
  • RT-PCR clones were obtained as follows: newborn mouse gut DNA was used as a template with the primers: 5' primer (5'-ACCTTTGGCCCTGGGGGCGCAGGGAGC-3'), and 3' primer
  • Plasmid P40-1 contains a clone with the translation intiation region and coding sequences up to the BamHI site at nucleotide 1419.
  • mouse type XIII collagen was added to plasmid P40-1 as follows.
  • a mouse type XIII collagen cDNA, MOABCD.5 was used to obtain the 590 bp Stul-Sacl fragment.
  • MOABCD.5 was built from the cDNA clones GUT 229A, GUT 219.1.4, RG.6, and 18 to cover the coding sequences of mouse type XIII collagen except for the alternatively spliced exons 4A, 4B, 12, 13, and 33.
  • Clones GUT 229A and GUT 219.1.4 are obtained from a cDNA library produced from newborn mouse gut RNA using random hexamers as primers and the You-Prime-cDNA synthesis kit (Pharmacia) by probing with reverse transcriptase - PCR clones of the NCI to NC4 domain of mouse type XIII collagen. These RT-PCR clones were obtained as follows: newborn mouse gut DNA was used as a template with the primers: 5' primer (5'-ACCTTTGGCCCTGGGGGCGCAGGGAGC-3'), and 3' primer (5'-AGGGAGAGAAAGGCGATGCTGGCA-3') to produce the M91 fragment.
  • the M91 fragment was used to design primers for subsequent RT-PCR reactions in both the 5' and 3' directions using combinations of primers of mouse and human origin.
  • Clone RG.6 was obtained using 3'-RACE-PCR.
  • the reverse transcriptase reaction was carried out using mouse gut poly(A+) RNA as the template and the primer (5'-GACTGC AGTCGACATCGATTTTTTTTTTTTTTTTTTTTT-3'), followed by a first round of PCR using primers: MR- 15 (5'- GCCTCCAGGAATGAAGGGAGAAGT-3'), and primer Tail (5'-GACTCGAGTCGACATCG-3'), and a second round of PCR using the primers: MR-1 (5'-GGGGGAGAGGGGGAAGAA-3'), and primer Tail
  • the GUT 229A, GUT 219.2.4, RG.6 and 18 inserts were liberated from their respective library vectors by using Notl. These Notl fragments were ligated into the
  • MOABCD.5 was constructed as follows: Clone pBluescript-GUT 219.1.4 was digested with BamHI and EcoRI, and the resulting 960 bp fragment was ligated into fi ⁇ mHI/EcoRI pBluescript-GUT 229A, to give clone MOAB.3. Plasmid pBluescript-I8 was digested with Stul and Hmdlll and the resulting 310 bp fragment was ligated into Stul/ Hindlll digested MOAB.3, to give the clone MOABC.5.
  • the plasmid pBluescript-RG.6 was digested with Xbal and Hindlll, and the resulting 250 bp fragment was ligated into Xbal/Hindl ⁇ l digested MOABC.5, to give the clone MOABCD.5
  • MOABCD.5 was digested with Stul and S ⁇ cl, and the ensuing 673 bp
  • Stul-Sacl fragment was ligated into the Stul and S ⁇ cl sites of clone 689 (plasmid
  • Plasmid moC-2 was digested with with BamHI and
  • pVLC15Al The baculovirus transfer vector was constructed using a PCR fragment covering nucleotides 14 to 1374 of type XV procoUagen cD ⁇ A (Kivirikko et al , J. Biol. Chem. 269: 4773-4779, (1994)).
  • cD ⁇ As for type XV procoUagen were made from human umbilical cord RNA using standard techniques described in, for example, Maniatis et al. , Molecular Cloning A Laboratory Manual. Cold Spring Harbor Laboratory, N.Y. (1989) and Ausubel et al , Current Protocols in Molecular Biology. Greene Publishing Associates and Wiley Interscience, N.Y. (1989).
  • the PCR fragment covering nucleotides 14 to 1374 of type XV procoUagen was made with the PCR primers: 5' primer (5'-GATATCACCCTT CGTCCTCCGCTAAGCTC-3'), and 3' primer (5'-GAATTCTGGCC
  • the PCR fragment contains an EcoRV linker sequence at the 5' end and an EcoRI linker sequence at the 3' end.
  • the PCR fragment is digested with EcoRV and EcoRI, and ligated into the EcoRV-EcoRI sites of pBluescript (SK-).
  • This construct was digested by Sphl (cleaving in the PCR fragment at sequences corresponding to nucleotide 1355 of sequences presented in Kivirikko et al , J. Biol. Chem. 269:4773-4779 (1994)) and EcoRI (digesting at the polylinker of pBluescript).
  • M18K The baculovirus transfer vector was constructed using the polyhedron-based baculovirus transfer vector pVL 1393 (Invitrogen) and pBluescript
  • SK M18kok.ll (pBsM18kok. ll), which is made from the clones SXT-5B5, MM-103
  • the cDNA SXT-5B5 was identified and cloned from a ⁇ gtlO cDNA library made from mouse embryo (Clonetech) as follows.
  • the library was screened using a probe G2 for murine type XIII collagen to identify the clone M ⁇ -1.
  • G2 had been generated by RT-PCR using newborn mouse gut RNA as template, and primers: 5' primer MR-6 (5'-CCGGTGAGCCTGCTTGTCCT-3'), and 3' primer MR-11 (5'-ATGCCATCGGAGGAGGCAG-3').
  • the PCR product was ligated into the vector PCR- 1000 (Invitrogen) and the construct was further digested with EcoRI-Hindlll to give the probe G2.
  • ME-1 covers 2.3-kB of the mouse ⁇ l(XVIII) mRNA (described in Rehn et al., Proc. Nat'l. Acad. Sci. USA 91: 4234- 4238 (1994)) and it was used to rescreen the mouse embryo library and identify the clone SXT-5B5, which was isolated by digesting with EcoRI, and ligating the SXT-5B5 insert into the EcoRI site of pBluescript SK to give the plasmid ⁇ Bs(SK)SXT-5B5 containing 540 bp extreme 5' sequence of the mouse ⁇ l (XVIII) chain clone SXT-5 (SXT-5 is described in Rehn et al. , Proc.
  • the cDNA clone MM-103 was obtained from a ⁇ gtlO cDNA library of poly(A) RNA from adult mouse liver (BALP/c strain) isolated by the guanidium thiocyanate method (Chomczynski et al. , Anal. Biochem. 162: 156-159 (1987)), followed by two rounds of oligo(dT)-cellulose chromatography.
  • a cDNA library was constructed from this RNA using an oligo(dT) primer and the Time-Saver-cDNA synthesis kit (Pharmacia).
  • This library is screened with a probe from ME-1, and he clone MM-103 was isolated by digesting with Notl. The insert MM-103 was ligated into the Nod site of pBluescript SK to give pBs(SK)MM-103. (Rehn et al., J. Biol.
  • the cDNA clone MM-21.3 was identified and cloned from a ⁇ gtlO cDNA library, which had been made from adult mouse liver (BALB /c strain) poly (A) RNA using oligo(dT) method (described above, see also Rehn et al., J. Biol. Chem.
  • the plasmid pBs(SK)SXT-5B5 was digested with EcoRI and the resulting 540 bp fragment was further cloned into EcoRI-digested 5 kB fragment (insert + Bluescript) of plasmid pBs(SK)MM-21.3 to generate plasmid pBsM18kc.AB.pBsM18kc.AB was digested with EcoRV and Nsil, resulting in a 2.5 kB fragment, and plasmid pBsMM 103 was digested with Nsil and Notl resulting in a 1.5 kb fragment.
  • pBsM18kok. ll which contains the full-length cDNA of the shortest variant of the ⁇ l chain of mouse type XVIII collagen (1315 amino acid residues) including 22 bp 5' untranslated region and 180 bp 3' untranslated region.
  • pBsM18kok. ll was digested with EcoRV-Notl, and this fragment is cloned into the Smal-Notl sites of pVl .1393 to give the plasmid M18K.
  • M18VA2K The baculovirus transfer vector was constructed using the polyhedron-based baculovirus transfer vector pVL 1393 (Invitrogen), and pBsv2.5 which was built from cDNA clones PE17.24 (Rehn et al., J. Biol. Chem. 269: 13929-13953 (1994)) and PX4.3 (Rehn et al. , J. Biol. Chem. 269:13929-13953 (1994)), and plasmid pBsM18kok.l l (described previously, see the construct M18K) to generate pBsM18VA2K.
  • the cDNA clone PEI 7.24 is isolated from a cDNA pool made from 18.5 day-old mouse embryo poly(a). RNA with the primer
  • Plasmid v2.5 was built from clones PE17.24 and PX4.3 by digesting the pBsPE17.24 with EcoRI and ligating into the resulting 4.8 kB fragment (insert +
  • Bluescript the EcoRI digested 90 bp fragment of PX4.3 to cover the long 764 residues form of the type XVIII collagen NCI domain.
  • the plasmid v 2.5 was digested with Clal and the resulting 1.5 kB fragment was ligated into a Clal-digested 7.3 kB fragment (insert + Bluescript, the other Clal site in Bluescript) of pBsM18kok.
  • M18NC1 The baculovirus transfer vector was constructed using the cDNA clones SXT-5B5 (described above, see construct M18K) and ME-1 (Rehn et al., Proc. Nat'l. Acad. Sci. 91 :4234-4238(1994)), and the polyhedron- based baculovirus transfer vector pVL 1393 (Invitrogen). SXT-5B5 was identified and cloned as described previously (see construct M18K). ME-1 covers 2.3 kB of the mouse ⁇ (XVIII) mRNA (described in Rehn et al. , Proc. Nat'l. Acad. Sci.
  • NCI N-terminal noncollagenous domain
  • a stop codon is generated to the 3' end of the NCI domain by PCR, using ME-1 as template and the primers: 5' primer - T7, 17-mer primer (5'-AATACGACTCACTATAG-3'), and the 3' primer M18Bacl (5'- GAAGGGGCTTGATAAATGAGGATCCAT-3') including an in-frame stop codon and a BamHI digestion site.
  • the 400 bp PCR product was digested with EcoRI and BamHI and ligated to EcoRI-BamHI-digested pBluescript SK to give plasmid pBsNCII, ⁇ BsSXT-5B5 was digested with EcoRI and the resulting 540 bp fragment was further cloned into the EcoRI- digested pBsNCIL to give the plasmid pBsM18NCl, encoding the NCI domain and 22 bp of 5' untranslated sequences.
  • pBsM18Ncl is digested with EcoRV-Notl and the resulting fragment is cloned into the Smal-Notl sites of the pVI 1393 to give the plasmid M18NC1.
  • M18VA2N The baculovirus transfer vector was constructed using the polyhedron-based baculovirus transfer vector pVL1393 (Invitrogen), and the plasmid pBsM18NCl (described previously, see construct M18NC1), and the plasmid pBsV2.5 (see the construct M18VA2K). Plasmid pBsV2.5 was digested with Clal and the resulting 1.5 kB fragment was cloned into the Clal-digested 4.8 kB fragment of pBsM18NCl to generate the plasmid pBsM18VA2.3 encoding the longest variant aminoterminal noncollagenous domain (NC1-764) of type XVIII collagen al chain.
  • pBsM18VA2.3 is digested with EcoRV-Notl and the resulting fragment is cloned into the Smal-Notl sites of pVL1393 to give the plasmid M18VA2N.
  • M18C The baculovirus transfer vector was constructed using the vector pBluescript (SK)MM-103 (described previously, see the construct M18K) and the polyhedron-based baculovirus transfer vector pVL 1393 (Invitrogen).
  • pBluescript(SK)MM-103 encodes the cDNA for the C-terminus of the a 1 chain of mouse type XVIII collagen in the Notl site of pBluescript SK.
  • pBs(SK)MM-103 was digested with EcoRI-Notl which generates a cDNA fragment covering nucleotides 2802-4080 (see, Rehn et al., J. Biol. Chem. 269:13929-13953 (1994)) with a translation initiation codon at nucleotides 3010-3012 corresponding to the C-terminal noncollagenous domain (amino acid residues 997-1315) with 180 bp of the 3' untranslated region. This fragment is cloned into the EcoRI-Notl sites of the pVL1393 to give the plasmid M18C.
  • the baculovirus transfer vector was constructed using the polyhedrin-based baculovirus transfer vector pVL 1392, and the vector pBS(SK-)S138 which contains the full length cDNA for the ⁇ -subunit of human prolyl 4- hydroxylase in the EcoRI site (Pihlajaniemi et al, ⁇ MBO. J. 6:643
  • the ⁇ -subunit clone HB-95 was obtained from a human hepatoma ⁇ gtll cDNA expression library screened with purified antibodies against human prolyl
  • the vector pBS(SK-)S138 was constructed by identifying the clone S138 (human prolyl 4- hydroxylase ⁇ -subunit) from a ⁇ gtl l library derived from human placenta (Clontech) using HB-95 as a probe for the ⁇ -subunit of human prolyl 4-hydroxylase, releasing the insert from the identified ⁇ gtl 1 clone with EcoRI, and inserting the EcoRI fragment into the EcoRI site of pBS(SK-) (Stratagene) to give pBS(SK-)S138.
  • pSB(SK-)S138 was digested with EcoRI-fi ⁇ mHI to generate the full length cDNA plus 44 bp 5' untranslated and 207 bp 3' untranslated, and this fragment was cloned into the EcoRI- ⁇ /nHI sites of ⁇ VL1392 (Vuori et al , Proc. Natl. Acad. Sci.
  • the baculovirus transfer vector was constructed using the vector pBS(SK-)PA59 which contains the full length cDNA for human prolyl 4-hydroxylase ⁇ -subunit in the Smal site (Helakoski et al, , Proc. Nat'l. Acad. Sci. USA
  • the cone PA59 (human prolyl 4-hydroxylase ⁇ -subunit) was obtained as follows.
  • HTA-2 a 36-mer oligonucleotide derived from the HTA-2 sequence (nucleotides 1430-1465 of the ⁇ -subunit) was used to screen a human placenta ⁇ gtll library (Clontech).
  • the vector pBS(SK-)PA59 was constructed by releasing the ⁇ gtl 1 insert from the clone PA59 by digestion with HihPl and Accl, blunt ending the PA59 fragment with Klenow (Pharmacia Biotech), and cloning the blunt ended PA59 fragment into the Smal site of pBS(SK-) (Stratagene) to give pBS(SK-)PA59.
  • pBS(SK-)PA59 was digested with Pstl and BamHI to generate Pstl-Pstl and Pstl-BamHI fragments containing the full length cDNA plus 61 bp 5' untranslated region, and 551 bp 3' untranslated region, and these fragments are cloned into the Pstl- BamHI sites of pVL1392 (Vuori et al , Proc. Natl. Acad. Sci. USA 89:7467-7470 (1992)) to give the plasmid pVL ⁇ .
  • p2Bac ⁇ pBS(KS-)S138 was constructed by digesting pBS(SK-)S138 with
  • pBS(KS-)S138 was digested with B ⁇ mHI to give the full length ⁇ -subunit of human prolyl 4- hydroxylase including 44 bp 5' untranslated region and 207 bp 3' untranslated region. This fragment was cloned into the B ⁇ mHI site of p2Bac to give p2Bac ⁇ .
  • pBS(SK-)PA59 was mutated by PCR to place a Notl site 46 bp upstream of the initiation codon for the ⁇ -subunit of prolyl 4-hydroxylase to give the plasmid pBS(SK-)PA59/5'UT ⁇ otI as follows.
  • the second PCR product is digested with Clal and Afl ⁇ l, and ligated into pBS(SK-)PA59 to generate pBS(SK-)PA59/5'UTNotI.
  • pBS(SK-)PA59/5'UTNotI is digested with Notl to generate a fragment with the full length ⁇ -subunit of prolyl
  • Recombinant human collagens I, II, III, IV, XIII, XV, and _ XVIII have been expressed in insect cells by means of baculovirus expression vectors.
  • pVLC3Al is a recombinant expression vector encoding the full pro ⁇ l chain of human type III collagen. Similar baculovirus expression vectors pVL ⁇ , pVL ⁇ , and p2Bac ⁇ were created for the expression of 0 human prolyl 4-hydroxylase in insect cells. The constructs were transfected in various combinations into insect cells using a BaculoGold* transfection kit
  • Insect cells (Sf9 or High Five, Invitrogen) were cultured in T ⁇ M-FH medium 5 (Sigma) supplemented with 10% fetal bovine serum (BioClear) or in a serum-free HyQ CCM3 medium (HyClone) either as monolayers or in suspension in shaker flasks at 27°C.
  • T ⁇ M-FH medium 5 Sigma
  • HyQ CCM3 medium HyClone
  • insect cells seeded at a density 5-6 x 105/ml were infected at a multiplicity of 5-10 with the recombinant virus and at a o multiplicity of 1 with the viruses for the ( subunit and ( subunit of human prolyl 4-hydroxylase (Vuori et al , Proc. ⁇ atl. Acad. Sci.
  • Ascorbate 80 ⁇ g/ml was added daily to the culture medium.
  • the cells were harvested 48- 120 h after infection, washed with a solution of 0.15 M ⁇ aCl and 0.02 M phosphate, pH 7.4, homogenized in a 0.3 M ⁇ aCl, 0.2% Triton X-100 and 0.07 M Tris buffer, pH 7.4, and centrifuged at 10,000 x g for 20 min. The remaining cell pellet that was insoluble in the homogenization buffer was further solubilized in 1 % SDS and analyzed by SDS- PAGE1.
  • the cell culture medium was concentrated 10 times in an uitrafiltration cell (Cmicon) with a PM-100 membrane.
  • Sf9 and High Five cells were infected with a recombinant baculovirus coding for the pro ⁇ l (III) chains, harvested 72 h after infection, homogenized in a buffer containing 0.2% Triton X-100 and centrifuged. Aliquots of the Triton X-100 soluble protein fraction and the concentrated cell culmre medium were then analyzed either without pepsin treatment of after treatment with pepsin for lh at 22°C. The samples were electrophoresed on 8% SDS-PAGE and analyzed by Coomassie staining in A and by Western blotting using an antibody to the
  • Lane 1 sets forth molecular weight markers; lanes 2-3, cell extracts; and lanes 4-5, media from Sf9 cell culmres; lanes 6-7, cell extracts; and lanes 8-9, media from High Five cell culmres. Samples in the odd numbered lanes were digested with pepsin.
  • N-propeptide of human type III procoUagen (Farmos Diagnostica) and a colorimetric method for 4-hydroxyproline (Kivirikko et al , Anal. Biochem. 19:249-255 (1967)).
  • pro ⁇ l (III) could be seen by Western blotting in samples of the Triton X-100 soluble proteins (Fig. 6B, lanes 2 and 6) and cell culmre media (Fig. 6B, lanes 4 and 8) in both Sf9 and High Five cells. After the pepsin digestion the (1 chains of type III collagen were seen in the High Five cells in the Coomassie stained gel (Fig. 6A, lane 7). The pepsin resistant (1(111) chains were not detected in the Western blot (Fig. 6B, lanes 3, 5, 7 and 9) since the antibody used reacts only with the N-propeptides of the pro ⁇ l(III) chains, which were apparently digested by pepsin.
  • type III collagen in the samples was calculated by multiplying the N-propeptide values obtain by 7 and the 4-hydroxyproline values by, 8. AU measurements were made 72 h after the infection.
  • TABLE II provides: First, the amount of 4-hydroxyproline formed was in all experiments distinctly higher in cells infected with the prolyl 4-hydroxylase-coding viruses than in their absence. Second, the expression level obtained in High Five cells was consistently higher than that obtained in Sf9 cells. Third, in cells coinfected with the prolyl 4-hydroxylase-coding viruses the level of type III collagen produced was always higher when calculated from the 4-hydroxyproline values than from the radioimmuno assay values, suggesting either that some of the N-propeptides of type III procoUagen were degraded or that some of the fully 4-hydroxylated pro ⁇ l (III) chains remained nontriple-helical.
  • the highest type III collagen expression values were in the High Five cells that also expressed prolyl 4-hydroxylase, the amount of cellular type III collagen in these cells being about 41-81 ⁇ g/5 x 106 cells (TABLE III).
  • Baculovirus expression vectors pVLClAl and pVLClA2 were created for the expression of the pro ⁇ l chain and the pro ⁇ 2 chain of human collagen I, and pVLC3A15'UT/C2Al was created for the expression of the pro ⁇ l chain of human collagen II.
  • insect cells were cultured, and recombinant collagen produced following the procedures supra.
  • pro ⁇ l (I), and pro ⁇ l (I) and pro ⁇ 2 (I) in the presence of prolyl 4-hydroxylase, and following pepsin digestion of the supernatants from cell homogenates could be seen in silver-stained 5% SDS-PAGE. See Figure 7, lanes (DIA 1).
  • the silver-stained SDS PAGE revealed the formation of triple-helical procoUagen I in these cells.
  • Homotrimeric collagen can be separated from heterotrimeric collagen I on a metal chelate affinity column through the use of a histidine-tag to the C-terminal domain of the pro ⁇ 2 chain.
  • pro ⁇ l (II) in the presence of prolyl 4-hydroxylase could be seen in coomassie stained 5 % SDS PAGE. See Figure 8 (wherein lane 1 depicts the expression of a homotrimer of type I collagen; lane 2 is a standard sample of type II procoUagen; lane 6 is a standard sample of type III procoUagen; and lanes 3-5 compare three different constructs of human type II procoUagen containing varying amounts of human procoUagen type III.
  • Lane 3 is type II procoUagen with the C-terminal end of type III procoUagen; lane 4 is type II procoUagen with the N-terminal non-collagenous region from type III procoUagen; and lane 5 is type II procoUagen with the N- and C-terminal regions of type III procoUagen).
  • baculovirus vectors for the expression of human type II collagen were constructed. In one of these vectors, the 5' untranslated region of human type II collagen was replaced with human type III collagen 5' untranslated region. In another vector, the entire human type II collagen gene was expressed. In another insect expression vector, the N-propeptide of type II collagen was replaced with an N-propeptide of type III collagen. AU three of those vectors were found to express human type II collagen in varying levels. Expression was detected by Coomassie Blue stain SDS-PAGE and by Western blot analysis.
  • pVLC4Al is a recombinant baculovirus expression vector encoding the pro ⁇ l chain of human collagen IV.
  • pVLhuXIII is a recombinant baculovirus vector encoding the pro ⁇ l chain of human collagen XIII.
  • pVLC15Al is a recombinant expression vector encoding the pro ⁇ l chain of human collagen XV.
  • M18K and M18VA2K are recombinant expression vectors encoding two variants of the pro ⁇ l chain of human collagen type XVIII.
  • insect cells were cultured and recombinant collagen produced following the procedures supra.
  • pVLC4Al, pVLhuXIII, pVLC15Al, M18K, and M18VA2K have been transformed into insect cells, and the recombinant collagens have been successfully expressed.
  • Insect cells expressing the recombinant type III procoUagen were washed with a solution of 0.15 M NaCl and 0.02 M phosphate, pH 7.4, homogenized in a cold 0.2 M. NaCl, 0.1 % Triton X-100 and 0.05 M Tris buffer, pH 7.4 (20 x 106 cells/ml), incubated on ice for 30 min, and centrifuged at 16,000 x g for 30 min. Unless otherwise mentioned, all the following steps were performed at 4°C.
  • the supernatant was chromatographed on a DEAE cellulose column (DE-52, Whatman) equilibrated and eluted with a 0.2 M NaCl and 0.05 M Tris buffer, pH 7.4, the void volume being collected.
  • the pH of the sample was lowered to 2.0-2.5, and the sample was digested with a final concentration of 150 ⁇ g/ml of pepsin for 1 h at 22°C.
  • Pepsin was irreversibly inactivated by neutralization of the sample followed by an overnight incubation on ice.
  • the recombinant type III collagen was precipitated by adding solid NaCl to a final concentration of 2 M and centrifugation at 16,000 x g for 1 h.
  • the pellet was dissolved in a 0.5 M NaCl, 0.5 M urea, and 0.05 M Tris buffer, pH 7.4, for 1 day, and the sample was digested with pepsin as above for a second time.
  • the sample was then chromatographed on a Sephacryl HR-500 gel filtration column (Pharmacia), eluted with a solution of 0.2 M NaCl and 0.05 M Tris, pH 7.4, dialyzed against 0.1 M acetic acid and lyophilized.
  • Type III procoUagen was expressed in High Five cells cultured either as monolayers or in suspension in shaker flasks. The cells were harvested 72 h after infection, homogenized in a buffer containing 0.1 % Triton X-100 and centrifuged, and the supernatant of the cell homogenate was passed through a DEAE cellulose column to remove nucleic acids. The flow through fractions containing the type III procoUagen were pooled and digested with pepsin. This converted the type III procoUagen to type III collagen and digested most of the noncollagenous proteins. The type III collagen was then concentrated by salt precipitation, solubilized and treated with pepsin as above.
  • the type III collagen was finally separated from pepsin and other remaining contaminants by gel filtration on a Sephacryl S 500-HR column. The fractions containing the type III collagen were pooled, dialyzed and lyophilized. The purified type III collagen was analyzed by 5 % SDS-PAGE under reducing ( Figure 9, lane 2) and nonreducing ( Figure 9, lane 3) conditions. No contaminants were seen in the Coomassie stained gel and the type III collagen (1 chains were disulfide-bonded. Amino acid and CD spectmm analysis were performed on the purified type III collagen. The amino acid composition of the recombinant type III obtained corresponded well with the amino acid composition reported for human type III collagen.
  • the High Five cells gave consistently higher production rates than Sf9 cells, 5 the highest production rates seen in High Five cells culmred in monolayers ranging up to about 80 ⁇ g of cellular recombinant human type III collagen/5 x 106 cells, which conesponds to about 120 ⁇ g of type III procoUagen.
  • the highest amount of cellular type _ III collagen produced ranged up to about 40 mg/l, corresponding to about 60 mg/l of type III procoUagen.
  • the thermal stability of the type III collagen expressed under different cell culmre conditions was studied by using digestion with a mixture of trypsin and chymotrypsin after heating to various temperatures (Bruckner, et al. , Anal. Biochem. 110:360-368 (1981)).
  • High Five cells were infected with vimses coding for the pro ⁇ l (III) chains and the ( and ( subunits of human prolyl 4-hydroxylase.
  • the cells were harvested 72 h after infection, homogenized in a buffer containing 0.2% Triton X-100 and centrifuged. In these experiments, ascorbate was either added daily to the cell culture medium as usual or omitted during the infection.
  • Triton X-100 soluble proteins were first digested with pepsin for 1 h at 22°C to convert type III procoUagen to type III collagen (Pihlajaniemi et al , EMBO J. 6:643-649 (1987)), and the trypsin/chymotrypsin digestion was then performed for aliquots of the pepsin-treated samples. The samples were then electrophoresed on 8% SDS-PAGE and analyzed by Coomassie staining. Figure 11 provides the results of this thermal stability for a variety of collagen products.
  • the cells were infected only with the vims coding for the pro ⁇ l (III) chains, and ascorbate was omitted from the culmre medium;
  • panel B the cells were infected only with the vims coding for the pro ⁇ l (III) chains, and ascorbate was present in the culmre medium as usually;
  • panel C the cells were coinfected with vimses coding for the pro ⁇ l (III) chains, and the ( and ( subunits of prolyl 4-hydroxylase, but ascorbate was omitted from the culmre medium;
  • panel D the cells were infected with the three vimses, and ascorbate was present in the culture medium.
  • Lane P shows a sample digested with pepsin without subsequent trypsin/chymotrypsin digestion
  • lanes 27-42 show samples treated with the trypsin/chymotrypsin mixture at the temperatures indicated.
  • the arrows show the position of the (1 (III) chains.
  • Collagen Types I and II Purification and analvsis of Collagen Types I and II.
  • Collagens types I and II were purified as described supra.
  • the recombinant type II human collagen expressed from the recombinant insect cells was found to exhibit resistance to trypsin and chymotrypsin digestion. These protease digestion experiments indicated that triple helical type II human collagen was formed in the recombinant insect cells.
  • the thermal stability of the recombinant type II human collagen expressed from the recombinant insect cells was measured and compared with native type I human collagen. These results indicated that the recombinant type II collagen had a triple helical structure.
  • the Tm of the recombinant type II collagen was up to about 40°C.
  • Example 11 Expression of Recombinant CoUagen Genes in Yeast Cells Expressing Recombinant Genes for Prolyl 4-Hydroxylase
  • the 3' end of the COL III gene was synthesized by PCR from the 4195 bp downstream (EcoRI site) of the translation initiation codon to the stop codon (4401 bp) using pBluescript SM38 as a template and the PCR primers: 5' primer
  • pBS-SM38 is derived from sequences presented in Ala-Kokko et al Biochem. J. 260: 509-516 (1989)), and GenBank accession number X14420) to give the plasmid pBluescript-SM38/B.
  • the 5' end of the Col III gene was synthesized from 73 bp downstream of the translation initiation codon to 176 bp (BamHI site) by PCR (for sequences, see
  • pBluescript SM38 Ala-Kokko et al , Biochem.. J. 260:509-516 (1989)) using pBluescript SM38 as the template and the PCR primers: 5' primer (5'-GCGATCGATGC GGCCGCGCAGGAAGCTGTTGAAGGAGG-3'), and 3' primer (5'-GAGAA CGGATCCTGAGTCAC-3'). Clal and Notl sites were created in the 5' end of the PCR fragment.
  • pBluescript-SM38/B was digested with Clal and ⁇ mHI, and the fragments from this digest and the 5' PCR fragment were ligated with T4 ligase to give the plasmid pBluescript-SM38/ll.
  • pBluescript-SM38/l l was digested by Notl and the Notl-Notl collagen fragment (73-4401 bp) was cloned in frame with the ⁇ -factor signal sequence in the yeast expression vector pPIC9 (Invitrogen) to give the plasmid pPIC9COLIII. ⁇ HIL-D2/colIII.
  • the 3' end of the COL III gene was synthesized by PCR from the 4195 bp downstream (EcoRI site) of the translation initiation codon to the stop codon (4401 bp) using pBluescript-SM38 as the template D ⁇ A and the primers:
  • Xbal fragment from pBluescript-C3Al/10, containing collagen sequences from (nucleotides - 16 to 4401) is ligated into the EcoRI site of pHIL-D2 (Invitrogen) to give plasmid PHII-D2/colHI.
  • pYM25 was digested with Hpal and the fragment containing the ARG4 gene of Saccharomyces cerevisiae was isolated and cloned into the EcoRV sites of pAO815 (Invitrogen) replacing the HIS4 gene with ARG4, to give the plasmid pARG815.
  • a cDNA of the ⁇ subunit of human prolyl 4-hydroxylase (Vuori et al.
  • This PCR fragment was digested with EcoRI and cloned into pBluescript SK, to give pBluescript SK ⁇ /20.
  • pBluescript SKB/20 was digested with EcoRI and this fragment was cloned into the EcoRI site of pAO815 (Invitrogen), to give the plasmid pAO815 ⁇ which has a single expression cassette for the ⁇ -subunit of prolyl 4-hydroxylase.
  • pARG815 ⁇ The 5' end of the ⁇ -subunit of prolyl 4-hydroxylase was synthesized by PCR from the translation initiation codon to 689 bp downstream (HmdIII site), and HmdIII and Sm ⁇ l sites were created in the 5' end of the fragment.
  • pBS(SK-)PA59 was used as the template DNA with the primers: 5' primer (5'- GCGAAGCTTCCCGGGATGATCTGGTATATATTA-3'), and 3' primer (5'-GGATCTAGTTCAAGAAGCTT-3').
  • pA-59 (Vuori et al , Proc. Nat'l. Acad. Sci. USA 89:7467-7470 (1992)) was digested with HmdIII and the large fragment was isolated and ligated with the 5' PCR fragment to give pA-59/15.
  • the 3' end of the ⁇ -subunit was synthesized by PCR from 1373 bp (Pstl site) downstream of the translation initiation codon to the translation stop codon, and Smal and B ⁇ m ⁇ I sites were created in the 3' end of the fragment.
  • pBS(SK-)PA59 was used as the template DNA with the primers: 5' primer
  • pA-59/15 was digested with Pstl and BamHI, and the large fragment was isolated, and ligated with the 3' PCR fragment to give pA-59/3.
  • pA-59/3 was digested with Smal and the Smal-Smal ⁇ -subunit fragment was cloned into the EcoRI site of pARG815, to give pARG815 ⁇ .
  • pARG815 ⁇ 5'-GCGGGATCCCCCGGGTCATTCCAATTCTGACAACG-3'.
  • ⁇ AO815 ⁇ was digested with Bglll and BamHI to excise the expression cassette, and the expression cassette is cloned into the BamHI site of pARG815 ⁇ to give the vector pARG815 ⁇ fi.
  • pAO815 ⁇ - is similar to pAO815 ⁇ , but contains two cassettes of the ⁇ subunit of the human prolyl 4-hydroxylase gene.
  • pAO815 ⁇ was digested with BgUl and BamHI to excise the expression cassette, and the expression cassette is cloned into the BamHI site of pARG815 ⁇ to give the vector ⁇ ARG815 ⁇ .
  • the ⁇ -subunit without its signal sequence was synthesized by PCR from 52 bp downstream of the translation initiation codon to the translation stop codon. EcoRI restriction sites were created in 5' and 3' ends. This PCR fragment was cloned into the EcoRI site of ⁇ SP72 (Promega).
  • Pichia pastoris host strain GS200 his4 arg4 was stably transformed with combinations of the plasmid described supra and related plasmids to produce the following recombinant strains.
  • P. pastoris Col Ill ⁇ - carries the human Col III gene with ⁇ -MFSS and both subunits of the human Prolyl 4- hydroxylase.
  • P. pastoris nCol III - is similar to P. pastoris nCol III ⁇ , but uses the native Col III signal sequence.
  • P. pastoris ⁇ - carries both subunits of human prolyl 4-hydroxylase.
  • P. pastoris ⁇ contains human prolyl 4-hydroxylase, wherein the ⁇ : ⁇ gene ratio fs 1:2.
  • P. pastoris ⁇ contains the human prolyl 4-hydroxylase ⁇ gene.
  • P. pastoris ⁇ contains the human prolyl 4-hydroxylase ⁇ gene.
  • the P. pastoris strains described in paragraph 5 were grown in rotary shakers to an 0D600 of 5.0. Samples were taken and mn on PAGE gels. Western blots were performed and analyzed with antibodies against proCol III N- terminal peptide, the ⁇ -subunit of human prolyl 4-hydroxylase and the ⁇ -subunit of human prolyl 4-hydroxylase. The Western blots described in paragraph 6 demonstrated that both human collagen III and human prolyl 4- hydroxylase were produced in P. pastoris.
  • Pepsin digestion experiments were performed to test for triple helical structure in the human collagen produced in P. pastoris. Whereas most proteins are degraded by the proteolytic enzyme pepsin, the triple helical region of collagen is pepsin resistant.
  • the collagen from cell lysates of P. pastoris Col Ill ⁇ were digested with pepsin, and the digestion products were separated by SDS-PAGE. The results of these experiments indicated that triple helical human collagen III was produced in the recombinant P. pastoris cells.
  • pSFVmoXIII The Semliki Forest expression vector was constructed using the vector pBSmoXIII generated based on clones and sequences as described for pVLmoXIII above (Rehn et al, submitted; Peltonen et al, submitted) and the eukaryotic expression vector pSFV-1 (Liljestrom et al , Bio/tecnologv 9:1356-1361 (1991)).
  • pBSmoXIII is digested with EcoRI to generate the full-length type XIII collagen variant with seven bp 5' untranlsated region and 288 bp 3' untranslated region, and this fragment is made blunt ended with Klenow, and cloned into the Smal site of pSFV-1 to give the plasmid pSFVmoXIII.
  • pSFVmoXIII plasmid was used to produce RNA by in vitro transcription using M ⁇ GAscript 8 in vitro transcription kit by Ambion. Baby hamster kidney (BNK) cells transfected with the RNA as described in Lilegestrom et al. , Current Protocols in Molecular Biology 2:16-20 (1991). Synthesis of full-length chains for mouse type XIII collagen were observed in the BHK cells by Western blotting of SDS-polyacrylamide gel- fractionated cell extracts.
  • Semliki Forest vims is preferred as the vims because it has a broad host range such that infection of the above mentioned mammalian cell lines will also be possible. More specifically, it is expected that the use of the Semliki Forest vims can be used in a wide range of hosts, as the system is not based on chromosomal integration, and therefore it will be a quick way of obtaining modifications of the recombinant collagens in studies aiming at identifying stmcmre-function relationships and testing the effects of various hybrid molecules. In addition, it is expected that use of the Semliki Forest vims will be used in a wide range of hosts, as the system is not based on chromosomal integration, and therefore it will be a quick way of obtaining modifications of the recombinant collagens in studies aiming at identifying stmcmre-function relationships and testing the effects of various hybrid molecules. In addition, it is expected that use of the Semliki Forest vims will be used in a wide range of hosts
  • HeLa cells and the vaccinia vims-based expression system can also be used to express collagens in mammalian cells, and will preferably be used to expresst type
  • IV collagens as homo- and hetero- trimer isoforms of the six type IV collagen 2 resort0_ chains.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biochemistry (AREA)
  • Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Molecular Biology (AREA)
  • Wood Science & Technology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Biophysics (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Toxicology (AREA)
  • General Engineering & Computer Science (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Peptides Or Proteins (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Abstract

Methods of making collagen with hosts, and vectors that express collagen, and collagen post-translation enzymes are disclosed. Collagen post-translation enzymes include prolyl-4-hydroxylase, lysyl hydroxylase, lysyl oxidase, C-proteinase, and N-proteinase, and these enzymes increase the yield of properly folded, recombinant collagen in non-mammalian hosts. The collagens produced by these methods, hosts, and vectors include both homotrimer and heterotrimer collagen made from single or multiple collagen genes, respectively.

Description

SYNTHESIS OF HUMAN PROCOLLAGENS AND COLLAGENS IN RECOMBINANT DNA SYSTEMS
RELATED APPUCATIONS
This application is a continuation-in-part of United States Applications, Serial No. 08/631,336, filed April 12, 1996 ("the '336 Application"), which is a continuation-in-part of United States Applications, Serial No. 08/211,820, filed August 11, 1994 ("the '820 Application"). The '820 Application is a U.S. National Application, pursuant to 35 U.S.C. § 371, of PCT Application Serial No. PCT/US92/09061, filed October 22, 1992, which is a continuation-in-part of U.S. Application No. 07/780,899, filed October 23, 1991, now abandoned. The '860 Application is a continuation-in-part of the '820 Application and United States Application Serial No. 08/210,063, filed March 16, 1994, which is a U.S. National Application, pursuant to 35U.S.C.§ 371, of PCT Application Serial No. PCT/US92/22333, filed June 10, 1992, which is a continuation of US Application
Serial No. 07/713,945, filed June 12, 1991, now abandoned. Each of these applications is incorporated herein by reference. Portions of the invention described herein were made in the course of research supported in part by NIH grants
AR38188 and AR39740. The Government may have certain rights in this invention.
/. FIELD OF THE INVENTION
The present invention is directed to the recombinant production of procollagen, collagen and fragments thereof.
//. BACKGROUND OF THE INVENTION
The ExtraCeUular Matrix. The most abundant component of the extracellular matrix is collagen. Collagen molecules are generally the result of the trimeric assembly of three polypeptide chains contaimng, in their primary sequence,
(-Gly-X-Y-)n repeats which allow for the formation of triple helical domains (van der
Rest et al. FASEB J. 5:2814-2823 (1991)). During their biosynthesis, the three polypeptide chains comprising collagen undergo various post-translational 5 modifications which permit the formation of these triple helical domains (Van der Rest et al. , Adv. Mol. Cell Biol. 6: 1-67 (1993)). For example, the proline residues of collagen are hydroxylated into 4- hydroxyproline, thereby allowing for the formation of interchain hydrogen bonds by the enzyme prolyl 4-hydroxylase (Kivirikko e*t al. , Post-translational modifications of proteins (Harding, J. J., Crabbe, M. J. C, eds) pp. 1-51 , CRC Press, Boca Raton, FL (1992)). The triple-helical molecule is then further processed to render collagens. For example, the N-propeptide and C-propeptide comprising the collagen precursor molecule,
15 "procollagen, " are cleaved during post-translational events by the enzymes N-proteinase and C-proteinase, respectively.
As a consequence of the diverse structural and functional properties of collagen in its various forms or "types," collagen can contribute significantly to the
_ Q high diversity of the extracellular matrix.
Collagen Types. Nineteen distinct collagen types have been identified in vertebrates, including bovine, ovine, porcine, chicken and human collagens. These collagen types are numbered by Roman numerals and the chains found in each collagen type are identified with Arabic numerals. A detailed description of structure 25 and biological functions of the various different types of naturally occurring collagens can be found, among other places, in Ayad et al. , The Extracellular Matrix Facts
Book. Academic Press, San Diego, CA; Burgeson, R. E., and Nirnmi, "Collagen types: Molecular Structure and Tissue Distribution," Clin. Orthop. 282:250-272
30 (1992); Kielty, C IA. et al. , "The Collagen Family: Structure, Assembly And Organization In The Extracellular Matrix," in Connective Tissue And Its Heritable Disorders. Molecular Genetics. And Medical Aspects. Royce, P. M. and Steinmann, B., Eds., Wiley-Liss, NY, pp. 103-147 (1993).
35 Type I collagen is the major fibrillar collagen of bone and skin. Type I collagen is a heterotrimeric molecule comprising two αl(I) chains and one α2(I) chain. Details on preparing purified type I collagen can be found, among other places, in Miller et al. , Methods In Enzymology 82:33-64 (1982), Academic Press.
Type II collagen is a homotrimeric collagen comprising three identical α(II) chains. Purified Type II collagen may be prepared from tissues by, among other methods, the procedure described in Miller et al. , Methods In Enzymology. 82:33-64 (1982), Academic Press.
Type III collagen is a major fibrillar collagen found in skin and vascular tissues. Type III collagen is a homotrimeric collagen comprising three identical α(III) chains. Methods for purifying type III collagen from tissues can be found in, among other places, Byers et al. , Biochemistry 13_: 5243-5248 (1974) and Miller et al. , Methods in Enzymology 82:33-64 (1982), Academic Press.
Type IV collagen is found in basement membranes in the form of a sheet rather than fibrils. The most common form of type IV collagen contains two αl(IV) chains and one α2(IV) chain. The particular chains comprising type IV collagen are tissue-specific. Type IV collagen may be purified by, among other methods, the procedures described in Furuto et al. , Methods in Enzymology 144:41-61 (1987), Academic Press.
Type V collagen is a fibrillar collagen found in, primarily, bones, tendon, cornea, skin, and blood vessels. Type V collagen exists in both homotrimeric and heterotrimeric forms. One type of type V collagen is a heterotrimer of two αl(V) chains and α2(V). Another type of type V collagen is a heterotrimer of αl(V), cκ2(V), and α3(V). Yet another type of type V collagen is a homotrimer of αl(V).
Methods for isolating type V collagen from natural sources can be found, among other places, in Elstrow et al. , Collagen Rel. Res. 3:181-193 (1983) and Abedin et al., Biosci. Rep. 2:493-502 (1982). Type VI collagen has a small triple helical region and two large non-collagenous remainder portions. Type VI collagen is a heterotrimer comprising αl(VI), α2(VI), and α3(VI) chains. Type VI collagen is found in many connective tissues. Descriptions of how to purify type VI collagen from natural sources can be found, among other places, in Wu et al. , Biochem. J. 248:373-381 (1987), and Kielty et al, J. Cell Sci. 99:797-807. Type VII collagen is a fibrillar collagen found in particular epithelial tissues. Type VII is a homotrimeric molecule of three αl(VII) chains. Descriptions of how to purify type VII collagen from tissue can be found in, among other places, Lundstrom et al., J. Biol. Chem. 261:9042-9048 (1986), and Bentz et al , Proc. Natl. Acad. Sci. USA 80:3168-3172 (1983).
Type VIII collagen can be found in Descemet's membrane in the cornea. Type VIII collagen is a heterotrimer comprising two αl(VIII) chains and one α2(VIII) chain, although other chain compositions have been reported. Methods for the purification of type VIII collagen from nature can be found, among other places, in Benya et al. , J. Biol. Chem. 261:4160-4169 (1986), and Kapoor et al., Biochemistry 25:3930-3937 (1986).
Type IX collagen is a fibril associated collagen which can be found in cartilage and vitreous humor. Type IX collagen is a heterotrimeric molecule comprising αl(IX), α2(IX), and α3(IX) chains. Procedures for purifying type IX collagen can be found, among other places, in Duance et al., Biochem. J. 221 :885-889 (1984), Ayad et al. , Biochem. J. 262:753-761 (1989), Grant et al., The Control of Tissue Damage. Glauert, A. M. , Ed. , El Sevier, Amsterdam, pp. 3-28 (1988).
Type X collagen is a homotrimeric compound of αl(X) chains. Type X collagen has been isolated from, among other tissues, hypertrophic cartilage found in growth plates.
Type XI collagen can be found in cartilaginous tissues associated with type II and type IX collagens, as well as other locations in the body. Type XI collagen is a heterotrimeric molecule comprising αl(XI), α2(XI), and α3(XI) chains. Methods for purifying type XI collagen can be found, among other places, in Grant et al., In The Control of Tissue Damage. Glauert, A. M., ed., El Savier, Amsterdam, pp.3-28 (1988).
Type XII collagen is a fibril associated collagen found primarily associated with type I collagen. Type XII collagen is a homotrimeric molecule comprising three αl(XII) chains. Methods for purifying type XII collagen and variants thereof can be found, among other places, in Dublet et al , J. Biol. Chem. 264:13150-13156 (1989), Lundstrum et al. , J. Biol. Chem. 267:20087-20092 (1992), Watt et al., L Biol. Chem. 267:20093-20099 (1992).
Type XIII is a non-fibrillar collagen found, among other places, in skin, intestine, bone, cartilage, and striated muscle. A detailed description of the type
XIII collagen may be found, among other places, in Juvonen et al. J. Biol. Chem. 267:24700-24707 (1992).
Type XIV is a fibril associated collagen. Type XIV collagen is a homotrimeric molecule comprising three αl(XIV) chains. Methods for isolating type
XIV collagen can be found, among other places, in Aubert-Foucher et al., J. Biol. Chem. 266:19759-19764 (1992) and Watt et al. , J. Biol. Chem. 267:20093-20099 (1992).
Type XV collagen is homologous in structure to type XVIII collagen.
15 Information about the structure and isolation of natural type XV collagen can be found, among other places, in Myers et al., Proc. Natl. Acad. Sci. USA
89: 10144-10148 (1992), Huebner et al., Genomics 14:220-224 (1992), Kivirikko et al., J. Biol. Chem. 269:4773-4779 (1994), and Muragaki, J. Biol. Chem.
20 264:4042-4046 (1994).
Type XVI collagen is a fibril associated collagen, found in skin, lung fibrobiast, keratinocytes, and elsewhere. Information on the structure of type XVI collagen and the gene encoding type XVI can be found, among elsewhere, in Pan et al., Proc. Natl. Acad. Sci. USA 1989:6565-6569 (1992), and Yamaguchi et al , L 25
Biochem. 112:856-863 (1992).
Type XVII collagen is a hemidesmosal transmembrane collagen. Information on the structure of type XVII collagen and the gene encoding type XVII collagen can be found, among elsewhere, in Li et al , J. Biol. Chem. 268(12):8825-8834 (1993),
30 and McGrath et al. , Nat. Genet. ϋ(l):83-86 (1995).
Type XVIII collagen is similar in structure to type XV collagen and can be isolated from the liver. Descriptions of the structures and isolation of type XVIII collagen from natural sources can be found, among other places, in Rehn et al. ,
35 Proc. Natl. Acad. Sci USA 9.1:4234-4238 (1994), Oh et al, Proc. Natl. Acad. Sci USA 91:4229- 4233 (1994), Rehn et al, J. Biol. Chem. 269: 13924-13935 (1994), and Oh et al, Genomics 19:994-999 (1994).
Type XIX collagen' s gene structure classify it as another member of the
FACIT collagenous family. Type XIX mRNA was recently isolated from rhabdomyosarcoma cell. Descriptions of the structures and isolation of type XIX collagen can be found, among other places, in Inoguchi et al. , J. Biochem. 117: 137-146 (1995), Yoshioka et al , Genomics 13:884-886 (1992), Myers et al , L Biol. Chem. 289: 18549-18557 (1994).
Post-Translational Enzymes. Prolyl 4-hydroxylase is an important post-translational enzyme necessary for the synthesis of procollagen or collagen by cells. The enzyme is required to hydroxy late prolyl residues in the Y-position of the repeating -Gly-X-Y- sequences to 4-hydroxyproline. Prockop et al , N. Engl. J. Med. 3Jl:376-386 (1984). Unless an appropriate number of Y-position prolyl residues are hydroxy lated to 4-hydroxyproline by prolyl 4- hydroxylase, the newly synthesized chains cannot fold into a triple-helical conformation at 37°C. Moreover, if the hydroxylation does not occur, the polypeptides remain non-helical, are poorly secreted by cells, and cannot self- assemble into collagen fibrils.
Prolyl-4-hydroxylase from vertebrates is an alβ2 tetramer. Berg et al. , L.
Biol. Chem. 248:1175-1192 (1973); Tuderman et al. , Eur. J. Biochem. 52:9-16
(1975). The a subunits "63 kDa) contain the catalytic sites involved in the hydroxylation of prolyl residues but are insoluble in the absence of β subunits. The β subunits ("55 kDa) were found to be identical to the protein disulfide isomerase, which catalyzes thiol/disulfide interchange in a protein substrate, leading to the formation of the set of disulfide bonds which permit establishment of the most stable state of the protein. The β subunits retain 50% of protein disulfide isomerase activity when part of the prolyl-4-hydroxylase tetramer. Pihlajaniemi et al. , Embo J. 6:643-649 (1987); Parkkonen et al , Biochem. J. 256: 1005-1011 (1988); Koivu et al , J. Biol. Chem. 262:6447-6449 (1987)). Recently, active recombinant human enzyme has been produced in insect cells by simultaneously expressing the α and β subunits in Sf9 cells. Vuori et al , Proc. Natl. Acad. Sci. USAE89: 7467-7470 (1992). In addition to prolyl-4-hydroxylase, other collagen post-translational enzymes have been identified and reported in the literature, including C-proteinase,
N-proteinase, lysyl oxidase, and lysyl hydroxylase.
Attempts to Express Collagen. Expression of many exogenous genes is 5 readily obtained in a variety of recombinant host- vector systems. Expression, however, becomes difficult to obtain if the final formation of the protein requires extensive post-translational processing. This is the likely reason that, prior to the present invention, expression of properly formed collagen in a fully recombinant system has not been reported. See Prockop et al , N. Engl. J. Med. 311:376-386 (1984).
Notably, rescue experiments in two different systems that synthesized only one of the two chains for type I procoUagen have been reported. Specifically, it was 5 found that a gene for the human fibrillar procoUagen proαl(I) chain, the COLI A 1 gene, can be expressed in mouse fibroblasts and the chains used to assemble molecules of type I procoUagen, the precursor of type I collagen. However, the reports are limited to the proα2(I) chains of mouse origin. Hence, the type I Q procoUagen synthesized is a hybrid molecule of human and mouse origin.
Similarly, expression of a rat exogenous proα2(I) gene to generate type I rat procoUagen have been reported. Thus, synthesis of a recombinant procoUagen molecule in which all three chains are derived from exogenous genes was not obtained in the art. 5
Failure to obtain expression of genes for human collagens has made it impossible to prepare human procollagens and collagens that have a number of therapeutic uses in man and that will not produce the undesirable immune responses that have been encountered with use of collagen from animal sources. Also, many 0 types of collagen are only available in trace quantities in tissues and can only be obtained in significant quantities by recombinant production.
5 ///. SUMMARY OF THE INVENTION
Methods. The present invention comprises the expression of at least one nucleic acid sequence encoding a collagen chain, and at least one nucleic acid sequence encoding a collagen post-translational enzyme.
More specifically, the present invention provides for methods of expressing at least a single procoUagen or collagen gene (or other nucleic acid molecule) or a number of different procoUagen or collagen genes (or other nucleic acid molecule) within a cell. Further, it is contemplated that there can be one or more copies of a single procoUagen or collagen gene (or other nucleic acid molecule) or of the number of different such genes introduced into cells (/. e. , transformation or transduction) and expressed. The present invention provides that these cells can be transformed or transfected with nucleic acids encoding collagen and enzymes that modify collagen so that they express at least one procoUagen or collagen chain, preferably human, that will assemble into a homotrimer or heterotrimer procoUagen or collagen.
In one embodiment of the present invention, the method utilizes a procoUagen or collagen gene (or other nucleic acid molecule) transfected into and expressed within cells which are a mutant, variant, hybrid or recombinant gene (or other nucleic acid molecule). Such mutant, variant, hybrid or recombinant gene may include, for example, a mutation which provides unique restriction sites for cleavage of the hybrid gene.
In a further embodiment of the present invention, such mutations provide one or more unique restriction sites do not alter the amino acid sequence encoded by the nucleic acid molecule, but merely provide unique restriction sites useful for manipulation of the molecule. Thus, the modified molecule would be made up of a number of discrete regions, or D-regions, flanked by unique restriction sites. These discrete regions of the molecule are herein referred to as cassettes. For example, cassettes designated as Dl through D4.4 are shown in Figure 4. Molecules formed of multiple copies of a cassette are another variant of the present gene which is encompassed by the present invention. Recombinant or mutant nucleic acid molecules or cassettes which provide desired characteristics such as resistance to endogenous enzymes such as collagenase are also encompassed by the present invention.
A novel feature of the methods of the invention is that relatively large amounts of a human procoUagen or collagen can be synthesized in a recombinant cell culture system that does not make any other procoUagen or collagen. Systems that make other procoUagens or collagens are preferred because of the extreme difficulty of separating the product of the endogenous genes for procoUagen or collagen from recombinant collagen products. Using methods of the present invention, purification of procoUagen, including human, bovine, porcine, chicken and other mammalian collagens, is greatly facilitated. Moreover, it has been demonstrated that the amounts of protein synthesized by the methods of the present invention are high relative to other systems used in the art. Other novel features of the methods of the present invention are that procoUagens synthesized are correctly folded proteins so that they exhibit the normal triple-helical conformation characteristic of procoUagens and collagens. Therefore, the procoUagens can be used to generate stable collagen by cleavage of the procoUagens with proteases.
The present invention provides methods for the production of procoUagens or collagens derived solely from transformed or transfected procoUagen and collagen genes, such methods are not limited, however, to the production of procoUagen and collagen derived solely from transformed or transfected genes.
Vectors. The present invention is also directed to vectors and plasmids used in the methods of the invention. Such vectors and/or plasmids are comprised of the nucleic acid sequence encoding the desired procoUagens and collagens and necessary promoters, and other sequences necessary for the proper expression of such procoUagens and collagens. In a preferred embodiment, the vectors and plasmids of the present invention further include at least one sequence encoding one or more post-translational enzymes.
It is an object of the invention to construct expression vectors for various host cells that contain collagen genes from human and other sources, and to construct expression vectors that contain various collagen post- translation modification enzymes.
Cells. The present invention further comprises cells in which a procoUagen or collagen, either alone or in combination with one or more post translational enzymes, is expressed both as mRNA and as a protein. Preferably, the procoUagen or collagen (types I-XIX), and/or the post-translational enzyme, is expressed in mammalian cells, insect cells, or yeast cells. Notwithstanding these preferred embodiments, other cells, including plant cells and algae, can be manufactured.
In preferred embodiments of the present invention, cells such as mammalian, insect and yeast cells, which may not naturally produce sufficient amounts of post- translational enzymes, are transformed with at least one set of genes coding for a post-translational enzyme, such as prolyl 4-hydroxylase, C-proteinase, N-proteinase, lysyl oxidase or lysyl hydroxylase.
Polypeptides. The invention comprises the recombinant polypeptides expressed according to the methods of the present invention, including fusion products produced from chimeric genes wherein, for example, relevant epitopes of collagen or procoUagen can be manufactured for therapeutic and other uses. The polypeptides of the present invention further include deglycosolated, unglycosolated and partially glycosolated collagens and procoUagens.
An advantage of recombinant collagens of the present invention is that these collagens will not produce allergic responses in the mammals to which they are administered provided that the recombinant collagen is manufactured utilizing the nucleic acid sequence encoding such mammal's native collagen . For example, it is expected that humans will be to tolerate the administration of human collagen, as compared to collagen derived from other mammals (e.g. , bovine derived collagen). Moreover, collagen of the present invention prepared from cultured cells should be of a higher quality than collagen obtained from animal sources, and should form larger and more tightly packed proteins. IV. BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 is a photograph showing analysis by polyacrylamide gel electrophoresis in SDS of the proteins secreted into medium by HT-1080 cells that were transfected with a gene construct containing the promoter, first exon and most of the first intron of the human COLI A 1 gene linked to 30 kb fragment containing all of COL2A1 except the first two exons.
Figure 2 is a photograph evidencing the secretion type II procoUagen into the medium from cells described in Figure 1 was folded into a correct native conformation.
Figure 3 is a photograph showing analysis of medium of HT-1080 cells co-transfected with a gene for COLI A 1 and a gene for COL1A2.
Figure 4 is a schematic representation of the cDNA for the proαl(l) chain of human type I procoUagen that has been modified to contain artificial sites for cleavage by specific restriction endonucleases.
Figure 5 is a photograph showing analysis by nondenaturing 7.5% polyacrylamide gel electrophoresis (lanes 1-3) and 10% polyacrylamide gel electrophoresis in SDS (lanes 4-6) of purified chick prolyl 4-hydroxylase (lanes 1 and 4) and the proteins secreted into medium by Sf9 cells expressing the gene for the a-subunit and the B- subunit of human prolyl 4-hydroxylase and infected with a58/B virus (lanes 2 and 5) or with a59/B virus (lanes 3 and 6). a58/B and a59/B differ by a stretch of 64 base pairs.
Figure 6 is a gel showing the expression of recombinant human type III procoUagen in Sf9 and High Five cells.
Figure 7 is a gel showing the expression of recombinant human type I procoUagen in insect cells, analyzed on a silver stained, 5% SDS-PAGE gel. Lane 1 is a pepsin digested sample from cells expressing only the pro cd chain of type I procoUagen. Lane 2 is a pepsin digested sample from cells coexpressing proαl and proα2 chains of type I procoUagen.
Figure 8 is a gel showing the expression of recombinant human type II procoUagen in insect cells, analyzed on a coomassie stained 5% SDS-PAGE gel. Figure 9 is an SDS-PAGE analysis under reducing and nonreducing conditions of purified type III collagen. The gel was stained with Coomassie
Brilliant Blue. The reduced type III collagen sample is shown in lane 2 and the nonreduced sample in lane 3. Molecular weight markers were run in lane 1. The positions of the trimeric αl (III) chains and the monomeric αl (III) chains are shown by arrows.
Figure 10 is a non-reducing SDS-PAGE analysis of trimer formation of the proαl (III) chains expressed in High Five insect cells. The samples were electrophoresed on 5% SDS-PAGE under nonreducing conditions and analyzed by Coomassie staining. Lane 1, molecular weight markers; lane 2, cell extract; lane 3, cell extract digested with pepsin; lane 4, proteins soluble in 1 % SDS. The positions of the trimeric proαl (III) and αl (III) chains are shown by arrows. Figure 11 is an analysis of the thermal stability of the recombinant human type III collagen produced in insect cells by a brief protease digestion.
V. DETAILED DESCRIPTION OF THE INVENTION 0 A. Definitions;
The term "collagen" refers to any one of the collagen types I-XIX, as well as any novel collagens produced according to the methods of this invention. The term also encompasses both procoUagen and mature collagen assembled as hetero- and homo-trimers, and any single chain polypeptides of procoUagen or 5 collagen for any of the collagen types, and any heterotrimers of any combination of the collagen constructs of the invention. The term "collagen" is meant to encompasses all of the foregoing, unless the context dictates otherwise.
The term "procoUagen" refers to any one of the collagen types I-XIX, as well ° as any novel collagens produced by this invention, that possess additional C-terminal and/or N-terminal peptides that assist in trimer assembly, solubility, purification or other function, and then are subsequently cleaved by N-proteinase, C-proteinase or other proteins. 5 The term "collagen subunit" refers to the amino acid sequence of one polypeptide chain of a collagen protein encoded by a single gene, as well as derivatives, including deletion derivatives, conservative substitutions, etc.
A "fusion protein" is a protein in which peptide sequences from different proteins are covalently linked together.
The term "collagen post-translational enzyme" refers to any enzyme that modifies a procoUagen, collagen, or components comprising a collagen molecule, and encompasses, but is not limited to, prolyl-4-hydroxylase, C-proteinase, N-proteinase, lysyl hydroxylase, and lysyl oxidase. The term "collagen post-translational enzyme" is meant to encompass all of the foregoing, unless the context dictates otherwise.
The term "infection" refers to the introduction of nucleic acids into an 15 organism by use of a virus or viral vector, and preferably, baculovirus or Semliki Forest virus.
The term "transformation" means introducing DNA into an organism so that the DNA is replicable, either as an extrachromosomal element, or by chromosomal 2 Q integration.
The term "transfection" refers to the taking up of an expression vector by a host cell, whether or not any coding sequences are in fact expressed.
The phrase "stringent conditions" as used herein refers to those hybridizing conditions that (1) employ low ionic strength and high temperature for washing, for
25 example, 0.015 M NaCl/0.0015 M sodium citrate/0.1% SDS at 50°C; or (2) employ during hybridization a denaturing agent such as formamide, for example, 50%
(vol/vol) formamide with 0.1 % bovine serum albumin/0.1 % Ficoll/0.1 % polyvinylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 with 750 mM NaCl,
30 75 mM sodium citrate at 42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M Sodium citrate), 5 x Denhardt's solution, sonicated salmon sperm DNA (50 g/ml), 0.1 % SDS, and 10% dextran sulfate at 42°C, with washes at 42°C in 0.2 x SSC and 0.1 % SDS.
35 The term "purified" as used herein denotes that the indicated collagen or procoUagen is present in the substantial absence of other biological macromolecules, e.g. , polynucleotides, proteins, and the like. The term "purified" as used herein preferably means at least 95% by weight, more preferably at least 99.8% by weight, of the indicated biological macromolecules present (but water, buffers, and other small molecules, especially molecules having a molecular weight of less than 1000 daltons, can be present).
The term "isolated" as used herein refers to a protein molecule separated not only from other proteins that are present in the natural source of the protein, but also from other proteins, and preferably refers to a protein found in the presence of (if anything) only a solvent, buffer, ion, or other component normally present in a solution of the same. The terms "isolated" and "purified" do not encompass proteins present in their natural source.
5 B. Nucleic Acids Related To The Present Invention
In accordance with the invention, polynucleotide sequences which encode any collagen subunit, or functional equivalents thereof, may be used to generate recombinant DNA molecules that direct the expression of that subunit of _ collagen, or a functional equivalent thereof, in appropriate host cells. Preferred embodiments of the invention relate to polynucleotide sequences encoding human collagens or functional equivalents thereof. Preferred embodiments of the invention also include the polynucleotide sequences of collagen subunits of type I - type IV, type XIII, type XV, and type XVIII, or functional equivalents thereof. 5
The nucleic acid sequences encoding the known collagen types have been generally described in the art. See, e.g. , Fukai et al , Methods of Enzymology
245:3-28 (1994) and references cited therein. New collagens/procollagens or known collagens/procollagens from which nucleic acid sequence is not available may be 0 obtained from cDNA libraries prepared from tissues believed to possess a "novel" type of collagen and to express the novel collagen at a detectable level. For example, a cDNA library could be constructed by obtaining polyadenylated mRNA from a cell line known to express the novel collagen, or a cDNA library previously 5 made to the tissue/cell type could be used. The cDNA library is screened with appropriate nucleic acid probes, and/or the library is screened with suitable polyclonal or monoclonal antibodies that specificaUy recognize other collagens.
Appropriate nucleic acid probes include oligonucleotide probes that encode known portions of the novel collagen from the same or different species. Other suitable probes include, without limitation, oligonucleotides, cDNAs, or fragments thereof 5 that encode the same or similar gene, and/or homologous genomic DNAs or fragments thereof. Screening the cDNA or genomic library with the selected probe may be accomplished using standard procedures known to those in the art, such as those described in Chapters 10-12 of Sambrook et al. , Molecular Cloning: A
Laboratory Manual. New York, Cold Spring Harbor Laboratory Press, 1989. Other means for identifying novel collagens involve known techniques of recombinant DNA technology, such as by direct expression cloning or using the polymerase chain reaction (PCR) as described in U.S. Patent No. 4,683,195, issued 28 July 1987, or
15 in section 14 of Sambrook et al. , Molecular Cloning: A Laboratory Manual. Second Edition, Cold Spring Harbor Laboratory Press, New York, 1989, or in Chapter 15 of Current Protocols in Molecular Biology. Ausubel et al. eds. , Greene Publishing Associates and Wiley-Interscience 1991.
_ . Altered DNA sequences which may be used in accordance with the invention include deletions, additions or substitutions of different nucleotide residues resulting in a sequence that encodes the same or a functionally equivalent gene product. The gene product itself may contain deletions, additions or substitutions of amino acid residues within a collagen sequence, which result in a functionally equivalent
25 collagen.
The nucleic acid sequences of the invention may be engineered in order to alter the collagen coding sequence for a variety of ends including, but not limited to, alterations which modify processing and expression of the gene product. For
30 example, alternative secretory signals may be substituted for the native human secretory signal and/or mutations may be introduced using techniques which are well known in the art, e.g. , site-directed mutagenesis, to insert new restriction sites, to alter glycosylation patterns, phosphorylation, etc. Additionally, when expressing in
35 non-human cells, the polynucleotides encoding the collagens of the invention may be modified in the silent position of any triplet amino acid codon so as to better conform to the codon preference of the particular host organism.
The nucleic acid sequences of the invention are further directed to sequences which encode variants of the described collagens and fragments. These amino acid sequence variants of native collagens and collagen fragments may be prepared by methods known in the art by introducing appropriate nucleotide changes into a native or variant collagen encoding polynucleotide. There are two variables in the construction of amino acid sequence variants: the location of the mutation and the nature of the mutation. The amino acid sequence variants of collagen are preferably constructed by mutating the polynucleotide to give an amino acid sequence that does not occur in nature. These amino acid alterations can be made at sites that differ in collagens from different species (variable positions) or in highly conserved regions (constant regions). Sites at such locations will typically be modified in series, e.g. , by substituting first with conservative choices (e.g. , hydrophobic amino acid to a different hydrophobic amino acid) and then with more distant choices (e.g. , hydrophobic amino acid to a charged amino acid), and then deletions or insertions may be made at the target site.
Amino acids are divided into groups based on the properties of their side chains (polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipatic nature): (1) hydrophobic (leu, met, ala, ile), (2) neutral hydrophobic
(cys, ser, thr), (3) acidic (asp, glu), (4) weakly basic (asn, gin, his), (5) strongly basic (lys, arg), (6) residues mat influence chain orientation (gly, pro), and (7) aromatic (trp, tyr, phe). Conservative changes encompass variants of an amino acid position that are within the same group as the "native" amino acid. Moderately conservative changes encompass variants of an amino acid position that are in a group that is closely related to the "native" amino acid (e.g. , neutral hydrophobic to weakly basic). Non-conservative changes encompass variants of an amino acid position that are in a group that is distantly related to the "native" amino acid (e.g. , hydrophobic to strongly basic or acidic). Amino acid sequence deletions generally range from about 1 to 30 residues, preferably about 1 to 10 residues, and are typically contiguous. Amino acid insertions include amino- and/or carboxyl-terminal fusions ranging in length from one to one hundred or more residues, as well as intrasequence insertions of single or multiple amino acid residues. Intrasequence insertions may range generally from about 1 to 10 amino residues, preferably from 1 to 5 residues. Examples of terminal insertions include the heterologous signal sequences necessary for secretion or for intracellular targeting in different host cells.
In a preferred method, polynucleotides encoding a collagen are changed via site-directed mutagenesis. This method uses oligonucleotide sequences that encode the polynucleotide sequence of the desired amino acid variant, as well as a sufficient adjacent nucleotide on both sides of the changed amino acid to form a stable duplex on either side of the site of being changed. In general, the techniques of site-directed mutagenesis are well known to those of skill in the art and this technique is exemplified by publications such as, Edelman et al. , DNA 2:183 (1983). A versatile and efficient method for producing site-specific changes in a polynucleotide sequence was published by Zoller and Smith, Nucleic Acids Res. 10:6487-6500 (1982).
PCR may also be used to create amino acid sequence variants of a collagen. When small amounts of template DNA are used as starting material, primer(s) that differs slightly in sequence from the corresponding region in the template DNA can generate the desired amino acid variant. PCR amplification results in a population of product DNA fragments that differ from the polynucleotide template encoding the collagen at the position specified by the primer. The product DNA fragments replace the corresponding region in the plasmid and this gives the desired amino acid variant.
A further technique for generating amino acid variants is the cassette mutagenesis technique described in Wells et al , Gene 34:315 (1985); and other mutagenesis techniques well known in the art, such as, for example, the techniques in Sambrook et al. , supra, and Current Protocols in Molecular Biology. Ausubel et al. , supra.
In another embodiment of the invention, a collagen sequence may be ligated to a heterologous sequence to encode a fusion protein. For example, a fusion protein may be engineered to contain a cleavage site located between an (3(IX) collagen sequence and the heterologous protein sequence, so that the (3(IX) collagen may be cleaved away from the heterologous moiety.
Due to the inherent degeneracy of the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent amino acid sequence may be used in the practice of the invention for the cloning and expression of these collagen proteins. Such DNA sequences include those which are capable of hybridizing to the appropriate human collagen sequence under stringent conditions.
C. Collagen Modifying Polypeptides And Corresponding Nucleic Acid Sequences
As naturally produced, collagens are structural proteins comprised of one or more collagen subunits which together form at least one triple-helical domain.
A variety of enzymes are utilized in order to transform the collagen subunits into procoUagen or other precursor molecules and then mature collagen. Such enzymes include prolyl-4-hydroxylase, C-proteinase, N-proteinase, lysyl oxidase and lysye hydroxylase.
Prolyl 4-hydroxylase plays a central role in the biosynthesis of all collagens, 0 as the 4-hydroxyproline residues stabilize the folding of the newly synthesized polypeptide chains, into triple-helical molecules. Prockop et al. , Annu. Rev.
Biochem. 64:403-434 (1995); Kivirikko et al , "Post-Translational Modifications of
Proteins," pp. 1-51 (1992); Kivirikko et al , FASEB J. 3: 1609-1617 (1989). For 5 example, when the proαl chains of type III procoUagen were expressed in insect cells, without recombinant prolyl 4-hydroxylase, considerable amounts of procoUagen were made in the cells, and the proαl chains formed triple-helical molecules as indicated by the resistance of the collagenous domains of the collagen to protease Q degradation at 22°C. However, the Tm of the triple helices of such molecules was about 6°C lower than procoUagen produced in the presence of the recombinant prolyl
4- hydroxylase. Also, the level of expression of type III collagen was lower in the absence of recombinant prolyl 4- hydroxylase than in its presence.
Lysyl hydroxylase, an α2 homodimer, catalyzes the post-translation 5 modification of collagen to form hydroxy lysine in collagens. See generally,
Kivirikko et al , Post-Translational Modifications of Proteins. Harding, J.J. , and Crabbe, M.J.C., eds. , CRC Press, Boca Raton, FL (1992); Kivirikko, Principles of Medical Biology. Vol. 3 Cellular Organelles and the Extracellular Matrix. Bittar, E.E., and Bittar, N., eds., JAI Press, Greenwich, Great Britain (1995).
C-proteinase processes the assembled procoUagen by cleaving off the C-terminal ends of the procoUagens that assist in assembly of, but are not part of, the triple helix of the collagen molecule. See generally, Kadler et al , J. Biol. Chem. 262: 15969-15701 (1987), Kadler et al , Ann. NY Acad. Sci. 580:214-224 (1990).
N-proteinase processes the assembled procoUagen by cleaving off the N-terminal ends of the procoUagens that assist in the assembly of, but are not part of, the collagen triple helix. See generally, Hojima et al. , J. Biol. Chem. 269:11381-11390 (1994).
Lysyl oxidase is an extracellular copper enzyme that catalyzes the oxidative 5 deamination of the e-amino group in certain lysine and hydroxy lysine residues to form a reactive aldehyde. These aldehydes then undergo an aldol condensation to form aldols, which cross links collagen fibrils. Information on the DNA and protein sequence of lysyl oxidase can found, among elsewhere, in Kivirikko, Principles of 0 Medical Biology. Vol. 3 Cellular Organelles and the Extracellular Matrix. Bittar, E.E. , and Bittar, N., eds., JAI Press, Greenwich, Great Britain (1995), Kagan, Path. Res. Pract. 190: 910-919 (1994), Kenyon et al , J. Biol. Chem. 268(25): 18435-18437 (1993), Wu et al , J. Biol. Chem. 267(34): 24199-24206 (1992), Mariani et al , Matrix L2(3):242-248 (1992), and Hamalainen et al , Genomics U(3):508-516 (1991).
The nucleic acid sequences encoding a number of these post-translational enzymes have been reported. See e.g. Vuori et al. , Proc. Natl. Acad. Sci. USA 89:7467-7470 (1992); Kessler et al , Science 271:360-362 (1996). The nucleic acid ° sequences encoding the various post-translational enzymes may also be determined according to the methods generally described above and include use of appropriate probes and nucleic acid libraries.
5 D. Host- Vector Systems for Expressing Recombinant Collagen
In order to express the collagens and related collagen post-translational enzymes of the invention, the nucleotide sequence encoding the collagen, or a functional equivalent, is inserted into an appropriate expression vector, i.e. , a vector which contains the necessary elements for the transcription and translation of the inserted coding sequence, or in the case of an RNA viral vector, the necessary elements for replication and translation.
Methods which are well known to those skilled in the art can be used to construct expression vectors containing a collagen coding sequence for the collagens of the invention and appropriate transcriptional/translational control signals. These methods include in vitro recombinant DNA techniques, synthetic techniques and in vivo recombination/genetic recombination. See, for example, the techniques described in Maniatis et al , Molecular Cloning A Laboratory Manual. Cold Spring Harbor Laboratory, N.Y. (1989) and Ausubel et al , Current Protocols in Molecular Biology. Greene Publishing Associates and Wiley Interscience, N.Y. (1989).
A variety of host-expression vector systems may be utilized to express a collagen coding sequence. These include, but are not limited to, microorganisms such as bacteria transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors containing a procoUagen or collagen coding sequence; yeast or filamentous fungi transformed with recombinant yeast or fungi expression vectors containing a procoUagen or collagen coding sequence; insect cell systems infected with recombinant virus expression vectors (e.g. , baculovirus) containing sequence encoding the procoUagen or collagen of the invention; plant cell systems infected with recombinant virus expression vectors (e.g. , cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed with recombinant plasmid expression vectors (e.g. , Ti plasmid) containing a procoUagen or collagen coding sequence; or animal cell systems. The expression elements of these systems vary in their strength and specificities. Depending on the host/ vector system utilized, any of a number of suitable transcription and translation elements, including constitutive and inducible promoters, may be used in the expression vector. For example, when cloning in bacterial systems, inducible promoters such as pL of bacteriophage λ, plac, ptrp, ptac (ptφ-lac hybrid promoter) and the like may be used; when cloning in insect cell systems, promoters such as the baculovirus polyhedron promoter may be used; when cloning in plant cell systems, promoters derived from the genome of plant cells (e.g. , heat shock promoters; the promoter for the small subunit of RUBISCO; the promoter for the chlorophyll a/b binding protein) or from plant viruses (e.g. , the 35S RNA promoter of CaMV; the coat protein promoter of TMV) may be used; when cloning in mammalian cell systems, promoters derived from the genome of mammalian cells (e.g. , metallothionein promoter) or from mammalian viruses (e.g. , the adenovirus late promoter; the vaccinia virus 7.5 K promoter) may be used; when generating cell lines that contain multiple copies of a collagen DNA,
SV40-, BPV- and EBV-based vectors may be used with an appropriate selectable marker. In bacterial systems a number of expression vectors may be advantageously selected depending upon the use intended for the collagen expressed. For example, when large quantities of the collagens of the invention are to be produced for the generation of antibodies, vectors which direct the expression of high levels of fusion protein products that are readily purified may be desirable. Such vectors include, but are not limited to, the E. coli expression vector pUR278 (Ruther et al , EMBO J.
2: 1791 (1983)), in which the collagen coding sequence may be ligated into the vector in frame with the lac Z coding region so that a hybrid AS-lac Z protein is produced; pIN vectors (Inouye et al , Nucleic Acids Res. 13:3101-3109 (1985); Van Heeke et al , J. Biol. Chem. 264:5503-5509 (1989)); and the like. pGEX vectors may also be used to express foreign polypeptides as fusion proteins with glutathione S- transferase
(GST). In general, such fusion proteins are soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose beads followed by elution in the presence of free glutathione. The pGEX vectors are designed to include thrombin or factor Xa protease cleavage sites so that the cloned polypeptide of interest can be released from the GST moiety.
A preferred expression system is a yeast expression system. In yeast, a number of vectors containing constitutive or inducible promoters may be used. For a review see, Current Protocols in Molecular Biology. Vol. 2, Ed. Ausubel et al , Greene Publish. Assoc. & Wiley Interscience, Ch. 13 (1988); Grant et al ,
Expression and Secretion Vectors for Yeast, in Methods in Enzymology. Ed. Wu &
Grossman, Acad. Press, N.Y. 15.3:516-544 (1987); Glover, DNA Cloning. Vol. II,
IRL Press, Wash., D.C , Ch. 3 (1986); Bitter, Heterologous Gene Expression in
Yeast, in Methods in Enzymology. Eds. Berger & Kimmel, Acad. Press, N.Y. 152:673-684 (1987); and The Molecular Biology of the Yeast Saccharomyces. Eds. Strathern et al , Cold Spring Harbor Press, Vols. I and II (1982).
A particularly preferred system useful for cloning and expression of the collagen proteins of the invention uses host cells from the yeast Pichia. Species of non-Saccharomyces yeast such as Pichia pastoris appear to have special advantages in producing high yields of recombinant protein in scaled up procedures. Additionally, a Pichia expression kit is available from Invitrogen Corporation (San Diego, CA). There are a number of methanol responsive genes in methy lotrophic yeasts such as Pichia pastoris, the expression of each being controlled by methanol responsive regulatory regions (also referred to as promoters). Any of such methanol responsive promoters are suitable for use in the practice of the present invention. Examples of specific regulatory regions include the promoter for the primary alcohol oxidase gene from Pichia pastoris AOX1, the promoter for the secondary alcohol oxidase gene from P. pastoris AXO2, the promoter for the dihydroxyacetone synthase gene from P. pastoris (DAS), the promoter for the P40 gene from P. pastoris, the promoter for the catalase gene from P. pastoris, and the like.
Typical expression in Pichia pastoris is obtained by the promoter from the tightly regulated AOX1 gene. See Ellis et al. , Mol. Cell. Biol. 5: 1111 (1985) and
U.S. Patent No. 4,855,231. This promoter can be induced to produce high levels of recombinant protein after addition of methanol to the culture. By subsequent manipulations of the same cells, expression of genes for the collagens of the invention described herein is achieved under conditions where the recombinant protein is adequately hydroxylated by prolyl 4-hydroxylase and, therefore, can fold into a stable helix that is required for the normal biological function of the proteins in forming fibrils. Another particularly preferred yeast expression system makes use of the methy lotrophic yeast Hansenula potymorpha. Growth on methanol results in the induction of key enzymes of the methanol metabolism, namely MOX (methanol oxidase), DAS (dihydroxyacetone synthase) and FMHD (formate dehydrogenase). These enzymes can constitute up to 30-40% of the total cell protein. The genes encoding MOX, DAS, and FMDH production are controlled by very strong promoters which are induced by growth on methanol and repressed by growth on glucose. Any or all three of these promoters may be used to obtain high level expression of heterologous genes in H. potymorpha. The gene encoding a collagen of the invention is cloned into an expression vector under the control of an inducible H. polymorpha promoter. If secretion of the product is desired, a polynucleotide encoding a signal sequence for secretion in yeast, such as the 5. cerevisiae prepro-mating factor αl, is fused in frame with the coding sequence for the collagen of the invention. The expression vector preferably contains an auxotrophic marker gene, such as URA3 or LEU2, which may be used to complement die deficiency of an auxotrophic host. The expression vector is then used to transform H. polymorpha host cells using techniques known to those of skill in the art. An interesting and useful feature of H. polymorpha transformation is the spontaneous integration of up to 100 copies of the expression vector into the genome. In most cases, the integrated DNA forms multimers exhibiting a head-to-tail arrangement. The integrated foreign DNA has been shown to be mitotically stable in several recombinant strains, even under non-selective conditions. This phenomena of high copy integration further adds to the high productivity potential of the system.
Filamentous fungi may also be used to produce the collagens of the instant invention. Vectors for expressing and/or secreting recombinant proteins in filamentous fungi are well known, and one of skill in the art could use these vectors to express recombinant collagen.
In cases where plant expression vectors are used, the expression of sequences encoding the collagens of the invention may be driven by any of a number of promoters. For example, viral promoters such as the 35S RNA and 19S RNA promoters of CaMV (Brisson et al , Namre 3_1Q:511-514 (1984), or the coat protein promoter of TMV (Takamatsu et al , EMBO J. £:307-311 (1987)) may be used; alternatively, plant promoters such as the small subunit of RUBISCO (Coruzzi et al. , EMBO J. 3: 1671-1680 (1984); Broglie et al , Science 224:838-843 (1984); or heat shock promoters, e.g. , soybean hspl7.5-E or hspl7.3-B (Gurley et al. , Mol. Cell. Biol. 6:559-565 (1986) may be used. These constructs can be introduced into plant cells using Ti plasmids, Ri plasmids, plant virus vectors, direct DNA transformation, microinjection, electroporation, etc. For reviews of such techniques see, for example, Weissbach & Weissbach, Methods for Plant Molecular Biology. Academic Press, NY, Section VIII, pp. 421-463 (1988); and Grierson & Corey, Plant Molecular Biology. 2d Ed. , Blackie, London, Ch. 7-9 (1988).
An alternative expression system which could be used to express the collagens 5 of the invention is an insect system. In one such system, Autographa californica nuclear polyhidrosis virus (AcNPV) is used as a vector to express foreign genes. The virus grows in Spodoptera frugiperda cells. Coding sequence for the collagens of the invention may be cloned into non-essential regions (for example the o polyhedron gene) of the virus and placed under control of an AcNPV promoter (for example, the polyhedron promoter). Successful insertion of a collagen coding sequence will result in inactivation of the polyhedron gene and production of non-occluded recombinant virus (i.e. , virus lacking the proteinaceous coat coded for by the polyhedron gene). These recombinant viruses are then used to infect 5
Spodoptera frugiperda cells in which the inserted gene is expressed, (see, e.g. ,
Smith et al , J. Virol. 46:584 (1983); Smith, U.S. Patent No. 4,215,051). Further examples of this expression system may be found in Current Protocols in Molecular
Biology. Vol. 2, Ed. Ausubel et al. , Greene Publish. Assoc. & Wiley Interscience. 0 In mammalian host cells, a number of viral based expression systems may be utilized. In cases where an adenovirus is used as an expression vector, coding sequence for the collagens of the invention may be ligated to an adenovirus transcription/translation control complex, e.g. , the late promoter and tripartite leader 5 sequence. This chimeric gene may then be inserted in the adenovirus genome by in vitro or in vivo recombination. Insertion in a non-essential region of the viral genome (e.g. , region El or E3) will result in a recombinant virus that is viable and capable of expressing collagen in infected hosts. (See, e.g. , Logan & Shenk, Proc. Natl. Acad. Sci. USA 81:3655- 3659 (1984)). Alternatively, the vaccinia 7.5 K promoter may be used. (See, e.g. , Mackett et al , Proc. Natl. Acad. Sci. USA 79:7415-7419 (1982); Mackett et al , J. Virol. 49:857-864 (1984); Panicali et al , Proc. Natl. Acad. Sci. USA 79:4927-4931 (1982).
Specific initiation signals may also be required for efficient translation of inserted collagen coding sequences. These signals include the ATG initiation codon and adjacent sequences. In cases where the entire collagen gene, including its own initiation codon and adjacent sequences, is inserted into the appropriate expression vector, no additional translational control signals may be needed. However, in cases where only a portion of a collagen coding sequence is inserted, exogenous translational control signals, including the ATG initiation codon, must be provided. Furthermore, the initiation codon must be in phase with the reading frame of the collagen coding sequence to ensure translation of the entire insert. These exogenous translational control signals and initiation codons can be of a variety of origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of appropriate transcription enhancer elements, transcription terminators, etc. (see Bittner et al. , Methods in Enzvmol. 153:516-544 (1987)).
Preferably, the collagens of the invention are expressed as secreted proteins.
When the engineered cells used for expression of the proteins are non-human host cells, it is often advantageous to replace the human secretory signal peptide of the collagen protein with an alternative secretory signal peptide which is more efficiently recognized by the host cell's secretory targeting machinery. The appropriate secretory signal sequence is particularly important in obtaining optimal fungal expression of mammalian genes. For example, in methylotrophic yeasts, a DNA sequence encoding the in-reading frame 5. cerevisiae α-mating factor pre-pro sequence may be inserted at the amino-terminal end of the coding sequence. The αMF pre-pro sequence is a leader sequence contained in the αMF precursor molecule, and includes the lys-arg encoding sequence which is necessary for proteolytic processing and secretion (see, e.g. , Brake et al , Proc. Natl. Acad. Sci. USA 81:4642 (1984)). Other signal sequences for prokaryotic, yeast, fungi, insect or mammalian cells are well known in the art, and one of ordinary skill could easily select a signal sequence appropriate for the host cell of choice.
The vectors of this invention may autonomously replicate in the host cell, or may integrate into the host chromosome. Suitable vectors with autonomously replicating sequences ("ars") are well known for a variety of bacteria (e.g. , the ars from pBR322 functions in the majority of gram negative bacteria), yeast (the 2μ plasmid ars), and various viral replications sequences for both prokaryotes and eukaryotes (prokaryote: λ, T-even phages, M13, etc; eukaryote: adenovirus, SV40, polyoma, VSV or BPV, vaccina, etc.). Vectors may integrate into the host cell genome when they have a DNA sequence that is homologous to a sequence found in the host cell's genomic DNA. The vectors of the invention also encode a selection gene, also termed a selectable marker, that encodes a product necessary for the host cell to grow and survive under certain conditions. Typical selection genes include genes encoding (1) a protein that confers resistance to an antibiotic or other toxin (e.g. , tetracyclme, _ ampicillin, neomycin, methotrexate, etc.), and (2) a protein that complements an auxotrophic requirement of the host cell, etc. Other examples of selection genes include: the herpes simplex virus thymidine kinase (Wigler et al , Cell 11:223 (1977)), hypoxanthine-guanine phosphoribosyltransferase (Szybalska et al , Proc.
Natl. Acad. Sci. USA 48:2026 (1962)), and adenine phosphoribosyltransferase 5
(Lowy et al. , Cell 22:817 (1980)) genes that can be employed in tk-, hgprt- or aprt- cells, respectively. Also, antimetabolite resistance can be used as the basis of selection for dhfr, which confers resistance to methotrexate (Wigler et al. , Natl.
Acad. Sci. USA 77:3567 (1980); O'Hare et al , Proc. Natl. Acad. Sci. USA 78:1527 0 (1981)); gpt, which confers resistance to mycophenolic acid (Mulligan et al , Proc. Natl. Acad. Sci. USA 78:2072 (1981)); neo, which confers resistance to the aminoglycoside G-418 (Colberre-Garapin et al , J. Mol. Biol. 150: 1 (1981)); and hygro, which confers resistance to hygromycin (Santerre et al , Gene 30:147 (1984)). 5 Recently, additional selectable genes have been described, namely trpB, which allows cells to utilize indole in place of tryptophan; hisD, which allows cells to utilize histinol in place of histidine (Hartman et al. , Proc. Natl. Acad. Sci. USA 85:8047 (1988)); and ODC (omithine decarboxylase) which confers resistance to the omithine decarboxylase inhibitor, 2-(difluoromethyl)-DL-ornithine, DFMO (McConlogue L., In: Current Communications in Molecular Biology. Cold Spring Harbor Laboratory, Ed. (1987)).
Further regulatory elements necessary for the expression vectors of the invention include sequences for initiating transcription, e.g. , promoters and enhancers. Promoters are untranslated sequences located upstream from the start codon of the strucmral gene that control the transcription of the nucleic acid under its control. Inducible promoters are promoters that alter their level of transcription initiation in response to a change in culmre conditions, e.g. , the presence or absence of a nutrient. One of skill in the art would know of a large number of promoters that would be recognized in host cells suitable for the present invention. These promoters are operably linked to the DNA encoding the collagen by removing the promoter from its native gene and placing the collagen encoding DNA 3' of the promoter sequence. Promoters useful in the present invention include, but are not limited to, the following: (prokaryote) (1) the lactose promoter, the alkaline phosphatase promoter, the tryptophan promoter, and hybrid promoters such as the tac promoter, (yeast) (2) the promoter for 3 -phosphoglycerate kinase, other glycolytic enzyme promoters (hexokinase, pyruvate decarboxylase, phophofructosekinase, glucose -6-phosphate isomerase, etc.), the promoter for alcohol dehydrogenase, the metallothionein promoter, the maltose promoter, and the galactose promoter,
(eukaryotic) (3) virtually all eukaryotic genes have an AT-rich region located approximately 25 to 30 bases upstream from the site where transcription is initiated, examples of suitable eukaryotic promoters include: promoters from the viruses polyoma, fowlpox, adenovirus, bovine papilloma virus, avian sarcoma virus, cytomegalovirus, retroviruses, SV40, and promoters from the target eukaryote including: the glucoamylase promoter from Aspergillus, the actin promoter or an immunoglobin promoter from a mammal, and native collagen promoters. See, e.g. , de Boer et al, Proc. Natl. Acad. Sci. USA 80:21-25 (1983), Hitzeman et al. , L Biol. Chem. 255:2073 (1980), Fiers et al , Nature 273:113 (1978), Mulligan and Berg, Science 209: 1422-1427 (1980), Pavlakis et al , Proc. Natl. Acad. Sci. USA 78:7398-7402 (1981), Greenway et al , Gene 18:355-360 (1982), Gray et al , Nature 295:503-508 (1982), Reyes et al , Nature 222:598-601 (1982), Canaani and Berg, Proc. Natl. Acad. Sci. USA 79:5166-5170 (1982), Gorman et al , Proc. Natl. Acad. Sci. USA 79:6777-6781 (1982), Nunberg et al , Mol. and Cell. Biol. ϋ(4):2306-2315 (1984).
Transcription of the collagen encoding DNA from the promoter is often increased by inserting an enhancer sequence in the vector. Enhancers are cis-acting elements, usually about from 10 to 300 bp, that act to increase the rate of transcription initiation at a promoter. Many enhancers are known for both eukaryotes and prokaryotes, and one of ordinary skill could select an appropriate enhancer for the host cell of interest. See, e.g. , Yaniv, Nature 297:17-18 (1982) for 5 eukaryotic enhancers.
In addition, a host cell strain may be chosen which modulates the expression of the inserted sequences, or modifies and processes the gene product in the specific fashion desired. Such modifications (e.g. , glycosylation) and processing (e.g. , Q cleavage) of protein products may be important for the function of the protein.
Different host cells have characteristic and specific mechanisms for die post-translational processing and modification of proteins. Appropriate cells lines or host systems can be chosen to ensure the correct modification and processing of the foreign protein expressed. To this end, eukaryotic host cells which possess the 5 cellular machinery for proper processing of the primary transcript, glycosylation, and phosphorylation of the gene product may be used. Such mammalian host cells include, but are not limited to, CHO, VERO, BHK, HeLa, COS, MDCK, 293,
WI38, etc. Additionally, host cells may be engineered to express various enzymes to 0 ensure the proper processing of the collagen molecules. For example, the gene for prolyl-4-hydroxylase may be coexpressed with the collagen gene in the host cell.
For long-term, high-yield production of recombinant proteins, stable expression is preferred. For example, cell lines which stably express the collagens 5 of the invention may be engineered. Rather than using expression vectors which contain viral origins of replication, host cells can be transformed with collagen encoding DNA controlled by appropriate expression control elements (e.g. , promoter, enhancer, sequences, transcription terminators, polyadenylation sites, etc.), and a selectable marker. Following the introduction of foreign DNA, engineered cells may be allowed to grow for 1-2 days in an enriched media, and then are switched to a selective media. The selectable marker in the recombinant plasmid confers resistance to the selection and allows cells to stably integrate the plasmid into their chromosomes and grow to form foci which in mm can be cloned and expanded into cell lines. This method may advantageously be used to engineer cell lines which express a desired collagen.
E. Infection. Transformation and Transfection
Host cells are transfected or preferably infected or transformed with 5 the above-described expression vectors, and culmred in nutrient media appropriate for selecting transductants or transformants containing the collagen encoding vector.
The host cells which contain the coding sequence and which express the biologically active gene product may be identified by at least four general approaches; (a) DNA-DNA or DNA-RNA hybridization; (b) the presence or absence of "marker" gene functions; (c) assessing the level of transcription as measured by the expression of collagen mRNA transcripts in the host cell; and (d) detection of the gene product as measured by immunoassay or by its biological activity.
In the first approach, the presence of the collagen coding sequence inserted in 5 the expression vector can be detected by DNA-DNA or DNA-RNA hybridization using probes comprising nucleotide sequences that are homologous to the collagen coding sequence, respectively, or portions or derivatives tiiereof.
In the second approach, the recombinant expression vector/host system can be ° identified and selected based upon the presence or absence of certain "marker" gene functions (e.g. , thymidine kinase activity, resistance to antibiotics, resistance to methotrexate, transformation phenotype, occlusion body formation in baculovirus, etc.). For example, if the collagen coding sequence is inserted within a marker gene 5 sequence of the vector, recombinant cells containing collagen coding sequence can be identified by the absence of the marker gene function. Alternatively, a marker gene can be placed in tandem with the collagen sequence under the control of the same or different promoter used to control the expression of the collagen coding sequence.
Expression of the marker in response to induction or selection indicates expression of the collagen coding sequence.
In the third approach, transcriptional activity of the collagen coding region can be assessed by hybridization assays. For example, RNA can be isolated and analyzed by Northern blot using a probe homologous to the collagen coding sequence or particular portions thereof. Alternatively, total nucleic acids of the host cell may be extracted and assayed for hybridization to such probes.
In the fourth approach, the expression of a collagen protein product can be assessed immunologically, for example by Western blots, immunoassays such as radioimmuno-precipitation, enzyme-linked immunoassays and the like.
F. Purification of Collagens
The expressed collagen of the invention, which is preferably secreted into the culture medium, is purified to homogeneity by chromatography. In one embodiment, the recombinant collagen protein is purified by size exclusion chromatography. However, other purification techniques known in the art can also be used, including ion exchange chromatography, and reverse-phase chromatography. See, e.g. , Maniatis et al , Molecular Cloning A Laboratory Manual. Cold Spring
Harbor Laboratory, N.Y. (1989), Ausubel et al , Current Protocols in Molecular
Biology. Greene Publishing Associates and Wiley Interscience, N.Y. (1989), and
Scopes, Protein Purification: Principles and Practice. Springer- Verlag New York,
Inc., NY (1994).
The present invention is further illustrated by the following examples, which are not intended to be limited in any way.
VI. EXAMPLES
A. Example 1: Synthesis of Human Type π ProcoUagen A recombinant COLI A 1 gene construct employed in the present invention comprised a fragment of the 5'- end of COL1A1 having a promotor, exon 1 and intron 1 fused to exons 3 through 54 of a COL2A1 gene. The hybrid construct was transfected into HT-1080 cells. These cells were co-transfected with a neomycin-resistance gene and grown in the presence of the neomycin analog G418. The hybrid construct was used to generate transfected cells.
A series of clones were obtained that synthesized mRNA for human type II procoUagen. To analyze the synthesized proteins, the cells were incubated with [14C] proline so that the medium proteins could be analyzed by autoradiography (storage phosphor film analyzer).
As set forth at Figure 1, lane 1 shows that the unpurified medium proteins are comprised of three major polypeptide chains. Specifically, the medium proteins contained the expected type II procoUagen comprised of proαl (II) chains together with proαl (IV) and proα2(IV) chains of type IV collagen normally synthesized by the cells. The upper two are proαl (IV) and proα2 (IV) chains of type IV collagen that are synthesized by cells not transfected by the construct. The third band is the proαl (II) chains of human type II procoUagen synthesized from the construct. Lanes
2 and 3 are the same medium protein after chromatography of the medium on an ion exchange column. As indicated in Lanes 2 and 3, the type II procoUagen was readily purified by a single step of ion exchange chromatography.
The type II procoUagen secreted into the medium was correctly folded by a protease-thermal stability test. As evidenced at Figure 2, the medium proteins were digested at the temperatures indicated with a high concentration of trypsin and chymotrypsin under conditions in which conectly folded triple-helical procoUagen or collagen resists digestion but unfolded or incorrectly folded procoUagen of collagen is digested to small fragments. The products of the digestion were than analyzed by polyacrylamide gel electrophoresis in SDS and fluorography. The results show that the type II procoUagen resisted digestion up to 43°C, the normal temperature at which type II procoUagen unfolds. Therefore, the type II procoUagen is correctly folded and can be used to generate collagen fibrils. B. Example 2: Synthesis of Human Type I ProcoUagen
As a second example, HT-1080 cells were co-transfected with a
COLI A 1 gene and a COL1A2 gene. Both genes consisted of a cytomegalic virus promoter linked to a full-length cDNA. The COL1A2 gene construct but not the
COLI A 1 gene construct contained a neomycin-resistance gene. The cells were selected for expression of the COL1A2- neomycin resistance gene construct by growth in the presence of the neomycin-analog G418. The medium was then examined for expression of the COLI A 1 with a specific polyclonal antibody for human proαl(l) chains.
More specifically, the COLIA2 was linked to an active neomycin-resistance gene but the COLI A 1 was not. The cells were screened for expression of the COLlA2-neomycin resistance gene construct with the neomycin analog G418. The medium was analyzed for expression of the COLI A 1 by Western blotting with a polyclonal antibody specific for the human proαl(I) chain. As set forth in Figure 3, lane 1 indicates that the medium proteins contained proα(I) chains (αl(I) and α2(I)). Lane 2 is an authentic standard of type I procoUagen containing proαl(I) and proα2(I) chains and partially processed pCαl(I) chains. The results demonstrate that the cells synthesized human type procoUagen that contained proαl(I) chains, presumably in the form of the normal heterotrimer with the composition two proα(I) chains and one proα2(I) chain.
These results demonstrated that the cells synthesized human type I procoUagen that was probably comprised of the normal heterotrimeric structure of two proαl (I) chains and one proα2(I) chain.
TABLE I presents a summary of some of the DNA constructs containing human procoUagen genes. The constructs were assembled from discrete fragments of the genes or cDNAs from the genes together with appropriate promoter fragments. TABLE I
Figure imgf000035_0001
C. Example 3: Cell Transfections
For cell transfection experiments, a cosmid plasmid clone containing the gene construct was cleaved with a restriction endonuclease to release the construct from the vector. A plasmid vector comprising a neomycin resistance gene, (Law et al , Mol. Cell. Biol. 3:2110-2115 (1983)) was linearized by cleavage with
BamHI. The two samples were mixed in a ratio of approximately 10: 1 gene construct to neomycin resistant gene, and the mixture was then used for cotransfection of HT-1080 cells by calcium phosphate coprecipitation (Sambrook et al , Molecular Cloning. A Laboratory Manual. Cold Spring Harbor Laboratory Press, 2d Edition (1989)). DNA in the calcium phosphate solution was layered onto cultured cells without 10μg of chimeric gene construct per 100 ml plate of preconfluent cells. Cells were incubated in DMEM containing 10% newborn calf serum for 10 hours. The samples were subjected to glycerol shock by adding a 15% glycerol solution for 3 minutes. The cells were then transferred to DMEM medium containing newborn calf serum for 24 hours and then to the same medium containing 450 μg/ml of G418. Incubation in the medium containing G418 was continued for about 4 weeks with a change of medium every third day. G418-resistant cells were either pooled or separate clones obtained by isolating foci with a plastic cylinder and subcultured.
D. Example 4: Western Blotting
For assay of expression of the COL2A1 gene, polyclonal antibodies were prepared in rabbits using a 23-residue synthetic peptide that had an amino acid sequence found in the COOH-terminal telopeptide of type II collagen. See generally,
Cheah et al. , Proc. Natl. Acad. Sci. USA 82:2555-2559 (1985). The antibody did not react by Western blot analysis with proα chains of human type I procoUagen or collagen, human type II procoUagen or collagen, or murine type I procoUagen. For assay of expression of the COLIAI genes, polyclonal antibodies that reacted with the
COOH-terminal polypeptide of the proα(I) chain were employed. See generally, Olsen et a/. , J. Biol. Chem. 266: 1117-1121 (1991).
Culmre medium from pooled clones or individual clones was removed and separately precipitated by the addition of solid ammonium sulfate to 30% saturation and precipitates were collected by centrifugation at 14,000 x g and then dialyzed against a buffer containing 0.15 M NaCl, 0.5 mM EDTA, 0.5 mM
N-ethylmaleimide, 0.1 mM and p-aminobenzamidine, and 50 mM Tris-HCl (pH 7.4 at 4°C). Aliquots of the samples were heated to 10°C for 5 minutes in 1 % SDS, 50 mM DTT and 10% (v/v) glycerol, and separated by electrophoresis on 6% polyacrylamide gels using a mini-gel apparatus (Holford SE250, Holford Scientific) run at 125 V for 90 minutes. Separated proteins were electroblotted from the polyacrylamide gel at 40 V for 90 minutes onto a supported nitrocellulose membrane (Schleicher and Schuell). The transferred proteins were reacted for 30 minutes with the polyclonal antibodies at a 1:500 (v/v) dilution. Proteins reacting with the antibodies were detected with a secondary anti-rabbit IgG antibody coupled to alkaline phosphatase (Promega Biotech) for 30 minutes. Alkaline phosphatase was visualized with NBT/BCIP (Promega Biotech) as directed by the manufacturer.
E. Example 5: In vitro Analysis Of Recombinant Collagen. /. Assembly Of Recombinant Collagen: Protease Digestion.
To demonstrate that the procoUagens synthesized and secreted in the medium by the transfected cells were correctly folded, the medium proteins were digested with high concentrations of proteases under conditions in which only correctly folded procoUagens and collagens resist digestion. For digestion with a combination of trypsin and chymotrypsin, the cell layer from a 25 cm flask was scraped into 0.5 ml of modified Krebs II medium containing 10 mM EDTA and 0.1 % Nonidet P-40 (Sigma). The cells were vigorously agitated in a Vortex mixer for 1 minute and immediately cooled to 4°C. The supernatant was transferred to new tubes. The sample was preincubated at the temperamre indicated for 10 minutes and the digestion was carried out at the same temperature for 2 minutes. For the digestion, a 0.1 volume of the modified Krebs II medium containing 1 mg/ml trypsin and 2.5 mg/ml α-chyrnotrypsin (Boehringer Manheim) was added. The digestion was stopped by adding a 0.1 volume of 5 mg/ml soybean trypsin inhibitor (Sigma). For analysis of the digestion products, the sample was rapidly immersed in boiling water for 2 minutes with the concomitant addition of a 0.2 volume of 5 x electrophoresis sample buffer that consisted of 10% SDS, 50% glycerol, and 0.012% bromphenol blue in 0.625 M Tris-HCl buffer (pH 6.8). Samples were applied to SDS gels with prior reduction by incubating for 3 minutes in boiling water after the addition of 2% 2-mercaptoethanol. Electrophoresis was performed using the discontinuous system of Laemli, Namre 227:680-685 (1979), with minor modifications described by de Wet et al , J. Biol. Chem. 258:7721-7728 (1983).
2. Double Immunostaining Of Sβ Cells.
Sf9 cells were grown on glass slides and fixed in 100% ethanol at -20°C. Alternatively, cells in monolayer were detached, washed twice with a solution of 0.15 M NaCl and 0.02 M phosphate, pH 7.4 (washing solution), suspended in cold ethanol and spread on silanated (Maples, J.A., (1985), Am. J.
Clin. Pathol. 83:356- 363) glass slides. Cells were incubated with 1 % bovine serum albumin in 0.15 M NaCl and 0.02 M phosphate, pH 7.4, for 15 min followed by incubation for 30 min in a 1 :50 dilution of a mouse monoclonal antibody to the β subunit (5B5, Dako) and a rabbit polyclonal antibody to the α subunit of human prolyl 4-hydroxylase in the above bovine serum albumin-containing solution. Cells were washed with the washing solution 4 times for 20 min and incubated in a 1 : 10 dilution of a sheep anti-mouse Ig-rhodamine F(ab)2 fragment (Boehringer Mannheim) and a sheep anti- rabbit IgG fluorescein F(ab)2 fragment (Boehringer Mannheim) in the bovine serum albumin-containing solution for 30 min, washed with the washing solution, rinsed with distilled water and mounted using Glycergel (Dako). The samples were photographed using a Leitz Aristoplan microscope equipped with ep-illuminator and filters for fluorescein isothiocyanate and tetramethyl rhodamine B isothiocyanate fluorescence.
To study the efficiency of a multiple baculovirus infection, immunocytochemical staining of insect cells was used. Sf9 cells were coinfected with two recombinant viruses coding for the ( and ( subunits of prolyl 4-hydroxylase and immunostained with antibodies to these two subunits (Fig. 3). When the analysis was performed 48 h after infection, 87% of all cells were found to express at least one of the two types of subunit, 90% of cells expressing one type of subunit also expressing the other type. 3. Prolyl 4-Hydroxylase Activity Assay.
The 0.2% Triton X-100 extracts of cell homogenates were analyzed for prolyl 4-hydroxylase activity with an assay based on the hydroxylation-coupled decarboxylation of 2-oxo [1-14C] glutarate (Kivirikko et al. ,
Methods Enzvmol. 82:245-304 (1982)). As reported previously (Veijola et al , L Biol. Chem. 269:26746-26753 (1994)), a significant level of prolyl 4-hydroxylase activity was found in both Sf9 and High Five cells, the activity in High Five cells being distinctly higher than that in Sf9 cells (TABLE I). Infection of the cells with a virus coding for the proαl (III) chains had only minor effects on this activity, whereas the activity in cells infected with the virus coding for the proαl (III) chain together with viruses coding for the two types of subunit of human prolyl 4-hydroxylase was markedly higher (TABLE I).
4. Assay For Measuring Collagen.
The amount of the purified type III collagen was determined by using the Sircol collagen assay (Biocolor). Amino acid analysis of the purified type III collagen was performed in an Applied Biosystems 421 Amino Acid Analyzer.
F. Example 6: Specifically Engineered ProcoUagens and Collagens
As indicated in Figure 4, a hybrid gene consisting of some genomic
DNA and some cDNA for the proαl (I) chain of human type I procoUagen was the starting material. The DNA sequence of the hybrid gene was analyzed and the codons for amino acids that formed the junctions between the repeating D-periods were modified in ways that did not change the amino acids encoded but did create unique sites for cleavage of the hybrid gene by restriction endonucleases.
1. Recombinant ProcoUagen Or Collagen
The D3-period of proαl (I) is excised using Srfl and Nael restriction nucleases. The bases coding for the amino acids found in the collagenase recognition site present in the D3 period are modified so that they code for a different amino acid sequence. The cassette is amplified and reinserted in the gene. Expression of the gene in an appropriate host cell will result in type I collagen which cannot be cleaved by collagenase.
2. ProcoUagen Or Collagen Deletion Mutants
A D2 period cassette (of the proαl (I) chain) is excised from the gene described above by digestion with Smal. The gene is reassembled to provide a gene having a specific 5 in-frame deletion of the codons for the D-2 period.
3. ProcoUagen Or Collagen Addition Mutants
Multiple copies of one or more D-cassettes may be inserted at the engineered sites to provide multiple copies of desired regions of procoUagen or collagen.
G. Example 7: Expression Of Human Prolyl 4-Hydroxylase In A Recombinant DNA System
To obtain expression of the two genes for prolyl 4-hydroxylase in insect cells, the following procedures were canied out. The baculovirus transfer vector pVlα58 was constructed by digesting a pBluescript (Stratagene) vector containing in the Small site the full-length cDNA for the α subunit of human prolyl
4- hydroxylase, Pα-58 (Helaakoski et al, Proc. Natl. Acad. Sci. USA 86,
4392-4396 (1989)), with Pstl and BamHI, the cleavage sites which closely flank the Smal site. The resulting Pstl-Pstl and Pstl-BamHI fragments containing 61 bp of the 5' untranslated sequence, the whole coding region, and 551 bp of the 3' untranslated sequence were cloned to the Pstl-BamHI site for the baculovirus transfer vector pVL1392 (Luckow et al , Virology 170_:31-39 (1989)). The baculovirus transfer vector pVLα59 was similarly constructed from pVL1392 and another cDNA clone,
Pα-59 (Helaakoski et al , supra), encoding the α subunit of human prolyl
4-hydfoxylase. The cDNA clones Pα-58 and Pα-59 differ by a stretch of 64 bp.
The pVLβ vector was constructed by litigation of an EcoRI-BamHI fragment of a full-length cDNA for the β subunit of human prolyl 4-hydroxylase, S-l 38
(Pihlajaniemi et al , EMBO J. 6:643-649 (1987)) containing 44 bp of the 5' untranslated sequence, the whole coding region, and 207 bp of the 3' untranslated sequence to EcoRI/BamHI-digested pVL1392. Recombinant baculovirus transfer vectors were cotransfected into Sf9 cells (Summers et al , Tex. Agric. EXP. St. Bull. 1555: 1-56 (1987)) with wild-type Autographa californica nuclear polyhedrosis virus (AcNPV) DNA by calcium phosphate transfection. The resultant viral pool in the supernatant of the transfected cells was collected 4 days later and used for plaque assay. Recombinant occlusion-negative plaques were subjected to three rounds of plaque purification to generate recombinant viruses totally free of contaminating wild-type virus. The screening procedure and isolation of the recombinant viruses essentially followed by the method of Summers and Smith, supra. The resulting recombinant viruses from pVLα58, pVLα59, and pvLfi were designated as the α58 virus, α59 virus and β virus, respectively.
Sf9 cells were cultured in TNM-FH medium (Sigma) supplemented with 10% 5 fetal bovine serum at 27°C either as monolayers or in suspension in spinner flasks (Techne). To produce recombinant proteins, Sf9 cells seeded at a density of 106 cells per ml were injected at a multiplicity of 5-10 with recombinant viruses when the α58, α59, or β virus was used alone. The α and β viruses were used for Q infection in ratios of 1: 10-10: 1 when producing the prolyl 4-hydroxylase tetramer. The cells were harvested 72 hours after infection, homogenized in 0.01 M Tris, pH 7.8/0.1 M NaCl/0.1 M glycine/lOμM dithiothreitol/0.1 % Triton X-100, and centrifuged. The resulting supernatants were analyzed by SDS/10% PAGE or nondenaturing 7.5% PAGE and assayed for enzyme activities. The cell pellets were 5 further solubilized in 1 % SDS and analyzed by SDS/10% PAGE. The cell medium at 24-96 hours postinfection was also analyzed by SDS/10% PAGE to identify any secretion of the resultant proteins into the medium. The cells in these experiments were grown in TNM-FH medium without serum. ° When the time course of protein expression was examined, Sf9 cells infected with recombinant viruses were labeled with [35S]methionine (10 μCi/μl; Amersham; 1 Ci=37CBq) for 2 hours at various time points between 24 and 50 hours after infection and collected for analysis by SDS/10% PAGE. To determine the maximal 5 accumulation of recombinant protein, cells were harvested at various times from 24 to 96 hours after infection and analyzed on by SDS/10% PAGE. Both the 0.1 % Triton X-100- and 1 % SDS-soluble fractions of the cells were analyzed. Prolyl 4- hydroxylase activity was assayed by a method based on the decarboxylation of
2-oxo[l-14C]glutarate (Kivirikko et al, Methods in Enzvmologv 82:245-304 (1982)).
The Km values were determined by varying the concentrations of one substrate in the presence of fixed concentration of the second, while the concentrations of the other substrates were held constant (Myllyla et al , Eur. J. Biochem. 80:349-357 (1977)). Protein disulfide-isomerase activity of the β subunit was measured by glutathione: insulin transhydrogenase assay (Carmichael et al , J. Biol. Chem. 252:7163-7167 (1977)). Western blot analysis was performed using a monoclonal antibody, 5B5, to the β subunit of human prolyl 4-hydroxylase (Hoyhtya et al. , Eur. J. Biochem. 141:477-482 (1984)). Prolyl 4-hydroxylase was purified by a procedure consisting of poly (L-proline) affinity chromatography, DEAE-cellulose chromatography, and gel filtration (Kivirikko et al , Methods in Enzvmologv 144:96-114 (1987)).
Figure 5 presents analysis of the prolyl 4-hydroxylase synthesized by the insect cells after purification of the protein by affinity-column chromatography. When examined by polyacrylamide gel electrophoresis in a nondenaturing gel, the recombinant enzyme co-migrated with the tetrameric and active form of the normal enzyme purified from chick embryos. After the purified recombinant enzyme was reduced, the (- and (- subunits were detected. As set forth in Figure 5, lanes 1-3 are protein separated under non-denaturing conditions and showing tetramers of the two kinds of subunits. Lanes 4-6 are the same samples separated under denaturing 5 conditions so that the two subunits appear as separate bonds.
TABLE II presented data on the enzymic activity of the recombinant enzyme.
The Km values were determined by varying the concentration of one substrate in the presence of fixed concentrations of the second while the concentration of the other ° substrates were held constant.
5 TABLE II
Figure imgf000043_0001
0
As indicated, the Michales-Mento (Km) values for the recombinant enzyme were essentially the same as for the authentic normal enzyme from chick embryos.
Since the transfected insect cells synthesize large amounts of active prolyl 5 4-hydroxylase, they are appropriate cells to transfect with genes of the present invention coding for procoUagens and collagens so as to obtain synthesis of large amounts of the procoUagens and collagens. Transfection of the cells with genes of the present invention is performed as described in Example 3. 0
H. Example 8: Expression of Recombinant Collagen Genes in
Saccharomyces cerevisiae Yeast Expressing Recombinant Genes for
Prolyl 4-Hydroxylase
The yeast Saccharomyces cerevisiae can be used with any of a large number of expression vectors. One of the most commonly employed expression 5 vectors is the multi-copy 2μ plasmid that contains sequences for propagation both in yeast and E. coli, a yeast promoter and terminator for efficient transmission of the foreign gene. Typical examples of such vectors based on 2 μ plasmids are pWYG4 that has the 2 μ ORI-STB elements, the GALI remoter, and the 2μ D gene ° terminator. In this vector an Ncol cloning site is used insert the gene for either the ( or ( subunit of prolyl 4-hydroxylase, and provide the ATG start codon for either the α or β subunit. As another example, the expression vector can be pWYG7L that has intact 2μ ORI, STB, REP1 and REP2, the GAL7 promoter, and uses the FLP 5 terminator. In this vector, the gene for either the ( or ( subunit of prolyl 4- hydroxylase is inserted in the polylinker with its 5' ends at a BamHI or Ncol site. The vector containing the prolyl 4-hydroxylase gene is transformed into S. cerevisiae either after removal of the cell wall to produce spheroplasts that take up DNA on treatment with calcium and polyethylene glycol or by treatment of intact cells with lithium ions. Alternatively, DNA can be introduced by electroporation.
Transformants can be selected by using host yeast cells that are auxotrophic for leucine, tryptophane, uracil or histidine together with selectable marker genes such as LEU2, TRO1, URA3, HIS3 or LEU2-D. Expression of the prolyl 4-hydroxylase genes driven by the galactose promoters can be induced by growing the culmre on a non-repressing, non-inducing sugar so that very rapid induction follows addition of galactose; by growing the culmre in glucose medium and then removing the glucose by centrifugation and washing the cells before resuspension in galactose medium; and by growing the cells in medium containing both glucose and galactose so that the glucose is preferentially metabolized before galactose-induction can occur. Further manipulations of the transformed cells are performed as described above to incorporate genes for both subunits of prolyl 4-hydroxylase and desired collagen or procoUagen genes into the cells to achieve expression of collagen and procoUagen that is adequately hydroxylated by prolyl 4-hydroxylase to fold into a stable triple helical conformation and therefore accompanied by the requisite folding associated with normal biological function.
I. Example 9Expression of Recombinant Collagen Genes in Pichia pastoris Yeast Expressing Recombinant Genes for Prolyl
4-Hydroxylase
Expression of the genes for prolyl 4-hydroxylase and procoUagens or collagens can also be in non- Saccharomyces yeast such as Pichia pastoris that appear to have special advantages in producing high yields of recombinant protein in scaled-up procedures. Typical expression in the methylotroph P. pastoris is obtained by the promoter from the tightly regulated AOX1 gene that encodes for alcohol oxidase and can be induced to give high levels of recombinant protein driven by the promoter after addition of methanol to the cultures. Since P. Pastoris has no native plasmids, the yeast is employed with expression vectors designed for chromosomal integration and genes such as HIS4 are used for selection. By subsequent manipulations of the same cells, expression of genes for procoUagens and collagens described herein is achieved under conditions where the recombinant protein is adequately hydroxylated by prolyl 4-hydroxylase and, therefore, can fold into a stable helix that is required for the normal biological function of the proteins in forming fibrils.
The following vectors have been constructed according to the disclosures set forth herein:
1. Human collagen type III without its own signal sequence.
The 3' end of the collagen DNA was synthesized from 4195 bp downstream (EcoRI site) of the translation initiation codon to stop codon (4401 bp) of the translation by PCR (see Ala-Kokko et al, 1989 Biochem J.. 260:509-516 5 accession number X144207) using pBluescript-SM38. Notl and Xbal sites were created in the 3' end of the fragment. pBluescript-SM38 was digested with
ΕcoRl-Xbal and the large fragment (approximately 7.2 kb) was isolated. This large
ΕcoRl-Xbal fragment (approximately 7.2 kb) and the 3' collage PCR ΕcoRI-Xbal Q fragment were ligated with T4 ligase to give the plasmid pBluescript-SM38B.
The 5' end of the collagen DΝA was synthesized from 73bp downstream of the translation initiation codon to 176 bp (BamHI site) by PCR (for sequences, see
Ala-Kokko et al, 1989 Biochem J. , 260, 509-516). Clal and Notl sites were created in the 5' end of the fragment. pbluescript-SM38/B was digested by Claland BamHI 5 and the large fragment (approximately 6.3 kb) and the 862 bp BamHl-BamHl collagen fragment were isolated. The 5' PCR collagen fragment BamHl-Clal, large fragment BamHl-Clal (approximately 6.3 kb) and 862 bp BamHl-BamHl fragments were ligated with T4 ligase (triple ligation), to give plasmid pBluescript-SM38/ll . 0 pBluescriρt-SM38 11 was digested by Νotl. Then Notl-Notl collagen fragment (73 bp-4401 bp) was cloned in frame with α-factor signal sequence into the pPIC9 (Invitrogen), to give plasmid pPIC9Col III(clone7).
5 2. Human collagen type III with its own signal sequence
The 3' end of the collage was created from 4195 bp downstream
(EcoRIsite) of the translation initiation codon to the stop codon of the translation by
PCR using pBluescript-SM38. An Xbal site was created in the 3' end of the fragment. The ensuing pBluescript-C3Al plasmid was digested by ΕcoRI and Xbal, large fragment (approximately 7.2 kb) was isolated and the large fragment (approximately 7.2 kb) and the 3' PCR collagen fragment were ligated with T4, to give plasmid pBluescript-C3Al/10. A Bg/ll site was created 16 bp upstream of the translation initiation codon (Lamberg et al, 1996). BgΛl-Xbal collagen fragment (-16 bp - 4401 bp) of pBluescript-C3Al/10 was then ligated into the EcoRI site of the pHIL-D2 (Invitrogen) to give plasmid pHII- D2/colIII.
3. Human prolyl 4-hydroxylase
A vector, pYM25, which contains ARG4 gene of Saccharomyces
Cerevisiae, was digested by Hpal. The Hpal fragment of ARG4 gene was inserted into the EcoRV sites of pAO815 (Invitrogen) to create a vector pARG815, which contains the ARG4 gene instead of the HIS4 gene.
The β-subunit cDNA of prolyl 4-hydroxylase (Vuori et al, 1992 PNAS. , 89,
7467-7470) was synthesized from the translation initiation codon to the stop codon by
PCR. EcoRI restriction sites were created in 5' and 3' ends. The C-terminal ΕR retention peptide sequence-KDΕL-was modified to yeast ΕR retention signal-HDΕL-(CAC GAT GAA CTG) by PCR. The EcoRI-EcoRI -fragment was inserted into the pBluescript SK, to give plasmid pBluescript SKβ/20. Then this
EcoRI -fragment was inserted into the EcoRI site of PAO815 (Invitrogen) to create single expression cassette vector. The 5' end of the α-subunit was synthesized from the translation initiation codon to 689 bp downstream (Hindlll site) by PCR. Hindlll and Smal sites were created in 5' end of the fragment. Plasmid pA-59 (see Vuori et al) was digested by Hindllland the large fragment (approximately 4.9 kb) was isolated. The large fragment (approximately 4.9 kb) and the 5' PCR fragment were ligated with T4 ligase, to give plasmid pA-59/15. The 3' end was synthesized from 1373 bp (Pstl site) downstream of the translation initiation to the translation stop codon. Smal and BamHI sites were created in 3' end of the fragment. Plasmid pA-59/15 was digested with Pstland BamHI, the large fragment (approximately 3.9 kb) was isolated, and the large fragment and 3' PCR fragment were ligated with T4 ligase, to give plasmid pA-59/3. Plasmid pA-59 was digested with Smal and the Smal-Smal α-subunit fragment (1 bp-1605 bp) was ligated into EcoRI site of the pARG815.
The β single cassette vector was digested by Bglll-BamHl to excise the expression cassette and the expression cassette was reinserted into one for the BamHI site of pARG815α expression vector. Thereby the vector contains two expression cassettes: one for the α-subunit and one for the β-subunit.
The β-subunit without its signal sequence was synthesized by PCR from 52 bp downstream of the translation initiation codon to the translation stop codon. ΕcoRI restriction sites were created in 5' and 3' ends. This PCR fragment was cloned into the ΕcoRI site of pSP72 (Promega).
J. Example 10: Expression of Recombinant Collagen Genes in Insect
Cells Expressing Recombinant Genes for Prolyl 4-Hydroxylase
1. Construction of Recombinant Vectors Containing CoUagen Genes. pVLClAl : The baculovirus transfer vector, was constructed using the eukaryotic expression vector CMV- COLIAI (Geddis et al , Matrix
13:399-405 (1993)) and the polyhedrin-based baculovirus transfer vector pVL 1392
(Luckow et al , Virology 170:31-39 (1989)). CMV-COL1A1 contains the sequences coding for the full length cDNA sequence of the αl chain of the human procoUagen I (COLIAI).
Digestion of CMV-COL1A1 with Xbal generates the full length cDNA for
COLIAI including six bp 5' untranslated, and 222 bp 3' untranslated, and this fragment is cloned into the Xbal site of pVL1392 to give the plasmid pVLClAl. pVLClA2: The baculovirus transfer vector was constructed using the vector pUC-HP2010 (Kuivaniem et al , Biochem. J. 252:633-640 (1988)) and the polyhedrin-based baculovirus transfer vector pVL 1392 (Luckow et al. , Virology 170:31-39 (1989)). pUC-HP2010 contains the sequences coding for the full length cDNA sequence of the α2 chain of the human procoUagen I (COL1A2) in the Sphl site of pUC19. pUC-HP2010 is digested with Sphl, the GTAC overhang is removed with T4
DNA Polymerase, and the blunt ended fragment is cloned into the EcoRV site of pSP72 (Promega). A Bglll site is made six bp upstream of the translation initiation site by PCR, to give the plasmid pSP72-ClA2T. The full length cDNA for COL1A2 is generated by cutting pSP72-ClA2T with BgHl-BamHl. The BgHl-BamHl fragment from pSP72-ClA2T has the full length COL1A2 sequence plus six bp 5' untranslated, and 278 bp 3' untranslated, and this fragment is cloned into the BgHl-BamHl sites of pVL1392 to give pVLClA2. pVLC3Al: A Bglll site was created 16 bp upstream of the translation initiation codon to a full- length cDNA including 92 bp 5' untranslated region and 715 bp 3' untranslated region for the proαl chain of human type III procoUagen in the plasmid pBS-SM38 (derived from sequences presented in Ala-Kokko et al Biochem. J. 260: 509-516 (1989), and GenBank accession number X 14420) by PCR, to give the plasmid pBS-C3Al . pBS-C3Al was digested with Bglll and Xbal restriction enzymes and the Bglll/Xbal fragment containing the full-length cDNA of proαl chain of human type III procoUagen including 16 bp 5' untranslated region, and 715 bp 3' untranslated region, was then ligated to pVL1392 (Luckow et al.
Virology 170:31-39 (1989)) to give the plasmid pVLC3Al . pVLC3A15'UT/C2Al : The baculovirus transfer vector was constructed using the sequences presented in Baldwin et al , Biochem. J. 262:521-528 (1989) resulting in the vector pGΕMC2Al and the polyhedrin-based baculovirus transfer vector pVL
1392 (Luckow et al. , Virology 170:31-39 (1989)). pGEMC2Al contains the sequences coding for exon I from type I collagen, and type II collagen starts from exon 2B. pGEMC2Al is digested with Xbal-Dral to generate a fragment with the full length cDNA fusion, and six bp 5' untranslated region and 396 bp 3' untranslated region, and this fragment is cloned into the Xbal-Smal sites of pVL1392 to give the plasmid pVLClAl/C2Al. The 5' untranslated region was then changed to GATCTGATATT by cloning an oligonucleotide into the BgHl-Xbal sites of the COL
II vector.
PNLC3A1NP/C2A1: pGEMC2Al is digested with Xbal-BamHl and the full length cDNA fusion is cloned into the Xbal-BamHl sites of pBS(SK-) to give the plasmid pBSClAl/C2Al. pBSClAl/C2Al is digested with BgHl-Narl to generate a full length cDNA without the N-propeptide, the N-propeptide with 16 bp 5' untranslated from type III collagen was synthesized by PCR using the plasmid pBS-C3Al as a template. The oligonucleotides used to synthesize the type-Ill
N-propeptide were as follows: 5' oligo
(5'-TACTCTAGACTCAGATCTGATATT-3') and 3' oligo (5'-
GGGAGAATAGTTCTGAGGACCAGT-3'). The 35 bp fragment of telopeptide from type II collagen was synthesized by oligonucleotides (chemical synthesis). The following oligonucleotides were used
5'-CAGATGGCTGGAGGATTTGATGAAAAGG CT GGTGG-3'; and
5 ' -CGCC ACC AGCCTTTTC ATC AAATCCTCC AGCC ATCTG-3 ' . These fragments were ligated into pBSClAl/C2Al digested with Bglϊl-Narl. This hybrid full length cDNA was excised with BgHl-Dral and cloned into the Bglϊl-Notl (the Notl site is blunt ended by filing in the overhangs with klenow and dΝTPs) sites of pVL1392 to give the plasmid pVLC3AlΝP/C2Al. pVLC4Al : The baculovirus transfer vector was constructed using the vector αlCMVC which was constructed by R. Niecht K*ln (based on the sequence published by Brazel et al. , Eur. J. Biochem. 168:529-536 (1987), and Soininen et al , FEBS Lett. 225:188-194 (1987)) and the polyhedrin-based baculovirus transfer vector pVL 1392 (Luckow et al. , Virology 170:31-39 (1989)). αlCMVC was digested with Clal to generate a full length cDNA with 18 bp 5' untranslated and 203 bp 3' untranslated, and this fragment was blunt ended using
Klenow polymerase (Pharmacia Biotech) and a mixture of dNTPS and cloned into the
Smal site of pVL1392 to give the plasmid pVLC4Al. pVLE26: The baculovirus transfer vector was constructed using the cDNA clone E-26 in vector pBluescript (SK-) (Pihlajaniemi et al. , J. Biol. Chem.
265:16922-16928 (1990)) and the polyhedrin-based transfer vector pVL1392 (Luckow et al, Virology 170:31-39 (1989)). The cDNA clone E-26 encodes the αl chain of human type XIII collagen that is ligated into the EcoRI site of pBS(SK-) (construct termed clone Ε-26). The E-26 clone is described in, for example, Pihlajaniemi et al , J. Biol. Chem. 265: 16922-16928 (1990). The cDNA E-26 was obtained from a λgtll cDNA library derived from human umbilical vein endothelial cells (Clontech), and the insert was released by digestion with EcoRI. This EcoRI fragment was ligated into the EcoRI site of pBR322 to give the clone Ε-26. Clone Ε-26 is digested with EcoRI to generate the Ε-26 cDNA covering type XIII coding sequences. 123 bp 5' untranslated region and 117 bp 3' untranslated region are included, and this fragment is cloned into the EcoRI site of pVL1392 to give the plasmid pVLΕ26. pVLhuXIII: The baculovirus transfer vector was constructed using clone E-26 (Pihlajaniemi et al , J. Biol. Chem. 265: 16922-16928 (1990)), genomic human type XIII collagen sequences (Tikka et al, J. Biol. Chem. 266:17713-17719 (1991)) and the polyhedrin-based baculovirus transfer vector pVL1932 (Luckow et al. Virology 170:31-39 (1989)). A clone called pBShuXIII was constructed and it contains the clone E-26 of the αl chain of human type XIII collagen with the 5' end of genomic human type XIII collagen covering nucleotides 1-272 from the type XIII collagen gene generated by PCR, in the Notl-EcoRl site of pBS(SK-) to give the full-length cDNA of type XIII collagen (Tikka et al , J. Biol. Chem. 266: 17713-17719 (1991)).
The 5' end of the genomic human type XIII collagen was generated using CL412 (a lambda clone isolated from a human genomic library (Clontech)) as the template and the PCR primers: 5' primer (5'-ATGCGGCCGCACGCGAGAGGATGGTAGC-3'), and 3' primer (5'- TAGCTGTCTCCATTTGCTGCTC-3'). The 5'-PCR-primer included a new Notl restriction site preceding the type XIII sequences, which was used as well as a Pstl site between nucleotides 216 and 217 (Tikka et al, J. Biol. Chem. 266:17713-17719 (1991)), for cloning the 5'-PCR-product into the clone E-26 digested with Notl and Pstl (Pihlajaniemi et al , J. Biol. Chem. 265:16922-16928 (1990)). pBShuXIII is digested with Notl-EcoRl to generate the full-length cDΝA with 10 bp 5' untranslated region and 117 bp 3' untranslated region, and this fragment is cloned into the Notl-EcoRl sites of pVL1392 to give the plasmid pVLhuXϋl. pVLmoXIII: The baculovirus transfer vector was constructed using the vector pBSmoXIII and the polyhedrin-based baculovirus transfer vector pVL1392, which is described in Luckow et al , Virology 170:31-39 (1989). pBSmoXIII consists of the cDΝA clone 689 in pBluescript encoding the αl chain of mouse type XIII collagen wherein the 5' end was generated by PCR and the 3' end by ligation of a fragment from the plasmid moC-2. Clone 689 is a cDΝA derived from mouse spleen RΝA as follows: total mouse spleen RΝA is reacted with reverse transcriptase and the primer 5'-ACACACACAGGCCAGT-3'. The reverse transcriptase products are then used as a template for a first PCR reaction with the primers: 5' primer (5'-ATGAATTCGCCAGTCCCAGGTTAGAGGCA-3'), and 3' primer (5'-ATGAATTCAAGTTCTACTCGCGTAGGCGC-3'), and these products were used as a template for second PCR reaction with the primers: 5' primer (5'-ATGAATTCGTTCCAGCAGCCTTGGACTG GTAAGC-3'), and 3' primer (5'-ATGAATTCCCGAAGATGTCTCCAGGATGT- 3'). The PCR fragment covers nucleotides 466-969 from the cDΝA sequences for mouse αl chain of type XIII collagen. cDΝA clone GUT 219.2.4 was used as a template with the PCR primers: 5' primer (5'- ATAAGCTTGAATTCCGAGGGCATGGTGGCGG-3'), and 3' primer (5'-CGAGGCCCGACGATGGACAT-3'). GUT 219.2.4 was obtained from a cDΝA library derived from newborn mouse gut RΝA using random hexamers as primers and the You-Prime-cDΝA synthesis kit (Pharmacia) by probmg with reverse transcriptase - PCR clones of the ΝC1 to ΝC4 domain of mouse type XIII collagen.
These RT-PCR clones were obtained as follows: newborn mouse gut DNA was used as a template with the primers: 5' primer (5'-ACCTTTGGCCCTGGGGGCGCAGGGAGC-3'), and 3' primer
(5'-AGGGAGAGAAAGGCGATGCTGGCA-3') to produce the M91 fragment. The
M91 fragment was used to design primers for subsequent RT-PCR reactions in both the 5' and 3' directions using combinations of primers of mouse and human origin. Clone 689 was digested with EcoRI and BamHI, and this 557 bp EcoRI-βαmHI fragment was ligated into the EcoRI and BamHI sites of pBluescript to give the plasmid P40-5. Plasmid P40-5 was digested with Hindlll and Bbvϊl and ligated with the 5' -PCR fragment digested with Hindlll and Bbvll to give plasmid P40-1. Plasmid P40-1 contains a clone with the translation intiation region and coding sequences up to the BamHI site at nucleotide 1419.
The 3' end of the mouse type XIII collagen was added to plasmid P40-1 as follows. A mouse type XIII collagen cDNA, MOABCD.5 was used to obtain the 590 bp Stul-Sacl fragment. MOABCD.5 was built from the cDNA clones GUT 229A, GUT 219.1.4, RG.6, and 18 to cover the coding sequences of mouse type XIII collagen except for the alternatively spliced exons 4A, 4B, 12, 13, and 33. Clones GUT 229A and GUT 219.1.4 are obtained from a cDNA library produced from newborn mouse gut RNA using random hexamers as primers and the You-Prime-cDNA synthesis kit (Pharmacia) by probing with reverse transcriptase - PCR clones of the NCI to NC4 domain of mouse type XIII collagen. These RT-PCR clones were obtained as follows: newborn mouse gut DNA was used as a template with the primers: 5' primer (5'-ACCTTTGGCCCTGGGGGCGCAGGGAGC-3'), and 3' primer (5'-AGGGAGAGAAAGGCGATGCTGGCA-3') to produce the M91 fragment. The M91 fragment was used to design primers for subsequent RT-PCR reactions in both the 5' and 3' directions using combinations of primers of mouse and human origin. Clone RG.6 was obtained using 3'-RACE-PCR. The reverse transcriptase reaction was carried out using mouse gut poly(A+) RNA as the template and the primer (5'-GACTGC AGTCGACATCGATTTTTTTTTTTTTTTTT-3'), followed by a first round of PCR using primers: MR- 15 (5'- GCCTCCAGGAATGAAGGGAGAAGT-3'), and primer Tail (5'-GACTCGAGTCGACATCG-3'), and a second round of PCR using the primers: MR-1 (5'-GGGGGAGAGGGGGAAGAA-3'), and primer Tail
(5'-GACTCGAGTCGACATCG-3'). These products are ligated to the vector pCRII (Invitrogen), and clone RG.6 was isolated from this library using the PCR clones of the NCI to NC4 domain of mouse type XIII collagen as probes. Clone 18 was obtained using RT-PCR. The reverse transcription reaction used newborn mouse GUT RNA as template and oligo(dT)17 as a primer, followed by a first round of PCR using primers: M-ll (5'-ATGCCATCGGAGGAGGCAG-3') and MR-13
(5'-GTTCCAGCAGCCTTGGACTGGTAAGC-3'), and a second round of PCR with the primers: MR-15 (5'-GCCTCCAGGAATGAAGGGAGAAGT-3') and MR-13
(5'-GTTCCAGCAGCCTTGGACTGGTAAGC-3'). These products were ligated into pCRlOOO (Invitrogen) and clone 18 was obtained by probing with the M91 fragment.
The GUT 229A, GUT 219.2.4, RG.6 and 18 inserts were liberated from their respective library vectors by using Notl. These Notl fragments were ligated into the
Notl site of plasmid pBluescript to give the clones GUT 229A, GUT 219.1.4 and RG.6.
MOABCD.5 was constructed as follows: Clone pBluescript-GUT 219.1.4 was digested with BamHI and EcoRI, and the resulting 960 bp fragment was ligated into fiαmHI/EcoRI pBluescript-GUT 229A, to give clone MOAB.3. Plasmid pBluescript-I8 was digested with Stul and Hmdlll and the resulting 310 bp fragment was ligated into Stul/ Hindlll digested MOAB.3, to give the clone MOABC.5. The plasmid pBluescript-RG.6 was digested with Xbal and Hindlll, and the resulting 250 bp fragment was ligated into Xbal/Hindlϊl digested MOABC.5, to give the clone MOABCD.5
MOABCD.5 was digested with Stul and Sαcl, and the ensuing 673 bp
Stul-Sacl fragment was ligated into the Stul and Sαcl sites of clone 689 (plasmid
P40) to give plasmid moC-2. Plasmid moC-2 was digested with with BamHI and
Sαcl, and this 1504 bp BamHl-Sacl fragment is ligated to the BamHI and Sαcl sites of plasmid P40-1 to give the plasmid pBSmoXIII. pBSmoXIII is digested with
EcoRI to generate a full-length type XIII collagen variant with seven base pairs 5' untranslated and 288 base pairs 3' untranslated, and this fragment was cloned into the EcoRI site of pVL1392 to give the plasmid pVLmoXIII. Another alternatively spliced full-length cDΝA variant for the 1 chain of mouse type XIII collagen was constructed and is termed pVLmoXIII(-Ε12/-Ε13). This construction is identical to pVLmoXIII, except that it lacks the sequence that encodes exon 12. pVLC15Al: The baculovirus transfer vector was constructed using a PCR fragment covering nucleotides 14 to 1374 of type XV procoUagen cDΝA (Kivirikko et al , J. Biol. Chem. 269: 4773-4779, (1994)). cDΝAs for type XV procoUagen were made from human umbilical cord RNA using standard techniques described in, for example, Maniatis et al. , Molecular Cloning A Laboratory Manual. Cold Spring Harbor Laboratory, N.Y. (1989) and Ausubel et al , Current Protocols in Molecular Biology. Greene Publishing Associates and Wiley Interscience, N.Y. (1989). Using the cDNAs as a template, the PCR fragment covering nucleotides 14 to 1374 of type XV procoUagen was made with the PCR primers: 5' primer (5'-GATATCACCCTT CGTCCTCCGCTAAGCTC-3'), and 3' primer (5'-GAATTCTGGCC
TCCACTTCCCCAGGCAT-3'). The PCR fragment contains an EcoRV linker sequence at the 5' end and an EcoRI linker sequence at the 3' end. The PCR fragment is digested with EcoRV and EcoRI, and ligated into the EcoRV-EcoRI sites of pBluescript (SK-). This construct was digested by Sphl (cleaving in the PCR fragment at sequences corresponding to nucleotide 1355 of sequences presented in Kivirikko et al , J. Biol. Chem. 269:4773-4779 (1994)) and EcoRI (digesting at the polylinker of pBluescript). An Sphl-EcoΕtl fragment of clone SK5-3 covering nucleotides 1355-4330 in Kivirikko et al , J. Biol. Chem. 269:4773-4779 (1994), was ligated with the above Sphl and EcoRI digested construct resulting in construct pBShuXV. The clone SK5-3 was isolated from a λgtll cDNA library derived from human placenta (Clontech), the SK5-3 insert was released by digestion with EcoRI, and this insert was ligated into the EcoRI site of pBluescript (SK-) to give SK5-3. pBShuXV is digested with ΕcoRV (cleaving at pBluescript polylinker) and Hindi
(cleaving at nucleotide 4309 of type XV collagen cDNA sequences) to generate the full length cDNA for COL XV including 76 bp 5' untranslated region, and 53 bp 3' untranslated region, and this fragment is cloned in the Smal site of pV 11392
(Luckow et al, Virology 170:31-39 (1989)) to give the plasmid pVLCUAl .
M18K: The baculovirus transfer vector was constructed using the polyhedron-based baculovirus transfer vector pVL 1393 (Invitrogen) and pBluescript
SK M18kok.ll (pBsM18kok. ll), which is made from the clones SXT-5B5, MM-103
(Rehn et al., J. Biol. Chem. 269:13929-13953 (1994)), and MM 21.3.
The cDNA SXT-5B5 was identified and cloned from a λgtlO cDNA library made from mouse embryo (Clonetech) as follows. The library was screened using a probe G2 for murine type XIII collagen to identify the clone MΕ-1. G2 had been generated by RT-PCR using newborn mouse gut RNA as template, and primers: 5' primer MR-6 (5'-CCGGTGAGCCTGCTTGTCCT-3'), and 3' primer MR-11 (5'-ATGCCATCGGAGGAGGCAG-3'). The PCR product was ligated into the vector PCR- 1000 (Invitrogen) and the construct was further digested with EcoRI-Hindlll to give the probe G2. ME-1 covers 2.3-kB of the mouse αl(XVIII) mRNA (described in Rehn et al., Proc. Nat'l. Acad. Sci. USA 91: 4234- 4238 (1994)) and it was used to rescreen the mouse embryo library and identify the clone SXT-5B5, which was isolated by digesting with EcoRI, and ligating the SXT-5B5 insert into the EcoRI site of pBluescript SK to give the plasmid ρBs(SK)SXT-5B5 containing 540 bp extreme 5' sequence of the mouse αl (XVIII) chain clone SXT-5 (SXT-5 is described in Rehn et al. , Proc. Nat'l. Acad. Sci. USA 91:4234-4238 (1994)). The cDNA clone MM-103 was obtained from a λgtlO cDNA library of poly(A) RNA from adult mouse liver (BALP/c strain) isolated by the guanidium thiocyanate method (Chomczynski et al. , Anal. Biochem. 162: 156-159 (1987)), followed by two rounds of oligo(dT)-cellulose chromatography. A cDNA library was constructed from this RNA using an oligo(dT) primer and the Time-Saver-cDNA synthesis kit (Pharmacia). This library is screened with a probe from ME-1, and he clone MM-103 was isolated by digesting with Notl. The insert MM-103 was ligated into the Nod site of pBluescript SK to give pBs(SK)MM-103. (Rehn et al., J. Biol.
Chem. 269: 13953(1994)).
The cDNA clone MM-21.3 was identified and cloned from a λgtlO cDNA library, which had been made from adult mouse liver (BALB /c strain) poly (A) RNA using oligo(dT) method (described above, see also Rehn et al., J. Biol. Chem.
269: 13929-13953 (1994)), by screening the library with probes SXT-5B5 and ME-1. The clone was digested with Notl and ligated into Notl site of pBluescript SK to give the plasmid pBs(SK)MM-21.3, which covers nucleotides 360-2572 of the mouse αl(XVIII) chain (Rehn et al., Proc. Nat'l. Acad. Sci. 91:4234-4238(1994)).
The plasmid pBs(SK)SXT-5B5 was digested with EcoRI and the resulting 540 bp fragment was further cloned into EcoRI-digested 5 kB fragment (insert + Bluescript) of plasmid pBs(SK)MM-21.3 to generate plasmid pBsM18kc.AB.pBsM18kc.AB was digested with EcoRV and Nsil, resulting in a 2.5 kB fragment, and plasmid pBsMM 103 was digested with Nsil and Notl resulting in a 1.5 kb fragment. These two fragments were ligated into EcoRV-Notl- digested vector pBluescript to give plasmid pBsM18kok. ll, which contains the full-length cDNA of the shortest variant of the αl chain of mouse type XVIII collagen (1315 amino acid residues) including 22 bp 5' untranslated region and 180 bp 3' untranslated region. pBsM18kok. ll was digested with EcoRV-Notl, and this fragment is cloned into the Smal-Notl sites of pVl .1393 to give the plasmid M18K.
M18VA2K: The baculovirus transfer vector was constructed using the polyhedron-based baculovirus transfer vector pVL 1393 (Invitrogen), and pBsv2.5 which was built from cDNA clones PE17.24 (Rehn et al., J. Biol. Chem. 269: 13929-13953 (1994)) and PX4.3 (Rehn et al. , J. Biol. Chem. 269:13929-13953 (1994)), and plasmid pBsM18kok.l l (described previously, see the construct M18K) to generate pBsM18VA2K.
The cDNA clone PEI 7.24 is isolated from a cDNA pool made from 18.5 day-old mouse embryo poly(a). RNA with the primer
(5'-GATGGCAAATAGCACCC-3'). The cDNA from this synthesis are ligated into λgtlO vectors, and the products are screened using a probe from ME-1. The clone PEI 7.24 was identified in this way and the insert was isolated by digesting with Notl, and the PEI 7.24 insert was ligated into the Notl site of pBluescript SK to give the pBs(SK)PE17.24.
Plasmid v2.5 was built from clones PE17.24 and PX4.3 by digesting the pBsPE17.24 with EcoRI and ligating into the resulting 4.8 kB fragment (insert +
Bluescript) the EcoRI digested 90 bp fragment of PX4.3 to cover the long 764 residues form of the type XVIII collagen NCI domain. To construct the full-length cDNA encoding the longest variant αl(XVIII) chain (1774 amino acid residues), including 3 bp 5' untranslated region and 180 bp 3' untranslated region, the plasmid v 2.5 was digested with Clal and the resulting 1.5 kB fragment was ligated into a Clal-digested 7.3 kB fragment (insert + Bluescript, the other Clal site in Bluescript) of pBsM18kok. ll (plasmid described previously, see construct M18K) resulting in the clone pBsM18VA2K. pBsM18VA2K was further digested with EcoRV and Notl, and the resulting fragment was cloned into the Smal-Notl sites of pVL1393 to give M18VA2K.
M18NC1: The baculovirus transfer vector was constructed using the cDNA clones SXT-5B5 (described above, see construct M18K) and ME-1 (Rehn et al., Proc. Nat'l. Acad. Sci. 91 :4234-4238(1994)), and the polyhedron- based baculovirus transfer vector pVL 1393 (Invitrogen). SXT-5B5 was identified and cloned as described previously (see construct M18K). ME-1 covers 2.3 kB of the mouse α (XVIII) mRNA (described in Rehn et al. , Proc. Nat'l. Acad. Sci. 91:4234-4238 (1994)), and it encodes the N-terminal noncollagenous domain (NCI) of the shortest variant of the mouse type XVIII collagen αl chain (characterizing and isolating of the clone ME-1 described previously, see construct M18K).
A stop codon is generated to the 3' end of the NCI domain by PCR, using ME-1 as template and the primers: 5' primer - T7, 17-mer primer (5'-AATACGACTCACTATAG-3'), and the 3' primer M18Bacl (5'- GAAGGGGCTTGATAAATGAGGATCCAT-3') including an in-frame stop codon and a BamHI digestion site. The 400 bp PCR product was digested with EcoRI and BamHI and ligated to EcoRI-BamHI-digested pBluescript SK to give plasmid pBsNCII, ρBsSXT-5B5 was digested with EcoRI and the resulting 540 bp fragment was further cloned into the EcoRI- digested pBsNCIL to give the plasmid pBsM18NCl, encoding the NCI domain and 22 bp of 5' untranslated sequences. pBsM18Ncl is digested with EcoRV-Notl and the resulting fragment is cloned into the Smal-Notl sites of the pVI 1393 to give the plasmid M18NC1.
M18VA2N: The baculovirus transfer vector was constructed using the polyhedron-based baculovirus transfer vector pVL1393 (Invitrogen), and the plasmid pBsM18NCl (described previously, see construct M18NC1), and the plasmid pBsV2.5 (see the construct M18VA2K). Plasmid pBsV2.5 was digested with Clal and the resulting 1.5 kB fragment was cloned into the Clal-digested 4.8 kB fragment of pBsM18NCl to generate the plasmid pBsM18VA2.3 encoding the longest variant aminoterminal noncollagenous domain (NC1-764) of type XVIII collagen al chain. pBsM18VA2.3 is digested with EcoRV-Notl and the resulting fragment is cloned into the Smal-Notl sites of pVL1393 to give the plasmid M18VA2N. M18C: The baculovirus transfer vector was constructed using the vector pBluescript (SK)MM-103 (described previously, see the construct M18K) and the polyhedron-based baculovirus transfer vector pVL 1393 (Invitrogen). pBluescript(SK)MM-103 encodes the cDNA for the C-terminus of the a 1 chain of mouse type XVIII collagen in the Notl site of pBluescript SK. pBs(SK)MM-103 was digested with EcoRI-Notl which generates a cDNA fragment covering nucleotides 2802-4080 (see, Rehn et al., J. Biol. Chem. 269:13929-13953 (1994)) with a translation initiation codon at nucleotides 3010-3012 corresponding to the C-terminal noncollagenous domain (amino acid residues 997-1315) with 180 bp of the 3' untranslated region. This fragment is cloned into the EcoRI-Notl sites of the pVL1393 to give the plasmid M18C.
2. Construction of Recombinant Vectors Containing Collagen
Modifying Enzymes. pVLβ: The baculovirus transfer vector was constructed using the polyhedrin-based baculovirus transfer vector pVL 1392, and the vector pBS(SK-)S138 which contains the full length cDNA for the β-subunit of human prolyl 4- hydroxylase in the EcoRI site (Pihlajaniemi et al, ΕMBO. J. 6:643
(1987)). The β-subunit clone HB-95 was obtained from a human hepatoma λgtll cDNA expression library screened with purified antibodies against human prolyl
4-hydroxylase. The vector pBS(SK-)S138 was constructed by identifying the clone S138 (human prolyl 4- hydroxylase β-subunit) from a λgtl l library derived from human placenta (Clontech) using HB-95 as a probe for the β-subunit of human prolyl 4-hydroxylase, releasing the insert from the identified λgtl 1 clone with EcoRI, and inserting the EcoRI fragment into the EcoRI site of pBS(SK-) (Stratagene) to give pBS(SK-)S138. pSB(SK-)S138 was digested with EcoRI-fiαmHI to generate the full length cDNA plus 44 bp 5' untranslated and 207 bp 3' untranslated, and this fragment was cloned into the EcoRI-βα/nHI sites of ρVL1392 (Vuori et al , Proc. Natl. Acad. Sci.
USA 89:7467-7470 (1992)) to give the plasmid pVLβ pVLα: The baculovirus transfer vector was constructed using the vector pBS(SK-)PA59 which contains the full length cDNA for human prolyl 4-hydroxylase α-subunit in the Smal site (Helakoski et al, , Proc. Nat'l. Acad. Sci. USA
86:4392-4396 (1989)) and the polyhedrin-based baculovirus transfer vector pVL
1392. The cone PA59 (human prolyl 4-hydroxylase β-subunit) was obtained as follows. An oligonucleotide mixture which encodes a peptide
(Gln-Val-Ala-Asn-Tyr-Gly) from the α-subunit of prolyl 4-hydroxylase was used to screen a cDNA library from HT 1080 cells. One positive clone, HTA-2, was obtained, and a 36-mer oligonucleotide derived from the HTA-2 sequence (nucleotides 1430-1465 of the α-subunit) was used to screen a human placenta λgtll library (Clontech). Two positive clones, PA-11 and PA-15, were isolated, and the full length clone, PA59, was obtained by rescreening the placenta library with these clones.
The vector pBS(SK-)PA59 was constructed by releasing the λgtl 1 insert from the clone PA59 by digestion with HihPl and Accl, blunt ending the PA59 fragment with Klenow (Pharmacia Biotech), and cloning the blunt ended PA59 fragment into the Smal site of pBS(SK-) (Stratagene) to give pBS(SK-)PA59. pBS(SK-)PA59 was digested with Pstl and BamHI to generate Pstl-Pstl and Pstl-BamHI fragments containing the full length cDNA plus 61 bp 5' untranslated region, and 551 bp 3' untranslated region, and these fragments are cloned into the Pstl- BamHI sites of pVL1392 (Vuori et al , Proc. Natl. Acad. Sci. USA 89:7467-7470 (1992)) to give the plasmid pVLα. p2Bacβ: pBS(KS-)S138 was constructed by digesting pBS(SK-)S138 with
EcoRI to release the S138 clone, and then inserting the S138 fragment into the EcoRI site ofpBS(KS-)S138. pBS(KS-)S138 was digested with BαmHI to give the full length β-subunit of human prolyl 4- hydroxylase including 44 bp 5' untranslated region and 207 bp 3' untranslated region. This fragment was cloned into the BαmHI site of p2Bac to give p2Bacβ. pBS(SK-)PA59 was mutated by PCR to place a Notl site 46 bp upstream of the initiation codon for the α-subunit of prolyl 4-hydroxylase to give the plasmid pBS(SK-)PA59/5'UTΝotI as follows. The plasmid pBS(SK-)PA59 and the primers: 5' primer (S'-GCCCTCGCGGCCGCCTTTCCAGGT-S'), and 3' primer
(5'-TGACATATCCTTAAGGACCAGTTC-3'), are used in a first PCR reaction, followed by a second PCR reaciton using pBS(SK-)PA59 and the primers: 5' primer
(5'- CGAGGTATCGATAAGCTTG-3'), and 3' primer (fragment from the first PCR reaction). The second PCR product is digested with Clal and Aflϊl, and ligated into pBS(SK-)PA59 to generate pBS(SK-)PA59/5'UTNotI. pBS(SK-)PA59/5'UTNotI is digested with Notl to generate a fragment with the full length α-subunit of prolyl
4-hydroxylase including 46 bp 5' untranslated region and 551 bp 3' untranslated region. This fragment is cloned into the Notl site of p2Bacβ to give the plasmid p2Bacβ.
3. Expression of Recombinant Collagen Genes in Insect CeUs with Prolyl-4-Hydroxylase.
Recombinant human collagens I, II, III, IV, XIII, XV, and _ XVIII have been expressed in insect cells by means of baculovirus expression vectors.
Expression of Collagen Type III. pVLC3Al is a recombinant expression vector encoding the full proαl chain of human type III collagen. Similar baculovirus expression vectors pVLα, pVLβ, and p2Bacβ were created for the expression of 0 human prolyl 4-hydroxylase in insect cells. The constructs were transfected in various combinations into insect cells using a BaculoGold* transfection kit
(Pharmigen).
Insect cells (Sf9 or High Five, Invitrogen) were cultured in TΝM-FH medium 5 (Sigma) supplemented with 10% fetal bovine serum (BioClear) or in a serum-free HyQ CCM3 medium (HyClone) either as monolayers or in suspension in shaker flasks at 27°C. To produce recombinant proteins, insect cells seeded at a density 5-6 x 105/ml were infected at a multiplicity of 5-10 with the recombinant virus and at a o multiplicity of 1 with the viruses for the ( subunit and ( subunit of human prolyl 4-hydroxylase (Vuori et al , Proc. Νatl. Acad. Sci. USA 89:7467-7470 (1992)). Ascorbate (80 μg/ml) was added daily to the culture medium. The cells were harvested 48- 120 h after infection, washed with a solution of 0.15 M ΝaCl and 0.02 M phosphate, pH 7.4, homogenized in a 0.3 M ΝaCl, 0.2% Triton X-100 and 0.07 M Tris buffer, pH 7.4, and centrifuged at 10,000 x g for 20 min. The remaining cell pellet that was insoluble in the homogenization buffer was further solubilized in 1 % SDS and analyzed by SDS- PAGE1. The cell culture medium was concentrated 10 times in an uitrafiltration cell (Cmicon) with a PM-100 membrane. Aliquots of the supernatants of the cell homogenates and the concentrated cell culture medium were analyzed by denaturing SDS-PAGE, foUowed by staining with Coomassie Brilliant Blue or Western blotting with an antibody to the N-propeptide of human type III procoUagen.
More specifically, Sf9 and High Five cells were infected with a recombinant baculovirus coding for the proαl (III) chains, harvested 72 h after infection, homogenized in a buffer containing 0.2% Triton X-100 and centrifuged. Aliquots of the Triton X-100 soluble protein fraction and the concentrated cell culmre medium were then analyzed either without pepsin treatment of after treatment with pepsin for lh at 22°C. The samples were electrophoresed on 8% SDS-PAGE and analyzed by Coomassie staining in A and by Western blotting using an antibody to the
N-propeptide of human type III procoUagen in B. As set forth in Figure 6, Lane 1 sets forth molecular weight markers; lanes 2-3, cell extracts; and lanes 4-5, media from Sf9 cell culmres; lanes 6-7, cell extracts; and lanes 8-9, media from High Five cell culmres. Samples in the odd numbered lanes were digested with pepsin.
Because the antibody used in the Western blotting reacts only with the N-propeptide of type III procoUagen, it does not recognize pepsin digested samples. The arrows indicate the proαl (III) and αl αl (III) chains.
Other aliquots were studied by a radioimmuno assay for the trimeric
N-propeptide of human type III procoUagen (Farmos Diagnostica) and a colorimetric method for 4-hydroxyproline (Kivirikko et al , Anal. Biochem. 19:249-255 (1967)).
Still further aliquots were digested with pepsin for lh at 22°C (Bruckner et al , Anal.
Biochem. 110:360-368 (1981)), and the thermal stability of the pepsin-resistant recombinant type III collagen was measured by rapid digestion with a mixture of trypsin and chymotrypsin.
The expression level of proαl (III) could be seen by Western blotting in samples of the Triton X-100 soluble proteins (Fig. 6B, lanes 2 and 6) and cell culmre media (Fig. 6B, lanes 4 and 8) in both Sf9 and High Five cells. After the pepsin digestion the (1 chains of type III collagen were seen in the High Five cells in the Coomassie stained gel (Fig. 6A, lane 7). The pepsin resistant (1(111) chains were not detected in the Western blot (Fig. 6B, lanes 3, 5, 7 and 9) since the antibody used reacts only with the N-propeptides of the proαl(III) chains, which were apparently digested by pepsin.
Sf9 and High Five cells were infected with the virus coding for the proαl (III) chains either with or without viruses coding for the two types of subunit of prolyl 4-hydroxylase (TABLE III). The expression level of total type III procoUagen was measured with a radioimmuno assay for the trimeric N-propeptide, and the amount of 4-hydroxyproline formed in the cells was determined by a colorimeric assay. Both values were used to calculate the amount of type III collagen produced by assuming that all the proαl (III) chains formed triple-helical molecules and that all the hydroxylatable proline residues in the proαl (III) chains had been converted to 4- hydroxyproline. Based on the known structure of type III procoUagen and the amount of 4-hydroxyproline in type III collagen, the amount of type III collagen in the samples was calculated by multiplying the N-propeptide values obtain by 7 and the 4-hydroxyproline values by, 8. AU measurements were made 72 h after the infection.
A considerable variation was found in the values obtained in different experiments as shown in TABLE II. Notwithstanding this variation, TABLE II provides: First, the amount of 4-hydroxyproline formed was in all experiments distinctly higher in cells infected with the prolyl 4-hydroxylase-coding viruses than in their absence. Second, the expression level obtained in High Five cells was consistently higher than that obtained in Sf9 cells. Third, in cells coinfected with the prolyl 4-hydroxylase-coding viruses the level of type III collagen produced was always higher when calculated from the 4-hydroxyproline values than from the radioimmuno assay values, suggesting either that some of the N-propeptides of type III procoUagen were degraded or that some of the fully 4-hydroxylated proαl (III) chains remained nontriple-helical. The highest type III collagen expression values were in the High Five cells that also expressed prolyl 4-hydroxylase, the amount of cellular type III collagen in these cells being about 41-81 μg/5 x 106 cells (TABLE III). The amount of type III collagen secreted into the culmre medium, when measured with the radioimmuno assay, was about 25-50% of total in Sf9 cells and about 10-30% of total in High Five cells.
Experiments were also performed in which High Five cells were grown in suspension in shaker flasks. A similar effect of prolyl 4-hydroxylase-coding viruses was seen in these experiments as above. The highest expression levels found in such experiments have ranged up to about 40 mg of type III collagen produced per liter of culture in 72 h, about 80-90% of the collagen produced being found in the cell pellet, and 10-20% in the medium.
TABLE ffl
PROLYL 4-HYDROXYLASE ACTIVITY OF TRITON X-100 EXTRACTS FROM INSECT CELLS EXPRESSING PROAL CHAINS OF HUMAN TYPE III PROCOLLAGEN WITH OR WITHOUT THE A AND B SUBUNITS OF PROLYL 4-HYDROXYLASE.
Figure imgf000063_0001
The cells expressed either no recombinant polypeptide or only the proαl (III) chains or the latter plus the α and β subunits of prolyl 4-hydroxylase. The analysis was performed 72 h after the infection. The values are given as dpm/10 μl of the Triton extract, mean of duplicate values obtained in three experiments for High Five cells, and mean of duplicate values in one experiment for Sf9 cells.
Expression of Collagen Types I and II. Baculovirus expression vectors pVLClAl and pVLClA2 were created for the expression of the proαl chain and the proα2 chain of human collagen I, and pVLC3A15'UT/C2Al was created for the expression of the proαl chain of human collagen II.
Unless otherwise specified, insect cells were cultured, and recombinant collagen produced following the procedures supra.
The expression level of proαl (I), and proαl (I) and proα2 (I) in the presence of prolyl 4-hydroxylase, and following pepsin digestion of the supernatants from cell homogenates could be seen in silver-stained 5% SDS-PAGE. See Figure 7, lanes (DIA 1). The silver-stained SDS PAGE revealed the formation of triple-helical procoUagen I in these cells. Homotrimeric collagen can be separated from heterotrimeric collagen I on a metal chelate affinity column through the use of a histidine-tag to the C-terminal domain of the proα2 chain.
The expression level of proαl (II) in the presence of prolyl 4-hydroxylase could be seen in coomassie stained 5 % SDS PAGE. See Figure 8 (wherein lane 1 depicts the expression of a homotrimer of type I collagen; lane 2 is a standard sample of type II procoUagen; lane 6 is a standard sample of type III procoUagen; and lanes 3-5 compare three different constructs of human type II procoUagen containing varying amounts of human procoUagen type III. Lane 3 is type II procoUagen with the C-terminal end of type III procoUagen; lane 4 is type II procoUagen with the N-terminal non-collagenous region from type III procoUagen; and lane 5 is type II procoUagen with the N- and C-terminal regions of type III procoUagen).
Several baculovirus vectors for the expression of human type II collagen were constructed. In one of these vectors, the 5' untranslated region of human type II collagen was replaced with human type III collagen 5' untranslated region. In another vector, the entire human type II collagen gene was expressed. In another insect expression vector, the N-propeptide of type II collagen was replaced with an N-propeptide of type III collagen. AU three of those vectors were found to express human type II collagen in varying levels. Expression was detected by Coomassie Blue stain SDS-PAGE and by Western blot analysis.
Expression of Collagen Types IV. XIII. and XVIII. pVLC4Al is a recombinant baculovirus expression vector encoding the proαl chain of human collagen IV. pVLhuXIII is a recombinant baculovirus vector encoding the proαl chain of human collagen XIII. pVLC15Al is a recombinant expression vector encoding the proαl chain of human collagen XV. M18K and M18VA2K are recombinant expression vectors encoding two variants of the proαl chain of human collagen type XVIII. Unless otherwise specified, insect cells were cultured and recombinant collagen produced following the procedures supra. pVLC4Al, pVLhuXIII, pVLC15Al, M18K, and M18VA2K have been transformed into insect cells, and the recombinant collagens have been successfully expressed.
4. Purification And Analysis Of Recombinant Collagen.
Purification of Recombinant Type III Collagen. The properties of the purified human type III collagen produced in insect cells were found to be very similar to those of the type III collagen extracted from carious tissues (Kielty et al , Connective Tissue and Its Heritable Disorders: Molecular. Genetic and Medical
Aspects pp.E103-147 (1993); Kivirikko, Ann. Med. 25: 113-125 (1993); van der Rest et al, Adv. Mol. Cell. Biol. 6:1-67 (1993); Brewton et al, Extracellular Matrix
Assembly and Structure pp. 129-170 (1994); Pihlajaniemi et al , Prog. Nucleic Acid Res. Mol. Biol. 50:225-262 (1995); Prockop et al , Annu. Rev. Biochem.
64:403-434 (1995)). In particular, the content of 4-hydroxyproline and the Tm of the triple helices, when determined by CD spectra, were found to be virtually identical to those of the authentic type III collagen. The content of hydroxylysine in the recombinant collagen was found to be about one-half of that of type III collagen extracted from various tissues, indicating that insect cells must have a considerable level of lysyl hydroxylase activity.
Insect cells expressing the recombinant type III procoUagen were washed with a solution of 0.15 M NaCl and 0.02 M phosphate, pH 7.4, homogenized in a cold 0.2 M. NaCl, 0.1 % Triton X-100 and 0.05 M Tris buffer, pH 7.4 (20 x 106 cells/ml), incubated on ice for 30 min, and centrifuged at 16,000 x g for 30 min. Unless otherwise mentioned, all the following steps were performed at 4°C. The supernatant was chromatographed on a DEAE cellulose column (DE-52, Whatman) equilibrated and eluted with a 0.2 M NaCl and 0.05 M Tris buffer, pH 7.4, the void volume being collected. The pH of the sample was lowered to 2.0-2.5, and the sample was digested with a final concentration of 150 μg/ml of pepsin for 1 h at 22°C. Pepsin was irreversibly inactivated by neutralization of the sample followed by an overnight incubation on ice. The recombinant type III collagen was precipitated by adding solid NaCl to a final concentration of 2 M and centrifugation at 16,000 x g for 1 h. The pellet was dissolved in a 0.5 M NaCl, 0.5 M urea, and 0.05 M Tris buffer, pH 7.4, for 1 day, and the sample was digested with pepsin as above for a second time. The sample was then chromatographed on a Sephacryl HR-500 gel filtration column (Pharmacia), eluted with a solution of 0.2 M NaCl and 0.05 M Tris, pH 7.4, dialyzed against 0.1 M acetic acid and lyophilized.
Type III procoUagen was expressed in High Five cells cultured either as monolayers or in suspension in shaker flasks. The cells were harvested 72 h after infection, homogenized in a buffer containing 0.1 % Triton X-100 and centrifuged, and the supernatant of the cell homogenate was passed through a DEAE cellulose column to remove nucleic acids. The flow through fractions containing the type III procoUagen were pooled and digested with pepsin. This converted the type III procoUagen to type III collagen and digested most of the noncollagenous proteins. The type III collagen was then concentrated by salt precipitation, solubilized and treated with pepsin as above. The type III collagen was finally separated from pepsin and other remaining contaminants by gel filtration on a Sephacryl S 500-HR column. The fractions containing the type III collagen were pooled, dialyzed and lyophilized. The purified type III collagen was analyzed by 5 % SDS-PAGE under reducing (Figure 9, lane 2) and nonreducing (Figure 9, lane 3) conditions. No contaminants were seen in the Coomassie stained gel and the type III collagen (1 chains were disulfide-bonded. Amino acid and CD spectmm analysis were performed on the purified type III collagen. The amino acid composition of the recombinant type III obtained corresponded well with the amino acid composition reported for human type III collagen. The only exception was the amount of hydroxy lysine, which was 3 residues/1000 amino acids in the recombinant type III collagen instead of 5/1000 amino acids in the authentic human type III collagen. The melting temperamre of the recombinant type III collagen determined by CD spectmm analysis was 40°C.
The High Five cells gave consistently higher production rates than Sf9 cells, 5 the highest production rates seen in High Five cells culmred in monolayers ranging up to about 80 μg of cellular recombinant human type III collagen/5 x 106 cells, which conesponds to about 120 μg of type III procoUagen. When the High Five cells were culmred in suspension in shaker flasks, the highest amount of cellular type _ III collagen produced ranged up to about 40 mg/l, corresponding to about 60 mg/l of type III procoUagen.
Conformational Integrity of the Recombinant Type III Collagen. Association of the proαl (UI) chains into trimers was studied by using SDS-PAGE analysis under 5 nonreducing conditions. High Five cells were coinfected with vimses coding for the proαl (III) chains and the ( and ( subunits of human prolyl 4-hydroxylase. The cells were harvested 72 h after infection, homogenized in a buffer containing 0.2% Triton
X-100, centrifuged, and the remaining cell pellets were further solubilized in 1% 0 SDS. Aliquots of the Triton soluble proteins were treated with pepsin for 1 h at 22°C. Essentially all the proαl (III) chains synthesized were found as disulfide-bonded trimers based on the disappearance of a protein band of a high molecular weight (Figure 10, lane 2). After pepsin digestion the band corresponding 5 to the recombinant type III procoUagen was converted to a band corresponding to type III collagen, and the protein remained in the form of the trimer, thus indicating the existence of disulfide bonds between the (1 (III) chains (Figure 10, lane 3).
Virtually all the type III procoUagen expressed was soluble in the Triton
X-100-containing homogenization buffer, as no band corresponding to type III procoUagen was seen in the Triton X-100-insoluble, SDS-soluble fraction (Figure 10, lane 4).
The thermal stability of the type III collagen expressed under different cell culmre conditions was studied by using digestion with a mixture of trypsin and chymotrypsin after heating to various temperatures (Bruckner, et al. , Anal. Biochem. 110:360-368 (1981)). High Five cells were infected with vimses coding for the proαl (III) chains and the ( and ( subunits of human prolyl 4-hydroxylase. The cells were harvested 72 h after infection, homogenized in a buffer containing 0.2% Triton X-100 and centrifuged. In these experiments, ascorbate was either added daily to the cell culture medium as usual or omitted during the infection. The Triton X-100 soluble proteins were first digested with pepsin for 1 h at 22°C to convert type III procoUagen to type III collagen (Pihlajaniemi et al , EMBO J. 6:643-649 (1987)), and the trypsin/chymotrypsin digestion was then performed for aliquots of the pepsin-treated samples. The samples were then electrophoresed on 8% SDS-PAGE and analyzed by Coomassie staining. Figure 11 provides the results of this thermal stability for a variety of collagen products. As set forth in panel A, the cells were infected only with the vims coding for the proαl (III) chains, and ascorbate was omitted from the culmre medium; panel B, the cells were infected only with the vims coding for the proαl (III) chains, and ascorbate was present in the culmre medium as usually; panel C, the cells were coinfected with vimses coding for the proαl (III) chains, and the ( and ( subunits of prolyl 4-hydroxylase, but ascorbate was omitted from the culmre medium; and panel D, the cells were infected with the three vimses, and ascorbate was present in the culture medium. Lane P shows a sample digested with pepsin without subsequent trypsin/chymotrypsin digestion, lanes 27-42 show samples treated with the trypsin/chymotrypsin mixture at the temperatures indicated. The arrows show the position of the (1 (III) chains. As evidenced by these results, when the proαl (III) chains were expressed without the presence of prolyl
4-hydroxylase and ascorbate, the Tm of type III collagen was found to be at about 32-34°C (Figure 11-4). The presence of either ascorbate of prolyl 4- hydroxylase without the other had virtually no increasing effect on the thermal stability (Figure I IB and 11C). In contrast, when the proαl (III) chains were produced in the presence of both prolyl 4-hydroxylase and ascorbate, the Tm of type III collagen was increased considerably, being at about 38-40°C (Figure HD).
Purification and analvsis of Collagen Types I and II. Collagens types I and II were purified as described supra. The recombinant type II human collagen expressed from the recombinant insect cells was found to exhibit resistance to trypsin and chymotrypsin digestion. These protease digestion experiments indicated that triple helical type II human collagen was formed in the recombinant insect cells.
The thermal stability of the recombinant type II human collagen expressed from the recombinant insect cells was measured and compared with native type I human collagen. These results indicated that the recombinant type II collagen had a triple helical structure. The Tm of the recombinant type II collagen was up to about 40°C.
A. Example 11: Expression of Recombinant CoUagen Genes in Yeast Cells Expressing Recombinant Genes for Prolyl 4-Hydroxylase
1. Construction of Recombinant Vectors Containing CoUagen Genes. pPIC9ColIII. This plasmid contains the human Col III gene joined to the α-mating factor secretion signal (α-MFSS) (and containing a deletion of the native human secretion signal).
The 3' end of the COL III gene was synthesized by PCR from the 4195 bp downstream (EcoRI site) of the translation initiation codon to the stop codon (4401 bp) using pBluescript SM38 as a template and the PCR primers: 5' primer
(5'-GAAGGTGAATTCAAGGCTGA-3'), and 3' primer (5'-GCGTCTAGAGCGG
CCGCTTATAAAAAGCAAACAGGGCC-3'). Notl and Xbal sites were created in the 3' end of the PCR fragment. The PCR fragment was digested with EcoRI and
Xbal and cloned into the EcoRI and Xbal sites of pBluescript-SM38 (pBS-SM38 is derived from sequences presented in Ala-Kokko et al Biochem. J. 260: 509-516 (1989)), and GenBank accession number X14420) to give the plasmid pBluescript-SM38/B.
The 5' end of the Col III gene was synthesized from 73 bp downstream of the translation initiation codon to 176 bp (BamHI site) by PCR (for sequences, see
Ala-Kokko et al , Biochem.. J. 260:509-516 (1989)) using pBluescript SM38 as the template and the PCR primers: 5' primer (5'-GCGATCGATGC GGCCGCGCAGGAAGCTGTTGAAGGAGG-3'), and 3' primer (5'-GAGAA CGGATCCTGAGTCAC-3'). Clal and Notl sites were created in the 5' end of the PCR fragment. pBluescript-SM38/B was digested with Clal and βαmHI, and the fragments from this digest and the 5' PCR fragment were ligated with T4 ligase to give the plasmid pBluescript-SM38/ll. pBluescript-SM38/l l was digested by Notl and the Notl-Notl collagen fragment (73-4401 bp) was cloned in frame with the α-factor signal sequence in the yeast expression vector pPIC9 (Invitrogen) to give the plasmid pPIC9COLIII. ρHIL-D2/colIII. The 3' end of the COL III gene was synthesized by PCR from the 4195 bp downstream (EcoRI site) of the translation initiation codon to the stop codon (4401 bp) using pBluescript-SM38 as the template DΝA and the primers:
5' primer (5'-GAAGGTGAATTCAAGGCTGA-3'), and the 3' primer (5'-
GCGTCTAGATTATAAAAAGCAAACAGGGCC-3'). An Xbal site was created in the 3' end of the PCR fragment. pBluescript-C3Al was digested with EcoRI and
Xbal and the large fragment isolated, and the 3' PCR fragment is digested with
EcoRI and Xbal. These two fragments and the digested pBluescript-C3Al vector are ligated with T4 ligase to give pBluescript-C3Al/10. A BgHl site was created 16 bp upstream of the translation initiation codon in pBluescript- C3A1/10 and the BgUl -
Xbal fragment from pBluescript-C3Al/10, containing collagen sequences from (nucleotides - 16 to 4401) is ligated into the EcoRI site of pHIL-D2 (Invitrogen) to give plasmid PHII-D2/colHI. pAO815β. pYM25 was digested with Hpal and the fragment containing the ARG4 gene of Saccharomyces cerevisiae was isolated and cloned into the EcoRV sites of pAO815 (Invitrogen) replacing the HIS4 gene with ARG4, to give the plasmid pARG815. A cDNA of the β subunit of human prolyl 4-hydroxylase (Vuori et al. , Proc. Nat'l. Acad. Sci. USA 89:7467-7470 (1992)) was synthesized by PCR from the translation initiation codon to the stop codon, and EcoRI sites were created in the 5' and 3' ends of the PCR fragment. pVL1392/_HDΕL (Vuori et al., 1992, EMBO J. ϋ:4213- 4217) was used as the template DNA with the primers: 5' primer (5'-GGCGAATTCATGCTGCGCCGCGCTCTGCT-3'), and 3' primer (5'-GCGGAA TTCTTACAGTTCATCGTGCACAGC-3'). This PCR fragment was digested with EcoRI and cloned into pBluescript SK, to give pBluescript SKβ/20. pBluescript SKB/20 was digested with EcoRI and this fragment was cloned into the EcoRI site of pAO815 (Invitrogen), to give the plasmid pAO815β which has a single expression cassette for the β-subunit of prolyl 4-hydroxylase. pARG815α. The 5' end of the α-subunit of prolyl 4-hydroxylase was synthesized by PCR from the translation initiation codon to 689 bp downstream (HmdIII site), and HmdIII and Smαl sites were created in the 5' end of the fragment. pBS(SK-)PA59 was used as the template DNA with the primers: 5' primer (5'- GCGAAGCTTCCCGGGATGATCTGGTATATATTA-3'), and 3' primer (5'-GGATCTAGTTCAAGAAGCTT-3'). pA-59 (Vuori et al , Proc. Nat'l. Acad. Sci. USA 89:7467-7470 (1992)) was digested with HmdIII and the large fragment was isolated and ligated with the 5' PCR fragment to give pA-59/15.
The 3' end of the α-subunit was synthesized by PCR from 1373 bp (Pstl site) downstream of the translation initiation codon to the translation stop codon, and Smal and BαmΗI sites were created in the 3' end of the fragment. pBS(SK-)PA59 was used as the template DNA with the primers: 5' primer
(5'-AGTGATGTGTCTGCAGGAGGAGC- 3'), and 3' primer
(5'-GCGGGATCCCCCGGGTCATTCCAATTCTGACAACG-3'). pA-59/15 was digested with Pstl and BamHI, and the large fragment was isolated, and ligated with the 3' PCR fragment to give pA-59/3. pA-59/3 was digested with Smal and the Smal-Smal α-subunit fragment was cloned into the EcoRI site of pARG815, to give pARG815α. pARG815αβ. ρAO815β was digested with Bglll and BamHI to excise the expression cassette, and the expression cassette is cloned into the BamHI site of pARG815α to give the vector pARG815αfi. pAO815ββ - is similar to pAO815αβ, but contains two cassettes of the β subunit of the human prolyl 4-hydroxylase gene. pAO815β was digested with BgUl and BamHI to excise the expression cassette, and the expression cassette is cloned into the BamHI site of pARG815αβ to give the vector ρARG815αββ.
The β-subunit without its signal sequence was synthesized by PCR from 52 bp downstream of the translation initiation codon to the translation stop codon. EcoRI restriction sites were created in 5' and 3' ends. This PCR fragment was cloned into the EcoRI site of ρSP72 (Promega).
2. Expression of Recombinant Collagen Genes in Yeast CeUs with Prolyl-4-Hydroxylase.
Pichia pastoris host strain GS200 his4 arg4 was stably transformed with combinations of the plasmid described supra and related plasmids to produce the following recombinant strains.
P. pastoris Col Illαβ - carries the human Col III gene with α-MFSS and both subunits of the human Prolyl 4- hydroxylase.
P. pastoris nCol III - is similar to P. pastoris nCol III αβ, but uses the native Col III signal sequence. P. pastoris αβ - carries both subunits of human prolyl 4-hydroxylase.
P. pastoris αββ contains human prolyl 4-hydroxylase, wherein the α:β gene ratio fs 1:2.
P. pastoris α contains the human prolyl 4-hydroxylase α gene. P. pastoris β contains the human prolyl 4-hydroxylase β gene.
The P. pastoris strains described in paragraph 5 were grown in rotary shakers to an 0D600 of 5.0. Samples were taken and mn on PAGE gels. Western blots were performed and analyzed with antibodies against proCol III N- terminal peptide, the α-subunit of human prolyl 4-hydroxylase and the β-subunit of human prolyl 4-hydroxylase. The Western blots described in paragraph 6 demonstrated that both human collagen III and human prolyl 4- hydroxylase were produced in P. pastoris.
Pepsin digestion experiments were performed to test for triple helical structure in the human collagen produced in P. pastoris. Whereas most proteins are degraded by the proteolytic enzyme pepsin, the triple helical region of collagen is pepsin resistant. The collagen from cell lysates of P. pastoris Col Illαβ were digested with pepsin, and the digestion products were separated by SDS-PAGE. The results of these experiments indicated that triple helical human collagen III was produced in the recombinant P. pastoris cells.
Experiments were performed to measure human prolyl 4-hydroxylase activity in the P. pastoris strains described above. P. pastoris has no intrinsic prolyl 4-hydroxylase activity. The assay were performed with 14C labelled proline, essentially as described by Kivirikko in Methods in Enzvmologv. Volume 82. pgs. 245-304, Academic Press, San Diego, CA. Prolyl 4-hydroxylase activity was found in the recombinant cells.
B. Example 12: Expression of Recombinant CoUagen Genes in
Mammalian Cells Expressing Recombinant Genes for Prolyl 4-Hydroxylase
1. Construction of a Recombinant Semliki Forest Virus Vectors Containmg CoUagen Genes. pSFVmoXIII: The Semliki Forest expression vector was constructed using the vector pBSmoXIII generated based on clones and sequences as described for pVLmoXIII above (Rehn et al, submitted; Peltonen et al, submitted) and the eukaryotic expression vector pSFV-1 (Liljestrom et al , Bio/tecnologv 9:1356-1361 (1991)). pBSmoXIII is digested with EcoRI to generate the full-length type XIII collagen variant with seven bp 5' untranlsated region and 288 bp 3' untranslated region, and this fragment is made blunt ended with Klenow, and cloned into the Smal site of pSFV-1 to give the plasmid pSFVmoXIII. pSFVmoXIII plasmid was used to produce RNA by in vitro transcription using MΕGAscript8 in vitro transcription kit by Ambion. Baby hamster kidney (BNK) cells transfected with the RNA as described in Lilegestrom et al. , Current Protocols in Molecular Biology 2:16-20 (1991). Synthesis of full-length chains for mouse type XIII collagen were observed in the BHK cells by Western blotting of SDS-polyacrylamide gel- fractionated cell extracts.
Efficient expression of other collagen genes in cells of higher eukaryotes will 5 be based on the above- described Semliki Forest vims vector. Semliki Forest vims is preferred as the vims because it has a broad host range such that infection of the above mentioned mammalian cell lines will also be possible. More specifically, it is expected that the use of the Semliki Forest vims can be used in a wide range of hosts, as the system is not based on chromosomal integration, and therefore it will be a quick way of obtaining modifications of the recombinant collagens in studies aiming at identifying stmcmre-function relationships and testing the effects of various hybrid molecules. In addition, it is expected that use of the Semliki Forest vims will
15 yield very high recombinant expression levels, over 10 ug/lxlO6 cells.
HeLa cells and the vaccinia vims-based expression system can also be used to express collagens in mammalian cells, and will preferably be used to expresst type
IV collagens as homo- and hetero- trimer isoforms of the six type IV collagen 2 „0_ chains.
AU patents, patents applications, and publications cited are incorporated herein by reference.
The foregoing written specification is considered to be sufficient to enable one skilled in the art to practice the invention. Indeed, various modifications of the
25 above-descnbed makes for carrying out the invention which are obvious to those skilled in the field of immunology, biochemistry, or related fields are intended to be within the scope of the following claims.
30
35

Claims

CLAIMSWHAT IS CLAIMED IS:
1. A method for producing a collagen polypeptide, wherein said collagen is selected from the group comprising collagen types IV, V, VI, VII, VIII, IX, X, XI, XII, XIII, XIV, XV, XVI, XVII, XVIII, and XIX, comprising: a. culturing a host cell, wherein said host cell has been infected, transfected or transformed with (i) a first expression vector comprising a polynucleotide molecule having a nucleic acid sequence which encodes a collagen subunit; and (ii) a second expression vector comprising a polynucleotide molecule having a nucleic acid sequence which encodes at least one collagen post-translational enzyme or subunit thereof; and b. purifying said collagen polypeptide.
2. The method of Claim 1 wherein the host cell is selected from the group consisting of a yeast cell, a plant cell, an insect cell and a mammalian cell.
3. The method of Claim 1 wherein the host cell is further infected, transfected or transformed with a third expression vector comprising a polynucleotide molecule having a nucleic acid sequence which encodes a second collagen subunit.
4. The method of Claim 3 wherein the host cell is further infected, transfected or transformed with a fourth expression vector comprising a polynucleotide molecule having a nucleic acid sequence which encodes a third collagen subunit.
5. The method of Claim 1 wherein said collagen post-translational enzyme is selected from the group consisting of prolyl-4-hydroxylase, lysyl oxidase, lysyl hydroxylase, C-proteinase, and N-proteinase.
6. The method of Claim 1 wherein the collagen post-translational enzyme subunit is selected from the group consisting of an alpha subunit of prolyl-4-hydroxylase and a beta subunit of prolyl-4-hydroxylase.
7. A method for producing a procoUagen polypeptide, wherein said procoUagen is selected from the group comprising collagen types IV, V, VI, VII, VIII, IX, X, XI, XII, XIII, XIV, XV, XVI, XVII, XVIII, and XIX, comprising: a. culturing a host cell, wherein said host cell has been infected, transfected or transformed with: (i) a first expression vector comprising a polynucleotide molecule having a nucleic acid sequence which encodes a collagen subunit; and (ii) a second expression vector comprising a polynucleotide molecule having a nucleic acid sequence which encodes at least one collagen post-translational enzyme or subunit thereof; and b. purifying said procoUagen polypeptide.
8. The method of Claim 7 wherein the host cell is selected from the group consisting of a yeast cell, a plant cell, an insect cell and a mammalian cell.
9. The method of Claim 7 wherein the host cell is further infected, transfected or transformed with a third expression vector comprising a polynucleotide molecule having a nucleic acid sequence which encodes a second collagen subunit.
10. The method of Claim 9 wherein the host cell is further infected, transfected or transformed with a fourth expression vector comprising a polynucleotide molecule having a nucleic acid sequence which encodes a third collagen subunit.
11. The method of Claim 7 wherein said collagen post-translational enzyme is selected from the group consisting of prolyl-4-hydroxylase, lysyl oxidase and lysyl hydroxylase.
12. The method of Claim 7 wherein the collagen post-translational enzyme subunit is selected from the group consisting of an alpha subunit of prolyl-4-hydroxylase and a beta subunit of prolyl-4-hydroxylase.
13. A collagen polypeptide, wherein said collagen is selected from the group comprising collagen types IV, V, VI, VII, VIII, IX, X, XI, XII, XIII, XIV, XV, XVI, XVII, XVIII, and XIX, manufactured according to a method comprising: a. culturing a host cell, wherein said host cell has been infected, transfected or transformed with: (i) a first expression vector comprising a polynucleotide molecule having a nucleic acid sequence which encodes a collagen subunit; and (ii) a second expression vector comprising a polynucleotide molecule having a nucleic acid sequence which encodes at least one collagen post-translational enzyme or subunit thereof; and b. purifying said collagen polypeptide.
14. The collagen polypeptide of Claim 13 wherein the host cell is selected from the group consisting of a yeast cell, a plant cell, an insect cell and a mammalian cell.
15. The collagen polypeptide of Claim 13 wherein the host cell is further infected, transfected or transformed with a third expression vector comprising a polynucleotide molecule having a nucleic acid sequence which encodes a second collagen subunit.
16. The collagen polypeptide of Claim 15 wherein the host cell is further infected, transfected or transformed with a fourth expression vector comprising a polynucleotide molecule having a nucleic acid sequence which encodes a third collagen subunit.
17. The collagen polypeptide of Claim 13 wherein said collagen post-translational enzyme is selected from the group consisting of prolyl-4-hydroxylase, lysyl oxidase, lysyl hydroxylase, C-proteinase, and N- proteinase.
18. The collagen polypeptide of Claim 13 wherein the collagen post-translational enzyme subunit is selected from the group consisting of an alpha subunit of prolyl-4-hydroxylase and a beta subunit of prolyl-4- hydroxylase.
19. The collagen polypeptide of Claim 13 wherein said polypeptide is not glycosolated.
20. The collagen polypeptide of Claim 13 wherein said polypeptide is partially deglycosolated.
21. A host cell which has been infected, transfected or transformed with:
(i) a first expression vector comprising a polynucleotide molecule having a nucleic acid sequence which encodes a collagen subunit; and (ii) a second expression vector comprising a polynucleotide molecule having a nucleic acid sequence which encodes at least one collagen post-translational enzyme or subunit thereof.
22. The host cell of Claim 21 wherein said host cell is further infected, transfected or transformed with a third expression vector comprising a second collagen subunit.
23. The host cell of Claim 22 wherein said host cell is further infected, transfected or transformed with a fourth expression vector comprising a third collagen subunit.
PCT/US1997/007300 1996-04-12 1997-04-11 Synthesis of human procollagens and collagens in recombinant dna systems Ceased WO1997038710A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP9537462A JP2000508544A (en) 1996-04-12 1997-04-11 Synthesis of human procollagen and collagen in a recombinant DNA system
AU30566/97A AU3056697A (en) 1996-04-12 1997-04-11 Synthesis of human procollagens and collagens in recombinant dna systems

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US63133696A 1996-04-12 1996-04-12
US08/631,336 1996-04-12

Publications (2)

Publication Number Publication Date
WO1997038710A1 true WO1997038710A1 (en) 1997-10-23
WO1997038710A9 WO1997038710A9 (en) 1997-12-31

Family

ID=24530765

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1997/007300 Ceased WO1997038710A1 (en) 1996-04-12 1997-04-11 Synthesis of human procollagens and collagens in recombinant dna systems

Country Status (3)

Country Link
JP (1) JP2000508544A (en)
AU (1) AU3056697A (en)
WO (1) WO1997038710A1 (en)

Cited By (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NL1007908C2 (en) * 1997-12-24 1999-06-25 Fuji Photo Film Bv Silver halide emulsions with recombinant collagen suitable for photographic administration and their preparation.
DE19834909A1 (en) * 1998-08-03 2000-02-17 Mpb Cologne Gmbh Molecular Pla Fiber proteins and their production
US6150081A (en) * 1997-12-24 2000-11-21 Fuji Photo Film B.V. Silver halide emulsions with recombinant collagen suitable for photographic application and also the preparation thereof
WO2001034646A3 (en) * 1999-11-12 2001-12-06 Fibrogen Inc Recombinant gelatins
WO2001068868A3 (en) * 2000-03-15 2002-01-31 Fibrogen Inc Alpha(iii) subunit of prolyl 4-hydroxylase
JP2002537785A (en) * 1999-03-04 2002-11-12 ピーピーエル セラピューティクス (スコットランド) リミテッド Genetic modification of somatic cells and their use
EP0992586A3 (en) * 1998-10-09 2004-04-21 United States Surgical Corporation Extracellular matrix proteins with modified amino acid
AU2005246967B2 (en) * 1999-11-12 2008-09-04 Fibrogen, Inc. Recombinant gelatins
WO2011059092A1 (en) 2009-11-16 2011-05-19 住友化学株式会社 Transformant which produces collagen wherein both lysine residue and proline residue are hydroxylated
WO2011115070A1 (en) 2010-03-15 2011-09-22 住友化学株式会社 Transformant which produces glycine repeat protein
US8058512B2 (en) 2001-06-05 2011-11-15 David N. Radin Gene expression and production of TGF-β proteins including bioactive mullerian inhibiting substance from plants
AU2006231795B2 (en) * 2005-03-31 2012-02-16 Fuso Pharmaceutical Industries, Ltd. Process for production of protein having triple-helical structure
US20130116412A1 (en) * 2010-04-02 2013-05-09 Daniel M. Pinkas Production of Post-Translationally Hydroxylated Recombinant Proteins in Bacteria
US8455717B2 (en) 2004-09-29 2013-06-04 Collplant Ltd. Collagen producing plants and methods of generating and using same
AU2012200573B2 (en) * 2005-03-31 2013-11-07 Fuso Pharmaceutical Industries, Ltd. "Process for production of protein having triple-helical structure"
CN109456989A (en) * 2018-10-31 2019-03-12 陕西慧康生物科技有限责任公司 A kind of raising Pichia pastoris secretion expression carrier construction method
CN111334512A (en) * 2019-12-06 2020-06-26 肽源(广州)生物科技有限公司 Recombinant human-like collagen containing hydroxyproline and hydroxylysine and production method thereof
US11028148B2 (en) 2017-09-28 2021-06-08 Geltor, Inc. Recombinant collagen and elastin molecules and uses thereof
US11168126B2 (en) 2019-04-12 2021-11-09 Geltor, Inc. Recombinant elastin and production thereof
US11174300B2 (en) 2020-01-24 2021-11-16 Geltor, Inc. Animal-free dietary collagen
WO2022237224A1 (en) * 2021-05-13 2022-11-17 江苏创健医疗科技有限公司 Recombinant human type xvii collagen, preparation method and application thereof
CN116218864A (en) * 2023-04-10 2023-06-06 山东多美康生物医药有限公司 A kind of recombinant type III humanized collagen α1 and its expression vector and application
CN116948014A (en) * 2023-07-18 2023-10-27 山西锦波生物医药股份有限公司 Method for biosynthesis of human structural material type VI collagen
WO2023231489A1 (en) * 2022-06-01 2023-12-07 山西锦波生物医药股份有限公司 Preparation method for biosynthetic human body structural material
CN118109506A (en) * 2024-03-08 2024-05-31 湖南诺合新生物科技有限公司 Recombinant type I human collagen and preparation method and application thereof
WO2025156530A1 (en) * 2024-01-26 2025-07-31 深圳粒影生物科技有限公司 Recombinant type xvii collagen having triple helix structure and use thereof

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11384135B2 (en) * 2017-09-22 2022-07-12 Modern Meadow, Inc. Recombinant yeast strains
WO2025116037A1 (en) * 2023-12-01 2025-06-05 株式会社UniBio Modified p4hb

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1993007889A1 (en) * 1991-10-23 1993-04-29 Thomas Jefferson University Synthesis of human procollagens and collagens in recombinant dna systems
US5593859A (en) * 1991-10-23 1997-01-14 Thomas Jefferson University Synthesis of human procollagens and collagens in recombinant DNA systems

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1993007889A1 (en) * 1991-10-23 1993-04-29 Thomas Jefferson University Synthesis of human procollagens and collagens in recombinant dna systems
US5405757A (en) * 1991-10-23 1995-04-11 Thomas Jefferson University Synthesis of human procollagens and collagens in recombinant DNA systems
US5593859A (en) * 1991-10-23 1997-01-14 Thomas Jefferson University Synthesis of human procollagens and collagens in recombinant DNA systems

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
AMER. J. OF RESPIRATORY CELL AND MOLECULAR BIOLOGY, 1989, Vol. 1, ADAMS S.L., "Collagen Gene Expression", pages 161-168. *
CURRENT OPINION IN CELL BIOLOGY, 1993, Vol. 5, MAYNE et al., "New Members of the Collagen Superfamily", pages 883-890. *
GENOMICS, 1990, Vol. 8, RYAN et al., "The Human Type ll Procollagen Gene: Identification of an Additional Protein-Coding Domain and Location of Potential Regulatory Sequences in the Promoter and First Intron", pages 41-48. *
J. BIOL. CHEM., 11 November 1994, Vol. 269, No. 45, HELAAKOSKI et al., "Structure and Expression of the Human Gene for the Alpha Subunit of Prolyl 4-Hydroxylase", pages 27847-27854. *

Cited By (53)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6958223B2 (en) 1996-06-03 2005-10-25 United States Surgical Corporation Methods for producing extracellular matrix proteins
US8188230B2 (en) 1997-12-24 2012-05-29 Fuji Film Manufacturing Europe B.V. Method for recombinant microorganism expression and isolation of collagen-like polypeptides
EP0926543A1 (en) * 1997-12-24 1999-06-30 Fuji Photo Film B.V. Silver halide emulsions with recombinant collagen suitable for photographic application and also the preparation thereof
US6150081A (en) * 1997-12-24 2000-11-21 Fuji Photo Film B.V. Silver halide emulsions with recombinant collagen suitable for photographic application and also the preparation thereof
NL1007908C2 (en) * 1997-12-24 1999-06-25 Fuji Photo Film Bv Silver halide emulsions with recombinant collagen suitable for photographic administration and their preparation.
US8084579B2 (en) 1997-12-24 2011-12-27 Fuji Film Manufacturing Europe B.V. Composition comprising a purified collagen-like polypeptide suitable as a peptizer
JP2009045069A (en) * 1997-12-24 2009-03-05 Fujifilm Manufacturing Europe Bv Silver halide emulsion with recombinant collagen suitable for photographic application, and preparation method therefor
DE19834909A1 (en) * 1998-08-03 2000-02-17 Mpb Cologne Gmbh Molecular Pla Fiber proteins and their production
WO2000008142A3 (en) * 1998-08-03 2000-06-08 Mpb Cologne Gmbh Molecular Pla Fibrous proteins and the production thereof
EP0992586A3 (en) * 1998-10-09 2004-04-21 United States Surgical Corporation Extracellular matrix proteins with modified amino acid
JP2011188873A (en) * 1999-03-04 2011-09-29 Revivicor Inc Genetic modification of somatic cell and use thereof
JP2002537785A (en) * 1999-03-04 2002-11-12 ピーピーエル セラピューティクス (スコットランド) リミテッド Genetic modification of somatic cells and their use
CN1309737C (en) * 1999-11-12 2007-04-11 法布罗根股份有限公司 Recombinant gelatins
AU2005246967B2 (en) * 1999-11-12 2008-09-04 Fibrogen, Inc. Recombinant gelatins
EP2011874A1 (en) * 1999-11-12 2009-01-07 Fibrogen, Inc. Recombinant gelatin in vaccines
JP2003513988A (en) * 1999-11-12 2003-04-15 ファイブローゲン、インコーポレーテッド Recombinant gelatin in vaccines
EP2345721A1 (en) * 1999-11-12 2011-07-20 Fibrogen, Inc. Recombinant gelatin in vaccines
WO2001034801A3 (en) * 1999-11-12 2002-01-31 Fibrogen Inc Recombinant gelatin in vaccines
WO2001034646A3 (en) * 1999-11-12 2001-12-06 Fibrogen Inc Recombinant gelatins
WO2001068868A3 (en) * 2000-03-15 2002-01-31 Fibrogen Inc Alpha(iii) subunit of prolyl 4-hydroxylase
US8058512B2 (en) 2001-06-05 2011-11-15 David N. Radin Gene expression and production of TGF-β proteins including bioactive mullerian inhibiting substance from plants
US10626408B2 (en) 2004-09-29 2020-04-21 Collplant Ltd. Collagen producing plants and methods of generating and using same
US9783816B2 (en) 2004-09-29 2017-10-10 Collplant Ltd. Collagen producing plants and methods of generating and using same
US8455717B2 (en) 2004-09-29 2013-06-04 Collplant Ltd. Collagen producing plants and methods of generating and using same
AU2012200573B2 (en) * 2005-03-31 2013-11-07 Fuso Pharmaceutical Industries, Ltd. "Process for production of protein having triple-helical structure"
US9018354B2 (en) 2005-03-31 2015-04-28 Fuso Pharmaceutical Industries, Ltd. Methods of producing proteins having triple-helix structure
AU2006231795B9 (en) * 2005-03-31 2012-04-12 Fuso Pharmaceutical Industries, Ltd. Process for production of protein having triple-helical structure
AU2006231795B2 (en) * 2005-03-31 2012-02-16 Fuso Pharmaceutical Industries, Ltd. Process for production of protein having triple-helical structure
WO2011059092A1 (en) 2009-11-16 2011-05-19 住友化学株式会社 Transformant which produces collagen wherein both lysine residue and proline residue are hydroxylated
WO2011115070A1 (en) 2010-03-15 2011-09-22 住友化学株式会社 Transformant which produces glycine repeat protein
US20130116412A1 (en) * 2010-04-02 2013-05-09 Daniel M. Pinkas Production of Post-Translationally Hydroxylated Recombinant Proteins in Bacteria
US11028148B2 (en) 2017-09-28 2021-06-08 Geltor, Inc. Recombinant collagen and elastin molecules and uses thereof
US11041015B2 (en) 2017-09-28 2021-06-22 Geltor, Inc. Recombinant collagen and elastin molecules and uses thereof
US11180541B2 (en) 2017-09-28 2021-11-23 Geltor, Inc. Recombinant collagen and elastin molecules and uses thereof
US11214609B2 (en) 2017-09-28 2022-01-04 Geltor, Inc. Recombinant collagen and elastin molecules and uses thereof
CN109456989A (en) * 2018-10-31 2019-03-12 陕西慧康生物科技有限责任公司 A kind of raising Pichia pastoris secretion expression carrier construction method
US11168126B2 (en) 2019-04-12 2021-11-09 Geltor, Inc. Recombinant elastin and production thereof
US12269865B2 (en) 2019-04-12 2025-04-08 Geltor, Inc. Recombinant elastin and production thereof
CN111334512B (en) * 2019-12-06 2023-10-13 肽源(广州)生物科技有限公司 Recombinant human-like collagen containing hydroxyproline and hydroxylysine and production method thereof
CN111334512A (en) * 2019-12-06 2020-06-26 肽源(广州)生物科技有限公司 Recombinant human-like collagen containing hydroxyproline and hydroxylysine and production method thereof
US11332505B2 (en) 2020-01-24 2022-05-17 Geltor, Inc. Animal-free dietary collagen
US11174300B2 (en) 2020-01-24 2021-11-16 Geltor, Inc. Animal-free dietary collagen
WO2022237224A1 (en) * 2021-05-13 2022-11-17 江苏创健医疗科技有限公司 Recombinant human type xvii collagen, preparation method and application thereof
US11787852B2 (en) 2021-05-13 2023-10-17 Jiangsu Trautec Medical Technology Co., Ltd. Recombinant human type XVII collagen, and preparation method and use thereof
WO2023231489A1 (en) * 2022-06-01 2023-12-07 山西锦波生物医药股份有限公司 Preparation method for biosynthetic human body structural material
CN116218864A (en) * 2023-04-10 2023-06-06 山东多美康生物医药有限公司 A kind of recombinant type III humanized collagen α1 and its expression vector and application
CN116218864B (en) * 2023-04-10 2024-01-09 山东多美康生物医药有限公司 Recombinant III type humanized collagen alpha 1 and expression vector and application thereof
CN116948014A (en) * 2023-07-18 2023-10-27 山西锦波生物医药股份有限公司 Method for biosynthesis of human structural material type VI collagen
CN116948014B (en) * 2023-07-18 2024-05-10 山西锦波生物医药股份有限公司 Method for biosynthesizing type VI collagen, a structural material for human body
CN118388630A (en) * 2023-07-18 2024-07-26 山西锦波生物医药股份有限公司 Method for biosynthesizing type VI collagen, a structural material for human body
WO2025015767A1 (en) * 2023-07-18 2025-01-23 山西锦波生物医药股份有限公司 Method for biosynthesis of human structural material type vi collagen
WO2025156530A1 (en) * 2024-01-26 2025-07-31 深圳粒影生物科技有限公司 Recombinant type xvii collagen having triple helix structure and use thereof
CN118109506A (en) * 2024-03-08 2024-05-31 湖南诺合新生物科技有限公司 Recombinant type I human collagen and preparation method and application thereof

Also Published As

Publication number Publication date
JP2000508544A (en) 2000-07-11
AU3056697A (en) 1997-11-07

Similar Documents

Publication Publication Date Title
WO1997038710A1 (en) Synthesis of human procollagens and collagens in recombinant dna systems
WO1997038710A9 (en) Synthesis of human procollagens and collagens in recombinant dna systems
US20020142391A1 (en) Synthesis of human procollagens and collagens in recombinant DNA systems
US20050164345A1 (en) Synthesis of human procollagens and collagens in recombinant DNA systems
CA2121698C (en) Synthesis of human procollagens and collagens in recombinant dna systems
US5593859A (en) Synthesis of human procollagens and collagens in recombinant DNA systems
US20080139477A1 (en) Method for producing, in yeast, a hydroxylated triple helical protein, and yeast host cells useful in said method
JP2019010096A (en) Yeast strain and method for producing collagen
JPH08510897A (en) Extracellular signal-regulated kinase, sequences, manufacturing methods and uses
US5928922A (en) α2 subunit of prolyl-4-hydroxylase, nucleic acid sequences encoding such subunit and methods for producing the same
US5773248A (en) Nucleic acid encoding a human α3(IX) collagen protein and method of producing the protein recombinantly
CN100335631C (en) Stable expression of triple helical proteins
US7951562B2 (en) Method for glycosylating hydroxylysine residues in polypeptides or peptides
US20030064436A1 (en) Method for producing, in yeast, a hydroxylated triple helical protein, and yeast host cells useful in said method
AU7170300A (en) Synthesis of human procollagens and collagens in recombinant DNA systems
HK1010334B (en) Synthesis of human procollagens and collagens in recombinant dna systems
AU780813B2 (en) Stable expression of triple helical proteins
HK1024263A (en) Synthesis of human procollagens and collagens in recombinant dna systems
Lee The roles of tropoelastin and the microfibrillar proteins in elastic fiber formation by chondrocytes
KR20050092657A (en) Gene which is specifically expressed in muscle

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AL AM AU AZ BA BB BG BR BY CA CN CU CZ EE GE GH HU IL IS JP KG KP KR KZ LC LK LR LT LV MD MG MK MN MX NO NZ PL RO RU SG SI SK TJ TM TR TT UA US UZ VN YU AM AZ BY KG KZ MD RU TJ TM

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH KE LS MW SD SZ UG AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN ML MR NE SN TD TG

COP Corrected version of pamphlet

Free format text: PAGES 12 AND 66,DESCRIPTION,REPLACED BY NEW PAGES BEARING THE SAME NUMBER;AFTER RECTIFICATION OF OBVIOUS ERRORS AS AUTHORIZED BY THE INTERNATIONAL SEARCHING AUTHORITY;PAGES 1/11-6/11 AND 9/11-11/11,DRAWINGS,REPLACED BY NEW PAGES 1/10-6/10,9/10 AND 10/10;PAGES 7/11 AND 8/11 RENUMBERED AS PAGES 7/10 AND 8/10;DUE TO LATE TRANSMITTAL BY THE RECEIVING OFFICE

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: CA

122 Ep: pct application non-entry in european phase