Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below. The examples, in which specific conditions are not specified, were conducted under conventional conditions or conditions recommended by the manufacturer. The reagents or instruments used are not indicated by the manufacturer, and are all conventional products available commercially.
Noun definitions
As used herein, "protein phosphorylation" refers to the process of transferring the phosphate group of ATP to the amino acid residues (serine, threonine and tyrosine) of the substrate protein, or the binding of GTP by a signal, catalyzed by protein kinases. The reason that the analysis of protein phosphorylation sites is difficult in the prior art is that protein phosphorylation is an unstable dynamic process in vivo, phosphorylated proteins are low in intracellular abundance, and phosphate groups of phosphorylated proteins are easily lost in a separation process and are difficult to protonate due to electronegativity.
As used herein, the term "phosphorylated antibody" refers to an antibody produced against a phosphorylation site of a substrate in a phosphorylated state, and an immunogen is phosphorylated, and an antibody produced using the phosphorylated antibody is referred to as a phosphorylated antibody. However, the prior art has the disadvantages that the preparation of specific phosphorylated antibodies directed against a single site of a specific protein is difficult, the preparation cost is high, the preparation time is long, and the preparation may not be successful.
As used herein, the term "reversible phosphorylation site" refers to a site on a protein at which phosphorylation is possible, wherein the phosphorylation is catalyzed by a protein kinase, and the transfer of a phosphorylated gene of ATP or GTP to an amino acid residue of the protein is reversed, wherein the dephosphorylation and the non-phosphorylation of the protein are reversed, and wherein the phosphorylation is catalyzed by a protein phospho (ester) enzyme.
Reference herein to "the majority of possible reversible phosphorylation sites" is to: after all of the "most likely reversible phosphorylation sites" are subjected to permanent non-phosphorylation site mutation, the vector containing a fragment of the most likely reversible phosphorylation sites can be used as a negative control to the extent that the vector does not react with a phosphorylated antibody.
Reference herein to "permanent non-phosphorylation site mutation" and "permanent phosphorylation site mutation" is a reference to the mutation of an amino acid that is reversibly phosphorylated to a site to a permanent non-phosphorylated and permanently phosphorylated amino acid.
The embodiment of the invention provides a method for identifying protein phosphorylation sites, which can detect phosphorylation site modification conditions of protein single sites under the condition of no specific phosphorylation antibody aiming at the protein single sites.
Specifically, the method specifically comprises the following steps: and (3) separating the protein expressed by the expression vector 1 through an immunoprecipitation experiment, and carrying out an immunoblotting experiment by using a phosphorylation antibody to detect the modification condition of the phosphorylation site to be detected, wherein the expression vector 1 contains an amino acid sequence fragment 1 with the site to be detected.
The amino acid sequence fragment 1 is a fragment of the protein to be detected, wherein permanent non-phosphorylation site mutation is carried out on most possible reversible phosphorylation sites except the site to be detected.
Specifically, in the expression vector 1, the purpose of carrying out permanent non-phosphorylation site mutation on most possible reversible phosphorylation sites except for the site to be detected is as follows: the possibility of phosphorylation of other reversible phosphorylation sites is eliminated, so that the object detected by the phosphorylation antibody which is not specifically directed to a single site is directed only to the site to be detected, namely under the condition, the phosphorylation condition on the site to be detected can be detected without the phosphorylation antibody which is specifically directed to a single site of a certain protein.
In an alternative embodiment, the method further comprises separating the protein expressed by the expression vector 2 through an immunoprecipitation experiment, and performing a Western Blot experiment using a phosphorylated antibody to detect the modification of the phosphorylation site to be detected, wherein the expression vector 2 contains an amino acid sequence fragment 2 having the phosphorylation site to be detected.
The amino acid sequence segment 2 is a segment in which most possible reversible phosphorylation sites on the segment of the protein to be detected are subjected to permanent non-phosphorylation site mutation. As a negative control for the method.
In an alternative embodiment, the method further comprises separating the protein expressed by the expression vector 3 through an immunoprecipitation experiment, and performing a Western Blot experiment using a phosphorylated antibody to detect the modification of the phosphorylation site to be detected, wherein the expression vector 3 contains the amino acid sequence fragment 3 having the phosphorylation site to be detected.
And the amino acid sequence fragment 3 is a fragment in which permanent phosphorylation site mutation is carried out on a site to be detected on a fragment of a protein to be detected, and permanent non-phosphorylation site mutation is carried out on most of possible reversible phosphorylation sites except the site to be detected. As a positive control for the method.
In the case of expression vectors 1-3, the method comprises separating proteins expressed by the expression vectors 1-3, and performing Western Blot experiments using phosphorylated antibodies to detect the modification of the phosphorylated sites to be detected. In addition, "amino acid sequence fragment 1, amino acid sequence fragment 2, and amino acid sequence fragment 3" in the specification of the present application are all amino acid sequence fragments of a protein to be detected, and the differences are only in the treatment modes of phosphorylation sites on the sequence fragments.
The protein expressed by the expression vector 1 is used as a wild type sample for detecting the locus to be detected, the protein expressed by the expression vector 2 is used as a negative control for detecting the locus to be detected, and the protein expressed by the expression vector 3 is used as a positive control for detecting the locus to be detected. In the embodiment with the expression vectors 1-3, comparison can be carried out according to the detection result, and the difference of the phosphorylation site modification degree can be compared.
In an alternative embodiment, the phosphorylated antibody is a phosphorylated antibody that is non-specific for a single site.
Preferably, the phosphorylated antibody comprises: an antibody that is resistant to phosphorylation by any one or more of the following amino acids: serine, threonine and tyrosine.
Further, the above-mentioned "antibody resistant to phosphorylation of any one amino acid" includes: serine phosphorylated antibodies, threonine phosphorylated antibodies, tyrosine phosphorylated antibodies; further examples are antibodies that are resistant to phosphorylation of various amino acids: serine/threonine phosphorylated antibodies, serine/tyrosine phosphorylated antibodies, tyrosine/threonine phosphorylated antibodies, and tyrosine/serine/threonine phosphorylated antibodies.
In an alternative embodiment, the means of screening for reversible phosphorylation sites on the test protein sequence is a phosphoproteomic analysis. It should be noted that the potential reversible phosphorylation sites can also be identified by screening in other ways, such as by searching the available literature.
In alternative embodiments, the permanent non-phosphorylation site mutation is a mutation of a serine, threonine, and/or tyrosine to an alanine or phenylalanine.
Specifically, "mutation of serine, threonine and/or tyrosine to alanine or phenylalanine" can be understood as 2 cases, and when a permanent non-phosphorylation site mutation is made for a single reversible phosphorylation site, any one of serine, threonine and tyrosine is mutated to alanine or phenylalanine; when permanent non-phosphorylation site mutation is made for multiple phosphorylation sites on a sequence, serine, threonine and tyrosine of most possible reversible phosphorylation sites on the sequence are mutated to alanine or phenylalanine.
The permanent phosphorylation site is mutated to serine, threonine and/or tyrosine to aspartic acid or glutamic acid. It should be noted that the understanding of the permanent phosphorylation site mutation is analogized to the understanding of the permanent non-phosphorylation site mutation.
In an alternative embodiment, the method is used to identify the phosphorylation site of triglyceride lipase.
Enzymes responsible for triglyceride metabolism include triglyceride lipase (ATGL), Hormone Sensitive Lipase (HSL) and monoglyceride lipase (MGL). Among them, research on HSL and MGL has been over 40 years old, and ATGL has been discovered only in recent years, and its research is progressing.
ATGL was originally found in adipose tissue, and has been found to be widely present in many tissues (including liver, cardiac and skeletal). ATGL is the rate-limiting enzyme involved in the first step of triglyceride hydrolysis. Under the action of ATGL, triglycerides are hydrolyzed to Diglycerides (DAG), then diglycerides are hydrolyzed by HSL to monoglycerides, and then completely hydrolyzed to glycerol and fatty acids by monoglyceride lipase MGL. Although phosphorylation of ATGL plays an important role in the synthesis and breakdown of triglycerides, the lack of antibodies directed to single-site phosphorylation of ATGL has limited the study of ATGL phosphorylation modifications in relation to their function. The phosphorylation sites of ATGL can be rapidly and effectively analyzed by adopting the method provided by the embodiment of the application.
In an alternative embodiment, the phosphorylation site of the triglyceride lipase is selected from the group consisting of: any one of Ser47, Ser87, Thr101, Thr210, Thr372, Tyr378, Ser393, Ser396, Ser406, and Ser 430.
In an alternative embodiment, the phosphorylation site of the triglyceride lipase is Ser 47.
The features and properties of the present invention are described in further detail below with reference to examples.
Example 1
The present example provides a method for identifying a phosphorylation site of a protein, which includes the following steps.
(1) Determination of phosphorylation sites of a test sample
Most of the possible reversible phosphorylation sites are screened for their possible existence by literature search means or phosphorylation proteomics analysis.
(2) Construction of expression vectors 1-3
The expression vectors 1-3 are plasmids respectively containing amino acid sequence fragments 1-3, the amino acid sequence fragments 1-3 are sequence fragments containing to-be-detected sites of to-be-detected proteins, and the sequence lengths are the same.
Wherein, the amino acid sequence segment 1 is a segment of which most possible reversible phosphorylation sites except the site to be detected on the segment of the protein to be detected are subjected to permanent non-phosphorylation site mutation;
the amino acid sequence segment 2 is a segment in which permanent non-phosphorylation site mutation is carried out on most possible reversible phosphorylation sites on a segment of the protein to be detected;
and the amino acid sequence fragment 3 is a fragment in which permanent phosphorylation site mutation is carried out on a site to be detected on a fragment of a protein to be detected, and permanent non-phosphorylation site mutation is carried out on most of possible reversible phosphorylation sites except the site to be detected.
(3) Detection of phosphorylation sites
And (3) respectively carrying out transfection experiments on the expression vectors 1-3 in the step (2), culturing host cells, and obtaining proteins expressed by the host cells. And (3) separating the proteins expressed by the expression vectors 1-3 through an immunoprecipitation experiment, and performing an immunoblotting experiment by using a phosphorylation antibody to detect the modification condition of the phosphorylation sites to be detected.
Example 2
This example provides a method for detecting ATGL single-site phosphorylation, and the phosphorylation site of Ser47 is used as a research object in this example.
(1) Screening for possible phosphorylation sites of ATGL
The mouse ATGL is reported to be modified by 10 phosphorylation sites in the literature, including Ser47, Ser87, Thr101, Thr210, Thr372, Tyr378, Ser393, Ser396, Ser406 and Ser 430.
(2) Construction of expression vectors
In this example, the phosphorylation site of Ser47 was used as a study object, and a 10-site permanent non-phosphorylation mutant plasmid controlled by CMV promoter (FLAG-FLAG ditag, p9-ATGL-FLAG) was designed as a negative control, i.e., expression vector 2, and the ATGL vector map is shown in FIG. 1. The base sequence of the amino acid sequence fragment is shown as SEQ ID No. 2.
The plasmid contains 10 permanent non-phosphorylation mutation sites of amino acid sequence fragment: S47A, S87A, T101A, T210A, T372A, Y378F, S393A, S396A, S406A and S430A (a is an abbreviation for alanine, F is an abbreviation for phenylalanine, it should be noted that in other embodiments, Y378 may be mutated to alanine for the negative control, and the other 9 phosphorylation sites may be mutated to phenylalanine).
Then, a Ser47 permanently phosphorylated plasmid (p10-ATGL-S47D-FLAG) and a Ser47 wild-type plasmid (p10-ATGL-S47-FLAG) were constructed on the basis of negative controls.
Specifically, the Ser47 permanently phosphorylated plasmid is an expression vector 3, and the base sequence of the amino acid sequence fragment 3 is shown as SEQ ID No. 3; the Ser47 wild type plasmid is an expression vector 1, and the base sequence of the amino acid sequence segment 3 is shown as SEQ ID No. 1.
It should be noted that, in the examples, when Ser87 is the research site, the operation process of the method is the same as Ser47, except that in the plasmid construction, the positive control is p10-ATGL-S87D-FLAG, the wild type plasmid is p10-ATGL-S87-FLAG, and the rest sites are analogized in turn.
(3) Detection of phosphorylation sites
3.1 cell culture and plasmid transfection
Transfecting a mouse-derived cardiomyocyte cell line HL-1 cell with the negative plasmid (expression vector 2), the positive plasmid (expression vector 3) and the wild-type plasmid (expression vector 1) obtained in the step (2), respectively, growing the cell in Claycomb medium (Sigma) supplemented with 10% FBS, 1% penicillin and streptomycin, and culturing the cell at 37 ℃ and 5% CO2Culturing under the conditions of (1).
3.2 harvesting of cells
After plasmid transfection for 24-48 h protein expression, the culture medium is discarded, the culture dish is washed with 4 ℃ precooled PBS for 1-2 times, and the liquid is sucked dry. At 4 ℃ or on ice, add the appropriate amount of 4 ℃ pre-cooled cell IP lysis buffer (10 mM NaF, 10mM Na added)3VO4And protease inhibitor), the cells were harvested with a cell scraper, lysed at 4 ℃ or on ice for 30 minutes, centrifuged at 12000g for 30 minutes in a centrifuge, and the supernatant was collected.
A small amount of lysate was taken for Western blot analysis and the remaining lysate was added to 50. mu.L of FLAG magnetic beads (Sigma-Aldrich M8823) and incubated at 4 ℃ for 4h with slow shaking. After incubation, the beads were separated with a magnetic rack and the supernatant was aspirated. The beads were washed 1 to 2 times with 1mL lysis buffer. Finally, 15. mu.L of 2 XSDS loading buffer was added and the mixture was heat denatured by boiling for 10 min. Total protein was separated by 10% SDS-PAGE, transferred to PVDF membrane, blocked at 4 ℃ for 8h in TBS buffer with 0.1% Tween 20 and 5% bovine albumin, and incubated at 4 ℃ for 8h as primary antibody. Incubation with horseradish-peroxidase conjugated secondary antibody for 8h and detection with ECL system.
The ATGL protein was separated by FLAG magnetic bead immunoprecipitation, silver staining and immunoblotting as shown in FIG. 2. In FIG. 2, Input is a transfected cell lysate; output is supernatant after immunoprecipitation of transfected cell lysate; wash is protein washed by FLAG magnetic beads after immunoprecipitation; beads are FLAG magnetic bead binding protein. The Heavy chain refers to the Heavy chain of the FLAG antibody on the magnetic beads, and the Light chain refers to the Light chain of the FLAG antibody on the magnetic beads; ATGL-FLAG refers to FLAG-tagged ATGL protein expressed by an expression vector.
Silver staining results show that the FLAG magnetic bead binding protein is mainly the ATGL protein with the FLAG label expressed by an expression vector. The immunoblotting result shows that the phosphorylation modification of the carrier 3 is positive, the phosphorylation modification of the carrier 2 is negative, and the phosphorylation modification of the carrier 1 is weak.
Example 3
The present embodiment provides a method for detecting ATGL single-site phosphorylation, which uses the phosphorylation sites of Ser47, Ser87, Thr101, Thr210, Thr372, Tyr378, Ser393, Ser396, Ser406, and Ser430 as the research targets, respectively, as in embodiment 2.
Expression vectors 1 corresponding to 10 sites to be detected are prepared respectively for the 10 phosphorylation sites, and the sequence information refers to table 1.
TABLE 1 sequence information
Transfecting HL-1 cells with ATGL10 phosphorylation site plasmid vector 1 for 48 hours to collect protein, and separating ATGL protein by FLAG magnetic bead immunoprecipitation reaction. Silver staining results and immunoblotting results please refer to fig. 3.
Silver staining results show that the FLAG magnetic bead binding protein is mainly an ATGL protein with a FLAG label expressed by an expression vector. Immunoblot results showed phosphorylation modification of ATGL ten phosphorylation site vector 1.
As can be seen from FIG. 3, the phosphorylation modifications of the protein expressed by the expression vector 1 corresponding to each of the 10 phosphorylation sites of ATGL protein were detected by the reaction system. In conclusion, the method provided in example 1 can effectively detect the phosphorylation state of the target site. ATGL is a key enzyme in triglyceride metabolism, and phosphorylation is a major pathway for regulating ATGL activity. The method can effectively replace ATGL single-site phosphorylation antibody, and is convenient for various functional researches on ATGL. The same principle can be applied to other proteins, and provides a powerful tool for researching protein phosphorylation modification.
The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.
SEQUENCE LISTING
<110> Beijing thoracic Hospital affiliated to capital medical university
BEIJING TUBERCULOSIS AND THORACIC TUMOR Research Institute
<120> a method for identifying protein phosphorylation sites
<160> 21
<170> PatentIn version 3.5
<210> 1
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 1
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagcctc ggcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 2
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 2
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 3
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 3
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccga cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 4
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 4
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatccctc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 5
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 5
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccga cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 6
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 6
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
accctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 7
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 7
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gacctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 8
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 8
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacacc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 9
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 9
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgac agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 10
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 10
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag cagacgggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 11
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 11
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggacggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 12
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 12
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gtatctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 13
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 13
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca ggagctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 14
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 14
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgccttcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 15
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 15
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgaca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 16
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 16
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactgtctga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 17
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 17
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggacga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 18
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 18
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccagtctct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 19
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 19
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggacct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgcc ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 20
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 20
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctctca ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461
<210> 21
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 21
atgttcccga gggagaccaa gtggaacatc tcattcgctg gctgcggctt cctcggggtc 60
taccacattg gcgtggcctc ctgcctccgt gagcacgcgc ccttcctggt ggccaacgcc 120
actcacatct acggagccgc cgcaggggcg ctcaccgcca cagcgctggt cactggggcc 180
tgcctgggtg aagcaggtgc caacattatt gaggtgtcca aggaggcccg gaagcggttc 240
ctgggtcctc tgcatcccgc cttcaacctg gtgaagacca tccgtggctg tctactaaag 300
gccctgcctg ctgattgcca tgagcgcgcc aatggacgcc tgggcatctc cctgactcgt 360
gtttcagacg gagagaacgt catcatatcc cactttagct ccaaggatga gctcatccag 420
gccaatgtct gcagcacatt tatcccggtg tactgtggcc tcattcctcc taccctccaa 480
ggggtgcgct atgtggatgg cggcatttca gacaacttgc cactttatga gctgaagaat 540
accatcacag tgtccccatt ctcaggcgag agtgacatct gccctcagga cagctccacc 600
aacatccacg agcttcgcgt caccaacgcc agcatccagt tcaaccttcg caatctctac 660
cgcctctcga aggctctctt cccgccagag cccatggtcc tccgagagat gtgcaaacag 720
ggctacagag atggacttcg attccttagg aggaatggcc tactgaacca acccaaccct 780
ttgctggcac tgcccccagt tgtcccccag gaagaggatg cagaggaagc tgctgtggtg 840
gaggagaggg ctggagagga ggatcaattg cagccttata gaaaagatcg aattctagag 900
cacctgcctg ccagactcaa tgaggccctg ctggaggcct gtgtggaacc aaaggacctg 960
atgaccaccc tttccaacat gctaccagtg cgcctggcaa cggccatgat ggtgccctat 1020
actctgccgc tggagagtgc agtgtccttc accatccgct tgttggagtg gctgcctgat 1080
gtccctgaag atatccggtg gatgaaagag caggccggta gcatctgcca gttcctggtg 1140
atgagggcca agaggaaatt gggtgaccat ctgcctgcca gactggccga gcaggtggaa 1200
ctgcgacgtg cccaggccct gccctctgtg ccactgtctt gcgccaccta cagtgaggcc 1260
ctacccaact gggtacgaaa caacctcgac ctgggggacg cgctggccaa gtgggaagaa 1320
tgccagcgtc agctactgct gggtctcttc tgcaccaatg tggccttccc gccggatgcc 1380
ttgcgcatgc gcgcacctgc cagccccact gccgcagatc ctgccacccc acaggatcca 1440
cctggcctcc cgccttgctg a 1461