[go: up one dir, main page]

CN110468188B - Tag sequence set for next generation sequencing and its design method and application - Google Patents

Tag sequence set for next generation sequencing and its design method and application Download PDF

Info

Publication number
CN110468188B
CN110468188B CN201910779115.8A CN201910779115A CN110468188B CN 110468188 B CN110468188 B CN 110468188B CN 201910779115 A CN201910779115 A CN 201910779115A CN 110468188 B CN110468188 B CN 110468188B
Authority
CN
China
Prior art keywords
dna
artificial sequence
sequence
tag
tag sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910779115.8A
Other languages
Chinese (zh)
Other versions
CN110468188A (en
Inventor
许腾
曾伟奇
唐毅
李永军
王小锐
苏杭
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Vision Gene Technology Co ltd
Guangzhou Weiyuan Medical Equipment Co ltd
Guangzhou Weiyuan Medical Laboratory Co ltd
Weiyuan Medical Technology (Huzhou) Co.,Ltd.
Original Assignee
Guangzhou Vision Gene Technology Co ltd
Guangzhou Weiyuan Medical Laboratory Co ltd
Shenzhen Weiyuan Medical Technology Co ltd
Weiyuan Shenzhen Medical Research Center Co ltd
Guangzhou Weiyuan Medical Equipment Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Vision Gene Technology Co ltd, Guangzhou Weiyuan Medical Laboratory Co ltd, Shenzhen Weiyuan Medical Technology Co ltd, Weiyuan Shenzhen Medical Research Center Co ltd, Guangzhou Weiyuan Medical Equipment Co ltd filed Critical Guangzhou Vision Gene Technology Co ltd
Priority to CN201910779115.8A priority Critical patent/CN110468188B/en
Publication of CN110468188A publication Critical patent/CN110468188A/en
Application granted granted Critical
Publication of CN110468188B publication Critical patent/CN110468188B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Biotechnology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Analytical Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention relates to a tag sequence set for second generation sequencing, and a design method and application thereof, belonging to the technical field of molecular biology. The length of each tag sequence in the tag sequence set is 6-10, the GC content in each tag sequence is 40-60%, the number of continuous bases in each tag sequence is 1-3 at the maximum, and the first two bases of each tag sequence are not G at the same time. The tag sequence set is used for second generation sequencing, and the number of finally obtained tag sequence sets is high through screening and combining the tag sequences, so that the requirement of large sample size detection can be met, and high-quality sequencing data can be accurately split while the sequencing requirement is met.

Description

用于二代测序的标签序列集及其设计方法和应用Tag sequence set for next generation sequencing and its design method and application

技术领域technical field

本发明涉及分子生物学领域,特别是涉及一种用于二代测序的标签序列集及其设计方法和应用。The present invention relates to the field of molecular biology, in particular to a tag sequence set for next-generation sequencing and its design method and application.

背景技术Background technique

二代测序(又称下一代测序,NGS)技术在发现和转化研究的诸多方面已逐渐成为一种被广泛采用的技术,二代测序由于高通量的特性,能够同时对多个样本同时进行测序,而区分样本的方法就是运用标签序列(index)进行区分。Next-generation sequencing (also known as next-generation sequencing, NGS) technology has gradually become a widely adopted technology in many aspects of discovery and translational research. Sequencing, and the method of distinguishing samples is to use the index sequence (index) to distinguish.

目前标签序列大多采用测序平台(比如Illumina)提供的序列,但是测序平台给的标签序列数目有限,再进行大规模样本测序或者双端测序的时候就需要更多量的标签序列。At present, most of the tag sequences are provided by sequencing platforms (such as Illumina), but the number of tag sequences provided by the sequencing platform is limited, and more tag sequences are required for large-scale sample sequencing or paired-end sequencing.

目前单独设计标签序列缺少一种系统的设计标签序列的方法和基于严格筛选而产生的标签序列,自己合成的标签序列就有可能不符合测序要求而导致测序失败或者测序数据难以拆分。At present, there is a lack of a systematic method for designing tag sequences and tag sequences generated based on strict screening. The self-synthesized tag sequences may not meet the sequencing requirements, resulting in sequencing failure or difficulty in splitting the sequencing data.

发明内容Contents of the invention

基于此,有必要针对上述问题,提供一种用于二代测序的标签序列集,此类标签序列能够在符合测序要求同时能够正确拆分出高质量的测序数据。Based on this, it is necessary to address the above problems and provide a tag sequence set for next-generation sequencing, which can correctly split high-quality sequencing data while meeting the sequencing requirements.

一种用于二代测序的标签序列集,所述标签序列集中每个标签序列的长度为6-10,每个标签序列中的GC含量为40-60%,每个标签序列中连续碱基数最大为1-3,每个标签序列的前两个碱基不同时为G。A tag sequence set for next-generation sequencing, the length of each tag sequence in the tag sequence set is 6-10, the GC content in each tag sequence is 40-60%, and the consecutive bases in each tag sequence The maximum number is 1-3, and the first two bases of each tag sequence are not G at the same time.

上述标签序列集中的标签序列,可用于文库构建,随后利用测序平台测序。可以理解的,如将设计的标签序列在反向引物上使用,需要对标签序列进行反向互补。The tag sequences in the above tag sequence set can be used for library construction and then sequenced using a sequencing platform. It can be understood that if the designed tag sequence is used on the reverse primer, reverse complementation of the tag sequence is required.

在其中一个实施例中,所述标签序列的长度为8,每个标签序列中连续碱基数最大为2。将标签序列的长度控制为8,由于片段越长,容错率越大,但是长片段也意味着更长的测序时间和更多的试剂消耗,从实践考虑,将最大长度控制为8是一个较好的长度;同时控制其连续碱基数最大为2,可以在最大限度的保证碱基含量均衡的同时减少因为测序产生的碱基误读。In one embodiment, the length of the tag sequence is 8, and the maximum number of consecutive bases in each tag sequence is 2. Control the length of the tag sequence to 8, because the longer the fragment, the greater the error tolerance rate, but the long fragment also means longer sequencing time and more reagent consumption. From a practical point of view, it is more difficult to control the maximum length to 8. Good length; at the same time, the number of consecutive bases is controlled to a maximum of 2, which can maximize the balance of base content and reduce base misreading caused by sequencing.

在其中一个实施例中,所述标签序列的GC含量为50%。GC含量是在4种碱基中,鸟嘌呤(G)和胞嘧啶(C)所占的比率。In one embodiment, the GC content of the tag sequence is 50%. The GC content is the ratio of guanine (G) and cytosine (C) among the four bases.

在其中一个实施例中,将所述标签序列集中的每个标签序列进行两两对比,对于相同位置的碱基,相同的记为0分,不同的记为1分,计算总分,所述每个标签序列的总分≥4。上述以积分限定标签序列的目的是为了增大两个标签之间的汉明距离(Hammingdistance),可以更好得区分两条序列。例如,当两条序列分别为:TGGTTACC、CAACCAGA,只有第6个碱基一样,同为“A”,所以标签总分为7,而再如两条序列:CGATCACA、CAATCAGA,标签总分为2,可见前一组的差异大于后一组,能够更好的区分序列,由于错配而产生识别的错误大大降低。In one of the embodiments, each tag sequence in the tag sequence set is pairwise compared, and for the bases at the same position, the same is recorded as 0 points, and the different ones are recorded as 1 point, and the total score is calculated. Total score ≥4 for each tag sequence. The purpose of defining the tag sequence by the integral above is to increase the Hamming distance (Hammingdistance) between the two tags, so that the two sequences can be better distinguished. For example, when the two sequences are: TGGTTACC, CAACCAGA, only the 6th base is the same as "A", so the total score of the label is 7, and another example is two sequences: CGATCACA, CAATCAGA, the total score of the label is 2 , it can be seen that the difference of the former group is greater than that of the latter group, which can better distinguish the sequences, and the recognition errors due to mismatches are greatly reduced.

在其中一个实施例中,所述每个标签序列的总分为4。具有如下有点优点:1,最大限度的保证标签序列的差异性;2,能够得到足够用于生产的标签序列。In one of the embodiments, the total score of each tag sequence is 4. The method has the following advantages: 1. To ensure the maximum diversity of tag sequences; 2. To obtain enough tag sequences for production.

在其中一个实施例中,所述标签序列选自如SEQ ID No.1-SEQ ID No.142所示的核苷酸序列。上述标签序列能够在符合测序要求同时能够正确拆分出高质量的测序数据。具体如下表所示:In one embodiment, the tag sequence is selected from the nucleotide sequences shown in SEQ ID No.1-SEQ ID No.142. The above-mentioned tag sequence can correctly split high-quality sequencing data while meeting the sequencing requirements. The details are shown in the following table:

表1.标签序列Table 1. Tag sequence

在其中一个实施例中,所述测序标签在相同位置的碱基含量比为:A=20-30%,B=20-30%,C=20-30%,D=20-30%。将相同位置的碱基含量控制在上述范围,具有如下优点:既能保证碱基平衡,避免同一位置有些信号过高产生的光衍;又可避免同一位置有些信号过低而漏读。In one embodiment, the base content ratios of the sequencing tags at the same position are: A=20-30%, B=20-30%, C=20-30%, D=20-30%. Controlling the base content at the same position within the above range has the following advantages: it can not only ensure the base balance, avoid the light diffraction caused by some signals at the same position that are too high, but also avoid some signals at the same position that are too low and miss reading.

本发明还公开了上述的用于二代测序的标签序列集在病原微生物的DNA和/或RNAPCR产物测序中的应用。The present invention also discloses the application of the above-mentioned tag sequence set for next-generation sequencing in the sequencing of DNA and/or RNA PCR products of pathogenic microorganisms.

本发明还公开了上述的用于二代测序的标签序列集的设计方法,包括以下步骤:The present invention also discloses the above-mentioned method for designing a tag sequence set for next-generation sequencing, including the following steps:

1)列出预定标签长度的所有碱基组合的序列;1) List the sequences of all base combinations of the predetermined tag length;

2)挑选其中符合预定GC含量的序列;2) Selecting sequences that meet the predetermined GC content;

3)筛选上述序列中连续碱基数最大为2的序列;3) Screen the sequence with the maximum number of consecutive bases in the above sequence being 2;

4)筛选上述序列中出前两个碱基不同时为G的序列;4) Screen out the sequences whose first two bases are different from G in the above sequence;

5)将上述序列进行两两对比,对于相同位置的碱基,相同的记为0分,不同的记为1分,计算总分,挑选总分≥4的序列;5) Compare the above sequences pairwise. For the bases at the same position, the same base is scored as 0 points, and the different bases are scored as 1 point. The total score is calculated, and the sequence with a total score ≥ 4 is selected;

6)根据所需标签序列数量,在上述序列中挑选序列,使所得测序标签在相同位置的碱基含量比为:A=20-30%,B=20-30%,C=20-30%,D=20-30%。6) According to the number of required tag sequences, select sequences from the above sequences, so that the base content ratio of the obtained sequencing tags at the same position is: A=20-30%, B=20-30%, C=20-30% , D = 20-30%.

与现有技术相比,本发明具有以下有益效果:Compared with the prior art, the present invention has the following beneficial effects:

本发明的一种用于二代测序的标签序列集,通过对标签序列的筛选和组合,使最终获得标签序列集数量较高,可实现大样本量检测的要求,并能够在符合测序要求同时能够正确拆分出高质量的测序数据。A tag sequence set for next-generation sequencing of the present invention, through the screening and combination of tag sequences, the number of tag sequence sets finally obtained is relatively high, which can meet the requirements of large sample size detection, and can meet the sequencing requirements at the same time It can correctly split high-quality sequencing data.

具体实施方式Detailed ways

为了便于理解本发明,下面将参照实施例对本发明进行更全面的描述。但是,本发明可以以许多不同的形式来实现,并不限于本文所描述的实施例。相反地,提供这些实施例的目的是使对本发明的公开内容的理解更加透彻全面。In order to facilitate understanding of the present invention, the present invention will be described more fully below with reference to Examples. However, the present invention can be embodied in many different forms and is not limited to the embodiments described herein. On the contrary, these embodiments are provided to make the understanding of the disclosure of the present invention more thorough and comprehensive.

除非另有定义,本文所使用的所有的技术和科学术语与属于本发明的技术领域的技术人员通常理解的含义相同。本文中在本发明的说明书中所使用的术语只是为了描述具体的实施例的目的,不是旨在于限制本发明。本文所使用的术语“和/或”包括一个或多个相关的所列项目的任意的和所有的组合。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the technical field of the invention. The terms used herein in the description of the present invention are for the purpose of describing specific embodiments only, and are not intended to limit the present invention. As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items.

实施例1Example 1

采用本发明的标签序列集进行二代测序,步骤如下:Using the tag sequence set of the present invention to carry out next-generation sequencing, the steps are as follows:

1、合成接头引物。1. Synthesize adapter primers.

按照下表合成接头引物,并稀释成10μM。Adapter primers were synthesized according to the table below and diluted to 10 μM.

表2.引物序列表Table 2. List of primer sequences

注:上表序列中下划线部分表示标签序列,其中SEQ ID No.146-148采用的是反向互补序列。Note: The underlined part in the sequence in the above table indicates the tag sequence, in which SEQ ID No.146-148 uses the reverse complementary sequence.

2、样本提取。2. Sample extraction.

按照Qiagen QIAamp Viral RNA Mini Kit(货号:52906)方法,提取不同样本类型的RNA,用Qubit 3.0荧光定量仪准确定量RNA样本浓度。结果如下表所示。According to the method of Qiagen QIAamp Viral RNA Mini Kit (Cat. No.: 52906), RNA of different sample types was extracted, and the concentration of RNA samples was accurately quantified with a Qubit 3.0 fluorescence quantitator. The results are shown in the table below.

表3.各样本提取和选用indexTable 3. Extraction and selection index of each sample

3、文库构建与上机测序。3. Library construction and on-machine sequencing.

用VAHTS RNA建库试剂盒(NR603)进行RNA文库构建后在Illumina NEXTSEQ550上机检测。所得上机数据结果质控如下表所示。VAHTS RNA library construction kit (NR603) was used for RNA library construction and detection on Illumina NEXTSEQ550. The quality control results of the machine data obtained are shown in the table below.

表4.上机数据结果质控结果表Table 4. Quality control results table of machine data results

样本编号sample number Total readsTotal reads Clean readsClean reads Adapter ratio(%)Adapter ratio(%) Q20(%)Q20(%) Q30(%)Q30(%) M474M474 1572908615729086 1534030915340309 2.472.47 95.7295.72 93.8293.82 M483M483 1629259116292591 1559011915590119 4.314.31 95.5195.51 93.5493.54 M522M522 3345539733455397 3341837333418373 0.110.11 96.7696.76 95.1795.17

从上述结果可以看出,数据的Q30>93%,Adaptor比例小于5%,表明在极低起始量的情况下依然满足建库和上机要求。From the above results, it can be seen that the Q30 of the data is > 93%, and the Adapter ratio is less than 5%, indicating that the requirements for library construction and computer use are still met under the condition of extremely low initial amount.

实施例2(双index)Example 2 (double index)

采用本发明的标签序列集进行二代测序,与实施例1基本相同,不同在于,所采用的标签序列如下表所示。Using the tag sequence set of the present invention to carry out next-generation sequencing is basically the same as in Example 1, except that the tag sequences used are shown in the table below.

表5.引物序列表Table 5. List of primer sequences

注:上表序列中下划线部分表示标签序列,其中SEQ ID No.149-180采用的是反向互补序列。Note: The underlined part in the sequence in the above table indicates the tag sequence, in which SEQ ID No.149-180 uses the reverse complementary sequence.

测序方法:Sequencing method:

1、合成表5中的接头引物,并稀释成10μM.1. Synthesize the linker primers in Table 5 and dilute to 10 μM.

2、样本提取。2. Sample extraction.

按照北京天根生化有限公司微量样本基因组DNA提取试剂盒(DP316)方法,提取脑脊液的DNA,用Qubit 3.0荧光定量仪准确定量RNA样本浓度。结果如下表所示。According to the method of Genomic DNA Extraction Kit (DP316) from Beijing Tiangen Biochemical Co., Ltd., the DNA of cerebrospinal fluid was extracted, and the concentration of RNA samples was accurately quantified by Qubit 3.0 fluorescence quantification instrument. The results are shown in the table below.

表6.各样本提取和选用indexTable 6. Extraction and selection index of each sample

3、文库构建与上机测序。3. Library construction and on-machine sequencing.

用VAHTS建库试剂盒IndexuePrep DNA Library Prep Kit V2 for Illumina(TD503-01)进行DNA文库构建后在Illumina NEXTSEQ550上机测序。所得上机数据结果质控如下表所示。The VAHTS library construction kit IndexuePrep DNA Library Prep Kit V2 for Illumina (TD503-01) was used for DNA library construction and sequenced on Illumina NEXTSEQ550. The quality control results of the machine data obtained are shown in the table below.

表7.质控结果Table 7. Quality control results

样本编号sample number Total readsTotal reads Clean readsClean reads Adapter ratio(%)Adapter ratio(%) Q20(%)Q20(%) Q30(%)Q30(%) TM113TM113 11,452,81611,452,816 11,377,94911,377,949 0.650.65 97.3197.31 96.0896.08 TM114TM114 11,939,76411,939,764 11,838,14611,838,146 0.850.85 97.3797.37 96.1496.14 TM115TM115 15,922,03115,922,031 15,770,61615,770,616 0.950.95 97.397.3 96.0696.06 TM116TM116 10,488,03110,488,031 10,423,28410,423,284 0.620.62 97.4497.44 96.2596.25 TM213TM213 13,888,42213,888,422 13,835,66613,835,666 0.380.38 97.797.7 96.6596.65 TM214TM214 10,528,66210,528,662 10,509,32910,509,329 0.180.18 97.4597.45 96.2996.29 TM215TM215 9,891,3609,891,360 9,875,6859,875,685 0.160.16 96.4496.44 94.5894.58 TM216TM216 12,910,38612,910,386 12,845,90612,845,906 0.50.5 97.5397.53 96.4196.41 TM313TM313 12,297,89912,297,899 12,221,93112,221,931 0.620.62 96.7296.72 95.1395.13 TM314TM314 11,923,85411,923,854 11,858,06411,858,064 0.550.55 97.5597.55 96.4396.43 TM315TM315 11,674,47911,674,479 11,625,81611,625,816 0.420.42 97.2297.22 95.9195.91 TM316TM316 11,751,19011,751,190 11,692,21511,692,215 0.50.5 97.5497.54 96.3996.39 TM413TM413 11,591,67711,591,677 11,528,76911,528,769 0.540.54 96.0196.01 94.0494.04 TM414TM414 12,954,70412,954,704 12,903,73012,903,730 0.390.39 97.6597.65 96.5996.59 TM415TM415 9,792,7479,792,747 9,765,4689,765,468 0.280.28 96.4196.41 94.6194.61 TM416TM416 10,679,32110,679,321 10,638,16910,638,169 0.390.39 97.2997.29 96.0396.03 TM513TM513 12,730,90212,730,902 12,668,13212,668,132 0.490.49 97.497.4 96.1796.17 TM514TM514 10,705,55310,705,553 10,671,83910,671,839 0.310.31 97.6697.66 96.5996.59 TM515TM515 11,601,83211,601,832 11,544,59411,544,594 0.490.49 97.3697.36 96.1496.14 TM516TM516 11,656,42011,656,420 11,604,49311,604,493 0.450.45 97.797.7 96.6596.65 TM613TM613 11,945,47811,945,478 11,895,75011,895,750 0.420.42 97.5497.54 96.496.4 TM614TM614 12,593,74912,593,749 12,538,15012,538,150 0.440.44 97.2697.26 95.9595.95 TM615TM615 12,368,34412,368,344 12,179,40412,179,404 1.531.53 97.6697.66 96.5796.57 TM616TM616 10,984,52110,984,521 10,910,52810,910,528 0.670.67 97.5597.55 96.4296.42 TM713TM713 14,895,00214,895,002 14,827,59614,827,596 0.450.45 97.3397.33 96.0596.05 TM714TM714 11,387,09911,387,099 11,332,41811,332,418 0.480.48 97.3997.39 96.1996.19 TM715TM715 11,193,70011,193,700 11,115,50211,115,502 0.70.7 97.4597.45 96.2496.24 TM716TM716 10,350,48110,350,481 10,292,85910,292,859 0.560.56 97.3897.38 96.1496.14 TM813TM813 9,882,7669,882,766 9,843,3239,843,323 0.40.4 97.2497.24 95.9495.94 TM814TM814 9,236,6159,236,615 9,202,8069,202,806 0.370.37 97.0997.09 95.7195.71 TM815TM815 10,289,60610,289,606 10,243,41710,243,417 0.450.45 97.0497.04 95.6495.64 TM816TM816 12,072,23612,072,236 11,928,80011,928,800 1.191.19 97.3897.38 96.1696.16

表8.数据拆分大小Table 8. Data split size

从上述结果可以看出,数据的Q30>95%,Adaptor比例小于2%,i5和i7拆出的数据均衡,表明所设计的接头序列能够满足大规模的建库的要求。From the above results, it can be seen that the Q30 of the data is >95%, the Adaptor ratio is less than 2%, and the data extracted from i5 and i7 are balanced, indicating that the designed linker sequence can meet the requirements of large-scale library construction.

综上所述,本发明的标签序列集能够克服现有标签数量少,难以实现大样本量同时检测,同时容错率低,容易导致数据拆分错误,产生假阳性或者假阴性的问题。并且部分标签序列存在连续序列过长,测序过程难以判读,以及GC碱基平衡性差,易导致数据质量差等问题。同时通过设计合适的汉明距离,降低了机器读出碱基的错误率。既具有标签序列集数量大,可实现大样本量检测的要求,又能够在符合测序要求同时能够正确拆分出高质量的测序数据的优点。To sum up, the label sequence set of the present invention can overcome the problems of the small number of existing labels, the difficulty in realizing the simultaneous detection of a large sample size, and the low error tolerance rate, which easily leads to data splitting errors, resulting in false positives or false negatives. In addition, some tag sequences have too long continuous sequences, which are difficult to interpret during the sequencing process, and the GC base balance is poor, which easily leads to poor data quality and other problems. At the same time, by designing an appropriate Hamming distance, the error rate of the machine reading bases is reduced. It not only has the advantages of a large number of tag sequence sets, which can meet the requirements of large sample size detection, but also can correctly split high-quality sequencing data while meeting the sequencing requirements.

以上所述实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above-mentioned embodiments can be combined arbitrarily. To make the description concise, all possible combinations of the technical features in the above-mentioned embodiments are not described. However, as long as there is no contradiction in the combination of these technical features, should be considered as within the scope of this specification.

以上所述实施例仅表达了本发明的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本发明构思的前提下,还可以做出若干变形和改进,这些都属于本发明的保护范围。因此,本发明专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only express several implementation modes of the present invention, and the descriptions thereof are relatively specific and detailed, but should not be construed as limiting the patent scope of the invention. It should be noted that, for those skilled in the art, several modifications and improvements can be made without departing from the concept of the present invention, and these all belong to the protection scope of the present invention. Therefore, the protection scope of the patent for the present invention should be based on the appended claims.

序列表sequence listing

<110> 广州微远基因科技有限公司<110> Guangzhou Weiyuan Gene Technology Co., Ltd.

<120> 用于二代测序的标签序列集及其设计方法和应用<120> Tag sequence set for next generation sequencing and its design method and application

<160> 212<160> 212

<170> SIPOSequenceListing 1.0<170> SIPOSequenceListing 1.0

<210> 1<210> 1

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 1<400> 1

aatccacc 8aatccacc 8

<210> 2<210> 2

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 2<400> 2

taagcacg 8taagcacg 8

<210> 3<210> 3

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 3<400> 3

caaccaga 8caaccaga 8

<210> 4<210> 4

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 4<400> 4

agcatgag 8agcatgag 8

<210> 5<210> 5

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 5<400> 5

gaacctag 8gaacctag 8

<210> 6<210> 6

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 6<400> 6

aatcgtgg 8aatcgtgg 8

<210> 7<210> 7

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 7<400> 7

taaggtgc 8taaggtgc 8

<210> 8<210> 8

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 8<400> 8

aggtccta 8aggtccta 8

<210> 9<210> 9

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 9<400> 9

caacgtct 8caacgtct 8

<210> 10<210> 10

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 10<400> 10

gaacgatc 8gaacgatc 8

<210> 11<210> 11

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 11<400> 11

aacacagg 8aacacagg 8

<210> 12<210> 12

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 12<400> 12

taccaagc 8taccaagc 8

<210> 13<210> 13

<211> 8<211> 8

<212> DNA<212> DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 13<400> 13

aggtggat 8aggtggat 8

<210> 14<210> 14

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 14<400> 14

catactgc 8catactgc 8

<210> 15<210> 15

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 15<400> 15

gatgcagt 8gatgcagt8

<210> 16<210> 16

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 16<400> 16

aacagtcc 8aacagtcc 8

<210> 17<210> 17

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 17<400> 17

taccttcg 8taccttcg 8

<210> 18<210> 18

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 18<400> 18

catagacg 8catagacg 8

<210> 19<210> 19

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 19<400> 19

gatggtca 8gatggtca 8

<210> 20<210> 20

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 20<400> 20

aactccac 8aactccac 8

<210> 21<210> 21

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 21<400> 21

tacgacag 8tacgacag 8

<210> 22<210> 22

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 22<400> 22

cattcctg 8cattcctg 8

<210> 23<210> 23

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 23<400> 23

gagtagga 8gagtagga 8

<210> 24<210> 24

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 24<400> 24

aactggtg 8aactggtg 8

<210> 25<210> 25

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 25<400> 25

tacgtgtc 8tacgtgtc 8

<210> 26<210> 26

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 26<400> 26

cattggac 8cattggac 8

<210> 27<210> 27

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 27<400> 27

gagttcct 8gagttcct8

<210> 28<210> 28

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 28<400> 28

aaccacca 8aaccacca 8

<210> 29<210> 29

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 29<400> 29

tagcagtg 8tagcagtg8

<210> 30<210> 30

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 30<400> 30

catgagct 8catgagct 8

<210> 31<210> 31

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 31<400> 31

gtatgctg 8gtatgctg8

<210> 32<210> 32

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 32<400> 32

aacctggt 8aacctggt 8

<210> 33<210> 33

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 33<400> 33

tagctcac 8tagctcac 8

<210> 34<210> 34

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 34<400> 34

catgtcga 8catgtcga 8

<210> 35<210> 35

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 35<400> 35

gtagaacc 8gtagaacc 8

<210> 36<210> 36

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 36<400> 36

aagacgtc 8aagacgtc 8

<210> 37<210> 37

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 37<400> 37

ttcacacc 8ttcacacc 8

<210> 38<210> 38

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 38<400> 38

cacacgaa 8cacacgaa 8

<210> 39<210> 39

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 39<400> 39

gtagttgg 8gtagttgg 8

<210> 40<210> 40

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 40<400> 40

aagagcag 8aagagcag 8

<210> 41<210> 41

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 41<400> 41

ttcagtgg 8ttcagtgg8

<210> 42<210> 42

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 42<400> 42

cacagctt 8cacagctt 8

<210> 43<210> 43

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 43<400> 43

gttcctct 8gttcctct 8

<210> 44<210> 44

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 44<400> 44

aagtctcg 8aagtctcg 8

<210> 45<210> 45

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 45<400> 45

ttcgaggt 8ttcgaggt 8

<210> 46<210> 46

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 46<400> 46

cactatgg 8cactatgg 8

<210> 47<210> 47

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 47<400> 47

gttcgaga 8gttcgaga 8

<210> 48<210> 48

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 48<400> 48

aagtgagc 8aagtgagc 8

<210> 49<210> 49

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 49<400> 49

ttcgtcca 8ttcgtcca 8

<210> 50<210> 50

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 50<400> 50

cactcact 8cactcact 8

<210> 51<210> 51

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 51<400> 51

gtcaagtg 8gtcaagtg 8

<210> 52<210> 52

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 52<400> 52

aaggacgt 8aaggacgt 8

<210> 53<210> 53

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 53<400> 53

ttgacgag 8ttgacgag 8

<210> 54<210> 54

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 54<400> 54

caggaatc 8caggaatc 8

<210> 55<210> 55

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 55<400> 55

gtcatcac 8gtcatcac 8

<210> 56<210> 56

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 56<400> 56

aaggtgca 8aaggtgca 8

<210> 57<210> 57

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 57<400> 57

ttgagctc 8ttgagctc 8

<210> 58<210> 58

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 58<400> 58

caggttag 8caggttag 8

<210> 59<210> 59

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 59<400> 59

gtctcgat 8gtctcgat 8

<210> 60<210> 60

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 60<400> 60

atacctgc 8atacctgc 8

<210> 61<210> 61

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 61<400> 61

ttgcatcc 8ttgcatcc 8

<210> 62<210> 62

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 62<400> 62

ctaagagc 8ctaagagc 8

<210> 63<210> 63

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 63<400> 63

gtgttgtc 8gtgttgtc 8

<210> 64<210> 64

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 64<400> 64

atacgacg 8atacgacg 8

<210> 65<210> 65

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 65<400> 65

ttgctagg 8ttgctagg 8

<210> 66<210> 66

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 66<400> 66

ctacacgt 8ctacacgt 8

<210> 67<210> 67

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 67<400> 67

gtgcacaa 8gtgcacaa 8

<210> 68<210> 68

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 68<400> 68

atccagac 8atccagac 8

<210> 69<210> 69

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 69<400> 69

tcatcagc 8tcatcagc 8

<210> 70<210> 70

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 70<400> 70

ctactgca 8ctactgca 8

<210> 71<210> 71

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 71<400> 71

gcaatacg 8gcaatacg 8

<210> 72<210> 72

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 72<400> 72

atcctctg 8atcctctg 8

<210> 73<210> 73

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 73<400> 73

tcatgtcg 8tcatgtcg8

<210> 74<210> 74

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 74<400> 74

ctagagag 8ctagagag 8

<210> 75<210> 75

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 75<400> 75

gcaactgt 8gcaactgt8

<210> 76<210> 76

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 76<400> 76

atcgatcg 8atcgatcg 8

<210> 77<210> 77

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 77<400> 77

tcacactc 8tcacactc 8

<210> 78<210> 78

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 78<400> 78

ctagtctc 8ctagtctc 8

<210> 79<210> 79

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 79<400> 79

gcagacat 8gcagacat 8

<210> 80<210> 80

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 80<400> 80

atcgtagc 8atcgtagc 8

<210> 81<210> 81

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 81<400> 81

tcactgag 8tcactgag 8

<210> 82<210> 82

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 82<400> 82

cttccaag 8cttccaag 8

<210> 83<210> 83

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 83<400> 83

gcagtgta 8gcagtgta 8

<210> 84<210> 84

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 84<400> 84

acaacctg 8acaacctg 8

<210> 85<210> 85

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 85<400> 85

tctaccac 8tctaccac 8

<210> 86<210> 86

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 86<400> 86

cttcgttc 8cttcgttc 8

<210> 87<210> 87

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 87<400> 87

gctaatcc 8gctaatcc 8

<210> 88<210> 88

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 88<400> 88

acaaggac 8acaaggac 8

<210> 89<210> 89

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 89<400> 89

tctaggtg 8tctaggtg8

<210> 90<210> 90

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 90<400> 90

ctctgtca 8ctctgtca 8

<210> 91<210> 91

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 91<400> 91

acacagct 8acacagct 8

<210> 92<210> 92

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 92<400> 92

tctcaacg 8tctcaacg 8

<210> 93<210> 93

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 93<400> 93

ctgatggt 8ctgatggt 8

<210> 94<210> 94

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 94<400> 94

acactcga 8acactcga 8

<210> 95<210> 95

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 95<400> 95

tctcttgc 8tctcttgc 8

<210> 96<210> 96

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 96<400> 96

ctgtaacg 8ctgtaacg 8

<210> 97<210> 97

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 97<400> 97

acagaagg 8acagaagg 8

<210> 98<210> 98

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 98<400> 98

tcctcctt 8tcctcctt 8

<210> 99<210> 99

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 99<400> 99

ctgtctac 8ctgtctac 8

<210> 100<210> 100

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 100<400> 100

acagttcc 8acagttcc 8

<210> 101<210> 101

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 101<400> 101

tcctggaa 8tcctggaa 8

<210> 102<210> 102

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 102<400> 102

ctggatga 8ctggatga 8

<210> 103<210> 103

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 103<400> 103

acttcgag 8acttcgag 8

<210> 104<210> 104

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 104<400> 104

tcgagagt 8tcgagagt 8

<210> 105<210> 105

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 105<400> 105

ccatccaa 8ccatccaa 8

<210> 106<210> 106

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 106<400> 106

acttgctc 8acttgctc 8

<210> 107<210> 107

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 107<400> 107

tcgtacca 8tcgtacca 8

<210> 108<210> 108

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 108<400> 108

ccatggtt 8ccatggtt 8

<210> 109<210> 109

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 109<400> 109

actgctga 8actgctga 8

<210> 110<210> 110

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 110<400> 110

tgatcgtg 8tgatcgtg8

<210> 111<210> 111

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 111<400> 111

cctcagaa 8cctcagaa 8

<210> 112<210> 112

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 112<400> 112

actggact 8actggact 8

<210> 113<210> 113

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 113<400> 113

tgatgcac 8tgatgcac 8

<210> 114<210> 114

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 114<400> 114

cctctctt 8cctctctt 8

<210> 115<210> 115

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 115<400> 115

accaagga 8accaagga 8

<210> 116<210> 116

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 116<400> 116

tgacagga 8tgacagga 8

<210> 117<210> 117

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 117<400> 117

cctgattg 8cctgattg 8

<210> 118<210> 118

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 118<400> 118

accatcct 8accatcct 8

<210> 119<210> 119

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 119<400> 119

tgactcct 8tgactcct8

<210> 120<210> 120

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 120<400> 120

cctgtaac 8cctgtaac 8

<210> 121<210> 121

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 121<400> 121

acctaacc 8acctaacc 8

<210> 122<210> 122

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 122<400> 122

tgtactcg 8tgtactcg 8

<210> 123<210> 123

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 123<400> 123

cgatatcc 8cgatatcc 8

<210> 124<210> 124

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 124<400> 124

acctgtgt 8acctgtgt8

<210> 125<210> 125

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 125<400> 125

tgttgagg 8tgttgagg 8

<210> 126<210> 126

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 126<400> 126

cgagtagt 8cgagt 8

<210> 127<210> 127

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 127<400> 127

acgattgg 8acgattgg 8

<210> 128<210> 128

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 128<400> 128

tgtgagac 8tgtgagac 8

<210> 129<210> 129

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 129<400> 129

cgtaacca 8cgtaacca 8

<210> 130<210> 130

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 130<400> 130

acgacaca 8acgacaca 8

<210> 131<210> 131

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 131<400> 131

tgtgtctg 8tgtgtctg8

<210> 132<210> 132

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 132<400> 132

cgtacgtt 8cgtacgtt 8

<210> 133<210> 133

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 133<400> 133

agacacag 8agacacag 8

<210> 134<210> 134

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 134<400> 134

tgctctga 8tgctctga 8

<210> 135<210> 135

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 135<400> 135

cgtggata 8cgtggata 8

<210> 136<210> 136

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 136<400> 136

agactgtc 8agactgtc 8

<210> 137<210> 137

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 137<400> 137

tggaagct 8tggaagct 8

<210> 138<210> 138

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 138<400> 138

agagcaac 8agagcaac 8

<210> 139<210> 139

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 139<400> 139

tggatcga 8tggatcga 8

<210> 140<210> 140

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 140<400> 140

agaggttg 8agaggttg 8

<210> 141<210> 141

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 141<400> 141

tggttacc 8tggttacc 8

<210> 142<210> 142

<211> 8<211> 8

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 142<400> 142

agcaactc 8agcaactc 8

<210> 143<210> 143

<211> 70<211> 70

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 143<400> 143

aatgatacgg cgaccaccga gatctacact gtactcgaca ctctttccct acacgacgct 60aatgatacgg cgaccaccga gatctacact gtactcgaca ctctttccct acacgacgct 60

cttccgatct 70cttccgatct70

<210> 144<210> 144

<211> 70<211> 70

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 144<400> 144

aatgatacgg cgaccaccga gatctacacc attcctgaca ctctttccct acacgacgct 60aatgatacgg cgaccaccga gatctacacc attcctgaca ctctttccct acacgacgct 60

cttccgatct 70cttccgatct70

<210> 145<210> 145

<211> 70<211> 70

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 145<400> 145

aatgatacgg cgaccaccga gatctacacc aggaatcaca ctctttccct acacgacgct 60aatgatacgg cgaccaccga gatctacacc aggaatcaca ctctttccct acacgacgct 60

cttccgatct 70cttccgatct70

<210> 146<210> 146

<211> 66<211> 66

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 146<400> 146

caagcagaag acggcatacg agatcgagac ttgtgactgg agttcagacg tgtgctcttc 60caagcagaag acggcatacg agatcgagac ttgtgactgg agttcagacg tgtgctcttc 60

cgatct 66cgatct 66

<210> 147<210> 147

<211> 66<211> 66

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 147<400> 147

caagcagaag acggcatacg agattccttg gtgtgactgg agttcagacg tgtgctcttc 60caagcagaag acggcatacg agattccttg gtgtgactgg agttcagacg tgtgctcttc 60

cgatct 66cgatct 66

<210> 148<210> 148

<211> 66<211> 66

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 148<400> 148

caagcagaag acggcatacg agatactctc gagtgactgg agttcagacg tgtgctcttc 60caagcagaag acggcatacg agatactctc gagtgactgg agttcagacg tgtgctcttc 60

cgatct 66cgatct 66

<210> 149<210> 149

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 149<400> 149

caagcagaag acggcatacg agatgtctgg atgtctcgtg ggctcgg 47caagcagaag acggcatacg agatgtctgg atgtctcgtg ggctcgg 47

<210> 150<210> 150

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 150<400> 150

caagcagaag acggcatacg agatgctgat gagtctcgtg ggctcgg 47caagcagaag acggcatacg agatgctgat gagtctcgtg ggctcgg 47

<210> 151<210> 151

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 151<400> 151

caagcagaag acggcatacg agattgcagt aggtctcgtg ggctcgg 47caagcagaag acggcatacg agattgcagt aggtctcgtg ggctcgg 47

<210> 152<210> 152

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 152<400> 152

caagcagaag acggcatacg agatcgtatt gcgtctcgtg ggctcgg 47caagcagaag acggcatacg agatcgtatt gcgtctcgtg ggctcgg 47

<210> 153<210> 153

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 153<400> 153

caagcagaag acggcatacg agatcagagg atgtctcgtg ggctcgg 47caagcagaag acggcatacg agatcagagg atgtctcgtg ggctcgg 47

<210> 154<210> 154

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 154<400> 154

caagcagaag acggcatacg agatcgacat gagtctcgtg ggctcgg 47caagcagaag acggcatacg agatcgacat gagtctcgtg ggctcgg 47

<210> 155<210> 155

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 155<400> 155

caagcagaag acggcatacg agatctctct aggtctcgtg ggctcgg 47caagcagaag acggcatacg agatctctct aggtctcgtg ggctcgg 47

<210> 156<210> 156

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 156<400> 156

caagcagaag acggcatacg agatacagtt gcgtctcgtg ggctcgg 47caagcagaag acggcatacg agatacagtt gcgtctcgtg ggctcgg 47

<210> 157<210> 157

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 157<400> 157

caagcagaag acggcatacg agatgagtgt gagtctcgtg ggctcgg 47caagcagaag acggcatacg agatgagtgt gagtctcgtg ggctcgg 47

<210> 158<210> 158

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 158<400> 158

caagcagaag acggcatacg agatgctacg atgtctcgtg ggctcgg 47caagcagaag acggcatacg agatgctacg atgtctcgtg ggctcgg 47

<210> 159<210> 159

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 159<400> 159

caagcagaag acggcatacg agatctcagt gagtctcgtg ggctcgg 47caagcagaag acggcatacg agatctcagt gagtctcgtg ggctcgg 47

<210> 160<210> 160

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 160<400> 160

caagcagaag acggcatacg agatcttgga aggtctcgtg ggctcgg 47caagcagaag acggcatacg agatcttgga aggtctcgtg ggctcgg 47

<210> 161<210> 161

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 161<400> 161

caagcagaag acggcatacg agattacact gcgtctcgtg ggctcgg 47caagcagaag acggcatacg agattaacact gcgtctcgtg ggctcgg 47

<210> 162<210> 162

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 162<400> 162

caagcagaag acggcatacg agatcaggtt gtgtctcgtg ggctcgg 47caagcagaag acggcatacg agatcaggtt gtgtctcgtg ggctcgg 47

<210> 163<210> 163

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 163<400> 163

caagcagaag acggcatacg agatgaacga aggtctcgtg ggctcgg 47caagcagaag acggcatacg agatgaacga aggtctcgtg ggctcgg 47

<210> 164<210> 164

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 164<400> 164

caagcagaag acggcatacg agatggatta gcgtctcgtg ggctcgg 47caagcagaag acggcatacg agatggatta gcgtctcgtg ggctcgg 47

<210> 165<210> 165

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 165<400> 165

caagcagaag acggcatacg agatgtcctt gtgtctcgtg ggctcgg 47caagcagaag acggcatacg agatgtcctt gtgtctcgtg ggctcgg 47

<210> 166<210> 166

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 166<400> 166

caagcagaag acggcatacg agatcaccta gagtctcgtg ggctcgg 47caagcagaag acggcatacg agatcaccta gagtctcgtg ggctcgg 47

<210> 167<210> 167

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 167<400> 167

caagcagaag acggcatacg agattgacag aggtctcgtg ggctcgg 47caagcagaag acggcatacg agattgacag aggtctcgtg ggctcgg 47

<210> 168<210> 168

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 168<400> 168

caagcagaag acggcatacg agatagctgt gtgtctcgtg ggctcgg 47caagcagaag acggcatacg agatagctgt gtgtctcgtg ggctcgg 47

<210> 169<210> 169

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 169<400> 169

caagcagaag acggcatacg agatcgttga gagtctcgtg ggctcgg 47caagcagaag acggcatacg agatcgttga gagtctcgtg ggctcgg 47

<210> 170<210> 170

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 170<400> 170

caagcagaag acggcatacg agattcgagt gtgtctcgtg ggctcgg 47caagcagaag acggcatacg agattcgagt gtgtctcgtg ggctcgg 47

<210> 171<210> 171

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 171<400> 171

caagcagaag acggcatacg agatgcaaga gagtctcgtg ggctcgg 47caagcagaag acggcatacg agatgcaaga gagtctcgtg ggctcgg 47

<210> 172<210> 172

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 172<400> 172

caagcagaag acggcatacg agatcgttac aggtctcgtg ggctcgg 47caagcagaag acggcatacg agatcgttac aggtctcgtg ggctcgg 47

<210> 173<210> 173

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 173<400> 173

caagcagaag acggcatacg agataaggag gagtctcgtg ggctcgg 47caagcagaag acggcatacg agataaggag gagtctcgtg ggctcgg 47

<210> 174<210> 174

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 174<400> 174

caagcagaag acggcatacg agatggaact gtgtctcgtg ggctcgg 47caagcagaag acggcatacg agatggaact gtgtctcgtg ggctcgg 47

<210> 175<210> 175

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 175<400> 175

caagcagaag acggcatacg agatttccag gagtctcgtg ggctcgg 47caagcagaag acggcatacg agattccag gagtctcgtg ggctcgg 47

<210> 176<210> 176

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 176<400> 176

caagcagaag acggcatacg agattcatcc aggtctcgtg ggctcgg 47caagcagaag acggcatacg agattcatcc aggtctcgtg ggctcgg 47

<210> 177<210> 177

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 177<400> 177

caagcagaag acggcatacg agatctcgaa gtgtctcgtg ggctcgg 47caagcagaag acggcatacg agatctcgaa gtgtctcgtg ggctcgg 47

<210> 178<210> 178

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 178<400> 178

caagcagaag acggcatacg agatgagcaa gtgtctcgtg ggctcgg 47caagcagaag acggcatacg agatgagcaa gtgtctcgtg ggctcgg 47

<210> 179<210> 179

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 179<400> 179

caagcagaag acggcatacg agataaccat gggtctcgtg ggctcgg 47caagcagaag acggcatacg agataaccat gggtctcgtg ggctcgg 47

<210> 180<210> 180

<211> 47<211> 47

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 180<400> 180

caagcagaag acggcatacg agattcagca gtgtctcgtg ggctcgg 47caagcagaag acggcatacg agattcagca gtgtctcgtg ggctcgg 47

<210> 181<210> 181

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 181<400> 181

aatgatacgg cgaccaccga gatctacaca atccacctcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacaca atccacctcg tcggcagcgt c 51

<210> 182<210> 182

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 182<400> 182

aatgatacgg cgaccaccga gatctacact aagcacgtcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacact aagcacgtcg tcggcagcgt c 51

<210> 183<210> 183

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 183<400> 183

aatgatacgg cgaccaccga gatctacacc aaccagatcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacacc aaccagatcg tcggcagcgt c 51

<210> 184<210> 184

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 184<400> 184

aatgatacgg cgaccaccga gatctacaca gcatgagtcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacaca gcatgagtcg tcggcagcgt c 51

<210> 185<210> 185

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 185<400> 185

aatgatacgg cgaccaccga gatctacacg aacctagtcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacacg aacctagtcg tcggcagcgt c 51

<210> 186<210> 186

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 186<400> 186

aatgatacgg cgaccaccga gatctacaca atcgtggtcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacaca atcgtggtcg tcggcagcgt c 51

<210> 187<210> 187

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 187<400> 187

aatgatacgg cgaccaccga gatctacact aaggtgctcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacact aaggtgctcg tcggcagcgt c 51

<210> 188<210> 188

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 188<400> 188

aatgatacgg cgaccaccga gatctacaca ggtcctatcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacaca ggtcctatcg tcggcagcgt c 51

<210> 189<210> 189

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 189<400> 189

aatgatacgg cgaccaccga gatctacacc aacgtcttcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacacc aacgtcttcg tcggcagcgt c 51

<210> 190<210> 190

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 190<400> 190

aatgatacgg cgaccaccga gatctacaca acacaggtcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacaca acacagggtcg tcggcagcgt c 51

<210> 191<210> 191

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 191<400> 191

aatgatacgg cgaccaccga gatctacact accaagctcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacact accaagctcg tcggcagcgt c 51

<210> 192<210> 192

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 192<400> 192

aatgatacgg cgaccaccga gatctacaca ggtggattcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacaca ggtggattcg tcggcagcgt c 51

<210> 193<210> 193

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 193<400> 193

aatgatacgg cgaccaccga gatctacacc atactgctcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacacc atactgctcg tcggcagcgt c 51

<210> 194<210> 194

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 194<400> 194

aatgatacgg cgaccaccga gatctacacg atgcagttcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacacg atgcagttcg tcggcagcgt c 51

<210> 195<210> 195

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 195<400> 195

aatgatacgg cgaccaccga gatctacaca acagtcctcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacaca acagtcctcg tcggcagcgt c 51

<210> 196<210> 196

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 196<400> 196

aatgatacgg cgaccaccga gatctacact accttcgtcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacact accttcgtcg tcggcagcgt c 51

<210> 197<210> 197

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 197<400> 197

aatgatacgg cgaccaccga gatctacacc atagacgtcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacacc atagacgtcg tcggcagcgt c 51

<210> 198<210> 198

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 198<400> 198

aatgatacgg cgaccaccga gatctacacg atggtcatcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacacg atggtcatcg tcggcagcgt c 51

<210> 199<210> 199

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 199<400> 199

aatgatacgg cgaccaccga gatctacaca actccactcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacaca actccactcg tcggcagcgt c 51

<210> 200<210> 200

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 200<400> 200

aatgatacgg cgaccaccga gatctacacg agtaggatcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacacg agtaggatcg tcggcagcgt c 51

<210> 201<210> 201

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 201<400> 201

aatgatacgg cgaccaccga gatctacaca actggtgtcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacaca actggtgtcg tcggcagcgt c 51

<210> 202<210> 202

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 202<400> 202

aatgatacgg cgaccaccga gatctacacc attggactcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacacc attggactcg tcggcagcgt c 51

<210> 203<210> 203

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 203<400> 203

aatgatacgg cgaccaccga gatctacacg agttccttcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacacg agttccttcg tcggcagcgt c 51

<210> 204<210> 204

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 204<400> 204

aatgatacgg cgaccaccga gatctacaca accaccatcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacaca accaccatcg tcggcagcgt c 51

<210> 205<210> 205

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 205<400> 205

aatgatacgg cgaccaccga gatctacact agcagtgtcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacact agcagtgtcg tcggcagcgt c 51

<210> 206<210> 206

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 206<400> 206

aatgatacgg cgaccaccga gatctacacc atgagcttcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacacc atgagcttcg tcggcagcgt c 51

<210> 207<210> 207

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence (Artificial Sequence)

<400> 207<400> 207

aatgatacgg cgaccaccga gatctacacg tatgctgtcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacacg tatgctgtcg tcggcagcgt c 51

<210> 208<210> 208

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 208<400> 208

aatgatacgg cgaccaccga gatctacaca acctggttcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacaca acctggttcg tcggcagcgt c 51

<210> 209<210> 209

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 209<400> 209

aatgatacgg cgaccaccga gatctacact agctcactcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacact agctcactcg tcggcagcgt c 51

<210> 210<210> 210

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 210<400> 210

aatgatacgg cgaccaccga gatctacacg tagaacctcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacacg tagaacctcg tcggcagcgt c 51

<210> 211<210> 211

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 211<400> 211

aatgatacgg cgaccaccga gatctacact tcacacctcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacact tcacacctcg tcggcagcgt c 51

<210> 212<210> 212

<211> 51<211> 51

<212> DNA<212>DNA

<213> 人工序列(Artificial Sequence)<213> Artificial Sequence

<400> 212<400> 212

aatgatacgg cgaccaccga gatctacacc acacgaatcg tcggcagcgt c 51aatgatacgg cgaccaccga gatctacacc acacgaatcg tcggcagcgt c 51

Claims (1)

1.一种用于二代测序的标签序列集,其特征在于,包括如SEQ ID No.1-SEQ ID No.142所示的核苷酸序列及其反向互补序列。1. A tag sequence set for next-generation sequencing, characterized in that it includes the nucleotide sequences shown in SEQ ID No. 1-SEQ ID No. 142 and their reverse complementary sequences.
CN201910779115.8A 2019-08-22 2019-08-22 Tag sequence set for next generation sequencing and its design method and application Active CN110468188B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910779115.8A CN110468188B (en) 2019-08-22 2019-08-22 Tag sequence set for next generation sequencing and its design method and application

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910779115.8A CN110468188B (en) 2019-08-22 2019-08-22 Tag sequence set for next generation sequencing and its design method and application

Publications (2)

Publication Number Publication Date
CN110468188A CN110468188A (en) 2019-11-19
CN110468188B true CN110468188B (en) 2023-08-22

Family

ID=68513393

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910779115.8A Active CN110468188B (en) 2019-08-22 2019-08-22 Tag sequence set for next generation sequencing and its design method and application

Country Status (1)

Country Link
CN (1) CN110468188B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113444769B (en) * 2020-03-28 2023-06-23 深圳人体密码基因科技有限公司 Construction method and application of DNA tag sequence
CN113517026B (en) * 2021-06-16 2022-08-19 苏州拉索生物芯片科技有限公司 Method and system for generating label sequence applied to biological product, intelligent terminal and computer readable storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102115789A (en) * 2010-12-15 2011-07-06 厦门大学 Nucleic acid label for second-generation high-flux sequencing and design method thereof
WO2012037876A1 (en) * 2010-09-21 2012-03-29 深圳华大基因科技有限公司 Dna tag and application thereof
CN102653784A (en) * 2011-03-03 2012-09-05 深圳华大基因科技有限公司 Tag used for multiple nucleic acid sequencing and application method thereof
CN102757956A (en) * 2012-07-18 2012-10-31 云南省烟草农业科学研究院 Tobacco genome molecular marker probe and sequence collective group as well as acquiring method and application thereof
CN105986015A (en) * 2015-02-05 2016-10-05 大连晶泰生物技术有限公司 Method and kit for detecting one or more target sequence of multiple samples based on high-throughput sequencing
CN108018607A (en) * 2016-10-28 2018-05-11 深圳华大基因股份有限公司 A kind of sequence label for lifting microarray dataset library fractionation rate mixes storehouse method and apparatus
WO2019071051A1 (en) * 2017-10-04 2019-04-11 The Broad Institute, Inc. Crispr effector system based diagnostics

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2900935A1 (en) * 2006-05-15 2007-11-16 Centre Nat Rech Scient RECOMBINANT HYPERACTIVE TRANSPOSITION SYSTEMS DERIVED FROM TRANSPOSON MOS-1
WO2012122571A1 (en) * 2011-03-10 2012-09-13 Gen-Probe Incorporated Methods and compositions for the selection and optimization of oligonucleotide tag sequences
WO2013033721A1 (en) * 2011-09-02 2013-03-07 Atreca, Inc. Dna barcodes for multiplexed sequencing
WO2018204423A1 (en) * 2017-05-01 2018-11-08 Illumina, Inc. Optimal index sequences for multiplex massively parallel sequencing

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012037876A1 (en) * 2010-09-21 2012-03-29 深圳华大基因科技有限公司 Dna tag and application thereof
CN102115789A (en) * 2010-12-15 2011-07-06 厦门大学 Nucleic acid label for second-generation high-flux sequencing and design method thereof
CN102653784A (en) * 2011-03-03 2012-09-05 深圳华大基因科技有限公司 Tag used for multiple nucleic acid sequencing and application method thereof
CN102757956A (en) * 2012-07-18 2012-10-31 云南省烟草农业科学研究院 Tobacco genome molecular marker probe and sequence collective group as well as acquiring method and application thereof
CN105986015A (en) * 2015-02-05 2016-10-05 大连晶泰生物技术有限公司 Method and kit for detecting one or more target sequence of multiple samples based on high-throughput sequencing
CN108018607A (en) * 2016-10-28 2018-05-11 深圳华大基因股份有限公司 A kind of sequence label for lifting microarray dataset library fractionation rate mixes storehouse method and apparatus
WO2019071051A1 (en) * 2017-10-04 2019-04-11 The Broad Institute, Inc. Crispr effector system based diagnostics

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Improved Lower Bounds of DNA Tags Based on a Modified Genetic Algorithm;Bin Wang等;《PLoS ONE》;20150218;第10卷(第2期);摘要,第2页第2-3段,第3-4页方法部分,第5-8页结果部分,附件数据 *

Also Published As

Publication number Publication date
CN110468188A (en) 2019-11-19

Similar Documents

Publication Publication Date Title
CN106367485B (en) Double label connector groups of a kind of more positioning for detecting gene mutation and its preparation method and application
CN105525357B (en) The construction method and kit of a kind of sequencing library and application
CN106555226B (en) A kind of method and kit constructing high-throughput sequencing library
CN105238859B (en) A kind of method for obtaining chicken full-length genome high density SNP marker site
CN111748551B (en) Blocking sequence, capture kit, library hybridization capture method and library construction method
CN111808854A (en) Equilibrium linker with molecular barcode and method for rapid construction of transcriptome library
CN102061526A (en) DNA (deoxyribonucleic acid) library and preparation method thereof as well as method and device for detecting single nucleotide polymorphisms (SNPs)
CN108517567B (en) Adapters, primer sets, kits and library construction methods for cfDNA library construction
CN106283202A (en) A kind of supper-fast DNA library based on illumina secondary order-checking platform builds test kit
CN109971827A (en) The banking process of plasma dna and build library kit
WO2021253372A1 (en) High-compatibility pcr-free library building and sequencing method
CN110157785A (en) A kind of unicellular RNA sequencing library construction method
CN110438121A (en) Adapters, adapter libraries and their applications
WO2019062289A1 (en) Probe and method applying the same for enriching target region in high-throughput sequencing
US20170218446A1 (en) Cell characterisation
CN107829146A (en) Primer group for constructing 16SrRNA gene amplicon sequencing library and construction method
CN110468188B (en) Tag sequence set for next generation sequencing and its design method and application
CN107904669A (en) A kind of construction method of unicellular sequencing library that methylates and its application
CN117004756A (en) MNP marker sites, primer compositions and kits for identification of Osmanthus fragrans varieties and their applications
CN105524920B (en) TN5 for Ion Proton microarray datasets builds the primer sets, kit and banking process in library
WO2023202030A1 (en) Method for constructing high-throughput sequencing library of small rna
CN112359093A (en) Method and kit for preparing and expressing and quantifying free miRNA library in blood
WO2017185758A1 (en) Primer, probe, kit, and method for microchimerism assay and individual recognition
CN111667883A (en) A forensic mixed DNA analysis method based on complex microhaplotype pyrosequencing map analysis
CN114574595B (en) Application of human chromosome InDel gene locus, primer group, product thereof and individual identification method of test material

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20200616

Address after: 510130 No. 301, building G10, South China new material innovation park, self compiled building 3, No. 31, Kefeng Road, Guangzhou high tech Industrial Development Zone, Guangdong Province

Applicant after: Guangzhou Weiyuan Medical Equipment Co.,Ltd.

Applicant after: GUANGZHOU VISION GENE TECHNOLOGY Co.,Ltd.

Applicant after: Guangzhou Weiyuan medical laboratory Co.,Ltd.

Applicant after: Shenzhen Weiyuan Medical Technology Co.,Ltd.

Address before: 510130 Three South China New Materials Innovation Park G10 Building 303, No. 31 Kefeng Road, Guangzhou High-tech Industrial Development Zone, Guangdong Province

Applicant before: GUANGZHOU VISION GENE TECHNOLOGY Co.,Ltd.

TA01 Transfer of patent application right
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20201010

Address after: 510130 No. 301, building G10, South China new material innovation park, self compiled building 3, No. 31, Kefeng Road, Guangzhou high tech Industrial Development Zone, Guangdong Province

Applicant after: Guangzhou Weiyuan Medical Equipment Co.,Ltd.

Applicant after: GUANGZHOU VISION GENE TECHNOLOGY Co.,Ltd.

Applicant after: Guangzhou Weiyuan medical laboratory Co.,Ltd.

Applicant after: Shenzhen Weiyuan Medical Technology Co.,Ltd.

Applicant after: Weiyuan (Shenzhen) Medical Research Center Co.,Ltd.

Address before: 510130 No. 301, building G10, South China new material innovation park, self compiled building 3, No. 31, Kefeng Road, Guangzhou high tech Industrial Development Zone, Guangdong Province

Applicant before: Guangzhou Weiyuan Medical Equipment Co.,Ltd.

Applicant before: GUANGZHOU VISION GENE TECHNOLOGY Co.,Ltd.

Applicant before: Guangzhou Weiyuan medical laboratory Co.,Ltd.

Applicant before: Shenzhen Weiyuan Medical Technology Co.,Ltd.

GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20230823

Address after: Room 301, G10, South China new material innovation park, building 3, No. 31, Kefeng Road, Guangzhou hi tech Industrial Development Zone, Guangdong 510130

Patentee after: Guangzhou Weiyuan Medical Equipment Co.,Ltd.

Patentee after: GUANGZHOU VISION GENE TECHNOLOGY Co.,Ltd.

Patentee after: Guangzhou Weiyuan medical laboratory Co.,Ltd.

Patentee after: Shenzhen Weiyuan Medical Technology Co.,Ltd.

Address before: Room 301, G10, South China new material innovation park, building 3, No. 31, Kefeng Road, Guangzhou hi tech Industrial Development Zone, Guangdong 510130

Patentee before: Guangzhou Weiyuan Medical Equipment Co.,Ltd.

Patentee before: GUANGZHOU VISION GENE TECHNOLOGY Co.,Ltd.

Patentee before: Guangzhou Weiyuan medical laboratory Co.,Ltd.

Patentee before: Shenzhen Weiyuan Medical Technology Co.,Ltd.

Patentee before: Weiyuan (Shenzhen) Medical Research Center Co.,Ltd.

TR01 Transfer of patent right
CP03 Change of name, title or address

Address after: Room 301, G10, South China new material innovation park, building 3, No. 31, Kefeng Road, Guangzhou hi tech Industrial Development Zone, Guangdong 510130

Patentee after: Guangzhou Weiyuan Medical Equipment Co.,Ltd.

Country or region after: China

Patentee after: GUANGZHOU VISION GENE TECHNOLOGY Co.,Ltd.

Patentee after: Guangzhou Weiyuan medical laboratory Co.,Ltd.

Patentee after: Weiyuan Medical Technology (Huzhou) Co.,Ltd.

Address before: Room 301, G10, South China new material innovation park, building 3, No. 31, Kefeng Road, Guangzhou hi tech Industrial Development Zone, Guangdong 510130

Patentee before: Guangzhou Weiyuan Medical Equipment Co.,Ltd.

Country or region before: China

Patentee before: GUANGZHOU VISION GENE TECHNOLOGY Co.,Ltd.

Patentee before: Guangzhou Weiyuan medical laboratory Co.,Ltd.

Patentee before: Shenzhen Weiyuan Medical Technology Co.,Ltd.

CP03 Change of name, title or address