WO2023116373A1 - 一种生成标记的核酸分子群的方法及其试剂盒 - Google Patents
一种生成标记的核酸分子群的方法及其试剂盒 Download PDFInfo
- Publication number
- WO2023116373A1 WO2023116373A1 PCT/CN2022/135363 CN2022135363W WO2023116373A1 WO 2023116373 A1 WO2023116373 A1 WO 2023116373A1 CN 2022135363 W CN2022135363 W CN 2022135363W WO 2023116373 A1 WO2023116373 A1 WO 2023116373A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sequence
- primer
- nucleic acid
- oligonucleotide
- region
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6806—Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1065—Preparation or screening of tagged libraries, e.g. tagged microorganisms by STM-mutagenesis, tagged polynucleotides, gene tags
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1096—Processes for the isolation, preparation or purification of DNA or RNA cDNA Synthesis; Subtracted cDNA library construction, e.g. RT, RT-PCR
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
- C12Q1/6874—Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
Definitions
- This application relates to the technical field of transcriptome sequencing and biomolecular spatial information detection. Specifically, the present application relates to a method for positionally marking nucleic acid molecules, and a method for constructing a library of nucleic acid molecules for transcriptome sequencing. In addition, the present application also relates to a library of nucleic acid molecules constructed by the method, and a kit for implementing the method.
- the present application provides a new method for generating labeled nucleic acid molecule groups, and a method for constructing a nucleic acid molecule library and performing high-throughput sequencing based on the method.
- the application provides a method of generating a population of labeled nucleic acid molecules, comprising the steps of:
- nucleic acid array includes a solid support, and the solid support is coupled with multiple oligonucleotide probes; each oligonucleotide probe comprises At least one copy; and, the oligonucleotide probe comprises or consists of: a consensus sequence X1, a tag sequence Y and a consensus sequence X2 from a 5' to a 3' direction, wherein,
- oligonucleotide probes have different tag sequences Y, and the tag sequence Y has a unique nucleotide sequence corresponding to the position of the oligonucleotide probe on the solid support;
- RNA for example, mRNA
- pretreatment comprising the following steps:
- RNA for example, mRNA
- the cDNA chain includes the RNA (for example, mRNA) complementary cDNA sequence, and 3' terminal overhang
- the primer A contains a capture sequence A
- the capture sequence A can anneal with the RNA to be captured (for example, mRNA) and initiate an extension reaction
- primer B annealing primer B to the cDNA strand generated in (a), and performing an extension reaction to generate a first extension product, which is the first nucleic acid molecule to be labeled, thereby generating a first population of nucleic acid molecules
- the primer B comprises a consensus sequence B, a 3' end overhang complementary sequence, and an optional tag sequence B; the 3' end overhang complementary sequence is located at 3 of the primer B ' end; said consensus sequence B is located upstream of said 3' end overhanging complementary sequence (e.g., at the 5' end
- RNA for example, mRNA
- primer A' contains a consensus sequence A and a capture sequence A, and the capture sequence A can anneal to the RNA (eg, mRNA) to be captured and initiate an extension reaction; the consensus sequence A is located upstream of the capture sequence A (for example, at the 5' end of the primer A');
- primer B' combine primer B' with the cDNA generated in (a) The strands are annealed and extended to generate a first extension product; wherein, the primer B' comprises a consensus sequence B, a 3' end overhang complementary sequence, and an optional tag sequence B; the 3' end overhang A complementary sequence is located at the 3' end of the primer B'; the consensus sequence B is located upstream of the complementary sequence overhanging the 3' end (eg, at the
- step (2) (i) implementing annealing conditions to the product of step (2), such that the oligonucleotide probe anneals to the first nucleic acid molecule to be labeled at the corresponding position of the oligonucleotide probe (for example annealing in situ), And carry out an extension reaction to generate an extension product, which is a second nucleic acid molecule with a position marker, thereby generating a second nucleic acid molecule population; wherein, the consensus sequence X2 of the oligonucleotide probe or a partial sequence thereof (a) capable of annealing to the complementary sequence of the consensus sequence B or a partial sequence thereof of the first extension product obtained in step (2)(i), or, (b) capable of annealing to the first extension product obtained in step (2)(ii); Annealing to the complementary sequence of said consensus sequence A or a partial sequence thereof of two extension products; or,
- the bridging oligonucleotide pair is composed of a first bridging oligonucleotide and a second bridging oligonucleotide, and the first bridging oligonucleotide and the second bridging oligonucleotide are each independently comprising: a first region and a second region, and optionally a third region located between the first region and the second region, the first region being located upstream (e.g., the 5' end) of the second region; wherein ,
- the first region of the first bridging oligonucleotide is capable of annealing to the first region of the second bridging oligonucleotide; the second region of the first bridging oligonucleotide is capable of annealing to the oligonucleotide Annealing to the consensus sequence X2 of the acid probe or a partial sequence thereof;
- the second region (a) of the second bridging oligonucleotide can anneal to the complementary sequence of the consensus sequence B of the first extension product obtained in step (2)(i) or a partial sequence thereof, or, (b ) capable of annealing to the complementary sequence of the consensus sequence A or a partial sequence thereof of the second extension product obtained in step (2)(ii);
- the first bridging oligonucleotide and the second bridging oligonucleotide of the bridging oligonucleotide pair each of the bridging oligonucleotides is in single-stranded form, or the first bridging oligonucleotide and the second bridging oligonucleotide of the pair of bridging oligonucleotides are in a partially double-stranded form by annealing to each other;
- ligation reaction ligation of nucleic acid molecules that hybridize to the first and second regions of the same first bridging oligonucleotide, and/or, hybridizing to the first and second regions of the same second bridging oligonucleotide
- the nucleic acid molecules of the region are connected; and an extension reaction is carried out; wherein, the connection reaction and the extension reaction are carried out in any order;
- the obtained reaction product is the second nucleic acid molecule with a position marker, thereby generating the second nucleic acid molecule population.
- step (3)(ii) of the method in step (3)(ii) of the method:
- the step comprises: using nucleic acid ligase to hybridize to the nucleic acid molecules of the first region and the second region of the same first bridging oligonucleotide; or,
- the first bridging oligonucleotide comprises a first region, a second region and a third region therebetween, the first region and the second region that will hybridize to the same first bridging oligonucleotide
- the step of ligating the nucleic acid molecule of the region includes: using a nucleic acid polymerase (for example, without 5' to 3' end exonuclease activity or strand displacement activity) to carry out a polymerization reaction using the third region as a template, and using a nucleic acid ligase to hybridize linked to the nucleic acid molecules of the first region, the third region and the second region of the same first bridging oligonucleotide;
- the step comprising: using nucleic acid ligase to hybridize to the nucleic acid molecules of the first region and the second region of the same second bridging oligonucleotide; or,
- the second bridging oligonucleotide comprises a first region, a second region and a third region therebetween, the first region and the second region that will hybridize to the same second bridging oligonucleotide
- the step of ligating the nucleic acid molecule of the region includes: using a nucleic acid polymerase (for example, without 5' to 3' end exonuclease activity or strand displacement activity) to carry out a polymerization reaction using the third region as a template, and using a nucleic acid ligase to hybridize The nucleic acid molecules at the first region, the third region and the second region of the same second bridging oligonucleotide are linked.
- each oligonucleotide probe comprises one copy.
- each oligonucleotide probe comprises multiple copies.
- the region where each of the oligonucleotide probes is coupled to the solid support is referred to as a microspot. It is easy to understand that when each oligonucleotide probe is one copy, each micro-dot is coupled with a probe, and the oligonucleotide probes of different micro-dots have different label sequences Y; when each oligonucleotide When the nucleotide probe contains multiple copies, each micro-dot is coupled with multiple probes, the oligonucleotide probes in the same micro-dot have the same label sequence Y, and the oligonucleotide probes in different micro-dots have Different label sequences Y.
- the solid support comprises a plurality of microdots, each microdot is coupled to an oligonucleotide probe, and each oligonucleotide probe may comprise one or more copies.
- the solid support comprises a plurality (eg, at least 10, at least 10 2 , at least 10 3 , at least 10 4 , at least 10 5 , at least 10 6 , at least 10 7 , at least 10 8 , or more) microdots; in certain embodiments, the solid support comprises at least 10 4 (eg, at least 10 4 , at least 10 5 , at least 10 6 , at least 10 7 , at least 10 8 , at least 10 9 , at least 10 10 , at least 10 11 , or at least 10 12 ) microdots/square millimeter.
- the solid support comprises at least 10 4 (eg, at least 10 4 , at least 10 5 , at least 10 6 , at least 10 7 , at least 10 8 , at least 10 9 , at least 10 10 , at least 10 11 , or at least 10 12 ) microdots/square millimeter.
- the interval between adjacent microdots is less than 100 ⁇ m, less than 50 ⁇ m, less than 10 ⁇ m, less than 5 ⁇ m, less than 1 ⁇ m, less than 0.5 ⁇ m, less than 0.1 ⁇ m, less than 0.05 ⁇ m, or less than 0.01 ⁇ m.
- the microdots have a size (e.g., equivalent diameter) of less than 100 ⁇ m, less than 50 ⁇ m, less than 10 ⁇ m, less than 5 ⁇ m, less than 1 ⁇ m, less than 0.5 ⁇ m, less than 0.1 ⁇ m, less than 0.05 ⁇ m, or less than 0.01 ⁇ m .
- the method comprises step (1), step (2)(i) and step (3); wherein, in step (2)(i)(b), the primer B contains a consensus sequence B, Complementary sequence of 3' end overhang, and tag sequence B.
- the first extension product described in step (2)(i)(b) sequentially comprises from the 5' end to the 3' end: the primer A formed by using the primer A as the reverse transcription primer and the RNA complementary cDNA sequence, the 3' end overhang sequence, the complementary sequence of the tag sequence B, the complementary sequence of the consensus sequence B.
- each copy of the second nucleic acid molecule derived from the same oligonucleotide probe has a different tag sequence B as UMI.
- Embodiments comprising step (1), step (2)(i) and step (3)(i)
- the method comprises step (1), step (2)(i) and step (3)(i); wherein, the consensus sequence X2 or a partial sequence thereof can be combined with the consensus sequence B
- the complementary sequence or partial sequence thereof is annealed;
- the extension product obtained in step (3)(i) is a labeled nucleic acid molecule, which comprises: the first strand containing the first nucleic acid molecule sequence to be labeled, and/or , the second strand containing the oligonucleotide probe sequence.
- partial sequence of XX (sequence) or “partial sequence of XX (sequence)" means the nucleotide sequence of at least one segment of "XX (sequence)".
- the entire nucleotide sequence of the consensus sequence X2 can anneal to the complementary sequence of the consensus sequence B or the nucleotide sequence of a partial segment of the complementary sequence of the consensus sequence B, and the consensus sequence X2 It is also possible to anneal with the complementary sequence of the consensus sequence B or the nucleotide sequence of a partial segment of the complementary sequence of the consensus sequence B with the nucleotide sequence of a partial segment thereof.
- annealing means that in the two nucleotide sequences that are annealed to each other, each base in one nucleotide sequence can pair with the base in the other nucleotide sequence without mismatching or a gap; or, in two nucleotide sequences that anneal to each other, most of the bases in one nucleotide sequence can pair with the bases in the other nucleotide sequence, which allows mismatches or gaps ( For example, a mismatch or gap of one or several nucleotides). That is, the two nucleotide sequences that can be annealed can be either completely complementary or partially complementary. Unless otherwise indicated herein or clearly contradicted by the context, the description of "annealing" here applies to the entire text.
- the first strand comprises from the 5' end to the 3' end: a cDNA sequence complementary to the RNA formed by using the primer A as a reverse transcription primer, and the overhanging sequence at the 3' end , the complementary sequence of the tag sequence B, the complementary sequence of the consensus sequence B, the complementary sequence of the tag sequence Y, the complementary sequence of the consensus sequence X1.
- the second strand comprises from the 5' end to the 3' end: the consensus sequence X1, the tag sequence Y, the consensus sequence X2, the tag sequence B, the 3' The complementary sequence of the terminal overhang sequence, the complementary sequence of the cDNA sequence complementary to the RNA formed by using the primer A as a reverse transcription primer.
- Embodiment comprising step (1), step (2)(i) and step (3)(i): a chain
- the consensus sequence X2 or a partial sequence thereof can anneal to the complementary sequence of the consensus sequence B or a partial sequence thereof (for example, a 3' end partial sequence), and in step (2)(i) The complementary sequence of said consensus sequence B of the first extension product has a 3' free end.
- the extension product obtained in step (3)(i) is a labeled nucleic acid molecule comprising the first strand.
- the first strand comprises from the 5' end to the 3' end: a cDNA sequence complementary to the RNA formed by using the primer A as a reverse transcription primer, and the overhanging sequence at the 3' end , the complementary sequence of the tag sequence B, the complementary sequence of the consensus sequence B, the complementary sequence of the tag sequence Y, the complementary sequence of the consensus sequence X1.
- step (3)(i) the oligonucleotide probe cannot initiate an extension reaction (eg, the 3' end is blocked).
- the capture sequence A of the primer A is a random oligonucleotide sequence.
- the first extension product described in step (2)(i)(b) sequentially comprises from the 5' end to the 3' end: the primer A formed by using the primer A as the reverse transcription primer and the RNA complementary cDNA sequence, the 3' end overhang sequence, the complementary sequence of the tag sequence B, the complementary sequence of the consensus sequence B.
- the first strand comprises from the 5' end to the 3' end: a cDNA sequence complementary to the RNA formed by using the primer A as a reverse transcription primer, and the overhanging sequence at the 3' end , the complementary sequence of the tag sequence B, the complementary sequence of the consensus sequence B, the complementary sequence of the tag sequence Y, the complementary sequence of the consensus sequence X1.
- the capture sequence A of the primer A is a poly(T) sequence or a specific sequence for a specific target nucleic acid.
- the primer A further contains a consensus sequence A, and an optional tag sequence A, such as a random oligonucleotide sequence.
- the capture sequence A is located at the 3' end of the primer A.
- the consensus sequence A is located upstream of the capture sequence A (eg, at the 5' end of the primer A).
- the first extension product in step (2)(i)(b) sequentially comprises from the 5' end to the 3' end: the consensus sequence A, the optional tag sequence A, and the
- the primer A is the cDNA sequence complementary to the RNA formed by the reverse transcription primer, the overhang sequence at the 3' end, the complementary sequence of the tag sequence B, and the complementary sequence of the consensus sequence B.
- the first strand comprises from the 5' end to the 3' end: the consensus sequence A, optionally the tag sequence A, the primer A formed with the primer A as a reverse transcription primer and the The cDNA sequence complementary to the RNA, the 3' end overhang sequence, the complementary sequence of the tag sequence B, the complementary sequence of the consensus sequence B, the complementary sequence of the tag sequence Y, the complementary sequence of the consensus sequence X1 sequence.
- Embodiment comprising step (1), step (2)(i) and step (3)(i): two chains
- the consensus sequence X2 or a partial sequence thereof can anneal to the complementary sequence of the consensus sequence B or a partial sequence thereof, and the oligonucleotide probe
- the consensus sequence X2 of has a 3' free end.
- the extension product obtained in step (3)(i) is a labeled nucleic acid molecule comprising the second strand.
- the second strand comprises from the 5' end to the 3' end: the consensus sequence X1, the tag sequence Y, the consensus sequence X2, the tag sequence B, the 3' The complementary sequence of the terminal overhang sequence, the complementary sequence of the cDNA sequence complementary to the RNA formed by using the primer A as a reverse transcription primer.
- the first extension product obtained in step (2)(i) cannot initiate an extension reaction (eg, the 3' end is blocked).
- the capture sequence A of the primer A is a random oligonucleotide sequence.
- the first extension product described in step (2)(i)(b) sequentially comprises from the 5' end to the 3' end: the primer A formed by using the primer A as the reverse transcription primer and the RNA complementary cDNA sequence, the 3' end overhang sequence, the complementary sequence of the tag sequence B, the complementary sequence of the consensus sequence B.
- the second strand comprises from the 5' end to the 3' end: the consensus sequence X1, the tag sequence Y, the consensus sequence X2, the tag sequence B, the 3' The complementary sequence of the terminal overhang sequence, the complementary sequence of the cDNA sequence complementary to the RNA formed by using the primer A as a reverse transcription primer.
- the capture sequence A of the primer A is a poly(T) sequence or a specific sequence for a specific target nucleic acid.
- the primer A further contains a consensus sequence A, and an optional tag sequence A, such as a random oligonucleotide sequence.
- the capture sequence A is located at the 3' end of the primer A.
- the consensus sequence A is located upstream of the capture sequence A (eg, at the 5' end of the primer A).
- the first extension product in step (2)(i)(b) sequentially comprises from the 5' end to the 3' end: the consensus sequence A, optionally the tag sequence A, The cDNA sequence complementary to the RNA formed by using the primer A as a reverse transcription primer, the overhang sequence at the 3' end, the complementary sequence of the tag sequence B, and the complementary sequence of the consensus sequence B.
- the second strand comprises from the 5' end to the 3' end: the consensus sequence X1, the tag sequence Y, the consensus sequence X2, the tag sequence B, the 3' The complementary sequence of the terminal overhang sequence, the complementary sequence of the cDNA sequence complementary to the RNA formed by using the primer A as a reverse transcription primer, the optional complementary sequence of the tag sequence A, the complementary sequence of the consensus sequence A sequence.
- the term "UMI” refers to "Unique Molecular Identifier, a unique molecular label", which can be used to perform qualitative and/or quantitative nucleic acid molecules. Unless otherwise indicated herein or clearly contradicted by the context, the present application does not limit the position and quantity of the UMI or its complementary sequence in the nucleic acid molecule. For example, when the cDNA chain contains the UMI or its complementary sequence, the UMI or its complementary sequence can be located at the 3' end of the cDNA sequence in the cDNA chain, or at the 5' end of the cDNA sequence, or The UMI or its complement is contained at both the 3' end and the 5' end.
- the UMI or its complementary sequence can be located at the 3' end of the complementary sequence of the cDNA sequence in the complementary strand of the cDNA strand, or at the end of the complementary sequence of the cDNA sequence.
- the 5' end may also contain the UMI or its complementary sequence at both the 3' end and the 5' end.
- the UMI can be introduced via primer A, and/or via primer B.
- the UMI can be introduced via primer A', and/or via primer B'.
- step (1) An exemplary embodiment of the present application comprising step (1), step (2)(i) and step (3)(i) is described in detail as follows:
- An exemplary scheme for preparing a cDNA strand containing a complementary sequence of UMI at the 3' end using RNA (such as mRNA) in the sample as a template comprises the following steps (as shown in Figure 2):
- RNA molecules (for example, mRNA molecules) in the permeabilized sample are reverse-transcribed using reverse transcriptase (for example, reverse transcriptase with terminal transfer activity) and primer A to generate cDNA, and 3 An overhang (eg, an overhang comprising 3 cytosine nucleotides) is added to the ' end.
- reverse transcriptase for example, reverse transcriptase with terminal transfer activity
- primer A primer A to generate cDNA
- 3 An overhang eg, an overhang comprising 3 cytosine nucleotides
- 3 reverse transcriptases having terminal transfer activity can be used for the reverse transcription reaction.
- the reverse transcriptase used does not have RNaseH activity.
- the primer A comprises a poly(T) sequence and a consensus sequence A (CA).
- a poly(T) sequence is located at the 3' end of the primer A to initiate reverse transcription.
- the primer A comprises a random oligonucleotide sequence that can be used to capture RNA without a poly(A) tail.
- the random oligonucleotide sequence is located at the 3' end of the primer A to initiate reverse transcription.
- primer B to anneal or hybridize with the cDNA strand, said primer B comprising a consensus sequence B (CB), a unique molecular tag sequence (UMI) and the complementary sequence of the 3' end overhang of the cDNA.
- CB consensus sequence B
- UMI unique molecular tag sequence
- the nucleic acid fragment hybridized or annealed to the primer B can be extended using the UMI sequence and the consensus sequence B as templates, thereby generating a complementary primer carrying the UMI sequence at the 3' end. sequence, the nucleic acid molecule of the complementary sequence of the consensus sequence B.
- the consensus sequence B is located upstream of the UMI sequence (for example, the 5' end), and the sequence complementary to the 3' end overhang of the cDNA strand is located at the 3' end of the primer B.
- the primer B may include GGG at its 3' end.
- the nucleotides of the primer B can also be modified (for example, using locked nucleic acid) to enhance the complementary pairing between the primer B and the 3' end overhang of the cDNA strand.
- nucleic acid polymerases for example, DNA polymerase or reverse transcriptase
- DNA polymerase or reverse transcriptase can be used to carry out the extension reaction, as long as it can use the sequence of the primer B or a partial sequence thereof as a template to extend the captured Nucleic acid fragments (reverse transcription products) are sufficient.
- reverse transcriptase enzyme as in the previous reverse transcription step can be used to extend the captured nucleic acid fragment (reverse transcription product).
- this step is performed simultaneously with step (1) (eg, in the same reaction system).
- the method optionally further comprises step (3): adding RNaseH to digest the RNA strand in the RNA/cDNA hybrid duplex to form a cDNA single strand.
- said method does not comprise said step (3).
- the exemplary structure of the cDNA strand prepared by the above exemplary embodiment comprises: consensus sequence A, cDNA sequence, 3' end overhang sequence, complementary sequence of UMI sequence, and complementary sequence of consensus sequence B.
- the performance scheme includes the following steps (as shown in Figure 4):
- the consensus sequence X2 of the ChIP-seq or a partial sequence thereof can anneal to the complementary sequence of the consensus sequence B of the cDNA strand obtained in the above step 1 or a partial sequence thereof.
- the cDNA strand is annealed or hybridized with ChIP-seq, and under the action of polymerase, a new nucleic acid molecule containing ChIP-seq information (ie, a nucleic acid molecule marked with ChIP-seq) is formed.
- the exemplary structure of the new nucleic acid molecule containing chip sequence information formed by the above exemplary embodiment comprises: a consensus sequence A from the 5' end to the 3' end, a cDNA sequence, an overhang sequence at the 3' end, and the complement of the UMI sequence sequence, the complementary sequence of the consensus sequence B, the complementary sequence of the tag sequence Y, and the nucleic acid strand of the complementary sequence of the consensus sequence X1 and/or its complementary nucleic acid strand.
- Embodiment comprising step (1), step (2)(i) and step (3)(ii)
- the method comprises step (1), step (2)(i) and step (3)(ii); wherein the second region of the second bridging oligonucleotide is capable of combining with step (2) Annealing to the complementary sequence of the consensus sequence B of the first extension product obtained in (i) or a partial sequence thereof;
- the reaction product obtained in step (3)(ii) is a labeled nucleic acid molecule, which comprises: The first strand of the first nucleic acid molecule sequence to be labeled, and/or, the second strand containing the oligonucleotide probe sequence.
- the second region of the second bridging oligonucleotide can be compatible with the complementary sequence of the consensus sequence B or the complementary sequence of the consensus sequence B of the first extension product obtained in step (2)(i).
- the nucleotide sequences of the partial segments are annealed.
- the first strand comprises from the 5' end to the 3' end: a cDNA sequence complementary to the RNA formed by using the primer A as a reverse transcription primer, and the overhanging sequence at the 3' end , the complementary sequence of the tag sequence B, the complementary sequence of the consensus sequence B, optionally the complementary sequence of the third region of the second bridging oligonucleotide, the first bridging oligonucleotide sequence, The complementary sequence of the tag sequence Y, the complementary sequence of the consensus sequence X1.
- the second strand comprises from the 5' end to the 3' end: the consensus sequence X1, the tag sequence Y, the consensus sequence X2, optionally the first bridging oligo
- the complementary sequence of the third region of nucleotides, the second bridging oligonucleotide sequence, the tag sequence B, the complementary sequence of the 3' end overhang sequence, formed by using the primer A as a reverse transcription primer The complement of the cDNA sequence complementary to said RNA.
- Embodiment comprising step (1), step (2)(i) and step (3)(ii): a chain
- the second region of the second bridging oligonucleotide can be complementary to the consensus sequence B of the first extension product obtained in step (2)(i) or a partial sequence thereof (such as , 3' end partial sequence) annealing, and the second region of the first bridging oligonucleotide has a 3' free end.
- the reaction product obtained in step (3)(ii) is a labeled nucleic acid molecule comprising the first strand.
- the first strand comprises from the 5' end to the 3' end: a cDNA sequence complementary to the RNA formed by using the primer A as a reverse transcription primer, and the overhanging sequence at the 3' end , the complementary sequence of the tag sequence B, the complementary sequence of the consensus sequence B, optionally the complementary sequence of the third region of the second bridging oligonucleotide, the first bridging oligonucleotide sequence, The complementary sequence of the tag sequence Y, the complementary sequence of the consensus sequence X1.
- the second region of the first bridging oligonucleotide is located at the 3' end of the first bridging oligonucleotide.
- the first region of the first bridging oligonucleotide is located at the 5' end of the first bridging oligonucleotide.
- said first bridging oligonucleotide does not contain said third region, and/or said second bridging oligonucleotide does not contain said third region.
- the 5' end of the first bridging oligonucleotide contains a phosphorylation modification.
- the 3' end of the first bridging oligonucleotide contains a free -OH.
- step (3)(ii) the second bridging oligonucleotide is unable to initiate an extension reaction (eg, the 3' end is blocked), and/or, the oligonucleotide The probe cannot initiate an extension reaction (eg, the 3' end is blocked).
- the capture sequence A of the primer A is a random oligonucleotide sequence.
- the first extension product described in step (2)(i)(b) of the method sequentially comprises from the 5' end to the 3' end: The cDNA sequence complementary to the RNA, the 3' end overhang sequence, the complementary sequence of the tag sequence B, the complementary sequence of the consensus sequence B.
- the first strand comprises from the 5' end to the 3' end: a cDNA sequence complementary to the RNA formed by using the primer A as a reverse transcription primer, and the overhanging sequence at the 3' end , the complementary sequence of the tag sequence B, the complementary sequence of the consensus sequence B, optionally the complementary sequence of the third region of the second bridging oligonucleotide, the first bridging oligonucleotide sequence, The complementary sequence of the tag sequence Y, the complementary sequence of the consensus sequence X1.
- the capture sequence A of the primer A is a poly(T) sequence or a specific sequence for a specific target nucleic acid.
- the primer A further contains a consensus sequence A, and an optional tag sequence A, such as a random oligonucleotide sequence.
- the capture sequence A is located at the 3' end of the primer A.
- the first extension product in step (2)(i)(b) sequentially comprises from the 5' end to the 3' end: the consensus sequence A, optionally the tag sequence A, The cDNA sequence complementary to the RNA formed by using the primer A as a reverse transcription primer, the overhang sequence at the 3' end, the complementary sequence of the tag sequence B, and the complementary sequence of the consensus sequence B.
- the first strand comprises from the 5' end to the 3' end: the consensus sequence A, optionally the tag sequence A, the primer A formed with the primer A as a reverse transcription primer and the The cDNA sequence complementary to the RNA, the 3' end overhang sequence, the complementary sequence of the tag sequence B, the complementary sequence of the consensus sequence B, optionally the third region of the second bridging oligonucleotide
- step (3)(ii) at the positions corresponding to the first bridging oligonucleotide, the second bridging oligonucleotide, the oligonucleotide probe and the oligonucleotide probe
- the nucleic acid molecules hybridized to the first region and the second region of the same first bridging oligonucleotide are ligated, and/or, the nucleic acid molecules hybridized to the same second bridging oligonucleotide
- the ligation reaction process of connecting the nucleic acid molecules of the first region and the second region and the extension reaction described in step (3)(ii) can be carried out in any order, as long as the second nucleic acid molecule with a position marker can be obtained.
- nucleic acid molecules that hybridize to the first region and the second region of the same second bridging oligonucleotide can be ligated, and the first The bridging oligonucleotide initiates the extension reaction resulting in the first strand being obtained.
- the polymerase used in the extension reaction preferably does not have strand displacement activity or 5' to 3' excision activity.
- the first strand can be obtained in the following exemplary ways:
- the polymerase used in the extension reaction preferably has strand displacement activity or 5' to 3' excision activity.
- said first bridging oligonucleotide can be obtained by initiating an extension reaction with said first bridging oligonucleotide and then ligating nucleic acid molecules hybridizing to the first and second regions of the same second bridging oligonucleotide. first chain.
- the polymerase used in the extension reaction preferably does not have strand displacement activity or 5' to 3' excision activity.
- Embodiment comprising step (1), step (2)(i) and step (3)(ii): two strands
- the second region of the second bridging oligonucleotide is capable of annealing to the consensus sequence B complementary sequence or a partial sequence thereof of the first extension product obtained in step (2)(i), and
- the second region of the second bridging oligonucleotide has a 3' free end.
- the reaction product obtained in step (3)(ii) is a labeled nucleic acid molecule comprising said second strand.
- the second strand comprises from the 5' end to the 3' end: the consensus sequence X1, the tag sequence Y, the consensus sequence X2, optionally the first bridging oligo
- the complementary sequence of the third region of nucleotides, the second bridging oligonucleotide sequence, the tag sequence B, the complementary sequence of the 3' end overhang sequence, formed by using the primer A as a reverse transcription primer The complement of the cDNA sequence complementary to said RNA.
- the second region of the second bridging oligonucleotide is located at the 3' end of the second bridging oligonucleotide.
- the first region of the second bridging oligonucleotide is located at the 5' end of the second bridging oligonucleotide.
- said first bridging oligonucleotide does not contain said third region, and/or said second bridging oligonucleotide does not contain said third region.
- the 5' end of the second bridging oligonucleotide contains a phosphorylation modification.
- the 3' end of the second bridging oligonucleotide contains a free -OH.
- step (3)(ii) the first bridging oligonucleotide is unable to initiate an extension reaction (eg, the 3' end is blocked), and/or, step (2)(i ) The first extension product obtained cannot initiate the extension reaction (for example, the 3' end is blocked).
- the capture sequence A of the primer A is a random oligonucleotide sequence.
- the first extension product described in step (2)(i)(b) sequentially comprises from the 5' end to the 3' end: the primer A formed by using the primer A as the reverse transcription primer and the RNA complementary cDNA sequence, the 3' end overhang sequence, the complementary sequence of the tag sequence B, the complementary sequence of the consensus sequence B.
- the second strand comprises from the 5' end to the 3' end: the consensus sequence X1, the tag sequence Y, the consensus sequence X2, optionally the first bridging oligo
- the complementary sequence of the third region of nucleotides, the second bridging oligonucleotide sequence, the tag sequence B, the complementary sequence of the 3' end overhang sequence, formed by using the primer A as a reverse transcription primer The complement of the cDNA sequence complementary to said RNA.
- the capture sequence A of the primer A is a poly(T) sequence or a specific sequence for a specific target nucleic acid.
- the primer A further contains a consensus sequence A, and an optional tag sequence A, such as a random oligonucleotide sequence.
- the capture sequence A is located at the 3' end of the primer A.
- the first extension product in step (2)(i)(b) sequentially comprises from the 5' end to the 3' end: the consensus sequence A, optionally the tag sequence A, The cDNA sequence complementary to the RNA formed by using the primer A as a reverse transcription primer, the overhang sequence at the 3' end, the complementary sequence of the tag sequence B, and the complementary sequence of the consensus sequence B.
- the second strand comprises from the 5' end to the 3' end: the consensus sequence X1, the tag sequence Y, the consensus sequence X2, optionally the first bridging oligo
- the complementary sequence of the third region of nucleotides, the second bridging oligonucleotide sequence, the tag sequence B, the complementary sequence of the 3' end overhang sequence, formed by using the primer A as a reverse transcription primer The complementary sequence of the cDNA sequence complementary to the RNA, optionally the complementary sequence of the tag sequence A, the complementary sequence of the consensus sequence A.
- step (3)(ii) at the positions corresponding to the first bridging oligonucleotide, the second bridging oligonucleotide, the oligonucleotide probe and the oligonucleotide probe
- the nucleic acid molecules hybridized to the first region and the second region of the same first bridging oligonucleotide are ligated, and/or, the nucleic acid molecules hybridized to the same second bridging oligonucleotide
- the ligation reaction process of connecting the nucleic acid molecules of the first region and the second region and the extension reaction described in step (3)(ii) can be carried out in any order, as long as the second nucleic acid molecule with a position marker can be obtained.
- nucleic acid molecules hybridized to the first region and the second region of the same first bridging oligonucleotide can be connected, and the second The bridging oligonucleotide initiates the extension reaction, resulting in the second strand.
- the polymerase used in the extension reaction preferably does not have strand displacement activity or 5' to 3' excision activity.
- the second chain can be obtained in the following exemplary ways:
- said second bridging oligonucleotide can be obtained by initiating an extension reaction with said second bridging oligonucleotide and then ligating nucleic acid molecules hybridizing to the first and second regions of the same first bridging oligonucleotide. second chain.
- the polymerase used in the extension reaction preferably does not have strand displacement activity or 5' to 3' excision activity.
- step (1) An exemplary embodiment of the present application comprising step (1), step (2)(i) and step (3)(ii) is described in detail as follows:
- An exemplary scheme for preparing a cDNA chain using RNA (such as mRNA) in a sample as a template comprises the following steps (as shown in Figure 2):
- RNA molecules (for example, mRNA molecules) in the permeabilized sample are reverse-transcribed using reverse transcriptase (for example, reverse transcriptase with terminal transfer activity) and primer A to generate cDNA, and 3 An overhang (eg, an overhang comprising 3 cytosine nucleotides) is added to the ' end.
- reverse transcriptase for example, reverse transcriptase with terminal transfer activity
- primer A primer A to generate cDNA
- 3 An overhang eg, an overhang comprising 3 cytosine nucleotides
- 3 reverse transcriptases having terminal transfer activity can be used for the reverse transcription reaction.
- the reverse transcriptase used does not have RNaseH activity.
- the primer A comprises a poly(T) sequence and a consensus sequence A (CA).
- a poly(T) sequence is located at the 3' end of the primer A to initiate reverse transcription.
- the primer A comprises a random oligonucleotide sequence that can be used to capture RNA without a poly(A) tail.
- the random oligonucleotide sequence is located at the 3' end of the primer A to initiate reverse transcription.
- primer B to anneal or hybridize with the cDNA strand, said primer B comprising a consensus sequence B (CB), a unique molecular tag sequence (UMI) and the complementary sequence of the 3' end overhang of the cDNA.
- CB consensus sequence B
- UMI unique molecular tag sequence
- the nucleic acid fragment hybridized or annealed to the primer B can be extended using the UMI sequence and the consensus sequence B as templates, thereby generating a complementary primer carrying the UMI sequence at the 3' end. sequence, the nucleic acid molecule of the complementary sequence of the consensus sequence B.
- the consensus sequence B is located upstream of the UMI sequence (for example, the 5' end), and the sequence complementary to the 3' end overhang of the cDNA strand is located at the 3' end of the primer B.
- the primer B may include GGG at its 3' end.
- the nucleotides of the primer B can also be modified (for example, using locked nucleic acid) to enhance the complementary pairing between the primer B and the 3' end overhang of the cDNA strand.
- nucleic acid polymerases for example, DNA polymerase or reverse transcriptase
- DNA polymerase or reverse transcriptase can be used to carry out the extension reaction, as long as it can use the sequence of the primer B or a partial sequence thereof as a template to extend the captured Nucleic acid fragments (reverse transcription products) are sufficient.
- reverse transcriptase enzyme as in the previous reverse transcription step can be used to extend the captured nucleic acid fragment (reverse transcription product).
- this step is performed simultaneously with step (1) (eg, in the same reaction system).
- the method optionally further comprises step (3): adding RNaseH to digest the RNA strand in the RNA/cDNA hybrid duplex to form a cDNA single strand.
- said method does not comprise said step (3).
- the exemplary structure of the cDNA strand prepared by the above exemplary embodiment comprises: consensus sequence A, cDNA sequence, 3' end overhang sequence, complementary sequence of UMI sequence, and complementary sequence of consensus sequence B.
- the sexual scheme includes the following steps (as shown in Figure 3):
- a bridging oligonucleotide pair consisting of a first bridging oligonucleotide and a second bridging oligonucleotide, wherein each of the first bridging oligonucleotide and the second bridging oligonucleotide is independently Including: a first region (P1) and a second region (P2), the first region is located upstream of the second region (for example, the 5' end); wherein,
- the first region of the first bridging oligonucleotide is capable of annealing to the first region of the second bridging oligonucleotide; the second region of the first bridging oligonucleotide is capable of annealing to the oligonucleotide Annealing to the consensus sequence X2 of the acid probe or a partial sequence thereof;
- the second region of the second bridging oligonucleotide can anneal to the complementary sequence of the consensus sequence B or a partial sequence thereof in the cDNA strand obtained in step 1 above.
- the first bridging oligonucleotide contains spacer nucleotides between the first region and the second region, such as 1-5nt or 5-10nt spacer nucleotides, that is, the first bridging oligonucleotide
- a bridging oligonucleotide sequence contains a third region located between the first region and the second region.
- the first region and the second region in the first bridging oligonucleotide are adjacently connected without redundant nucleotides, that is, the first bridging oligonucleotide
- the nucleotide sequence does not contain a third region located between the first region and the second region.
- the second bridging oligonucleotide contains spacer nucleotides between the first region and the second region, such as 1-5nt or 5-10nt spacer nucleotides, that is, the second bridging oligonucleotide
- the second bridging oligonucleotide sequence contains a third region located between the first region and the second region.
- the first region and the second region in the second bridging oligonucleotide are adjacently connected without redundant nucleotides, that is, the second bridging oligonucleotide
- the nucleotide sequence does not contain a third region located between the first region and the second region.
- the exemplary structure of the new nucleic acid molecule containing chip sequence information formed by the above exemplary embodiment comprises: a consensus sequence A from the 5' end to the 3' end, a cDNA sequence, an overhang sequence at the 3' end, and the complement of the UMI sequence sequence, the complementary sequence of the consensus sequence B, the first bridging oligonucleotide sequence, the complementary sequence of the tag sequence Y, and the nucleic acid strand of the complementary sequence of the consensus sequence X1 and/or its complementary nucleic acid strand.
- the method comprises step (1), step (2)(ii) and step (3).
- the first extension product comprises from the 5' end to the 3' end: the consensus sequence A, reverse transcribed by the primer A' The cDNA sequence complementary to the RNA formed by the primers, the overhang sequence at the 3' end, the optional complementary sequence of the tag sequence B, and the complementary sequence of the consensus sequence B.
- the extension primer is the primer B' or primer B" or a random primer, wherein the primer B" can be combined with the consensus sequence The complementary sequence of B, or a portion thereof, anneals and is able to initiate an extension reaction.
- the second extension product comprises from the 5' end to the 3' end: a cDNA sequence complementary to the cDNA sequence formed by extending the extension primer Sequence, the complementary sequence of the consensus sequence A.
- Embodiments comprising step (1), step (2)(ii) and step (3)(i)
- the method comprises step (1), step (2)(ii) and step (3)(i); wherein, the consensus sequence X2 or a partial sequence thereof can be combined with the consensus sequence A
- the complementary sequence or partial sequence thereof is annealed;
- the extension product obtained in step (3)(i) is a labeled nucleic acid molecule, which comprises: the first strand containing the first nucleic acid molecule sequence to be labeled, and/or , the second strand containing the oligonucleotide probe sequence.
- the consensus sequence X2 can be annealed with the complementary sequence of the consensus sequence A or the nucleotide sequence of a partial segment of the complementary sequence of the consensus sequence A with its overall nucleotide sequence, the consensus sequence X2 can also anneal to the complementary sequence of the consensus sequence A or the nucleotide sequence of a partial segment of the complementary sequence of the consensus sequence A with the nucleotide sequence of its partial segment.
- the first strand comprises from the 5' end to the 3' end: the sequence of the first nucleic acid molecule to be labeled, the complementary sequence of the tag sequence Y, the complementary sequence of the consensus sequence X1 .
- the second strand comprises from the 5' end to the 3' end: the consensus sequence X1, the tag sequence Y, the consensus sequence X2, and the first nucleic acid molecule to be labeled Sequence complementary cDNA sequences.
- Embodiment comprising step (1), step (2)(ii) and step (3)(i): a chain
- the consensus sequence X2 or a partial sequence thereof can anneal to the complementary sequence of the consensus sequence A or a partial sequence thereof (for example, a partial sequence at the 3' end); obtained in step (3)(i)
- the extension product is a labeled nucleic acid molecule, which includes a first strand containing the sequence of the first nucleic acid molecule to be labeled.
- step (3)(i) the oligonucleotide probe cannot initiate an extension reaction (eg, the 3' end is blocked).
- the capture sequence A of the primer A' is a random oligonucleotide sequence.
- the extension primer is the primer B'.
- the second extension product comprises from the 5' end to the 3' end: the consensus sequence B, optionally the tag sequence B, The complementary sequence of the overhanging sequence at the 3' end, the complementary sequence of the cDNA sequence complementary to the RNA formed by using the primer A' as a reverse transcription primer, and the complementary sequence of the consensus sequence A.
- the first strand comprises from the 5' end to the 3' end: the consensus sequence B, optionally the tag sequence B, the complementary sequence of the overhang sequence at the 3' end, and
- the primer A' is the complementary sequence of the cDNA sequence complementary to the RNA formed by the reverse transcription primer, the complementary sequence of the consensus sequence A, the complementary sequence of the tag sequence Y, and the complementary sequence of the consensus sequence X1.
- each copy of the first strand derived from the same oligonucleotide probe has a different complementary sequence of capture sequence A as the UMI.
- the capture sequence A of the primer A' is a poly(T) sequence or a specific sequence for a specific target nucleic acid.
- the primer A' also contains a tag sequence A, such as a random oligonucleotide sequence.
- the capture sequence A is located at the 3' end of the primer A.
- the extension primer is the primer B'.
- the second extension product comprises from the 5' end to the 3' end: the consensus sequence B, optionally the tag sequence B, The complementary sequence of the overhang sequence at the 3' end, the complementary sequence of the cDNA sequence complementary to the RNA formed by using the primer A' as a reverse transcription primer, the complementary sequence of the tag sequence A, the consensus sequence A complementary sequence.
- the first strand comprises from the 5' end to the 3' end: the consensus sequence B, optionally the tag sequence B, the complementary sequence of the overhang sequence at the 3' end, and
- the primer A' is the complementary sequence of the cDNA sequence complementary to the RNA formed by the reverse transcription primer, the complementary sequence of the tag sequence A, the complementary sequence of the consensus sequence A, the complementary sequence of the tag sequence Y, The complementary sequence of said consensus sequence X1.
- each copy of the first strand derived from the same oligonucleotide probe has a different complementary sequence of the tag sequence A as the UMI.
- Embodiment comprising step (1), step (2)(ii) and step (3)(i): two strands
- the consensus sequence X2 or a partial sequence thereof can anneal to the complementary sequence of the consensus sequence A or a partial sequence thereof; obtained in step (3)(i)
- the extension product of is a labeled nucleic acid molecule comprising a second strand comprising the oligonucleotide probe sequence.
- the second extension product obtained in step (2)(ii) cannot initiate an extension reaction (eg, the 3' end is blocked).
- the capture sequence A of the primer A' is a random oligonucleotide sequence.
- the extension primer is the primer B'.
- the second extension product comprises from the 5' end to the 3' end: the consensus sequence B, optionally the tag sequence B, The complementary sequence of the overhanging sequence at the 3' end, the complementary sequence of the cDNA sequence complementary to the RNA formed by using the primer A' as a reverse transcription primer, and the complementary sequence of the consensus sequence A.
- the second strand comprises from the 5' end to the 3' end: the consensus sequence X1, the tag sequence Y, the consensus sequence X2, and the first nucleic acid molecule to be labeled cDNA sequence complementary to the sequence, the 3' end overhang sequence, optionally the complementary sequence of the tag sequence B, the complementary sequence of the consensus sequence B.
- each copy of the second strand derived from the same oligonucleotide probe has a different capture sequence A as UMI.
- the capture sequence A of the primer A' is a poly(T) sequence or a specific sequence for a specific target nucleic acid.
- the primer A' also contains a tag sequence A, such as a random oligonucleotide sequence.
- the capture sequence A is located at the 3' end of the primer A.
- the extension primer is the primer B'.
- the second extension product comprises from the 5' end to the 3' end: the consensus sequence B, optionally the tag sequence B, The complementary sequence of the overhang sequence at the 3' end, the complementary sequence of the cDNA sequence complementary to the RNA formed by using the primer A' as a reverse transcription primer, the complementary sequence of the tag sequence A, the consensus sequence A complementary sequence.
- the second strand comprises from the 5' end to the 3' end: the consensus sequence X1, the tag sequence Y, the consensus sequence X2, the tag sequence A, and the to-be
- each copy of the second strand derived from the same oligonucleotide probe has a different tag sequence A as UMI.
- step (1) An exemplary embodiment of the present application comprising step (1), step (2)(ii) and step (3)(i) is described in detail as follows:
- An exemplary scheme for preparing a cDNA strand complementary strand containing a complementary sequence of UMI at the 3' end using RNA (such as mRNA) in the sample as a template comprises the following steps (as shown in Figure 5):
- RNA molecules (for example, mRNA molecules) in the permeabilized sample are reverse-transcribed using reverse transcriptase (for example, reverse transcriptase with terminal transfer activity) and primer A' to generate cDNA, and An overhang (eg, an overhang comprising 3 cytosine nucleotides) is added at the 3' end.
- reverse transcriptase for example, reverse transcriptase with terminal transfer activity
- primer A' primer A' to generate cDNA
- An overhang eg, an overhang comprising 3 cytosine nucleotides
- Various reverse transcriptases having terminal transfer activity can be used for the reverse transcription reaction.
- the reverse transcriptase used does not have RNaseH activity.
- the primer A' comprises a poly(T) sequence, a UMI sequence, and a consensus sequence A(CA).
- a poly(T) sequence is located at the 3' end of the primer A' to initiate reverse transcription, and the consensus sequence A is located upstream (eg 5' end) of the UMI sequence.
- the primer A' comprises a random oligonucleotide sequence and a consensus sequence A, which can be used to capture RNA without a ploy A tail.
- the random oligonucleotide sequence is located at the 3' end of the primer A' to initiate reverse transcription.
- sequence complementary to the 3' end overhang of the cDNA strand is located at the 3' end of the primer B'.
- the primer B' may include GGG at its 3' end.
- the nucleotides of the primer B' can also be modified (for example, using a locked nucleic acid) to enhance the complementary pairing between the primer B' and the 3' end overhang of the cDNA strand.
- nucleic acid polymerases for example, DNA polymerase or reverse transcriptase
- DNA polymerase or reverse transcriptase can be used to carry out the extension reaction, as long as it can be captured using the sequence of the primer B' or a partial sequence thereof as a template extension Nucleic acid fragments (reverse transcription products) can be.
- reverse transcriptase enzyme as in the previous reverse transcription step can be used to extend the captured nucleic acid fragment (reverse transcription product).
- this step is performed simultaneously with step (1) (eg, in the same reaction system).
- the method optionally further comprises step (3): adding RNaseH to digest the RNA strand in the RNA/cDNA hybrid duplex to form a cDNA single strand.
- said method does not comprise said step (3).
- extension primer the cDNA strand obtained in the previous step is used as a template for an extension reaction to obtain an extension product;
- the extension primer is the primer B', a random primer, or a primer B", and the primer B" can be combined with The consensus sequence B or a partial sequence thereof anneals and is capable of initiating an extension reaction.
- the exemplary structure of the cDNA strand complementary chain prepared by the above exemplary embodiment comprises: consensus sequence B, complementary sequence of 3' end overhang, complementary sequence of cDNA sequence, complementary sequence of UMI sequence, and complementary sequence of consensus sequence A sequence.
- the consensus sequence X2 of the ChIP-seq or a partial sequence thereof can anneal to the complementary sequence of the consensus sequence A or a partial sequence thereof of the complementary strand of the cDNA strand obtained in step 1 above.
- the complementary strand of the cDNA chain is annealed or hybridized with the ChIP-seq, and under the action of the polymerase, a new nucleic acid molecule containing the ChIP-seq information (that is, a nucleic acid molecule labeled with the ChIP-seq) is formed.
- the exemplary structure of the new nucleic acid molecule containing chip sequence information formed by the above exemplary embodiment comprises: from the 5' end to the 3' end containing the consensus sequence B, the complementary sequence of the 3' end overhang, the cDNA sequence Complementary sequence, the complementary sequence of the UMI sequence, the complementary sequence of the consensus sequence A, the complementary sequence of the tag sequence Y, and the nucleic acid strand of the complementary sequence of the consensus sequence X1 and/or its complementary nucleic acid strand.
- Embodiments comprising step (1), step (2)(ii) and step (3)(ii)
- the method comprises step (1), step (2)(ii) and step (3)(ii); wherein the second region of the second bridging oligonucleotide is capable of combining with step (2) Annealing of the complementary sequence of the consensus sequence A of the second extension product obtained in (ii) or a partial sequence thereof;
- the reaction product obtained in step (3)(ii) is a labeled nucleic acid molecule, which comprises: The first strand of the first nucleic acid molecule sequence to be labeled, and/or, the second strand containing the oligonucleotide probe sequence.
- the second region of the second bridging oligonucleotide can be the complementary sequence of the consensus sequence A of the second extension product obtained in step (2)(ii) or a partial region of the complementary sequence of the consensus sequence A
- the nucleotide sequences of the segments are annealed.
- the first strand comprises from the 5' end to the 3' end: the sequence of the first nucleic acid molecule to be labeled, optionally the third region of the second bridging oligonucleotide complementary sequence, the first bridging oligonucleotide sequence, the complementary sequence of the tag sequence Y, the complementary sequence of the consensus sequence X1.
- the second strand comprises from the 5' end to the 3' end: the consensus sequence X1, the tag sequence Y, the consensus sequence X2, optionally the first bridging oligo
- the consensus sequence X1 the tag sequence Y
- the consensus sequence X2 optionally the first bridging oligo
- Embodiment comprising step (1), step (2)(ii) and step (3)(ii): a chain
- the second region of the second bridging oligonucleotide can be compatible with the complementary sequence of the consensus sequence A of the second extension product obtained in step (2)(ii) or the 3' end portion thereof The sequences anneal and the second region of the first bridging oligonucleotide has a 3' free end.
- the reaction product obtained in step (3)(ii) is a labeled nucleic acid molecule comprising the first strand.
- the second region of the first bridging oligonucleotide is located at the 3' end of the first bridging oligonucleotide.
- the first region of the first bridging oligonucleotide is located at the 5' end of the first bridging oligonucleotide. In certain embodiments, said first bridging oligonucleotide does not contain said third region, and/or said second bridging oligonucleotide does not contain said third region.
- the 5' end of the first bridging oligonucleotide contains a phosphorylation modification.
- the 3' end of the first bridging oligonucleotide contains a free -OH.
- step (3)(ii) the second bridging oligonucleotide is unable to initiate an extension reaction (eg, the 3' end is blocked), and/or, the oligonucleotide The probe cannot initiate an extension reaction (eg, the 3' end is blocked).
- the capture sequence A of the primer A' is a random oligonucleotide sequence.
- the extension primer is the primer B'.
- the second extension product comprises from the 5' end to the 3' end: the consensus sequence B, optionally the tag sequence B, The complementary sequence of the overhanging sequence at the 3' end, the complementary sequence of the cDNA sequence complementary to the RNA formed by using the primer A' as a reverse transcription primer, and the complementary sequence of the consensus sequence A.
- the first strand comprises from the 5' end to the 3' end: the consensus sequence B, optionally the tag sequence B, the complementary sequence of the overhang sequence at the 3' end, and the Primer A' is the complementary sequence of the cDNA sequence complementary to the RNA formed by the reverse transcription primer, the complementary sequence of the consensus sequence A, and optionally the complementary sequence of the third region of the second bridging oligonucleotide, The first bridging oligonucleotide sequence, the complementary sequence of the tag sequence Y, the complementary sequence of the consensus sequence X1.
- each copy of the first strand derived from the same oligonucleotide probe has a different complementary sequence of capture sequence A as the UMI.
- the capture sequence A of the primer A' is a poly(T) sequence or a specific sequence for a specific target nucleic acid.
- the primer A' also contains a tag sequence A, such as a random oligonucleotide sequence.
- the capture sequence A is located at the 3' end of the primer A.
- the extension primer is the primer B'.
- the second extension product comprises from the 5' end to the 3' end: the consensus sequence B, optionally the tag sequence B, The complementary sequence of the overhang sequence at the 3' end, the complementary sequence of the cDNA sequence complementary to the RNA formed by using the primer A' as a reverse transcription primer, the complementary sequence of the tag sequence A, the consensus sequence A complementary sequence.
- the first strand comprises from the 5' end to the 3' end: the consensus sequence B, optionally the tag sequence B, the complementary sequence of the overhang sequence at the 3' end, and the Primer A' is the complementary sequence of the cDNA sequence complementary to the RNA formed by the reverse transcription primer, the complementary sequence of the tag sequence A, the complementary sequence of the consensus sequence A, and optionally the second bridging oligonucleotide
- each copy of the first strand derived from the same oligonucleotide probe has a different complementary sequence of the tag sequence A as the UMI.
- step (3)(ii) at the positions corresponding to the first bridging oligonucleotide, the second bridging oligonucleotide, the oligonucleotide probe and the oligonucleotide probe
- the nucleic acid molecules hybridized to the first region and the second region of the same first bridging oligonucleotide are ligated, and/or, the nucleic acid molecules hybridized to the same second bridging oligonucleotide
- the ligation reaction process of connecting the nucleic acid molecules of the first region and the second region and the extension reaction described in step (3)(ii) can be carried out in any order, as long as the second nucleic acid molecule with a position marker can be obtained.
- nucleic acid molecules that hybridize to the first region and the second region of the same second bridging oligonucleotide can be ligated, and the first The bridging oligonucleotide initiates the extension reaction resulting in the first strand being obtained.
- the polymerase used in the extension reaction preferably does not have strand displacement activity or 5' to 3' excision activity.
- the first strand can be obtained in the following exemplary ways:
- the polymerase used in the extension reaction preferably has strand displacement activity or 5' to 3' excision activity.
- said first bridging oligonucleotide can be obtained by initiating an extension reaction with said first bridging oligonucleotide and then ligating nucleic acid molecules hybridizing to the first and second regions of the same second bridging oligonucleotide. first chain.
- the polymerase used in the extension reaction preferably does not have strand displacement activity or 5' to 3' excision activity.
- Embodiment comprising step (1), step (2)(ii) and step (3)(ii): two chains
- the second region of the second bridging oligonucleotide is capable of annealing to the complementary sequence of the consensus sequence A or a partial sequence thereof of the second extension product obtained in step (2)(ii), And the second region of the second bridging oligonucleotide has a 3' free end.
- the reaction product obtained in step (3)(ii) is a labeled nucleic acid molecule comprising said second strand.
- the second region of the second bridging oligonucleotide is located at the 3' end of the second bridging oligonucleotide.
- the first region of the second bridging oligonucleotide is located at the 5' end of the second bridging oligonucleotide.
- said first bridging oligonucleotide does not contain said third region, and/or said second bridging oligonucleotide does not contain said third region.
- the 5' end of the second bridging oligonucleotide contains a phosphorylation modification.
- the 3' end of the second bridging oligonucleotide contains a free -OH.
- step (3)(ii) the first bridging oligonucleotide cannot initiate an extension reaction (eg, the 3' end is blocked), and/or, step (2)(ii) ) The second extension product obtained cannot initiate the extension reaction (for example, the 3' end is blocked).
- the capture sequence A of the primer A' is a random oligonucleotide sequence.
- the extension primer is the primer B'.
- the second extension product comprises from the 5' end to the 3' end: the consensus sequence B, optionally the tag sequence B, The complementary sequence of the overhanging sequence at the 3' end, the complementary sequence of the cDNA sequence complementary to the RNA formed by using the primer A' as a reverse transcription primer, and the complementary sequence of the consensus sequence A.
- the second strand comprises from the 5' end to the 3' end: the consensus sequence X1, the tag sequence Y, the consensus sequence X2, optionally the first bridging oligo
- each copy of the second strand derived from the same oligonucleotide probe has a different capture sequence A as UMI.
- the capture sequence A of the primer A' is a poly(T) sequence or a specific sequence for a specific target nucleic acid.
- the primer A' also contains a tag sequence A, such as a random oligonucleotide sequence.
- the capture sequence A is located at the 3' end of the primer A.
- the extension primer is the primer B'.
- the second extension product comprises from the 5' end to the 3' end: the consensus sequence B, optionally the tag sequence B, The complementary sequence of the overhang sequence at the 3' end, the complementary sequence of the cDNA sequence complementary to the RNA formed by using the primer A' as a reverse transcription primer, the complementary sequence of the tag sequence A, the consensus sequence A complementary sequence.
- the second strand comprises from the 5' end to the 3' end: the consensus sequence X1, the tag sequence Y, the consensus sequence X2, optionally the first bridging oligo
- each copy of the second strand derived from the same oligonucleotide probe has a different tag sequence A as UMI.
- step (3)(ii) at the positions corresponding to the first bridging oligonucleotide, the second bridging oligonucleotide, the oligonucleotide probe and the oligonucleotide probe
- the nucleic acid molecules hybridized to the first region and the second region of the same first bridging oligonucleotide are ligated, and/or, the nucleic acid molecules hybridized to the same second bridging oligonucleotide
- the ligation reaction process of connecting the nucleic acid molecules of the first region and the second region and the extension reaction described in step (3)(ii) can be carried out in any order, as long as the second nucleic acid molecule with a position marker can be obtained.
- nucleic acid molecules hybridized to the first region and the second region of the same first bridging oligonucleotide can be connected, and the second The bridging oligonucleotide initiates the extension reaction, resulting in the second strand.
- the polymerase used in the extension reaction preferably does not have strand displacement activity or 5' to 3' excision activity.
- the second chain can be obtained in the following exemplary ways:
- said second bridging oligonucleotide can be obtained by initiating an extension reaction with said second bridging oligonucleotide and then ligating nucleic acid molecules hybridizing to the first and second regions of the same first bridging oligonucleotide. second chain.
- the polymerase used in the extension reaction preferably does not have strand displacement activity or 5' to 3' excision activity.
- step (1) An exemplary embodiment of the present application comprising step (1), step (2)(ii) and step (3)(ii) is described in detail as follows:
- An exemplary scheme for preparing a cDNA strand complementary chain using RNA (such as mRNA) in a sample as a template comprises the following steps (as shown in FIG. 5 ):
- RNA molecules (for example, mRNA molecules) in the permeabilized sample are reverse-transcribed using reverse transcriptase (for example, reverse transcriptase with terminal transfer activity) and primer A' to generate cDNA, and An overhang (eg, an overhang comprising 3 cytosine nucleotides) is added at the 3' end.
- reverse transcriptase for example, reverse transcriptase with terminal transfer activity
- primer A' primer A' to generate cDNA
- An overhang eg, an overhang comprising 3 cytosine nucleotides
- Various reverse transcriptases having terminal transfer activity can be used for the reverse transcription reaction.
- the reverse transcriptase used does not have RNaseH activity.
- the primer A' comprises a poly(T) sequence, a UMI sequence, and a consensus sequence A(CA).
- a poly(T) sequence is located at the 3' end of the primer A' to initiate reverse transcription, and the consensus sequence A is located upstream (e.g., at the 5' end) of the UMI sequence.
- the primer A' comprises a random oligonucleotide sequence and a consensus sequence A, which can be used to capture RNA without a ploy A tail.
- the random oligonucleotide sequence is located at the 3' end of the primer A' to initiate reverse transcription.
- sequence complementary to the 3' end overhang of the cDNA strand is located at the 3' end of the primer B'.
- the primer B' may include GGG at its 3' end.
- the nucleotides of the primer B' can also be modified (for example, using a locked nucleic acid) to enhance the complementary pairing between the primer B' and the 3' end overhang of the cDNA strand.
- nucleic acid polymerases for example, DNA polymerase or reverse transcriptase
- DNA polymerase or reverse transcriptase can be used to carry out the extension reaction, as long as it can be captured using the sequence of the primer B' or a partial sequence thereof as a template extension Nucleic acid fragments (reverse transcription products) can be.
- reverse transcriptase enzyme as in the previous reverse transcription step can be used to extend the captured nucleic acid fragment (reverse transcription product).
- this step is performed simultaneously with step (1) (eg, in the same reaction system).
- the method optionally further comprises step (3): adding RNaseH to digest the RNA strand in the RNA/cDNA hybrid duplex to form a cDNA single strand.
- said method does not comprise said step (3).
- extension primer the cDNA strand obtained in the previous step is used as a template for an extension reaction to obtain an extension product;
- the extension primer is the primer B', a random primer, or a primer B", and the primer B" can be combined with The consensus sequence B or a partial sequence thereof anneals and is capable of initiating an extension reaction.
- the exemplary structure of the cDNA strand complementary chain prepared by the above exemplary embodiment comprises: consensus sequence B, complementary sequence of 3' end overhang, complementary sequence of cDNA sequence, complementary sequence of UMI sequence, and complementary sequence of consensus sequence A sequence.
- a bridging oligonucleotide pair consisting of a first bridging oligonucleotide and a second bridging oligonucleotide, wherein each of the first bridging oligonucleotide and the second bridging oligonucleotide is independently Including: a first region (P1) and a second region (P2), the first region is located upstream of the second region (for example, the 5' end); wherein,
- the first region of the first bridging oligonucleotide is capable of annealing to the first region of the second bridging oligonucleotide; the second region of the first bridging oligonucleotide is capable of annealing to the oligonucleotide Annealing to the consensus sequence X2 of the acid probe or a partial sequence thereof;
- the second region of the second bridging oligonucleotide can anneal to the complementary sequence of the consensus sequence A in the complementary strand of the cDNA strand obtained in the above step 1 or a partial sequence thereof.
- the first bridging oligonucleotide contains spacer nucleotides between the first region and the second region, such as 1-5nt or 5-10nt spacer nucleotides, that is, the first bridging oligonucleotide
- a bridging oligonucleotide sequence contains a third region located between the first region and the second region.
- the first region and the second region in the first bridging oligonucleotide are adjacently connected without redundant nucleotides, that is, the first bridging oligonucleotide
- the nucleotide sequence does not contain a third region located between the first region and the second region.
- the first region and the second region in the second bridging oligonucleotide include spacer nucleotides, such as 1-5nt or 5-10nt spacer nucleotides, that is, the second bridging oligonucleotide
- the second bridging oligonucleotide sequence contains a third region located between the first region and the second region.
- the first region and the second region in the second bridging oligonucleotide are adjacently connected without redundant nucleotides, that is, the second bridging oligonucleotide
- the nucleotide sequence does not contain a third region located between the first region and the second region.
- the first bridging oligonucleotide, the second bridging oligonucleotide and the chip sequence are annealed or hybridized to the complementary strand of the cDNA strand obtained in step 1 above, and then hybridized to the same first bridging oligonucleotide by DNA ligase and/or, link nucleic acid molecules of the first and second regions that hybridize to the same second bridging oligonucleotide. Subsequently, under the action of DNA polymerase, new nucleic acid molecules containing ChIP-seq information (ie, ChIP-seq-labeled nucleic acid molecules) are formed. The concatenation process and polymerization process are performed in any order.
- the exemplary structure of the new nucleic acid molecule containing chip sequence information formed by the above exemplary embodiment comprises: from the 5' end to the 3' end containing the consensus sequence B, the complementary sequence of the 3' end overhang, the cDNA sequence complementary sequence, the complementary sequence of the UMI sequence, the complementary sequence of the consensus sequence A, the first bridging oligonucleotide sequence, the complementary sequence of the tag sequence Y, and the complementary sequence of the consensus sequence X1 A nucleic acid strand and/or its complementary nucleic acid strand.
- step (2)(i)(b) of the method the cDNA strand anneals to the primer B via its 3' end overhang, and, in the presence of a nucleic acid polymerase (e.g., Under the action of DNA polymerase or reverse transcriptase), the cDNA chain is extended using the primer B as a template to generate the first extension product.
- a nucleic acid polymerase e.g., Under the action of DNA polymerase or reverse transcriptase
- step (2)(ii)(b) of the method the cDNA strand is annealed to the primer B' via its 3' end overhang, and, in the presence of a nucleic acid polymerase (e.g. , DNA polymerase or reverse transcriptase), the cDNA chain is extended using the primer B' as a template to generate the first extension product.
- a nucleic acid polymerase e.g. , DNA polymerase or reverse transcriptase
- the 3' terminal overhang has at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9 , at least 10 or more nucleotides in length. In certain embodiments, the 3' terminal overhang is a 3' terminal overhang of 2-5 cytosine nucleotides (eg, a CCC overhang).
- step (2) before performing the pretreatment, the biological sample is permeabilized.
- the biological sample is a tissue sample.
- the tissue sample is a tissue section.
- the tissue sections are prepared from fixed tissue, eg, formalin-fixed paraffin-embedded (FFPE) tissue or deep-frozen tissue.
- FFPE formalin-fixed paraffin-embedded
- each cell of the biological sample occupies one or more microspots in the nucleic acid array (i.e., each cell is individually associated with One or more micropoint contacts in the nucleic acid array).
- performing reverse transcription in step (2) includes using reverse transcriptase.
- the reverse transcriptase has terminal transfer activity.
- the reverse transcriptase is capable of synthesizing a cDNA strand using RNA (eg, mRNA) as a template, and adding an overhang at the 3' end of the cDNA strand.
- RNA eg, mRNA
- the reverse transcriptase is capable of adding at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7 nucleotides in length to the 3' end of the cDNA strand. , an overhang of at least 8, at least 9, at least 10 or more nucleotides.
- the reverse transcriptase is capable of adding an overhang of 2-5 cytosine nucleotides (eg, a CCC overhang) at the 3' end of the cDNA strand.
- an overhang of 2-5 cytosine nucleotides eg, a CCC overhang
- the reverse transcriptase is selected from the group consisting of M-MLV reverse transcriptase, HIV-1 reverse transcriptase, AMV reverse transcriptase, telomerase reverse transcriptase, and transposases having the above transposase activity variants, modifications and derivatives.
- steps (2) and (3) have one or more features selected from:
- the primer A, primer A', primer B, primer B', the first bridging oligonucleotide, and the first bridging oligonucleotide each independently comprise or consist of naturally occurring nucleotides (such as deoxyribose nucleotides or ribonucleotides), modified nucleotides, non-natural nucleotides, or any combination thereof; in certain embodiments, the primer A, primer A' is capable of initiating an extension reaction ;
- the primer B comprises a modified nucleotide (such as a locked nucleic acid); in some embodiments, the 3' end of the primer B comprises one or more modified nucleotides (such as a locked nucleic acid);
- the primer B' comprises a modified nucleotide (such as a locked nucleic acid); in some embodiments, the 3' end of the primer B' comprises one or more modified nucleotides (such as a locked nucleic acid );
- the tag sequence A and the tag sequence B each independently have a length of 5-200 (eg, 5-30nt, 6-15nt);
- the consensus sequence A and the consensus sequence B each independently have 10-200nt (such as 10-100nt, 20-100nt, 25-100nt, 5-10nt, 10-15nt, 15-20nt, 20-50nt, 20 -30nt, 30-40nt, 40-50nt, 50-100nt) length;
- the primer A, primer A', primer B, and primer B' each independently have 4-200nt (such as 5-200nt, 15-230nt, 26-115nt, 10-130nt, 10-20nt, 20-50nt , 20-30nt, 30-40nt, 40-50nt, 50-100nt, 100-150nt, 150-200nt) length;
- the first region of the first bridging oligonucleotide, and the second region each independently have 3-100nt (such as 20-100nt, 3-10nt, 10-15nt, 15-20nt, 20-70nt, 20 -30nt, 30-40nt, 40-50nt, 50-100nt) length;
- the first region of the second bridging oligonucleotide, the second region each independently has 3-100nt (such as 20-100nt, 3-10nt, 10-15nt, 15-20nt, 20-70nt, 20 -30nt, 30-40nt, 40-50nt, 50-100nt) length;
- the third region of the first bridging oligonucleotide, the third region of the second bridging oligonucleotide each independently has 0-50nt (such as Ont, 0-10nt, 10-15nt, 15 -20nt, 20-30nt, 30-40nt, 40-50nt) length;
- the first bridging oligonucleotide and the second bridging oligonucleotide each independently have 6-200nt (such as 20-100nt, 20-70nt, 6-15nt, 15-20nt, 20-30nt, 30nt -40nt, 40-50nt, 50-100nt, 100-150nt, 150-200nt) length;
- the poly(T) sequence includes at least 5, or at least 20 (eg, 6-100, 10-50) deoxythymidine residues;
- the random oligonucleotide sequence has a length of 5-200 (eg 5nt, 5-30nt, 6-15nt).
- the method further comprises: (4) recovering and purifying the second population of nucleic acid molecules.
- the obtained second population of nucleic acid molecules and/or complements thereof are used for constructing a transcriptome library or for transcriptome sequencing.
- the oligonucleotide probes in step (1) have one or more characteristics selected from the following:
- consensus sequence X1, tag sequence Y and consensus sequence X2 each independently comprise or consist of naturally occurring nucleotides (such as deoxyribonucleotides or ribonucleotides), modified nucleotides, non- Natural nucleotides (such as peptide nucleic acid (PNA) or locked nucleic acid), or any combination thereof;
- naturally occurring nucleotides such as deoxyribonucleotides or ribonucleotides
- modified nucleotides such as peptide nucleic acid (PNA) or locked nucleic acid
- the consensus sequence X1, the tag sequence Y and the consensus sequence X2 each independently have 2-200nt (such as 10-200nt, 25-100nt, 10-30nt, 10-100nt, 5-10nt, 10-15nt, 15 -20nt, 20-30nt, 30-40nt, 40-50nt, 50-100nt) length.
- 2-200nt such as 10-200nt, 25-100nt, 10-30nt, 10-100nt, 5-10nt, 10-15nt, 15 -20nt, 20-30nt, 30-40nt, 40-50nt, 50-100nt
- the oligonucleotide probe is coupled to the solid support via a linker.
- the linker is a linking group capable of reacting with an activating group, and the surface of the solid support is linked with an activating group.
- the linker comprises -SH, -DBCO, or -NHS.
- the nucleic acid array in step (1) has one or more characteristics selected from the following:
- the oligonucleotide probes coupled on the same solid support have the same consensus sequence X1 and/or the same consensus sequence X2; (2) the oligonucleotide probes
- the consensus sequence X1 of the nucleotide probes comprises a cleavage site; in some embodiments, the cleavage site can be selected from a nicking enzyme (nicking enzyme) enzyme cleavage, USER enzyme cleavage, photocleavage, chemical cleavage or CRISPR cut or fractured by means of resection.
- nicking enzyme nicking enzyme
- the nucleic acid array in step (1) is provided by steps comprising:
- each vector sequence comprising at least one copy (for example, multiple copies) of the vector sequence, the vector sequence comprising from the 5' to 3' direction: the complementary sequence of the consensus sequence X2, The complementary sequence of the tag sequence Y and the fixed sequence; wherein, the complementary sequences of the tag sequence Y of each carrier sequence are different from each other;
- extension product comprises or consists of: a consensus sequence X1, a tag sequence Composed of Y and consensus sequence X2;
- steps (3) and (4) are performed in any order;
- the fixed sequence of the carrier sequence also includes a cleavage site, and the cleavage can be selected from nicking enzyme enzyme cleavage, USER enzyme cleavage, light cleavage, chemical cleavage or CRISPR cleavage;
- the cleavage site contained in the fixed sequence of the carrier sequence is cut to digest the carrier sequence, so that the extension product in step (3) is separated from the template (i.e. the carrier sequence) forming the extension product, so that the oligo Nucleotide probes are attached to the surface of a solid support such as a chip.
- the method further includes separating the extension product in step (3) from the template forming the extension product (ie, the carrier sequence) by high temperature denaturation.
- each vector sequence is a DNB formed from a concatemer of multiple copies of the vector sequence.
- the various vector sequences are provided in step (1) by the following steps:
- each vector template sequence as a template to perform a nucleic acid amplification reaction to obtain an amplification product of each vector template sequence, the amplification product comprising at least one copy of the vector sequence; in certain embodiments, Rolling circle replication is performed to obtain DNBs formed from concatemers of the vector sequences.
- the solid phase support in step (1) has one or more characteristics selected from the following:
- the solid support is selected from latex beads, dextran beads, polystyrene surfaces, polypropylene surfaces, polyacrylamide gels, gold surfaces, glass surfaces, chips, sensors, electrodes and silicon wafers; In some embodiments, the solid support is a chip;
- the solid support is planar, spherical or porous
- the solid phase support can be used as a sequencing platform, such as a sequencing chip; in some embodiments, the solid phase support is a sequencing chip for Illumina, MGI or Thermo Fisher sequencing platforms; and
- the solid support is capable of releasing all the compounds spontaneously or upon exposure to one or more stimuli (e.g., temperature change, pH change, exposure to a specific chemical substance or phase, exposure to light, reducing agent, etc.) oligonucleotide probes.
- stimuli e.g., temperature change, pH change, exposure to a specific chemical substance or phase, exposure to light, reducing agent, etc.
- the present application also provides a method for constructing a library of nucleic acid molecules, which includes,
- step (c) optionally, amplifying and/or enriching the product of step (b);
- a library of nucleic acid molecules is thereby obtained.
- the library of nucleic acid molecules is used for sequencing, e.g., transcriptome sequencing, e.g., single cell transcriptome sequencing (e.g., 5' or 3' transcriptome sequencing).
- sequencing e.g., transcriptome sequencing, e.g., single cell transcriptome sequencing (e.g., 5' or 3' transcriptome sequencing).
- the method before performing step (b), further comprises a step (pre-b): amplifying and/or enriching the population of labeled nucleic acid molecules.
- step (pre-b) the population of labeled nucleic acid molecules is subjected to a nucleic acid amplification reaction to generate an enriched product.
- the amplification reaction is performed using at least primer C and/or primer D, wherein the primer C is capable of hybridizing or annealing to the complementary sequence of the consensus sequence X1 or a partial sequence thereof, and Initiate an extension reaction; the primer D can hybridize or anneal to the nucleic acid molecular chain containing the tag sequence Y in the labeled nucleic acid molecule population, and initiate an extension reaction.
- the nucleic acid amplification reaction in step (pre-b) is performed using a nucleic acid polymerase (eg, DNA polymerase, eg, DNA polymerase with strand displacement activity and/or high fidelity).
- a nucleic acid polymerase eg, DNA polymerase, eg, DNA polymerase with strand displacement activity and/or high fidelity.
- step (b) of the method the nucleic acid molecule is randomly disrupted with a transposase and adapters are added.
- the nucleic acid molecule obtained in the previous step is randomly interrupted with a transposase, and a first linker and a second linker are respectively added to both ends of the fragment.
- the transposase is selected from Tn5 transposase, MuA transposase, Sleeping Beauty transposase, Mariner transposase, Tn7 transposase, Tn10 transposase, Ty1 transposase, Tn552 transposase, and variants, modified products and derivatives having the transposition activity of the above-mentioned transposases.
- the transposase is a Tn5 transposase.
- step (c) at least primer C' and/or primer D' are used to amplify the product of step (b), wherein said primer C' is capable of combining with said first adapter hybridizes or anneals and initiates an extension reaction, said primer D' is capable of hybridizing or annealing to said second adapter and initiates an extension reaction.
- step (c) at least the product of step (b) is amplified using the primer C and/or primer D'; wherein, the primer D' can be combined with the first The adapter or second adapter hybridizes or anneals and initiates an extension reaction.
- the present application also provides a method for performing transcriptome sequencing on cells in a sample, comprising:
- test kit comprising:
- nucleic acid array for labeling nucleic acids comprising a solid support coupled to a plurality of oligonucleotide probes; each oligonucleotide probe comprising at least one copy; and , the oligonucleotide probe comprises or consists of: consensus sequence X1, tag sequence Y and consensus sequence X2 from the 5' to 3' direction, wherein,
- oligonucleotide probes have different tag sequences Y, and the tag sequence Y has a unique nucleotide sequence corresponding to the position of the oligonucleotide probe on the solid support;
- the primer A contains a capture sequence A capable of annealing to the RNA to be captured (eg, mRNA) and initiating an extension reaction;
- the primer B comprises a consensus sequence B, a 3' end overhang complementary sequence, and an optional tag sequence B; in some embodiments, the 3' end overhang complementary sequence is located at the 3' end of the primer B ; In some embodiments, the consensus sequence B is located upstream of the complementary sequence of the 3' end overhang (for example, at the 5' end of the primer B); wherein, the 3' end overhang refers to The RNA captured by the capture sequence A of the primer A is one or more non-template nucleotides contained in the 3' end of the cDNA chain generated by template reverse transcription;
- the primer A' contains a consensus sequence A and a capture sequence A; in some embodiments, the capture sequence A is located at the 3' end of the primer A'; in some embodiments, the consensus sequence A is located at Upstream of the capture sequence A (for example, at the 5' end of the primer A');
- the primer B' comprises a consensus sequence B, a 3' end overhanging complementary sequence, and an optional tag sequence B; in some embodiments, the 3' end overhanging complementary sequence is located at the 3' end of the primer B' ' end; in some embodiments, the consensus sequence B is located upstream of the complementary sequence of the 3' end overhang (for example, at the 5' end of the primer B'); wherein the 3' end overhang Refers to one or more non-template nucleotides contained in the 3' end of the cDNA chain generated by reverse transcription using the RNA captured by the capture sequence A of the primer A' as a template.
- each oligonucleotide probe comprises one copy.
- each oligonucleotide probe comprises multiple copies.
- the region where each of the oligonucleotide probes is coupled to the solid support is referred to as a microspot. It is easy to understand that when each oligonucleotide probe is one copy, each micro-dot is coupled with a probe, and the oligonucleotide probes of different micro-dots have different label sequences Y; when each oligonucleotide When the nucleotide probe contains multiple copies, each micro-dot is coupled with multiple probes, the oligonucleotide probes in the same micro-dot have the same label sequence Y, and the oligonucleotide probes in different micro-dots have Different label sequences Y.
- the solid support comprises a plurality of microdots, each microdot is coupled to an oligonucleotide probe, and each oligonucleotide probe may comprise one or more copies.
- the solid support comprises a plurality (eg, at least 10, at least 10 2 , at least 10 3 , at least 10 4 , at least 10 5 , at least 10 6 , at least 10 7 , at least 10 8 , or more) microdots; in certain embodiments, the solid support comprises at least 10 4 (eg, at least 10 4 , at least 10 5 , at least 10 6 , at least 10 7 , at least 10 8 , at least 10 9 , at least 10 10 , at least 10 11 , or at least 10 12 ) microdots/square millimeter.
- the solid support comprises at least 10 4 (eg, at least 10 4 , at least 10 5 , at least 10 6 , at least 10 7 , at least 10 8 , at least 10 9 , at least 10 10 , at least 10 11 , or at least 10 12 ) microdots/square millimeter.
- the interval between adjacent microdots is less than 100 ⁇ m, less than 50 ⁇ m, less than 10 ⁇ m, less than 5 ⁇ m, less than 1 ⁇ m, less than 0.5 ⁇ m, less than 0.1 ⁇ m, less than 0.05 ⁇ m, or less than 0.01 ⁇ m.
- the microdots have a size (e.g., equivalent diameter) of less than 100 ⁇ m, less than 50 ⁇ m, less than 10 ⁇ m, less than 5 ⁇ m, less than 1 ⁇ m, less than 0.5 ⁇ m, less than 0.1 ⁇ m, less than 0.05 ⁇ m, or less than 0.01 ⁇ m .
- the kit comprises: a nucleic acid array for labeling nucleic acids as described in (i), a primer set of primer A and primer B as described in (ii), and, (iii ) a first bridging oligonucleotide and a second bridging oligonucleotide; wherein the first bridging oligonucleotide and the second bridging oligonucleotide each independently comprise: a first region and a second region , and optionally a third region located between the first region and the second region, the first region being located upstream (e.g., the 5' end) of the second region; wherein,
- the first region of the first bridging oligonucleotide is capable of annealing to the first region of the second bridging oligonucleotide; the second region of the first bridging oligonucleotide is capable of annealing to the oligonucleotide Annealing to the consensus sequence X2 of the acid probe or a partial sequence thereof;
- the second region of the second bridging oligonucleotide can anneal to the complementary sequence of the consensus sequence B of the primer B or a partial sequence thereof.
- the capture sequence A of the primer A is a random oligonucleotide sequence.
- the capture sequence A of the primer A is a poly(T) sequence or a specific sequence for a specific target nucleic acid.
- the primer A further comprises a consensus sequence A and an optional tag sequence A, such as a random oligonucleotide sequence.
- the capture sequence A is located at the 3' end of the primer A, and the consensus sequence A is located upstream of the primer A (eg, the 5' end).
- the primer B contains the consensus sequence B, the complementary sequence of the 3' end overhang, and the tag sequence B.
- the primer B comprises modified nucleotides (eg, locked nucleic acids).
- the 3' end of primer B comprises one or more modified nucleotides (eg, locked nucleic acids).
- the second region of the second bridging oligonucleotide is capable of annealing to the complementary sequence of the consensus sequence B of the primer B or a partial sequence thereof (for example, a 3' end partial sequence).
- the second region of the first bridging oligonucleotide is located at the 3' end of the first bridging oligonucleotide.
- the first region of the first bridging oligonucleotide is located at the 5' end of the first bridging oligonucleotide.
- the 5' end of the first bridging oligonucleotide contains a phosphorylation modification.
- the 3' end of the first bridging oligonucleotide contains a free -OH.
- the second bridging oligonucleotide is incapable of initiating an extension reaction (e.g., the 3' end is blocked), and/or, the oligonucleotide probe is incapable of initiating an extension reaction (e.g., 3' ends are blocked).
- the second region of the second bridging oligonucleotide is capable of annealing to the complementary sequence of the consensus sequence B of the primer B or a partial sequence thereof.
- the second region of the second bridging oligonucleotide is located at the 3' end of the second bridging oligonucleotide.
- the first region of the second bridging oligonucleotide is located at the 5' end of the second bridging oligonucleotide.
- the 5' end of the second bridging oligonucleotide contains a phosphorylation modification.
- the 3' end of the second bridging oligonucleotide contains a free -OH.
- the first bridging oligonucleotide is incapable of initiating an extension reaction (e.g., the 3' end is blocked).
- the kit comprises: a nucleic acid array for labeling nucleic acids as described in (i), and a primer set of primer A and primer B as described in (ii).
- the capture sequence A of the primer A is a random oligonucleotide sequence.
- the capture sequence A of the primer A is a poly(T) sequence or a specific sequence for a specific target nucleic acid.
- the primer A further comprises a consensus sequence A and an optional tag sequence A, such as a random oligonucleotide sequence.
- the capture sequence A is located at the 3' end of the primer A, and the consensus sequence A is located upstream (eg, the 5' end) of the primer A.
- the primer B contains the consensus sequence B, the complementary sequence of the 3' end overhang, and the tag sequence B.
- the primer B comprises modified nucleotides (eg, locked nucleic acids).
- the 3' end of primer B comprises one or more modified nucleotides (eg, locked nucleic acids).
- the oligonucleotide probe is capable (e.g. 3' end contains a free -OH) or incapable of initiating an extension reaction (e.g. 3' end is blocked).
- the kit comprises: a nucleic acid array for labeling nucleic acids as described in (i), a primer set of primer A' and primer B' as described in (ii), and, (iii) a first bridging oligonucleotide and a second bridging oligonucleotide; wherein, the first bridging oligonucleotide and the second bridging oligonucleotide each independently comprise: a first region and a second bridging oligonucleotide Two regions, and optionally a third region located between the first region and the second region, the first region being located upstream (eg 5' end) of the second region; wherein,
- the first region of the first bridging oligonucleotide is capable of annealing to the first region of the second bridging oligonucleotide; the second region of the first bridging oligonucleotide is capable of annealing to the oligonucleotide Annealing to the consensus sequence X2 of the acid probe or a partial sequence thereof;
- the second region of the second bridging oligonucleotide can anneal to the complementary sequence of the consensus sequence A of the primer A' or a partial sequence thereof.
- the capture sequence A of the primer A' is a random oligonucleotide sequence.
- the capture sequence A of the primer A' is a poly(T) sequence or a specific sequence for a specific target nucleic acid.
- the primer A' further comprises a tag sequence A, such as a random oligonucleotide sequence.
- the capture sequence A is located at the 3' end of the primer A', and the consensus sequence A is located upstream of the tag sequence A (for example, at the 5' end of the primer A').
- the primer B' comprises modified nucleotides (e.g., locked nucleic acids).
- the 3' end of primer B' comprises one or more modified nucleotides (e.g., locked nucleic acids).
- the kit further comprises a primer B" or a random primer, the primer B" can anneal to the complementary sequence of the consensus sequence B or a partial sequence thereof, and can initiate an extension reaction.
- the second region of the second bridging oligonucleotide is capable of annealing to the complementary sequence of the consensus sequence A of the primer A' or a partial sequence thereof (for example, a partial sequence at the 3' end).
- the second region of the first bridging oligonucleotide is located at the 3' end of the first bridging oligonucleotide.
- the first region of the first bridging oligonucleotide is located at the 5' end of the first bridging oligonucleotide.
- the 5' end of the first bridging oligonucleotide contains a phosphorylation modification.
- the 3' end of the first bridging oligonucleotide contains a free -OH.
- the second bridging oligonucleotide is incapable of initiating an extension reaction (e.g., the 3' end is blocked), and/or, the oligonucleotide probe is incapable of initiating an extension reaction (e.g., 3' ends are blocked).
- the second region of the second bridging oligonucleotide is capable of annealing to the complementary sequence of the consensus sequence A of the primer A' or a partial sequence thereof.
- the second region of the second bridging oligonucleotide is located at the 3' end of the second bridging oligonucleotide.
- the first region of the second bridging oligonucleotide is located at the 5' end of the second bridging oligonucleotide.
- the 5' end of the second bridging oligonucleotide contains a phosphorylation modification.
- the 3' end of the second bridging oligonucleotide contains a free -OH.
- the first bridging oligonucleotide is incapable of initiating an extension reaction (e.g., the 3' end is blocked).
- the kit comprises: a nucleic acid array for labeling nucleic acids as described in (i), and a primer set of primer A' and primer B' as described in (ii).
- the capture sequence A of the primer A' is a random oligonucleotide sequence.
- the capture sequence A of the primer A' is a poly(T) sequence or a specific sequence for a specific target nucleic acid.
- the primer A' further comprises a tag sequence A, such as a random oligonucleotide sequence.
- the capture sequence A is located at the 3' end of the primer A', and the consensus sequence A is located upstream of the tag sequence A (for example, at the 5' end of the primer A').
- the primer B' contains the consensus sequence B, the complementary sequence of the 3' end overhang, and the tag sequence B.
- the primer B' comprises modified nucleotides (e.g., locked nucleic acids).
- the 3' end of primer B' comprises one or more modified nucleotides (e.g., locked nucleic acids).
- the kit further comprises a primer B" or a random primer, the primer B" can anneal to the complementary sequence of the consensus sequence B or a partial sequence thereof, and can initiate an extension reaction.
- the oligonucleotide probe is capable (eg 3' end contains a free -OH) or incapable of initiating an extension reaction (eg 3' end is blocked).
- the kit has one or more features selected from:
- the oligonucleotide probes, primer A, primer A', primer B, primer B', primer B", random primers, the first bridging oligonucleotide, and the second bridging oligonucleotide are independent Contains or consists of naturally occurring nucleotides (such as deoxyribonucleotides or ribonucleotides), modified nucleotides, non-natural nucleotides, or any combination thereof;
- the oligonucleotide probes each independently have 15-300nt (such as 15-200nt, 15-20nt, 20-30nt, 30-40nt, 40-50nt, 50-100nt, 100-150nt, 150- 200nt) in length;
- primer A, primer A', primer B, primer B', primer B", random primers each independently have 4-200nt (such as 5-200nt, 15-230nt, 26-115nt, 10-130nt, 10-20nt, 20-50nt, 20-30nt, 30-40nt, 40-50nt, 50-100nt, 100-150nt, 150-200nt) length;
- the first bridging oligonucleotide and the second bridging oligonucleotide each independently have 6-200nt (such as 20-100nt, 20-70nt, 6-15nt, 15-20nt, 20-30nt , 30-40nt, 40-50nt, 50-100nt, 100-150nt, 150-200nt) length;
- the oligonucleotide probes coupled to the same solid support have the same consensus sequence X1 and/or the same consensus sequence X2;
- the consensus sequence X1 of the oligonucleotide probe comprises a cleavage site; Cut or fragmented by photoablation, chemical ablation, or CRISPR ablation.
- the kit further comprises reverse transcriptase, nucleic acid ligase, nucleic acid polymerase and/or transposase.
- the reverse transcriptase has terminal transfer activity.
- the reverse transcriptase is capable of synthesizing a cDNA strand using RNA (eg, mRNA) as a template, and adding the 3' end overhang at the 3' end of the cDNA strand.
- the reverse transcriptase is capable of adding at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7 nucleotides in length to the 3' end of the cDNA strand. , an overhang of at least 8, at least 9, at least 10 or more nucleotides.
- the reverse transcriptase is capable of adding an overhang of 2-5 cytosine nucleotides (eg, a CCC overhang) at the 3' end of the cDNA strand.
- the reverse transcriptase is selected from the group consisting of M-MLV reverse transcriptase, HIV-1 reverse transcriptase, AMV reverse transcriptase, telomerase reverse transcriptase, and transposases having the above transposase activity variants, modifications and derivatives.
- the nucleic acid polymerase has no 5' to 3' exonucleating activity or strand displacement activity.
- the nucleic acid polymerase has 5' to 3' exonucleation activity or strand displacement activity.
- the transposase is selected from Tn5 transposase, MuA transposase, Sleeping Beauty transposase, Mariner transposase, Tn7 transposase, Tn10 transposase, Ty1 transposase, Tn552 transposase, and variants, modified products and derivatives having the transposition activity of the above-mentioned transposases.
- the kit further comprises: the primer C, the primer D, the primer C' and/or the primer D'.
- the kit further comprises the primer C, the primer D and the primer D'.
- said kit further comprises said primer C, said primer D, said primer C' and said primer D'.
- the kit further comprises: reagents for nucleic acid hybridization, reagents for nucleic acid extension, reagents for nucleic acid amplification, reagents for recovering or purifying nucleic acids, reagents for A reagent for constructing a transcriptome sequencing library, a reagent for sequencing (such as next-generation sequencing or third-generation sequencing), or any combination thereof.
- the present application also provides the above-mentioned method for generating a labeled nucleic acid molecule population or the use of the above-mentioned kit for constructing a library of nucleic acid molecules or for performing transcriptome sequencing.
- DNB DNA nanoball, DNA nanoball
- RCA rolling circle amplification
- the RCA product is a multi-copy single-stranded DNA sequence, which can form a similar "spherical” structure due to the interaction force between the bases of the internal DNA sequence.
- the library molecules are circularized to form single-stranded circular DNA, and then the single-stranded circular DNA can be amplified by multiple orders of magnitude using rolling circle amplification technology, and the resulting amplification product is called DNB.
- a "population of nucleic acid molecules” refers to, for example, nucleic acid molecules derived directly or indirectly from target nucleic acid molecules (e.g., DNA double-stranded molecules, RNA/cDNA hybrid double-stranded molecules, DNA single-stranded molecules, or RNA single-stranded molecules) groups or collections.
- the population of nucleic acid molecules comprises a library of nucleic acid molecules comprising sequences qualitatively and/or quantitatively representative of target nucleic acid molecule sequences.
- the population of nucleic acid molecules comprises a subset of a library of nucleic acid molecules.
- a "library of nucleic acid molecules” refers to labeled nucleic acid molecules (e.g., labeled DNA double-stranded molecules, labeled RNA/cDNA hybrid double-stranded molecules, labeled DNA Single-stranded molecules, or labeled RNA single-stranded molecules) or a collection or population of fragments thereof, wherein the combination of labeled nucleic acid molecules or fragments thereof in the collection or population exhibits qualitative and/or quantitative representation of the resulting The sequence of the target nucleic acid molecule sequence of the labeled nucleic acid molecule.
- the library of nucleic acid molecules is a sequencing library.
- the library of nucleic acid molecules can be used to construct a sequencing library.
- cDNA or "cDNA strand” refers to a primer that anneals to an RNA molecule of interest, catalyzed by RNA-dependent DNA polymerase or reverse transcriptase, using at least a portion of the RNA molecule of interest as a template
- the "complementary DNA” synthesized by the extension of DNA (this process is also called “reverse transcription”).
- the synthesized cDNA molecule is "homologous” or “complementary” or “base paired” or “complexed” with at least a portion of the template.
- upstream is used to describe the relative positional relationship of two nucleic acid sequences (or two nucleic acid molecules), and has the meaning generally understood by those skilled in the art.
- the expression “one nucleic acid sequence is located upstream of another nucleic acid sequence” means that when arranged in the 5' to 3' direction, the former is located in a more forward position (i.e., closer to the 5' end) than the latter Location).
- downstream has the opposite meaning of "upstream”.
- Tag Sequence Y As used herein, "Tag Sequence Y”, “Tag Sequence A”, “Tag Sequence B”, “Consensus Sequence X1”, “Consensus Sequence X2”, “Consensus Sequence A”, “Consensus Sequence B”, etc.
- the joined nucleic acid molecule or a derivative product of the joined nucleic acid molecule provides means for identification, recognition, and/or molecular manipulation or biochemical manipulation (e.g., by providing A site for annealing an oligonucleotide, such as a primer for DNA polymerase extension or an oligonucleotide for a non-target nucleic acid component of a capture reaction or ligation reaction) glycosides.
- the oligonucleotides may consist of consecutive at least two (preferably about 6 to 100, but there is no firm limit to the length of the oligonucleotides, the exact size depends on many factors which in turn depend on the oligonucleotide
- the final function or use of acid) nucleotides can also be composed of multiple oligonucleotides in continuous or discontinuous arrangement.
- the oligonucleotide sequence may be unique for each nucleic acid molecule it ligates, or it may be unique for a certain class of nucleic acid molecules it ligates.
- the oligonucleotide sequence can be reversibly or irreversibly joined to the polynucleotide sequence to be "labeled” by any means including ligation, hybridization or other methods.
- the process of joining the oligonucleotide sequence to a nucleic acid molecule is sometimes referred to herein as "labeling" and a nucleic acid molecule undergoing labeling or containing a labeling sequence is referred to as a "labeled nucleic acid molecule" or "labeled nucleic acid molecule”. .
- Nucleic acids or polynucleotides of the present invention may include one or more modified nucleobases, sugar moieties or internucleoside linkages.
- nucleic acids or polynucleotides that contain modified bases, sugar moieties, or internucleoside linkages include, but are not limited to: (1) changes in Tm; (3) providing a moiety for attaching a label; (4) providing a label or a label quencher; or (5) providing a moiety for attaching another molecule in solution or bound to a surface, such as Biotin.
- oligonucleotides such as primers can be synthesized such that the random portion comprises one or more conformationally constrained nucleic acid analogs, such as but not limited to, wherein a ribose ring is attached 2' One or more ribonucleic acid analogues in which the -O atom is "locked" to the methylene bridge of the 4'-C atom; these modified nucleotides result in an increase in the Tm or melting temperature of each nucleotide monomer by approx. 2 degrees C to about 8 degrees C.
- conformationally constrained nucleic acid analogs such as but not limited to, wherein a ribose ring is attached 2'
- ribonucleic acid analogues in which the -O atom is "locked" to the methylene bridge of the 4'-C atom
- one indicator of the use of modified nucleotides in the method may be the nucleoside comprising the modification
- Acidic oligonucleotides can be digested by single-strand-specific RNases.
- a nucleic acid base in a single nucleotide at one or more positions in a polynucleotide or oligonucleotide may include guanine, adenine, uracil, thymine, or cytosine.
- one or more of the nucleic acid bases may comprise modified bases such as, but not limited to, xanthine, allyamino-uracil, allyamino-thymine Nucleosides, hypoxanthine, 2-aminoadenine, 5-propynyluracil, 5-propynylcytosine, 4-thiouracil, 6-thioguanine, nitrogen-uracil and deaza-uracil, thymidine, cytosine, adenine, or guanine.
- modified bases such as, but not limited to, xanthine, allyamino-uracil, allyamino-thymine Nucleosides, hypoxanthine, 2-aminoadenine, 5-propynyluracil, 5-propynylcytosine, 4-thiouracil, 6-thioguanine, nitrogen-uracil and deaza-uracil, thymidine, cytosine, adenine,
- nucleic acid bases may comprise nucleic acid bases derivatized with a biotin moiety, a digoxigenin moiety, a fluorescent or chemiluminescent moiety, a quencher moiety, or some other moiety.
- the invention is not limited to the listed nucleic acid bases; the list given shows examples of a wide range of bases that can be used in the methods of the invention.
- one or more of the sugar moieties may include 2'-deoxyribose, or alternatively, one or more of the sugar moieties may include some other sugar moiety, such as But not limited to: Ribose or 2'-fluoro-2'-deoxyribose or 2'-O-methyl-ribose that provide resistance to some nucleases, or can be passed with visible, fluorescent, infrared fluorescent 2'-amino 2'-deoxyribose or 2'-azido- 2'-deoxyribose.
- internucleoside linkages of nucleic acids or polynucleotides of the invention may be phosphodiester linkages, or alternatively, one or more of the internucleoside linkages may include modified linkages such as, but not limited to: Phosphate, phosphorodithioate, phosphoroselenate, or phosphorodiselenate linkages, which are resistant to some nucleases.
- terminal transfer activity refers to the ability to catalyze the template-independent addition (or “tailing") of one or more deoxyribonucleoside triphosphates (dNTPs) or a single dideoxyribonucleoside triphosphate to Activity of the 3' end of the cDNA.
- dNTPs deoxyribonucleoside triphosphates
- Examples of reverse transcriptases having terminal transfer activity include, but are not limited to, M-MLV reverse transcriptase, HIV-1 reverse transcriptase, AMV reverse transcriptase, telomerase reverse transcriptase, and reverse transcriptases having said reverse transcriptase Variants, modified products and derivatives with recording activity and terminal transfer activity. Described reverse transcriptase does not have or has RNase activity (particularly RNase H activity).
- the reverse transcriptase used to reverse transcribe RNA to generate cDNA does not have RNase activity (particularly RNase H activity).
- the reverse transcriptase used to reverse transcribe RNA to generate cDNA has terminal transfer activity and does not have RNase activity (particularly RNase H activity).
- nucleic acid polymerase with "strand displacement activity” means that, in the process of elongating a new nucleic acid strand, if it encounters a downstream nucleic acid strand complementary to the template strand, it can continue the extension reaction and replace the nucleic acid strand complementary to the template strand.
- nucleic acid polymerase having "5' to 3' exonuclease activity” refers to a nucleic acid polymerase capable of catalyzing the hydrolysis of 3, 5- Phosphodiester bond, nucleic acid polymerase that degrades nucleotides.
- a nucleic acid polymerase (or DNA polymerase) with "high fidelity” means that, during the process of amplifying nucleic acid, the probability of introducing a wrong nucleotide (i.e., the error rate) is lower than that of the wild-type Taq enzyme (for example, the nucleic acid polymerase (or DNA polymerase) of Taq enzyme whose sequence is shown in UniProt Accession: P19821.1).
- annealing As used herein, the terms “annealing”, “annealing”, “annealing”, “hybridizing” or “hybridizing” and the like refer to the presence of sufficient complementarity to form a complex via Watson-Crick base pairing. Complexes are formed between nucleotide sequences.
- nucleic acid sequences that are “complementary to” or “complementary” or “hybridize” or “anneal” to each other should be able to form or form sufficiently stable “hybrids" or “hybrids” that serve the intended purpose. "Complex".
- every nucleic acid base within the sequence represented by one nucleic acid molecule is capable of base pairing or pairing or complexing with every nucleic acid base within the sequence represented by a second nucleic acid molecule such that the two nucleic acid molecules or one of them
- Corresponding sequences shown are “complementary” or “anneal” or “hybridize” to each other.
- the terms “complementary” or “complementarity” are used when referring to a sequence of nucleotides related by the base pairing rules. For example, the sequence 5'-A-G-T-3' is complementary to the sequence 3'-T-C-A-5'.
- Complementarity can be "partial,” wherein only some of the nucleic acid bases match according to the base pairing rules. Alternatively, there may be “perfect” or “total” complementarity between nucleic acids. The degree of complementarity between nucleic acid strands has a significant effect on the efficiency and strength of hybridization between nucleic acid strands. This is particularly important in amplification reactions and detection methods that rely on hybridization of nucleic acids.
- the term “homology” refers to the degree of complementarity of one nucleic acid sequence to another nucleic acid sequence. There may be partial or complete homology (ie, complementarity).
- a partially complementary sequence is one that at least partially inhibits hybridization of a fully complementary sequence to a target nucleic acid and is referred to using the functional term "substantially homologous". Inhibition of hybridization of a perfectly complementary sequence to a target sequence can be examined under low stringency conditions using a hybridization assay (eg, Southern or Northern blot, solution hybridization, etc.). Substantially homologous sequences or probes will compete or inhibit binding (ie, hybridization) of a fully homologous sequence to a target under conditions of low stringency. This is not to say that low stringency conditions are conditions that allow non-specific binding; low stringency conditions require that the binding of two sequences to each other is a specific (ie selective) interaction.
- a hybridization assay eg, Southern or Northern blot, solution hybridization, etc.
- the absence of non-specific binding can be tested by using a second target that lacks complementarity or has only a low degree of complementarity (eg, less than about 30% complementarity). In cases of little or no specific binding, the probe will not hybridize to the nucleic acid target.
- substantially homologous when used in reference to a double-stranded nucleic acid sequence, such as a cDNA or genomic clone, means hybridizable to one or both strands of the double-stranded nucleic acid sequence under low stringency conditions as described herein any oligonucleotide or probe.
- the terms “anneal” or “hybridize” are used when referring to the pairing of complementary nucleic acid strands.
- Hybridization and the strength of hybridization are affected by a number of factors well known in the art, including the degree of complementarity between the nucleic acids, including the stringency of conditions affected by conditions such as salt concentration, the degree of hybridization formed The Tm (melting temperature) of the body, the presence of other components (eg, the presence or absence of polyethylene glycol or betaine), the molarity of the hybridized strands, and the G:C content of the nucleic acid strands.
- the solid support can spontaneously or upon exposure to one or more stimuli (e.g., temperature change, pH change, exposure to a particular chemical species or phase, exposure to light, reducing agent, etc.)
- the oligonucleotide probe is released. It will be appreciated that the oligonucleotide probe may be released by cleavage of the bond between the oligonucleotide probe and the solid support, or by degradation of the solid support itself. Oligonucleotide probes, or both, which allow or are accessible to other reagents.
- Addition of various types of labile bonds to the solid support can result in a solid support capable of responding to different stimuli.
- Each type of labile bond can be sensitive to relevant stimuli (eg, chemical stimuli, light, temperature, etc.), so that the release of substances attached to the solid support through each labile bond can be controlled by applying appropriate stimuli.
- labile bonds that can be coupled to solid supports include ester bonds (for example, cleavable with acids, bases, or hydroxylamine), ortho Diol bonds (e.g., cleavable by sodium periodate), Diels-Alder bonds (e.g., cleavable by heat), sulfone bonds (e.g., cleavable by bases), silane Ether bonds (e.g., cleavable by acids), glycosidic bonds (e.g., cleavable by amylases), peptide bonds (e.g., cleavable by proteases), or phosphodiester bonds (e.g., cleavable by nucleases (e.g., DNA Enzyme) cleavage)).
- ester bonds for example, cleavable with acids, bases, or hydroxylamine
- ortho Diol bonds e.g., cleavable by sodium periodate
- Diels-Alder bonds
- the solid support can be activated spontaneously or upon exposure to one or more stimuli (e.g., temperature). degradable, destructible or soluble upon exposure to a change in pH, change in pH, exposure to a particular chemical species or phase, exposure to light, reducing agents, etc.).
- a solid support can be soluble such that the material components of the solid support dissolve upon exposure to a particular chemical or environmental change (eg, a change in temperature or a change in pH).
- the solid support degrades or dissolves under elevated temperature and/or alkaline conditions.
- the solid support can be thermally degradable such that when the solid support is exposed to an appropriate temperature change (eg, heating), the solid support degrades. Degradation or dissolution of a solid support bound to a substance (eg, an oligonucleotide probe) can result in the release of the substance from the solid support.
- an appropriate temperature change eg, heating
- transposase and reverse transcriptase and “nucleic acid polymerase” refer to protein molecules or aggregates of protein molecules responsible for catalyzing specific chemical and biological reactions.
- the methods, compositions or kits of the invention are not limited to the use of a particular transposase, reverse transcriptase or nucleic acid polymerase from a particular source.
- the methods, compositions, or kits of the invention include any transposase, reverse transcriptase, or nucleic acid polymerase from any source that has equivalent enzymatic activity to the particular enzyme of the particular method, composition, or kit disclosed herein.
- the method of the present invention also includes the following embodiment: wherein any specific enzyme provided and used in the steps of the method is replaced by a combination of two or more enzymes, the two or more enzymes When used in combination, whether used separately in a stepwise fashion or together simultaneously, the reaction mixture produces the same results as those obtained with that one particular enzyme.
- the methods, buffers and reaction conditions provided herein, including those in the Examples, are presently preferred for embodiments of the methods, compositions and kits of the invention.
- other enzyme storage buffers, reaction buffers and reaction conditions using some of the enzymes of the invention are known in the art and may also be suitable for use in the invention and are included herein.
- the present application provides a new method for generating labeled nucleic acid molecule populations, and based on the method for constructing a nucleic acid molecule library and performing high-throughput sequencing, thereby realizing high-precision subcellular spatial positioning of samples.
- the method of the present application has one or more beneficial technical effects selected from the following:
- the probes of traditional nucleic acid arrays (such as chips) used for spatial transcriptome sequencing contain fixed capture sequences, usually specific capture sequences can only capture specific target nucleic acid molecules corresponding to them, for example, when the capture sequence is poly (T), corresponding to capturing target nucleic acid molecules containing poly(A). If the target nucleic acid molecule changes, the probe sequence containing the capture sequence needs to be modified accordingly, that is, the entire nucleic acid array (such as a chip) needs to be modified, which is costly and inefficient in practical applications.
- the nucleic acid array (such as a chip) of the present application does not contain a capture sequence, and the capture sequence exists in a reverse transcription primer independent of the nucleic acid array (that is, the capture sequence and the probe are independent of each other). Nucleotides effectuate attachment to the probe.
- the present application can design corresponding capture sequences for different target nucleic acid molecules without changing the probe sequence (that is, without changing the nucleic acid array (such as a chip)). Changes in acid enable capture of different target nucleic acid molecules.
- RNA permeabilization is generally performed first to release intracellular RNA. Excessive permeabilization will cause RNA to diffuse to adjacent cells Even the periphery of the tissue sample is captured by the probe, so that the in situ capture of mRNA cannot be achieved. If the permeabilization is not complete, the capture efficiency of mRNA will be affected.
- the nucleic acid array (such as a chip) does not contain a capture sequence (only spatial information), and the purpose of tissue permeabilization is to allow reverse transcription primers to enter cells and hybridize with mRNA in situ, without the need for intense permeabilization reagent treatment, thereby Can reduce the spread of the sample.
- Fig. 1 shows an exemplary structure of a chip used for capturing and labeling nucleic acid molecules in this application, which includes: a chip and oligonucleotide probes (also called chip sequences) coupled to the chip.
- oligonucleotide probes also called chip sequences
- Each oligonucleotide probe contains a label sequence Y corresponding to its position on the chip, and the coupling area between each oligonucleotide probe and the chip can be called a micro spot.
- Each oligonucleotide probe can be single or multiple copies.
- FIG. 2 shows an exemplary scheme for preparing a cDNA chain using RNA (such as mRNA) in a sample as a template, and an exemplary structure of the cDNA chain.
- RNA such as mRNA
- Figure 3 shows an exemplary scheme 1 for marking the 3' end of a cDNA strand with the complementary sequence of ChIP-seq to form a new nucleic acid molecule (ie, a ChIP-seq-labeled nucleic acid molecule) containing ChIP-seq information, and the ChIP-seq-containing Exemplary structures of novel nucleic acid molecules of sequence information.
- CA consensus sequence A
- CB consensus sequence B
- X1 consensus sequence X1
- Y tag sequence Y
- X2 consensus sequence X2
- P1 first region
- P2 second region.
- Figure 4 shows an exemplary scheme 2 for marking the 3' end of a cDNA strand with the complementary sequence of ChIP-seq to form a new nucleic acid molecule containing ChIP-seq information (that is, a nucleic acid molecule labeled with ChIP-seq), and the ChIP-seq-containing Exemplary structures of novel nucleic acid molecules of sequence information.
- CA consensus sequence A
- CB consensus sequence B
- X1 consensus sequence X1
- Y tag sequence Y
- X2 consensus sequence X2.
- FIG. 5 shows an exemplary scheme for preparing a complementary strand of a cDNA chain using RNA (such as mRNA) in a sample as a template, and an exemplary structure of the complementary strand of the cDNA strand.
- RNA such as mRNA
- Figure 6 shows an exemplary scheme 1 for marking the 3' end of the complementary strand of the cDNA strand with the complementary sequence of ChIP-seq to form a new nucleic acid molecule (that is, a nucleic acid molecule marked by ChIP-seq) containing ChIP-seq information, and, the Exemplary structures of novel nucleic acid molecules containing ChIP-seq information.
- CA consensus sequence A
- CB consensus sequence B
- X1 consensus sequence X1
- Y tag sequence Y
- X2 consensus sequence X2
- P1 first region
- P2 second region.
- Figure 7 shows an exemplary scheme 2 for marking the 3' end of the complementary strand of the cDNA strand with the complementary sequence of the ChIP-seq to form a new nucleic acid molecule (that is, a nucleic acid molecule marked by the ChIP-seq) containing the ChIP-seq information, and, the Exemplary structures of novel nucleic acid molecules containing ChIP-seq information.
- CA consensus sequence A
- CB consensus sequence B
- X1 consensus sequence X1
- Y tag sequence Y
- X2 consensus sequence X2.
- Fig. 8 shows the length distribution of cDNA amplification products prepared in Example 2.
- Fig. 9 shows the spatial expression map of the mouse brain slice obtained from the sequencing analysis in Example 3.
- DNBSEQ sequencing kit purchased from MGI, catalog number 1000019840 was used to prepare DNA nanoballs (DNB). Specific embodiments are briefly described below.
- reaction system 40 ⁇ L of the reaction system shown in Table 2 was configured.
- the reaction system was placed in a PCR instrument, and the reaction was carried out according to the following reaction conditions: 95°C for 3 minutes, 40°C for 3 minutes.
- After the reaction put the reaction product on ice, add 40 ⁇ L mixed enzyme I and 2 ⁇ L mixed enzyme II (from DNBSEQ sequencing kit), 1 ⁇ L ATP (100 mM stock solution, obtained from Thermo Fisher), and 0.1 ⁇ L T4ligase (obtained from NEB, Cat. No.: M0202S).
- the above reaction system was placed in a PCR instrument and reacted at 30° C. for 20 minutes to generate DNB.
- the DNB was loaded onto the BGISEQ SEQ 500 sequencing chip according to the method described in the BGISEQ 500 high-throughput sequencing reagent set (SE50) (purchased from MGI, catalog number: 1000012551).
- the sequencing chip add the MDA reagent in the BGISEQ 500 PE50 sequencing kit (purchased from MGI, 1000012554), incubate at 37°C for 30min, and then wash the chip with 5XSSC.
- Chip surface modified with N3-PEG3500-NHS (the modification reagent was purchased from sigma, product number: JKA5086). After incubation for 30 minutes, pump into the DBCO-modified chip sequence to synthesize primers (sequence shown in SEQ ID NO: 3), and overnight at room temperature Incubation.
- the DNB was sequenced according to the instructions of the BGISEQ-500 high-throughput sequencing reagent kit, and the read length of SE was set to 25bp.
- the above-mentioned DBCO-modified sequence is extended to obtain the chain grown after sequencing, and the chain is decoded to obtain the position sequence information corresponding to the DNB.
- the chain grown after sequencing continues to extend: on the basis of the above step 3, continue to carry out the cPAS reaction of 15 bases to obtain the chip sequence (SEQ ID NO: 8, which contains the consensus sequence X1 (SEQ ID NO: 4), Tag sequence Y, consensus sequence X2 (SEQ ID NO:5)).
- Chip dicing cut the prepared chip into several small pieces, adjust the size of the slice according to the needs of the experiment, soak the chip in 50mM Tris buffer with pH 8.0, and keep it at 4°C for use.
- Example 2 cDNA in situ synthesis and amplification
- the mouse tissue sections were made according to the standard method of frozen sections, and the frozen sections were pasted on the chip prepared in Example 1. After 30 min of frozen methanol fixation, the tissues were permeabilized with 0.5% Triton X-100. Use 5X SSC to wash the chip twice at room temperature, configure 200 ⁇ L of the reverse transcriptase reaction system shown in Table 3, add the reaction solution to the chip, fully cover it, and react at 42°C for 90min-180min. Reverse transcriptase will use mRNA as a template to synthesize cDNA with primers containing polyT (sequence shown in SEQ ID NO: 6, which contains consensus sequence A (CA) and polyT sequence), and add CCC overhang.
- SEQ ID NO: 6 which contains consensus sequence A (CA) and polyT sequence
- the reverse transcriptase will use the consensus sequence B and the UMI sequence as templates to continue to extend the cDNA chain, so that the 3' end of the cDNA will carry the complementary sequence of the consensus sequence B and the complementary sequence of the UMI sequence.
- the synthetic cDNA strand comprises the following sequence structure: the sequence of the reverse transcription primer (SEQ ID NO:6)-cDNA sequence-c(TSO) sequence (complementary sequence of SEQ ID NO:7).
- the chip was cleaned with 5X SSC.
- Prepare 200 ⁇ L of Bst polymerization reaction solution (purchased from NEB, M0275S) according to the instructions, pump it into the chip, and react at 65°C for 60 minutes to obtain double-stranded nucleic acid containing position information (namely, tag sequence Y(Y) or its complementary sequence c(Y)).
- Molecule one strand of which contains the following sequence structure: cDNA sequence - complementary sequence of TSO sequence - first bridging oligonucleotide sequence - complementary sequence of ChIP-seq partial sequence.
- cDNA concentration take 20ng cDNA (obtained in step 4 of Example 2), add 0.5 ⁇ M Tn5 interrupting enzyme and corresponding buffer (purchased from BGI, catalog number 10000028493; Tn5 interrupting enzyme coating method according to Stereomics library preparation kit- S1 operation), mix well to form a 20 ⁇ L reaction system, react at 55°C for 10 minutes, add 5 ⁇ L 0.1% SDS and mix at room temperature for 5 minutes to end the Tn5 interruption step.
- Tn5 interrupting enzyme and corresponding buffer purchased from BGI, catalog number 10000028493; Tn5 interrupting enzyme coating method according to Stereomics library preparation kit- S1 operation
- the reaction conditions are as follows: 95°C for 3 minutes, 40°C for 3 minutes; after the reaction is completed, put it on ice, add 40 ⁇ L of the mixed enzyme I required for DNB preparation in the DNBSEQ sequencing kit, and 2 ⁇ L of the mixed enzyme II, and 1 ⁇ L ATP, 0.1 ⁇ L T4Ligase, after mixing, put the above reaction system in a PCR instrument at 30°C, and react for 20 minutes to form DNB.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- Analytical Chemistry (AREA)
- Biomedical Technology (AREA)
- Immunology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Plant Pathology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
Claims (63)
- 一种生成标记的核酸分子群的方法,其包括下述步骤:(1)提供:生物样本和核酸阵列;其中,所述核酸阵列包括固相支持物,所述固相支持物偶联有多种寡核苷酸探针;每种寡核苷酸探针包含至少一个拷贝;并且,所述寡核苷酸探针从5’到3’的方向上包含或者由:共有序列X1,标签序列Y和共有序列X2组成,其中,不同种寡核苷酸探针具有不同的标签序列Y,所述标签序列Y具有与该种寡核苷酸探针在固相支持物的位置相对应的独一无二的核苷酸序列;(2)将所述生物样本与所述核酸阵列接触,以使得所述生物样本中的RNA(例如,mRNA)的位置被对应至核酸阵列上所述寡核苷酸探针的位置;对所述生物样本中的RNA(例如,mRNA)进行预处理以生成第一核酸分子群,所述预处理包括以下步骤:(i)(a)用引物A对所述生物样本的RNA(例如,mRNA)进行逆转录,生成cDNA链,所述cDNA链包含以所述引物A为逆转录引物形成的与所述RNA(例如,mRNA)互补的cDNA序列,以及3’末端悬突;其中,所述引物A含有捕获序列A,所述捕获序列A能与待捕获的RNA(例如,mRNA)退火并起始延伸反应;和,(b)将引物B与(a)中生成的所述cDNA链进行退火,并进行延伸反应,生成第一延伸产物,所述第一延伸产物即为待标记的第一核酸分子,从而生成第一核酸分子群;其中,所述引物B包含共有序列B,3’末端悬突互补序列,以及任选的标签序列B;所述3’末端悬突互补序列位于所述引物B的3’末端;所述共有序列B位于所述3’末端悬突互补序列的上游(例如位于所述引物B的5’端);或,(ii)(a)用引物A’对所述生物样本的RNA(例如,mRNA)进行逆转录,生成cDNA链;所述cDNA链包含以所述引物A’为逆转录引物形成的与所述RNA(例如,mRNA)互补的cDNA序列,以及3’末端悬突;其中,所述引物A’含有共有序列A和捕获序列A,所述捕获序列A能与待捕获的RNA(例如,mRNA)退火并起始延伸反应;所述共有序列A位于所述捕获序列A的上游(例如位于所述引物A’的5’端);(b)将引物B’与(a)中生成的所述cDNA链进行退火,并进行延伸反应,生成第一延伸产物;其中,所述引物B’包含共有序列B,3’末端悬突互补序列,以及任选的标签序列B;所述3’末端悬突互补序列位于所述引物B’的3’末端;所述共有序列B位于所述3’末端悬突互补序列的上游(例如位于所述引物B’的5’端);和,(c)提供延伸引物,以第一延伸产物为模板进行延伸反应,生成第二延伸产物,所述第二延伸产物即为待标记的第一核酸分子,从而生成第一核酸分子群;(3)将前一步骤获得的第一核酸分子群通过包含选自下列的步骤生成第二核酸分子群:(i)向步骤(2)的产物实施退火条件,使得所述寡核苷酸探针与所述寡核苷酸探针对应位置的待标记的第一核酸分子退火(例如原位退火),并进行延伸反应,生成延伸产物,所述延伸产物即为具有位置标记的第二核酸分子,从而生成第二核酸分子群;其中,所述寡核苷酸探针的共有序列X2或其部分序列(a)能与步骤(2)(i)获得的第一延伸产物的所述共有序列B的互补序列或其部分序列退火,或者,(b)能与步骤(2)(ii)获得的第二延伸产物的所述共有序列A的互补序列或其部分序列退火;或,(ii)在允许退火的条件下,将桥接寡核苷酸对与所述寡核苷酸探针以及前一步骤获得的第一核酸分子群接触,使得所述桥接寡核苷酸对与所述寡核苷酸探针以及所述寡核苷酸探针对应位置的待标记的第一核酸分子退火(例如原位退火),其中,所述桥接寡核苷酸对由第一桥接寡核苷酸和第二桥接寡核苷酸组成,所述第一桥接寡核苷酸和所述第二桥接寡核苷酸各自独立地包括:第一区域和第二区域,以及任选的位于第一区域和第二区域之间的第三区域,所述第一区域位于所述第二区域的上游(例如5’端);其中,所述第一桥接寡核苷酸的第一区域能与所述第二桥接寡核苷酸的第一区域退火;所述第一桥接寡核苷酸的第二区域能与所述寡核苷酸探针的共有序列X2或其部分序列退火;所述第二桥接寡核苷酸的第二区域(a)能与步骤(2)(i)获得的第一延伸产物的所述共有序列B的互补序列或其部分序列退火,或者,(b)能与步骤(2)(ii)获得的第二延伸产物的所述共有序列A的互补序列或其部分序列退火;其中,将所述桥接寡核苷酸对与所述第一核酸分子群、所述寡核苷酸探针接触时,所述桥接寡核苷酸对的第一桥接寡核苷酸和第二桥接寡核苷酸各自以单链的形式存在,或者,所述桥接寡核苷酸对的第一桥接寡核苷酸和第二桥接寡核苷酸以彼此退火形成部分双链的形式存在;进行连接反应:将杂交于同一第一桥接寡核苷酸的第一区域和第二区域的核酸分子连接,和/或,将杂交于同一第二桥接寡核苷酸的第一区域和第二区域的核酸分子连接;并进行延伸反应;其中,所述连接反应与延伸反应以任意顺序进行;所获得的反应产物即为具有位置标记的第二核酸分子,从而生成所述第二核酸分子群。
- 权利要求1的方法,其中,步骤(3)(ii)中:(1)当所述第一桥接寡核苷酸的第一区域和第二区域相邻时,所述将杂交于同一第一桥接寡核苷酸的第一区域和第二区域的核酸分子连接的步骤包括:使用核酸连接酶将杂交于同一第一桥接寡核苷酸的第一区域和第二区域的核酸分子连接;或者,当所述第一桥接寡核苷酸包括第一区域、第二区域以及位于两者之间的第三区域时,所述将杂交于同一第一桥接寡核苷酸的第一区域和第二区域的核酸分子连接的步骤包括:使用核酸聚合酶(例如,无5’至3’端外切酶活性或链置换活性)以所述第三区域为模板进行聚合反应,使用核酸连接酶将杂交于同一第一桥接寡核苷酸的第一区域、第三区域和第二区域的核酸分子连接;和/或(2)当所述第二桥接寡核苷酸的第一区域和第二区域相邻时,所述将杂交于同一第二桥接寡核苷酸的第一区域和第二区域的核酸分子连接的步骤包括:使用核酸连接酶将杂交于同一第二桥接寡核苷酸的第一区域和第二区域的核酸分子连接;或者,当所述第二桥接寡核苷酸包括第一区域、第二区域以及位于两者之间的第三区域时,所述将杂交于同一第二桥接寡核苷酸的第一区域和第二区域的核酸分子连接的步骤包括:使用核酸聚合酶(例如,无5’至3’端外切酶活性或链置换活性)以所述第三区域为模板进行聚合反应,使用核酸连接酶将杂交于同一第二桥接寡核苷酸的第一区域、第三区域和第二区域的核酸分子连接。
- 权利要求1或2的方法,其包括步骤(1)、步骤(2)(i)和步骤(3);其中,步骤(2)(i)(b)中,所述引物B含有共有序列B,3’末端悬突互补序列,以及标签序列B;优选地,步骤(3)中,源自同一种寡核苷酸探针的每个拷贝的所述第二核酸分子具有不同的所述标签序列B作为UMI。
- 权利要求3的方法,其包括步骤(1)、步骤(2)(i)和步骤(3)(i);其中,所述共有序列X2或其部分序列能与所述共有序列B的互补序列或其部分序列退火;步骤(3)(i)中获得的延伸产物即为标记的核酸分子,其包含:含有所述待标记的第一核酸分子序列的第一链,和/或,含有所述寡核苷酸探针序列的第二链。
- 权利要求4的方法,其中,所述共有序列X2或其部分序列能与所述共有序列B的互补序列或其部分序列退火,并且步骤(2)(i)中的第一延伸产物的所述共有序列B的互补序列具有3’自由端;其中,步骤(3)(i)中获得的延伸产物即为标记的核酸分子,其包含所述第一链;优选地,步骤(3)(i)中,所述寡核苷酸探针不能起始延伸反应(例如3’端是封闭的)。
- 权利要求5的方法,其中,步骤(2)(i)(a)中,所述引物A的捕获序列A为随机寡核苷酸序列。
- 权利要求5的方法,其中,步骤(2)(i)(a)中,所述引物A的捕获序列A为poly(T)序列或针对特定靶核酸的特异性序列;优选地,所述引物A还含有共有序列A,以及任选的标签序列A,例如为随机寡核苷酸序列。
- 权利要求4的方法,其中,所述共有序列X2或其部分序列能与所述共有序列B的互补序列或其部分序列退火,并且所述寡核苷酸探针的所述共有序列X2具有3’自由端;其中,步骤(3)(i)中获得的延伸产物即为标记的核酸分子,其包含所述第二链;优选地,步骤(2)(i)获得的第一延伸产物不能起始延伸反应(例如3’端是封闭的)。
- 权利要求8的方法,其中,步骤(2)(i)(a)中,所述引物A的捕获序列A为随机寡核苷酸序列。
- 权利要求8的方法,其中,步骤(2)(i)(a)中,所述引物A的捕获序列A为poly(T)序列或针对特定靶核酸的特异性序列;优选地,所述引物A还含有共有序列A,以及任选的标签序列A,例如为随机寡核苷酸序列。
- 权利要求3的方法,其包括步骤(1)、步骤(2)(i)和步骤(3)(ii);其中,所述第二桥接寡核苷酸的第二区域能与步骤(2)(i)获得的第一延伸产物的所述共有序列B的互补序列或其部分序列退火;步骤(3)(ii)中获得的反应产物即为标记的核酸分子,其包含:含有所述待标记的第一核酸分子序列的第一链,和/或,含有所述寡核苷酸探针序列的第二链。
- 权利要求11的方法,其中,所述第二桥接寡核苷酸的第二区域能与步骤(2)(i)获得的第一延伸产物的所述共有序列B的互补序列或其部分序列退火,并且所述第一桥接寡核苷酸的第二区域具有3’自由端;其中,步骤(3)(ii)中获得的反应产物即为标记的核酸分子,其包含所述第一链;优选地,所述第一桥接寡核苷酸具备以下特征的一项或多项:i)所述第一桥接寡核苷酸的第二区域位于所述第一桥接寡核苷酸的3’末端;ii)所述第一桥接寡核苷酸的第一区域位于所述第一桥接寡核苷酸的5’末端;iii)所述第一桥接寡核苷酸的5’末端含有磷酸化修饰;iv)所述第一桥接寡核苷酸的3’末端含有自由-OH;优选地,所述第二桥接寡核苷酸不能起始延伸反应(例如3’端是封闭的),和/或,所述寡核苷酸探针不能起始延伸反应(例如3’端是封闭的)。
- 权利要求12的方法,其中,步骤(2)(i)(a)中,所述引物A的捕获序列A为随机寡核苷酸序列。
- 权利要求12的方法,其中,步骤(2)(i)(a)中,所述引物A的捕获序列A为poly(T)序列或针对特定靶核酸的特异性序列;优选地,所述引物A还含有共有序列A,以及任选的标签序列A,例如为随机寡核苷酸序列。
- 权利要求11的方法,其中,所述第二桥接寡核苷酸的第二区域能与步骤(2)(i)获得的第一延伸产物的所述共有序列B互补序列或其部分序列退火,并且所述第二桥接寡核苷酸的第二区域具有3’自由端;其中,步骤(3)(ii)中获得的反应产物即为标记的核酸分子,其包含所述第二链;优选地,所述第二桥接寡核苷酸具备以下特征的一项或多项:i)所述第二桥接寡核苷酸的第二区域位于所述第二桥接寡核苷酸的3’末端;ii)所述第二桥接寡核苷酸的第一区域位于所述第二桥接寡核苷酸的5’末端;iii)所述第二桥接寡核苷酸的5’末端含有磷酸化修饰;iv)所述第二桥接寡核苷酸的3’末端含有自由-OH;优选地,所述第一桥接寡核苷酸不能起始延伸反应(例如3’端是封闭的),和/或,步骤(2)(i)获得的第一延伸产物不能起始延伸反应(例如3’端是封闭的)。
- 权利要求15的方法,其中,步骤(2)(i)(a)中,所述引物A的捕获序列A为随机寡核苷酸序列。
- 权利要求15的方法,其中,步骤(2)(i)(a)中,所述引物A的捕获序列A为poly(T)序列或针对特定靶核酸的特异性序列;优选地,所述引物A还含有共有序列A,以及任选的标签序列A,例如为随机寡核苷酸序列。
- 权利要求1或2的方法,其包括步骤(1)、步骤(2)(ii)和步骤(3);其中,步骤(2)(ii)(b)中,所述第一延伸产物从5’端至3’端包含:所述共有序列A,以所述引物A’为逆转录引物形成的与所述RNA互补的cDNA序列,所述3’末端悬突序列,任选的所述标签序列B的互补序列,所述共有序列B的互补序列;优选地,步骤(2)(ii)(c)中,所述延伸引物为所述引物B’或引物B”或随机引物,其中,所述引物B”能与所述共有序列B的互补序列或其部分序列退火,并且能起始延伸反应。
- 权利要求18的方法,其包括步骤(1)、步骤(2)(ii)和步骤(3)(i);其中,所述共有序列X2或其部分序列能与所述共有序列A的互补序列或其部分序列退火;步骤(3)(i)中获得的延伸产物即为标记的核酸分子,其包含:含有所述待标记的第一核酸分子序列的第一链,和/或,含有所述寡核苷酸探针序列的第二链。
- 权利要求19的方法,其中,所述共有序列X2或其部分序列能与所述共有序列A的互补序列或其部分序列退火;步骤(3)(i)中获得的延伸产物即为标记的核酸分子,其包含含有所述待标记的第一核酸分子序列的第一链;优选地,步骤(3)(i)中,所述寡核苷酸探针不能起始延伸反应(例如3’端是封闭的)。
- 权利要求20的方法,其中,步骤(2)(ii)(a)中,所述引物A’的捕获序列A为随机寡核苷酸序列;优选地,步骤(3)中,源自同一种寡核苷酸探针的每个拷贝的所述第一链具有不同的捕获序列A的互补序列作为UMI。
- 权利要求20的方法,其中,步骤(2)(ii)(a)中,所述引物A’的捕获序列A为poly(T)序列或针对特定靶核酸的特异性序列;其中,所述引物A’还含有标签序列A,例如为随机寡核苷酸序列;优选地,步骤(3)中,源自同一种寡核苷酸探针的每个拷贝的所述第一链具有不同的标签序列A的互补序列作为UMI。
- 权利要求19的方法,其中,所述共有序列X2或其部分序列能与所述共有序列A的互补序列或其部分序列退火;步骤(3)(i)中获得的延伸产物即为标记的核酸分子,其包含含有所述寡核苷酸探针序列的第二链;优选地,步骤(2)(ii)获得的第二延伸产物不能起始延伸反应(例如3’端是封闭的)。
- 权利要求23的方法,其中,步骤(2)(ii)(a)中,所述引物A’的捕获序列A为随机寡核苷酸序列;优选地,步骤(3)中,源自同一种寡核苷酸探针的每个拷贝的所述第二链具有不同的捕获序列A作为UMI。
- 权利要求23的方法,其中,步骤(2)(ii)(a)中,所述引物A’的捕获序列A为poly(T)序列或针对特定靶核酸的特异性序列;其中,所述引物A’还含有标签序列A,例如为随机寡核苷酸序列;优选地,步骤(3)中,源自同一种寡核苷酸探针的每个拷贝的所述第二链具有不同的标签序列A作为UMI。
- 权利要求18的方法,其包括步骤(1)、步骤(2)(ii)和步骤(3)(ii);其中,所述第二桥接寡核苷酸的第二区域能与步骤(2)(ii)获得的第二延伸产物的共有序列A的互补序列或其部分序列退火;步骤(3)(ii)中获得的反应产物即为标记的核酸分子,其包含:含有所述待标记的第一核酸分子序列的第一链,和/或,含有所述寡核苷酸探针序列的第二链。
- 权利要求20的方法,其中,所述第二桥接寡核苷酸的第二区域能与步骤(2)(ii)获得的第二延伸产物的所述共有序列A的互补序列或其部分序列退火,并且所述第一桥接寡核苷酸的第二区域具有3’自由端;其中,步骤(3)(ii)中获得的反应产物即为标记的核酸分子,其包含所述第一链;优选地,所述第一桥接寡核苷酸具备以下特征的一项或多项:i)所述第一桥接寡核苷酸的第二区域位于所述第一桥接寡核苷酸的3’末端;ii)所述第一桥接寡核苷酸的第一区域位于所述第一桥接寡核苷酸的5’末端;iii)所述第一桥接寡核苷酸的5’末端含有磷酸化修饰;iv)所述第一桥接寡核苷酸的3’末端含有自由-OH;优选地,所述第二桥接寡核苷酸不能起始延伸反应(例如3’端是封闭的),和/或,所述寡核苷酸探针不能起始延伸反应(例如3’端是封闭的)。
- 权利要求27的方法,其中,步骤(2)(ii)(a)中,所述引物A’的捕获序列A为随机寡核苷酸序列;优选地,步骤(3)中,源自同一种寡核苷酸探针的每个拷贝的所述第一链具有不同的捕获序列A的互补序列作为UMI。
- 权利要求27的方法,其中,步骤(2)(ii)(a)中,所述引物A’的捕获序列A为poly(T)序列或针对特定靶核酸的特异性序列;其中,,所述引物A’还含有标签序列A,例如为随机寡核苷酸序列;优选地,步骤(3)中,源自同一种寡核苷酸探针的每个拷贝的所述第一链具有不同的标签序列A的互补序列作为UMI。
- 权利要求26的方法,其中,所述第二桥接寡核苷酸的第二区域能与步骤(2)(ii)获得的第二延伸产物的所述共有序列A的互补序列或其部分序列退火,并且所述第二桥接寡核苷酸的第二区域具有3’自由端;其中,步骤(3)(ii)中获得的反应产物即为标记的核酸分子,其包含所述第二链;优选地,所述第二桥接寡核苷酸具备以下特征的一项或多项:i)所述第二桥接寡核苷酸的第二区域位于所述第二桥接寡核苷酸的3’末端;ii)所述第二桥接寡核苷酸的第一区域位于所述第二桥接寡核苷酸的5’末端;iii)所述第二桥接寡核苷酸的5’末端含有磷酸化修饰;iv)所述第二桥接寡核苷酸的3’末端含有自由-OH;优选地,所述第一桥接寡核苷酸不能起始延伸反应(例如3’端是封闭的),和/或,步骤(2)(ii)获得的第二延伸产物不能起始延伸反应(例如3’端是封闭的)。
- 权利要求30的方法,其中,步骤(2)(ii)(a)中,所述引物A’的捕获序列A为随机寡核苷酸序列;优选地,步骤(3)中,源自同一种寡核苷酸探针的每个拷贝的所述第二链具有不同的捕获序列A作为UMI。
- 权利要求30的方法,其中,步骤(2)(ii)(a)中,所述引物A’的捕获序列A为poly(T)序列或针对特定靶核酸的特异性序列;其中,所述引物A’还含有标签序列A,例如为随机寡核苷酸序列;优选地,步骤(3)中,源自同一种寡核苷酸探针的每个拷贝的所述第二链具有不同的标签序列A作为UMI。
- 权利要求1-17任一项的方法,其中,在步骤(2)(i)(b)中,所述cDNA链通过其3’末端悬突与所述引物B退火,并且,在核酸聚合酶(例如,DNA聚合酶或逆转录酶)的作用下,所述cDNA链以所述引物B为模板被延伸,生成所述第一延伸产物。
- 权利要求1-2、18-32任一项的方法,其中,在步骤(2)(ii)(b)中,所述cDNA链通过其3’末端悬突与所述引物B’退火,并且,在核酸聚合酶(例如,DNA聚合酶或逆转录酶)的作用下,所述cDNA链以所述引物B’为模板被延伸,生成所述第一延伸产物。
- 权利要求1-34任一项的方法,其中,所述3’末端悬突具有至少1个,至少2个,至少3个,至少4个,至少5个,至少6个,至少7个,至少8个,至少9个,至少10个或更多个核苷 酸的长度。
- 权利要求1-35任一项的方法,其中,步骤(2)中,在进行所述预处理之前,对所述生物样本进行透化处理。
- 权利要求1-36任一项的方法,其中,所述生物样本是组织样品;优选地,所述组织样品是组织切片。
- 权利要求1-37任一项的方法,其中,步骤(2)中所述进行逆转录包括使用逆转录酶;优选地,所述逆转录酶具有末端转移活性;优选地,所述逆转录酶能够以RNA(例如,mRNA)为模板,合成cDNA链,且在所述cDNA链的3’端添加悬突。
- 权利要求1-38任一项所述的方法,其中,步骤(2)和(3)具有选自以下的一项或多项特征:(1)所述引物A,引物A’,引物B,引物B’,第一桥接寡核苷酸,第二桥接寡核苷酸各自独立地包含或者由天然存在的核苷酸(例如脱氧核糖核苷酸或核糖核苷酸),经修饰的核苷酸,非天然的核苷酸,或其任何组合组成;优选地,所述引物A,引物A’能够起始延伸反应;(2)所述引物B包含修饰的核苷酸(例如锁核酸);优选地,所述引物B的3’末端包含一个或多个修饰的核苷酸(例如锁核酸);(3)所述引物B’包含修饰的核苷酸(例如锁核酸);优选地,所述引物B’的3’末端包含一个或多个修饰的核苷酸(例如锁核酸);(4)所述标签序列A,标签序列B各自独立地具有5-200(例如5-30nt,6-15nt)的长度;(5)所述共有序列A,共有序列B各自独立地具有10-200nt(例如10-100nt,20-100nt,25-100nt,5-10nt,10-15nt,15-20nt,20-50nt,20-30nt,30-40nt,40-50nt,50-100nt)的长度;(6)所述引物A,引物A’,引物B,引物B’各自独立地具有4-200nt(例如5-200nt,15-230nt,26-115nt,10-130nt,10-20nt,20-50nt,20-30nt,30-40nt,40-50nt,50-100nt,100-150nt,150-200nt)的长度;(7)所述第一桥接寡核苷酸的第一区域,第二区域各自独立地具有3-100nt(例如20-100nt,3-10nt,10-15nt,15-20nt,20-70nt,20-30nt,30-40nt,40-50nt,50-100nt)的长度;(8)所述第二桥接寡核苷酸的第一区域,第二区域各自独立地具有3-100nt(例如20-100nt,3-10nt,10-15nt,15-20nt,20-70nt,20-30nt,30-40nt,40-50nt,50-100nt)的长度;(9)所述第一桥接寡核苷酸的第三区域,所述第二桥接寡核苷酸的第三区域各自独立地具有0-50nt(例如0nt,0-10nt,10-15nt,15-20nt,20-30nt,30-40nt,40-50nt)的长度;(10)所述第一桥接寡核苷酸、第二桥接寡核苷酸各自独立地具有6-200nt(例如20-100nt,20-70nt,6-15nt,15-20nt,20-30nt,30-40nt,40-50nt,50-100nt,100-150nt,150-200nt)的长度;(11)所述poly(T)序列包括至少5个,或至少20个(例如6-100个,10-50个)脱氧胸腺嘧啶核苷残基;(12)所述随机寡核苷酸序列具有5-200(例如5nt,5-30nt,6-15nt)的长度。
- 权利要求1-39任一项所述的方法,其中,所述方法还包括:(4)回收和纯化所述第二核酸分子群。
- 权利要求1-40任一项所述的方法,其中,所获得的第二核酸分子群和/或其互补物用于构建转录组文库或用于转录组测序。
- 权利要求1-41任一项的方法,其中,步骤(1)中所述寡核苷酸探针具有选自下列的一个或多个特征:(1)所述共有序列X1,标签序列Y和共有序列X2各自独立地包含或者由天然存在的核苷酸(例如脱氧核糖核苷酸或核糖核苷酸),经修饰的核苷酸,非天然的核苷酸(例如肽核酸(PNA)或锁核酸),或其任何组合组成;(2)所述共有序列X1,标签序列Y和共有序列X2各自独立地具有2-200nt(例如10-200nt,25-100nt,10-30nt,10-100nt,5-10nt,10-15nt,15-20nt,20-30nt,30-40nt,40-50nt,50-100nt)的长度。
- 权利要求1-43任一项的方法,其中,步骤(1)所述核酸阵列具有选自下列的一个或多个特征:(1)偶联在同一固相支持物上的所述寡核苷酸探针具有相同的共有序列X1和/或相同的共有序列X2;(2)所述寡核苷酸探针的共有序列X1包含切割位点;优选地,所述切割位点可以通过选自切刻酶(nickingenzyme)酶切、USER酶切、光切除、化学切除或CRISPR切除的方式而被切割或断裂。
- 权利要求1-44任一项的方法,其中,步骤(1)所述核酸阵列由包含以下的步骤来提供:(1)提供多种载体序列,每种载体序列包含至少一个拷贝的载体序列,所述载体序列从5’到3’的方向上包含:共有序列X2的互补序列,标签序列Y的互补序列以及固定序列;其中,每种载体序列的标签序列Y的互补序列互不相同;(2)将所述多种载体序列连接于固相支持物(例如芯片)表面;(3)提供固定引物,并以所述载体序列为模板,进行引物延伸反应,生成延伸产物,所述延伸产物即为寡核苷酸探针;其中,所述固定引物包含共有序列X1的序列,并且,所述固定引物能与所述固定序列退火并起始延伸反应;优选地,所述延伸产物从5’到3’的方向上包含或者由:共有序列X1,标签序列Y和共有序列X2组成;(4)将所述固定引物与所述固相支持物表面连接;其中,步骤(3)与(4)以任意顺序进行;(5)任选地,所述载体序列的固定序列还包含切割位点,所述切割可以选自切刻酶(nicking enzyme)酶切、USER酶切、光切除、化学切除或CRISPR切除;对所述载体序列的固定序列所包含的切割位点进行切割,以消化所述载体序列,使得步骤(3)中的延伸产物与形成延伸产物的模板(即载体序列)分离,从而将所述寡核苷酸探针连接于固相支持物(例如芯片)表面;优选地,所述方法还包括通过高温变性使得步骤(3)中的延伸产物与形成延伸产物的模板(即载体序列)分离;优选地,每种载体序列是由多个拷贝的载体序列的多联体所形成的DNB;优选地,步骤(1)中通过以下步骤提供所述多种载体序列:(i)提供多种载体模板序列,所述载体模板序列包含所述载体序列的互补序列;(ii)以每种载体模板序列为模板,进行核酸扩增反应,以获得每种载体模板序列的扩增产物,所述扩增产物包含至少一个拷贝的载体序列;优选地,进行滚环复制,以获得由所述载体序列的多联体所形成的DNB。
- 权利要求1-45任一项的方法,其中,步骤(1)所述固相支持物具有选自下列的一个或多个特征:(1)所述固体支持物选自乳胶珠、葡聚糖珠、聚苯乙烯表面、聚丙烯表面、聚丙烯酰胺凝胶、金表面、玻璃表面、芯片、传感器、电极和硅晶片;优选地,所述固相支持物是芯片;(2)所述固体支持物为平面的、球形的或多孔的;(3)所述固相支持物能够用作测序平台,例如测序芯片;优选地,所述固相支持物是用于Illumina、MGI或Thermo Fisher测序平台的测序芯片;和(4)所述固相支持物能够自发地或在暴露于一种或多种刺激(例如,温度变化、pH变化、暴露于特定化学物质或相、暴露于光、还原剂等)时释放所述寡核苷酸探针。
- 一种构建核酸分子文库的方法,其包括,(a)根据权利要求1-46任一项的方法生成标记的核酸分子群;(b)将所述标记的核酸分子群中的核酸分子随机打断并添加接头;和(c)任选地,对步骤(b)的产物进行扩增和/或富集;从而获得核酸分子文库;优选地,所述核酸分子文库用于测序,例如转录组测序,例如单细胞转录组测序(例如5’端或3’端转录组测序)。
- 权利要求47的方法,其中,在进行步骤(b)之前,所述方法还包括步骤(pre-b):扩增和/或富集所述标记的核酸分子群;优选地,所述扩增反应使用至少引物C和/或引物D来进行,其中,所述引物C能够与所述共有序列X1的互补序列或其部分序列杂交或退火,并起始延伸反应;所述引物D能够与所述标记的核酸分子群中含有所述标签序列Y的核酸分子链杂交或退火,并起始延伸反应。
- 权利要求47或48所述的方法,其中,在步骤(b)中,用转座酶将前一步骤获得的核酸分子随机打断并在片段两端分别添加接头;优选地,在步骤(c)中,至少使用引物C’和/或引物D’对步骤(b)的产物进行扩增,其中, 片段两端的接头分别为第一接头和第二接头,所述引物C’能够与所述第一接头杂交或退火,并起始延伸反应,所述引物D’能够与所述第二接头杂交或退火,并起始延伸反应。
- 一种对样品中的细胞进行转录组测序的方法,其包括:(1)根据权利要求47-49任一项的方法构建核酸分子文库;和(2)对所述核酸分子文库进行测序。
- 试剂盒,其包含:(i)用于标记核酸的核酸阵列,其包括固相支持物,所述固相支持物偶联有多个寡核苷酸探针;每种寡核苷酸探针包含至少一个拷贝;并且,所述寡核苷酸探针从5’到3’的方向上包含或者由:共有序列X1,标签序列Y和共有序列X2组成,其中,不同种寡核苷酸探针具有不同的标签序列Y,所述标签序列Y具有与该种寡核苷酸探针在固相支持物的位置相对应的独一无二的核苷酸序列;(ii)包含引物A和引物B或者包含引物A’和引物B’的引物组,其中:所述引物A含有捕获序列A,所述捕获序列A能与待捕获的RNA(例如,mRNA)退火并起始延伸反应;所述引物B包含共有序列B,3’末端悬突互补序列,以及任选的标签序列B;其中,所述3’末端悬突互补序列位于所述引物B的3’末端,所述共有序列B位于所述3’末端悬突互补序列的上游(例如位于所述引物B的5’端);其中,所述3’末端悬突是指以所述引物A的捕获序列A所捕获的RNA为模板逆转录生成的cDNA链的3’末端所包含的一个或多个非模板核苷酸;所述引物A’含有共有序列A和捕获序列A;其中,所述捕获序列A位于所述引物A’的3’端,所述共有序列A位于所述捕获序列A的上游(例如位于所述引物A’的5’端);所述引物B’包含共有序列B,3’末端悬突互补序列,以及任选的标签序列B;其中,所述3’末端悬突互补序列位于所述引物B’的3’末端,所述共有序列B位于所述3’末端悬突互补序列的上游(例如位于所述引物B’的5’端);其中,所述3’末端悬突是指以所述引物A’的捕获序列A所捕获的RNA为模板逆转录生成的cDNA链的3’末端所包含的一个或多个非模板核苷酸。
- 权利要求51的试剂盒,其包含:如(i)中所述的用于标记核酸的核酸阵列,如(ii)中所述的引物A和引物B的引物组,以及,(iii)第一桥接寡核苷酸和第二桥接寡核苷酸;其中,所述第一桥接寡核苷酸和所述第二桥接寡核苷酸各自独立地包括:第一区域和第二区域,以及任选的位于第一区域和第二区域之间的第三区域,所述第一区域位于所述第二区域的上游(例如5’端);其中,所述第一桥接寡核苷酸的第一区域能与所述第二桥接寡核苷酸的第一区域退火;所述第一桥接寡核苷酸的第二区域能与所述寡核苷酸探针的共有序列X2或其部分序列退火;所述第二桥接寡核苷酸的第二区域能与所述引物B的共有序列B的互补序列或其部分序列退火;其中,所述引物A的捕获序列A是随机寡核苷酸序列;或者,所述引物A的捕获序列A是poly(T)序列或针对特定靶核酸的特异性序列,所述引物A优选地进一步包含共有序列A和任选的标签序列A,例如为随机寡核苷酸序列;其中,所述引物B含有所述共有序列B,3’末端悬突互补序列,以及标签序列B;优选地,所述引物B包含修饰的核苷酸(例如锁核酸);优选地,所述引物B的3’末端包含一个或多个修饰的核苷酸(例如锁核酸)。
- 权利要求52的试剂盒,其中,所述第二桥接寡核苷酸的第二区域能与所述引物B的共有序列B的互补序列或其部分序列退火;优选地,所述第一桥接寡核苷酸具备以下特征的一项或多项:i)所述第一桥接寡核苷酸的第二区域位于所述第一桥接寡核苷酸的3’末端;ii)所述第一桥接寡核苷酸的第一区域位于所述第一桥接寡核苷酸的5’末端;iii)所述第一桥接寡核苷酸的5’末端含有磷酸化修饰;iv)所述第一桥接寡核苷酸的3’末端含有自由-OH;优选地,所述第二桥接寡核苷酸不能起始延伸反应(例如3’端是封闭的),和/或,所述寡核苷酸探针不能起始延伸反应(例如3’端是封闭的)。
- 权利要求52的试剂盒,其中,所述第二桥接寡核苷酸的第二区域能与所述引物B的共有序列B互补序列或其部分序列退火;优选地,所述第二桥接寡核苷酸具备以下特征的一项或多项:i)所述第二桥接寡核苷酸的第二区域位于所述第二桥接寡核苷酸的3’末端;ii)所述第二桥接寡核苷酸的第一区域位于所述第二桥接寡核苷酸的5’末端;iii)所述第二桥接寡核苷酸的5’末端含有磷酸化修饰;iv)所述第二桥接寡核苷酸的3’末端含有自由-OH;优选地,所述第一桥接寡核苷酸不能起始延伸反应(例如3’端是封闭的)。
- 权利要求51的试剂盒,其包含:如(i)中所述的用于标记核酸的核酸阵列,以及,如(ii)中所述的引物A和引物B的引物组;其中,所述引物A的捕获序列A是随机寡核苷酸序列;或者,所述引物A的捕获序列A是poly(T)序列或针对特定靶核酸的特异性序列,所述引物A优选地进一步包含共有序列A和任选的标签序列A,例如为随机寡核苷酸序列;其中,所述引物B含有所述共有序列B,3’末端悬突互补序列,以及标签序列B;优选地,所述引物B包含修饰的核苷酸(例如锁核酸);优选地,所述引物B的3’末端包含一个或多个修饰的核苷酸(例如锁核酸)。
- 权利要求51的试剂盒,其包含:如(i)中所述的用于标记核酸的核酸阵列,如(ii)中所述的引物A’和引物B’的引物组,以及,(iii)第一桥接寡核苷酸和第二桥接寡核苷酸;其中,所述第一桥接寡核苷酸和所述第二桥接寡核苷酸各自独立地包括:第一区域和第二区域,以及任选的位于第一区域和第二区域之间的第三区域,所述第一区域位于所述第二区域的上游(例如5’端);其中,所述第一桥接寡核苷酸的第一区域能与所述第二桥接寡核苷酸的第一区域退火;所述第一桥接寡核苷酸的第二区域能与所述寡核苷酸探针的共有序列X2或其部分序列退火;所述第二桥接寡核苷酸的第二区域能与所述引物A’的共有序列A互补序列或其部分序列退火;其中,所述引物A’的捕获序列A是随机寡核苷酸序列;或者,所述引物A’的捕获序列A是poly(T)序列或针对特定靶核酸的特异性序列,所述引物A’进一步包含标签序列A,例如为随机寡 核苷酸序列;优选地,所述引物B’包含修饰的核苷酸(例如锁核酸);优选地,所述引物B’的3’末端包含一个或多个修饰的核苷酸(例如锁核酸);优选地,所述试剂盒进一步包含引物B”或随机引物,所述引物B”能与所述共有序列B的互补序列或其部分序列退火,并且能起始延伸反应。
- 权利要求56的试剂盒,其中,所述第二桥接寡核苷酸的第二区域能与所述引物A’的共有序列A的互补序列或其部分序列退火;优选地,所述第一桥接寡核苷酸具备以下特征的一项或多项:i)所述第一桥接寡核苷酸的第二区域位于所述第一桥接寡核苷酸的3’末端;ii)所述第一桥接寡核苷酸的第一区域位于所述第一桥接寡核苷酸的5’末端;iii)所述第一桥接寡核苷酸的5’末端含有磷酸化修饰;iv)所述第一桥接寡核苷酸的3’末端含有自由-OH;优选地,所述第二桥接寡核苷酸不能起始延伸反应(例如3’端是封闭的),和/或,所述寡核苷酸探针不能起始延伸反应(例如3’端是封闭的)。
- 权利要求56的试剂盒,其中,所述第二桥接寡核苷酸的第二区域能与所述引物A’的共有序列A的互补序列或其部分序列退火;优选地,所述第二桥接寡核苷酸具备以下特征的一项或多项:i)所述第二桥接寡核苷酸的第二区域位于所述第二桥接寡核苷酸的3’末端;ii)所述第二桥接寡核苷酸的第一区域位于所述第二桥接寡核苷酸的5’末端;iii)所述第二桥接寡核苷酸的5’末端含有磷酸化修饰;iii)所述第二桥接寡核苷酸的3’末端含有自由-OH;优选地,所述第一桥接寡核苷酸不能起始延伸反应(例如3’端是封闭的)。
- 权利要求51的试剂盒,其包含:如(i)中所述的用于标记核酸的核酸阵列,以及,如(ii)中所述的引物A’和引物B’的引物组;其中,所述引物A’的捕获序列A是随机寡核苷酸序列;或者,所述引物A’的捕获序列A是poly(T)序列或针对特定靶核酸的特异性序列,所述引物A’进一步包含标签序列A,例如为随机寡核苷酸序列;其中,所述引物B’含有所述共有序列B,3’末端悬突互补序列,以及标签序列B;优选地,所述引物B’包含修饰的核苷酸(例如锁核酸);优选地,所述引物B’的3’末端包含一个或多个修饰的核苷酸(例如锁核酸);优选地,所述试剂盒进一步包含引物B”或随机引物,所述引物B”能与所述共有序列B的互补序列或其部分序列退火,并且能起始延伸反应。
- 权利要求51-59任一项的试剂盒,其具有选自以下的一项或多项特征:(1)所述寡核苷酸探针,引物A,引物A’,引物B,引物B’,引物B”,随机引物,第一桥接寡核苷酸,第二桥接寡核苷酸各自独立地包含或者由天然存在的核苷酸(例如脱氧核糖核苷酸或核糖核苷酸),经修饰的核苷酸,非天然的核苷酸,或其任何组合组成;(2)所述寡核苷酸探针各自独立地具有15-300nt(例如15-200nt,15-20nt,20-30nt,30-40nt,40-50nt,50-100nt,100-150nt,150-200nt)的长度;(3)所述引物A,引物A’,引物B,引物B’,引物B”,随机引物各自独立地具有4-200nt(例如5-200nt,15-230nt,26-115nt,10-130nt,10-20nt,20-50nt,20-30nt,30-40nt,40-50nt,50-100nt,100-150nt,150-200nt)的长度;(4)所述第一桥接寡核苷酸和所述第二桥接寡核苷酸各自独立地具有6-200nt(例如20-100nt,20-70nt,6-15nt,15-20nt,20-30nt,30-40nt,40-50nt,50-100nt,100-150nt,150-200nt)的长度;(5)偶联在同一固相支持物上的所述寡核苷酸探针具有相同的共有序列X1和/或相同的共有序列X2;(6)所述寡核苷酸探针的共有序列X1包含切割位点;优选地,所述切割位点可以通过选自切刻酶(nicking enzyme)酶切、USER酶切、光切除、化学切除或CRISPR切除的方式而被切割或断裂。
- 权利要求51-60任一项的试剂盒,其进一步包含逆转录酶,核酸连接酶,核酸聚合酶和/或转座酶;优选地,所述逆转录酶具有末端转移活性;优选地,所述逆转录酶能够以RNA(例如,mRNA)为模板,合成cDNA链,且在所述cDNA链的3’端添加所述3’末端悬突。
- 权利要求51-61任一项的试剂盒,其进一步包含:用于进行核酸杂交的试剂、用于进行核酸延伸的试剂、用于进行核酸扩增的试剂、用于回收或纯化核酸的试剂、用于构建转录组测序文库的试剂、用于测序(例如二代测序或三代测序)的试剂、或其任何组合。
- 权利要求1-46任一项的方法或权利要求51-62任一项的试剂盒用于构建核酸分子文库或用于进行转录组测序的用途。
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202280085230.5A CN118434882A (zh) | 2021-12-24 | 2022-11-30 | 一种生成标记的核酸分子群的方法及其试剂盒 |
| US18/722,541 US20250163492A1 (en) | 2021-12-24 | 2022-11-30 | Method for generating population of labeled nucleic acid molecules and kit for the method |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202111602443.4 | 2021-12-24 | ||
| CN202111602443 | 2021-12-24 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2023116373A1 true WO2023116373A1 (zh) | 2023-06-29 |
Family
ID=86901199
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2022/135363 Ceased WO2023116373A1 (zh) | 2021-12-24 | 2022-11-30 | 一种生成标记的核酸分子群的方法及其试剂盒 |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US20250163492A1 (zh) |
| CN (1) | CN118434882A (zh) |
| WO (1) | WO2023116373A1 (zh) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2025189364A1 (zh) * | 2024-03-13 | 2025-09-18 | 深圳华大智造科技股份有限公司 | 添加接头的方法及其在制备高通量测序文库中的应用 |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103781918A (zh) * | 2011-04-13 | 2014-05-07 | 空间转录公司 | 用于组织样本中核酸的局部或空间检测的方法和产品 |
| CN105505755A (zh) * | 2015-12-23 | 2016-04-20 | 杭州谷禾信息技术有限公司 | 空间转录组建库测序方法及所用装置 |
| CN107636163A (zh) * | 2015-04-29 | 2018-01-26 | 加利福尼亚大学董事会 | 用于构建链特异性cDNA文库的组合物和方法 |
| CN108138225A (zh) * | 2015-07-27 | 2018-06-08 | 亿明达股份有限公司 | 核酸序列信息的空间定位 |
| US20190262831A1 (en) * | 2016-10-17 | 2019-08-29 | Lociomics Corporation | High resolution spatial genomic analysis of tissues and cell aggregates |
| WO2020228788A1 (zh) * | 2019-05-15 | 2020-11-19 | 深圳华大生命科学研究院 | 用于检测核酸空间信息的阵列及检测方法 |
-
2022
- 2022-11-30 WO PCT/CN2022/135363 patent/WO2023116373A1/zh not_active Ceased
- 2022-11-30 US US18/722,541 patent/US20250163492A1/en active Pending
- 2022-11-30 CN CN202280085230.5A patent/CN118434882A/zh active Pending
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103781918A (zh) * | 2011-04-13 | 2014-05-07 | 空间转录公司 | 用于组织样本中核酸的局部或空间检测的方法和产品 |
| CN107636163A (zh) * | 2015-04-29 | 2018-01-26 | 加利福尼亚大学董事会 | 用于构建链特异性cDNA文库的组合物和方法 |
| CN108138225A (zh) * | 2015-07-27 | 2018-06-08 | 亿明达股份有限公司 | 核酸序列信息的空间定位 |
| CN105505755A (zh) * | 2015-12-23 | 2016-04-20 | 杭州谷禾信息技术有限公司 | 空间转录组建库测序方法及所用装置 |
| US20190262831A1 (en) * | 2016-10-17 | 2019-08-29 | Lociomics Corporation | High resolution spatial genomic analysis of tissues and cell aggregates |
| WO2020228788A1 (zh) * | 2019-05-15 | 2020-11-19 | 深圳华大生命科学研究院 | 用于检测核酸空间信息的阵列及检测方法 |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2025189364A1 (zh) * | 2024-03-13 | 2025-09-18 | 深圳华大智造科技股份有限公司 | 添加接头的方法及其在制备高通量测序文库中的应用 |
Also Published As
| Publication number | Publication date |
|---|---|
| US20250163492A1 (en) | 2025-05-22 |
| CN118434882A (zh) | 2024-08-02 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP3790967B1 (en) | Single tube bead-based dna co-barcoding for accurate and cost-effective sequencing, haplotyping, and assembly | |
| US9745614B2 (en) | Reduced representation bisulfite sequencing with diversity adaptors | |
| EP3066114B1 (en) | Plurality of transposase adapters for dna manipulations | |
| CA2810931C (en) | Direct capture, amplification and sequencing of target dna using immobilized primers | |
| EP4090766B1 (en) | Methods of targeted sequencing | |
| US20120003657A1 (en) | Targeted sequencing library preparation by genomic dna circularization | |
| JP7641118B2 (ja) | プローブ及びこれをハイスループットシーケンシングに適用するターゲット領域の濃縮方法 | |
| CN114096678A (zh) | 多种核酸共标记支持物及其制作方法与应用 | |
| US20140336058A1 (en) | Method and kit for characterizing rna in a composition | |
| US20240271126A1 (en) | Oligo-modified nucleotide analogues for nucleic acid preparation | |
| WO2023116373A1 (zh) | 一种生成标记的核酸分子群的方法及其试剂盒 | |
| WO2023115536A1 (zh) | 一种生成标记的核酸分子群的方法及其试剂盒 | |
| JP7490071B2 (ja) | シーケンシングのための新規核酸鋳型構造 | |
| WO2023116376A1 (zh) | 单细胞核酸标记和分析方法 | |
| US20070148636A1 (en) | Method, compositions and kits for preparation of nucleic acids | |
| CN119913234A (zh) | 超高密度探针捕获的空间转录组芯片及其制备方法和应用 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22909688 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 202280085230.5 Country of ref document: CN |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 22909688 Country of ref document: EP Kind code of ref document: A1 |
|
| WWP | Wipo information: published in national office |
Ref document number: 18722541 Country of ref document: US |