JP2002508664A

JP2002508664A - Method for detecting multiple single nucleotide polymorphisms in a single reaction

Info

Publication number: JP2002508664A
Application number: JP50498299A
Authority: JP
Inventors: マッキントッシュ，ティナ; ヘッド，スティーブン; ゴーレット，フィリップ; ボイス−ジャチノ，マイケル・ティ
Original assignee: オーキッド・バイオサイエンシーズ・インコーポレイテッド
Priority date: 1997-06-25
Filing date: 1998-06-24
Publication date: 2002-03-19
Also published as: WO1998059066A1; EP0994960A1; CA2294053A1; AU8162498A

Abstract

(57)【要約】植物または動物のゲノム中の複数の多型部位を同定するのに適した分子および方法。そのような部位の同定は、同一性、祖先、遺伝病への素因、所望の特質の存在または不存在などを決定するのに有用である。 (57) [Summary] Molecules and methods suitable for identifying multiple polymorphic sites in the genome of a plant or animal. Identification of such sites is useful for determining identity, ancestry, predisposition to a genetic disease, the presence or absence of a desired attribute, and the like.

Description

【発明の詳細な説明】発明の名称複数の単一ヌクレオチド多型を単一の反応で検出する方法発明の技術分野本発明は、組換えＤＮＡ技術の分野におけるものである。さらに詳しくは、本発明は、植物、動物または微生物のゲノムにおいて１またはそれ以上の単一ヌクレオチド多型を単一の反応にて同定し、かかる部位を同一性、祖先または遺伝的特質（traits）を分析するために用いるのに適した分子および方法に関する。関連出願の相互参照本願は、米国特許出願第０８／２１６,５３８号（１９９４年３月２３日出願）（該出願は、米国特許出願第０８／１４５,１４５号（１９９３年１１月３日出願）の一部継続出願である）の一部継続出願である。発明の背景動物、植物または微生物の遺伝子型を決定しうることは、法科学、医学および疫学および公衆衛生において、および動物の育種および出品（exhibition）において基本的に重要な事柄である。そのような決定しうることは、たとえば、感染性疾患の病因を同定するため、２人の個体が血縁であるか否かを決定するため、または生物のゲノム内に遺伝子をマッピングするために必要である。疾患を診断しうることとともに同一性および親であることの分析はまた、ヒト、動物および植物の遺伝的研究、とりわけ法的なまたは実父確定（paternity）評価において、および個体が遺伝病に罹るリスクを評価するうえで中心的な関心事でもある。そのような目標は、一人の個体のＤＮＡを他の個体のＤＮＡとは異なったものとするＤＮＡ配列の変異を分析することによって追跡されている。もしも、そのような変異が制限エンドヌクレアーゼ開裂により生成する断片の長さを変えるなら、そのような変異は制限断片長多型（「ＲＦＬＰ」）と称される。ＲＦＬＰはヒトおよび動物の遺伝子分析に広く用いられている（Grassberg ，J.、英国特許出願第２１３５７７４号；Skolnick，M.H.ら、Cytogen ．Cell Ge net ．32:58〜67(1982)；Bostein，D.ら、Ann ．J．Hum．Genet．32:314〜331（19 80）；Fischer，S．G.ら（ＰＣＴ出願ＷＯ９０／１３６６８号）；Uhlen， M.、ＰＣＴ出願ＷＯ９０／１１３６９号）。遺伝性の特質を特定のＲＦＬＰと関連付けることができれば、標的動物での該ＲＦＬＰの存在は該動物もまた該特質を示すであろう可能性の高いことを予測するのに用いることができる。複数のアレルに依存する複合的な特質をマッピングできるように、ＲＦＬＰの多遺伝子座（multilocus）分析を可能とする統計的方法が開発されている(Lander，S.ら、P roc ．Natl．Acad．Sci．(U.S.A) 83:7353〜7357(1986)；Lander，S.ら、Proc ．N atl．Acad．Sci.(U.S.A) 84:2363〜2367(1987);Donis-Keller，H.ら、Cell 51:3 19〜337(1987)；Lander，S.ら、Genetics 121:185〜199(1989)、すべて参照のため本明細書中に引用する）。そのような方法は、遺伝子地図を開発するため、並びに一層望ましい特質を有する植物または動物を開発するために用いることができる（Donis-Keller，H.ら、Cell 51:319〜337(1987);Lander，S.ら、Genetics 121 :185〜199(1989)）。ある場合には、ＤＮＡ配列の変異は、ヌクレオチドのタンデムなジヌクレオチドまたはトリヌクレオチド繰り返しモチーフを含む短タンデムリピート（shortt andem repeats;「ＳＴＲ」）を特徴とする領域に存在する。これらタンデムリピートはまた、「可変数タンデムリピート（variable number tandem repeat）（「ＶＮＴＲ」）とも称される。ＶＮＴＲは同定および実父確定分析に用いられてきており (Weber，J.L.、米国特許第５,０７５,２１７号；Armour，J.A.L.ら、FEBS Lett ．307:113〜115(1992)；Jones，L.ら、Eur ．J．Haematol．39:144〜14 7(1987)；Horn，G.T.ら、ＰＣＴ出願ＷＯ９１／１４００３号；Jeffreys，A.J. 、ヨーロッパ特許出願３７０,７１９号；Jeffreys，A.J.、米国特許第５,１７５ ,０８２号；Jeffreys，A.J.ら、Amer ．J．Hum．Genet．39:11〜24(1986)；Jeffr eys，A.J.ら、Nature 316:76〜79(1985)；Grey，I.C.ら、Proc ．R．Acad．Soc． Lond ．243:241〜253(1991)；Moore，S.S.ら、Genomics 10:654〜660(1991)；Jef freys，A.J.ら、Anim ．Genet．18:1〜15(1987)；Hillel，J.ら、Anim ．Genet．2 0 :145〜155(1989)；Hillel，J.ら、Genet．124:783〜789(1990))、現在では数多くの遺伝子マッピングの研究に用いられている。第三のクラスのＤＮＡ配列変異は、同じ種の個体間に存在する単一ヌクレオチド多型（「ＳＮＰ」）の結果生じるものである。そのような多型は、ＳＴＲやＶＮＴＲに比べてはるかに頻繁に起こる。幾つかの場合において、そのような多型は、遺伝病において決定的な特性である変異を含んでいる。実際、そのような変異は、疾患（すなわち、血友病、鎌形赤血球貧血など）を現実に引き起こすに充分な仕方でタンパク質コード遺伝子中の単一のヌクレオチドに影響を及ぼす。多くの場合、これらＳＮＰはゲノムの非コード領域に起こる。そのような多型の現代遺伝学における中心的な重要性にもかかわらず、個体からの１またはそれ以上の遺伝子座の分析を単一の反応フォーマットで可能とする実際的方法は開発されていない。本発明は、そのような改良法を提供するものである。実際、本発明は、複数の単一ヌクレオチド多型の変異を識別することにより同定および実父確定の遺伝子分析および疾患の診断を可能とする方法および遺伝子配列を提供する。発明の要約本発明は、あらゆる生命形態に存在する単一ヌクレオチド多型（ＳＮＰ）を含む分子に関する。本発明は、（i）１またはそれ以上の新規な単一ヌクレオチド多型を同定する方法、（ii）これらＳＮＰを異なる試料で繰返し分析および試験する方法および（iii）そのような部位の存在を動物、植物および微生物の遺伝子分析に活用する方法に関する。そのような部位の分析（遺伝子型決定）は、同一性、祖先、遺伝病への素因、所望の特質の存在または不在等の決定に有用である。詳細には、本発明は、いずれかの生物のゲノムＤＮＡセグメントの１またはそれ以上のヌクレオチド配列に相補的なポリヌクレオチド配列を有する１またはそれ以上のインテロゲーション（interrogation）核酸（または核酸アナログ）プライマー分子を提供するものであり、その際、該ゲノムセグメントは、哺乳動物の単一ヌクレオチド多型アレルの単一ヌクレオチド多型部位Ｘの直ぐ３'遠位に位置し、単一ヌクレオチド（またはヌクレオチドアナログ）による核酸（または核酸アナログ）プライマー分子の鋳型依存性の伸長は、該プライマー分子を単一ヌクレオチド（またはアナログ）により伸長させ、該単一ヌクレオチド（またはアナログ）は該単一ヌクレオチド多型アレルのヌクレオチドＸに相補的である。本発明は、プライマーの鋳型依存性の伸長を、ｄｄＡＴＰ、ｄｄＴＴＰ、ｄｄＣＴＰおよびｄｄＧＴＰ（または他の鎖停止塩基アナログ）よりなる群から選ばれた１またはそれ以上のジデオキシヌクレオチド三リン酸誘導体（またはアナログ）の存在下だが、ｄＡＴＰ、ｄＴＴＰ、ｄＣＴＰおよびｄＧＴＰの不在下で行う態様に関する。本発明はさらに、１またはそれ以上の単一ヌクレオチド多型を単一の反応において同定する方法であって、工程：（Ａ）１またはそれ以上の識別可能なインテロゲーションオリゴヌクレオチド（またはオリゴヌクレオチドアナログ）プライマーを１またはそれ以上の標的核酸分子にハイブリダイズさせ、その際、各オリゴヌクレオチドプライマーは、各プライマーの３'末端が興味のもたれる特定かつ独特の標的ヌクレオチドの直ぐ近位に位置するように、各標的核酸分子の特定かつ独特の領域に相補的であり；（Ｂ）各インテロゲーションオリゴヌクレオチド（またはアナログ）を鋳型依存性のポリメラーゼで伸長させ、その際、該伸長は、１またはそれ以上の非伸長性のヌクレオチド（またはヌクレオチドアナログ）の存在下で起こり；（Ｃ）使用した各インテロゲーションプライマーについて、そのようなプライマー中に導入された非伸長性のヌクレオチド（またはヌクレオチドアナログ）の同一性を決定することにより興味のもたれる各ヌクレオチド（またはアナログ）を同定し、その際、同定された該非伸長性のヌクレオチド（またはヌクレオチドアナログ）は該プライマーの標的ヌクレオチドに相補的である；ついで（Ｄ）適当なマトリックス上、または物理的または化学的分離の他の標準法または同定法により、該伸長したプライマーを分離するを含む。図面の簡単な説明図１は、ランダムゲノム断片をクローニングするための好ましい方法を示す。ゲノムＤＮＡをサイズ分画し、ついでランダムクローンを得るためにプラスミドベクター中に導入する。挿入されたゲノム配列の配列を決定するためにＰＣＲプライマーをデザインし、使用する。図２は、ランダムゲノム断片のサイクルシークエンシング（cycle sequencing ）である、新規な多型配列を同定するための好ましい方法により得られたデータを示す。図３は、多型配列のためのランダムクローンのスクリーニングのためのＲＦＬＰ法を示す。図４は、遺伝子マーカーの所定のパネルで２人の個体が同一の遺伝子型を有するであろう確率のグラフを示す。図５は、２０の遺伝子マーカーの所定のパネルが、母親が問題となっていない実父確定訴訟においてランダムな真偽の疑わしい父親を排除するであろう確率のグラフを示す。図６は、ＳＮＰの遺伝子型決定のための好ましい方法を示す。７つの工程は、生物学的試料から出発してＧＢＡを行う仕方を示している。好ましい態様の説明Ｉ．本発明の単一ヌクレオチド多型および遺伝子分析におけるその使用の利点Ａ．多型の属性本発明にとって興味のもたれる特定の遺伝子配列は、「単一ヌクレオチド多型」を含む。「多型」とは、ある種における幾つかの成員のＤＮＡ配列の変異である。動物および植物のゲノムは、その持続的な進化の過程で自然の突然変異を蒙る（Gusella，J.F.、Ann ．Rev．Biochem．55:831〜854(1986)）。そのような突然変異の大部分は多型を生み出す。変異した配列および元の配列は、その種の集団において共に存在する。幾つかの場合には、そのような共存は安定なまたは準安定な平衡にある。他の場合には、そのような変異はその種が生き残る上でまたは進化にとっての有利さをもたらすため、ついには（すなわち、進化的な時間において）、それがその種の全成員のゲノム中に組み込まれることになる。それゆえ、多型は、多型の存在のためにある種の幾つかの成員は変異していない配列（すなわち、元の「アレル」）を有する一方で他の成員は変異した配列（すなわち、変異した「アレル」）を有するという点で「アレリック（allelic）」と呼ばれる。最も単純な場合では、ただ一つの変異した配列が存在し、この多型はジアレリック（diallelic）と呼ばれる。別の変異が起こるとトリアレリック（triallelic）をもたらす、等。アレルは、該変異を含むヌクレオチドにより表される。本発明は、特定のクラスのアレル多型、および植物、動物、または微生物の遺伝子型を決定するためのその使用に関する。そのようなアレル多型は、本明細書において「単一ヌクレオチド多型」または「ＳＮＰ」と称する。「単一ヌクレオチド多型」は以下の属性により定義される。そのような多型の中心的な属性は、アレル配列間での変異の部位である多型部位「Ｘ」を含むことである。ＳＮＰの第二の特性は、その多型部位「Ｘ」がしばしばアレルの「非変異」配列に先行され後続されていることである。それゆえ、ＳＮＰの多型部位は、「５'近位」非変異配列の「直ぐ」３'側に位置し、「３'遠位」非変異配列の「直ぐ」５'側に位置するとされる。そのような配列は多型部位をフランキングしている。単一ヌクレオチド多型の「単一」なる語は、多型のヌクレオチドの数（すなわち、１つのヌクレオチド）をいう；これは標的ＤＮＡ中に存在する多型の数（１つから多数にわたる）とは関係ない。本明細書において、ある配列がその種の集団内で変化しないならばアレルの「非変異」配列といわれ、マッピングしている場合にはその種の集団の各成員のゲノム内の同じアレルの「対応」配列にマッピングされる。２またはそれ以上のＳＮＰが互いに極めて近接して位置してよいことに注意すべきである。２つの配列は、それらが互いに異なる採取源から得られたアナログである場合に「対応」配列といわれる。２人のヒトにおいてヘモグロビンをコードする遺伝子配列が「対応」アレル配列の例示である。本明細書における「対応アレル」の定義は、この語の意味を明確にするためであって、当業者に理解されている語の意味を変えることを意図するものではない。表１の各列は、「対応」ウマアレルの多型部位のヌクレオチドの同定並びに非変異５'近位配列および３'遠位配列（これらも該ＳＮＰの属性である）を示す。ヒトアレルに関する「対応アレル」は表２に示してある。表２の各列は、「対応」ヒトアレルの多型部位のヌクレオチドの同定並びに非変異５'近位配列および３'遠位配列（これらも該ＳＮＰの属性である）を示す。ゲノムＤＮＡは二本鎖であるから、各ＳＮＰはプラス鎖かまたはマイナス鎖のいずれかで定義できる。それゆえ、各ＳＮＰについて、一方の鎖は直ぐ５'の近位非変異配列を含み、他方の鎖は直ぐ３'の遠位非変異配列を含むであろう。各ＳＮＰ多型部位「Ｘ」が単一のヌクレオチドである好ましい態様において、ＳＮＰの二本鎖ＤＮＡの各鎖は直ぐ５'の近位非変異配列および直ぐ３'の遠位非変異配列の両者を含むであろう。本発明の好ましいＳＮＰはＳＮＰ多型での１つのヌクレオチドの他のヌクレオチドによる置換に関するものであるが、ＳＮＰはまたさらに複雑なものであってもよく、２つの対応配列の１つからのヌクレオチドの欠失または２つの対応配列の１つへの挿入を含んでいてよい。たとえば、ある動物において特定の遺伝子配列が特定の多型部位中にＡを含んでいるのに対して、他の動物では該部位に単一または複数の塩基置換が存在してよい。本発明の好ましいＳＮＰは非変異近位配列および非変異遠位配列の両者を有するが、ＳＮＰは非変異近位配列のみまたは非変異遠位配列のみを有していてもよい。ＳＮＰの直ぐ３'の遠位非変異配列に相補的な配列を有する核酸分子は、「鋳型依存的な」仕方で伸長された場合には、ＳＮＰの多型部位を含む伸長生成物を形成しうる。そのような核酸分子の好ましい例は、ＳＮＰの５'の近位非変異配列と同じ配列を有する核酸分子である。「鋳型依存的な」伸長とは、伸長された配列が核酸鋳型の配列に相補的となるようにポリメラーゼがプライマーの伸長を媒介しうることをいう。「プライマー」とは、「鋳型依存的な」伸長反応でのヌクレオチド（またはヌクレオチドアナログ）の共有結合付加により伸長されうる、一本鎖オリゴヌクレオチド（またはオリゴヌクレオチドアナログ）または一本鎖ポリヌクレオチド（またはポリヌクレオチドアナログ）をいう。そのような能力を有するためには、プライマーは３'ヒドロキシル（またはポリメラーゼ媒介伸長に適した他の化学基）末端を有していなければならず、第二の核酸分子（すなわち、「鋳型」）にハイブリダイズされなければならない。プライマーは、（１）該プライマーの３'末端が目的とする標的ヌクレオチドの直ぐ近位となるように、標的分子の特定の領域に相補的な８塩基またはそれ以上の長さの独特の配列、および（２）特定かつ独特の長さ、物理的または化学的特性の中性成分からなる５'テール、からなる。最も好ましくはプライマーの相補的な領域は約２０塩基であるが、一層短いまたは一層長いプライマーでもよい。典型的には、プライマーの相補的な領域は約１２塩基から約２０塩基である。５'テールの中性成分は、ポリＴ、非塩基性残基等の、あらゆる非特異的でハイブリダイズしないポリマーまたは化学的基である。「ポリメラーゼ」とは、ある核酸分子が適当な鋳型核酸分子にハイブリダイズした場合に、ヌクレオシド三リン酸（または適当なアナログ）を導入することによって該核酸分子の３'ヒドロキシル基を伸長しうる酵素である。ポリメラーゼ酵素は、Watson，J.D.，In:Molecular Biology o f the Gene 、第３版、ベンジャミン、メンロ・パーク、カリフォルニア（１９７７）（参照のため本明細書中に引用する）および同様の出典において論じられている。他のポリメラーゼ、たとえば、「クレノウ」ポリメラーゼとして一般に知られる大腸菌のＤＮＡポリメラーゼＩの大きなタンパク質加水分解断片、大腸菌ＤＮＡポリメラーゼＩ、およびバクテリオファージＴ７ＤＮＡポリメラーゼなどもまた、本明細書に記載した方法を行うのに用いることができる。ＳＮＰの直ぐ３'の遠位非変異配列と同じ配列を有する核酸は、直ぐ５'の近位配列と同じ配列を有し鋳型依存的な仕方で１のヌクレオチドを伸長したプライマーに鋳型依存的な仕方でライゲートすることができる。Ｂ．遺伝子分析にＳＮＰを用いることの利点本発明の単一ヌクレオチド多型部位は、植物、動物、または微生物のＤＮＡを分析するのに用いることができる。そのような部位は、ヒトを含む哺乳動物、非哺乳動物、家畜（イヌ、ネコなど）、農場動物（ウシ、ヒツジなど）および他の経済的に重要な動物のゲノムを分析するのに適している。しかしながら、本発明の単一ヌクレオチド多型部位は他の種類の動物、植物、および微生物に関して用いることができる。ＳＮＰはＳＴＲやＶＮＴＲに比べて遺伝子分析において幾つかの際立った利点を有する。第一に、ＳＮＰはＳＴＲやＶＮＴＲに比べて一層高い頻度で（約１０〜１００倍高い）かつ一層均一に生じる。ＳＮＰの頻度が一層高いことは、ＳＮＰが他のクラスの多型に比べて一層容易に同定できることを意味する。その分布が一層均一であることは、目的とする特定の特質に「一層近くで」ＳＮＰを同定することを可能にする。これら２つの属性の組み合わせはＳＮＰを極めて価値あるものにする。たとえば、ある特定の特質（たとえば、癌への素因）がある特定の遺伝子座での変異を反映しているならば、該遺伝子座と関連したすべての多型を用いて、ある個人が該特質を示すようになる確率を予測することができる。そのような予測の価値は、一部は、多型と遺伝子座との間の拒理によって決定される。それゆえ、もしも遺伝子座がいかなる繰返しタンデムヌクレオチド配列モチーフからも離れているならば、ＶＮＴＲ分析はごく限られた価値しか有しないであろう。同様に、もしも遺伝子座がいかなる検出可能なＲＦＬＰからも離れているならば、ＲＦＬＰ分析は正確ではなくなるであろう。しかしながら、本発明のＳＮＰは哺乳動物ゲノム中の約３００塩基毎に存在し、しかも分布の均一性を示すので、ＳＮＰは統計的には、ある特定の遺伝子障害または変異の１５０塩基内に認められうる。実際、特定の変異は、それ自体、ＳＮＰであってよい。それゆえ、そのような遺伝子座の配列が決定されている場合には、該遺伝子座のヌクレオチドにおける変異は、問題となっている特質を決定するものとなる。第二に、ＳＮＰは他のクラスの多型に比べて一層安定である。その自然突然変異率は約１０^-9であり、ＶＮＴＲよりも約１,０００倍頻度が低い。確かに、ＶＮＴＲ型の多型は高変異率を特徴とする。第三に、ＳＮＰはさらに、そのアレル頻度を比較的少数の代表的な例についての研究から推論できるという利点を有する。これらのＳＮＰの属性は、ＲＦＬＰかまたはＶＮＴＲ多型のいずれかで可能なものに比べて、同一性、実父確定排除（paternity exclusion）、および特定の遺伝的特質に対する動物の素因の分析のはるかに高度な遺伝的解析を可能にする。第四に、ＳＮＰは、遺伝情報（ヌクレオチド位置および塩基の同定）の最も可能性の高い描写（definition）を反映している。そのように高度の描写を与えるにもかかわらず、ＳＮＰはＲＦＬＰまたはＶＮＴＲに比べて一層容易に、しかも一層高い柔軟性にて検出することができる。実際、ＤＮＡは二本鎖であるので、アレルの相補鎖を分析してＳＮＰの存在および同一性を確認することができる。同定したＳＮＰを特徴付ける際の柔軟性はＳＮＰの際立った特徴である。たとえば、ＶＮＴＲ型の多型は、多数の繰返し中の変異を識別しうるサイズ分画法により容易に検出できる。ＲＦＬＰは、サイズ分画についで制限消化することにより最も容易に検出できる。対照的に、ＳＮＰは様々な方法のいずれを用いても特徴付けることができる。そのような方法としては、該部位の直接または間接シークエンシング、該部位の各アレルが制限部位を生成または破壊する場合の制限酵素の使用、アレル特異的なハイブリダイゼーションプローブの使用、多型の異なるアレルによりコードされるタンパク質に特異的な抗体の使用、または他の生化学的解釈が挙げられる。 Goelet，P.らにより記載され（ＷＯ９２／１５７１２号、参照のため本明細書中に引用する）以下において論じる「ジェネティックビット分析（Genetic Bit Analysis）」（「ＧＢＡ」）法は、単一ヌクレオチド多型部位に存在するヌクレオチドを同定するための方法である。ＧＢＡは、標的ＤＮＡ配列中の変異の部位の周辺のヌクレオチド配列情報を用いて該標的ＤＮＡ中の可変ヌクレオチドに直ぐ隣接するが該ヌクレオチドを含まない領域に相補的なオリゴヌクレオチドプライマーをデザインする、多型部位解明の方法である。標的ＤＮＡ鋳型を生物学的試料から選択し、インテロゲーションプライマーにハイブリダイズさせる。このプライマーを、１またはそれ以上の鎖停止ヌクレオシド三リン酸前駆体（または適当なアナログ）の存在下、ＤＮＡポリメラーゼを用いて単一の標識ジデオキシヌクレオチド（またはアナログ）により伸長させる。 Cohen，D.ら（ＰＣＴ出願ＷＯ９１／０２０８７号）は、遺伝子型決定のための他の関連法を記載しており、該方法によれば、所望の遺伝子座で配列を決定するためにジデオキシヌクレオチドを用いて単一のヌクレオチドにより単一のプライマーを伸長させる。Daleら（ＰＣＴ出願ＷＯ９０／０９４５５）は、プライマーを単一のジデオキシヌクレオチド種とともに用いて「可変部位」の配列を決定するための方法を開示している。Daleらの方法はさらに、複数のプライマーの使用および分離要素の使用を開示している。Ritterband，M．ら（ＰＣＴ出願ＷＯ９５／１７６７６号）は、そのような標的分子を液体試料中で分離、濃縮および検出するための装置を記載している。Cheeseman，P.（米国特許第５,３０２,５０９号）は、一本鎖ＤＮＡ分子の配列を決定する関連法を記載している。 Cheesemanの方法は、蛍光標識した３’保護ヌクレオチド三リン酸（各塩基は異なる蛍光標識を有する）を用いる。 Wallaceら（ＰＣＴ出願ＷＯ８９／１０４１４）は、アレル特異的なプライマーを用いることにより標的の複数の領域を同時に複製するのに用いることのできる複数のＰＣＲ法を記載している。アレル特異的なプライマーを用いることにより、特定のアレルが試料中に存在する場合にのみ増幅が起こる。ＤＮＡ中の多型部位をアッセイするための幾つかのプライマー先導ヌクレオチド導入法が記載されている（Komher，J.S．ら、Nucl ．Acids Res． 17:7779〜77 84(1989);Sokolov，B.P.、Nucl ．Acids Res． 18：3671(1990);Syvanen，A.-C. ら、Genomics 8:684〜692(1990);Kuppuswamy，M.N.ら、Proc ．Natl．Acad．Sci ．(USA) 88:1143〜1147(1991);Prezant，T.R.ら、Hum ．Mutat． 1:159〜164(199 2);Ugozzoli，L.ら、GATA 9:107〜112(1992);Nyren，P.ら、Anal ．Biochem． 20 8 ：171〜175(1993)）。これら方法はすべて、多型部位の塩基を識別するために標識デオキシヌクレオチドを導入することに拠っている点でＧＢＡとは異なる。かかるフォーマットにおいて、シグナルは導入されたデオキシヌクレオチドの数に比例するので、同じヌクレオチドのラン（runs）で生じる多型はランの長さに比例するシグナルという結果となりうる（Syvanen，A.-C.ら、Amer ．J．Hum．Ge net． 52:46〜59(1993)）。かかる所定範囲の遺伝子座特異的シグナルは、とりわけヘテロ接合体の場合、ＧＢＡ法によって生成される単純な３成分（２：０、１：１または０：２）クラスのシグナルに比べて解釈が一層困難でありうる。さらに、ある種の遺伝子座においては、正しいジデオキシヌクレオチドの存在下でさえも正しくないデオキシヌクレオチドの導入が起こりうる（Komher，J.S.ら、Nucl ．Acids Res． 17:7779〜7784(1989)）。かかるデオキシヌクレオチドの誤導入は、幾つかの配列の状況ではミスペアしたデオキシ−基質に対するＤＮＡポリメラーゼのＫｍが、正しく塩基対形成したジデオキシ−基質の比較的小さなＫｍに匹敵することによる（Kornberg，A.ら、In:DNA Replication、第２版(1992) 、フリーマン・アンド・カンパニー、ニューヨーク；Tabor，S.ら、Proc ．Natl ．Acad．Sci．(USA) 86：4076〜4080(1989)）。この効果は、多型部位の解明においてバックグラウンドノイズに貢献する。そのような方法すべてとは対照的に、本発明の方法は、複数のＳＮＰに存在するヌクレオチドの決定を可能とするかまたは極めて容易にする。 II．新規な多型部位を発見する方法多型部位を発見するための好ましい方法は、多数の半数体ゲノムからのゲノムＤＮＡ断片の比較的な配列決定を含む。好ましい態様において、図１に示すように、そのような配列決定は、ある種の１つの成員からのＤＮＡの０．５〜３Ｋｂ断片を含むランダムゲノムライブラリーを調製することにより行う。ついで、これら組換え体の配列を用いて、該種のランダムに選択した多数の個体の同じゲノム遺伝子座でのＰＣＲ配列決定を容易にする。そのようなゲノムライブラリー（典型的に約５０,０００クローン）から数百（２００〜５００）の個々のクローンを精製し、その挿入物の末端の配列を決定する。クローニングした領域のＰＣＲ増幅を可能とするには、ごく少量の末端配列情報（１００〜２００塩基）が得られればよい。この配列決定の目的は、該種の他の成員のゲノムＤＮＡ試料からの同等の断片の増幅を媒介するのに適したプライマーの合成を可能とするのに充分な配列情報を得ることである。好ましくは、そのような配列決定は、サイクルシークエンシング法を用いて行う。これらプライマーを用いて標的種のランダムに選択した成員のパネルからのＤＮＡを増幅させる。パネル中の成員数は、単離すべき多型の最低の頻度を決定する。それゆえ、もしも６の成員を評価するならば、たとえば０．０１の頻度で存在する多型は同定されないかもしれない。例示的だが単純化しすぎた数学的扱いにおいて、６の成員のサンプリングでは約０.０８（すなわち、１.０の全頻度割る６成員割るゲノム当たり２アレル）を超える頻度で起こる多型のみが同定されることが予測される。それゆえ、一層低い頻度の多型の同定を望むなら、より多数のパネル成員を評価しなければならない。サイクル配列分析（Mullis，K.ら、Cold Spring Harbor Symp ．Quant．Biol． 51 :263〜273(1986)；Erlich H.ら、ヨーロッパ特許出願第50,424号；ヨーロッパ特許出願第84,796号、ヨーロッパ特許出願第258,017号、ヨーロッパ特許出願第2 37,362号；Mullis，K.、ヨーロッパ特許出願第201,184号；Mullis，K.ら、米国特許第4,683,202号；Erlich，H.、米国特許第4,582,788号；Saiki，R.ら、米国特許第4,683,194号）は、自動ＤＮＡシークエンシング装置およびソフトウエア（Applied Biosystems，Inc.）を用いることにより容易になる。そのようにして、コンピュータースクリーン上のクロマトグラムの関連部分を調べることにより、異なる動物の配列間の差異を同定し、確認することができる。差異は、両鎖にtuいてデータが得られ、試験した動物の集団の中で１を超える半数体例で存在する場合にのみ、ＤＮＡ多型を反映していると解釈される。図２は、ランダムゲノム断片のサイクルシークエンシングである、新規な多型配列を同定するための好ましい方法を示す。動物からのＰＣＲ断片をアクリルアミドゲルから電気溶出し、ｄＮＴＰおよび蛍光標識または化学的に標識したｄｄＮＴＰの混合物の存在下での熱安定ＴａｑＤＮＡポリメラーゼの繰返しサイクルを用いて配列決定する。ついで、生成物を分離し、Applied Biosystems，Inc.の自動ＤＮＡシークエンシング装置を用いて分析する。得られたデータをＡＢＩソフトウエアを用いて分析する。異なる動物の配列間の差異は、該ソフトウエアにより同定され、コンピュータースクリーン上のクロマトグラムの関連部分を調べることにより確認される。差異は、両鎖についてデータが得られ、試験した５匹のウマで１を超える半数体例に存在する場合にのみ、「ＤＮＡ多型」として表される。最上段のパネルは「Ａ」ホモ接合体を示し、中段のパネルは「ＡＴ」ヘテロ接合体を示し、最下段のパネルは「Ｔ」ホモ接合体を示す。別のやり方として、多型部位の発見は図３に示す戦略を用いて行うことができる。この態様では、ＤＮＡ配列多型は、関連のない成員のゲノム鋳型からのｐＣＲ反応の生成物に対して幾つかの制限酵素のパネルを作用させて得られた制限エンドヌクレアーゼ開裂プロフィールを比較することにより同定する。最も好ましくは、使用する各制限エンドヌクレアーゼは４塩基認識配列を有しており、それゆえ、増幅生成物中で所望の数の切断が可能となるであろう。ゲノムＤＮＡから得られた制限消化パターンを、対応プラスミド鋳型を用いて得られたＰＣＲ生成物から得られたパターンと直接比較する。そのような比較は、ゲノムＤＮＡおよびプラスミドＤＮＡからの増幅配列が同等の遺伝子座からのものであることを示す内部対照を提供する。この対照はまた、繰返し配列またはマルチコピー遺伝子座を偶然に増幅させたプライマーの同定を可能にする。なぜなら、これらはプラスミド鋳型からよりもゲノムＤＮＡ鋳型からの方がより多くの断片を生成するであろうからである。 III．本発明の単一ヌクレオチド多型の遺伝子型を決定する方法本発明の単一ヌクレオチド多型の多型部位「Ｘ」を同定するために、様々な方法のいずれをも用いることができる。そのような同定の好ましい方法は、分析した各多型について多型の配列を直接確認することを含む。それゆえ、このアプローチは、多型の特定の配列よりもむしろバンドのパターンを分析するＲＦＬＰ法とは著しく異なる。Ａ．増幅ベースの分析ＤＮＡ試料中の多型部位の検出は、ＤＮＡ増幅法を用いることにより容易になる。そのような方法は、多型部位にまたがる配列、または多型部位と該部位の遠位かまたは近位に位置する配列とを含む配列の濃度を特異的に増加させる。そのようにして増幅した分子は、ゲル電気泳動その他の手段により容易に検出できる。そのような増幅を達成するための最も好ましい方法では、その二本鎖形態において多型を定める近位配列にハイブリダイズしうるプライマーペアを用いたＰＣＲを採用する。ＰＣＲの代わりに「リガーゼ連鎖反応」（「ＬＣＲ」）などの別法を用いることができる（Barany，F. Proc ．Natl．Acad．Sci．(U.S.A.) 88:189〜193(1991) ）。ＬＣＲでは、２対のオリゴヌクレオチドプローブを用いて特定の標的を指数関数的に増幅させる。各オリゴヌクレオチドペアの配列は、該ペアが標的の同鎖上の隣接する配列にハイブリダイズできるように選択する。そのようなハイブリダイゼーションは、鋳型依存性のリガーゼの基質を形成する。ＰＣＲと同様に、かくして得られた生成物はその後のサイクルの鋳型として働き、所望の配列の指数関数的な増幅が得られる。本発明に従って、ＬＣＲは多型部位の同鎖上の近位配列および遠位配列をそれぞれ有するオリゴヌクレオチドを用いて行うことができる。一つの態様において、いずれかのオリゴヌクレオチドが多型の実際の多型部位を含むようにデザインされるであろう。そのような態様において、反応条件は、標的分子が該オリゴヌクレオチド上に存在する多型部位に相補的な特定のヌクレオチドを含むかあるいは欠失している場合にのみオリゴヌクレオチドが一緒にライゲートしうるように選択する。別の態様において、オリゴヌクレオチドは多型部位を含まず、それらが標的分子にハイブリダイズしたときには「ギャップ」が生成されるであろう（Segev，D .ＰＣＴ出願ＷＯ９０／０１０６９参照）。ついで、このギャップは、相補的なｄＮＴＰにより（ＤＮＡポリメラーゼにより媒介される）、または別のオリゴヌクレオチドペアにより充填される。それゆえ、各サイクルの終わりには各一本鎖は次のサイクルにおいて標的として働きうる相補鎖を有し、所望の配列の指数関数的な増幅が得られる。「オリゴヌクレオチドライゲーションアッセイ（「ＯＬＡ」）」（Landegren ，U.ら、Science 241:1077〜1080(1988)）はＬＣＲとある種の類似点を有し、多型分析に使用すべく適合させることができる。ＯＬＡプロトコールでは、標的一本鎖の隣接配列にハイブリダイズしうるようにデザインした２つのオリゴヌクレオチドを用いる。ＬＣＲと同様にＯＬＡは点変異の検出に特に適している。しかしながら、ＬＣＲとは異なり、ＯＬＡでは標的配列の指数関数的な増幅ではなく「直線的な」増幅が得られる。 Nickerson，D.A.らは、ＰＣＲとＯＬＡとの属性を組み合わせた核酸検出アッセイを記載している（Nickerson，D.A.ら、Proc ．Natl．Acad．Sci．(U.S.A)87: 8923〜8927(1990)）。この方法では、標的ＤＮＡの指数関数的増幅を達成するためにＰＣＲを用い、ついで該標的ＤＮＡをＯＬＡを用いて検出する。複数かつ別々の処理工程が必要であることに加えて、このような組み合わせに付随する一つの問題は、それがＰＣＲおよびＯＬＡに付随する全ての問題を継承していることである。２つの（またはそれ以上の）オリゴヌクレオチドをその結果得られる「ジオリゴヌクレオチド」の配列を有する核酸の存在下でライゲーションさせ、それによって該ジオリゴヌクレオチドを増幅させる方式もまた知られており(Wu，D.Y.ら、Genomics 4:560(1989))、本発明の目的のために容易に適合させることができる。他の知られた核酸増幅法、たとえば、転写ベースの増幅系（Malek，L.T.ら、米国特許第5,130,238号；Davey，C.ら、ヨーロッパ特許出願第329,822号； Schusterら、米国特許第5,169,766号；Miller，H.I.ら、ＰＣＴ出願ＷＯ８９／０６７００；Kwoh，D.ら、Proc ．Natl．Acad．Sci．(U.S.A.) 86:1173(1989)；G ingeras，T.R.ら、ＰＣＴ出願ＷＯ８８／１０３１５）または等温増幅法（Walke r，G.T.ら、Proc ．Natl．Acad．Sic．(U.S.A.) 89:392〜396(1992)）もまた用いることができる。Ｂ．一本鎖ＤＮＡの調製本発明におけるＳＮＰの配列の直接分析は、「サンガー法」としても知られる「ジデオキシ媒介鎖停止法」（Sanger，F.ら、J ．Molec．Biol． 94：441(1975) ）かまたは「マクサム−ギルバート法」としても知られる「化学分解法」（Maxa m，A.M.ら、Proc ．Natl．Acad．Sci．(U.S.A.) 74:560(1977)）（両文献を参照のため本明細書中に引用する）のいずれかを用いて行うことができる。ジデオキシ媒介法かまたはマクサム−ギルバート法のいずれかを用いてＤＮＡの配列を決定する方法は当業者には広く知られている。そのような方法は、たとえば、Samb rook，J.ら、Molecular Cloning ，A Laboratory Manual、第２版、コールド・スプリング・ハーバー・プレス、コールド・スプリング・ハーバー、ニューヨーク（１９８９）およびZyskind，J.W.ら、Recombinant DNA Laboratory Manual、アカデミック・プレス、ニューヨーク（１９８８）（両文献を参照のため本明細書中に引用する）に開示されている。核酸試料が二本鎖ＤＮＡ（またはＲＮＡ）を含む場合、または二本鎖核酸増幅プロトコール（ＰＣＲなど）を用いる場合には、二本鎖のうちの一方の鎖に富む、好ましくは優先的に一方の鎖のみを含む調製物が得られるように二本鎖分子を処理した後にそのような配列分析を行うのが一般に望ましい。しかしながら、熱安定性のポリメラーゼを使用し、反応液を１回かまたはそれ以上加熱および冷却するならば、本発明では一本鎖ＤＮＡ鋳型を生成する必要はない。この処理により、二本鎖の鋳型がその相補鎖から分離され、その後の冷却工程でインテロゲーションプライマーとアニールさせることができる。他の鋳型鎖によるハイブリダイゼーションへの競合は、融解−冷却条件の繰返しサイクルにより補償できる。二本鎖ＤＮＡから一本鎖ＤＮＡ分子を生成するための最も簡単な方法は、熱かまたはアルカリ処理のいずれかを用いた変性である。一本鎖ＤＮＡ分子はまた、一本鎖ＤＮＡバクテリオファージＭ１３を用いて生成することができる（Messin g，J.ら、Meth ．Enzymol． 101:20(1983);Sambrook，J.らのMolecular Cloning ．A Laboratory Manual 、コールド・スプリング・ハーバー・ラボラトリー・プレス、コールド・スプリング・ハーバー、ニューヨーク（１９８９）をも参照）。一本鎖ＤＮＡ分子を生成するために幾つかの別法を用いることができる。 Gyllensten，U.ら(Proc ．Natl．Acad．Sci．(USA) 85：7652〜7656(1988))およびMihovilovic，M.ら(BioTechniques 7(1)：14(1989)）は、「非対称ＰＣＲ」と称する方法を記載しており、この方法では異なるモル濃度で存在するプライマーを用いて標準「ＰＣＲ」法を行う。Higuchi，R.G.ら（Nucl ．Acids Res．17:586 5(1985)）は、一本鎖増幅生成物を生成するための別の方法を例示している。この方法は、二本鎖増幅生成物の一方の鎖の５'末端をリン酸化し、ついで５'→３ 'エキソヌクレアーゼ（Ｔ７エキソヌクレアーゼなど）にリン酸化鎖を優先的に分解させることを内容とする。他の方法はまた、一本鎖ＤＮＡ分子を生成させるためにホスホロチオエート誘導体のヌクレアーゼ耐性特性を利用している（Benkovicら、米国特許第4,521,50 9号;Sayers，J.R.ら、Nucl ．Acids Res． 16:791〜802(1988);Eckstein，F.ら、Biochemistry 15:1685〜1691(1976);Ott，J.ら、Biochemistry 26:8237〜8241(1 987)）。そのような一本鎖分子を生成する方法の相対的な利点および不利な点に関する議論は、Nikiforov，T.（米国特許出願第08/005,061号（出願は１９９４年６月２４日に放棄）、参照のため本明細書中に引用する）により記載されている。最も好ましい態様では、ホスホロチオエート誘導体をプライマーに含ませる。ヌクレオチド誘導体はプライマーのいずれの位置にも導入することができるが、好ましくはプライマーの５'末端に、最も好ましくは互いに隣接して導入されるであろう。好ましくは、プライマー分子は長さが約２５ヌクレオチドの相補的領域を有し、（全残基と比較して）約４％〜約１００％、さらに好ましくは約４％〜約４０％、最も好ましくは約１６％のホスホロチオエート残基を含むであろう。これらヌクレオチドはプライマーのいずれの位置に導入してもよく、互いに隣接していてよく、またはプライマーの全体または一部に散在していてよい。一つの態様において、本発明は増幅プロトコール、たとえばＰＣＲとともに用いることができる。この態様では、非修飾プライマーの場合に確立されているＰＣＲプロトコールを変えることなく上記プライマーを用いることができるように、プライマーのホスホロチオエート結合の数は約１０（プライマーの長さの約半分）に制限するのが好ましい。プライマーがそれ以上のホスホロチオエート結合を含む場合には、反応条件を最適化するためにＰＣＲ条件はとりわけアニーリング温度に関して調整する必要がある。そのようなヌクレオチド誘導体のＤＮＡまたはＲＮＡ中への導入は、ＤＮＡポリメラーゼを用いて酵素的に行うことができる（Vosberg，H.P.ら、Biochemistr y 16:3633〜3640(1977)；Burgers，P.M.J.ら、Ｊ．Biol．Chem．254:6889〜6893 (1979)；Kunkel，T.A.ら、In:Nucleic Acids and Molecular Biology Vol．2,12 4〜135(Eckstein，F.ら編)、シュプリンガー−フェアラーク、ベルリン(1988)； Olsen，D.B.ら、Proc ．Natl．Acad．Sic．(U.S.A.) 87:，1451〜1455(1990)；Gr iep，M.A.ら、Biochemistry 29:9006〜9014(1990)；Sayers，J.R.ら、Nucl ．Aci ds Res． 16：791〜802(1988)）。別法として、ホスホロチオエートヌクレオチド誘導体は合成によりオリゴヌクレオチド中に導入することができる（Zon，G．Anti-Canc ．Drug Des． 6:539〜568(1991)）。プライマー分子は相補的な標的核酸分子にハイブリダイズし、ついで好ましくはポリメラーゼにより伸長して伸長生成物を形成することができる。プライマー中のホスホロチオエートヌクレオチドの存在は、伸長生成物をヌクレアーゼ攻撃に対して耐性にする。指摘されるように、ホスホロチオエートまたは他の適当なヌクレオチド誘導体を含む増幅生成物はＴ７エキソヌクレアーゼやエキソヌクレアーゼなどの「５'→３'」エキソヌクレアーゼによる「排除」（すなわち、分解）に対して実質的に耐性であり、それゆえ、５'→３'エキソヌクレアーゼがホスホロチオエート残基のところへきたときには該エキソヌクレアーゼは核酸分子をそれ以上、実質的に分解することができないであろう。標的分子はヌクレアーゼ耐性残基を欠くので、伸長生成物およびその鋳型（標的）を５'→３'エキソヌクレアーゼの存在下でインキュベートすると鋳型鎖の破壊という結果となり、それにより所望の一本鎖の優先的な生成が達成される。Ｃ．溶液中でのＤＮＡのハイブリダイゼーション多型の多型部位の同一性を決定するための好ましい方法は核酸ハイブリダイゼーションを含む。そのようなハイブリダイゼーションは固相上でも行うことができるが（Saiki，R.K.ら、Proc ．Natl．Acad．Sci．(U.S.A.) 86:6230〜6234(198 9)；Gilhamら、J ．Amer．Chem．Soc． 86:4982(1964)およびKrembskyら、Nucl ． Acids Res． 15:3131〜3139(1987)）、溶液中でハイブリダイズさせるのが好ましい（Berk，A.J.ら、Cell 12:721〜732(1977)；Hood，L.E.ら、In:Molecular B iology of Eukaryotic Cells:A Problems Approach 、メンロ・パーク、カリフォルニア：ベンジャミン−カミングス(1975)；Wetmer，J.G．Hybridization and R enaturation Kinetics of Nucleic Acids ． Ann ．Rev．Biophys．Bioeng． 5:33 7〜361(1976)；Itakura，K.ら、Ann ．Rev．Biochem．53:323〜356(1984)）。高容量試験に応用するには、非放射性の検出法を用いるのが望ましい。それゆえ、蛍光標識したまたはハプテン化した（haptenated）ジデオキシヌクレオチドを使用するのが好ましい。ビオチン化ｄｄＮＴＰの使用は、４つの各（３−アミノプロピン−１−イル）ヌクレオシド三リン酸をスルホスクシンイミジル６−（ビオチンアミド）ヘキサノエートと反応させることにより調製するのが好ましい。その際、（３−アミノプロピン−１−イル）ヌクレオシド５'−三リン酸は、H obbs，F.W.により（J ．Org．Chem． 54:3420〜3422(1989)）およびHobbs，F.W．らにより（米国特許第5,047,519号）記載されたようにして調製する。Ｄ．多型部位の分析１．ポリメラーゼ媒介分析本発明の多型部位のヌクレオチドの同一性は、たとえば、Goelet，P.らにより開示された核酸配列変異のオリゴヌクレオチドベース診断アッセイの変法（ＰＣＴ出願ＷＯ９２／１５７１２、参照のため本明細書中に引用する）を用いて決定できる。とりわけ、本発明は、複数のＳＮＰの同時または同時に近い分析を可能または容易にするという点で米国特許出願第０８／２１６,５３８号（参照のため本明細書中に引用する）に記載されたＳＮＰの分析法に対して改良をもたらすものである。そのような進歩を達成するため、本発明は好ましくは、標的分子に溶液中でハイブリダイズしうる所定の配列を有する１またはそれ以上の精製したインテロゲーションオリゴヌクレオチドを用いる。「インテロゲーションオリゴヌクレオチド」とは、一般に、１またはそれ以上の単一ヌクレオチド多型の直ぐ近位または遠位の配列に相補的な配列を有するオリゴヌクレオチドプライマーをいう。好ましい態様において、標的分子の特定の領域に相補的な配列を有する１またはそれ以上のインテロゲーションオリゴヌクレオチドプライマーを上記方法を用いて調製する。好ましくは、プライマーは標的分子の特定の領域に相補的な約１２〜２０塩基を有する。これらオリゴヌクレオチドプライマーは、各プライマーの３'末端が目的とする標的ヌクレオチド（ＳＮＰなど）の直ぐ近位となるように標的分子にハイブリダイズする。好ましくは、オリゴヌクレオチドプライマーは、中性成分（たとえば、ポリＡ、非塩基性残基、または他の非特異的でハイブリダイズしないポリマーまたは化学標識）からなる５'テールを含む。最も好ましい態様において、中性成分は特定の独特の長さが割り当てられる。プライマーは、プライマー特異的な標識を含んでいてもよいし、含んでいなくてもよい。しかしながら、最も好しい態様においてはオリゴヌクレオチドプライマーはプライマー特異的な標識を含む。ついで、インテロゲーションプライマーを、１またはそれ以上の単一ヌクレオチド多型を有する標的ＤＮＡ分子（好ましくはゲノムＤＮＡ分子）（各ＳＮＰについてその直ぐ３'の遠位配列はインテロゲーションプライマーと相補的である）、ＤＮＡポリメラーゼおよび鎖停止ヌクレオチド（またはヌクレオチドアナログ）三リン酸誘導体の存在下でインキュベートする。好ましくは、そのようなインキュベーションは、ｄＮＴＰ（すなわち、ｄＡＴＰ、ｄＧＴＰ、ｄＣＴＰ、ｄＴＴＰ）の完全な不在下だが１またはそれ以上の鎖停止ヌクレオチド三リン酸誘導体（またはアナログ）（たとえば、ｄｄＡＴＰ、ｄｄＧＴＰ、ｄｄＣＴＰ、ｄｄＴＴＰなど）のみの存在下、該プライマーの３'末端への該誘導体の単一塩基導入が可能な条件下にて行う。反応液中に未導入のヌクレオチド三リン酸が存在することは反応にとって重要ではないが、そのような未導入のヌクレオチドは幾つかの手段により分離することができる。導入したヌクレオチドの同一性は、多型の多型部位のヌクレオチドにより決定され、該ヌクレオチドに相補的である。本発明において非伸長性のヌクレオチドは、好ましくは³²Ｐまたは蛍光分子で標識してよい。本発明に適した他の標識としては、これらに限られるものではないが、ビオチン、イミノビオチン、ハプテン、抗原、補因子、ジニトロフェノール、リポ酸、オレフィン化合物、検出可能なポリペプチド、電子密度の高い（el ectron dense）分子、不溶性の反応生成物を沈積しうる酵素が挙げられる。本発明に適した蛍光分子としては、これらに限られないが、フルオレセイン、ローダミン、テキサスレッド、ＦＡＭ、ＪＯＥ、ＴＡＭＲＡ、ＲＯＸ、ＨＥＸ、ＴＥＴ、Ｃｙ３、Ｃｙ３.５、Ｃｙ５、Ｃｙ５.５、ＩＲＤ４０、ＩＲＤ４１およびＢＯＤＩＰＹが挙げられる。本発明に適した電子密度指示薬（electron dense indic ator）分子としては、これらに限られるものではないが、フェリチン、ヘモシアニンおよびコロイド状金が挙げられる。検出可能なポリペプチドは、検出可能なポリペプチドを、指示薬分子に共有結合した第二のポリペプチドと特異的に複合体を形成させることにより間接的に検出可能であってよい。そのような態様において、検出可能なポリペプチドはアビジンおよびストレプトアビジンよりなる群から選ばれるのが好ましく、第二のポリペプチドはビオチンおよびイミノビオチンよりなる群から選ばれるのが好ましい。好ましい態様において、得られた伸長したプライマーを適当なマトリックス上で分析するために分離する。伸長したプライマーを分析のために分離するために、多くの方法のいずれをも用いることができる。そのような方法としては、これらに限られるものではないが、マススペクトロメトリー、（オリゴヌクレオチドアレイハイブリダイゼーション）フローサイトメトリー、ＨＰＬＣ、ＦＰＬＣ、サイズ排除クロマトグラフィー、アフィニティークロマトグラフィー、ゲル電気泳動などが挙げられる。好ましくは、伸長したプライマーは変性条件下で分離される；しかしながら、変性条件は有効な分離のためには必要とされない。非伸長性のヌクレオチドとは、鋳型依存的な仕方で導入されうる、合成または天然に存在するヌクレオチドアナログをいう。本発明において使用するのに適した合成または天然に存在するヌクレオチドアナログとしては、これらに限られるものではないが、非環状リボースヌクレオチドアナログ、置換リボースヌクレオチドアナログ、および修飾リボースヌクレオチドアナログが挙げられる。合成ヌクレオチドアナログは、好ましくは、フルクトースベースのヌクレオチドアナログ、天然のヌクレオチドと特異的に塩基対形成する能力を保持した化学的修飾プリン、天然のヌクレオチドと特異的に塩基対形成する能力を保持した化学的修飾ピリミジン、および天然のヌクレオチドと特異的に塩基対形成する能力を保持したあらゆる化合物よりなる群から選ばれる。最も好ましい態様において、伸長したプライマーの分離は、適当なアクリルアミド濃度を有する標準シークエンシングゲルなどの変性サイズ分離マトリックス上で行う。この態様では、特定の独特の長さの５'テールを含むインテロゲーションプライマーを用いる。伸長したプライマーは、特定の独特の長さの５'テールに基づいて識別分離される。好ましい態様では標識した鎖停止ヌクレオチド（またはヌクレオチドアナログ）を用いるが、本態様はまた識別標識したインテロゲーションプライマーをも指向するものである。一つの亜態様では識別標識した鎖停止ヌクレオチド（すなわち、ジデオキシヌクレオチド）を用いる。別の亜態様では、単一の鎖停止ヌクレオチドのみを標識した、１またはそれ以上の鎖停止ヌクレオチドを用いる。他の好ましい態様では、固相上に配置した相補的な配列にハイブリダイズしうる独特かつ特定の配列を含むインテロゲーションプライマーを用いる。この態様では、インテロゲーションプライマーを配置された捕捉プライマーに暴露し、固相捕捉プライマーアレイ上での位置に基づいて標識塩基を検出することにより各単一ヌクレオチド多型を同定することで分離される。別の態様においては、得られた伸長したプライマーは適当なアフィニティー分離法を用いて分離される。そのようなアフィニティー分離法は、一般に、レセプター−リガンド法（たとえば、アビジン−ストレプトアビジンなど）、モノクローナル抗体法などと関係する。この態様では、各プライマーの５'末端を、対応の独特のレセプターが存在する独特のリガンドに結合させる。この態様では、レ七プターを固相表面（すなわち、ビーズ、カラム、ディップスティック、マイクロタイタープレートなど）に固定化するのが好ましい。リガンド標識したプライマーは、反応混合物を対応レセプターに暴露することにより分離される。ついで、上記方法を用いて各単一ヌクレオチド多型の同一性を決定することができる。他の態様では、独特のサイズの残基（たとえば、ＢＳＡ、リゾチーム、卵アルブミンなど）に（共有結合かまたは他の仕方で）結合させたプライマーを用いる。この態様では、独特のサイズのプライマーは、適当なサイズ排除クロマトグラフィーカラムに該プライマーを通し、上記方法を用いて各単一ヌクレオチド多型の同一性を決定することにより分離される。また、本発明において上記方法を組み合わせて用いることにより単一の反応で同定できるＳＮＰの数を増加させることもできる。本発明において使用するのに適した標識としては、これらに限られるものではないが、酵素（β−ガラクトシダーゼ、ルシフェラーゼなど）、放射性同位体（すなわち、³²Ｐ、¹³Ｃ、³Ｈなど）、蛍光残基（すなわち、フルオレセイン、ローダミンなど）、色原体が挙げられる。プライマーは直接標識することもできるし、または標識されているかまたは標識されていない別個のリガンドに結合させることもできる。リガンド分子はオリゴヌクレオチドプライマーに、共有結合、イオン性相互作用、非特異的吸着、または特異的だが非共有結合性のリガンド− レセプター相互作用により結合させることができる。リガンドとは、一般に、対応の別個のレセプターが存在する所定のタンパク質または化学物質をいう。本発明に使用するのに適したリガンドとしては、これらに限られるものではないが、ハプテン、抗原、補因子、ビオチン、イミノビオチン、ジニトロフェノール、リボ酸、オレフィン化合物、オリゴヌクレオチド、相補的なオリゴヌクレオチドに特異的にハイブリダイズするようにデザインしたタンパク質核酸（「ＰＮＡ」）配列、およびレセプターとして機能するＰＮＡ配列が挙げられる。本発明に使用するのに適した他のリガンドとしては、これらに限られるものではないが、抗体、酵素、ポリペプチド、ストレプトアビジンおよびアビジンが挙げられる。一つの態様において、リガンドは検出可能に標識したポリペプチドと結合することにより複合体を形成することができる。本発明に使用するのに適した検出可能な標識としては、これらに限られるものではないが、抗体、不溶性の反応生成物を沈積しうる酵素、ストレプトアビジンおよびアビジンが挙げられる。好ましくは、検出可能に標識したポリペプチドは、ランダムに生成したポリペプチドライブラリーから選択する。レセプターとは、一般に、対応のリガンドが存在する所定のタンパク質または化学物質をいう。本発明に使用するのに適したレセプターとしては、これらに限られるものではないが、抗原、補因子、ビオチン、イミノビオチン、ジニトロフェノール、リポ酸、オレフィン化合物、オリゴヌクレオチド、相補的なオリゴヌクレオチドに特異的にハイブリダイズするようにデザインしたＰＮＡ配列、およびリガンドとして機能するＰＮＡ配列が挙げられる。一つの態様において、レセプターは検出可能に標識したポリペプチドと結合することにより複合体を形成することができる。本発明に使用するのに適した検出可能な標識としては、これらに限られるものではないが、抗体、不溶性の反応生成物を沈積しうる酵素、ストレプトアビジンおよびアビジンが挙げられる。好ましくは、検出可能に標識したポリペプチドは、ランダムに生成したポリペプチドライブラリーから選択する。レセプターはマトリックスに結合させてよい。レセプターをマトリックスに結合させる適当な方法としては、これらに限られるものではないが、共有結合、イオン性相互作用、非特異的吸着、または特異的だが非共有結合性のリガンド−レセプター相互作用が挙げられる。本発明に使用するのに適したリガンド−レセプタ一としては、これらに限られるものではないが、相補的なハイブリダイズする核酸、相補的なハイブリダイズするＰＮＡ、および他の相補的な合成核酸アナログが挙げられる。好ましい態様では検出に先立って伸長したプライマーを分離するが、本発明はまたプライマーを分離することなくＳＮＰを同定するために用いることができる。この態様では、（オリゴヌクレオチドの標識によるか、またはオリゴヌクレオチドが標識されないことにより）ヌクレオチドのインテロゲーションプライマーへの付加を検出できるように、少なくとも一つの鎖停止ヌクレオチド三リン酸誘導体を独特に標識する。それゆえ、たとえば、３つの異なるＳＮＰを解明するために３つのプライマーを標識ｄｄＡＴＰの存在下で用いた場合には、そのような標識の導入はＳＮＰの１つがＴであることを示すものである。プライマー特異的標識および導入したヌクレオチドによるプライマーの同定は、標的分子の遺伝子型決定を可能にする。それゆえ、多型部位のヌクレオチドは、標識ヌクレオチドのセットのうちのいずれがプライマー依存性ポリメラーゼによりオリゴヌクレオチドの３'末端に導入されたかをアッセイすることにより決定される。非伸長性のヌクレオチドまたはヌクレオチドアナログは、多くの物理的または化学的方法のいずれによっても同定できる。しかしながら、好ましい物理的または化学的手段は、偏光分光分析、質量分光分析、赤外分光分析、紫外分光分析、可視光分光分析またはＮＭＲ分光分析よりなる群から選ばれる。本発明は複数のＳＮＰを単一の反応にて同定する方法を指向するものであるが、本発明はまた、複数のＳＮＰの同定を単一の反応にて確認することもできる。この態様では、標的核酸のプラス鎖およびマイナス鎖の両方について各ＳＮＰの同一性を上記のようにして決定する。その配列は、分析した各ＳＮＰについてプラス鎖およびマイナス鎖が相補的である場合に確認される。２．ポリメラーゼ／リガーゼ媒介分析別の態様において、多型部位のヌクレオチドはポリメラーゼ／リガーゼ媒介法を用いて同定される。上記態様と同様に、複数のＳＮＰを同じ反応にて検出するために複数のオリゴヌクレオチドプライマーを同時に用いる。上記態様と同様に、ＳＮＰの直ぐ３'の遠位非変異配列に相補的なオリゴヌクレオチドプライマーを用いる。分析すべき多型の５'近位配列に相補的だが該オリゴヌクレオチドプライマーにハイブリダイズできない第二のオリゴヌクレオチドを用いる。これらオリゴヌクレオチドを、分析すべき単一ヌクレオチド多型を含むＤＮＡ、および少なくとも１つの２',５'−デオキシヌクレオチド三リン酸の存在下でインキュベートする。このインキュベーション反応液はさらにＤＮＡポリメラーゼおよびＤＮＡリガーゼを含む。それゆえ、両オリゴヌクレオチドは、分析すべき単一ヌクレオチド多型の同じ鎖にハイブリダイズすることができる。配列の考慮により、これら２つのオリゴヌクレオチドは、多型の多型部位（Ｘ）をフランキングしているＳＮＰの近位および遠位配列にハイブリダイズするようになる；それゆえ、これらハイブリダイズしたオリゴヌクレオチドは、多型部位の正確な位置で単一ヌクレオチドの「ギャップ」により隔てられている。ポリメラーゼおよび（Ｘ）に相補的な２',５'−デオキシヌクレオチド三リン酸の存在により、該相補的な２'，５'−デオキシヌクレオチド三リン酸で伸長したプライマーと遠位配列に相補的なハイブリダイズしたオリゴヌクレオチドとのライゲーションが可能となり、多型のヌクレオチドに相補的な２',５'−デオキシヌクレオチド三リン酸はライゲートしうる基質を生成させる。ついで、「ギャップ」の向かい側にある多型部位の同一性は、幾つかの手段のいずれによっても決定することができる。好ましい態様では、反応の２',５'− デオキシヌクレオチド三リン酸を標識しておき、その検出により多型部位の相補的ヌクレオチドを同定する。幾つかの異なる２',５'−デオキシヌクレオチド三リン酸が存在していてよく、それぞれ異なって標識する。別法として、別々の反応を行い、各反応で異なる２',５'−デオキシヌクレオチド三リン酸を用いる。別の亜態様において、２',５'−デオキシヌクレオチド三リン酸は標識せず、第二の可溶性のオリゴヌクレオチドを標識する。別々の反応を行い、各反応で異なる２',５'−デオキシヌクレオチド三リン酸を用いる。上記態様は単一の多型部位の検出のためのポリメラーゼ／リガーゼ媒介法を詳述するものであるが、この方法は複数の多型部位の検出のための複数の独特のオリゴヌクレオチドプライマーの同時使用を採用できることが一般的に理解されるであろう。Ｅ．シグナル増幅核酸ハイブリダイゼーション検出アッセイの感度は、検出を観察者に報告ないしシグナル伝達する仕方を変えることにより高めることができる。それゆえ、たとえば、アッセイ感度は検出可能に標識した試薬を使用することにより高めることができる。この目的のため、広範囲のそのようなシグナル増幅法がデザインされている。Kourilskyら（米国特許第4,581,333号）は、酵素標識を用いて検出アッセイの感度を高めることを記載している。蛍光標識（Albarellaら、ＥＰ14491 4号）、化学標識(Sheldon IIIら、米国特許第4,582,789号；Albarellaら、米国特許第4,563,417号）、修飾塩基（Miyoshiら、ＥＰ119448号）なども、ハイブリダイゼーションを観察する効率を改良する努力の中で用いられている。導入されたヌクレオチドの同一性を適当な検出装置を用いて自動または半自動の仕方で決定しうるように、蛍光または色原体（とりわけ、酵素）標識を用いるのが好ましい。ＩＶ．遺伝子分析の方法におけるＳＮＰ遺伝子型決定の使用Ａ．遺伝子分析に単一ヌクレオチド多型を用いるに際しての一般的考察本発明の多型部位の有用性は、そのような部位が、２つの個体が所定の多型に対して同じアレルを有するであろう統計的な確率を予測するのに用いることができることに基づく。ＳＮＰの統計的分析は、様々な目的のいずれに対しても用いることができる。ある特定の個体が以前に試験されている場合には、そのような試験は「フィンガープリント」として用いることができ、これはある特定の個体の同一性を決定するのに用いることができる。ある特定の個体の推定の親あるいは両方の親を試験した場合には、本発明の方法は、ある特定の動物がそのような親あるいは両方の親の子孫であるかないかの見込みを決定するのに用いることができる。それゆえ、ＳＮＰの検出および分析は、ある特定の個体に対してあるオスの父性（paternity）（ある特定の子供の父親の実父確定など）を排除したり、またはある特定の個体がある選ばれたメス（ある特定の子供とある選ばれた母親など）の子孫である確率を評価するのに用いることができる。以下に示すように、本発明は標的種の遺伝子地図の構築を可能にする。それゆえ、そのような遺伝病、状態、または特質へのある特定の動物（または植物）の素因を予測するため、本発明の方法により同定した多型の特定のアレイを特定の特質と関連付けることができる。本明細書において「特質」とは、「遺伝病」、「状態」、または「特性」を包含することを意図するものである。「遺伝病」とは、検出しうるか否かまたは無症候性であるか否かにかかわらず、変異によって引き起こされる病理学的状態をいう。「状態」とは、特性（喘息、弱い骨、盲目、潰瘍、癌、心臓または心血管系の疾患、骨格−筋の欠陥など）への素因をいう。「特性」は、植物または動物に経済的価値を付与する属性である。特性の例としては、寿命、スピード、耐久性、加齢の速度、生殖能力などが挙げられる。Ｂ．同定および親であることの確認同定および父性試験システムの効力を決定する最も有用な測定値は、（i）「同一性の確率（probability of identity）（ｐ(ＩＤ)）」および（ii）「排除の確率（probability of exclusion）（ｐ(ｅｘｃ)）」である。ｐ(ＩＤ)は、２つのランダムな個体が所定の多型マーカーに関して同じ遺伝子型を有するであろう見込みを計算するものである。ｐ(ｅｘｃ)は、所定の多型マーカーに関して、ランダムなオスが母親の同一性が問題でない平均的な実父確定ケースにおいて父親であることと両立しない遺伝子型を有するであろう見込みを計算するものである。単一の遺伝子座（主要組織適合領域などの多数のアレルを有する遺伝子座を含む）が実父確定試験について適切な統計的信頼性のある試験を提供することは稀であるので、所望の試験は好ましくは複数の連関していない遺伝子座を平行して測定するであろう。同一性または非同一性の累積確率、および父性排除の累積確率は、各遺伝子座により与えられる確率を掛けることにより、これら複数遺伝子座試験について決定される。最も興味のもたれる統計的測定値は、（i）非同一性の累積確率（cump(nonID) ）、および（ii）父性排除の累積確率（cump(exc)）である。これらの確率の値を計算するのに用いる式は下記に示す。簡単のため、これらはまず２アレル遺伝子座に対して与えられ、一つのアレルはＡ型と称し、他方はＢ型と称する。そのようなモデルでは４つの遺伝子型が可能である：ＡＡ、ＡＢ、ＢＡおよびＢＢ（ＡＢ型およびＢＡ型は生化学的には区別することができない）。アレルの頻度は、半数体ゲノム中に見出されるＡの回数（f(A)、Ａの頻度は「ｐ」で表される）またはＢの回数（f(B)、Ｂの頻度は「ｑ」（ここで、ｑ＝１ −ｐである）で表される）によって与えられる。所定の遺伝子型座における所定の遺伝子型の確率は下記のように示される：ホモ接合体：p(AA)＝p² 単一のヘテロ接合体：p(AB)＝p(BA)＝pq＝p(1-p) 両方のヘテロ接合体：p(AB+BA)＝2pq＝2p(1-p) ホモ接合体：p(BB)＝q²＝(1-p)² ある遺伝子座における同一性の確率（すなわち、ある集団からランダムに取り出した２つの個体が所定の遺伝子座で同一の遺伝子型を有するであろう確率）は、等式： p(ID)＝(p²)²＋(2pq)²＋(q²)² で与えられる。それゆえ、ｎ個の遺伝子座に対する同一性の累積確率は、等式：で与えられる。ｎ個の遺伝子座に対する非同一性の累積確率（すなわち、ある集団からランダムに取り出した２つの個体が１またはそれ以上の遺伝子座で異なるであろう確率）は、等式： cum p(nonID)＝１−cum p(ID) で与えられる。父性排除の確率（ランダムなオスが所定の遺伝子座に関して母親の同一性が問題でない平均的な実父確定ケースにおいてオス親であることと両立しない遺伝子型を有するであろう確率を示す）は、等式： p(exc)＝pq(1−pq) で与えられる。非排除の確率（所定の遺伝子座でランダムなオスが平均的な実父確定ケースにおいてオス親として生化学的に排除されないであろう確率を表す）は、等式： p(non-exc)＝１−p(exc) で与えられる。それゆえ、非排除の累積確率（ｎ個の遺伝子座を用いたときに得られる値を表す）は、である。排除の累積確率（ｎ個の遺伝子座のパネルを用い、ランダムなオスが母親が問題とならない平均的な実父確定ケースにおいてオス親として生化学的に排除されるであろう確率を表す）は、等式： cum p(exc)＝１−cump(non-exc) で与えられる。これらの計算は、所定の遺伝子座においていかなる数のアレルに対しても拡張できる。たとえば、アレルがそれぞれ集団中の頻度ｐ、ｑおよびｒを有する３アレル系についての同一性の確率ｐ(ＩＤ)は、遺伝子型頻度の二乗の合計に等しい： p(ID)＝p⁴＋(2pq)²＋(2qr)²＋(2pr)²＋r⁴＋q⁴ 同様に、３アレル系についての排除の確率は、 p(exc)＝pq(1−pq)＋qr(1−qr)＋pr(1−pr)＋3pqr(1−pqr) によって与えられる。ｎ個のアレルの遺伝子座においては、適当な二項式的拡張を用いてｐ(ＩＤ)およびｐ(ｅｘｃ)を計算する。図３および図４は、使用した遺伝子座の数および型の両者にともなってcum p( nonID)およびcum p(exc)がどのようにして増大するかを示す。３アレル系を用いた場合には一層優れた識別力が一層少ないマーカーで達成されることがわかる。図３および図４において、三角は２つのアレルを有する遺伝子座の数が増加するにつれて増大する確率値を追ったものであり、共通のアレルはｐ＝０.７９の頻度で存在する。図３および図４における×は、ｐ＝０.５、ｑ＝０.３４およびｒ＝０.１５の３アレル遺伝子座の数の増加についての同様の分析を示す。しかしながら、２、３またはそれ以上のアレルを有する遺伝子座のいずれを選択するかは、上記生化学的な考慮により大きく影響される。多型分析試験は、所定の遺伝子座においていずれの数のアレルに対しても評点すべくデザインできる。アレルの評点をゲル電気泳動を用いて行うのであれば、各アレルはゲル電気泳動により容易に解析しうるものでなければならない。複数のアレルファミリーでの長さの変異はしばしば小さいので、複数のアレルファミリーを用いたヒトＤＮＡ試験にはアレルの誤同定のための統計的な補正が含まれる。さらに、複数のアレル系からの稀なアレルの出現は情報として非常に有益ではあるが、これらアレルが稀であることは集団中でのその頻度の正確な測定を極めて困難なものにしている。稀なアレルを用いた場合にこれら頻度評価における誤りを補正するために、このデータの統計分析は、これら頻度評価における不確実性の累積効果の測定を含んでいなければならない。これら複数のアレル系の使用はまた、大きな集団のスクリーニングの際に該集団中の新規なまたは稀なアレルが発見されるであろう見込みを増大させる。これまでに集めた遺伝子データの完全さは、新規なアレルの発見を反映して経験的に改定されるであろう。これらの点を考慮に入れて、多数のアレルを有する遺伝子座を用いることは潜在的に幾つかの短期的な利点をもたらしうるけれども（より少ない遺伝子座しかスクリーニングする必要がないので）、（i）一層頻繁に表され、および（ii）明白に測定するのが一層容易であるような一層少ないアレルを有する遺伝子座を用いて多型分析を行うのが好ましい。このタイプの試験では、より多型性の高い遺伝子座に基づく試験と同じ識別力を達成できるが、ただし、同じ全数のアレルを一連の連関していない遺伝子座から回収しなければならない。Ｇ．ＳＮＰを用いた遺伝子マッピングおよび遺伝特質分析同じ種（ヒト、ウマなど）かまたは密接に関連した種の個体のセットで検出された多型は、ある特定の多型の存在または不存在が特定の特質と相関関係があるか否かについて決定するために分析することができる。そのような多型分析を行うため、多型のセット（すなわち、「多型アレイ」）の存在または不存在を個体のセットについて決定する。これら個体のうちの幾つかは特定の特質を示し、また別の幾つかは相互に排他的な特質（たとえば、ウマに関しては、脆い骨と脆くない骨；成熟期に開始する盲目と盲目でないこと；喘息、心血管系疾患などの素因とそのような素因のないこと）を示す。ついで、このセットの各多型のアレルを調べて、ある特定のアレルの存在または不存在が興味のもたれる特定の特質と関係しているか否かを決定する。そのような相関関係は、個々の種の遺伝子地図を定める。特質に関してランダムには分離しないアレルは、ある特定の動物がその特性を発現するであろう確率を予測するのに用いることができる。たとえば、ある特定の多型アレルが心血管状態を呈する種の成員の２０％にしか存在しないときは、そのアレルを含むその種のある特定の成員は、そのような心血管状態を示す確率を２０％有することであろう。指摘するように、この分析の予測力は、ある特定の多型アレルとある特定の特性との間の連関の程度により増大する。同様に、この分析の予測力は、ある特定の特質の複数の多型遺伝子座のアレルを同時に分析することによって増大させることができる。上記例において、第二の多型アレルもまた心血管状態を呈する成員の２０％に存在することがわかれば、そのような心血管状態を呈する評価した全成員がこれら第一および第二の多型のアレルの特定の組み合わせを有し、それゆえ、そのようなアレルを両者ともに含む特定の成員は心血管状態を呈する非常に高い確率を有するであろう。複数の多型部位の検出は、そのような部位が集団中で独立に分離する頻度を定めることを可能にする。たとえば、２つの多型部位がランダムに分離するならば、それらは別々の染色体上に存在するかまたは同じ染色体上で互いに離れて存在する。逆に、有意の頻度で同時に遺伝される２つの多型部位は、同じ染色体上で互いに連関している。このようにして、分離の頻度の分析は、マーカーの遺伝子地図の確立を可能にする。それゆえ、本発明は植物および動物のゲノムをマッピングする手段を提供するものである。遺伝子地図の解像度は、それが含むマーカーの数に比例する。本発明の方法は多数の多型部位を単離するのに用いることができるので、あらゆる所望の程度の解像度を有する地図を作成するのに用いることができる。多型部位の配列決定は、遺伝子マッピングにおけるその有用性を大いに高める。そのような配列は染色体を「歩き」、それによって新たなマーカー部位を同定するのに用いることのできるオリゴヌクレオチドプライマーおよびプローブをデザインするのに用いることができる（Bender，W.ら、J ．Supra．Molec．Struc． 10(補遺) ：32(1979);Chinault，A.C.ら、Gene 5:111〜126(1979)；Clarke L．ら、Nature 287:504〜509(1980)）。地図の解像度は、多型分析を、そのゲノムをマッピングしようとする植物または動物の他の属性の表現型に関するデータと組み合わせることによりさらに高めることができる。それゆえ、ある特定の多型が褐色の髪の毛の色とともに分離すれば、その多型は髪の毛の色の遺伝子または遺伝子群の近くの遺伝子座にマッピングされる。同様に、生化学的データを用いて遺伝子地図の解像度を高めることができる。この態様では、生化学的な決定（血清型、イソ型など）は、それがいずれかの多型部位と一緒に分離するか否かを決定するために調べられる。そのような地図は、たとえば、新たな遺伝子配列を同定したり、疾患の原因となる変異を同定するのに用いることができる。実際、本発明のＳＮＰの同定は、該ＳＮＰのいずれかの側に位置する新規な遺伝子配列を単離あるいは配列決定するために、相補的なオリゴヌクレオチドをＰＣＲまたは他の反応においてプライマーとして用いることを可能にする。本発明は、そのような新規な遺伝子配列を含む。そのようなプライマーを用いることによってクローン的に単離できるゲノム配列は、ＲＮＡに転写され、ついでタンパク質として翻訳することができる。本発明はまた、そのようなタンパク質、並びにそのようなタンパク質に結合しうる抗体その他の結合分子をも含む。本発明を以下において、その態様の２つ、すなわち、ウマおよびヒトに関して説明する。しかしながら、遺伝学の基本的な教義は種にかかわりなく適用できるので、そのような説明は他のいずれの種に対しても同様に適用しうるものである。それゆえ、当業者は、上記本発明の方法を直接用いて他のいずれかの種においてＳＮＰを単離し、それによって本発明の遺伝子分析を行うだけでよいであろう。上記で指摘したように、ＬＯＤ評点法は、遺伝的特質の遺伝を追跡するためと、種の遺伝子地図を構築するためとの両方のためにＲＦＬＰを使用するのを可能とすべく開発された（Lander，S.ら、Proc ．Natl．Acad．Sci.(U.S.A) 83:7353 〜7357(1986);Lander，S.ら、Proc ．Natl．Acad．Sci.(U.S.A) 84:2363〜2367(1 987);Donis-Keller，H.ら、Cell 51:319〜337(1987);Lander，S.ら、Genetics 1 21 ：185〜199(1989)）。そのような方法は、本発明の多型に使用すべく容易に適合させることができる。実際、そのような多型は、この点でＲＦＬＰおよびＳＴＲよりも優れている。ＳＮＰの頻度のため、濃密な遺伝子地図を容易に作成することができる。さらに、上記で指摘したように、本発明の多型は典型的なＶＮＴＲ型の多型に比べて一層安定である。本発明の多型は直接的なゲノム配列情報を含むため、多くの方法によりタイプ分けできる。ＲＦＬＰまたはＳＴＲ依存性の地図では分析はゲルベースでなければならず、標的動物のＤＮＡの電気泳動プロフィルを得ることを必然的に伴う。ゲルベースの方法に加え、多型（ＳＮＰ）の分析は分光分析法を用いても行うことができ、多数の標的動物の分析を容易にするために容易に自動化することができる。以上、本発明を一般的に記載したので、ウマ多型の単離および分析である下記実施例を参照することにより本発明をより一層理解できるであろう。これら実施例は説明のためであって、本発明を限定することを意図するものではない。実施例１ポリアクリルアミドゲル電気泳動による複数のＳＮＰの分析複数の単一ヌクレオチド多型の検出のための本実施例では、一本鎖ＤＮＡ鋳型を３つのインテロゲーションプライマーでプローブした（probed）。プライマー＃１は５塩基Ｔ−テールを有し、プライマー＃２は１０塩基Ｔ−テールを有し、プライマー＃３は１５塩基Ｔ−テールを有する。一本鎖鋳型を得るため、２つの方法のうちのいずれかを用いた。第一に、Niki forov，T．（米国特許出願第08/005,061号（出願は１９９６年６月２４日に放棄）により教示されるように、４ホスホロチオエート−ヌクレオチド誘導体を含むプライマーを用いて増幅を媒介する。別法として、第二ラウンドのＰＣＲを「非対称」プライマー濃度を用いて行うことができる。第一の反応の生成物を第二の反応では１／１０００に希釈する。第二ラウンドのプライマーの一方は２Ｍの標準濃度で用い、他方は０.０８Ｍの濃度で用いる。これら条件下で一本鎖分子を反応の間に合成する。プライマー混合物を一本鎖標的鋳型にハイブリダイズさせ、ＤＮＡポリメラーゼおよび４つの修飾した非伸長性のヌクレオチドを用いた単一塩基伸長反応を起こさせる。各４つの反応管について１のみの修飾した非伸長性のヌクレオチドを好ましくは³²Ｐまたは蛍光分子で標識する。ついで、得られた伸長したプライマーを標準１２％シークエンシングゲル上での分析のため分離する。このようにして、各プライマーの電気泳動移動度および標識した非伸長性のヌクレオチドの同定に基づいて各プライマーに対応するＳＮＰの同一性を決定することが可能になる。実施例２サイズ排除クロマトグラフィーによる複数のＳＮＰの分析プライマー１、２、３および４をそれぞれＢＳＡ、卵アルブミン、リゾチームおよびＣＩＰに共有結合させる。一本鎖ＤＮＡ鋳型を４つのインテロゲーションプライマーでプローブし、ＤＮＡポリメラーゼおよび４つの修飾した非伸長性のヌクレオチドを用いた単一塩基伸長反応を起こさせる。好ましくは各非伸長性のヌクレオチドは、独特かつ別個に標識する。引き続き、得られた伸長したプライマーを適当なサイズ排除カラム（たとえば、セファデックス、セファロースなど）上で分離し、溶出液を（たとえば、シンチレーションカウンターにより）分析して導入されたヌクレオチドの同一性を決定する。実施例３アフィニティー法を用いた複数のＳＮＰの分析本実施例では、たとえば、Chuらにより開示された方法（Nucleic Acids Res ． 16 :3671〜3691(1988)、参照のため本明細書中に引用する）を用い、ペプチドまたはタンパク質アフィニティーリガンドをインテロゲーションオリゴヌクレオチドに共有結合させる。ついで、アフィニティーリガンド−インテロゲーションプライマー複合体を標的核酸分子にハイブリダイズさせ、上記単一塩基伸長を４っの異なって標識したジデオキシヌクレオチド種（ｄｄＡ、ｄｄＴ、ｄｄＣおよびｄｄＧ）の存在下で起こさせる。所望の場合は、より少ない種のジデオキシヌクレオチドを用いてもよい。上記ペプチドまたはタンパク質に対する対応モノクローナル抗体をマイクロタイタープレート（Nunc）に固定化する。各モノクローナル抗体を室温にて緩衝液中でマイクロタイタープレートに固定化する。ついで、プレートをＴＮＴｗ溶液で３回洗浄して過剰の未結合タンパク質を除去する。ついで、伸長したプライマーの溶液を各ウエルに約３０分間かけて加える。未結合プライマーをＴＮＴｗ溶液で充分に洗浄して除去する。表３はそのような実験から予測される結果を示す。実施例４オリゴヌクレオチドを分離しない複数のＳＮＰの分析オリゴヌクレオチドプローブを分離することなく複数の単一ヌクレオチド多型を同定することも可能である。そのような実験では、各ジデオキシヌクレオチド三リン酸を独特に標識するのが好ましい。それゆえ、たとえば、ｄｄＡＴＰは³²Ｐで標識し、ｄｄＧＴＰは³Ｈで標識し、ｄｄＣＴＰは³⁵Ｓで標識し、ｄｄＴＴＰは¹²⁵Ｉで標識することができる。表３は、興味のもたれる核酸を含む調製液に６つのオリゴヌクレオチドをハイブリダイズさせて得られる仮定上の結果、および単一の塩基伸長反応の結果を示す。そのような実験において、導入されなかったｄｄＮＴＰは種々の手段（たとえば、適当なスピンカラム（すなわち、CentriSepスピンカラム）など）のいずれかを用いて伸長したプローブから分離できる。導入された標識ジデオキシヌクレオチド三リン酸は、その後、シンチレーションカウンターを用いて検出する。各同位体は別個の発光スペクトルを有するので、シンチレーションカウンターは精製手順を必要とすることなく複数の単一ヌクレオチド多型の同一性を決定することができる。表４に示すように、単一ヌクレオチド多型に相補的な導入されたジデオキシヌクレオチドの同定は、プライマー１〜６に関してそれぞれＴ、Ｇ、Ａ、Ｔ、Ｃ、Ｇであることが明らかである。曖昧さは、プライマーの組み合わせを変えることによって決定できる。実施例５オリゴヌクレオチドアレイ分離による複数のＳＮＰの分析プライマー１、２、３および４は、鋳型ＤＮＡに相補的な配列に加えて固相表面上の捕捉オリゴヌクレオチドにその後にハイブリダイズさせるための独特の配列を含む。一本鎖ＤＮＡ鋳型を４つのインテロゲーションプライマーでプローブし、ＤＮＡポリメラーゼおよび４つの修飾した非伸長性のヌクレオチドを用いた単一塩基伸長反応を起こさせる。好ましくは各非伸長性のヌクレオチドは、独特かつ別個に標識する。得られた伸長したプライマーを、その後、オリゴヌクレオチドアレイの表面に適用する。このアレイは４つの別々かつ空間的に隔てられた別個の捕捉オリゴヌクレオチドからなり、各オリゴヌクレオチドはインテロゲーションプライマーの一つの上の独特の配列に相補的である。各インテロゲーションプライマーは、その対応表面に結合した捕捉プライマーへのハイブリダイゼーションにより有効に分離される。ついで、標識ヌクレオチドの同一性を適当な方法により決定する。本発明をその特定の態様に関連して記載したが、さらに修飾することが可能であり、この出願は、一般に本発明の原理に従って、および本発明が関与する技術分野内では公知で慣例的な実施であるとして、あるいは上記や添付の請求の範囲内の本質的特徴に適用しうるものとして、本発明の開示から逸脱するものを包含して、本発明の変更、使用、または適応を包含することを意図するものであることが理解されるであろう。DETAILED DESCRIPTION OF THE INVENTIONTitle of invention Method for detecting multiple single nucleotide polymorphisms in a single reaction TECHNICAL FIELD OF THE INVENTION The present invention is in the field of recombinant DNA technology. For more information, see the book The invention relates to one or more single nucleotides in the genome of a plant, animal or microorganism. Identify leotide polymorphisms in a single reaction and identify such sites with identity, ancestry or genetic Molecules and methods suitable for use in analyzing traits.Cross-reference of related applications This application claims priority with US Ser. No. 08 / 216,538, filed Mar. 23, 1994. (The application is filed in U.S. patent application Ser. No. 08 / 145,145 (November 3, 1993) Application) is a partial continuation application).Background of the Invention Being able to determine the genotype of an animal, plant or microorganism is forensic, medical and In epidemiology and public health, and in animal breeding and exhibition It is basically important. Such determinants include, for example, infection To identify the etiology of a sexual illness, to determine if two individuals are related, Or it is necessary to map a gene into the genome of an organism. Analyzes of identity and parenthood along with being able to diagnose the disease are also human , Animal and plant genetic studies, especially legal or paternity Central interest in assessment and in assessing an individual's risk of developing a genetic disease It is also a thing. Such a goal is to make the DNA of one individual different from the DNA of another individual. It has been pursued by analyzing the DNA sequence for mutations that have been deemed to have been. If such a mutation results in a fragment produced by restriction endonuclease cleavage If it changes length, such a mutation is called a restriction fragment length polymorphism ("RFLP"). You. RFLP is widely used in human and animal genetic analysis (Grassberg , J., UK Patent Application No. 2135774; Skolnick, M.H. et al.Cytogen . Cell Ge net .32: 58-67 (1982); Bostein, D. et al.Ann . J. Hum. Genet.32: 314-331 (19 80); Fischer, S .; G. et al. (PCT application WO 90/13668); Uhlen, M., PCT application WO 90/11369). Linking heritable attributes to specific RFLPs If linked, the presence of the RFLP in the target animal indicates that the animal Can be used to predict that it is likely to show Multiple a RFLP multiple loci to map complex traits dependent on rel (Multilocus) Statistical methods have been developed that allow analysis (Lander, S. et al.P roc . Natl. Acad. Sci. (USA) 83: 7353-7357 (1986); Lander, S. et al.Proc . N atl. Acad. Sci. (USA) 84: 2363-2367 (1987); Donis-Keller, H. et al.Cell 51: 3 19-337 (1987); Lander, S. et al.Genetics 121: 185-199 (1989), all referenced Cited herein). Such methods are commonly used to develop genetic maps. And can be used to develop plants or animals with more desirable attributes. Kirun (Donis-Keller, H. et al.Cell 51: 319-337 (1987); Lander, S. et al.Genetics 121 : 185-199 (1989)). In some cases, mutations in the DNA sequence are caused by tandem dinucleotides of nucleotides. Tandem repeats containing short or trinucleotide repeat motifs (shortt andem repeats (“STR”). These tandem lipi The note also states "variable number tandem repeat ( Also referred to as "VNTR"). VNTR is used for identification and paternity analysis (Weber, J.L., U.S. Pat. No. 5,075,217; Armour, J.A.L., et al.)FEBS Lett .307: 113-115 (1992); Jones, L. et al.Eur . J. Haematol.39: 144〜14 7 (1987); Horn, G.T., et al., PCT Application WO 91/14003; Jeffreys, A.J. Jeffreys, A.J., U.S. Patent No. 5,175. No. 082; Jeffreys, A.J. et al.Amer . J. Hum. Genet.39: 11-24 (1986); Jeffr eys, A.J. et al.Nature 316: 76-79 (1985); Gray, I.C., et al.Proc . R. Acad. Soc. Lond .243: 241-253 (1991); Moore, S.S., et al.Genomics Ten: 654-660 (1991); Jef freys, A.J. et al.Anim . Genet.18: 1-15 (1987); Hillel, J. et al.Anim . Genet.Two 0 : 145-155 (1989); Hillel, J. et al.Genet.124: 783-789 (1990)), currently many It has been used in many gene mapping studies. A third class of DNA sequence variation is the occurrence of single nucleotides present between individuals of the same species. De-polymorphism ("SNP"). Such polymorphisms are identified by STR or V It occurs much more frequently than NTR. In some cases, such polymorphisms Contain mutations that are a critical feature in genetic diseases. In fact, such a change The difference is sufficient to actually cause the disease (ie, hemophilia, sickle cell anemia, etc.). Affects a single nucleotide in a protein-encoding gene in a sensitive way. Many In many cases, these SNPs occur in non-coding regions of the genome. Despite the central importance of such polymorphisms in modern genetics, Enable analysis of one or more of these loci in a single reaction format No practical method has been developed. The present invention provides such an improved method. In fact, the present invention Genes identified and paternally identified by discriminating single nucleotide polymorphism variants Methods and gene sequences are provided that enable analysis and diagnosis of disease.Summary of the Invention The present invention includes single nucleotide polymorphisms (SNPs) present in all life forms. About molecules. The invention provides (i) one or more novel single nucleotides A method for identifying polymorphisms, (ii) repeated analysis and testing of these SNPs in different samples And (iii) determining the presence of such sites in animals, plants and microorganisms. How to use for child analysis. Analysis of such sites (genotyping) can include identity, ancestry, predisposition to genetic disease, Useful for determining the presence or absence of a desired attribute, and the like. Specifically, the present invention One or more nucleotide sequences of the genomic DNA segment of any organism One or more interrogations having complementary polynucleotide sequences (Interrogation) providing nucleic acid (or nucleic acid analog) primer molecules Wherein the genomic segment is a mammalian single nucleotide polymorphism array. Located just 3 ′ distal to the single nucleotide polymorphism site X of the Or nucleotide analog) for nucleic acid (or nucleic acid analog) primer The template-dependent extension of the primer allows the primer molecule to bind to a single nucleotide (or analog). And the single nucleotide (or analog) is replaced with the single nucleo Complementary to nucleotide X of the Tide polymorphism allele. The present invention provides for template-dependent extension of primers using ddATP, ddTTP, dd. Selected from the group consisting of CTP and ddGTP (or other chain terminating base analog) One or more dideoxynucleotide triphosphate derivatives (or analogs) G) but in the absence of dATP, dTTP, dCTP and dGTP It relates to an aspect. The present invention further provides for one or more single nucleotide polymorphisms in a single reaction. A method of identifying (A) one or more identifiable interrogation oligonucleotides ( Or oligonucleotide analog) primers to one or more target nucleic acids Hybridize to the molecule, where each oligonucleotide primer is The 3 'end of the primer is immediately adjacent to the specific and unique target nucleotide of interest Complementary to a specific and unique region of each target nucleic acid molecule, such as (B) Each interrogation oligonucleotide (or analog) is template dependent With one or more non-extendable polymerases Occurs in the presence of a nucleotide (or nucleotide analog) of (C) For each interrogation primer used, such a primer Of non-extendable nucleotides (or nucleotide analogs) introduced into Each nucleotide (or analog) of interest by determining the identity And identifying the non-extendable nucleotide (or nucleotide Nalog) is complementary to the target nucleotide of the primer; (D) on a suitable matrix or other standard method of physical or chemical separation or Separates the extended primer by an identification method including.BRIEF DESCRIPTION OF THE FIGURES FIG. 1 shows a preferred method for cloning random genomic fragments. Genomic DNA is subjected to size fractionation and then plasmids are used to obtain random clones. Introduce into the vector. PCR sequencing to determine the sequence of the inserted genomic sequence Design and use limers. Figure 2 shows the cycle sequencing of random genome fragments. ) Obtained by a preferred method for identifying a novel polymorphic sequence. Shows the data obtained. FIG. 3. RFL for screening random clones for polymorphic sequences. The P method is shown. FIG. 4 shows that for a given panel of genetic markers, two individuals have the same genotype 3 shows a graph of the probability that the probability will be high. FIG. 5 shows that a given panel of 20 genetic markers is not a problem for mothers Probability of rejecting random and suspicious fathers in paternity lawsuits The graph is shown. FIG. 6 shows a preferred method for genotyping SNPs. The seven steps are: Figure 4 shows how to perform GBA starting from a biological sample.Description of the preferred embodiment I. Advantages of the Single Nucleotide Polymorphism of the Invention and Its Use in Gene Analysis A. Polymorphic attributes Certain gene sequences of interest to the present invention are single nucleotide polymorphisms. "including. A “polymorphism” is a variation in the DNA sequence of some member of a species. You. Animal and plant genomes have undergone natural mutations during their sustained evolution. (Gusella, J.F.,Ann . Rev. Biochem.55: 831-854 (1986)). Such a spike Most mutations, however, produce polymorphisms. The mutated sequence and the original sequence are Co-existing in a group. In some cases, such coexistence is stable or near It is in stable equilibrium. In other cases, such mutations are not Will eventually provide an advantage to evolution (ie, ), It will be integrated into the genome of all members of that species. Therefore, a polymorphism is that some members of certain species are not mutated due to the existence of the polymorphism. Other members have a mutated sequence (ie, the original "allele") That is, "allelic" in that it has a mutated "allele"). ". In the simplest case, there is only one mutated sequence and this multiple The type is called dialelic. When another mutation occurs, (Triallelic), etc. The allele is defined by the nucleotide containing the mutation. expressed. The present invention relates to specific classes of allelic variants and to the inheritance of plants, animals, or microorganisms. It relates to its use for determining gene types. Such allelic variants are described herein. Are referred to as "single nucleotide polymorphisms" or "SNPs." "Single nucleo The "tide polymorphism" is defined by the following attributes. The central attribute of such polymorphisms is It is to include a polymorphic site “X” which is a site of mutation between allele sequences. SNP's The second property is that the polymorphic site "X" often precedes the "unmutated" sequence of the allele. It has been followed. Therefore, the polymorphic site of the SNP is a "5 'proximal" non- Located “immediately” 3 ′ to the mutated sequence and “immediately” 5 ′ to the “3 ′ distal” unmutated sequence It is assumed to be located. Such a sequence flanks the polymorphic site. Single The term "single" of a nucleotide polymorphism refers to the number of nucleotides in the polymorphism (ie, one Nucleotides); this is the number of polymorphisms (from one to many) present in the target DNA. Has no relation to numbers). As used herein, an allele is referred to as an "allele" if the sequence does not change within that species population. The sequence is referred to as the 'unmutated' sequence, and if mapped, the Maps to the "corresponding" sequence of the same allele in the nom. Two or more S Note that the NPs may be located very close to each other. Two arrays Are `` corresponding '' when they are analogs from different sources. It is called a row. The gene sequence encoding hemoglobin in two humans 9 is an example of an allele sequence. The definition of “corresponding allele” in this specification is To change the meaning of a word that is understood by those skilled in the art, to clarify the meaning of the word It is not intended. Each column in Table 1 shows the “corresponding” equine allele polymorphism site. Nucleotide identification and unmutated 5 'proximal and 3' distal sequences (also in the S NP). "Corresponding alleles" for human alleles are shown in Table 2. is there. Each column in Table 2 shows the identification sequence of the nucleotide at the polymorphic site of the “corresponding” human allele. Shows the unmutated 5 'proximal and 3' distal sequences, which are also attributes of the SNP. You. Since genomic DNA is double-stranded, each SNP has a positive or negative strand. Either can be defined. Therefore, for each SNP, one strand is immediately 5 ′ The other strand will contain the immediate 3 'distal non-mutated sequence. each In a preferred embodiment where the SNP polymorphism site "X" is a single nucleotide, Each strand of P double-stranded DNA has an immediate 5 ′ proximal unmutated sequence and an immediate 3 ′ distal unmutated Will contain both sequences. Preferred SNPs of the present invention are other nucleotides of one nucleotide at the SNP polymorphism. Regarding substitution with tides, SNPs are also more complex and Deletion of nucleotides from one of the two corresponding sequences or two corresponding sequences May include an insertion into one of the For example, in certain animals Rows contain A in a particular polymorphic site, whereas in other animals a single Or there may be multiple base substitutions. Preferred SNPs of the invention are non-mutated proximal SNPs have both non-mutated proximal sequences or both It may have only the non-mutated distal sequence. Nucleic acid molecules having a sequence complementary to the immediate non-mutated sequence immediately 3 ′ to the SNP are identified as When extended in a "type-dependent" manner, the extension product containing the polymorphic site of the SNP Can be formed. A preferred example of such a nucleic acid molecule is the 5 'proximal non-mutated A nucleic acid molecule having the same sequence as the sequence. "Template-dependent" extension refers to The polymerase extends the primer so that the sequence is complementary to the sequence of the nucleic acid template. Means that it can mediate. A “primer” is a nucleic acid in a “template-dependent” extension reaction. Can be extended by the covalent attachment of a nucleotide (or nucleotide analog) , Single-stranded oligonucleotide (or oligonucleotide analog) or single-stranded Refers to a strand polynucleotide (or polynucleotide analog). Such noh To be powerful, the primer must be 3 'hydroxyl (or polymerase mediated It must have a terminus for the other nucleic acid molecule (another chemical group suitable for extension). That is, it must be hybridized to a "template"). The primer is (1) The 3 'end of the primer is immediately adjacent to the target nucleotide of interest. As such, a unique 8 base or longer length complementary to a particular region of the target molecule Sequence and (2) specific and unique length, physical or chemical properties, neutral components 5 'tail. Most preferably, the complementary region of the primer is about 2 It has zero bases, but may be a shorter or longer primer. Typically, The complementary region of the primer is from about 12 to about 20 bases. 5 'tail neutral The components are non-specific and non-hybridizing, such as poly T, non-basic residues, etc. A polymer or a chemical group. "Polymerase" means that a nucleic acid molecule When hybridized to the template nucleic acid molecule, the nucleoside triphosphate (or To extend the 3 'hydroxyl group of the nucleic acid molecule by introducing Is an enzyme. The polymerase enzyme is available from Watson, J.D.,In: Molecular Biology o f the Gene Third Edition, Benjamin, Menlo Park, California (197 7) (discussed herein by reference) and similar sources I have. Commonly known as other polymerases, for example, "Klenow" polymerase Large Proteolytic Fragment of DNA Polymerase I from Escherichia coli DNA polymerase I, bacteriophage T7 DNA polymerase, etc. Can also be used to perform the methods described herein. Immediately after SNP A nucleic acid having the same sequence as the 3 ′ distal non-mutated sequence has the same sequence as the immediately 5 ′ proximal sequence. Template-dependent primer with one nucleotide extended in a template-dependent manner Can be ligated in any way. B. Benefits of using SNPs for gene analysis The single nucleotide polymorphism site of the present invention can be used for plant, animal, or microbial DNA. Can be used for analysis. Such sites include mammals, including humans, Mammals, livestock (dogs, cats, etc.), farm animals (cows, sheep, etc.) and other Suitable for analyzing genomes of animals of economic importance. However, the present invention Single nucleotide polymorphism sites are useful for other types of animals, plants, and microorganisms Can be. How many SNPs are in gene analysis compared to STR and VNTR? It has that distinct advantage. First, SNPs are more frequent (about 10 to 100) than STRs and VNTRs. Times higher) and more uniformly. The higher frequency of SNPs indicates that SNPs It means that it can be more easily identified than the class polymorphism. The distribution is more even One is to identify SNPs "closer" to a particular attribute of interest Enable. The combination of these two attributes makes SNPs extremely valuable I do. For example, certain genes with certain attributes (for example, predisposition to cancer) If it reflects a mutation at the locus, use all polymorphisms associated with the locus One can predict the probability that an individual will exhibit the trait. The value of such predictions is determined in part by the rejection between the polymorphism and the locus. Is done. Therefore, if the locus is any repetitive tandem nucleotide sequence VNTR analysis has only limited value if it is far from the motif Would be. Similarly, if the locus moves away from any detectable RFLP If so, the RFLP analysis will not be accurate. However, The light SNP is present at about every 300 bases in the mammalian genome, and has a uniform distribution. SNPs are statistically considered to be 150 salts of a particular genetic disorder or mutation. Can be found in the group. In fact, a particular mutation may itself be a SNP. So Therefore, if the sequence of such a locus has been determined, Mutations in the nucleotides determine the quality in question. Second, SNPs are more stable than other classes of polymorphisms. That natural sudden change The odds are about 10^-9And about 1,000 times less frequent than VNTR. Indeed, V The NTR polymorphism is characterized by a high mutation rate. Third, SNPs further reduce their allele frequency for relatively few representative examples. Has the advantage that it can be inferred from the study of The attributes of these SNPs are RFLP Identity, paternity elimination compared to that possible with either or VNTR polymorphisms (Patternity exclusion) and analysis of animal predisposition to certain genetic attributes Enables much more advanced genetic analysis. Fourth, SNPs are the best source of genetic information (nucleotide position and base identification). It reflects a highly efficient definition. Give such a high description Nevertheless, SNPs are much easier than RFLP or VNTR, and Detection can be performed with higher flexibility. In fact, since DNA is double-stranded, The complement of the allele can be analyzed to confirm the presence and identity of the SNP. Flexibility in characterizing identified SNPs is a salient feature of SNPs. And For example, VNTR-type polymorphisms can be used for size-fractionation methods that can discriminate between many repetitive mutations. It can be detected more easily. RFLP is obtained by restriction digestion following size fractionation. Most easily detectable. In contrast, SNPs can be characterized using any of a variety of methods. Such methods include direct or indirect sequencing of the site, Use of restriction enzymes when alleles create or destroy restriction sites, allele specific Use of different hybridization probes, encoded by different alleles of polymorphism Use of antibodies specific to the protein to be used, or other biochemical interpretations. (WO 92/15712, incorporated herein by reference). "Genetic Bit Analysis" (discussed below) Analysis) ”(“ GBA ”) method is based on the nucleotides present at single nucleotide polymorphism sites. This is a method for identifying otide. GBA is the site of mutation in the target DNA sequence. The variable nucleotide in the target DNA using the nucleotide sequence information around the Oligonucleotides complementary to the region immediately adjacent to but not containing the nucleotide This is a method for elucidating polymorphic sites by designing imers. Biological target DNA template Select from the sample and hybridize to the interrogation primer. this The primer can be used to bind one or more chain-terminating nucleoside triphosphate precursors (or Single labeled dideoxy using a DNA polymerase in the presence of the appropriate analog) Extend by nucleotides (or analogs). Cohen, D. et al. (PCT application WO 91/02087) provide for genotyping. Describes a related method by which to sequence at a desired locus. A single nucleotide with a single nucleotide using dideoxynucleotides Extend the immer. Dale et al. (PCT application WO 90/09455) disclose primers Used in conjunction with a single dideoxynucleotide species to sequence "variable sites" A method for doing so is disclosed. The method of Dale et al. Furthermore uses multiple primers. And the use of a separating element. Ritterband, M.R. (PCT application WO No. 95/17676) separates, concentrates and concentrates such target molecules in a liquid sample. An apparatus for detecting is described. Cheeseman, P. (US Pat. No. 5,302,5) No. 09) describes a related method for determining the sequence of single-stranded DNA molecules. Cheeseman's method uses a fluorescently labeled 3 'protected nucleotide triphosphate (each base is different). What With a fluorescent label). Wallace et al. (PCT application WO89 / 10414) describe allele-specific primers. Can be used to replicate multiple regions of the target simultaneously. Several PCR methods are described. By using allele-specific primers Amplification occurs only when a particular allele is present in the sample. Some primer-guided nucleotides for assaying polymorphic sites in DNA Introduced methods are described (Komher, J.S. et al.Nucl . Acids Res. 17: 7779-77 84 (1989); Sokolov, B.P.,Nucl . Acids Res. 18: 3671 (1990); Syvanen, A.-C. Et al.,Genomics 8: 684-692 (1990); Kuppuswamy, M.N., et al.Proc . Natl. Acad. Sci . (USA) 88: 1143-1147 (1991); Prezant, T.R., et al.Hum . Mutat. 1: 159〜164 (199 2); Ugozzoli, L. et al.GATA 9: 107-112 (1992); Nyren, P. et al.Anal . Biochem. 20 8 : 171-175 (1993)). All of these methods are used to identify bases at polymorphic sites. It differs from GBA in that it relies on the introduction of labeled deoxynucleotides. In such a format, the signal is the number of deoxynucleotides introduced. Polymorphism in runs of the same nucleotide is proportional to the length of the run. Can result in a proportional signal (Syvanen, A.-C. et al.Amer . J. Hum. Ge net. 52: 46-59 (1993)). Such a predetermined range of locus-specific signals is In the case of heterozygotes, for example, a simple three-component (2: 0, Interpretation can be more difficult compared to signals of the 1: 1 or 0: 2) class. Sa Furthermore, at certain loci, in the presence of the correct dideoxynucleotide, Even the introduction of incorrect deoxynucleotides can occur (Komher, J.S. et al.Nucl . Acids Res. 17: 7779-7784 (1989)). Such deoxynucleotide errors The introduction may, in some sequence contexts, be carried out on the DNA port to the mispaired deoxy-substrate. The Km of the lymerase is relatively small for the correctly base-paired dideoxy-substrate. m (Kornberg, A. et al., In: DNA Replication, 2nd ed. (1992) , Freeman and Company, New York; Tabor, S. et al.Proc . Natl . Acad. Sci. (USA) 86: 4076-4080 (1989)). This effect is useful for elucidating polymorphic sites. Contribute to background noise. In contrast to all such methods, the method of the present invention involves multiple SNPs. Or very easy to determine the nucleotides. II. How to discover new polymorphic sites A preferred method for finding polymorphic sites is to use genomes from multiple haploid genomes. Includes comparative sequencing of DNA fragments. In a preferred embodiment, as shown in FIG. In addition, such sequencing can result in 0.5 to 3 Kb of DNA from one member of a species. This is performed by preparing a random genomic library containing fragments. By the way, Using the sequences of these recombinants, the same genotype of a large number of randomly selected individuals of the species Facilitates PCR sequencing at the gene locus. Hundreds of such genomic libraries (typically about 50,000 clones) Purify (200-500) individual clones and sequence the ends of the insert I do. To enable PCR amplification of the cloned region, only a small amount of It suffices if column information (100 to 200 bases) can be obtained. The purpose of this sequencing is to Suitable for mediating the amplification of equivalent fragments from genomic DNA samples of other members The purpose is to obtain sufficient sequence information to enable the synthesis of the primer. Preferably Such sequencing is performed using the cycle sequencing method. D from a panel of randomly selected members of the target species using these primers Amplify NA. The number of members in the panel determines the lowest frequency of polymorphism to be isolated. You. Therefore, if 6 members are evaluated, for example, a frequency of 0.01 The existing polymorphism may not be identified. Illustrative but oversimplified mathematical treatment In the sampling of 6 members, about 0.08 (ie, a total frequency Only polymorphisms occurring more frequently than 6 members divided by 2 alleles per genome) were identified. It is expected that Therefore, if one wants to identify lower frequency polymorphisms, A number of panel members must be evaluated. Cycle sequence analysis (Mullis, K. et al.Cold Spring Harbor Symp . Quant. Biol. 51 : 263-273 (1986); Erlich H. et al., European Patent Application No. 50,424; Europe Patent Application No. 84,796, European Patent Application No. 258,017, European Patent Application No. 2 37,362; Mullis, K., European Patent Application No. 201,184; Mullis, K. et al., United States. U.S. Patent No. 4,683,202; Erlich, H .; U.S. Patent No. 4,582,788; Saiki, R. et al., US Patent No. 4,683,194) discloses an automatic DNA sequencing apparatus and software. (Applied Biosystems, Inc.). Like that By examining the relevant parts of the chromatogram on the computer screen. Thus, differences between the sequences of different animals can be identified and confirmed. The difference is that both chains Data are available and are present in more than one haploid case in the animal population tested Only when the expression is performed, it is interpreted as reflecting the DNA polymorphism. FIG. Cycle sequencing of nome fragments to identify novel polymorphic sequences Here is a preferred method. Electroelution of PCR fragments from animals from acrylamide gel And the presence of a mixture of dNTPs and fluorescently or chemically labeled ddNTPs Sequencing using repeated cycles of thermostable Taq DNA polymerase under . The product was then separated and analyzed by Applied Biosystems, Inc. automated DNA sequencing. Analyze using a singing device. The obtained data was analyzed using ABI software. Analyze. Differences between the sequences of different animals are identified by the software and Identified by examining the relevant parts of the chromatogram on the computer screen. You. The difference is that data were obtained for both chains and more than one half of the five horses tested. It is represented as a "DNA polymorphism" only if it is present in several instances. Top panel Indicates "A" homozygotes, the middle panel indicates "AT" heterozygotes, The tier panel shows "T" homozygotes. Alternatively, polymorphic site discovery can be performed using the strategy shown in FIG. You. In this embodiment, the DNA sequence polymorphism is determined by the pC from unrelated member genomic template. A restriction enzyme obtained by reacting a panel of several restriction enzymes on the product of the R reaction. Identified by comparing the nuclease cleavage profiles. Most preferred Alternatively, each restriction endonuclease used has a four base recognition sequence, Thus, the desired number of cleavages in the amplification product will be possible. Restriction digestion patterns obtained from genomic DNA can be analyzed using the corresponding plasmid template. Compare directly with the pattern obtained from the obtained PCR product. Such a comparison is , Amplified sequences from genomic DNA and plasmid DNA from equivalent loci Provide an internal control to indicate that This control can also be a repeat sequence or Allows identification of primers that have accidentally amplified a multicopy locus. why Then these are more from the genomic DNA template than from the plasmid template of Because it will generate fragments. III. Methods for genotyping single nucleotide polymorphisms of the invention In order to identify the polymorphic site "X" of the single nucleotide polymorphism of the present invention, various methods are used. Any of the methods can be used. A preferred method of such identification is to analyze and For each polymorphism that has been identified. Therefore, this appro Is an RFLP method that analyzes band patterns rather than specific sequences of polymorphisms. Significantly different from A. Amplification-based analysis Detection of a polymorphic site in a DNA sample is facilitated by using a DNA amplification method. You. Such methods may involve sequences that span the polymorphic site, or a distance between the polymorphic site and the site. Or the sequence located in the proximal or proximal position is specifically increased. That Amplified molecules can be easily detected by gel electrophoresis or other means . The most preferred way to achieve such amplification is in its double-stranded form. Using a primer pair capable of hybridizing to a proximal sequence that defines a polymorphism Adopt R. Alternative methods such as “ligase chain reaction” (“LCR”) can be used instead of PCR. (Barany, F.Proc . Natl. Acad. Sci. (USA) 88: 189〜193 (1991) ). LCR indexes specific targets using two pairs of oligonucleotide probes Amplify functionally. The sequence of each oligonucleotide pair is determined by the Select so that it can hybridize to the upper adjacent sequence. Such a hybrid The ligation forms a template-dependent ligase substrate. Like PCR, The product thus obtained serves as a template for the subsequent cycles and provides the desired sequence index. Numerical amplification is obtained. In accordance with the present invention, the LCR derives proximal and distal sequences on the same strand of the polymorphic site. It can be carried out by using oligonucleotides having each. In one embodiment Designed so that either oligonucleotide contains the actual polymorphic site of the polymorphism Will be done. In such embodiments, the reaction conditions are such that the target molecule is the oligonucleotide. Contain or contain specific nucleotides complementary to polymorphic sites present on nucleotides So that oligonucleotides can be ligated together only if deleted Selection Select. In another embodiment, the oligonucleotides do not contain polymorphic sites and they When hybridized to offspring, a "gap" will be created (Segev, D . See PCT application WO 90/01069). This gap then has complementary by dNTPs (mediated by DNA polymerase) or by another oligonucleotide Filled with nucleotide pairs. Therefore, at the end of each cycle each single strand Has a complementary strand that can serve as a target in the next cycle, and Numerical amplification is obtained. "Oligonucleotide Ligation Assay (" OLA ")" (Landegren) , U. et al.Science 2411077-1080 (1988)) have certain similarities to LCR. Can be adapted for use in type analysis. In the OLA protocol, the target Two oligonucleotides designed to hybridize to adjacent sequences of the main strand Use otide. OLA, like LCR, is particularly suitable for detecting point mutations. Only However, unlike LCR, OLA is not an exponential amplification of the target sequence, A "linear" amplification is obtained. Nickerson, D.A., et al. Have developed a nucleic acid detection system that combines the attributes of PCR and OLA. Say (Nickerson, D.A. et al.Proc . Natl. Acad. Sci. (USA) 87: 8923-8927 (1990)). In this way, exponential amplification of the target DNA is achieved. PCR is used to detect the target DNA, and the target DNA is detected using OLA. Multiple and separate In addition to the need for various processing steps, one of the Problem is that it inherits all the problems associated with PCR and OLA It is. Two (or more) oligonucleotides are combined into the resulting Ligation in the presence of a nucleic acid having the sequence Thus, a method for amplifying the di-oligonucleotide is also known (Wu, D.Y., et al.). ,Genomics Four: 560 (1989)) and can be easily adapted for the purposes of the present invention. You. Other known nucleic acid amplification methods, such as transcription-based amplification systems (Malek, L.T. et al. U.S. Patent No. 5,130,238; Davey, C. et al., European Patent Application No. 329,822; Schuster et al., U.S. Patent No. 5,169,766; Miller, H.I. et al., PCT application WO89 /. 06700; Kwoh, D. et al.Proc . Natl. Acad. Sci. (USA) 86: 1173 (1989); G ingeras, TR et al., PCT application WO 88/10315) or isothermal amplification (Walke). r, G.T.Proc . Natl. Acad. Sic. (USA) 89: 392-396 (1992)) Can be B. Preparation of single-stranded DNA Direct analysis of SNP sequences in the present invention is also known as "Sanger method". "Dideoxy-mediated chain termination method" (Sanger, F. et al.J . Molec. Biol. 94： 441 (1975) ) Or "chemical decomposition method" (Maxa-Gilbert method) m, A.M.Proc . Natl. Acad. Sci. (USA) 74: 560 (1977)) (see both references) Therefore, any of the methods described above can be used. Zideoki Determine the DNA sequence using either the mediation method or the Maxam-Gilbert method. The method of defining is well known to those skilled in the art. One such method is, for example, Samb rook, J. et al.Molecular Cloning , A Laboratory Manual, 2nd edition,Cold Su Pulling Harbor Press , Cold Spring Harbor, New York (1989) and Zyskind, J.W. et al.Recombinant DNA Laboratory Manual,A Cademic Press New York (1988) (both references are incorporated herein by reference. Which is incorporated herein by reference. When the nucleic acid sample contains double-stranded DNA (or RNA), or double-stranded nucleic acid amplification When a protocol (such as PCR) is used, one of the two strands is enriched. Double-stranded molecule, preferably so as to obtain a preparation containing only one strand preferentially. It is generally desirable to perform such sequence analysis after processing. However, heat Heat and cool the reaction one or more times using a stable polymerase If so, the present invention does not require the generation of a single-stranded DNA template. By this processing The double-stranded template is separated from its complementary strand, and the subsequent cooling step Can be annealed with the primer. Hybridization with other template strands Competition for the isomerization can be compensated for by repeated cycles of melting-cooling conditions. The simplest way to produce single-stranded DNA molecules from double-stranded DNA is to use heat Or a modification using either of the alkali treatments. Single-stranded DNA molecules also It can be produced using single-stranded DNA bacteriophage M13 (Messin g, J. et al.Meth . Enzymol. 101: 20 (1983); Sambrook, J. et al.Molecular Cloning . A Laboratory Manual , Cold Spring Harbor Laboratory (See also Les, Cold Spring Harbor, New York (1989).) . Several alternatives can be used to generate single-stranded DNA molecules. Gyllensten, U. et al.(Proc . Natl. Acad. Sci. (USA) 85: 7652-7656 (1988)) and And Mihovilovic, M. et al. (BioTechniques 7 (1): 14 (1989)) is an asymmetric PCR. The method describes a method in which primers are present at different molar concentrations. Perform a standard "PCR" procedure using. Higuchi, R.G. et al. (Nucl . Acids Res. 17: 586 5 (1985)) illustrates another method for producing single-stranded amplification products. This Method phosphorylates the 5 'end of one strand of the double-stranded amplification product and then 5' → 3 'Preferentially phosphorylated chains to exonucleases (such as T7 exonuclease) The content shall be disassembled. Other methods also involve phosphorothioate attraction to produce single-stranded DNA molecules. Utilizes the nuclease resistance properties of conductors (Benkovic et al., US Pat. No. 4,521,50) No. 9; Saysers, J.R. et al.Nucl . Acids Res. 16: 791-802 (1988); Eckstein, F. et al.Biochemistry Fifteen: 1685-1691 (1976); Ott, J. et al.Biochemistry 26: 8237〜8241 (1 987)). On the relative advantages and disadvantages of methods of producing such single-stranded molecules For a discussion, see Nikiforov, T. (U.S. patent application Ser. No. 08 / 005,061 (filed June 1994). (Discarded 24), cited herein for reference). In the most preferred embodiment, a phosphorothioate derivative is included in the primer. The nucleotide derivative can be introduced at any position of the primer, Preferably introduced at the 5 'end of the primer, most preferably adjacent to each other Will. Preferably, the primer molecule has a complementary region of about 25 nucleotides in length. About 4% to about 100% (compared to all residues), more preferably about 4% Will contain from about 40%, most preferably about 16%, of phosphorothioate residues. . These nucleotides may be introduced at any position of the primer and are adjacent to each other. Contact Or may be interspersed with all or part of the primer. In one embodiment, the invention is for use with an amplification protocol, such as PCR. Can be. In this embodiment, the established P The above primer can be used without changing the CR protocol The number of phosphorothioate linkages in the primer is about 10 (about half the length of the primer) Minutes). Phosphorothioate linkage beyond primer When PCR conditions are included, PCR conditions are especially It is necessary to adjust with respect to the temperature. The introduction of such nucleotide derivatives into DNA or RNA is accomplished by DNA It can be performed enzymatically using limerase (Vosberg, HP, et al.Biochemistr y 16: 3633-3640 (1977); Burgers, P.M.J. et al.J. Biol. Chem. 254: 6889〜6893 (1979); Kunkel, T.A., et al., In:Nucleic Acids and Molecular Biology Vol. 2,12 4-135 (Eckstein, F. et al. Eds.), Springer-Veerark, Berlin (1988); Olsen, D.B. et al.Proc . Natl. Acad. Sic. (USA) 87:, 1451-1455 (1990); Gr iep, M.A.Biochemistry 29: 9006-9014 (1990); Saysers, J.R., et al.Nucl . Aci ds Res. 16: 791-802 (1988)). Alternatively, phosphorothioate nucleoti Derivatives can be synthetically introduced into oligonucleotides (Zon, G .;Anti-Canc . Drug Des. 6: 539-568 (1991)). The primer molecule hybridizes to a complementary target nucleic acid molecule and then preferably Can be extended by a polymerase to form an extension product. Primer Of phosphorothioate nucleotides in the nuclease attack extension products Make it resistant to As noted, phosphorothioate or other suitable Amplification products containing nucleotide derivatives include T7 exonuclease and exonuclease. “Exclusion” (ie, degradation) by “5 ′ → 3 ′” exonuclease such as ) Is substantially resistant and therefore the 5 ′ → 3 ′ exonuclease When it comes to the holothioate residue, the exonuclease binds the nucleic acid molecule. No further decomposition would be possible. Since the target molecule lacks nuclease resistant residues, the extension product and its template (target When the target is incubated in the presence of 5 ′ → 3 ′ exonuclease, the template strand breaks. Breakage results, whereby preferential production of the desired single strand is achieved. C. Hybridization of DNA in solution A preferred method for determining the identity of a polymorphic polymorphic site is nucleic acid hybridization. Including Such hybridization can be performed on a solid phase. Kiruga (Saiki, R.K. et al.,Proc . Natl. Acad. Sci. (USA) 86: 6230〜6234 (198 9); Gilham et al.J . Amer. Chem. Soc. 86: 4982 (1964) and Krembsky et al.,Nucl . Acids Res. Fifteen: 3131-3139 (1987)), and preferably hybridized in a solution. Shin (Berk, A.J. et al.Cell 12: 721-732 (1977); Hood, L.E., et al., In:Molecular B iology of Eukaryotic Cells: A Problems Approach , Menlo Park, Califor Lunia: Benjamin-Cummings (1975); Wetmer, J.G.Hybridization and R enaturation Kinetics of Nucleic Acids . Ann . Rev. Biophys. Bioeng. Five: 33 7-361 (1976); Itakura, K. et al.Ann . Rev. Biochem. 53: 323-356 (1984)). For application to high capacity testing, it is desirable to use a non-radioactive detection method. Soy sauce Fluorescently labeled or haptenated dideoxynucleotides It is preferred to use The use of biotinylated ddNTPs is based on four of each (3-amino Nopropin-1-yl) nucleoside triphosphate is converted to sulfosuccinimidyl 6- ( It is preferably prepared by reacting (biotinamide) hexanoate. . At that time, the (3-aminopropyn-1-yl) nucleoside 5′-triphosphate is H obbs, by F.W. (J . Org. Chem. 54: 3420-3422 (1989)) and Hobbs, F.W. (US Pat. No. 5,047,519). D. Analysis of polymorphic sites 1. Polymerase-mediated analysis The identity of the nucleotides at the polymorphic site of the invention can be determined, for example, by Goelet, P. et al. Variants of the disclosed oligonucleotide-based diagnostic assays for nucleic acid sequence variations (PC T application WO 92/15712, incorporated herein by reference). it can. In particular, the present invention allows simultaneous or near-simultaneous analysis of multiple SNPs Or US Patent Application Ser. No. 08 / 216,538 for ease of reference. To the SNP assay described in US Pat. Things. To achieve such an advance, the present invention preferably provides the target molecule with a solution in solution. One or more purified interrogates having a predetermined sequence capable of hybridizing Use a solution oligonucleotide. "Interrogation Oligonucleotides "Is" generally refers to the immediate or proximal region of one or more single nucleotide polymorphisms. An oligonucleotide primer having a sequence complementary to a distal sequence. In a preferred embodiment, one or more sequences having a sequence complementary to a particular region of the target molecule. Use the above method for additional interrogation oligonucleotide primers And prepare it. Preferably, the primer is about 1 complementary to a particular region of the target molecule. It has 2 to 20 bases. These oligonucleotide primers So that the 3 'end of is immediately adjacent to the target nucleotide (eg, SNP) of interest Hybridizes to the target molecule. Preferably, an oligonucleotide primer Are neutral components (eg, poly A, non-basic residues, or other non-specific hybrids). A non-redundant polymer or chemical label). Most preferred In a preferred embodiment, the neutral components are assigned a specific unique length. Primer May or may not include a primer-specific label. I However, in the most preferred embodiment, the oligonucleotide primer is a primer. Includes a mer-specific label. The interrogation primer is then replaced with one or more single nucleotides. A target DNA molecule having a polymorphism (preferably a genomic DNA molecule) (for each SNP About its immediate 3 'distal sequence is complementary to the interrogation primer ), DNA polymerase and strand terminating nucleotides (or nucleotide analogs) G) Incubate in the presence of the triphosphate derivative. Preferably, such an Incubation is based on dNTPs (ie, dATP, dGTP, dCTP, d TTP) in the total absence of one or more chain-terminating nucleotide triphosphates Conductor (or analog) (eg, ddATP, ddGTP, ddCTP, d a single base of the derivative to the 3 'end of the primer in the presence of Perform under conditions that allow introduction. Unintroduced nucleotide triphosphate present in the reaction solution Is not critical to the reaction, but such unintroduced nucleotides It can be separated by several means. The identity of the introduced nucleotide is Is determined by the nucleotide at the polymorphic site of the polymorphism and is complementary to the nucleotide . In the present invention, the non-extendable nucleotide is preferably³²P or fluorescent molecule May be labeled. Other labels suitable for the present invention include, but are not limited to: But biotin, iminobiotin, hapten, antigen, cofactor, dinitrophenol Lipoic acid, olefin compounds, detectable polypeptides, high electron density (el ectron dense) molecules, enzymes that can deposit insoluble reaction products. Departure Fluoresceins, loaders include, but are not limited to, Min, Texas Red, FAM, JOE, TAMRA, ROX, HEX, TET , Cy3, Cy3.5, Cy5, Cy5.5, IRD40, IRD41 and BO DIPY. Electron dense indices suitable for the present invention ator) molecules include, but are not limited to, ferritin, Nin and colloidal gold. A detectable polypeptide is detectable Specifically conjugates a polypeptide to a second polypeptide covalently linked to an indicator molecule It may be indirectly detectable by forming a body. In such an aspect The detectable polypeptide is a group consisting of avidin and streptavidin Preferably, the second polypeptide is biotin and iminobioti Preferably, it is selected from the group consisting of In a preferred embodiment, the resulting extended primer is placed on a suitable matrix. Separate for analysis. To separate extended primers for analysis Any of a number of methods can be used. One such method is But not limited to mass spectrometry, (oligonucleotides Array hybridization) flow cytometry, HPLC, FPLC, Size exclusion chromatography, affinity chromatography, gel electricity Electrophoresis and the like. Preferably, the extended primer is separated under denaturing conditions. However, denaturing conditions are not required for effective separation. Non-extendable nucleotides are synthetic or can be introduced in a template-dependent manner. Refers to naturally occurring nucleotide analogs. Suitable for use in the present invention Synthetic or naturally occurring nucleotide analogs Non-cyclic ribose nucleotide analogs, substituted ribose nucleosides Tide analogs, and modified ribose nucleotide analogs. Synthetic The nucleotide analog is preferably a fructose-based nucleotide analog. Chemically modified probes that retain the ability to specifically base pair with natural nucleotides Phosphorus, a chemical modification that retains the ability to specifically base pair with natural nucleotides Retains the ability to base pair specifically with pyrimidines and natural nucleotides Selected from the group consisting of all compounds. In the most preferred embodiment, the separation of the extended primer Denaturing size separation matrices, such as standard sequencing gels with mid concentrations Do above. In this embodiment, an interrogation system comprising a specific unique length of 5 'tail is provided. Use primers. The extended primer is a specific unique length of 5 'tape. Is separated based on the file. In a preferred embodiment, the labeled chain terminating nucleotide ( Or nucleotide analogs), but this embodiment also uses It is also directed to the gating primer. In one sub-embodiment, A chain terminating nucleotide (ie, dideoxynucleotide) is used. Another subform In one embodiment, one or more strand terminators are labeled with only a single strand terminating nucleotide. Use nucleotides. In another preferred embodiment, it will hybridize to a complementary sequence located on the solid phase. Interrogation primers containing unique and specific sequences are used. This aspect Now, expose the interrogation primer to the placed capture primer and fix it. By detecting labeled bases based on their position on the phase capture primer array, Separated by identifying single nucleotide polymorphisms. In another embodiment, the resulting extended primer has an appropriate affinity Separated using separation method. Such affinity separation methods are generally Ter-ligand method (eg, avidin-streptavidin, etc.), monochrome Related to the local antibody method. In this embodiment, the 5 'end of each primer is Binds to the unique ligand present. In this embodiment, Put the seven puters on a solid surface (ie, beads, columns, dipsticks, microphones). It is preferably immobilized on a titer plate. Ligand labeled plies The mer is separated by exposing the reaction mixture to the corresponding receptor. Incidentally The identity of each single nucleotide polymorphism can be determined using the methods described above. In other embodiments, residues of unique sizes (eg, BSA, lysozyme, egg alga) Using primers (covalently or otherwise) conjugated to (eg, cumin) . In this embodiment, unique size primers are appropriately size exclusion chromatographic. Pass the primer through a feed column and use the above method to make each single nucleotide polymorphism. Are determined by determining the identity of In the present invention, a single reaction can be performed by using the above methods in combination. The number of SNPs that can be identified can be increased. Labels suitable for use in the present invention include, but are not limited to: Not available, but enzymes (β-galactosidase, luciferase, etc.), radioisotopes ( That is,³²P,¹³C,^ThreeH), fluorescent residues (ie, fluorescein, And chromogens. Primers can be labeled directly Or to a separate ligand, labeled or unlabeled You can also. The ligand molecule is covalently bound to the oligonucleotide primer, Ionic interactions, non-specific adsorption, or specific but non-covalent ligands It can be bound by receptor interaction. A ligand is generally a protein of interest for which there is a corresponding distinct receptor Or a chemical substance. Suitable ligands for use in the present invention include these Hapten, antigen, cofactor, biotin, iminobioti , Dinitrophenol, riboacid, olefin compound, oligonucleotide, phase Tags designed to specifically hybridize to complementary oligonucleotides Protein nucleic acid ("PNA") sequences and PNA sequences functioning as receptors Is mentioned. Other ligands suitable for use in the present invention include, but are not limited to: Antibodies, enzymes, polypeptides, streptavidin and Avidin is mentioned. In one embodiment, the ligand is a detectably labeled polypeptide. A complex can be formed by binding to the polypeptide. Used in the present invention Suitable detectable labels include, but are not limited to, , Enzymes capable of depositing insoluble reaction products, streptavidin and avidin Is mentioned. Preferably, the detectably labeled polypeptide is produced randomly. Select from the resulting polypeptide library. A receptor is generally a defined protein or protein on which the corresponding ligand is present. Refers to a chemical substance. Receptors suitable for use in the present invention include, but are not limited to: Antigens, cofactors, biotin, iminobiotin, dinitroph Enols, lipoic acids, olefinic compounds, oligonucleotides, complementary oligonucleotides A PNA sequence designed to specifically hybridize to a nucleotide; and And a PNA sequence that functions as a ligand. In one embodiment, the reception The putter forms a complex by binding to a detectably labeled polypeptide. Can be Suitable detectable labels for use in the present invention include: Antibodies, enzymes that can deposit insoluble reaction products, Leptoavidin and avidin are mentioned. Preferably, it is detectably labeled The polypeptide is selected from a randomly generated polypeptide library. The receptor may be bound to a matrix. Binding receptor to matrix Suitable methods for this include, but are not limited to, covalent bonding, Interaction, nonspecific adsorption, or specific but noncovalent ligand-receptor Putter interaction. Ligand-receptor suitable for use in the present invention One is, but not limited to, a complementary hybridizing nucleus. Acids, complementary hybridizing PNAs, and other complementary synthetic nucleic acid analogs Is mentioned. In a preferred embodiment, the extended primer is separated prior to detection. It can also be used to identify SNPs without separating primers . In this embodiment, (either by labeling the oligonucleotide or by the oligonucleotide Nucleotide interrogation primer (by not labeling the tide) At least one chain-terminating nucleotide triphosphate so that the addition to Label conductors uniquely. Therefore, for example, three different SNPs were When three primers were used in the presence of labeled ddATP for The introduction of the label indicates that one of the SNPs is T. Identification of primers by primer-specific labeling and introduced nucleotides , Enables genotyping of target molecules. Therefore, the nucleotide at the polymorphic site is Whichever of the set of labeled nucleotides is By assaying whether it was introduced at the 3 'end of the oligonucleotide Is done. Non-extendable nucleotides or nucleotide analogs can be Or it can be identified by any of the chemical methods. However, preferred physical Or chemical means, polarization spectroscopy, mass spectroscopy, infrared spectroscopy, ultraviolet spectroscopy It is selected from the group consisting of analysis, visible light spectroscopy or NMR spectroscopy. Although the present invention is directed to a method for identifying a plurality of SNPs in a single reaction, The present invention can also confirm the identity of multiple SNPs in a single reaction. In this embodiment, each SNP for both the plus and minus strands of the target nucleic acid Identity is determined as described above. The sequence is pro- cessed for each SNP analyzed. Identified when the ras and minus strands are complementary. 2. Polymerase / ligase-mediated analysis In another embodiment, the nucleotide at the polymorphic site is a polymerase / ligase mediated method Is identified using As in the above embodiment, multiple SNPs are detected by the same reaction To use multiple oligonucleotide primers simultaneously. Oligonucleotides complementary to the immediate non-mutated sequence immediately 3 'of the SNP, as in the above embodiment. Use a reotide primer. Complementary to the 5 'proximal sequence of the polymorphism to be analyzed but not Second oligonucleotide that cannot hybridize to the oligonucleotide primer Use These oligonucleotides are converted to DNA containing single nucleotide polymorphisms to be analyzed. And in the presence of at least one 2 ', 5'-deoxynucleotide triphosphate Incubate. This incubation reaction is further performed with DNA Polymerase. And DNA ligase. Therefore, both oligonucleotides have the same single nucleotide polymorphism to be analyzed. It can hybridize to a chain. Due to sequence considerations, these two oligos Nucleotides are located proximal to the SNP flanking the polymorphic site (X). And to the distal sequence; hence, these hybrids The modified oligonucleotide is a single nucleotide "gis" at the exact location of the polymorphic site "Caps". Polymerase and 2 ', 5'-deoxynucleotide triphosphate complementary to (X) The presence of an acid extends the complementary 2 ', 5'-deoxynucleotide triphosphate Primer and a hybridized oligonucleotide complementary to the distal sequence. Ligation is possible and 2 ', 5'-deoxy complementary to polymorphic nucleotides Synonucleotide triphosphates generate a ligatable substrate. Then, the identity of the polymorphic site opposite the "gap" can be determined by several means. It can be determined by any of them. In a preferred embodiment, the 2 ', 5'- The deoxynucleotide triphosphate is labeled, and its detection complements the polymorphic site. Target nucleotides. Several different 2 ', 5'-deoxynucleotide tris Phosphoric acid may be present and each is labeled differently. Alternatively, separate anti And a different 2 ', 5'-deoxynucleotide triphosphate is used in each reaction. In another sub-embodiment, the 2 ′, 5′-deoxynucleotide triphosphate is unlabeled, Label the two soluble oligonucleotides. Perform separate reactions, each with a different 2 ', 5'-deoxynucleotide triphosphate is used. The above embodiment details a polymerase / ligase mediated method for detection of a single polymorphic site. As mentioned, this method provides multiple unique options for the detection of multiple polymorphic sites. It is generally understood that simultaneous use of lignonucleotide primers can be employed Will. E. FIG. Signal amplification Nucleic acid hybridization detection assay sensitivity does not report detection to observer It can be enhanced by changing the way it signals. Therefore For example, assay sensitivity can be increased by using detectably labeled reagents. Can be. A wide range of such signal amplification methods have been designed for this purpose. Have been. Kourilsky et al. (U.S. Pat. No. 4,581,333) report detection using enzyme labels. It describes increasing the sensitivity of the assay. Fluorescent labels (Albarella et al., EP 14491 No. 4), chemical labels (Sheldon III et al., US Pat. No. 4,582,789; Albarella et al., US Patent No. 4,563,417), modified bases (Miyoshi et al., EP 119448), etc. It has been used in an effort to improve the efficiency of observing solicitation. Automatic or semi-automatic detection of the identity of the introduced nucleotide using an appropriate detector Using fluorescent or chromogenic (especially enzyme) labels as can be determined by Is preferred. IV. Use of SNP genotyping in methods of genetic analysis A. General considerations for using single nucleotide polymorphisms in gene analysis The usefulness of the polymorphic site of the present invention is that such a site can Can be used to predict the statistical probability of having the same allele Based on what you can do. Statistical analysis of SNPs can be used for any of a variety of purposes. If a particular individual has been previously tested, such a test may be -Prints, which determine the identity of a particular individual. Can be used to If the putative parent or both parents of a particular individual are tested, The law determines whether a particular animal is a descendant of such parent or both parents. Can be used to determine likelihood. Hence, the detection and analysis of SNPs Is the paternity of a male (a child of a particular child) Selected females, which excludes the father's paternity, etc.) or has certain individuals (E.g., to determine the probability of being a descendant of a particular child and a selected mother) Can be. As shown below, the present invention allows the construction of a genetic map of a target species. Soy sauce Of certain animals (or plants) to such genetic diseases, conditions, or attributes To predict a predisposition, a particular array of polymorphisms identified by the method of the invention is identified by a particular array. Can be associated with attributes. As used herein, “characteristics” refers to “genetic diseases”, It is intended to encompass "conditions" or "characteristics." "Genetic disease" Can be detected by mutation, whether detectable or asymptomatic. Refers to the pathological condition caused. "Condition" refers to a characteristic (asthma, weak bone, blindness) Ulcers, cancer, diseases of the heart or cardiovascular system, skeletal-muscular defects, etc.) . "Characteristics" are attributes that confer economic value on a plant or animal. Examples of characteristics and Examples include longevity, speed, durability, aging speed, and reproductive ability. B. Identification and parental confirmation The most useful measurements to determine the efficacy of the identification and paternity test system are (i) "Probability of identity (p (ID))" and (ii) "exclusion" (Probability of exclusion) (p (exc)). " p (ID) is 2 Two random individuals will have the same genotype for a given polymorphic marker To calculate the likelihood. p (exc) is, for a given polymorphic marker, A random male is a father in an average paternity case where mother identity is not an issue. Calculates the likelihood of having a genotype that is incompatible with being a parent You. A single locus (a locus with multiple alleles, such as a major histocompatibility region) Does not provide an appropriate statistically-reliable test for paternity testing Since rare, the desired test preferably parallels multiple unlinked loci. Will be measured. Cumulative probability of identity or non-identity, and cumulative paternity exclusion Probability is calculated by multiplying the probability given by each locus by Decided for the locator test. The most interesting statistical measures are (i) the cumulative probability of non-identity (cump (nonID)) ) And (ii) the cumulative probability of paternity exclusion (cump (exc)). The equations used to calculate these probability values are shown below. For simplicity, these Are first given to two allele loci, one allele is called type A, and the other is It is called B type. Four genotypes are possible in such a model: AA, AB , BA and BB (AB and BA types cannot be distinguished biochemically ). The frequency of alleles is the number of A's found in the haploid genome (f (A), The frequency of B (f (B), the frequency of B is “q” (where q = 1 −p)). Predetermined at a given genotype locus The genotype probabilities are shown below: Homozygote: p (AA) = p^Two Single heterozygote: p (AB) = p (BA) = pq = p (1-p) Both heterozygotes: p (AB + BA) = 2pq = 2p (1-p) Homozygote: p (BB) = q^Two= (1-p)^Two Probability of identity at a locus (ie, randomly picking from a population) The probability that the two individuals will have the same genotype at a given locus) , Equation: p (ID) = (p^Two)^Two+ (2pq)^Two+ (Q^Two)^Two Given by Therefore, the cumulative probability of identity for n loci is given by the equation: Given by Cumulative probability of non-identity for n loci (ie, random Probability that two individuals picked at different times will differ at one or more loci ) Is the equation: cum p (nonID) = 1-cum p (ID) Given by Probability of paternity elimination (random males question maternal identity for a given locus) Genes that are incompatible with being male parents in an unnamed average paternity case Indicating the probability of having a type) is given by the equation: p (exc) = pq (1-pq) Given by Probability of non-exclusion (random male at given locus is average paternity case Represents the probability of not being biochemically excluded as a male parent in the equation): p (non-exc) = 1-p (exc) Given by Therefore, the non-excluded cumulative probability (the value obtained when using n loci Is) It is. Cumulative probability of exclusion (using a panel of n loci, random male Biochemically excluded as a male parent in an untitled average paternity case Is the probability that cum p (exc) = 1-cump (non-exc) Given by These calculations extend to any number of alleles at a given locus it can. For example, three alleles each having a frequency p, q and r in the population The probability of identity p (ID) for the rel system is equal to the sum of the squares of the genotype frequencies : p (ID) = p^Four+ (2pq)^Two+ (2qr)^Two+ (2pr)^Two+ R^Four+ Q^Four Similarly, the probability of exclusion for the three allele system is p (exc) = pq (1-pq) + qr (1-qr) + pr (1-pr) + 3pqr (1-pqr) Given by At the n allele locus, p (ID) and And p (exc). FIG. 3 and FIG. 4 show that both the number and type of loci used It shows how nonID) and cum p (exc) increase. Using 3 allele system It can be seen that better discriminative power can be achieved with fewer markers. 3 and 4, triangles increase the number of loci with two alleles The common allele is the frequency of p = 0.79. Exists in degrees. X in FIGS. 3 and 4 means p = 0.5, q = 0.34 and r A similar analysis is shown for an increase in the number of triallelic loci at = 0.15. However, any loci with 2, 3 or more alleles should be selected. The choice is largely influenced by the biochemical considerations described above. Polymorphism analysis tests Can be designed to score any number of alleles at a given locus . If alleles are scored using gel electrophoresis, each allele should be It must be easy to analyze by movement. With multiple allele families Are often small, so human DNs using multiple allele families The A test includes a statistical correction for misidentification of the allele. In addition, multiple Although the appearance of rare alleles from the rel system is very informative, Rareness makes accurate measurement of their frequency in a population extremely difficult. I have. To correct errors in these frequency evaluations when using rare alleles , Statistical analysis of this data provides a measure of the cumulative effect of uncertainty in these frequency assessments. To Must include. The use of these multiple allelic systems also New or rare alleles in the population will be discovered during screening Increase prospects. The completeness of the genetic data collected so far is a novel allele It will be revised empirically to reflect the discovery of. With these considerations in mind, using loci with multiple alleles is Although it may provide some short-term benefits (fewer loci (I) because it does not need to be screened), (i) it is represented more frequently, and (ii) A locus with fewer alleles that is clearly easier to measure It is preferable to perform polymorphism analysis using the same. In this type of study, the more polymorphic Can achieve the same discrimination as a locus-based test, but with the same total number of alleles Must be recovered from a series of unlinked loci. G. FIG. Gene mapping and genetic trait analysis using SNP Detected in a set of individuals of the same species (human, horse, etc.) or closely related species Polymorphisms indicate that the presence or absence of a particular polymorphism correlates with a particular attribute Can be analyzed to determine whether or not. To perform such polymorphism analysis, a set of polymorphisms (ie, a “polymorphism array”) Is determined for a set of individuals. How many of these individuals Indicate specific attributes, and some are mutually exclusive attributes (eg, horses). In terms of brittle and non-brittle bones; blindness and non-blindness beginning in maturity; asthma Predisposition and absence of such predisposition). By the way, Examining the alleles of each polymorphism in the set to determine the presence or absence of a particular allele. Determine whether it is associated with a particular attribute to taste. Such a correlation Defines the genetic map of the individual species. Arrays that do not separate randomly Are used to predict the probability that a particular animal will develop the property be able to. For example, members of a species where a particular polymorphic allele exhibits a cardiovascular condition Is present in only 20% of the species, certain members of that species, including the allele, Would have a 20% probability of indicating such a cardiovascular condition. To point out In addition, the predictive power of this analysis depends on the association between a particular polymorphic allele and a particular property. Increases with the degree of Similarly, the predictive power of this analysis is Polymorphism It can be increased by analyzing alleles at the locus simultaneously. the above In an example, the second polymorphic allele is also present in 20% of members exhibiting a cardiovascular condition. All members evaluated to present such a cardiovascular condition should And a particular combination of alleles of the second polymorphism, and therefore Certain members, including both rels, have a very high probability of exhibiting a cardiovascular condition Will. Detection of multiple polymorphic sites determines the frequency with which such sites are independently segregated in the population. To be able to For example, if the two polymorphic sites separate randomly , They are on separate chromosomes or separated from each other on the same chromosome I do. Conversely, two polymorphic sites that are simultaneously inherited at a significant frequency are located on the same chromosome. Linked to each other. Thus, analysis of the frequency of segregation is Enable the establishment of a map. Therefore, the present invention maps the genomes of plants and animals. To provide a means for performing The resolution of a genetic map is proportional to the number of markers it contains. The method of the present invention Because any number of polymorphic sites can be used to isolate It can be used to create maps with resolution. Sequencing polymorphic sites greatly enhances their utility in gene mapping . Such sequences "walk" the chromosome, thereby identifying new marker sites Oligonucleotide primers and probes that can be used to (Bender, W. et al.,J . Supra. Molec. Struc. 10 (Addendum) : 32 (1979); Chinault, A.C., et al.Gene Five: 111-126 (1979); Clarke L .; La ,Nature 287: 504-509 (1980)). The resolution of the map can be determined by analyzing the polymorphism analysis in plants or Is further enhanced by combining with data on the phenotype of other attributes of the animal. Can be Therefore, certain polymorphisms segregate with brown hair color If the polymorphism maps to a locus near the hair color gene or genes, Is performed. Similarly, increasing the resolution of genetic maps using biochemical data Can be. In this embodiment, the biochemical determination (serotype, isotype, etc.) It is examined to determine if it segregates with any of the polymorphic sites. That's it Unmaps can be used, for example, to identify new gene sequences or to cause disease-causing mutations. Can be used to identify Indeed, the identification of the SNPs of the present invention has led to the discovery of a novel residue located on either side of the SNP. To isolate or sequence the gene sequence, the complementary oligonucleotide is Enables use as a primer in CR or other reactions. The present invention Include such novel gene sequences. Using such primers Thus, genomic sequences that can be clonally isolated are transcribed into RNA, Can be translated as quality. The present invention also relates to such proteins, And antibodies and other binding molecules capable of binding to such proteins. The present invention is described below with respect to two of its aspects, namely horses and humans. explain. However, the basic tenets of genetics can be applied regardless of species So such an explanation is equally applicable to any other species . Therefore, those skilled in the art will recognize that any other species may be Only need to isolate the SNPs and thereby perform the genetic analysis of the present invention. . As noted above, LOD scoring is useful for tracking the inheritance of genetic traits. Enables the use of RFLP both for constructing genetic maps of species and (Lander, S. et al.,Proc . Natl. Acad. Sci. (USA) 83: 7353 ~ 7357 (1986); Lander, S. et al.Proc . Natl. Acad. Sci. (USA) 84: 2363〜2367 (1 987); Donis-Keller, H. et al.Cell 51: 319-337 (1987); Lander, S. et al.Genetics 1 twenty one : 185-199 (1989)). Such methods are easily adapted for use with the polymorphisms of the invention. Can be combined. In fact, such polymorphisms are not Better than R. Easily create dense genetic maps for SNP frequencies be able to. In addition, as noted above, the polymorphisms of the present invention are typical VNT More stable than R-type polymorphism. Because the polymorphisms of the present invention contain direct genomic sequence information, Can be divided. Analysis must be gel-based for RFLP or STR dependence maps This entails obtaining an electrophoretic profile of the DNA of the target animal. In addition to gel-based methods, polymorphism (SNP) analysis can also be performed using spectroscopy. And can be easily automated to facilitate the analysis of large numbers of target animals. Wear. As described above, since the present invention has been generally described, the following is the isolation and analysis of equine polymorphism. The present invention may be better understood with reference to the examples. These practices The examples are illustrative and are not intended to limit the invention. Example 1 Analysis of multiple SNPs by polyacrylamide gel electrophoresis In this example for the detection of multiple single nucleotide polymorphisms, the single-stranded DNA template Was probed with three interrogation primers. Primer # 1 has a 5 base T-tail, primer # 2 has a 10 base T-tail, Primer # 3 has a 15 base T-tail. To obtain a single-stranded template, one of two methods was used. First, Niki forov, T .; (US Patent Application No. 08 / 005,061 (application abandoned June 24, 1996) ), Including 4 phosphorothioate-nucleotide derivatives, as taught by Primers are used to mediate amplification. Alternatively, the second round of PCR may be This can be done using "symmetric" primer concentrations. The product of the first reaction is transferred to the second In the reaction, the dilution is 1/1000. One of the primers in the second round has a 2M mark Use at a sub-concentration, the other at a concentration of 0.08M. Under these conditions, single-stranded molecules It is synthesized during the reaction. The primer mixture is hybridized to the single-stranded target template and the DNA polymerase Initiate a single base extension reaction using a modified and four modified non-extendable nucleotides. Rub. Only one modified non-extendable nucleotide for each four reaction tubes Preferably³²Label with P or fluorescent molecule. Then, the obtained extended primer Are separated for analysis on a standard 12% sequencing gel. Like this To determine the electrophoretic mobility of each primer and the identity of labeled non-extendable nucleotides. It is possible to determine the identity of the SNP corresponding to each primer based on the You. Example 2 Analysis of multiple SNPs by size exclusion chromatography Primers 1, 2, 3 and 4 were replaced with BSA, egg albumin and lysozyme, respectively. And covalently bound to CIP. Four interrogations of single-stranded DNA template Probe with primers, DNA polymerase and four modified non-extendable A single base extension reaction using nucleotides is caused. Preferably each non-extensible The nucleotides are uniquely and separately labeled. Subsequently, the resulting extended primer is applied to a suitable size exclusion column (eg, , Sephadex, Sepharose, etc.) and separate the eluate (eg, Analysis (by a titration counter) to determine the identity of the introduced nucleotides. Set. Example 3 Analysis of multiple SNPs using affinity method In this example, for example, the method disclosed by Chu et al.Nucleic Acids Res . 16 : 3671-3691 (1988), which is incorporated herein by reference). Alternatively, the protein affinity ligand can be interrogated with an oligonucleotide. Covalently bound to Then, the affinity ligand-interrogation probe The primer complex is hybridized to the target nucleic acid molecule and the single base extension is performed for four times. Of differently labeled dideoxynucleotide species (ddA, ddT, ddC and ddG). If desired, less species of dideoxynucleated Reotide may be used. The corresponding monoclonal antibody against the peptide or protein Immobilize on an interplate (Nunc). Buffer each monoclonal antibody at room temperature In a microtiter plate. Then, plate the TNTw solution Wash three times to remove excess unbound protein. The extended primer solution is then added to each well over about 30 minutes. Not yet The bound primer is removed by extensive washing with a TNTw solution. Table 3 shows such facts. Shows the results expected from the experiment. Example 4 Analysis of multiple SNPs without separating oligonucleotides Multiple single nucleotide polymorphisms without separating oligonucleotide probes Can also be identified. In such experiments, each dideoxynucleotide Preferably, the triphosphate is uniquely labeled. So, for example, ddATP³²Labeled with P, ddGTP^ThreeLabel with H , DdCTP is³⁵Labeled with S, ddTTP is¹²⁵I can be labeled. table No. 3 hybridizes six oligonucleotides to the preparation containing the nucleic acid of interest. The hypothetical results obtained from soybean and the results of a single base extension reaction are shown. In such experiments, unintroduced ddNTPs could be used in various ways (eg, Any of the appropriate spin columns (ie, CentriSep spin columns) Can be used to separate from the extended probe. The introduced labeled dideoxynucleotide triphosphate is then scintillated. Detection using a counter. Because each isotope has a separate emission spectrum The scintillation counter allows multiple single nuclei without the need for purification procedures. The identity of the leotide polymorphism can be determined. As shown in Table 4, the introduced dideoxynuclease complementary to the single nucleotide polymorphism was The identification of the nucleotides was determined by T, G, A, T, C, It is clear that G. Ambiguity is changing primer combinations Can be determined by Example 5 Analysis of multiple SNPs by oligonucleotide array separation Primers 1, 2, 3 and 4 were used in addition to the sequence complementary to the template DNA, A unique arrangement for subsequent hybridization to the capture oligonucleotide on the surface Contains columns. Probe single-stranded DNA template with 4 interrogation primers Using DNA polymerase and four modified non-extendable nucleotides. Initiate a single base extension reaction. Preferably each non-extendable nucleotide is unique And label separately. The obtained extended primer is then applied to the surface of the oligonucleotide array. Apply. This array comprises four separate and spatially separated distinct capture oligonucleotides. Consisting of nucleotides, each oligonucleotide is an interrogation primer Complementary to one and above unique sequence. Each interrogation primer is Effective by hybridization to capture primer bound to corresponding surface Separated. The identity of the labeled nucleotide is then determined by a suitable method. Although the invention has been described with reference to specific embodiments thereof, it is possible to further modify it. This application is generally based on the principles of the present invention, and As is known and practiced in the art, or as described above and in the appended claims. Include those that depart from the disclosure of the invention as applicable to essential features therein. And is intended to cover any modifications, uses, or adaptations of the invention. It will be understood.

───────────────────────────────────────────────────── フロントページの続き (81)指定国ＥＰ(ＡＴ，ＢＥ，ＣＨ，ＣＹ，ＤＥ，ＤＫ，ＥＳ，ＦＩ，ＦＲ，ＧＢ，ＧＲ，ＩＥ，ＩＴ，ＬＵ，ＭＣ，ＮＬ，ＰＴ，ＳＥ)，ＯＡ(ＢＦ，ＢＪ，ＣＦ，ＣＧ，ＣＩ，ＣＭ，ＧＡ，ＧＮ，ＭＬ，ＭＲ，ＮＥ，ＳＮ，ＴＤ，ＴＧ)，ＡＰ(ＧＨ，ＧＭ，ＫＥ，ＬＳ，ＭＷ，ＳＤ，ＳＺ，ＵＧ，ＺＷ)，ＥＡ(ＡＭ，ＡＺ，ＢＹ，ＫＧ，ＫＺ，ＭＤ，ＲＵ，ＴＪ，ＴＭ)，ＡＬ，ＡＭ，ＡＴ，ＡＵ，ＡＺ，ＢＡ，ＢＢ，ＢＧ，ＢＲ，ＢＹ，ＣＡ，ＣＨ，ＣＮ，ＣＵ，ＣＺ，ＤＥ，ＤＫ，ＥＥ，ＥＳ，ＦＩ，ＧＢ，ＧＥ，ＧＨ，ＧＭ，ＧＷ，ＨＵ，ＩＤ，ＩＬ，ＩＳ，ＪＰ，ＫＥ，ＫＧ，ＫＰ，ＫＲ，ＫＺ，ＬＣ，ＬＫ，ＬＲ，ＬＳ，ＬＴ，ＬＵ，ＬＶ，ＭＤ，ＭＧ，ＭＫ，ＭＮ，ＭＷ，ＭＸ，ＮＯ，ＮＺ，ＰＬ，ＰＴ，ＲＯ，ＲＵ，ＳＤ，ＳＥ，ＳＧ，ＳＩ，ＳＫ，ＳＬ，ＴＪ，ＴＭ，ＴＲ，ＴＴ，ＵＡ，ＵＧ，ＵＺ，ＶＮ，ＹＵ，ＺＷ (72)発明者ヘッド，スティーブンアメリカ合衆国21074メリーランド州ハンプステッド、サウス・メイン・ストリート 808番 (72)発明者ゴーレット，フィリップアメリカ合衆国21136メリーランド州リースタータウン、ミレンダー・ミル・ロード 4000番 (72)発明者ボイス−ジャチノ，マイケル・ティアメリカ合衆国21048メリーランド州フィンクスバーグ、ナイナー・ロード3811番────────────────────────────────────────────────── ─── Continuation of front page (81) Designated country EP (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, I T, LU, MC, NL, PT, SE), OA (BF, BJ , CF, CG, CI, CM, GA, GN, ML, MR, NE, SN, TD, TG), AP (GH, GM, KE, L S, MW, SD, SZ, UG, ZW), EA (AM, AZ , BY, KG, KZ, MD, RU, TJ, TM), AL , AM, AT, AU, AZ, BA, BB, BG, BR, BY, CA, CH, CN, CU, CZ, DE, DK, E E, ES, FI, GB, GE, GH, GM, GW, HU , ID, IL, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, M D, MG, MK, MN, MW, MX, NO, NZ, PL , PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, UA, UG, UZ, V N, YU, ZW (72) Inventor Head, Stephen Han, United States 21074 Maryland Pstead, South Main Street 808 (72) Inventor Gorlet, Philip United States 21136 Lee, Maryland Startown, Millender Mill Road 4000th (72) Inventor Voice-Jachino, Michael Tee United States 21048 Philly, Maryland Nksburg, Naina Road 3811

Claims

[Claims] 1.1. A method for identifying one or more single polymorphisms in a single reaction , Process: (A) one or more identifiable interrogation oligonucleotide probes The primer is hybridized to one or more target nucleic acid molecules, wherein each Oligonucleotide primers are of interest where the 3 'end of each primer is of interest. Of each target nucleic acid molecule so that it is immediately adjacent to a constant and unique target nucleotide Complementary to specific and unique regions; (B) Each interrogation oligonucleotide is treated with a template-dependent polymerase. Extension, wherein the extension comprises one or more non-extendable nucleotides or Occurs in the presence of a nucleotide analog species; (C) For each interrogation primer used, such a primer Of non-extendable nucleotides (or nucleotide analogs) introduced into By determining the identity, the identity of each nucleotide of interest is determined. The identified non-extendable nucleotide or nucleotide analog Complementary to the target nucleotide of the primer A method that includes 2. Each interrogation oligonucleotide primer contains a 5 'tail The 5 'tail is of a specific and unique length or each interrogation primer A neutral component having other properties used to identify or separate Item 1. The method according to Item 1. 3. 2. The method of claim 1, wherein said hybridization step occurs in a solution. . 4. Identifying the non-extendable nucleotide by a physical or chemical method The method of claim 1. 5. The physical method or the chemical method is used for polarization spectroscopy, mass spectroscopy, infrared spectroscopy. Select from the group consisting of optical analysis, ultraviolet spectroscopy, visible light spectroscopy or NMR spectroscopy. 5. The method according to claim 4, wherein the method is performed. 6. Further steps: (D) separating the extended primer on a suitable matrix The method of claim 1, comprising: 7. 7. The method of claim 6, wherein said matrix is a size separation matrix. . 8. The method of claim 7, wherein the size separation matrix is a sequencing gel. The method described. 9. The sequencing gel comprises about 4% to about 20% polyacrylamide The method of claim 8. 10. 8. The method of claim 7, wherein the size separation matrix is a size exclusion column. the method of. 11. A suitable receptor molecule is bound to the matrix and the receptor The appropriate ligand molecule corresponding to the oligonucleotide is bound to the oligonucleotide primer. 7. The method of claim 6, wherein the method is performed. 12. The matrix contains beads, columns, dipsticks, Claims selected from the group consisting of interplates and glass slides Item 12. The method according to Item 11. 13. 2. The method of claim 1, wherein said non-extendable nucleotide is ddNTP. . 14． 14. The ddNTP is fluorescently or chemically labeled. The method described in. 15. 14. The method according to claim 13, wherein said ddNTP is biotinylated. 16. 2. The method of claim 1, wherein said target molecule is a nucleic acid molecule. 17． 17. The method according to claim 16, wherein said nucleic acid molecule is a DNA molecule. 18. 18. The method according to claim 17, wherein said DNA molecule is genomic DNA. 19. 18. The method according to claim 17, wherein said DNA molecule is double-stranded DNA. 20. 18. The method according to claim 17, wherein said DNA molecule is single-stranded DNA. 21. 17. The method of claim 16, wherein said nucleic acid molecule is an RNA molecule. 22. A method for characterizing a target DNA, comprising: (A) one or more identifiable interrogation oligonucleotide probes The primer is hybridized to one or more target nucleic acid molecules, wherein each Oligonucleotide primers are of interest where the 3 'end of each primer is of interest. Of each target nucleic acid molecule so that it is immediately adjacent to a constant and unique target nucleotide Complementary to specific and unique regions; (B) Each interrogation oligonucleotide is treated with a template-dependent polymerase. Extension, wherein the extension occurs in the presence of more than one non-extendable nucleotide species. Happen; (C) separating the extended primers on a suitable matrix; (D) For each interrogation primer used, such a primer Of interest by determining the identity of the non-extendable nucleotides introduced into the Examining each nucleotide that leans, wherein the identified non-extendable nucleotides Is complementary to the target nucleotide of the primer; And (E) converting the interested nucleotides of interest to those of a reference nucleic acid molecule. Compared to the corresponding nucleotide, the nucleotide of interest at each site Determine if they contain the same single nucleotide A method that includes: 23. 3. The method of claim 2, wherein the characterization identifies a property of the target DNA molecule. 3. The method according to 2. 24. 24. The method of claim 23, wherein said characteristic is a genetic disease. 25. 24. The method of claim 23, wherein said attribute is a genetic condition. 26. The size separation matrix is from Sepharose and Sephadex The method of claim 7, wherein the method is selected from the group consisting of: 27. The ligand is a hapten, antigen, cofactor, biotin and iminobioti. 12. The method of claim 11, wherein the method is selected from the group consisting of: 28. The ligand is a dinitrophenol, a lipoic acid and an olefin compound. 12. The method of claim 11, wherein the method is selected from the group consisting of: 29. The ligand specifically hybridizes to a complementary oligonucleotide. Unique and specific oligonucleotides, complementary oligonucleotides PNA sequences and receptors designed to specifically hybridize to otide 2. The protein according to claim 1, which is selected from the group consisting of PNA sequences functioning as putters. 2. The method according to 1. 30. The ligand may be an antibody, enzyme, polypeptide, streptavidin and antigen. 12. The method of claim 11, wherein the method is selected from the group consisting of vidin. 31. The ligand is conjugated by binding to a detectable polypeptide. The method of claim 11, which can form 32. The detectable polypeptide may deposit antibodies, insoluble reaction products Enzymes, selected from the group consisting of streptavidin and avidin, 31. The method according to claim 30. 33. The detectable polypeptide is a polypeptide library randomly generated. 31. The method of claim 30, wherein the method is selected from a rally. 34. The receptor is a hapten, antigen, cofactor, biotin and iminobio 12. The method of claim 11, wherein the method is selected from the group consisting of chin. 35. The receptor is a dinitrophenol, a lipoic acid and an olefin compound The method of claim 11, wherein the method is selected from the group consisting of: 36. The receptor specifically hybridizes to a complementary oligonucleotide Unique and specific oligonucleotides, complementary oligonucleotides PNA sequences designed to specifically hybridize to Claims 1. It is selected from the group consisting of PNA sequences that function as sceptors. 12. The method according to 11. 37. The receptor is conjugated by binding to a detectable polypeptide. 12. The method of claim 11, wherein the method is capable of forming a body. 38. The detectable polypeptide may deposit antibodies, insoluble reaction products Enzymes, selected from the group consisting of streptavidin and avidin, 38. The method of claim 37. 39. The detectable polypeptide is a polypeptide library randomly generated. 38. The method of claim 37, wherein the method is selected from a rally. 40. The receptor molecule binds to the matrix via covalent bonds, ionic interactions, Non-specific adsorption and specific but non-covalent ligand-receptor interactions 12. The method of claim 11, wherein the methods are combined by a method selected from the group consisting of: 41. The ligand-receptor is selected from complementary hybridizing nucleic acids. 41. The method of claim 40, wherein 42. Hybridizing PNAs and the like wherein the ligand-receptor is complementary 41. The method according to claim 40, which is selected from the group consisting of: Law. 43. The ligand molecule is covalently attached to the oligonucleotide primer, Interaction, non-specific adsorption, and specific but non-covalent ligand-receptor 12. The method according to claim 11, wherein the binding is performed by a method selected from the group consisting of Putter interactions. The method described. 44. The ligand-receptor is selected from complementary hybridizing nucleic acids. 44. The method of claim 43. 45. Hybridizing PNAs and the like wherein the ligand-receptor is complementary 44. The method according to claim 43, wherein the compound is selected from the group consisting of: Law. 46. The non-extendable nucleotide is introduced by a template-dependent polymerase. 2. The method of claim 1, wherein the compound is a synthetic or naturally occurring nucleotide analog. The method described. 47. The synthetic or naturally-occurring nucleotide analog is an acyclic ribose. , Substituted nucleotide analogs, and modified ribose nucleotide analogs 47. The method of claim 46, wherein the method is selected from the group consisting of: 48. The synthetic nucleotide analog is a fructose-based nucleotide 47. The method of claim 46, wherein the method is selected from the group consisting of Nalog. 49. The synthetic nucleotide analog is specific for a naturally occurring nucleotide. From chemically modified purines or pyrimidines that retain the ability to base pair with 47. The method of claim 46, wherein the method is selected from the group consisting of: 50. The synthetic nucleotide analog is specific for a naturally occurring nucleotide. Selected from the group consisting of compounds that retain the ability to base pair with 47. The method of claim 46. 51. The non-extendable nucleotide is fluorescently labeled or chemically labeled The method of claim 1, wherein the method is performed. 52. The non-extendable nucleotide is labeled with biotin or iminobiotin The method of claim 1, wherein the method is performed. 53. The non-extendable nucleotide is labeled with a hapten, antigen or cofactor. The method of claim 1, wherein 54. The non-extendable nucleotide is dinitrophenol, lipoic acid or olefin. The method according to claim 1, wherein the method is labeled with a quinine compound. 55. The non-extendable nucleotide is labeled with a detectable polypeptide The method of claim 1, wherein 56. The non-extendable nucleotide is a molecule having a high electron density or an insoluble reaction product. The method of claim 1, wherein the method is labeled with an enzyme capable of depositing a product. 57. The non-extendable nucleotide is a molecule having a high electron density or an insoluble reaction product. The method of claim 1, wherein the method is labeled with an enzyme capable of depositing a product. 58. The fluorescent indicator molecule is fluorescein, rhodamine, Texas Red, FAM, JOE, TAMRA, ROX, HEX, TET, Cy3, Cy3.5, The group consisting of Cy5, Cy5.5, IRD40, IRD41 and BODIPY 49. The method of claim 48, wherein the method is selected from: 59. The electron-dense indicator molecule comprises ferritin, hemocyanin, and 58. The method of claim 57, wherein the method is selected from the group consisting of Lloyd gold. 60. The detectable polypeptide separates the detectable polypeptide from an indicator component. By specifically forming a complex with a second polypeptide covalently linked to the 56. The method of claim 55, which is detectable indirectly. 61. The detectable polypeptide is less than avidin and streptavidin. Selected from the group consisting of biotin and iminobi. 61. The method of claim 60, wherein the method is selected from the group consisting of otyne. 62. 17. The method of claim 16, wherein said nucleic acid molecule is from a plant. 63. 17. The method of claim 16, wherein said nucleic acid molecule is from a microorganism. 64. The microorganism is a bacterium, fungus, yeast, virus, viroid and other genetic 64. The method of claim 63, wherein the method is selected from the group consisting of possible genetic entities. .