[go: up one dir, main page]

CN118510911A - Methods and systems for determining a pregnancy-related status of a subject - Google Patents

Methods and systems for determining a pregnancy-related status of a subject Download PDF

Info

Publication number
CN118510911A
CN118510911A CN202280087768.XA CN202280087768A CN118510911A CN 118510911 A CN118510911 A CN 118510911A CN 202280087768 A CN202280087768 A CN 202280087768A CN 118510911 A CN118510911 A CN 118510911A
Authority
CN
China
Prior art keywords
pregnancy
weeks
genomic loci
cell
eclampsia
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202280087768.XA
Other languages
Chinese (zh)
Inventor
马尼什·简恩
欧金尼·纳姆萨拉伊夫
莫滕·拉斯穆森
琼·卡穆纳斯索莱尔
法鲁克·西迪基
米特苏·雷迪
伊莱恩·吉
阿尔卡迪·霍杜尔斯基
罗里·诺兰
曼弗雷德·李
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Milvey Co
Original Assignee
Milvey Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Milvey Co filed Critical Milvey Co
Priority claimed from PCT/US2022/079235 external-priority patent/WO2023081768A1/en
Publication of CN118510911A publication Critical patent/CN118510911A/en
Pending legal-status Critical Current

Links

Landscapes

  • Investigating Or Analysing Biological Materials (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The present disclosure provides methods and systems for cell-free identification and/or monitoring of pregnancy related conditions. Methods for identifying or monitoring a pregnant subject for the presence or increased risk of a pregnancy-related status may include assaying a cell-free biological sample derived from the pregnant subject to detect a biomarker set, and analyzing the biomarker set using a trained algorithm to determine the presence or increased risk of a pregnancy-related status.

Description

用于确定对象的妊娠相关状态的方法和系统Methods and systems for determining a pregnancy-related status of a subject

交叉引用Cross-references

本申请要求于2021年11月4日提交的美国申请第63/275,726号、2021年11月8日提交的美国申请第63/276,809号和2021年12月10日提交的美国申请第63/288,044号的权益,其各自均通过引用整体并入本文。This application claims the benefit of U.S. Application No. 63/275,726, filed on November 4, 2021, U.S. Application No. 63/276,809, filed on November 8, 2021, and U.S. Application No. 63/288,044, filed on December 10, 2021, each of which is incorporated herein by reference in its entirety.

背景技术Background Art

每年,全球报告约1500万例早产,超过30万女性死于妊娠相关并发症诸如出血和高血压病症如先兆子痫。早产可能影响多达约10%的妊娠,其中大多数是自发性早产。早产等妊娠相关并发症是新生儿死亡和以后生活中并发症的主要原因。此外,此类妊娠相关并发症会对母体健康造成负面的健康影响。Every year, approximately 15 million premature births are reported worldwide, and more than 300,000 women die from pregnancy-related complications such as bleeding and hypertensive conditions such as preeclampsia. Preterm births may affect up to approximately 10% of pregnancies, most of which are spontaneous preterm births. Pregnancy-related complications such as premature births are the leading cause of neonatal mortality and complications later in life. In addition, such pregnancy-related complications can have negative health effects on maternal health.

发明内容Summary of the invention

目前,可能缺乏可用于许多妊娠相关并发症(诸如早产)的有意义的、临床上可行的诊断筛查或测试。因此,为了使妊娠尽可能安全,需要快速、准确的方法来鉴定和监测妊娠相关的状态,这些方法是无创的并且有成本效益的,以改善母婴健康。Currently, there may be a lack of meaningful, clinically feasible diagnostic screening or testing available for many pregnancy-related complications, such as preterm birth. Therefore, in order to make pregnancy as safe as possible, rapid, accurate methods are needed to identify and monitor pregnancy-related states that are non-invasive and cost-effective to improve maternal and infant health.

本公开提供了通过处理获得自或来源于对象的无细胞生物样品来鉴定或监测妊娠相关状态的方法、系统和试剂盒。可以分析从对象获得的无细胞生物样品(例如,血浆样品)以鉴定妊娠相关状态(其中可能包括,例如,测量妊娠相关状态的存在、不存在或相对评估)。此类对象可以包括具有一种或多种妊娠相关状态的对象和没有妊娠相关状态的对象。妊娠相关状态可以包括,例如,早产、足月产、胎龄、预产期(例如,对象的未出生婴儿或胎儿的预产期)、分娩发作、妊娠相关高血压病症(例如,先兆子痫)、子痫、妊娠期糖尿病、对象胎儿的先天性病症、异位妊娠、自然流产、死产、产后并发症(例如,产后抑郁、出血或流血过多、肺栓塞、心肌病、糖尿病、贫血和高血压疾病)、妊娠剧吐(晨吐)、分娩时出血或流血过多、胎膜早破、早产胎膜早破、前置胎盘(胎盘覆盖子宫颈)、宫内/胎儿生长受限、巨大儿(对于胎龄而言较大的胎儿)、新生儿状况(例如,贫血、呼吸暂停、心动过缓和其他心脏缺陷、支气管肺发育不良或慢性肺病、糖尿病、腹裂、脑积水、高胆红素血症、低钙血症、低血糖、脑室内出血、黄疸、坏死性小肠结肠炎、动脉导管未闭、脑室周围白质软化、持续性肺动脉高压、红细胞增多症、呼吸窘迫综合征、早产儿视网膜病变和暂时性呼吸急促),以及胎儿发育阶段或状态(例如,正常胎儿器官功能或发育和异常胎儿器官功能或发育)。例如,胎儿发育阶段或状态可能与胎儿器官的正常胎儿器官功能或发育和/或异常胎儿器官功能或发育有关,该胎儿器官选自心脏、大肠、小肠、视网膜、前额叶皮层、中脑、肾脏和食道。The present disclosure provides methods, systems and kits for identifying or monitoring pregnancy-related states by processing acellular biological samples obtained from or derived from a subject. Acellular biological samples (e.g., plasma samples) obtained from a subject can be analyzed to identify pregnancy-related states (which may include, for example, measuring the presence, absence or relative assessment of pregnancy-related states). Such subjects may include subjects with one or more pregnancy-related states and subjects without pregnancy-related states. Pregnancy-related states may include, for example, premature birth, term birth, gestational age, due date (e.g., due date of unborn baby or fetus of the subject), onset of labor, pregnancy-related hypertension disorders (e.g., pre-eclampsia), eclampsia, gestational diabetes, congenital disorders of the subject's fetus, ectopic pregnancy, spontaneous abortion, stillbirth, postpartum complications (e.g., postpartum depression, hemorrhage or excessive bleeding, pulmonary embolism, cardiomyopathy, diabetes, anemia and hypertensive disease), hyperemesis gravidarum (morning sickness), hemorrhage or excessive bleeding during labor, premature rupture of membranes, premature premature rupture of membranes, placenta previa (placenta covering the cervix), intrauterine/fetal growth retardation, ... The invention also provides a method for determining the presence or absence of a fetal malformation, a fetal malformation, a fetal macrosomia (a fetus that is large for gestational age), a neonatal condition (e.g., anemia, apnea, bradycardia and other cardiac defects, bronchopulmonary dysplasia or chronic lung disease, diabetes, gastroschisis, hydrocephalus, hyperbilirubinemia, hypocalcemia, hypoglycemia, intraventricular hemorrhage, jaundice, necrotizing enterocolitis, patent ductus arteriosus, periventricular leukomalacia, persistent pulmonary hypertension, polycythemia, respiratory distress syndrome, retinopathy of prematurity, and transient tachypnea), and a fetal developmental stage or state (e.g., normal fetal organ function or development and abnormal fetal organ function or development). For example, the fetal developmental stage or state may be associated with normal fetal organ function or development and/or abnormal fetal organ function or development of a fetal organ selected from the group consisting of heart, large intestine, small intestine, retina, prefrontal cortex, midbrain, kidney, and esophagus.

在一方面,本公开提供了用于鉴定妊娠对象的妊娠相关状态的存在或升高的风险的方法,包括测定来源于妊娠对象的无细胞生物样品以检测生物标志物集,以及使用经训练算法或针对参考值处理该生物标志物集以确定至少三种不同的妊娠相关状态的集之中的妊娠相关状态的存在或升高的风险。In one aspect, the present disclosure provides a method for identifying the presence or increased risk of a pregnancy-associated state in a pregnant subject, comprising assaying a cell-free biological sample derived from the pregnant subject to detect a set of biomarkers, and processing the set of biomarkers using a trained algorithm or against reference values to determine the presence or increased risk of a pregnancy-associated state among a set of at least three different pregnancy-associated states.

在一些实施方案中,妊娠相关状态选自早产、足月产、胎龄、预产期、分娩发作、妊娠相关高血压病症、先兆子痫、子痫、妊娠期糖尿病、所述妊娠对象的胎儿的先天性病症、异位妊娠、自然流产、死产、产后并发症、妊娠剧吐、分娩时出血或流血过多、胎膜早破、早产胎膜早破、前置胎盘、宫内/胎儿生长受限、巨大儿、新生儿状况,以及胎儿发育阶段或状态。In some embodiments, the pregnancy-related condition is selected from preterm birth, term birth, gestational age, due date, onset of labor, pregnancy-related hypertensive disorders, pre-eclampsia, eclampsia, gestational diabetes, a congenital disorder of the fetus of the pregnant subject, ectopic pregnancy, spontaneous abortion, stillbirth, postpartum complications, hyperemesis gravidarum, hemorrhage or excessive bleeding during labor, premature rupture of membranes, preterm premature rupture of membranes, placenta previa, intrauterine/fetal growth restriction, macrosomia, neonatal condition, and fetal development stage or state.

在一些实施方案中,妊娠相关状态是早产亚型,并且至少三种不同的妊娠相关状态包括至少两种不同的早产亚型。在一些实施方案中,早产亚型是早产分子亚型,并且至少两种不同的早产亚型包括至少两种不同的早产分子亚型。在一些实施方案中,早产分子亚型选自既往早产的病史、自发性早产的病史、种族特异性早产风险的病史,以及早产胎膜早破(PPROM)的病史。在一些实施方案中,早产分子亚型是自发性早产,并且生物标志物集包含与自发性早产相关联的基因组位点。在一些实施方案中,与自发性早产相关联的基因组位点选自表1中列出的基因、表2中列出的基因、表3中列出的基因、对应于表4中列出的途径的基因、表9中列出的基因、表10中列出的基因以及表11中列出的基因。在一些实施方案中,自发性早产包括在小于25周分娩、在小于26周分娩、在小于27周分娩、在小于28周分娩、在小于29周分娩、在小于30周分娩、在小于31周分娩、在小于32周分娩、在小于33周分娩、在小于34周分娩、在小于35周分娩、在小于36周分娩、在小于37周分娩或在小于38周分娩。在一些实施方案中,方法进一步包括至少部分地基于早产分子亚型的存在或升高的风险来鉴定妊娠对象的临床干预。在一些实施方案中,临床干预选自多个临床干预。在一些实施方案中,临床干预包括药物、补充剂、生活方式建议、宫颈环扎、宫颈子宫托或电收缩抑制。在一些实施方案中,药物选自黄体酮、红霉素、宫缩抑制药物、皮质类固醇、阴道菌群和抗氧化剂。In some embodiments, the pregnancy-related state is a preterm birth subtype, and at least three different pregnancy-related states include at least two different preterm birth subtypes. In some embodiments, the preterm birth subtype is a preterm birth molecular subtype, and at least two different preterm birth subtypes include at least two different preterm birth molecular subtypes. In some embodiments, the preterm birth molecular subtype is selected from the medical history of previous premature birth, the medical history of spontaneous premature birth, the medical history of ethnic-specific premature birth risk, and the medical history of premature premature rupture of membranes (PPROM). In some embodiments, the preterm birth molecular subtype is spontaneous premature birth, and the biomarker set includes genomic sites associated with spontaneous premature birth. In some embodiments, the genomic sites associated with spontaneous premature birth are selected from the genes listed in Table 1, the genes listed in Table 2, the genes listed in Table 3, the genes corresponding to the pathways listed in Table 4, the genes listed in Table 9, the genes listed in Table 10, and the genes listed in Table 11. In some embodiments, spontaneous preterm birth includes delivery at less than 25 weeks, delivery at less than 26 weeks, delivery at less than 27 weeks, delivery at less than 28 weeks, delivery at less than 29 weeks, delivery at less than 30 weeks, delivery at less than 31 weeks, delivery at less than 32 weeks, delivery at less than 33 weeks, delivery at less than 34 weeks, delivery at less than 35 weeks, delivery at less than 36 weeks, delivery at less than 37 weeks, or delivery at less than 38 weeks. In some embodiments, the method further includes identifying a clinical intervention for a pregnant subject based at least in part on the presence or increased risk of a molecular subtype of preterm birth. In some embodiments, the clinical intervention is selected from a plurality of clinical interventions. In some embodiments, the clinical intervention includes medications, supplements, lifestyle advice, cervical cerclage, cervical pessary, or electrical contraction inhibition. In some embodiments, the drug is selected from progesterone, erythromycin, tocolytic drugs, corticosteroids, vaginal flora, and antioxidants.

在一些实施方案中,妊娠相关状态是先兆子痫亚型,并且至少三种不同的妊娠相关状态包括至少两种不同的先兆子痫亚型。在一些实施方案中,先兆子痫亚型是先兆子痫分子亚型,并且其中至少两种不同的先兆子痫亚型包括至少两种不同的先兆子痫分子亚型。在一些实施方案中,先兆子痫分子亚型选自慢性或原有高血压的病史、妊娠高血压的存在或病史、轻度先兆子痫的存在或病史、重度先兆子痫的存在或病史、子痫的存在或病史,以及HELLP综合征的存在或病史。在一些实施方案中,先兆子痫分子亚型是早产先兆子痫,并且生物标志物集包含与早产先兆子痫相关联的基因组位点。在一些实施方案中,与早产先兆子痫相关联的基因组位点选自表5中列出的基因、表6中列出的基因、表7中列出的基因以及表8中列出的基因。在一些实施方案中,早产先兆子痫包括在小于25周分娩、在小于26周分娩、在小于27周分娩、在小于28周分娩、在小于29周分娩、在小于30周分娩、在小于31周分娩、在小于32周分娩、在小于33周分娩、在小于34周分娩、在小于35周分娩、在小于36周分娩、在小于37周分娩或在小于38周分娩。在一些实施方案中,方法进一步包括至少部分地基于先兆子痫分子亚型的存在或升高的风险来鉴定妊娠对象的临床干预。在一些实施方案中,临床干预选自多个临床干预。在一些实施方案中,临床干预包括药物、补充剂或生活方式建议。在一些实施方案中,药物选自阿司匹林、黄体酮、硫酸镁、胆固醇药物、胃灼热药物、血管紧张素II受体拮抗剂、钙通道阻滞剂、糖尿病药物和勃起功能障碍药物。In some embodiments, the pregnancy-related state is a pre-eclampsia subtype, and at least three different pregnancy-related states include at least two different pre-eclampsia subtypes. In some embodiments, the pre-eclampsia subtype is a pre-eclampsia molecular subtype, and at least two different pre-eclampsia subtypes include at least two different pre-eclampsia molecular subtypes. In some embodiments, the pre-eclampsia molecular subtype is selected from a history of chronic or pre-existing hypertension, the presence or history of gestational hypertension, the presence or history of mild pre-eclampsia, the presence or history of severe pre-eclampsia, the presence or history of eclampsia, and the presence or history of HELLP syndrome. In some embodiments, the pre-eclampsia molecular subtype is pre-eclampsia, and the biomarker set includes genomic sites associated with pre-eclampsia. In some embodiments, the genomic sites associated with pre-eclampsia are selected from the genes listed in Table 5, the genes listed in Table 6, the genes listed in Table 7, and the genes listed in Table 8. In some embodiments, premature pre-eclampsia is included in less than 25 weeks of delivery, less than 26 weeks of delivery, less than 27 weeks of delivery, less than 28 weeks of delivery, less than 29 weeks of delivery, less than 30 weeks of delivery, less than 31 weeks of delivery, less than 32 weeks of delivery, less than 33 weeks of delivery, less than 34 weeks of delivery, less than 35 weeks of delivery, less than 36 weeks of delivery, less than 37 weeks of delivery or less than 38 weeks of delivery. In some embodiments, the method further includes identifying the clinical intervention of the pregnancy object at least in part based on the presence of pre-eclampsia molecular subtypes or the risk of increase. In some embodiments, clinical intervention is selected from a plurality of clinical interventions. In some embodiments, clinical intervention includes medicine, supplements or lifestyle advice. In some embodiments, the medicine is selected from aspirin, progesterone, magnesium sulfate, cholesterol medicine, heartburn medicine, angiotensin II receptor antagonist, calcium channel blocker, diabetes medicine and erectile dysfunction medicine.

在一些实施方案中,妊娠相关状态是妊娠期糖尿病亚型,并且其中至少三种不同的妊娠相关状态包括至少两种不同的妊娠期糖尿病亚型。在一些实施方案中,妊娠期糖尿病亚型是妊娠期糖尿病分子亚型,并且其中至少两种不同的妊娠期糖尿病亚型包括至少两种不同的妊娠期糖尿病分子亚型。在一些实施方案中,生物标志物集包含与妊娠期糖尿病相关联的基因组位点。在一些实施方案中,与妊娠期糖尿病相关联的基因组位点选自PDK4、CSH1、PLAC4、TBCEL和FBXO7。在一些实施方案中,方法进一步包括至少部分地基于妊娠期糖尿病分子亚型的存在或升高的风险来鉴定妊娠对象的临床干预。在一些实施方案中,临床干预选自多个临床干预。在一些实施方案中,临床干预包括药物、补充剂、生活方式建议、宫颈环扎、宫颈子宫托或电收缩抑制。In some embodiments, the pregnancy-related state is a gestational diabetes subtype, and at least three different pregnancy-related states include at least two different gestational diabetes subtypes. In some embodiments, the gestational diabetes subtype is a gestational diabetes molecular subtype, and at least two different gestational diabetes subtypes include at least two different gestational diabetes molecular subtypes. In some embodiments, the biomarker set comprises genomic sites associated with gestational diabetes. In some embodiments, the genomic sites associated with gestational diabetes are selected from PDK4, CSH1, PLAC4, TBCEL and FBXO7. In some embodiments, the method further includes identifying a clinical intervention for a pregnant subject based at least in part on the presence or increased risk of a gestational diabetes molecular subtype. In some embodiments, the clinical intervention is selected from a plurality of clinical interventions. In some embodiments, the clinical intervention includes medications, supplements, lifestyle advice, cervical cerclage, cervical pessary or electrical contraction inhibition.

在一些实施方案中,生物标志物集包含至少5个不同的基因组位点、至少10个不同的基因组位点、至少15个不同的基因组位点、至少20个不同的基因组位点、至少25个不同的基因组位点、至少30个不同的基因组位点、至少35个不同的基因组位点、至少40个不同的基因组位点、至少45个不同的基因组位点、至少50个不同的基因组位点、至少100个不同的基因组位点、至少150个不同的基因组位点或至少200个不同的基因组位点。In some embodiments, the biomarker set comprises at least 5 different genomic loci, at least 10 different genomic loci, at least 15 different genomic loci, at least 20 different genomic loci, at least 25 different genomic loci, at least 30 different genomic loci, at least 35 different genomic loci, at least 40 different genomic loci, at least 45 different genomic loci, at least 50 different genomic loci, at least 100 different genomic loci, at least 150 different genomic loci, or at least 200 different genomic loci.

在一些实施方案中,测定包括使用来源于无细胞生物样品的无细胞核糖核酸(cfRNA)分子来生成转录组学数据、使用来源于无细胞生物样品的转录产物来生成转录产物数据、使用来源于无细胞生物样品的无细胞脱氧核糖核酸(cfDNA)分子来生成基因组数据和/或甲基化数据、使用来源于第一无细胞生物样品的蛋白质来生成蛋白质组学数据,或使用来源于第一无细胞生物样品的代谢物来生成代谢组学数据。In some embodiments, determining includes using cell-free ribonucleic acid (cfRNA) molecules derived from a cell-free biological sample to generate transcriptomic data, using transcription products derived from a cell-free biological sample to generate transcription product data, using cell-free deoxyribonucleic acid (cfDNA) molecules derived from a cell-free biological sample to generate genomic data and/or methylation data, using proteins derived from a first cell-free biological sample to generate proteomic data, or using metabolites derived from a first cell-free biological sample to generate metabolomic data.

在一些实施方案中,无细胞生物样品选自无细胞核糖核酸(cfRNA)、无细胞脱氧核糖核酸(cfDNA)、无细胞胎儿DNA(cffDNA)、血浆、血清、尿液、唾液、羊水及其衍生物。在一些实施方案中,无细胞生物样品是使用乙二胺四乙酸(EDTA)收集管、无细胞RNA收集管或无细胞脱氧核糖核酸(DNA)收集管获得自或来源于妊娠对象。在一些实施方案中,方法进一步包括对妊娠对象的全血样品进行分级分离以获得无细胞生物样品。在一些实施方案中,测定包括无细胞核糖核酸(cfRNA)测定或代谢组学测定。在一些实施方案中,代谢组学测定包括靶向质谱(MS)或免疫测定。在一些实施方案中,无细胞生物样品包括无细胞核糖核酸(cfRNA)或尿液。在一些实施方案中,测定包括定量聚合酶链反应(qPCR)。在一些实施方案中,测定包括被配置为在家庭环境中进行的家用测试。In some embodiments, the cell-free biological sample is selected from cell-free ribonucleic acid (cfRNA), cell-free deoxyribonucleic acid (cfDNA), cell-free fetal DNA (cffDNA), plasma, serum, urine, saliva, amniotic fluid and its derivatives. In some embodiments, the cell-free biological sample is obtained from or derived from a pregnant object using an ethylenediaminetetraacetic acid (EDTA) collection tube, a cell-free RNA collection tube or a cell-free deoxyribonucleic acid (DNA) collection tube. In some embodiments, the method further includes fractionating a whole blood sample of a pregnant object to obtain a cell-free biological sample. In some embodiments, determination includes determination of cell-free ribonucleic acid (cfRNA) or metabolomics determination. In some embodiments, metabolomics determination includes targeted mass spectrometry (MS) or immunoassay. In some embodiments, the cell-free biological sample includes cell-free ribonucleic acid (cfRNA) or urine. In some embodiments, determination includes quantitative polymerase chain reaction (qPCR). In some embodiments, determination includes a home test configured to be performed in a home environment.

在一些实施方案中,经训练算法以至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约85%、至少约90%或至少约95%的灵敏度确定妊娠对象的妊娠相关状态的存在或升高的风险。在一些实施方案中,经训练算法以至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约85%、至少约90%或至少约95%的阳性预测值(PPV)确定妊娠对象的妊娠相关状态的存在或升高的风险。在一些实施方案中,经训练算法以至少约0.60、至少约0.65、至少约0.70、至少约0.75、至少约0.80、至少约0.85、至少约0.90或至少约0.95的曲线下面积(AUC)确定妊娠对象的妊娠相关状态的存在或升高的风险。In some embodiments, the trained algorithm determines the presence or increased risk of a pregnancy-related state in a pregnant subject with a sensitivity of at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, or at least about 95%. In some embodiments, the trained algorithm determines the presence or increased risk of a pregnancy-related state in a pregnant subject with a positive predictive value (PPV) of at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, or at least about 95%. In some embodiments, the trained algorithm determines the presence or increased risk of a pregnancy-related state in a pregnant subject with an area under the curve (AUC) of at least about 0.60, at least about 0.65, at least about 0.70, at least about 0.75, at least about 0.80, at least about 0.85, at least about 0.90, or at least about 0.95.

在一些实施方案中,妊娠对象针对妊娠相关状态是无症状的。In some embodiments, the pregnant subject is asymptomatic for the pregnancy-related condition.

在一些实施方案中,经训练算法使用与妊娠相关状态的存在或升高的风险相关联的第一独立训练样品集和与妊娠相关状态的不存在或无升高的风险相关联的第二独立训练样品集进行训练。In some embodiments, the trained algorithm is trained using a first independent set of training samples associated with the presence or increased risk of a pregnancy-related state and a second independent set of training samples associated with the absence or no increased risk of a pregnancy-related state.

在一些实施方案中,方法进一步包括使用经训练算法或另一经训练算法来处理妊娠对象的临床健康数据集,以确定妊娠相关状态的存在或升高的风险。In some embodiments, the method further comprises processing a clinical health dataset of the pregnant subject using the trained algorithm or another trained algorithm to determine the presence or increased risk of a pregnancy-related condition.

在一些实施方案中,方法进一步包括使无细胞生物样品经受足以分离、富集或提取核糖核酸(RNA)分子、脱氧核糖核酸(DNA)分子、蛋白质或代谢物的集的条件;并且其中测定包括分析RNA分子、DNA分子、蛋白质或代谢物的集。在一些实施方案中,方法进一步包括从无细胞生物样品提取核酸分子集,以及对核酸分子集进行测序以生成测序读数集。在一些实施方案中,测序是大规模并行测序。在一些实施方案中,测序包括核酸扩增。在一些实施方案中,核酸扩增包括聚合酶链反应(PCR)。在一些实施方案中,测序包括使用逆转录(RT)和聚合酶链反应(PCR)。在一些实施方案中,方法进一步包括使用探针,这些探针被配置为选择性地富集对应于一个或多个基因组位点的分组的核酸分子集。在一些实施方案中,探针是核酸引物。在一些实施方案中,探针与一个或多个基因组位点的分组的核酸序列具有序列互补性。在一些实施方案中,一个或多个基因组位点的分组包括与自发性早产相关联的基因组位点,其中该基因组位点选自表1中列出的基因、表2中列出的基因、表3中列出的基因、对应于表4中列出的途径的基因、表9中列出的基因、表10中列出的基因以及表11中列出的基因。在一些实施方案中,一个或多个基因组位点的分组包括与早产先兆子痫相关联的基因组位点,其中该基因组位点选自表5中列出的基因、表6中列出的基因、表7中列出的基因以及表8中列出的基因。在一些实施方案中,一个或多个基因组位点的分组包括与妊娠期糖尿病相关联的基因组位点,其中该基因组位点选自PDK4、CSH1、PLAC4、TBCEL和FBXO7。在一些实施方案中,一个或多个基因组位点的分组包含至少5个不同的基因组位点、至少10个不同的基因组位点、至少15个不同的基因组位点、至少20个不同的基因组位点、至少25个不同的基因组位点、至少30个不同的基因组位点、至少35个不同的基因组位点、至少40个不同的基因组位点、至少45个不同的基因组位点、至少50个不同的基因组位点、至少100个不同的基因组位点、至少150个不同的基因组位点或至少200个不同的基因组位点。In some embodiments, the method further includes subjecting the cell-free biological sample to conditions sufficient to separate, enrich or extract a set of ribonucleic acid (RNA) molecules, deoxyribonucleic acid (DNA) molecules, proteins or metabolites; and wherein the determination includes analyzing a set of RNA molecules, DNA molecules, proteins or metabolites. In some embodiments, the method further includes extracting a set of nucleic acid molecules from the cell-free biological sample, and sequencing the set of nucleic acid molecules to generate a set of sequencing reads. In some embodiments, sequencing is massively parallel sequencing. In some embodiments, sequencing includes nucleic acid amplification. In some embodiments, nucleic acid amplification includes polymerase chain reaction (PCR). In some embodiments, sequencing includes using reverse transcription (RT) and polymerase chain reaction (PCR). In some embodiments, the method further includes using probes, which are configured to selectively enrich a set of nucleic acid molecules corresponding to a grouping of one or more genomic sites. In some embodiments, the probe is a nucleic acid primer. In some embodiments, the probe has sequence complementarity with the nucleic acid sequence of the grouping of one or more genomic sites. In some embodiments, the grouping of one or more genomic loci includes a genomic locus associated with spontaneous preterm birth, wherein the genomic locus is selected from the group consisting of genes listed in Table 1, genes listed in Table 2, genes listed in Table 3, genes corresponding to pathways listed in Table 4, genes listed in Table 9, genes listed in Table 10, and genes listed in Table 11. In some embodiments, the grouping of one or more genomic loci includes a genomic locus associated with preterm preeclampsia, wherein the genomic locus is selected from the group consisting of genes listed in Table 5, genes listed in Table 6, genes listed in Table 7, and genes listed in Table 8. In some embodiments, the grouping of one or more genomic loci includes a genomic locus associated with gestational diabetes, wherein the genomic locus is selected from the group consisting of PDK4, CSH1, PLAC4, TBCEL, and FBXO7. In some embodiments, the grouping of one or more genomic loci comprises at least 5 different genomic loci, at least 10 different genomic loci, at least 15 different genomic loci, at least 20 different genomic loci, at least 25 different genomic loci, at least 30 different genomic loci, at least 35 different genomic loci, at least 40 different genomic loci, at least 45 different genomic loci, at least 50 different genomic loci, at least 100 different genomic loci, at least 150 different genomic loci, or at least 200 different genomic loci.

在一些实施方案中,无细胞生物样品在没有核酸分离、富集或提取的情况下进行处理。In some embodiments, the cell-free biological sample is processed without nucleic acid isolation, enrichment, or extraction.

在一些实施方案中,方法进一步包括生成包括指示妊娠相关状态的经确定的存在或升高的风险的电子报告。In some embodiments, the method further comprises generating an electronic report including an indication of the determined presence or increased risk of a pregnancy-related condition.

在一些实施方案中,方法进一步包括确定妊娠对象的妊娠相关状态的存在或升高的风险的确定的似然性。In some embodiments, the method further comprises determining the likelihood of the presence or increased risk of a pregnancy-related condition in the pregnant subject.

在一些实施方案中,经训练算法包括经训练机器学习算法。在一些实施方案中,经训练机器学习算法包括深度学习算法、支持向量机(SVM)、神经网络、随机森林、线性回归模型、逻辑回归模型或ANOVA模型。In some embodiments, the trained algorithm comprises a trained machine learning algorithm. In some embodiments, the trained machine learning algorithm comprises a deep learning algorithm, a support vector machine (SVM), a neural network, a random forest, a linear regression model, a logistic regression model, or an ANOVA model.

在一些实施方案中,方法进一步包括处理生物标志物集以减少系统变异。在一些实施方案中,减少系统变异包括使用来自多元线性回归的残差来校正数据残差、执行基于经验贝叶斯方法的ComBat方法以及执行替代变量分析(SVA)校正。在一些实施方案中,系统变异包括每个样品的测序深度、个体处理操作的批次效应、各种原材料的使用、样品收集的局部外部温度、对象的BMI、胎儿分数、样品收集时的胎儿胎龄或其组合。In some embodiments, the method further comprises processing the biomarker set to reduce systematic variation. In some embodiments, reducing systematic variation comprises using residuals from multiple linear regression to correct data residuals, performing ComBat method based on empirical Bayesian method, and performing surrogate variable analysis (SVA) correction. In some embodiments, systematic variation includes sequencing depth of each sample, batch effect of individual processing operations, use of various raw materials, local external temperature of sample collection, BMI of the subject, fetal score, fetal gestational age at sample collection, or a combination thereof.

在一些实施方案中,方法进一步包括监测妊娠相关状态的存在或升高的风险,其中该监测包括在多个时间点评估妊娠对象的妊娠相关状态的存在或升高的风险,其中该评估至少基于在该多个时间点中的每一个处确定的妊娠相关状态的存在或升高的风险。在一些实施方案中,多个时间点之间对妊娠对象的妊娠相关状态的存在或升高的风险的评估的差异指示选自以下的一个或多个临床指征:(i)妊娠对象的妊娠相关状态的存在或升高的风险的诊断,(ii)妊娠对象的妊娠相关状态的存在或升高的风险的预后,以及(iii)治疗妊娠对象的妊娠相关状态的存在或升高的风险的疗程的有效性或无效性。In some embodiments, the method further comprises monitoring the presence or increased risk of a pregnancy-related state, wherein the monitoring comprises assessing the presence or increased risk of a pregnancy-related state in the pregnant subject at a plurality of time points, wherein the assessment is based at least on the presence or increased risk of a pregnancy-related state determined at each of the plurality of time points. In some embodiments, a difference in the assessment of the presence or increased risk of a pregnancy-related state in the pregnant subject between the plurality of time points indicates one or more clinical indications selected from: (i) diagnosis of the presence or increased risk of a pregnancy-related state in the pregnant subject, (ii) prognosis of the presence or increased risk of a pregnancy-related state in the pregnant subject, and (iii) effectiveness or ineffectiveness of a course of treatment for the presence or increased risk of a pregnancy-related state in the pregnant subject.

在一些实施方案中,妊娠对象处在妊娠的首三个月、妊娠的中间三个月或妊娠的末三个月。In some embodiments, the pregnant subject is in the first trimester, the second trimester, or the third trimester.

在一些实施方案中,从妊娠对象和/或非妊娠对象确定参考值。在一些实施方案中,针对参考值处理生物标志物集包括确定生物标志物集与参考值之间的差异。In some embodiments, reference values are determined from pregnant subjects and/or non-pregnant subjects. In some embodiments, processing the biomarker set against the reference value comprises determining a difference between the biomarker set and the reference value.

在另一方面,本公开提供了一种用于鉴定对象的妊娠相关状态的存在或易感性的方法,包括测定来源于对象的无细胞生物样品中的转录物和/或代谢物以检测生物标志物集,以及使用经训练算法分析该生物标志物集以确定妊娠相关状态的存在或易感性。在一些实施方案中,该方法包括测定来源于对象的无细胞生物样品中的转录物以检测该生物标志物集。在一些实施方案中,用核酸测序测定转录物。在一些实施方案中,该方法包括测定来源于对象的无细胞生物样品中的代谢物以检测该生物标志物集。在一些实施方案中,用代谢组学测定来测定代谢物。In another aspect, the present disclosure provides a method for identifying the presence or susceptibility of a pregnancy-related state in a subject, comprising determining transcripts and/or metabolites in a cell-free biological sample derived from the subject to detect a biomarker set, and analyzing the biomarker set using a trained algorithm to determine the presence or susceptibility of a pregnancy-related state. In some embodiments, the method comprises determining transcripts in a cell-free biological sample derived from the subject to detect the biomarker set. In some embodiments, the transcripts are determined using nucleic acid sequencing. In some embodiments, the method comprises determining metabolites in a cell-free biological sample derived from the subject to detect the biomarker set. In some embodiments, metabolites are determined using a metabolomics assay.

在另一方面,本公开提供了一种用于鉴定对象的妊娠相关状态的存在或易感性的方法,包括测定来源于对象的无细胞生物样品以检测生物标志物集,以及使用经训练算法分析该生物标志物集,从而确定至少三种不同的妊娠相关状态的集之中的妊娠相关状态的存在或易感性(例如,以至少约80%的精度)。In another aspect, the present disclosure provides a method for identifying the presence of or susceptibility to a pregnancy-related state in a subject, comprising assaying a cell-free biological sample derived from the subject to detect a set of biomarkers, and analyzing the set of biomarkers using a trained algorithm to determine the presence of or susceptibility to a pregnancy-related state among a set of at least three different pregnancy-related states (e.g., with an accuracy of at least about 80%).

在一些实施方案中,妊娠相关状态选自早产、足月产、胎龄、预产期、分娩发作、妊娠相关高血压病症(例如,先兆子痫)、子痫、妊娠期糖尿病、对象胎儿的先天性病症、异位妊娠、自然流产、死产、产后并发症(例如,产后抑郁、出血或流血过多、肺栓塞、心肌病、糖尿病、贫血和高血压病症)、妊娠剧吐(晨吐)、分娩时出血或流血过多、胎膜早破、早产胎膜早破、前置胎盘(胎盘覆盖子宫颈)、宫内/胎儿生长受限、巨大儿(对于胎龄而言较大的胎儿)、新生儿状况(例如,贫血、呼吸暂停、心动过缓和其他心脏缺陷、支气管肺发育不良或慢性肺病、糖尿病、腹裂、脑积水、高胆红素血症、低钙血症、低血糖、脑室内出血、黄疸、坏死性小肠结肠炎、动脉导管未闭、脑室周围白质软化、持续性肺动脉高压、红细胞增多症、呼吸窘迫综合征、早产儿视网膜病变和暂时性呼吸急促),以及胎儿发育阶段或状态(例如,正常胎儿器官功能或发育和异常胎儿器官功能或发育)。例如,胎儿发育阶段或状态可以与胎儿器官的正常胎儿器官功能或发育和/或异常胎儿器官功能或发育有关,该胎儿器官选自心脏、大肠、小肠、视网膜、前额叶皮层、中脑、肾脏和食道。In some embodiments, the pregnancy-related condition is selected from preterm birth, term birth, gestational age, due date, onset of labor, pregnancy-related hypertensive disorders (e.g., preeclampsia), eclampsia, gestational diabetes, congenital disorders of the subject's fetus, ectopic pregnancy, spontaneous abortion, stillbirth, postpartum complications (e.g., postpartum depression, hemorrhage or excessive bleeding, pulmonary embolism, cardiomyopathy, diabetes, anemia, and hypertensive disorders), hyperemesis gravidarum (morning sickness), hemorrhage or excessive bleeding during labor, premature rupture of membranes, premature premature rupture of membranes, placenta previa (placenta covering the cervix), intrauterine/fetal growth restriction, macrosomia (for fetuses The invention also relates to fetal development stages or states (e.g., normal fetal organ function or development and abnormal fetal organ function or development). For example, the fetal development stage or state can be associated with normal fetal organ function or development and/or abnormal fetal organ function or development of a fetal organ selected from the group consisting of heart, large intestine, small intestine, retina, prefrontal cortex, midbrain, kidney, and esophagus.

在一些实施方案中,妊娠相关状态是早产亚型,并且至少三种不同的妊娠相关状态包括至少两种不同的早产亚型。在一些实施方案中,早产亚型是早产分子亚型,并且至少两种不同的早产亚型包括至少两种不同的早产分子亚型。在一些实施方案中,不同的早产分子亚型包括选自以下早产分子亚型:既往早产的存在或病史、自发性早产的存在或病史、晚期流产的存在或病史、接受宫颈手术的存在或病史、子宫异常的存在或病史、种族特异性早产风险(例如,在非洲裔美国人中)的存在或病史,以及早产胎膜早破(PPROM)的存在或病史。In some embodiments, the pregnancy-related state is a preterm birth subtype, and at least three different pregnancy-related states include at least two different preterm birth subtypes. In some embodiments, the preterm birth subtype is a preterm birth molecular subtype, and at least two different preterm birth subtypes include at least two different preterm birth molecular subtypes. In some embodiments, different preterm birth molecular subtypes include selected from the following preterm birth molecular subtypes: the presence or history of previous preterm birth, the presence or history of spontaneous preterm birth, the presence or history of late miscarriage, the presence or history of receiving cervical surgery, the presence or history of uterine abnormalities, the presence or history of ethnic-specific preterm birth risk (e.g., in African Americans), and the presence or history of preterm premature rupture of membranes (PPROM).

在一些实施方案中,妊娠相关状态是先兆子痫亚型,并且至少三种不同的妊娠相关状态包括至少两种不同的子痫亚型。在一些实施方案中,不同的先兆子痫分子亚型包括选自以下的先兆子痫分子亚型:慢性或原有高血压的存在或病史、妊娠高血压的存在或病史、轻度先兆子痫的存在或病史(例如,在大于34周胎龄分娩)、重度先兆子痫的存在或病史(例如,在小于34周胎龄分娩)、子痫的存在或病史,以及HELLP综合征的存在或病史。In some embodiments, the pregnancy-related state is a subtype of pre-eclampsia, and the at least three different pregnancy-related states include at least two different subtypes of eclampsia. In some embodiments, the different molecular subtypes of pre-eclampsia include molecular subtypes of pre-eclampsia selected from the following: the presence or history of chronic or pre-existing hypertension, the presence or history of gestational hypertension, the presence or history of mild pre-eclampsia (e.g., delivery at greater than 34 weeks gestational age), the presence or history of severe pre-eclampsia (e.g., delivery at less than 34 weeks gestational age), the presence or history of eclampsia, and the presence or history of HELLP syndrome.

在一些实施方案中,该方法进一步包括至少部分地基于妊娠相关状态的存在或易感性来鉴定对象的临床干预。在一些实施方案中,该临床干预选自多个临床干预。在一些实施方案中,该方法进一步包括确定对象的妊娠相关状态的易感性的似然性,之后可以向对象提供临床干预。在一些实施方案中,临床干预包括药理学、外科或规程治疗,以减轻对象的未来易感性妊娠相关状态的严重性、延迟或消除未来易感性妊娠相关状态(例如,用于先兆子痫的阿司匹林和用于早产的类固醇)。In some embodiments, the method further includes a clinical intervention to identify an object at least in part based on the presence or susceptibility of pregnancy-related states. In some embodiments, the clinical intervention is selected from a plurality of clinical interventions. In some embodiments, the method further includes determining the likelihood of the susceptibility of the pregnancy-related states of the object, after which the clinical intervention can be provided to the object. In some embodiments, the clinical intervention includes pharmacology, surgery or protocol treatment, to mitigate the severity of the future susceptibility pregnancy-related states of the object, delay or eliminate future susceptibility pregnancy-related states (e.g., aspirin for pre-eclampsia and steroids for premature delivery).

在另一方面,本公开提供了方法,该方法包括测定来源于对象的无细胞生物样品;鉴定对象患有先兆子痫或有患有先兆子痫的风险;以及在鉴定对象患有先兆子痫或有患有先兆子痫的风险后,向对象施用抗高血压药物。In another aspect, the present disclosure provides a method comprising assaying a cell-free biological sample derived from a subject; identifying the subject as having or at risk of having pre-eclampsia; and administering an antihypertensive drug to the subject after identifying the subject as having or at risk of having pre-eclampsia.

在一些实施方案中,无细胞生物样品是在给定胎龄间隔内收集自对象,用于检测妊娠相关状态。在一些实施方案中,该给定胎龄间隔为给定胎龄的约1天、约2天、约3天、约4天、约5天、约6天、约7天、约8天、约9天、约10天、约11天、约12天、约13天、约14天、约3周或约4周内。在一些实施方案中,该给定胎龄为约0周、约1周、约2周、约3周、约4周、约5周、约6周、约7周、约8周、约9周、约10周、约11周、约12周、约13周、约14周、约15周、约16周、约17周、约18周、约19周、约20周、约21周、约22周、约23周、约24周、约25周、约26周、约27周、约28周、约29周、约30周、约31周、约32周、约33周、约34周、约35周、约36周、约37周、约38周、约39周、约40周、约41周、约42周、约43周、约44周或约45周。在一些实施方案中,妊娠相关状态包括以下一种或多种:早产、分娩发作、妊娠相关高血压病症、先兆子痫、子痫、妊娠期糖尿病、对象胎儿的先天性病症、异位妊娠、自然流产、死产、产后并发症、妊娠剧吐(晨吐)、分娩时出血或流血过多、胎膜早破、早产胎膜早破、前置胎盘(胎盘覆盖子宫颈)、宫内/胎儿生长受限、巨大儿(对于胎龄而言较大的胎儿)、新生儿状况,以及异常胎儿发育阶段或状态。例如,胎儿发育阶段或状态可以与胎儿器官的正常胎儿器官功能或发育和/或异常胎儿器官功能或发育有关,该胎儿器官选自心脏、大肠、小肠、视网膜、前额叶皮层、中脑、肾脏和食道。In some embodiments, the cell-free biological sample is collected from a subject within a given gestational age interval for detecting a pregnancy-related condition. In some embodiments, the given gestational age interval is within about 1 day, about 2 days, about 3 days, about 4 days, about 5 days, about 6 days, about 7 days, about 8 days, about 9 days, about 10 days, about 11 days, about 12 days, about 13 days, about 14 days, about 3 weeks, or about 4 weeks of a given gestational age. In some embodiments, the given gestational age is about 0 weeks, about 1 week, about 2 weeks, about 3 weeks, about 4 weeks, about 5 weeks, about 6 weeks, about 7 weeks, about 8 weeks, about 9 weeks, about 10 weeks, about 11 weeks, about 12 weeks, about 13 weeks, about 14 weeks, about 15 weeks, about 16 weeks, about 17 weeks, about 18 weeks, about 19 weeks, about 20 weeks, about 21 weeks, about 22 weeks, about 23 weeks, about 24 weeks, about 25 weeks, about 26 weeks, about 27 weeks, about 28 weeks, about 29 weeks, about 30 weeks, about 31 weeks, about 32 weeks, about 33 weeks, about 34 weeks, about 35 weeks, about 36 weeks, about 37 weeks, about 38 weeks, about 39 weeks, about 40 weeks, about 41 weeks, about 42 weeks, about 43 weeks, about 44 weeks, or about 45 weeks. In some embodiments, pregnancy-related conditions include one or more of: premature birth, onset of labor, pregnancy-related hypertensive disorders, pre-eclampsia, eclampsia, gestational diabetes, congenital disorders of the subject's fetus, ectopic pregnancy, spontaneous abortion, stillbirth, postpartum complications, hyperemesis gravidarum (morning sickness), bleeding or excessive bleeding during labor, premature rupture of membranes, premature premature rupture of membranes, placenta previa (placenta covering the cervix), intrauterine/fetal growth restriction, macrosomia (large fetus for gestational age), neonatal conditions, and abnormal fetal development stages or states. For example, the fetal development stage or state can be associated with normal fetal organ function or development and/or abnormal fetal organ function or development of a fetal organ selected from the group consisting of heart, large intestine, small intestine, retina, prefrontal cortex, midbrain, kidney, and esophagus.

在一些实施方案中,(a)包括(i)使无细胞生物样品经受足以分离、富集或提取核糖核酸(RNA)分子、脱氧核糖核酸(DNA)分子、转录产物(例如,信使RNA、转移RNA或核糖体RNA)、蛋白质(例如,对应于妊娠相关联基因组位点或基因的妊娠相关蛋白质)或代谢物的集的条件,以及(ii)使用第一测定分析RNA分子、DNA分子、蛋白质或代谢物的集,以生成第一数据集。在一些实施方案中,该方法进一步包括从无细胞生物样品提取核酸分子集,以及对核酸分子集进行测序以生成测序读数集,其中第一数据集包括测序读数集。在一些实施方案中,(b)包括(i)使阴道或宫颈生物样品经受于足以分离、富集或提取微生物群的条件,以及(ii)使用第二测定分析微生物群以生成第二数据集。In some embodiments, (a) includes (i) subjecting the cell-free biological sample to conditions sufficient to separate, enrich or extract a set of ribonucleic acid (RNA) molecules, deoxyribonucleic acid (DNA) molecules, transcripts (e.g., messenger RNA, transfer RNA or ribosomal RNA), proteins (e.g., pregnancy-associated proteins corresponding to pregnancy-associated genomic sites or genes), or metabolites, and (ii) analyzing the set of RNA molecules, DNA molecules, proteins or metabolites using a first assay to generate a first data set. In some embodiments, the method further includes extracting a set of nucleic acid molecules from the cell-free biological sample, and sequencing the set of nucleic acid molecules to generate a set of sequencing reads, wherein the first data set includes the set of sequencing reads. In some embodiments, (b) includes (i) subjecting the vaginal or cervical biological sample to conditions sufficient to separate, enrich or extract a microbiome, and (ii) analyzing the microbiome using a second assay to generate a second data set.

在一些实施方案中,测序是大规模并行测序。在一些实施方案中,测序包括核酸扩增。在一些实施方案中,核酸扩增包括聚合酶链反应(PCR)。在一些实施方案中,测序包括使用同时的逆转录(RT)和聚合酶链反应(PCR)。在一些实施方案中,该方法进一步包括使用探针,这些探针被配置为选择性地富集对应于一个或多个基因组位点的分组的核酸分子集。在一些实施方案中,探针是核酸引物。在一些实施方案中,探针与一个或多个基因组位点的分组的核酸序列具有序列互补性。In some embodiments, sequencing is large-scale parallel sequencing. In some embodiments, sequencing includes nucleic acid amplification. In some embodiments, nucleic acid amplification includes polymerase chain reaction (PCR). In some embodiments, sequencing includes using simultaneous reverse transcription (RT) and polymerase chain reaction (PCR). In some embodiments, the method further includes using probes, which are configured to selectively enrich for nucleic acid molecule sets corresponding to the groupings of one or more genomic sites. In some embodiments, the probe is a nucleic acid primer. In some embodiments, the probe has sequence complementarity with the nucleic acid sequence of the groupings of one or more genomic sites.

在一些实施方案中,一个或多个基因组位点的分组包括与预产期相关联的基因组位点。在一些实施方案中,一个或多个基因组位点的分组包括与胎龄相关联的基因组位点。在一些实施方案中,一个或多个基因组位点的分组包括与早产相关联的基因组位点。在一些实施方案中,一个或多个基因组位点的分组包括与先兆子痫相关联的基因组位点。在一些实施方案中,一个或多个基因组位点的分组包括与胎儿器官发育相关联的基因组位点。在一些实施方案中,一个或多个基因组位点的分组包括与妊娠期糖尿病相关联的基因组位点。In some embodiments, the grouping of one or more genomic sites includes genomic sites associated with due date. In some embodiments, the grouping of one or more genomic sites includes genomic sites associated with gestational age. In some embodiments, the grouping of one or more genomic sites includes genomic sites associated with premature birth. In some embodiments, the grouping of one or more genomic sites includes genomic sites associated with pre-eclampsia. In some embodiments, the grouping of one or more genomic sites includes genomic sites associated with fetal organ development. In some embodiments, the grouping of one or more genomic sites includes genomic sites associated with gestational diabetes.

在一些实施方案中,无细胞生物样品在没有核酸分离、富集或提取的情况下进行处理。In some embodiments, the cell-free biological sample is processed without nucleic acid isolation, enrichment, or extraction.

在一些实施方案中,报告呈现在用户的电子设备的图形用户界面上。在一些实施方案中,用户是对象。In some embodiments, the report is presented on a graphical user interface of the user's electronic device.In some embodiments, the user is the subject.

在一些实施方案中,该方法进一步包括确定对象的妊娠相关状态的存在或易感性的确定的似然性。In some embodiments, the method further comprises determining a likelihood of a determination of the presence or susceptibility of a pregnancy-related condition in the subject.

在一些实施方案中,经训练算法包括监督机器学习算法。在一些实施方案中,监督机器学习算法包括深度学习算法、支持向量机(SVM)、神经网络或随机森林。在一些实施方案中,经训练算法包括差异表达算法。在一些实施方案中,差异表达算法包括随机模型、广义泊松(GPseq)、混合泊松(TSPM)、泊松对数线性(PoissonSeq)、负二项式(edgeR、DESeq、baySeq、NBPSeq)、MAANOVA拟合的线性模型或其组合的使用比较。In some embodiments, the trained algorithm comprises a supervised machine learning algorithm. In some embodiments, the supervised machine learning algorithm comprises a deep learning algorithm, a support vector machine (SVM), a neural network, or a random forest. In some embodiments, the trained algorithm comprises a differential expression algorithm. In some embodiments, the differential expression algorithm comprises a random model, a generalized Poisson (GPseq), a mixed Poisson (TSPM), a Poisson log-linear (PoissonSeq), a negative binomial (edgeR, DESeq, baySeq, NBPSeq), a linear model fitted by MAANOVA, or a combination thereof.

在一些实施方案中,该方法进一步包括向对象提供针对妊娠相关状态的存在或易感性的治疗性干预。在一些实施方案中,治疗性干预包括己酸羟孕酮、阴道黄体酮、天然黄体酮IVR产物、前列腺素F2α受体拮抗剂或β2-肾上腺素能受体激动剂。In some embodiments, the method further comprises providing the subject with a therapeutic intervention for the presence or susceptibility to a pregnancy-related condition. In some embodiments, the therapeutic intervention comprises hydroxyprogesterone caproate, vaginal progesterone, a natural progesterone IVR product, a prostaglandin F2α receptor antagonist, or a β2-adrenergic receptor agonist.

在一些实施方案中,该方法进一步包括监测妊娠相关状态的存在或易感性,其中监测包括在多个时间点评估对象的妊娠相关状态的存在或易感性,其中评估至少基于在多个时间点中的每一个处在(d)中确定的妊娠相关状态的存在或易感性。In some embodiments, the method further comprises monitoring the presence of or susceptibility to a pregnancy-related condition, wherein the monitoring comprises assessing the subject's presence of or susceptibility to a pregnancy-related condition at a plurality of time points, wherein the assessing is based at least on the presence of or susceptibility to the pregnancy-related condition determined in (d) at each of the plurality of time points.

在一些实施方案中,多个时间点之间对对象的妊娠相关状态的存在或易感性的评估的差异指示选自以下的一个或多个临床指征:(i)对象的妊娠相关状态的存在或易感性的诊断,(ii)对象的妊娠相关状态的存在或易感性的预后,以及(iii)治疗对象的妊娠相关状态的存在或易感性的疗程的有效性或无效性。In some embodiments, the difference in the assessment of the presence or susceptibility of a pregnancy-related condition in a subject between multiple time points is indicative of one or more clinical indications selected from: (i) diagnosis of the presence or susceptibility of a pregnancy-related condition in a subject, (ii) prognosis of the presence or susceptibility of a pregnancy-related condition in a subject, and (iii) effectiveness or ineffectiveness of a course of treatment for the presence or susceptibility of a pregnancy-related condition in a subject.

在一些实施方案中,该方法进一步包括通过使用经训练算法从多个不同的早产分子亚型中确定早产分子亚型来对早产进行分层。在一些实施方案中,该多个不同的早产分子亚型包括选自以下的早产分子亚型:既往早产的存在或病史、自发性早产的存在或病史、晚期流产的存在或病史、接受宫颈手术的存在或病史、子宫异常的存在或病史、种族特异性早产风险(例如,在非洲裔美国人中)的存在或病史,以及早产胎膜早破(PPROM)的存在或病史。In some embodiments, the method further comprises stratifying preterm birth by determining a preterm birth molecular subtype from a plurality of different preterm birth molecular subtypes using a trained algorithm. In some embodiments, the plurality of different preterm birth molecular subtypes comprises a preterm birth molecular subtype selected from the following: the presence or history of a previous preterm birth, the presence or history of a spontaneous preterm birth, the presence or history of a late miscarriage, the presence or history of undergoing cervical surgery, the presence or history of a uterine abnormality, the presence or history of a race-specific risk of preterm birth (e.g., in African Americans), and the presence or history of preterm premature rupture of membranes (PPROM).

在一些实施方案中,该方法进一步包括通过使用经训练算法从多个不同的先兆子痫分子亚型中确定先兆子痫分子亚型来对先兆子痫进行分层,该多个不同的先兆子痫分子亚型包括选自以下的先兆子痫分子亚型:慢性/原有高血压、妊娠高血压、轻度先兆子痫(例如,在>34周分娩)、重度先兆子痫(例如,在<34周分娩)、子痫以及HELLP综合征的病史。In some embodiments, the method further comprises stratifying pre-eclampsia by determining a pre-eclampsia molecular subtype from a plurality of different pre-eclampsia molecular subtypes using a trained algorithm, the plurality of different pre-eclampsia molecular subtypes comprising a pre-eclampsia molecular subtype selected from the following: chronic/pre-existing hypertension, gestational hypertension, mild pre-eclampsia (e.g., delivery at >34 weeks), severe pre-eclampsia (e.g., delivery at <34 weeks), eclampsia, and a history of HELLP syndrome.

在一些实施方案中,该方法进一步包括至少部分基于指示早产风险的风险评分向对象提供治疗性干预。在一些实施方案中,治疗性干预包括己酸羟孕酮、阴道黄体酮、天然黄体酮IVR产物、前列腺素F2α受体拮抗剂、或β2-肾上腺素能受体激动剂。In some embodiments, the method further comprises providing a therapeutic intervention to the subject based at least in part on a risk score indicative of risk for preterm birth. In some embodiments, the therapeutic intervention comprises hydroxyprogesterone caproate, vaginal progesterone, a natural progesterone IVR product, a prostaglandin F2α receptor antagonist, or a β2-adrenergic receptor agonist.

在一些实施方案中,该方法进一步包括至少部分基于指示先兆子痫风险的风险评分向对象提供治疗性干预。在一些实施方案中,治疗性干预包括抗高血压药物治疗(诸如但不限于肼苯哒嗪、拉贝洛尔、硝苯地平和硝普钠)、管理或预防癫痫发作(诸如但不限于硫酸镁、苯妥英和地西泮),或通过低剂量阿司匹林治疗(例如,每天100g或更少)预防来先兆子痫的发病率。In some embodiments, the method further comprises providing therapeutic intervention to the subject based at least in part on a risk score indicative of risk of pre-eclampsia. In some embodiments, therapeutic intervention comprises antihypertensive drug therapy (such as, but not limited to, hydralazine, labetalol, nifedipine, and sodium nitroprusside), management or prevention of seizures (such as, but not limited to, magnesium sulfate, phenytoin, and diazepam), or prevention of the incidence of pre-eclampsia by low-dose aspirin therapy (e.g., 100 g or less per day).

在一些实施方案中,该方法进一步包括监测早产风险,其中监测包括在多个时间点评估对象的早产风险,其中评估至少基于在多个时间点中的每一个处在(b)中确定的指示早产风险的风险评分。In some embodiments, the method further comprises monitoring the risk of preterm birth, wherein the monitoring comprises assessing the subject's risk of preterm birth at a plurality of time points, wherein the assessing is based at least on a risk score indicative of the risk of preterm birth determined in (b) at each of the plurality of time points.

在一些实施方案中,该方法进一步包括监测先兆子痫风险,其中监测包括在多个时间点评估对象的先兆子痫风险,其中评估至少基于在多个时间点的每一个处在(b)中确定的指示先兆子痫风险的风险评分。In some embodiments, the method further comprises monitoring the risk of pre-eclampsia, wherein the monitoring comprises assessing the subject's risk of pre-eclampsia at a plurality of time points, wherein the assessing is based at least on the risk score indicative of the risk of pre-eclampsia determined in (b) at each of the plurality of time points.

在一些实施方案中,该方法进一步包括通过对对象进行一项或多项后续临床测试来改进指示对象的先兆子痫风险的风险评分,以及使用经训练算法处理来源于一项或多项后续临床测试的结果,以确定指示对象的先兆子痫风险的经更新风险评分。在一些实施方案中,一项或多项后续临床测试包括超声成像或血液测试。在一些实施方案中,风险评分包括对象在预定持续时间内患有先兆子痫的似然性。In some embodiments, the method further comprises improving the risk score indicating the risk of pre-eclampsia of the subject by performing one or more subsequent clinical tests on the subject, and processing the results derived from the one or more subsequent clinical tests using the trained algorithm to determine an updated risk score indicating the risk of pre-eclampsia of the subject. In some embodiments, the one or more subsequent clinical tests include ultrasound imaging or blood tests. In some embodiments, the risk score includes the likelihood that the subject will suffer from pre-eclampsia within a predetermined duration.

在一些实施方案中,该方法进一步包括通过对对象进行一项或多项后续临床测试来改进指示对象的先兆子痫的风险评分,以及使用经训练算法处理来源于一项或多项后续临床测试的结果,以确定指示对象的先兆子痫的经更新风险评分。在一些实施方案中,一项或多项后续临床测试包括超声成像或血液测试。在一些实施方案中,风险评分包括对象在预定持续时间内患有先兆子痫的似然性。In some embodiments, the method further comprises improving the risk score of pre-eclampsia of the indicated subject by performing one or more subsequent clinical tests on the subject, and processing the results derived from the one or more subsequent clinical tests using the trained algorithm to determine an updated risk score of pre-eclampsia of the indicated subject. In some embodiments, the one or more subsequent clinical tests comprise ultrasound imaging or blood tests. In some embodiments, the risk score comprises the likelihood that the subject suffers from pre-eclampsia within a predetermined duration.

在一些实施方案中,预定持续时间为约1小时、约2小时、约4小时、约6小时、约8小时、约10小时、约12小时、约14小时、约16小时、约18小时、约20小时、约22小时、约24小时、约1.5天、约2天、约2.5天、约3天、约3.5天、约4天、约4.5天、约5天、约5.5天、约6天、约6.5天、约7天、约8天、约9天、约10天、约12天、约14天、约3周、约4周、约5周、约6周、约7周、约8周、约9周、约10周、约11周、约12周、约13周或大于约13周。In some embodiments, the predetermined duration is about 1 hour, about 2 hours, about 4 hours, about 6 hours, about 8 hours, about 10 hours, about 12 hours, about 14 hours, about 16 hours, about 18 hours, about 20 hours, about 22 hours, about 24 hours, about 1.5 days, about 2 days, about 2.5 days, about 3 days, about 3.5 days, about 4 days, about 4.5 days, about 5 days, about 5.5 days, about 6 days, about 6.5 days, about 7 days, about 8 days, about 9 days, about 10 days, about 12 days, about 14 days, about 3 weeks, about 4 weeks, about 5 weeks, about 6 weeks, about 7 weeks, about 8 weeks, about 9 weeks, about 10 weeks, about 11 weeks, about 12 weeks, about 13 weeks, or more than about 13 weeks.

在一些实施方案中,该方法进一步包括向对象提供用于妊娠相关状态的存在或易感性的治疗性干预。在一些实施方案中,治疗性干预包括黄体酮治疗,诸如己酸羟孕酮(例如,17-α己酸羟孕酮(17-P)、来自Lipocine的LPCN 1107、来自AMAG制药的Makena)、阴道黄体酮或天然黄体酮IVR产品(例如,来自Juniper Pharma的DARE-FRT1(JNP-0301));前列腺素F2α受体拮抗剂(例如,来自ObsEva的OBE022);或β2-肾上腺素能受体激动剂(例如,来自MediciNova的硫酸贝多拉君(bedoradrine sulfate)(MN-221))。治疗干预可以由例如,“WHO Recommendations on Interventions to Improve Preterm Birth Outcomes”,ISBN9789241508988,世界卫生组织,2015描述,其通过引用整体并入本文。在一些实施方案中,该方法进一步包括监测妊娠相关状态的存在或易感性,其中监测包括在多个时间点评估对象的妊娠相关状态的存在或易感性,其中评估至少基于在多个时间点中的每一个处在(d)中确定的妊娠相关状态的存在或易感性。在一些实施方案中,多个时间点之间对对象的妊娠相关状态的存在或易感性的评估的差异指示选自以下的一个或多个临床指征:(i)对象的妊娠相关状态的存在或易感性的诊断,(ii)对象的妊娠相关状态的存在或易感性的预后,以及(iii)治疗对象的妊娠相关状态的存在或易感性的疗程的有效性或无效性。In some embodiments, the method further includes providing a therapeutic intervention for the presence or susceptibility of pregnancy-related states to the subject. In some embodiments, therapeutic intervention includes progesterone treatment, such as hydroxyprogesterone caproate (e.g., 17-α hydroxyprogesterone caproate (17-P), LPCN 1107 from Lipocine, Makena from AMAG Pharmaceuticals), vaginal progesterone or natural progesterone IVR products (e.g., DARE-FRT1 (JNP-0301) from Juniper Pharma); Prostaglandin F2α receptor antagonists (e.g., OBE022 from ObsEva); or β2-adrenergic receptor agonists (e.g., bedoradrine sulfate (MN-221) from MediciNova). Therapeutic intervention can be by, for example, "WHO Recommendations on Interventions to Improve Preterm Birth Outcomes", ISBN9789241508988, World Health Organization, 2015 description, which is incorporated herein by reference as a whole. In some embodiments, the method further comprises monitoring the presence or susceptibility to a pregnancy-related state, wherein the monitoring comprises assessing the presence or susceptibility to a pregnancy-related state in the subject at a plurality of time points, wherein the assessment is based at least on the presence or susceptibility to the pregnancy-related state determined in (d) at each of the plurality of time points. In some embodiments, a difference in the assessment of the presence or susceptibility to a pregnancy-related state in the subject between the plurality of time points indicates one or more clinical indications selected from: (i) diagnosis of the presence or susceptibility to a pregnancy-related state in the subject, (ii) prognosis of the presence or susceptibility to a pregnancy-related state in the subject, and (iii) effectiveness or ineffectiveness of a course of treatment for the presence or susceptibility to a pregnancy-related state in the subject.

在一些实施方案中,该方法进一步包括通过使用经训练算法从多个不同的早产分子亚型中确定早产分子亚型来对早产进行分层。在一些实施方案中,该多个不同的早产分子亚型包括选自以下的早产分子亚型:既往早产的存在或病史、自发性早产的存在或病史、晚期流产的存在或病史、接受宫颈手术的存在或病史、子宫异常的存在或病史、种族特异性早产风险(例如,在非洲裔美国人中)的存在或病史,以及早产胎膜早破(PPROM)的存在或病史。In some embodiments, the method further comprises stratifying preterm birth by determining a preterm birth molecular subtype from a plurality of different preterm birth molecular subtypes using a trained algorithm. In some embodiments, the plurality of different preterm birth molecular subtypes comprises a preterm birth molecular subtype selected from the following: the presence or history of a previous preterm birth, the presence or history of a spontaneous preterm birth, the presence or history of a late miscarriage, the presence or history of undergoing cervical surgery, the presence or history of a uterine abnormality, the presence or history of a race-specific risk of preterm birth (e.g., in African Americans), and the presence or history of preterm premature rupture of membranes (PPROM).

在一些实施方案中,该方法进一步包括通过使用经训练算法从多个不同的先兆子痫分子亚型中确定先兆子痫分子亚型来对先兆子痫进行分层。在一些实施方案中,该多个不同的先兆子痫分子亚型包括选自以下的先兆子痫分子亚型:慢性或原有高血压的存在或病史、妊娠高血压的存在或病史、轻度先兆子痫的存在或病史(例如,分娩大于34周胎龄)、重度先兆子痫的存在或病史(分娩小于34周胎龄)、子痫的存在或病史,以及HELLP综合征的存在或病史。In some embodiments, the method further comprises stratifying pre-eclampsia by determining a pre-eclampsia molecular subtype from a plurality of different pre-eclampsia molecular subtypes using a trained algorithm. In some embodiments, the plurality of different pre-eclampsia molecular subtypes include pre-eclampsia molecular subtypes selected from the following: the presence or history of chronic or pre-existing hypertension, the presence or history of gestational hypertension, the presence or history of mild pre-eclampsia (e.g., delivery greater than 34 weeks gestational age), the presence or history of severe pre-eclampsia (delivery less than 34 weeks gestational age), the presence or history of eclampsia, and the presence or history of HELLP syndrome.

在一些实施方案中,该方法进一步包括使用经训练算法分析生物标志物集。在一些实施方案中,健康或生理状况选自早产、足月产、胎龄、预产期、分娩发作、妊娠相关高血压病症、子痫、妊娠期糖尿病、对象的胎儿的先天性病症、异位妊娠、自然流产、死产、产后并发症、妊娠剧吐、分娩时出血或流血过多、胎膜早破、早产胎膜早破、前置胎盘、宫内/胎儿生长受限、巨大儿、新生儿状况,以及胎儿发育阶段或状态。在一些实施方案中,生物标志物集包括与预产期、胎龄、早产、先兆子痫、胎儿器官发育或妊娠期糖尿病相关联的基因组位点。在一些实施方案中,方法进一步包括至少部分地基于生物标志物集来选择妊娠对象的或妊娠对象的胎儿的健康或生理状况的治疗性干预。在一些实施方案中,治疗性干预选自多个治疗性干预之中。在一些实施方案中,至少部分地基于健康或生理状况(至少部分地基于生物标志物集所确定的)的分子亚型来选择治疗性干预。In some embodiments, the method further includes analyzing the biomarker set using a trained algorithm. In some embodiments, the health or physiological condition is selected from preterm birth, term birth, gestational age, expected date of delivery, onset of labor, pregnancy-related hypertension, eclampsia, gestational diabetes, congenital conditions of the fetus of the object, ectopic pregnancy, spontaneous abortion, stillbirth, postpartum complications, hyperemesis gravidarum, bleeding or excessive bleeding during delivery, premature rupture of membranes, premature premature rupture of membranes, placenta previa, intrauterine/fetal growth restriction, macrosomia, neonatal conditions, and fetal development stage or state. In some embodiments, the biomarker set includes genomic sites associated with expected date of delivery, gestational age, premature birth, preeclampsia, fetal organ development or gestational diabetes. In some embodiments, the method further includes selecting a therapeutic intervention for the health or physiological condition of a pregnant object or a fetus of a pregnant object based at least in part on a biomarker set. In some embodiments, the therapeutic intervention is selected from a plurality of therapeutic interventions. In some embodiments, the therapeutic intervention is selected at least in part based on a molecular subtype of a health or physiological condition (determined at least in part based on a biomarker set).

在一些实施方案中,基于与含胶原的细胞外基质途径相关的早产分子亚型以及包含选自胶原调节治疗剂(如凝血酶、Y-27632、TNFα和吲哚美辛)的药物来选择治疗性干预。In some embodiments, the therapeutic intervention is selected based on the molecular subtype of preterm labor associated with collagen-containing extracellular matrix pathways and comprises a drug selected from the group consisting of collagen modulating therapeutics such as thrombin, Y-27632, TNFα, and indomethacin.

在一些实施方案中,基于与细胞外基质(ECM)途径相关的早产分子亚型以及包含治疗剂来选择治疗性干预,其中该治疗剂是癌胚纤连蛋白调节剂。在一些实施方案中,治疗剂是糖皮质激素抑制剂。在一些实施方案中,治疗剂是地塞米松。在一些实施方案中,治疗剂是环己酰亚胺。In some embodiments, therapeutic intervention is selected based on the molecular subtype of premature birth associated with the extracellular matrix (ECM) pathway and the inclusion of a therapeutic agent, wherein the therapeutic agent is an oncofetal fibronectin regulator. In some embodiments, the therapeutic agent is a glucocorticoid inhibitor. In some embodiments, the therapeutic agent is dexamethasone. In some embodiments, the therapeutic agent is cycloheximide.

在一些实施方案中,基于与内质网(ER)腔途径(与由蜕膜细胞中的氧化应激诱导的内质网应激相关联)相关的早产分子亚型以及包含选自ER应激抑制剂4-苯丁酸(4-PBA)和牛磺熊去氧胆酸(TUDCA)的药物来选择治疗性干预。In some embodiments, the therapeutic intervention is selected based on a molecular subtype of preterm birth associated with the endoplasmic reticulum (ER) luminal pathway (associated with ER stress induced by oxidative stress in decidual cells) and comprises a drug selected from the ER stress inhibitors 4-phenylbutyric acid (4-PBA) and tauroursodeoxycholic acid (TUDCA).

在一些实施方案中,基于炎症途径相关的早产分子亚型以及包含选自非特异性NF-κB抑制剂、TLR4拮抗剂、TNF-α生物制剂、CTHE(新型抗炎药物):p38 MAPK抑制剂(SKF-86002、SB202190和SB239063)、IKK复合物抑制剂(NBNI、小白菊内酯和TPCA-1)或TAK1抑制剂(5z-7-氧杂玉米烯醇(OxZnl))的药物来选择治疗性干预。In some embodiments, therapeutic intervention is selected based on molecular subtypes of preterm birth associated with inflammatory pathways and comprises a drug selected from a non-specific NF-κB inhibitor, a TLR4 antagonist, a TNF-α biologic, a CTHE (a novel anti-inflammatory drug): a p38 MAPK inhibitor (SKF-86002, SB202190, and SB239063), an IKK complex inhibitor (NBNI, parthenolide, and TPCA-1), or a TAK1 inhibitor (5z-7-oxa-zearalenol (OxZnl)).

在一些实施方案中,基于胰岛素生长因子转运途径相关的早产分子亚型以及包含选自二甲双胍、胰岛素样生长因子1(IGF-1)、胰岛素样生长因子结合蛋白-3(IGFBP-3)以及葡萄糖转运蛋白(GLUT3、GLUT8和GLUT9)调节剂的治疗剂来选择治疗性干预。在一些实施方案中,治疗剂是二甲双胍。在一些实施方案中,治疗剂是IGF-1。在一些实施方案中,治疗剂是IGFBP-3。在一些实施方案中,治疗剂是GLUT3的调节剂。在一些实施方案中,治疗剂是GLUT8的调节剂。在一些实施方案中,治疗剂是GLUT9的调节剂。In some embodiments, therapeutic intervention is selected based on the molecular subtype of premature birth associated with the insulin growth factor transport pathway and a therapeutic agent selected from metformin, insulin-like growth factor 1 (IGF-1), insulin-like growth factor binding protein-3 (IGFBP-3), and glucose transporter (GLUT3, GLUT8 and GLUT9) regulators. In some embodiments, the therapeutic agent is metformin. In some embodiments, the therapeutic agent is IGF-1. In some embodiments, the therapeutic agent is IGFBP-3. In some embodiments, the therapeutic agent is a regulator of GLUT3. In some embodiments, the therapeutic agent is a regulator of GLUT8. In some embodiments, the therapeutic agent is a regulator of GLUT9.

在一些实施方案中,基于氨基酸和衍生物的代谢相关的早产分子亚型以及包含治疗剂来选择治疗性干预,其中该治疗剂是内源性代谢调节剂(EMM)。In some embodiments, therapeutic intervention is selected based on the metabolism of amino acids and derivatives associated with preterm molecular subtypes and includes a therapeutic agent, wherein the therapeutic agent is an endogenous metabolic modulator (EMM).

在一些实施方案中,健康或生理状况包括先兆子痫。在一些实施方案中,用于先兆子痫的治疗性干预包括药物、补充剂或生活方式建议。在一些实施方案中,药物选自阿司匹林、黄体酮、硫酸镁、胆固醇药物(诸如普伐他汀)、胃灼热药物(诸如埃索美拉唑)、血管紧张素II受体拮抗剂(诸如氯沙坦)、钙通道阻滞剂(诸如硝苯地平)、糖尿病药物(诸如肌醇、二甲双胍、glucovance和利拉鲁肽)和勃起功能障碍药物(诸如枸橼酸西地那非)。在一些实施方案中,补充剂选自钙、维生素D、维生素B3和DHA。在一些实施方案中,生活方式建议选自运动、营养咨询、冥想、缓解压力、减肥或维持体重以及改善睡眠质量。在一些实施方案中,用于先兆子痫的治疗性干预选自如以下所公开的治疗性预防(例如,治疗或预防)“WHOrecommendations:Prevention and treatment of pre-eclampsia and eclampsia”,世界卫生组织,ISBN 9789241548335,世界卫生组织,2011,其通过引用整体并入本文。在一些实施方案中,用于先兆子痫的治疗性干预选自如以下所公开的治疗性预防(例如,治疗或预防)“Summary of recommendations:Prevention and treatment of pre-eclampsia andeclampsia”,世界卫生组织,WHO参考编号WHO/RHR/11.30,世界卫生组织,2011,其通过引用整体并入本文。在一些实施方案中,用于先兆子痫的治疗性干预选自如以下所公开的治疗性预防(例如,治疗或预防)“WHO recommendations:Drug treatment for severehypertension in pregnancy”,世界卫生组织,ISBN 9789241550437,世界卫生组织,2018,其通过引用整体并入本文。In some embodiments, health or physiological condition include pre-eclampsia. In some embodiments, therapeutic intervention for pre-eclampsia includes medicine, supplement or lifestyle advice. In some embodiments, medicine is selected from aspirin, progesterone, magnesium sulfate, cholesterol medicine (such as pravastatin), heartburn medicine (such as esomeprazole), angiotensin II receptor antagonist (such as losartan), calcium channel blocker (such as nifedipine), diabetes medicine (such as inositol, metformin, glucovance and liraglutide) and erectile dysfunction medicine (such as sildenafil citrate). In some embodiments, supplement is selected from calcium, vitamin D, vitamin B3 and DHA. In some embodiments, lifestyle advice is selected from exercise, nutritional counseling, meditation, stress relief, weight loss or weight maintenance and improving sleep quality. In some embodiments, the therapeutic intervention for pre-eclampsia is selected from the therapeutic prevention (e.g., treatment or prevention) "WHO recommendations: Prevention and treatment of pre-eclampsia and eclampsia" as disclosed below, World Health Organization, ISBN 9789241548335, World Health Organization, 2011, which is incorporated herein by reference in its entirety. In some embodiments, the therapeutic intervention for pre-eclampsia is selected from the therapeutic prevention (e.g., treatment or prevention) "Summary of recommendations: Prevention and treatment of pre-eclampsia and eclampsia" as disclosed below, World Health Organization, WHO reference number WHO/RHR/11.30, World Health Organization, 2011, which is incorporated herein by reference in its entirety. In some embodiments, the therapeutic intervention for pre-eclampsia is selected from the therapeutic prevention (e.g., treatment or prevention) as disclosed in “WHO recommendations: Drug treatment for severe hypertension in pregnancy”, World Health Organization, ISBN 9789241550437, World Health Organization, 2018, which is incorporated herein by reference in its entirety.

在一些实施方案中,基于与早期胎盘形成步骤相关的先兆子痫分子亚型以及包含药物来选择治疗性干预,所述早期胎盘形成步骤负责调节血管舒张介质和抑制血管重塑、血小板聚集和血小板粘附,所述药物选自直接作用的血管舒张剂(肼屈嗪、米诺地尔、硝酸盐、硝普盐);钙通道阻滞剂(维拉帕米、地尔硫硝苯地平、氨氯地平);肾素-血管紧张素-醛固酮系统的拮抗剂(血管紧张素受体阻滞剂、血管紧张素转化酶抑制剂);β-2受体激动剂(沙丁胺醇、特布他林);突触后α-1受体拮抗剂(哌唑嗪、苯苄胺、酚妥拉明);中枢作用α-2受体激动剂(可乐定、α-甲基多巴);中枢作用α-2受体激动剂(可乐定、α-甲基多巴);中枢作用α-2受体激动剂(可乐定、α-甲基多巴)。In some embodiments, therapeutic intervention is selected based on the molecular subtype of pre-eclampsia associated with early placentation steps responsible for modulation of vasodilatory mediators and inhibition of vascular remodeling, platelet aggregation and platelet adhesion, selected from direct-acting vasodilators (hydralazine, minoxidil, nitrates, nitroprusside); calcium channel blockers (verapamil, diltiazem); nifedipine, amlodipine); antagonists of the renin-angiotensin-aldosterone system (angiotensin receptor blockers, angiotensin converting enzyme inhibitors); beta-2 receptor agonists (salbutamol, terbutaline); postsynaptic alpha-1 receptor antagonists (prazosin, benzylamine, phentolamine); centrally acting alpha-2 receptor agonists (clonidine, alpha-methyldopa); centrally acting alpha-2 receptor agonists (clonidine, alpha-methyldopa); centrally acting alpha-2 receptor agonists (clonidine, alpha-methyldopa).

在一些实施方案中,基于先兆子痫分子亚型以及包含药物来选择治疗性干预,所述先兆子痫分子亚型与和角质形成细胞内皮途径相关联的PE分子亚型相关,所述药物选自质子泵抑制剂(PPI):奥美拉唑、埃索美拉唑、泮托拉唑、雷贝拉唑或兰索拉唑。In some embodiments, therapeutic intervention is selected based on a pre-eclampsia molecular subtype associated with a PE molecular subtype associated with a keratinocyte endothelial pathway and includes a drug selected from a proton pump inhibitor (PPI): omeprazole, esomeprazole, pantoprazole, rabeprazole, or lansoprazole.

在一些实施方案中,健康或生理状况包括早产。在一些实施方案中,早产的治疗性干预包括药物、补充剂、生活方式建议、宫颈环扎、宫颈子宫托或电收缩抑制。在一些实施方案中,药物选自黄体酮、红霉素、宫缩抑制药物(诸如吲哚美辛)、皮质类固醇、阴道菌群(诸如克林霉素和甲硝唑)和抗氧化剂(诸如N-乙酰半胱氨酸)。在一些实施方案中,补充剂选自钙、维生素D和益生菌(诸如乳酸杆菌)。在一些实施方案中,生活方式建议选自运动、营养咨询、冥想、缓解压力、减肥或维持体重以及改善睡眠质量。在一些实施方案中,用于早产的治疗性干预选自如下所公开的治疗性干预(例如,治疗或预防)“WHO Recommendations onInterventions to Improve Preterm Birth Outcomes”ISBN 9789241508988,世界卫生组织,2015,其通过引用整体并入本文。In some embodiments, health or physiological conditions include premature birth. In some embodiments, therapeutic interventions for premature birth include drugs, supplements, lifestyle advice, cervical cerclage, cervical pessary or electrical contraction inhibition. In some embodiments, the drug is selected from progesterone, erythromycin, contraction inhibition drugs (such as indomethacin), corticosteroids, vaginal flora (such as clindamycin and metronidazole) and antioxidants (such as N-acetylcysteine). In some embodiments, supplements are selected from calcium, vitamin D and probiotics (such as lactobacilli). In some embodiments, lifestyle advice is selected from exercise, nutritional counseling, meditation, stress relief, weight loss or weight maintenance and improving sleep quality. In some embodiments, therapeutic interventions for premature birth are selected from therapeutic interventions (e.g., treatment or prevention) "WHO Recommendations on Interventions to Improve Preterm Birth Outcomes" ISBN 9789241508988, World Health Organization, 2015, which is incorporated herein by reference in its entirety.

在一些实施方案中,健康或生理状况包括妊娠期糖尿病(GDM)。在一些实施方案中,用于GDM的治疗性干预包括药物、补充剂或生活方式建议。在一些实施方案中,药物选自胰岛素和糖尿病药物(诸如肌醇、二甲双胍、glucovance和利拉鲁肽)。在一些实施方案中,补充剂选自维生素D、胆碱、益生菌和DHA。在一些实施方案中,生活方式建议选自运动、营养咨询、冥想、缓解压力、减肥或维持体重以及改善睡眠质量。在一些实施方案中,用于妊娠期糖尿病(GDM)的治疗性干预选自如下所公开的治疗性干预(例如,治疗或预防)“Diagnosticcriteria and classification of hyperglycaemia first detected in pregnancy”WHO参考号WHO/NMH/MND/13.2,世界卫生组织,2013,其通过引用整体并入本文。In some embodiments, health or physiological conditions include gestational diabetes (GDM). In some embodiments, therapeutic interventions for GDM include drugs, supplements or lifestyle advice. In some embodiments, drugs are selected from insulin and diabetes drugs (such as inositol, metformin, glucovance and liraglutide). In some embodiments, supplements are selected from vitamin D, choline, probiotics and DHA. In some embodiments, lifestyle advice is selected from exercise, nutritional counseling, meditation, stress relief, weight loss or maintenance and improving sleep quality. In some embodiments, therapeutic interventions for gestational diabetes (GDM) are selected from therapeutic interventions (e.g., treatment or prevention) disclosed below "Diagnostic criteria and classification of hyperglycaemia first detected in pregnancy" WHO reference number WHO/NMH/MND/13.2, World Health Organization, 2013, which is incorporated herein by reference in its entirety.

在一些实施方案中,基于妊娠期糖尿病(GDM)分子亚型以及包含药物来选择治疗性干预,所述妊娠期糖尿病分子亚型与和胎盘退化、胎盘功能不足、胎盘衰竭、胎盘功能障碍、过早老化、钙化相关联的GDM分子亚型相关,所述药物选自枸橼酸西地那非、tempol(超氧化物歧化酶歧化酶)、白藜芦醇、褪黑素、索法酮、他汀类药物、二甲双胍或[Leu27]胰岛素样生长因子-II(IGF-II)。In some embodiments, the therapeutic intervention is selected based on a gestational diabetes mellitus (GDM) molecular subtype associated with a GDM molecular subtype associated with placental involution, placental insufficiency, placental failure, placental dysfunction, premature aging, calcification, and comprises a drug selected from sildenafil citrate, tempol (superoxide dismutase dismutase), resveratrol, melatonin, sofalcone, statins, metformin, or [Leu27] insulin-like growth factor-II (IGF-II).

在一些实施方案中,基于妊娠期糖尿病(GDM)分子亚型以及包含药物来选择治疗性干预,所述妊娠期糖尿病分子亚型与和介导的高血糖记忆相关联的GDM分子亚型相关,所述药物选自普伐他汀、氨基胍、罗格列酮、葡萄籽原花青素提取物(GSPE)、橙皮苷、依帕司他、吡多胺、替米沙坦、二甲双胍和吡格列酮。In some embodiments, the therapeutic intervention is selected based on a gestational diabetes mellitus (GDM) molecular subtype associated with a GDM molecular subtype associated with mediated hyperglycemic memory and comprises a drug selected from pravastatin, aminoguanidine, rosiglitazone, grape seed proanthocyanidin extract (GSPE), hesperidin, epalrestat, pyridoxine, telmisartan, metformin, and pioglitazone.

在一些实施方案中,基于妊娠期糖尿病(GDM)分子亚型以及包含药物来选择治疗性干预,所述妊娠期糖尿病分子亚型与和适应性免疫系统和抗原特异性途径相关联的GDM分子亚型相关,所述药物选自硫唑嘌呤、吗替麦考酚酯、奥昔组单抗(otelixizumab)、替利组单抗(teplizumab)、GAD65、DiaPep277、抗-CD20 mAb、雷帕霉素/IL-2、磺酰脲类、二甲双胍、TZD、二肽基肽酶-4抑制剂、钠-葡萄糖协同转运蛋白2抑制剂、双醋瑞因、双水杨酸或GLP-1RA。In some embodiments, therapeutic intervention is selected based on a gestational diabetes mellitus (GDM) molecular subtype associated with a GDM molecular subtype associated with the adaptive immune system and antigen-specific pathways and comprises a drug selected from azathioprine, mycophenolate mofetil, otelixizumab, teplizumab, GAD65, DiaPep277, anti-CD20 mAb, rapamycin/IL-2, sulfonylureas, metformin, TZD, dipeptidyl peptidase-4 inhibitor, sodium-glucose co-transporter 2 inhibitor, diacerein, salicylic acid, or GLP-1RA.

本公开的另一方面提供了一种非暂时性计算机可读介质,包括机器可执行代码,其在由一个或多个计算机处理器执行时,实施本文上述或其他地方的任何方法。Another aspect of the present disclosure provides a non-transitory computer-readable medium comprising machine-executable code that, when executed by one or more computer processors, implements any of the methods described above or elsewhere herein.

本公开的另一方面提供了一种系统,该系统包括一个或多个计算机处理器和与其耦联的计算机存储器。计算机存储器包括机器可执行代码,其在由一个或多个计算机处理器执行时,实施本文上述或其他地方的任何方法。Another aspect of the present disclosure provides a system comprising one or more computer processors and a computer memory coupled thereto. The computer memory comprises machine executable code which, when executed by the one or more computer processors, implements any method described above or elsewhere herein.

从下面的详细描述中,本公开的附加方面和优点对于本领域技术人员来说将变得显而易见,其中仅示出和描述了本公开的说明性实施方案。如将认识到的,本公开能够有其他和不同的实施方案,并且其若干细节能够在各种明显的方面进行修改,所有这些都不脱离本公开。因此,附图和描述应在本质上被视为说明性的,而不是限制性的。Additional aspects and advantages of the present disclosure will become apparent to those skilled in the art from the following detailed description, wherein only illustrative embodiments of the present disclosure are shown and described. As will be appreciated, the present disclosure is capable of other and different embodiments, and its several details are capable of modification in various obvious respects, all without departing from the present disclosure. Therefore, the drawings and description should be regarded as illustrative in nature, and not restrictive.

援引并入Incorporation by reference

本说明书中提及的所有出版物、专利和专利申请均通过引用并入本文,如同每个单独的出版物、专利或专利申请被明确且单独地指出为通过引用并入的程度相同。如果通过引用并入的出版物和专利或专利申请与说明书中包含的公开内容相矛盾,则该说明书旨在取代和/或优先于任何此类矛盾材料。All publications, patents, and patent applications mentioned in this specification are incorporated herein by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference. If publications and patents or patent applications incorporated by reference contradict the disclosure contained in the specification, the specification is intended to supersede and/or take precedence over any such contradictory material.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

在所附权利要求中具体阐述了本发明的新颖特征。通过参考以下阐述说明性实施方案(其中利用了本发明的原理)的具体实施方式,以及附图(本文中也称为“图”,将获得对本发明的特征和优点的更好理解,其中:The novel features of the present invention are particularly set forth in the appended claims. A better understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description which sets forth illustrative embodiments in which the principles of the present invention are utilized, and to the accompanying drawings (also referred to herein as "Figures"), in which:

图1示出了经编程或以其他方式配置成实施本文提供的方法的计算机系统。FIG. 1 illustrates a computer system programmed or otherwise configured to implement the methods provided herein.

图2A-图2C显示了使用通过基因发现鉴定的差异表达基因的预测模型的受试者工作特征(ROC)曲线的示例(图2A),其中模型通过留一交叉验证(LOOCV)来验证,包括在位点1的交叉验证,ROC曲线的平均曲线下面积(AUC)为0.77(图2B)以及在位点2、3和4的测试,平均AUC为0.79(图2C)。Figures 2A-2C show examples of receiver operating characteristic (ROC) curves for a prediction model using differentially expressed genes identified by gene discovery (Figure 2A), where the model was validated by leave-one-out cross-validation (LOOCV), including cross-validation at site 1 with an average area under the curve (AUC) of 0.77 for the ROC curve (Figure 2B) and testing at sites 2, 3, and 4 with an average AUC of 0.79 (Figure 2C).

图2D显示了鉴定在对于基因本体论经由基因集富集分析的排名靠前的早产标志物基因中的途径以及在来自早产分娩的母亲的样品中含胶原细胞外基质的途径的示例。FIG. 2D shows examples of pathways identified in the top ranked preterm birth marker genes via gene set enrichment analysis for Gene Ontology and collagen extracellular matrix-containing pathways in samples from mothers who delivered preterm.

图3显示了鉴定在对于基因本体论经由基因集富集分析的晚期流产或早期早产中的途径以及在来自25周胎龄之前分娩的母亲的样品中与基膜和内质网腔相关的基因集的途径的示例。FIG3 shows examples of pathways identified in late miscarriage or early preterm birth via gene set enrichment analysis for Gene Ontology and gene sets associated with basement membrane and endoplasmic reticulum lumen in samples from mothers who delivered before 25 weeks gestational age.

图4显示了242个收集的样品在样品收集时的胎龄,以及在分娩时的胎龄的分布。FIG4 shows the distribution of gestational age at sample collection and gestational age at delivery for the 242 collected samples.

图5A显示了sPTB(分娩<35周的GA)的受试者工作特征(ROC)曲线。FIG5A shows the receiver operating characteristic (ROC) curve for sPTB (delivery <35 weeks GA).

图5B显示了通过sPTB模型(分娩<35周的GA)分配至每个样品的概率。0值对应于样品被分类为sPTB病例的概率为0%,并且1值对应于样品被分类为sPTB病例的概率为100%。样品由每个真标签分开(病例为橙色,对照为蓝色)。Figure 5B shows the probability assigned to each sample by the sPTB model (delivery <35 weeks GA). A value of 0 corresponds to a 0% probability that the sample is classified as an sPTB case, and a value of 1 corresponds to a 100% probability that the sample is classified as an sPTB case. Samples are separated by each true label (orange for cases and blue for controls).

图5C显示了sPTB(分娩<25周的GA)的受试者工作特征(ROC)曲线。FIG5C shows the receiver operating characteristic (ROC) curve for sPTB (delivery <25 weeks GA).

图5D显示了通过sPTB模型(分娩<25周的GA)分配至每个样品的概率。0值对应于样品被分类为sPTB病例的概率为0%,并且1值对应于样品被分类为sPTB病例的概率为100%。样品由每个真标签分开(病例为橙色,对照为蓝色)。Figure 5D shows the probability assigned to each sample by the sPTB model (delivery <25 weeks GA). A value of 0 corresponds to a 0% probability that the sample is classified as an sPTB case, and a value of 1 corresponds to a 100% probability that the sample is classified as an sPTB case. Samples are separated by each true label (orange for cases and blue for controls).

图6显示了先兆子痫观察队列的人口统计学和临床数据指标,包括2701名健康参与者和335名诊断为先兆子痫的参与者。Figure 6 shows the demographic and clinical data of the preeclampsia observational cohort, which included 2701 healthy participants and 335 participants diagnosed with preeclampsia.

图7显示了具有38周之前分娩的诊断为先兆子痫的对象的截止值的队列的人口统计学和临床数据,包括2690名健康参与者和199名被诊断为先兆子痫并在38周胎龄之前分娩的参与者。Figure 7 shows demographic and clinical data for the cohort with a cutoff value for subjects diagnosed with pre-eclampsia who delivered before 38 weeks of gestational age, including 2690 healthy participants and 199 participants who were diagnosed with pre-eclampsia and delivered before 38 weeks of gestational age.

图8显示了具有37周之前分娩的诊断为先兆子痫的对象的截止值的队列的人口统计学和临床数据,包括2780名健康参与者和109名被诊断为先兆子痫并在37周胎龄之前分娩的参与者。Figure 8 shows demographic and clinical data for the cohort with a cutoff value for subjects diagnosed with pre-eclampsia who delivered before 37 weeks of gestational age, including 2780 healthy participants and 109 participants who were diagnosed with pre-eclampsia and delivered before 37 weeks of gestational age.

图9A显示了无细胞RNA(cfRNA)下一代测序(NGS)数据的系统性方差的示例,该数据与在与免疫细胞基因相反的方向上运动的妊娠相关联基因和胎盘相关联基因的季节变化相关联。Figure 9A shows an example of systematic variance in cell-free RNA (cfRNA) next generation sequencing (NGS) data associated with seasonal changes in pregnancy-associated genes and placenta-associated genes moving in the opposite direction to immune cell genes.

图9B显示了由基因建模预测的局部外部温度与由靠近血液采集地点的气象站记录的实际温度之间的高相关性的示例。FIG9B shows an example of a high correlation between the local external temperature predicted by genetic modeling and the actual temperature recorded by a weather station close to the blood collection site.

图10A显示了与血液抽取/采集时间相关联的cfRNA NGS数据的系统性方差的示例。Figure 10A shows an example of systematic variance in cfRNA NGS data associated with blood draw/collection time.

图10B显示了与生物钟相关联的清晨组和下午组之间差异表达的分位数-分位数(QQ)图。FIG. 10B shows a quantile-quantile (QQ) plot of differential expression between the morning group and the afternoon group associated with the circadian clock.

图11A显示了通过确定独立性筛选(Sure Independence Screening)在重复交叉验证中以log2每百万计数(CPM)读数空间的基因发现率。FIG. 11A shows the gene discovery rate in log2 counts per million (CPM) read space in replicated cross validation by Sure Independence Screening.

图11B显示了通过确定独立性筛选在重复交叉验证中以log2原始计数读数空间的基因发现率。Figure 11B shows the gene discovery rate in log2 raw count read space in repeated cross validation by determining independence filtering.

图12A显示了对于来自具有临床因素的下采样计数的在小于38周分娩的早产先兆子痫病例,通过在多个空间中的校正和按照Akaike信息准则(AIC)的阈值发现的特征。FIG. 12A shows features found by correction in multiple spaces and thresholding according to the Akaike Information Criterion (AIC) for preterm pre-eclampsia cases delivered at less than 38 weeks from downsampled counts with clinical factors.

图12B显示了对于具有体重指数(BMI)和血压(BP)作为仅有的临床因素的在小于37周分娩的早产先兆子痫病例,通过在多个空间中的校正和按照AIC的阈值发现的特征的示例。12B shows an example of features found by correction in multiple spaces and by thresholding of AIC for preterm pre-eclampsia cases delivered at less than 37 weeks with body mass index (BMI) and blood pressure (BP) as the only clinical factors.

图13显示了对于预测在胎龄小于37周分娩的早产先兆子痫病例的模型,ROC曲线值平均为0.83的曲线下面积(AUC)的示例。FIG. 13 shows an example of an area under the curve (AUC) with an average ROC curve value of 0.83 for a model predicting cases of preterm preeclampsia who deliver at a gestational age of less than 37 weeks.

图14显示了在35周胎龄之前分娩的自发性早产病例中差异表达基因的Spearman和DESeq2差异基因表达分析的分位数-分位数(QQ)图。Spearman排位差异基因表达的QQ图位于左侧,且DESeq2差异基因表达的QQ图位于右侧。Figure 14 shows quantile-quantile (QQ) plots of Spearman and DESeq2 differential gene expression analysis of differentially expressed genes in spontaneous preterm birth cases delivered before gestational age of 35 weeks. The QQ plot of Spearman ranked differential gene expression is on the left, and the QQ plot of DESeq2 differential gene expression is on the right.

图15显示了在37周胎龄之前分娩的自发性早产病例中差异表达基因的DESeq2差异基因表达分析的分位数-分位数(QQ)图。Spearman排位差异基因表达的QQ图位于左侧,且DESeq2差异基因表达的QQ图位于右侧。Figure 15 shows a quantile-quantile (QQ) plot of the DESeq2 differential gene expression analysis of differentially expressed genes in spontaneous preterm birth cases delivered before gestational age of 37 weeks. The QQ plot of the Spearman ranked differential gene expression is on the left, and the QQ plot of the DESeq2 differential gene expression is on the right.

具体实施方式DETAILED DESCRIPTION

虽然已在本文显示和描述了本发明的各种实施方案,但对于本领域技术人员来说,显而易见的是,这些实施方案仅作为实例提供。在不脱离本发明的情况下,本领域技术人员可以想到许多变化、改变和替换。应当理解,可以采用本文描述的本发明实施方案的各种替代物。Although various embodiments of the present invention have been shown and described herein, it will be apparent to those skilled in the art that these embodiments are provided as examples only. Without departing from the present invention, those skilled in the art may conceive of many variations, changes and substitutions. It should be understood that various alternatives to the embodiments of the present invention described herein may be employed.

如本说明书和权利要求所用,单数形式“一个”、“一种”和“该”包括复数指代,除非上下文另有明确规定。例如,术语“核酸”包括多种核酸,包括其混合物。As used in this specification and claims, the singular forms "a", "an", and "the" include plural referents unless the context clearly dictates otherwise. For example, the term "nucleic acid" includes a plurality of nucleic acids, including mixtures thereof.

如本文所用,术语“对象”通常是指具有可测试或可检测遗传信息的实体或介质。对象可以是人、个体或患者。对象可以是脊椎动物,例如哺乳动物。哺乳动物的非限制性实例包括人类、猿类、农场动物、运动动物、啮齿动物和宠物。对象可以是怀孕的女性对象。对象可以是有胎儿(或多胎)或怀疑有胎儿(或多胎)的女性。对象可以是怀孕或怀疑怀孕的人。对象可以表现出表明对象的健康或生理状态或状况的症状,诸如对象的妊娠相关健康或生理状态或状况。替代性地,对象在此种健康或生理状态或病症方面可以是无症状的。As used herein, the term "subject" generally refers to an entity or medium with testable or detectable genetic information. The object can be a person, an individual, or a patient. The object can be a vertebrate, such as a mammal. Non-limiting examples of mammals include humans, apes, farm animals, sports animals, rodents, and pets. The object can be a pregnant female object. The object can be a woman with a fetus (or multiple births) or suspected of having a fetus (or multiple births). The object can be a person who is pregnant or suspected of being pregnant. The object can show symptoms indicating the health or physiological state or condition of the object, such as the pregnancy-related health or physiological state or condition of the object. Alternatively, the object can be asymptomatic in terms of such health or physiological state or illness.

如本文所用,术语“妊娠相关状态”通常是指怀孕或怀疑怀孕的对象或对象的胎儿(或多胎)的任何健康、生理和/或生化状态或状况。妊娠相关状态的实例包括但不限于:早产、足月产、胎龄、预产期、分娩发作、妊娠相关高血压病症(例如,先兆子痫)、子痫、妊娠期糖尿病、对象胎儿的先天性病症、异位妊娠、自然流产、死产、产后并发症(例如,产后抑郁、出血或流血过多、肺栓塞、心肌病、糖尿病、贫血和高血压疾病)、妊娠剧吐(晨吐)、分娩时出血或流血过多、胎膜早破、早产胎膜早破、前置胎盘(胎盘覆盖子宫颈)、宫内/胎儿生长受限、巨大儿(对于胎龄而言较大的胎儿)、新生儿状况(例如,贫血、呼吸暂停、心动过缓和其他心脏缺陷、支气管肺发育不良或慢性肺病、糖尿病、腹裂、脑积水、高胆红素血症、低钙血症、低血糖、脑室内出血、黄疸、坏死性小肠结肠炎、动脉导管未闭、脑室周围白质软化、持续性肺动脉高压、红细胞增多症、呼吸窘迫综合征、早产儿视网膜病变和暂时性呼吸急促),以及胎儿发育阶段或状态(例如,正常胎儿器官功能或发育和异常胎儿器官功能或发育)。例如,胎儿发育阶段或状态可以与胎儿器官的正常胎儿器官功能或发育和/或异常胎儿器官功能或发育有关,该胎儿器官选自心脏、大肠、小肠、视网膜、前额叶皮层、中脑、肾脏和食道。在某些情况下,妊娠相关状态与对象的胎儿(或多胎)的健康或生理状态或状况无关。As used herein, the term "pregnancy-related state" generally refers to any health, physiological and/or biochemical state or condition of a subject who is pregnant or suspected of being pregnant, or of a fetus (or multiple fetuses) of the subject. Examples of pregnancy-related states include, but are not limited to, preterm birth, term birth, gestational age, due date, onset of labor, pregnancy-related hypertensive disorders (e.g., preeclampsia), eclampsia, gestational diabetes, congenital disorders of a subject's fetus, ectopic pregnancy, spontaneous abortion, stillbirth, postpartum complications (e.g., postpartum depression, hemorrhage or excessive bleeding, pulmonary embolism, cardiomyopathy, diabetes, anemia, and hypertensive disorders), hyperemesis gravidarum (morning sickness), hemorrhage or excessive bleeding during labor, premature rupture of membranes, premature premature rupture of membranes, placenta previa (placenta covering the cervix), intrauterine/fetal growth restriction, macrosomia (for fetuses with The invention relates to a fetus that is large for its age, a neonatal condition (e.g., anemia, apnea, bradycardia and other cardiac defects, bronchopulmonary dysplasia or chronic lung disease, diabetes, gastroschisis, hydrocephalus, hyperbilirubinemia, hypocalcemia, hypoglycemia, intraventricular hemorrhage, jaundice, necrotizing enterocolitis, patent ductus arteriosus, periventricular leukomalacia, persistent pulmonary hypertension, polycythemia, respiratory distress syndrome, retinopathy of prematurity, and transient tachypnea), and a fetal developmental stage or state (e.g., normal fetal organ function or development and abnormal fetal organ function or development). For example, the fetal developmental stage or state can be associated with normal fetal organ function or development and/or abnormal fetal organ function or development of a fetal organ selected from the group consisting of heart, large intestine, small intestine, retina, prefrontal cortex, midbrain, kidney, and esophagus. In some cases, the pregnancy-related condition is not related to the health or physiological state or condition of the subject's fetus (or fetuses).

如本文所用,术语“样品”通常是指获得自或来源于一个或多个对象的生物样品。生物样品可以是无细胞生物样品或基本无细胞生物样品,或者可以被加工或分级分离以产生无细胞生物样品。例如,无细胞生物样品可以包括无细胞核糖核酸(cfRNA)、无细胞脱氧核糖核酸(cfDNA)、无细胞胎儿DNA(cffDNA)、血浆、血清、尿液、唾液、羊水及其衍生物。无细胞生物样品可以使用乙二胺四乙酸(EDTA)收集管、无细胞RNA收集管(例如,Streck)或无细胞DNA收集管(例如,Streck)获得自或来源于对象。无细胞生物样品可以通过分级分离来源于全血样品。生物样品或其衍生物可以含有细胞。例如,生物样品可以是血液样品或其衍生物(例如,通过收集管或滴血液收集的血液)、阴道样品(例如,阴道拭子)或宫颈样品(例如,宫颈拭子)。As used herein, the term "sample" generally refers to a biological sample obtained from or derived from one or more objects. A biological sample can be a cell-free biological sample or a substantially cell-free biological sample, or can be processed or fractionated to produce a cell-free biological sample. For example, a cell-free biological sample can include cell-free ribonucleic acid (cfRNA), cell-free deoxyribonucleic acid (cfDNA), cell-free fetal DNA (cffDNA), plasma, serum, urine, saliva, amniotic fluid and derivatives thereof. A cell-free biological sample can be obtained from or derived from an object using ethylenediaminetetraacetic acid (EDTA) collection tubes, cell-free RNA collection tubes (e.g., Streck) or cell-free DNA collection tubes (e.g., Streck). A cell-free biological sample can be derived from a whole blood sample by fractionation. A biological sample or its derivatives can contain cells. For example, a biological sample can be a blood sample or its derivatives (e.g., blood collected by a collection tube or dripping blood), a vaginal sample (e.g., a vaginal swab) or a cervical sample (e.g., a cervical swab).

如本文所用,术语“核酸”通常是指任何长度的核苷酸的聚合形式,脱氧核糖核苷酸(dNTP)或核糖核苷酸(rNTP),或其类似物。核酸可以具有任何三维结构,并且可以执行任何已知或未知的功能。核酸的非限制性实例包括脱氧核糖核酸(DNA)、核糖核酸(RNA)、基因或基因片段的编码或非编码区、由连锁分析定义的基因座、外显子、内含子、信使RNA(mRNA)、转移RNA、核糖体RNA、短干扰RNA(siRNA)、短发夹RNA(shRNA)、微RNA(miRNA)、核酶、cDNA、重组核酸、支链核酸、质粒、载体、任何序列的分离DNA、任何序列的分离RNA、核酸探针和引物。核酸可以包括一种或多种经修饰核苷酸,诸如甲基化的核苷酸和核苷酸类似物。如果存在,可以在核酸组装之前或之后对核苷酸结构进行修饰。核酸的核苷酸序列可以被非核苷酸组分中断。核酸在聚合后可以进一步被修饰,诸如通过与报告试剂(report agent)缀合或结合。As used herein, the term "nucleic acid" generally refers to a polymeric form of nucleotides of any length, deoxyribonucleotides (dNTPs) or ribonucleotides (rNTPs), or analogs thereof. Nucleic acids can have any three-dimensional structure and can perform any known or unknown function. Non-limiting examples of nucleic acids include deoxyribonucleic acid (DNA), ribonucleic acid (RNA), coding or non-coding regions of genes or gene fragments, loci defined by linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, short interfering RNA (siRNA), short hairpin RNA (shRNA), microRNA (miRNA), ribozymes, cDNA, recombinant nucleic acids, branched nucleic acids, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes and primers. Nucleic acids can include one or more modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, the nucleotide structure can be modified before or after nucleic acid assembly. The nucleotide sequence of nucleic acids can be interrupted by non-nucleotide components. Nucleic acids can be further modified after polymerization, such as by conjugating or combining with a reporter agent.

如本文所用,术语“靶核酸”通常是指在具有核苷酸序列的起始核酸分子群中的核酸分子,其存在、量和/或序列,或这些中的一个或多个的变化,是希望确定的。靶核酸可以是任何类型的核酸,包括DNA、RNA及其类似物。如本文所用,“靶核糖核酸(RNA)”通常是指即RNA的靶核酸。如本文所用,“靶脱氧核糖核酸(DNA)”通常是指即DNA的靶核酸。As used herein, the term "target nucleic acid" generally refers to a nucleic acid molecule in a population of starting nucleic acid molecules having a nucleotide sequence, the presence, amount and/or sequence of which, or a change in one or more of these, is desired to be determined. The target nucleic acid can be any type of nucleic acid, including DNA, RNA and analogs thereof. As used herein, "target ribonucleic acid (RNA)" generally refers to a target nucleic acid, i.e., RNA. As used herein, "target deoxyribonucleic acid (DNA)" generally refers to a target nucleic acid, i.e., DNA.

如本文所用,术语“扩增”通常是指增加核酸分子的大小或数量。核酸分子可以是单链或双链的。扩增可以包括产生核酸分子的一个或多个拷贝或“扩增产物”。扩增可以例如通过延伸(例如,引物延伸)或连接进行。扩增可以包括进行引物延伸反应以产生与单链核酸分子互补的链,并且在某些情况下产生该链和/或单链核酸分子的一个或多个拷贝。术语“DNA扩增”通常是指产生DNA分子或“扩增的DNA产物”的一个或多个拷贝。术语“逆转录扩增”通常是指通过逆转录酶的作用从核糖核酸(RNA)模板中产生脱氧核糖核酸(DNA)。As used herein, the term "amplification" generally refers to increasing the size or quantity of a nucleic acid molecule. A nucleic acid molecule can be single-stranded or double-stranded. Amplification can include generating one or more copies or "amplification products" of a nucleic acid molecule. Amplification can be performed, for example, by extension (e.g., primer extension) or connection. Amplification can include performing a primer extension reaction to generate a strand complementary to a single-stranded nucleic acid molecule, and in some cases generating one or more copies of the strand and/or single-stranded nucleic acid molecule. The term "DNA amplification" generally refers to generating one or more copies of a DNA molecule or "amplified DNA product". The term "reverse transcription amplification" generally refers to generating deoxyribonucleic acid (DNA) from a ribonucleic acid (RNA) template by the action of a reverse transcriptase.

每年,全球报告约1500万例早产。早产可以影响多达约10%的妊娠,其中大多数是自发性早产。目前,可能缺乏可用于许多妊娠相关并发症(诸如早产)的有意义的、临床上可行的诊断筛查或测试。然而,早产等妊娠相关并发症是新生儿死亡和以后生活中并发症的主要原因。此外,此类妊娠相关并发症会对母体健康造成负面的健康影响。因此,为了使妊娠尽可能安全,需要快速、准确的方法来鉴定和监测妊娠相关的状态,这些方法是无创的并且有成本效益的,以改善母婴健康。Every year, about 15 million premature births are reported worldwide. Premature births can affect up to about 10% of pregnancies, most of which are spontaneous premature births. At present, there may be a lack of meaningful, clinically feasible diagnostic screening or testing that can be used for many pregnancy-related complications (such as premature birth). However, pregnancy-related complications such as premature birth are the main causes of neonatal death and complications in later life. In addition, such pregnancy-related complications can cause negative health effects on maternal health. Therefore, in order to make pregnancy as safe as possible, it is necessary to identify and monitor the state related to pregnancy quickly and accurately, and these methods are non-invasive and cost-effective, to improve maternal and child health.

目前的产前检查可能是难以达到且不完整的。对于妊娠进展无妊娠相关并发症的病例,妊娠对象可以使用有限的妊娠监测方法,诸如分子检测、超声成像以及使用末次月经期估计胎龄和/或预产期。但是,此类监测方法可能是复杂、昂贵且不可靠的。例如,分子检测无法预测胎龄,超声成像价格昂贵并且最好在妊娠早期进行,使用末次月经期估计胎龄和/或预产期可能是不可靠的。此外,对于妊娠进展并伴有妊娠相关并发症(诸如自发性早产风险)的病例,分子检测、超声成像和人口统计学因素的临床效用可能是受限的。例如,分子检测可能具有受限的BMI(体重指数)范围、受限的胎龄和/或预产期范围(约2周)以及低阳性预测值(PPV);超声成像可能是昂贵的并且具有低PPV和特异性;使用人口统计学因素来预测妊娠相关并发症的风险可能是不可靠的。因此,临床上迫切需要用于检测和监测妊娠相关状态(例如,估计胎龄、预产期和/或分娩发作,以及预测早产等妊娠相关并发症)的准确且可负担的非侵入性诊断方法,以实现临床上可行的结果。Current prenatal examinations may be difficult to reach and incomplete. For cases where pregnancy progresses without pregnancy-related complications, pregnant subjects can use limited pregnancy monitoring methods, such as molecular testing, ultrasound imaging, and using the last menstrual period to estimate gestational age and/or expected date of delivery. However, such monitoring methods may be complex, expensive, and unreliable. For example, molecular testing cannot predict gestational age, ultrasound imaging is expensive and is best performed in early pregnancy, and using the last menstrual period to estimate gestational age and/or expected date of delivery may be unreliable. In addition, for cases where pregnancy progresses and is accompanied by pregnancy-related complications (such as spontaneous preterm birth risk), the clinical utility of molecular testing, ultrasound imaging, and demographic factors may be limited. For example, molecular testing may have a limited BMI (body mass index) range, a limited gestational age and/or expected date of delivery range (about 2 weeks), and a low positive predictive value (PPV); ultrasound imaging may be expensive and have low PPV and specificity; using demographic factors to predict the risk of pregnancy-related complications may be unreliable. Therefore, there is an urgent need for accurate and affordable non-invasive diagnostic methods for detecting and monitoring pregnancy-related states (e.g., estimating gestational age, due date and/or onset of labor, and predicting pregnancy-related complications such as preterm birth) to achieve clinically feasible outcomes.

本公开提供了通过处理获得自或来源于对象(例如,妊娠女性对象)的无细胞生物样品来鉴定或监测妊娠相关状态的方法、系统和试剂盒。可以分析从对象获得的无细胞生物样品(例如,血浆样品)以鉴定妊娠相关状态(其中可以包括,例如,测量妊娠相关状态的存在、不存在或定量评估(例如,风险))。此类对象可以包括具有一种或多种妊娠相关状态的对象和没有妊娠相关状态的对象。妊娠相关状态可包括,例如,早产、足月产、胎龄、预产期、分娩发作、妊娠相关高血压病症(例如,先兆子痫)、子痫、妊娠期糖尿病、对象胎儿的先天性病症、异位妊娠、自然流产、死产、产后并发症(例如,产后抑郁、出血或流血过多、肺栓塞、心肌病、糖尿病、贫血和高血压疾病)、妊娠剧吐(晨吐)、分娩时出血或流血过多、胎膜早破、早产胎膜早破、前置胎盘(胎盘覆盖子宫颈)、宫内/胎儿生长受限和巨大儿(对于胎龄而言较大的胎儿)。在一些实施方案中,妊娠相关状态与胎儿的健康无关。在一些实施方案中,妊娠相关状态包括新生儿状况(例如,贫血、呼吸暂停、心动过缓和其他心脏缺陷、支气管肺发育不良或慢性肺病、糖尿病、腹裂、脑积水、高胆红素血症、低钙血症、低血糖、脑室内出血、黄疸、坏死性小肠结肠炎、动脉导管未闭、脑室周围白质软化、持续性肺动脉高压、红细胞增多症、呼吸窘迫综合征、早产儿视网膜病变和暂时性呼吸急促)以及胎儿发育阶段或状态(例如,正常的胎儿器官功能或发育和异常的胎儿器官功能或发育)。例如,胎儿发育阶段或状态可以与胎儿器官的正常胎儿器官功能或发育和/或异常胎儿器官功能或发育有关,该胎儿器官选自心脏、大肠、小肠、视网膜、前额叶皮层、中脑、肾脏和食道。The present disclosure provides methods, systems and kits for identifying or monitoring pregnancy-related states by processing acellular biological samples obtained from or derived from an object (e.g., a pregnant female object). Acellular biological samples (e.g., plasma samples) obtained from an object can be analyzed to identify pregnancy-related states (which can include, for example, measuring the presence, absence or quantitative assessment (e.g., risk) of pregnancy-related states). Such objects can include objects with one or more pregnancy-related states and objects without pregnancy-related states. Pregnancy-related states can include, for example, premature birth, full-term birth, gestational age, expected date of delivery, onset of labor, pregnancy-related hypertension disorders (e.g., pre-eclampsia), eclampsia, gestational diabetes, congenital disorders of the fetus of the object, ectopic pregnancy, spontaneous abortion, stillbirth, postpartum complications (e.g., postpartum depression, hemorrhage or excessive bleeding, pulmonary embolism, cardiomyopathy, diabetes, anemia and hypertensive disease), hyperemesis gravidarum (morning sickness), hemorrhage or excessive bleeding during delivery, premature rupture of membranes, premature premature rupture of membranes, placenta previa (placenta covering the cervix), intrauterine/fetal growth restriction and macrosomia (large fetus for gestational age). In some embodiments, pregnancy-related states are not related to the health of the fetus. In some embodiments, pregnancy-related states include neonatal conditions (e.g., anemia, apnea, bradycardia and other heart defects, bronchopulmonary dysplasia or chronic lung disease, diabetes, gastroschisis, hydrocephalus, hyperbilirubinemia, hypocalcemia, hypoglycemia, intraventricular hemorrhage, jaundice, necrotizing enterocolitis, patent ductus arteriosus, periventricular leukomalacia, persistent pulmonary hypertension, polycythemia, respiratory distress syndrome, retinopathy of prematurity, and transient tachypnea) and fetal development stage or state (e.g., normal fetal organ function or development and abnormal fetal organ function or development). For example, the fetal development stage or state can be related to normal fetal organ function or development and/or abnormal fetal organ function or development of a fetal organ, and the fetal organ is selected from the heart, large intestine, small intestine, retina, prefrontal cortex, midbrain, kidney, and esophagus.

测定无细胞生物样品Assays for cell-free biological samples

无细胞生物样品可以获得自或来源于人对象(例如,怀孕的女性对象)。无细胞生物样品在处理前可以储存在各种储存条件下,诸如不同的温度(例如,在室温下,在冷藏或冷冻条件下,在25℃、在4℃、在-18℃、-20℃或-80℃)或不同的悬浮液(例如,EDTA收集管、无细胞RNA收集管或无细胞DNA收集管)。The cell-free biological sample can be obtained from or derived from a human subject (e.g., a pregnant female subject). The cell-free biological sample can be stored under various storage conditions before processing, such as different temperatures (e.g., at room temperature, under refrigerated or frozen conditions, at 25°C, at 4°C, at -18°C, -20°C, or -80°C) or different suspensions (e.g., EDTA collection tubes, cell-free RNA collection tubes, or cell-free DNA collection tubes).

无细胞生物样品可以获得自具有妊娠相关状态(例如,妊娠相关并发症)的对象,疑似具有妊娠相关状态(例如,妊娠相关并发症)的对象、或不具有或非疑似具有妊娠相关状态(例如,妊娠相关并发症)的对象。妊娠相关状态可以包括妊娠相关并发症,诸如早产、妊娠相关高血压病症(例如,先兆子痫)、子痫、妊娠期糖尿病、对象胎儿的先天性病症、异位妊娠、自然流产、死产、产后并发症(例如,产后抑郁、出血或流血过多、肺栓塞、心肌病、糖尿病、贫血和高血压疾病)、妊娠剧吐(晨吐)、分娩时出血或流血过多、胎膜早破、早产胎膜早破、前置胎盘(胎盘覆盖子宫颈)、宫内/胎儿生长受限、巨大儿(对于胎龄而言较大的胎儿)、新生儿状况(例如,贫血、呼吸暂停、心动过缓和其他心脏缺陷、支气管肺发育不良或慢性肺病、糖尿病、腹裂、脑积水、高胆红素血症、低钙血症、低血糖、脑室内出血、黄疸、坏死性小肠结肠炎、动脉导管未闭、脑室周围白质软化、持续性肺动脉高压、红细胞增多症、呼吸窘迫综合征、早产儿视网膜病变和暂时性呼吸急促),以及异常胎儿发育阶段或状态(例如,异常胎儿器官功能或发育)。妊娠相关状态可以包括:足月产、正常胎儿发育阶段或状态(例如,正常胎儿器官功能或发育)、或不存在妊娠相关并发症(例如,早产、妊娠相关高血压病症(例如,先兆子痫)、子痫、妊娠期糖尿病、对象胎儿的先天性病症、异位妊娠、自然流产、死产、产后并发症(例如,产后抑郁、出血或流血过多、肺栓塞、心肌病、糖尿病、贫血和高血压疾病)、妊娠剧吐(晨吐)、分娩时出血或流血过多、胎膜早破、早产胎膜早破、前置胎盘(胎盘覆盖子宫颈)、宫内/胎儿生长受限、巨大儿(对于胎龄而言较大的胎儿)、新生儿状况(例如,贫血、呼吸暂停、心动过缓和其他心脏缺陷、支气管肺发育不良或慢性肺病、糖尿病、腹裂、脑积水、高胆红素血症、低钙血症、低血糖、脑室内出血、黄疸、坏死性小肠结肠炎、动脉导管未闭、脑室周围白质软化、持续性肺动脉高压、红细胞增多症、呼吸窘迫综合征、早产儿视网膜病变和暂时性呼吸急促),以及异常胎儿发育阶段或状态异常(例如,异常胎儿器官功能或发育))。妊娠相关状态可以包括妊娠的定量评估,诸如胎龄(例如,以天、周或月为单位测量)或预产期(例如,表示为预测或估计的日历日期或日历日期的范围)。妊娠相关状态可以包括妊娠相关并发症的定量评估,诸如妊娠相关并发症(例如,早产、分娩发作、妊娠相关高血压病症(例如,先兆子痫)、子痫、妊娠期糖尿病、对象胎儿的先天性病症、异位妊娠、自然流产、死产、产后并发症(例如,产后抑郁、出血或流血过多、肺栓塞、心肌病、糖尿病、贫血和高血压疾病)、妊娠剧吐(晨吐)、分娩时出血或流血过多、胎膜早破、早产胎膜早破、前置胎盘(胎盘覆盖子宫颈)、宫内/胎儿生长受限、巨大儿(对于胎龄而言较大的胎儿)、新生儿状况(例如,贫血、呼吸暂停、心动过缓和其他心脏缺陷、支气管肺发育不良或慢性肺病、糖尿病、腹裂、脑积水、高胆红素血症、低钙血症、低血糖、脑室内出血、黄疸、坏死性小肠结肠炎、动脉导管未闭、脑室周围白质软化、持续性肺动脉高压、红细胞增多症、呼吸窘迫综合征、早产儿视网膜病变和暂时性呼吸急促),以及异常胎儿发育阶段或状态(例如,异常胎儿器官功能或发育)的似然性、易感性或风险(例如,表示为概率、相对概率、比值比或风险评分或风险指数)。例如,妊娠相关状态可以包括未来分娩发动的似然性或易感性(例如,在约1小时、约2小时、约4小时、约6小时、约8小时、约10小时、约12小时、约14小时、约16小时、约18小时、约20小时、约22小时、约24小时、约1.5天、约2天、约2.5天、约3天、约3.5天、约4天、约4.5天、约5天、约5.5天、约6天、约6.5天、约7天、约8天、约9天、约10天、约12天、约14天、约3周、约4周、约5周、约6周、约7周、约8周、约9周、约10周、约11周、约12周、约13周或大于约13周)。例如,胎儿发育阶段或状态可以与胎儿器官的正常胎儿器官功能或发育和/或异常胎儿器官功能或发育有关,该胎儿器官选自心脏、大肠、小肠、视网膜、前额叶皮层、中脑、肾脏和食道。The cell-free biological sample can be obtained from a subject having a pregnancy-related condition (e.g., a pregnancy-related complication), a subject suspected of having a pregnancy-related condition (e.g., a pregnancy-related complication), or a subject not having or not suspected of having a pregnancy-related condition (e.g., a pregnancy-related complication). Pregnancy-related conditions can include pregnancy-related complications such as premature birth, pregnancy-related hypertensive disorders (e.g., preeclampsia), eclampsia, gestational diabetes, congenital disorders of the subject's fetus, ectopic pregnancy, spontaneous abortion, stillbirth, postpartum complications (e.g., postpartum depression, hemorrhage or excessive bleeding, pulmonary embolism, cardiomyopathy, diabetes, anemia, and hypertensive disorders), hyperemesis gravidarum (morning sickness), hemorrhage or excessive bleeding during labor, premature rupture of membranes, premature premature rupture of membranes, placenta previa (placenta covering the cervix), intrauterine/fetal growth restriction, macrosomia (for gestational age), larger fetus), neonatal conditions (e.g., anemia, apnea, bradycardia and other cardiac defects, bronchopulmonary dysplasia or chronic lung disease, diabetes mellitus, gastroschisis, hydrocephalus, hyperbilirubinemia, hypocalcemia, hypoglycemia, intraventricular hemorrhage, jaundice, necrotizing enterocolitis, patent ductus arteriosus, periventricular leukomalacia, persistent pulmonary hypertension, polycythemia, respiratory distress syndrome, retinopathy of prematurity, and transient tachypnea), and abnormal fetal developmental stage or state (e.g., abnormal fetal organ function or development). Pregnancy-related states may include: full-term birth, normal fetal development stage or state (e.g., normal fetal organ function or development), or the absence of pregnancy-related complications (e.g., premature birth, pregnancy-related hypertensive disorders (e.g., preeclampsia), eclampsia, gestational diabetes, congenital disorders of the subject's fetus, ectopic pregnancy, spontaneous abortion, stillbirth, postpartum complications (e.g., postpartum depression, hemorrhage or excessive bleeding, pulmonary embolism, cardiomyopathy, diabetes, anemia, and hypertensive disorders), hyperemesis gravidarum (morning sickness), hemorrhage or excessive bleeding during labor, premature rupture of membranes, premature premature rupture of membranes, placenta previa (placenta covering the cervix), uterine congestion, Endonatal/fetal growth restriction, macrosomia (a fetus that is large for gestational age), neonatal conditions (e.g., anemia, apnea, bradycardia and other cardiac defects, bronchopulmonary dysplasia or chronic lung disease, diabetes, gastroschisis, hydrocephalus, hyperbilirubinemia, hypocalcemia, hypoglycemia, intraventricular hemorrhage, jaundice, necrotizing enterocolitis, patent ductus arteriosus, periventricular leukomalacia, persistent pulmonary hypertension, polycythemia, respiratory distress syndrome, retinopathy of prematurity, and transient tachypnea), and abnormal fetal developmental stage or state (e.g., abnormal fetal organ function or development). Pregnancy-related status can include a quantitative assessment of pregnancy, such as gestational age (e.g., measured in days, weeks, or months) or due date (e.g., expressed as a predicted or estimated calendar date or range of calendar dates). Pregnancy-related status can include quantitative assessments of pregnancy-related complications, such as pregnancy-related complications (e.g., preterm labor, onset of labor, pregnancy-related hypertensive disorders (e.g., preeclampsia), eclampsia, gestational diabetes, congenital disorders of the subject's fetus, ectopic pregnancy, spontaneous abortion, stillbirth, postpartum complications (e.g., postpartum depression, hemorrhage or excessive bleeding, pulmonary embolism, cardiomyopathy, diabetes, anemia, and hypertensive disorders), hyperemesis gravidarum (morning sickness), hemorrhage or excessive bleeding during labor, premature rupture of membranes, premature premature rupture of membranes, placenta previa (placenta covering the cervix), intrauterine fetal growth restriction, macrosomia (a fetus that is large for gestational age), neonatal conditions (e.g., anemia, apnea, bradycardia and other cardiac defects, bronchopulmonary dysplasia or chronic lung disease, diabetes, gastroschisis, hydrocephalus, hyperbilirubinemia, hypocalcemia, hypoglycemia, intraventricular hemorrhage, jaundice, necrotizing enterocolitis, patent ductus arteriosus, periventricular leukomalacia, persistent pulmonary hypertension, polycythemia, respiratory distress syndrome, retinopathy of prematurity, and transient tachypnea), and abnormal fetal developmental stage or state (e.g., abnormal The likelihood, susceptibility, or risk (e.g., expressed as a probability, relative probability, odds ratio, or risk score or risk index) of a future labor onset (e.g., within about 1 hour, about 2 hours, about 4 hours, about 6 hours, about 8 hours, about 10 hours, about 12 hours, about 14 hours, about 16 hours, about 18 hours, about 20 hours, about 22 hours, about 24 hours, about 1.5 days, about 2 days, about 2.5 days, about 3 days, about 3.5 days, about 4 days, about 4 For example, the fetal development stage or state can be associated with normal fetal organ function or development and/or abnormal fetal organ function or development of a fetal organ selected from the group consisting of heart, large intestine, small intestine, retina, prefrontal cortex, midbrain, kidney, and esophagus.

可以在治疗具有妊娠相关并发症的对象之前和/或之后收集无细胞生物样品。可以在治疗或治疗方案期间从对象获得无细胞生物样品。可以从对象获得多个无细胞生物样品,以监测治疗随时间推移的效果。无细胞生物样品可以取自已知或疑似具有妊娠相关状态(例如,妊娠相关并发症)的对象,其通过临床试验无法获得明确的阳性或阴性诊断。样品可以取自疑似具有妊娠相关并发症的对象。无细胞生物样品可以取自经历不明症状(诸如疲劳、恶心、体重减轻、疼痛、虚弱或出血)的对象。无细胞生物样品可以取自患有已解释症状的对象。无细胞生物样品可以取自由于以下因素而有发生妊娠相关并发症风险的对象:诸如家族史、年龄、高血压或高血压前期、糖尿病或糖尿病前期、超重或肥胖、环境暴露、生活方式风险因素(例如,吸烟、饮酒或吸毒)或存在其他风险因素。Cell-free biological samples can be collected before and/or after treating an object with pregnancy-related complications. Cell-free biological samples can be obtained from an object during treatment or a treatment regimen. Multiple cell-free biological samples can be obtained from an object to monitor the effect of treatment over time. Cell-free biological samples can be taken from an object known or suspected to have a pregnancy-related state (e.g., pregnancy-related complications), which cannot obtain a clear positive or negative diagnosis through clinical trials. Samples can be taken from an object suspected of having pregnancy-related complications. Cell-free biological samples can be taken from an object experiencing unexplained symptoms (such as fatigue, nausea, weight loss, pain, weakness or bleeding). Cell-free biological samples can be taken from an object with explained symptoms. Cell-free biological samples can be taken from an object with a risk of pregnancy-related complications due to the following factors: such as family history, age, hypertension or prehypertension, diabetes or prediabetes, overweight or obesity, environmental exposure, lifestyle risk factors (e.g., smoking, drinking or drug abuse) or the presence of other risk factors.

无细胞生物样品可以含有一种或多种能够被测定的分析物,诸如适于测定以生成转录组学数据的无细胞核糖核酸(cfRNA)分子、使用来源于无细胞生物样品的转录产物(例如,信使RNA、转移RNA或核糖体RNA)以生成转录产物数据、适于测定以生成基因组数据和/或甲基化数据的无细胞脱氧核糖核酸(cfDNA)分子、适于测定以生成蛋白质组学数据的蛋白质(例如,对应于妊娠相关联基因组位点或基因的妊娠相关蛋白质)、适于测定以生成代谢组学数据的代谢物,或其混合物或组合。可以从对象的一个或多个无细胞生物样品中分离或提取一种或多种此类分析物(例如,cfRNA分子、cfDNA分子、蛋白质或代谢物),以使用一种或多种合适的测定进行下游测定。The cell-free biological sample may contain one or more analytes capable of being assayed, such as cell-free ribonucleic acid (cfRNA) molecules suitable for assaying to generate transcriptomic data, using transcripts derived from the cell-free biological sample (e.g., messenger RNA, transfer RNA, or ribosomal RNA) to generate transcript data, cell-free deoxyribonucleic acid (cfDNA) molecules suitable for assaying to generate genomic data and/or methylation data, proteins suitable for assaying to generate proteomic data (e.g., pregnancy-associated proteins corresponding to pregnancy-associated genomic sites or genes), metabolites suitable for assaying to generate metabolomic data, or mixtures or combinations thereof. One or more such analytes (e.g., cfRNA molecules, cfDNA molecules, proteins, or metabolites) may be isolated or extracted from one or more cell-free biological samples of a subject for downstream assaying using one or more suitable assays.

从对象获得无细胞生物样品后,可以处理无细胞生物样品以生成指示对象的妊娠相关状态的数据集。例如,处于妊娠相关状态相关联基因组位点的分组的无细胞生物样品的核酸分子的存在、不存在或定量评估(例如,处于妊娠相关状态相关联基因组位点的RNA转录物或DNA的定量量度)、包括处于妊娠相关状态相关联蛋白质(例如,对应于妊娠相关联基因组位点或基因)的分组的数据集的蛋白质的定量量度的蛋白质组学数据,和/或包括妊娠相关状态相关联代谢物的分组的定量量度的代谢组数据,可以指示妊娠相关状态。处理从对象获得的无细胞生物样品可以包括(i)使无细胞生物样品经受足以分离、富集或提取多个核酸分子、蛋白质(例如,对应于妊娠相关联基因组位点或基因的妊娠相关联蛋白质)和/或代谢物的条件,以及(ii)测定多个核酸分子、蛋白质和/或代谢物以生成数据集。After obtaining a cell-free biological sample from a subject, the cell-free biological sample can be processed to generate a data set indicating a pregnancy-related state of the subject. For example, the presence, absence or quantitative assessment of nucleic acid molecules in a cell-free biological sample grouped at a pregnancy-related state-associated genomic site (e.g., a quantitative measurement of RNA transcripts or DNA in a pregnancy-related state-associated genomic site), proteomic data including quantitative measurements of proteins in a data set of grouped proteins associated with a pregnancy-related state (e.g., corresponding to a pregnancy-associated genomic site or gene), and/or metabolomic data including quantitative measurements of grouped metabolites associated with a pregnancy-associated state can indicate a pregnancy-associated state. Processing the cell-free biological sample obtained from the subject can include (i) subjecting the cell-free biological sample to conditions sufficient to separate, enrich or extract a plurality of nucleic acid molecules, proteins (e.g., pregnancy-associated proteins corresponding to a pregnancy-associated genomic site or gene) and/or metabolites, and (ii) determining a plurality of nucleic acid molecules, proteins and/or metabolites to generate a data set.

在一些实施方案中,从无细胞生物样品提取多个核酸分子并进行测序以生成多个测序读数。核酸分子可以包括核糖核酸(RNA)或脱氧核糖核酸(DNA)。核酸分子(例如,RNA或DNA)可以通过多种方法从无细胞生物样品提取,诸如MP Biomedicals的FastDNA试剂盒方案、Qiagen的QIAamp DNA无细胞生物迷你试剂盒或Norgen Biotek的无细胞生物DNA分离试剂盒方案。提取方法可以从样品提取所有RNA或DNA分子。或者,提取方法可以选择性地从样品提取一部分RNA或DNA分子。从样品提取的RNA分子可以通过逆转录(RT)转化为DNA分子。In some embodiments, multiple nucleic acid molecules are extracted from acellular biological samples and sequenced to generate multiple sequencing readings. Nucleic acid molecules may include ribonucleic acid (RNA) or deoxyribonucleic acid (DNA). Nucleic acid molecules (e.g., RNA or DNA) can be extracted from acellular biological samples by a variety of methods, such as the FastDNA kit scheme of MP Biomedicals, the QIAamp DNA acellular biological mini kit of Qiagen or the acellular biological DNA separation kit scheme of Norgen Biotek. The extraction method can extract all RNA or DNA molecules from the sample. Alternatively, the extraction method can selectively extract a portion of RNA or DNA molecules from the sample. The RNA molecules extracted from the sample can be converted into DNA molecules by reverse transcription (RT).

测序可以通过任何合适的测序方法进行,诸如大规模并行测序(MPS)、配对末端测序、高通量测序、二代测序(NGS)、鸟枪法测序、单分子测序、纳米孔测序、半导体测序、焦磷酸测序、合成测序(SBS)、连接测序、杂交测序和RNA-Seq(Illumina)。Sequencing can be performed by any suitable sequencing method, such as massively parallel sequencing (MPS), paired-end sequencing, high-throughput sequencing, next-generation sequencing (NGS), shotgun sequencing, single-molecule sequencing, nanopore sequencing, semiconductor sequencing, pyrosequencing, synthesis sequencing (SBS), ligation sequencing, hybridization sequencing, and RNA-Seq (Illumina).

测序可以包括(例如,RNA或DNA分子的)核酸扩增。在一些实施方案中,核酸扩增是聚合酶链反应(PCR)。可以进行适当轮数的PCR(例如,PCR、qPCR、逆转录酶PCR、数字PCR等),以将初始量的核酸(例如,RNA或DNA)充分扩增至随后测序所需的输入量。在某些情况下,PCR可以用于靶核酸的全局扩增。这可以包括使用接头序列,该接头序列可以首先连接到不同的分子上,然后使用通用引物进行PCR扩增。可以使用许多商业试剂盒中的任何一种进行PCR,例如,由Life Technologies、Affymetrix、Promega、Qiagen等提供的。在其他情况下,只有核酸群中的某些靶核酸可以被扩增。特异性引物,可以与接头连接联合使用,可以用于选择性扩增一些靶标以进行下游测序。PCR可以包括一个或多个基因组位点的靶向扩增,诸如与妊娠相关状态相关联的基因组位点。测序可以包括使用同时的逆转录(RT)和聚合酶链反应(PCR),诸如Qiagen、NEB、Thermo Fisher Scientific或Bio-Rad的OneStep RT-PCR试剂盒方案。Sequencing can include nucleic acid amplification (e.g., of RNA or DNA molecules). In some embodiments, nucleic acid amplification is polymerase chain reaction (PCR). Appropriate rounds of PCR (e.g., PCR, qPCR, reverse transcriptase PCR, digital PCR, etc.) can be performed to fully amplify the initial amount of nucleic acid (e.g., RNA or DNA) to the input amount required for subsequent sequencing. In some cases, PCR can be used for the global amplification of target nucleic acids. This can include the use of a joint sequence, which can first be connected to different molecules, and then PCR amplification using universal primers. PCR can be performed using any of many commercial kits, for example, provided by Life Technologies, Affymetrix, Promega, Qiagen, etc. In other cases, only certain target nucleic acids in the nucleic acid group can be amplified. Specific primers, which can be used in conjunction with joint connections, can be used to selectively amplify some targets for downstream sequencing. PCR can include targeted amplification of one or more genomic sites, such as genomic sites associated with pregnancy-related states. Sequencing may include the use of simultaneous reverse transcription (RT) and polymerase chain reaction (PCR), such as the OneStep RT-PCR kit protocol from Qiagen, NEB, Thermo Fisher Scientific, or Bio-Rad.

从无细胞生物样品中分离或提取的RNA或DNA分子可以例如,用可鉴定的标签进行标记,以允许多个样品的多重复用。任何数量的RNA或DNA样品都可以进行多重复用。例如,多重反应可以含有来自至少约2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、25、30、35、40、45、50、55、60、65、70、75、80、85、90、95、100或大于100个初始无细胞生物样品的RNA或DNA。例如,可以用样品条形码标记多个无细胞生物样品,使得每个DNA分子可以追溯到DNA分子起源的样品(和对象)。此类标签可以通过连接或用引物进行PCR扩增而附接到RNA或DNA分子上。The RNA or DNA molecules separated or extracted from the cell-free biological sample can, for example, be marked with identifiable labels to allow multiple reuse of multiple samples. Any number of RNA or DNA samples can be multiple reused. For example, multiple reactions can contain RNA or DNA from at least about 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 or more than 100 initial cell-free biological samples. For example, multiple cell-free biological samples can be marked with sample barcodes so that each DNA molecule can be traced back to the sample (and object) of the DNA molecule origin. Such labels can be attached to RNA or DNA molecules by connection or by carrying out pcr amplification with primers.

在对核酸分子进行测序后,可以对序列读数进行合适的生物信息学处理,以生成指示妊娠相关状态的存在、不存在或相对评估的数据。例如,可以将序列读数与一个或多个参考基因组(例如,一个或多个物种的基因组如人类基因组)进行比对。可以在一个或多个基因组位点上定量比对的序列读数,以生成指示妊娠相关状态的数据集。例如,定量对应于与妊娠相关状态相关联的多个基因组位点的序列可以生成指示妊娠相关状态的数据集。After sequencing the nucleic acid molecules, the sequence reads can be subjected to suitable bioinformatics processing to generate data indicating the presence, absence, or relative assessment of a pregnancy-related state. For example, the sequence reads can be compared to one or more reference genomes (e.g., genomes of one or more species such as the human genome). The sequence reads compared can be quantified at one or more genomic sites to generate a data set indicating a pregnancy-related state. For example, quantifying sequences corresponding to multiple genomic sites associated with a pregnancy-related state can generate a data set indicating a pregnancy-related state.

无细胞生物样品可以在没有任何核酸提取的情况下进行处理。例如,可以通过使用被配置为选择性富集对应于多个妊娠相关状态相关联基因组位点的核酸(例如,RNA或DNA)分子的探针在对象中鉴定或监测妊娠相关状态。探针可以是核酸引物。探针可以与来自多个妊娠相关状态相关联基因组位点或基因组区域中的一个或多个的核酸序列具有序列互补性。该多个妊娠相关状态相关联基因组位点或基因组区域可以包括至少2、至少3、至少4、至少5、至少6、至少7、至少8、至少9、至少10、至少11、至少12、至少13、至少14、至少15、至少16、至少17、至少18、至少19、至少20、至少约25、至少约30、至少约35、至少约40个、至少约45个、至少约50个、至少约55个、至少约60个、至少约65个、至少约70个、至少约75个、至少约80个、至少约85个、至少约90个、至少约95个、至少约100个或更多不同的妊娠相关状态相关联基因组位点或基因组区域。该多个妊娠相关状态相关联基因组位点或基因组区域可以包括一个或多个(例如,1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、约25、约30、约35、约40、约45、约50、约55、约60、约65、约70、约75、约80个或更多)成员,其选自ACTB、ADAM12、ALPP、ANXA3、APLF、ARG1、AVPR1A、CAMP、CAPN6、CD180、CGA、CGB、CLCN3、CPVL、CSH1、CSH2、CSHL1、CYP3A7、DAPP1、DCX、DEFA4、DGCR14、ELANE、ENAH、EPB42、FABP1、FAM212B-AS1、FGA、FGB、FRMD4B、FRZB、FSTL3、GH2、GNAZ、HAL、HSD17B1、HSD3B1、HSPB8、Immune、ITIH2、KLF9、KNG1、KRT8、LGALS14、LTF、LYPLAL1、MAP3K7CL、MEF2C、MMD、MMP8、MOB1B、NFATC2、OTC、P2RY12、PAPPA、PGLYRP1、PKHD1L1、PKHD1L1、PLAC1、PLAC4、POLE2、PPBP、PSG1、PSG4、PSG7、PTGER3、RAB11A、RAB27B、RAP1GAP、RGS18、RPL23AP7、S100A8、S100A9、S100P、SERPINA7、SLC2A2、SLC38A4、SLC4A1、TBC1D15、VCAN、VGLL1、B3GNT2、COL24A1、CXCL8和PTGS2。妊娠相关状态相关联基因组位点或基因组区域可以与胎龄、早产、预产期、分娩发作或其他妊娠相关状态或并发症相关联,诸如由例如Ngo等人(“Noninvasive blood tests for fetaldevelopment predict gestational age and preterm delivery”,Science,360(6393),第1133-1136页,2018年6月08日)描述的基因组位点,其在此通过引用整体并入。Cell-free biological samples can be processed without any nucleic acid extraction. For example, a pregnancy-related state can be identified or monitored in a subject by using a probe configured to selectively enrich for nucleic acid (e.g., RNA or DNA) molecules corresponding to a plurality of pregnancy-related state-associated genomic sites. The probe can be a nucleic acid primer. The probe can have sequence complementarity with one or more nucleic acid sequences from a plurality of pregnancy-related state-associated genomic sites or genomic regions. The plurality of pregnancy-related state-associated genomic loci or genomic regions may include at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least about 25, at least about 30, at least about 35, at least about 40, at least about 45, at least about 50, at least about 55, at least about 60, at least about 65, at least about 70, at least about 75, at least about 80, at least about 85, at least about 90, at least about 95, at least about 100 or more different pregnancy-related state-associated genomic loci or genomic regions. The plurality of pregnancy-related status-associated genomic loci or genomic regions may include one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, about 25, about 30, about 35, about 40, about 45, about 50, about 55, about 60, about 65, about 70, about 75, about 80 or more) members selected from ACTB, ADAM12, ALPP, ANXA 3. APLF, ARG1, AVPR1A, CAMP, CAPN6, CD180, CGA, CGB, CLCN3, CPVL, CSH1, CSH2, CSHL1, CYP3A7, DAPP1, DCX, DEFA4, DGCR14, ELANE, ENAH, EPB42, FABP1, FAM212B-AS1, FGA, FGB, FRMD4B, FRZB, FSTL3, GH2 ,G NAZ, HAL, HSD17B1, HSD3B1, HSPB8, Immune, ITIH2, KLF9, KNG1, KRT8, LGALS14, LTF, LYPLAL1, MAP3K7CL, MEF2C, MMD, MMP8, MOB1B, NFATC2, OTC, P2RY12, PAPPA, PGLYRP1, PKHD1L1, PKHD1L1, PLAC1, PLAC4, POLE2, PPBP, PSG1, PSG4, PSG7, PTGER3, RAB11A, RAB27B, RAP1GAP, RGS18, RPL23AP7, S100A8, S100A9, S100P, SERPINA7, SLC2A2, SLC38A4, SLC4A1, TBC1D15, VCAN, VGLL1, B3GNT2, COL24A1, CXCL8, and PTGS2. Genomic loci or genomic regions associated with pregnancy-related states can be associated with gestational age, preterm birth, due date, onset of labor, or other pregnancy-related states or complications, such as the genomic loci described by, for example, Ngo et al. (“Noninvasive blood tests for fetal development predict gestational age and preterm delivery”, Science, 360(6393), pp. 1133-1136, June 8, 2018), which are hereby incorporated by reference in their entirety.

探针可以是与一个或多个基因组位点(例如,妊娠相关状态相关联基因组位点)的核酸序列(例如,RNA或DNA)具有序列互补性的核酸分子(例如,RNA或DNA)。这些核酸分子可以是引物或富集序列。使用对一个或多个基因组位点(例如,妊娠相关状态相关联基因组位点)具有选择性的探针对无细胞生物样品的测定可以包括使用阵列杂交(例如,基于微阵列)、聚合酶链反应(PCR)或核酸测序(例如,RNA测序或DNA测序)。在一些实施方案中,DNA或RNA可以通过以下一种或多种进行测定:等温DNA/RNA扩增方法(例如,环介导等温扩增(LAMP)、解旋酶依赖性扩增(HDA)、滚环扩增(RCA)、重组酶聚合酶扩增(RPA))、免疫测定、电化学测定、表面增强拉曼光谱(SERS)、基于量子点(QD)的测定、分子倒置探针、液滴数字PCR(ddPCR)、基于CRISPR/Cas的检测(例如,CRISPR分型PCR(ctPCR)、特异性高灵敏度酶报告基因解锁(SHERLOCK)、DNA核酸内切酶靶向CRISPR反式报告基因(DETECTR)和CRISPR介导的模拟多事件记录装置(CAMERA))以及激光透射光谱(LTS)。The probes can be nucleic acid molecules (e.g., RNA or DNA) having sequence complementarity with nucleic acid sequences (e.g., RNA or DNA) of one or more genomic loci (e.g., genomic loci associated with a pregnancy-related state). These nucleic acid molecules can be primers or enrichment sequences. Assays of cell-free biological samples using probes that are selective for one or more genomic loci (e.g., genomic loci associated with a pregnancy-related state) can include the use of array hybridization (e.g., microarray-based), polymerase chain reaction (PCR), or nucleic acid sequencing (e.g., RNA sequencing or DNA sequencing). In some embodiments, DNA or RNA can be assayed by one or more of isothermal DNA/RNA amplification methods (e.g., loop-mediated isothermal amplification (LAMP), helicase-dependent amplification (HDA), rolling circle amplification (RCA), recombinase polymerase amplification (RPA)), immunoassays, electrochemical assays, surface enhanced Raman spectroscopy (SERS), quantum dot (QD)-based assays, molecular inversion probes, droplet digital PCR (ddPCR), CRISPR/Cas-based detection (e.g., CRISPR typing PCR (ctPCR), specific high sensitivity enzymatic reporter gene unlocking (SHERLOCK), DNA endonuclease-targeted CRISPR trans-reporter (DETECTR), and CRISPR-mediated analog multiple event recording device (CAMERA)), and laser transmission spectroscopy (LTS).

测定读数可以在一个或多个基因组位点(例如,妊娠相关状态相关联基因组位点)上进行定量,以生成指示妊娠相关状态的数据。例如,阵列杂交或聚合酶链反应(PCR)对应于多个基因组位点(例如,妊娠相关状态相关联基因组位点)的定量可以生成指示妊娠相关状态的数据。测定读数可以包括定量PCR(qPCR)值、数字PCR(dPCR)值、数字微滴PCR(ddPCR)值、荧光值等,或其标准化值。该测定可以是配置为在家庭环境中进行的家用测试。The determination reading can be quantified on one or more genomic sites (e.g., genomic sites associated with pregnancy-related states) to generate data indicating pregnancy-related states. For example, array hybridization or polymerase chain reaction (PCR) corresponds to quantification of multiple genomic sites (e.g., genomic sites associated with pregnancy-related states) to generate data indicating pregnancy-related states. The determination reading can include quantitative PCR (qPCR) values, digital PCR (dPCR) values, digital droplet PCR (ddPCR) values, fluorescence values, etc., or their standardized values. The determination can be a home test configured to be performed in a home environment.

在一些实施方案中,多重测定用于处理对象的无细胞生物样品。例如,第一测定可以用于处理获得自或来源于对象的第一无细胞生物样品以生成第一数据集;并且至少部分地基于第一数据集,不同于第一测定的第二测定可以用于处理获得自或来源于对象的第二无细胞生物样品,以生成指示妊娠相关状态的第二数据集。第一测定可以用于筛选或处理对象集的无细胞生物样品,而第二或后续测定可以用于筛选或处理对象集的较小子集的无细胞生物样品。第一测定可以具有检测一种或多种妊娠相关状态(例如,妊娠相关并发症)的低成本和/或高灵敏度,其适于筛选或处理相对大的对象集的无细胞生物样品。第二测定可以具有检测一种或多种妊娠相关状态(例如,妊娠相关并发症)的更高成本和/或更高特异性,其适于筛选或处理相对小的对象集(例如,使用第一种测定筛选的对象子集)的无细胞生物样品。第二测定可以生成具有比使用第一测定生成的第一数据集更大的特异性(例如,对于一种或多种妊娠相关状态诸如妊娠相关并发症)的第二数据集。例如,可以在大的对象集上使用cfRNA测定处理一个或多个无细胞生物样品,然后在较小的对象子集上使用代谢组学测定,反之亦然。可以至少部分地基于第一次测定的结果选择较小的对象子集。In some embodiments, multiple assays are used to process acellular biological samples of an object. For example, a first assay can be used to process a first acellular biological sample obtained from or derived from an object to generate a first data set; and based at least in part on the first data set, a second assay different from the first assay can be used to process a second acellular biological sample obtained from or derived from an object to generate a second data set indicating a pregnancy-related state. The first assay can be used to screen or process acellular biological samples of a set of objects, while a second or subsequent assay can be used to screen or process acellular biological samples of a smaller subset of the set of objects. The first assay can have low cost and/or high sensitivity for detecting one or more pregnancy-related states (e.g., pregnancy-related complications), which is suitable for screening or processing acellular biological samples of a relatively large set of objects. The second assay can have a higher cost and/or higher specificity for detecting one or more pregnancy-related states (e.g., pregnancy-related complications), which is suitable for screening or processing acellular biological samples of a relatively small set of objects (e.g., a subset of objects screened using the first assay). The second assay can generate a second data set with greater specificity (e.g., for one or more pregnancy-related states such as pregnancy-related complications) than the first data set generated using the first assay. For example, one or more cell-free biological samples may be processed using a cfRNA assay on a large set of subjects, and then a metabolomics assay may be used on a smaller subset of subjects, or vice versa. The smaller subset of subjects may be selected based at least in part on the results of the first assay.

替代性地,可以使用多重测定同时处理对象的无细胞生物样品。例如,第一测定可以用于处理获得自或来源于对象的第一无细胞生物样品,以生成指示妊娠相关状态的第一数据集;并且不同于第一测定的第二测定可以用于处理获得自或来源于对象的第二无细胞生物样品,以生成指示妊娠相关状态的第二数据集。然后可以分析第一数据集和第二数据集的任何一个或全部以评估对象的妊娠相关状态。例如,可以基于第一个数据集和第二个数据集的组合生成单个诊断指标或诊断评分。作为另一个实例,可以基于第一数据集和第二数据集生成单独的诊断指标或诊断评分。Alternatively, a cell-free biological sample of a subject can be processed simultaneously using multiple assays. For example, a first assay can be used to process a first cell-free biological sample obtained from or derived from a subject to generate a first data set indicating a pregnancy-related state; and a second assay different from the first assay can be used to process a second cell-free biological sample obtained from or derived from a subject to generate a second data set indicating a pregnancy-related state. Any or all of the first and second data sets can then be analyzed to assess the pregnancy-related state of the subject. For example, a single diagnostic indicator or diagnostic score can be generated based on a combination of the first and second data sets. As another example, a separate diagnostic indicator or diagnostic score can be generated based on the first and second data sets.

可以处理无细胞生物样品以鉴定生物标志物RNA转录物集,这些生物标志物RNA转录物指示相应的生物标志物蛋白(例如,对应于妊娠相关联基因组位点或基因的妊娠相关联蛋白质)、途径和/或代谢物集。例如,可以预期给定的生物标志物RNA转录物翻译成相应的给定生物标志物蛋白或相应的给定生物标志物蛋白的基因调节因子。因此,鉴定生物样品中给定生物标志物RNA转录物的存在或不存在可以指示相应的生物标志物蛋白的存在或不存在。作为另一个实例,可以预期给定的生物标志物RNA转录物与相应的给定途径相关。因此,鉴定生物样品中给定生物标志物RNA转录物的存在或不存在可以指示相应途径活性的存在或不存在。作为另一个实例,可以预期给定的生物标志物RNA转录物与相应的给定生物标志物代谢物相关。因此,鉴定生物样品中给定生物标志物RNA转录物的存在或不存在可以指示相应的生物标志物代谢物的存在或不存在。在一些实施方案中,相应的生物标志物蛋白、途径和/或代谢物集包括妊娠相关状态相关联蛋白质(例如,对应于妊娠相关联基因组位点或基因)、途径和/或代谢物。在一些实施方案中,相应的生物标志物蛋白、途径和/或代谢物集包括胎盘蛋白、途径和/或代谢物。例如,鉴定PAPPA基因的存在或不存在可以指示PAPPA蛋白类似物的存在或不存在。Cell-free biological samples can be processed to identify sets of biomarker RNA transcripts that indicate corresponding biomarker proteins (e.g., pregnancy-associated proteins corresponding to pregnancy-associated genomic sites or genes), pathways, and/or metabolite sets. For example, a given biomarker RNA transcript can be expected to be translated into a corresponding given biomarker protein or a gene regulator of a corresponding given biomarker protein. Therefore, identifying the presence or absence of a given biomarker RNA transcript in a biological sample can indicate the presence or absence of a corresponding biomarker protein. As another example, a given biomarker RNA transcript can be expected to be associated with a corresponding given pathway. Therefore, identifying the presence or absence of a given biomarker RNA transcript in a biological sample can indicate the presence or absence of a corresponding pathway activity. As another example, a given biomarker RNA transcript can be expected to be associated with a corresponding given biomarker metabolite. Therefore, identifying the presence or absence of a given biomarker RNA transcript in a biological sample can indicate the presence or absence of a corresponding biomarker metabolite. In some embodiments, the corresponding biomarker proteins, pathways and/or metabolite sets include pregnancy-related state-associated proteins (e.g., corresponding to pregnancy-associated genomic sites or genes), pathways and/or metabolites. In some embodiments, the corresponding biomarker proteins, pathways and/or metabolite sets include placental proteins, pathways and/or metabolites. For example, identifying the presence or absence of a PAPPA gene can indicate the presence or absence of a PAPPA protein analog.

无细胞生物样品可以使用代谢组学测定进行处理。例如,代谢组学测定可以用于鉴定对象的无细胞生物样品中多个妊娠相关状态相关联代谢物中每一种的定量量度(例如,指示存在、不存在或相对量)。代谢组学测定可以被配置为处理对象的无细胞生物样品,诸如血液样品或尿液样品(或其衍生物)。无细胞生物样品中妊娠相关状态相关联代谢物的定量量度(例如,指示存在、不存在或相对量)可以指示一种或多种妊娠相关状态。无细胞生物样品中的代谢物可以作为对应于妊娠相关状态相关联基因的一个或多个代谢途径的结果产生(例如,作为终产物或副产物)。测定无细胞生物样品的一种或多种代谢物可以包括从无细胞生物样品中分离或提取代谢物。代谢组学测定可以用于生成数据集,其指示对象的无细胞生物样品中多个妊娠相关状态相关联代谢物中每种的定量量度(例如,指示存在、不存在或相对量)。Cell-free biological samples can be processed using metabolomics assays. For example, metabolomics assays can be used to identify each of the quantitative measurements (e.g., indicating the presence, absence, or relative amount) of multiple pregnancy-related state-associated metabolites in the cell-free biological samples of the object. Metabolomics assays can be configured to process the cell-free biological samples of the object, such as blood samples or urine samples (or derivatives thereof). The quantitative measurements (e.g., indicating the presence, absence, or relative amount) of pregnancy-related state-associated metabolites in the cell-free biological samples can indicate one or more pregnancy-related states. The metabolites in the cell-free biological samples can be produced (e.g., as an end product or by-product) as a result of one or more metabolic pathways corresponding to the genes associated with pregnancy-related states. Determining one or more metabolites of the cell-free biological samples can include separating or extracting metabolites from the cell-free biological samples. Metabolomics assays can be used to generate a data set, which indicates each of the quantitative measurements (e.g., indicating the presence, absence, or relative amount) of multiple pregnancy-related state-associated metabolites in the cell-free biological samples of the object.

代谢组学测定可以分析无细胞生物样品中的多种代谢物,诸如小分子、脂质、氨基酸、肽、核苷酸、激素和其他信号分子、细胞因子、矿物质和元素、多酚、脂肪酸、二羧酸、醇和多元醇、烷烃和烯烃、酮酸、糖脂、碳水化合物、羟基酸、嘌呤、前列腺素、儿茶酚胺、酰基磷酸酯、磷脂、环胺、氨基酮、核苷、甘油脂、芳香酸、类视黄醇、氨基醇、蝶呤、类固醇、肉碱、白三烯、吲哚、卟啉、磷酸糖类、辅酶A衍生物、葡糖苷酸、酮、磷酸糖类、无机离子和气体、鞘脂类、胆汁酸、磷酸醇、氨基酸磷酸盐、醛、醌、嘧啶、吡哆醛、三羧酸、酰基甘氨酸、钴胺素衍生物、脂酰胺、生物素和多胺。Metabolomics assays can analyze a wide variety of metabolites in cell-free biological samples, such as small molecules, lipids, amino acids, peptides, nucleotides, hormones and other signaling molecules, cytokines, minerals and elements, polyphenols, fatty acids, dicarboxylic acids, alcohols and polyols, alkanes and alkenes, keto acids, glycolipids, carbohydrates, hydroxy acids, purines, prostaglandins, catecholamines, acyl phosphates, phospholipids, cyclic amines, aminoketones, nucleosides, glycerolipids, aromatic acids, retinoids, amino alcohols, pterins, steroids, carnitines, leukotrienes, indoles, porphyrins, phosphate sugars, CoA derivatives, glucuronides, ketones, phosphate sugars, inorganic ions and gases, sphingolipids, bile acids, phosphoalcohols, amino acid phosphates, aldehydes, quinones, pyrimidines, pyridoxal, tricarboxylic acids, acylglycines, cobalamin derivatives, lipoamides, biotin, and polyamines.

代谢组学测定可以包括,例如,以下一种或多种:质谱(MS)、靶向MS、气相色谱(GC)、高效液相色谱(HPLC)、毛细管电泳(CE)、核磁共振(NMR)波谱、离子迁移谱、拉曼光谱、电化学测定或免疫测定。Metabolomic assays can include, for example, one or more of the following: mass spectrometry (MS), targeted MS, gas chromatography (GC), high performance liquid chromatography (HPLC), capillary electrophoresis (CE), nuclear magnetic resonance (NMR) spectroscopy, ion mobility spectrometry, Raman spectroscopy, electrochemical assays, or immunoassays.

无细胞生物样品可以使用甲基化特异性测定进行处理。例如,甲基化特异性测定可以用于鉴定对象的无细胞生物样品中多个妊娠相关状态相关联基因组位点中每一个的甲基化的定量量度(例如,指示存在、不存在或相对量)。甲基化特异性测定可以配置为处理对象的无细胞生物样品,诸如血液样品或尿液样品(或其衍生物)。无细胞生物样品中妊娠相关状态相关联基因组位点甲基化的定量量度(例如,指示存在、不存在或相对量)可以指示一种或多种妊娠相关状态。甲基化特异性测定可以用于生成数据集,其指示对象的无细胞生物样品中多个妊娠相关状态相关联基因组位点中每个的甲基化的定量量度(例如,指示存在、不存在或相对量)。Cell-free biological samples can be processed using methylation-specific assays. For example, methylation-specific assays can be used to identify the quantitative measurement of methylation of each of multiple pregnancy-related state-associated genomic sites in the cell-free biological sample of the object (e.g., indicating the presence, absence, or relative amount). Methylation-specific assays can be configured to process the cell-free biological sample of the object, such as a blood sample or a urine sample (or its derivatives). The quantitative measurement of methylation of the genomic sites associated with pregnancy-related states in the cell-free biological sample (e.g., indicating the presence, absence, or relative amount) can indicate one or more pregnancy-related states. Methylation-specific assays can be used to generate a data set, which indicates the quantitative measurement of methylation of each of multiple pregnancy-related state-associated genomic sites in the cell-free biological sample of the object (e.g., indicating the presence, absence, or relative amount).

甲基化特异性测定可以包括,例如,以下一种或多种:甲基化感知测序(例如,使用亚硫酸氢盐处理)、焦磷酸测序、甲基化敏感性单链构象分析(MS-SSCA)、高分辨熔解曲线分析(HRM)、甲基化敏感性单核苷酸引物延伸(MS-SnuPE)、碱基特异性切割/MALDI-TOF、基于微阵列的甲基化测定、甲基化特异性PCR、靶向亚硫酸氢盐测序、氧化亚硫酸氢盐测序、基于质谱的亚硫酸氢盐测序,或简化代表性重亚硫酸氢盐测序(RRBS)。Methylation-specific assays can include, for example, one or more of the following: methylation-aware sequencing (e.g., using bisulfite treatment), pyrosequencing, methylation-sensitive single-strand conformation analysis (MS-SSCA), high-resolution melting curve analysis (HRM), methylation-sensitive single nucleotide primer extension (MS-SnuPE), base-specific cleavage/MALDI-TOF, microarray-based methylation assays, methylation-specific PCR, targeted bisulfite sequencing, oxidative bisulfite sequencing, mass spectrometry-based bisulfite sequencing, or reduced representation bisulfite sequencing (RRBS).

无细胞生物样品可以使用蛋白质组学测定进行处理。例如,蛋白质组学测定可以用于鉴定对象的无细胞生物样品中多个妊娠相关状态相关联蛋白质(例如,对应于妊娠相关联基因组位点或基因)或多肽中每一种的定量量度(例如,指示存在、不存在或相对量)。蛋白质组学测定可以配置为处理对象的无细胞生物样品,诸如血液样品或尿液样品(或其衍生物)。无细胞生物样品中妊娠相关状态相关联蛋白质(例如,对应于妊娠相关联基因组位点或基因)或多肽的定量量度(例如,指示存在、不存在或相对量)可以指示一种或多种妊娠相关状态。无细胞生物样品中的蛋白质或多肽可以作为对应于妊娠相关状态相关联基因的一个或多个生化途径的结果产生(例如,作为终产物、中间产物或副产物)。测定无细胞生物样品的一种或多种蛋白质或多肽可以包括从无细胞生物样品中分离或提取蛋白质或多肽。蛋白质组学测定可以用于生成数据集,其指示对象的无细胞生物样品中多个妊娠相关状态相关联蛋白质或多肽中每一种的定量量度(例如,指示存在、不存在或相对量)。Cell-free biological samples can be processed using proteomic assays. For example, proteomic assays can be used to identify a quantitative measure (e.g., indicating presence, absence, or relative amount) of each of a plurality of pregnancy-related state-associated proteins (e.g., corresponding to pregnancy-associated genomic sites or genes) or polypeptides in a cell-free biological sample of an object. Proteomic assays can be configured to process a cell-free biological sample of an object, such as a blood sample or a urine sample (or a derivative thereof). A quantitative measure (e.g., indicating presence, absence, or relative amount) of a pregnancy-related state-associated protein (e.g., corresponding to a pregnancy-associated genomic site or gene) or polypeptide in a cell-free biological sample can indicate one or more pregnancy-related states. A protein or polypeptide in a cell-free biological sample can be produced as a result of one or more biochemical pathways corresponding to a pregnancy-associated state-associated gene (e.g., as an end product, intermediate, or by-product). Determining one or more proteins or polypeptides of a cell-free biological sample can include separating or extracting proteins or polypeptides from a cell-free biological sample. Proteomic assays can be used to generate a data set, which indicates a quantitative measure (e.g., indicating presence, absence, or relative amount) of each of a plurality of pregnancy-associated state-associated proteins or polypeptides in a cell-free biological sample of an object.

蛋白质组学测定可以分析无细胞生物样品中的多种蛋白质(例如,对应于妊娠相关联基因组位点或基因的妊娠相关联蛋白质)或多肽,诸如在不同细胞条件下(例如,发育、细胞分化或细胞周期)制造的蛋白质。蛋白质组学测定可以包括例如以下一种或多种:基于抗体的免疫测定、Edman降解测定、基于质谱的测定(例如,基质辅助激光解吸/电离(MALDI)和电喷雾电离(ESI))、自上而下的蛋白质组学测定、自下而上的蛋白质组学测定、质谱免疫测定(MSIA)、稳定同位素标准品-利用抗肽抗体捕获(SISCAPA)测定、二维差异荧光凝胶电泳(2-DDIGE)测定、定量蛋白质组学测定、蛋白质微阵列测定或反相蛋白质微阵列测定。蛋白质组学测定可以检测蛋白质或多肽的翻译后修饰(例如,磷酸化、泛素化、甲基化、乙酰化、糖基化、氧化和亚硝基化)。蛋白质组学测定可以从数据库(例如,Human ProteinAtlas、PeptideAtlas和UniProt)中鉴定或定量一种或多种蛋白质或多肽。Proteomic assays can analyze a variety of proteins (e.g., pregnancy-associated proteins corresponding to pregnancy-associated genomic sites or genes) or polypeptides in a cell-free biological sample, such as proteins produced under different cellular conditions (e.g., development, cell differentiation, or cell cycle). Proteomic assays can include, for example, one or more of the following: antibody-based immunoassays, Edman degradation assays, mass spectrometry-based assays (e.g., matrix-assisted laser desorption/ionization (MALDI) and electrospray ionization (ESI)), top-down proteomic assays, bottom-up proteomic assays, mass spectrometry immunoassays (MSIA), stable isotope standards-captured using anti-peptide antibodies (SISCAPA) assays, two-dimensional differential fluorescence gel electrophoresis (2-DDIGE) assays, quantitative proteomic assays, protein microarray assays, or reversed-phase protein microarray assays. Proteomic assays can detect post-translational modifications (e.g., phosphorylation, ubiquitination, methylation, acetylation, glycosylation, oxidation, and nitrosylation) of proteins or polypeptides. Proteomic assays can identify or quantify one or more proteins or polypeptides from databases (eg, Human Protein Atlas, Peptide Atlas, and UniProt).

试剂盒Reagent test kit

本公开提供了用于鉴定或监测对象的妊娠相关状态的试剂盒。试剂盒包括探针,其用于鉴定对象的无细胞生物样品中多个妊娠相关状态相关联基因组位点中每一个处的序列的定量量度(例如,指示存在、不存在或相对量)。无细胞生物样品中多个妊娠相关状态相关联基因组位点中的每一个处的序列的定量量度(例如,指示存在、不存在或相对量)可以指示一种或多种妊娠相关状态。探针可以对无细胞生物样品中多个妊娠相关状态相关联基因组位点处的序列具有选择性。试剂盒可以包括使用探针处理无细胞生物样品以生成数据集的说明书,数据集指示在对象的无细胞生物样品中多个妊娠相关状态相关联基因组位点的每一个处的序列的定量量度(例如,指示存在、不存在或相对量)。The present disclosure provides a kit for identifying or monitoring a pregnancy-related state of an object. The kit includes a probe for identifying a quantitative measure (e.g., indicating the presence, absence, or relative amount) of a sequence at each of a plurality of pregnancy-related state-associated genomic sites in a cell-free biological sample of an object. The quantitative measure (e.g., indicating the presence, absence, or relative amount) of a sequence at each of a plurality of pregnancy-related state-associated genomic sites in a cell-free biological sample can indicate one or more pregnancy-related states. The probe can be selective for a sequence at a plurality of pregnancy-related state-associated genomic sites in a cell-free biological sample. The kit can include instructions for processing a cell-free biological sample using a probe to generate a data set, and the data set indicates a quantitative measure (e.g., indicating the presence, absence, or relative amount) of a sequence at each of a plurality of pregnancy-related state-associated genomic sites in a cell-free biological sample of an object.

试剂盒中的探针可以对无细胞生物样品中多个妊娠相关状态相关联基因组位点处的序列具有选择性。试剂盒中的探针可以被配置为选择性富集对应于多个妊娠相关状态相关联基因组位点的核酸(例如,RNA或DNA)分子。试剂盒中的探针可以是核酸引物。试剂盒中的探针可以与来自多个妊娠相关状态相关联基因组位点或基因组区域中的一个或多个的核酸序列具有序列互补性。该多个妊娠相关状态相关联基因组位点或基因组区域可以包括至少2、至少3、至少4、至少5、至少6、至少7、至少8、至少9、至少10、至少11、至少12、至少13、至少14、至少15、至少16、至少17、至少18、至少19、至少20或更多不同的妊娠相关状态相关联基因组位点或基因组区域。The probe in the test kit can be selective for sequences at multiple pregnancy-related state-associated genomic sites in acellular biological samples. The probe in the test kit can be configured to selectively enrich nucleic acid (e.g., RNA or DNA) molecules corresponding to multiple pregnancy-related state-associated genomic sites. The probe in the test kit can be a nucleic acid primer. The probe in the test kit can have sequence complementarity with one or more nucleic acid sequences from multiple pregnancy-related state-associated genomic sites or genomic regions. The multiple pregnancy-related state-associated genomic sites or genomic regions can include at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20 or more different pregnancy-related state-associated genomic sites or genomic regions.

试剂盒中的说明书可以包括使用对无细胞生物样品中多个妊娠相关状态相关联基因组位点处的序列具有选择性的探针来测定无细胞生物样品的说明书。这些探针可以是具有与来自多个妊娠相关状态相关联基因组位点中的一个或多个的核酸序列(例如,RNA或DNA)具有序列互补性的核酸分子(例如,RNA或DNA)。这些核酸分子可以是引物或富集序列。测定无细胞生物样品的说明书可以包括进行阵列杂交、聚合酶链反应(PCR)或核酸测序(例如,DNA测序或RNA测序)以处理无细胞生物样品生成数据集的说明,数据集指示在无细胞生物样品中多个妊娠相关状态相关联基因组位点中的每一个处的序列的定量量度(例如,指示存在、不存在或相对量)。无细胞生物样品中多个妊娠相关状态相关联基因组位点中的每一个处的序列的定量量度(例如,指示存在、不存在或相对量)可以指示一种或多种妊娠相关状态。The instructions in the kit may include instructions for measuring acellular biological samples using probes that are selective for sequences at multiple pregnancy-related state-associated genomic sites in acellular biological samples. These probes may be nucleic acid molecules (e.g., RNA or DNA) having sequence complementarity with one or more nucleic acid sequences (e.g., RNA or DNA) from multiple pregnancy-related state-associated genomic sites. These nucleic acid molecules may be primers or enrichment sequences. The instructions for measuring acellular biological samples may include instructions for performing array hybridization, polymerase chain reaction (PCR) or nucleic acid sequencing (e.g., DNA sequencing or RNA sequencing) to process acellular biological samples to generate a data set, and the data set indicates a quantitative measure (e.g., indicating presence, absence or relative amount) of a sequence at each of a plurality of pregnancy-related state-associated genomic sites in acellular biological samples. The quantitative measure (e.g., indicating presence, absence or relative amount) of a sequence at each of a plurality of pregnancy-related state-associated genomic sites in acellular biological samples may indicate one or more pregnancy-related states.

试剂盒中的说明书可以包括测量和解释测定读数的说明,其可以在多个妊娠相关状态相关联基因组位点中的一个或多个处进行定量,以生成指示在无细胞生物样品中多个妊娠相关状态相关联基因组位点中每一个处的序列的定量量度(例如,指示存在、不存在或相对量)的数据集。例如,对应于多个妊娠相关状态相关联基因组位点的阵列杂交或聚合酶链反应(PCR)的定量可以生成指示在无细胞生物样品中多个妊娠相关状态相关联基因组位点中每一个处的序列的定量量度(例如,指示存在、不存在或相对量)的数据集。测定读数可以包括定量PCR(qPCR)值、数字PCR(dPCR)值、数字微滴PCR(ddPCR)值、荧光值等,或其标准化值。The instructions in the kit may include instructions for measuring and interpreting assay readings, which may be quantified at one or more of a plurality of pregnancy-associated state-associated genomic sites to generate a data set indicating a quantitative measure (e.g., indicating presence, absence, or relative amount) of a sequence at each of a plurality of pregnancy-associated state-associated genomic sites in a cell-free biological sample. For example, quantification of an array hybridization or polymerase chain reaction (PCR) corresponding to a plurality of pregnancy-associated state-associated genomic sites may generate a data set indicating a quantitative measure (e.g., indicating presence, absence, or relative amount) of a sequence at each of a plurality of pregnancy-associated state-associated genomic sites in a cell-free biological sample. Assay readings may include quantitative PCR (qPCR) values, digital PCR (dPCR) values, digital droplet PCR (ddPCR) values, fluorescence values, etc., or standardized values thereof.

试剂盒可以包括代谢组学测定,其用于鉴定对象的无细胞生物样品中多个妊娠相关状态相关联代谢物中每一种的定量量度(例如,指示存在、不存在或相对量)。无细胞生物样品中妊娠相关状态相关联代谢物的定量量度(例如,指示存在、不存在或相对量)可以指示一种或多种妊娠相关状态。无细胞生物样品中的代谢物可以作为对应于妊娠相关状态相关联基因的一个或多个代谢途径的结果产生(例如,作为终产物或副产物)。试剂盒可以包括用于从无细胞生物样品中分离或提取代谢物和/或用于使用代谢组学测定生成数据集的说明书,这些数据集指示对象的无细胞生物样品中多个妊娠相关状态相关联代谢物中每一种的定量量度(例如,指示存在、不存在或相对量)。The kit may include a metabolomics assay for identifying a quantitative measure (e.g., indicating presence, absence, or relative amount) of each of a plurality of pregnancy-related state-associated metabolites in a cell-free biological sample of the subject. A quantitative measure (e.g., indicating presence, absence, or relative amount) of a pregnancy-related state-associated metabolite in a cell-free biological sample may indicate one or more pregnancy-related states. Metabolites in a cell-free biological sample may be produced (e.g., as an end product or byproduct) as a result of one or more metabolic pathways corresponding to genes associated with a pregnancy-related state. The kit may include instructions for separating or extracting metabolites from a cell-free biological sample and/or for generating a data set using a metabolomics assay, which data sets indicate a quantitative measure (e.g., indicating presence, absence, or relative amount) of each of a plurality of pregnancy-related state-associated metabolites in a cell-free biological sample of the subject.

经训练算法Trained Algorithm

在使用一种或多种测定处理来源于对象的一个或多个无细胞生物样品以生成一个或多个指示妊娠相关状态或妊娠相关并发症的数据集之后,可以使用经训练算法处理数据集中的一个或多个(例如,在多个妊娠相关状态相关联基因组位点中的每一个处)以确定妊娠相关状态。例如,经训练算法可以用于确定在无细胞生物样品中多个妊娠相关状态相关联基因组位点中的每一个处的序列的定量量度。经训练算法可以被配置为以至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约85%、至少约90%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%、或高于99%的准确度鉴定至少约25、至少约50、至少约100、至少约150、至少约200、至少约250、至少约300、至少约350、至少约400、至少约450、至少约500个、或大于约500个独立样品的妊娠相关状态。After processing one or more cell-free biological samples derived from a subject using one or more assays to generate one or more data sets indicative of a pregnancy-related state or a pregnancy-related complication, one or more of the data sets (e.g., at each of a plurality of pregnancy-related state-associated genomic sites) can be processed using a trained algorithm to determine a pregnancy-associated state. For example, a trained algorithm can be used to determine a quantitative measure of a sequence at each of a plurality of pregnancy-associated state-associated genomic sites in the cell-free biological sample. The trained algorithm can be configured to identify a pregnancy-associated status for at least about 25, at least about 50, at least about 100, at least about 150, at least about 200, at least about 250, at least about 300, at least about 350, at least about 400, at least about 450, at least about 500, or more than about 500 independent samples with an accuracy of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more than 99%.

经训练算法可以包括监督机器学习算法。经训练算法可以包括分类和回归树(CART)算法。监督机器学习算法可以包括例如随机森林、支持向量机(SVM)、神经网络或深度学习算法。经训练算法可以包括差异表达算法。该差异表达算法可以包括随机模型、广义泊松(GPseq)、混合泊松(TSPM)、泊松对数线性(PoissonSeq)、负二项式(edgeR、DESeq、baySeq、NBPSeq)、MAANOVA拟合的线性模型或其组合的使用比较。经训练算法可以包括无监督的机器学习算法。The trained algorithm may include a supervised machine learning algorithm. The trained algorithm may include a classification and regression tree (CART) algorithm. The supervised machine learning algorithm may include, for example, a random forest, a support vector machine (SVM), a neural network, or a deep learning algorithm. The trained algorithm may include a differential expression algorithm. The differential expression algorithm may include a random model, a generalized Poisson (GPseq), a mixed Poisson (TSPM), a Poisson log-linear (PoissonSeq), a negative binomial (edgeR, DESeq, baySeq, NBPSeq), a linear model of MAANOVA fitting, or a combination thereof. The trained algorithm may include an unsupervised machine learning algorithm.

经训练算法可以被配置为接受多个输入变量并基于该多个输入变量产生一个或多个输出值。该多个输入变量可以包括指示妊娠相关状态的一个或多个数据集。例如,输入变量可以包括与多个妊娠相关状态相关联基因组位点中的每一个相对应或对齐的若干序列。该多个输入变量还可以包括对象的临床健康数据。The trained algorithm can be configured to accept multiple input variables and generate one or more output values based on the multiple input variables. The multiple input variables can include one or more data sets indicating pregnancy-related states. For example, the input variables can include a number of sequences corresponding to or aligned with each of the genomic sites associated with the multiple pregnancy-related states. The multiple input variables can also include clinical health data of the subject.

经训练算法可以包括分类器,使得一个或多个输出值中的每一个包括固定数量的可能值(例如,线性分类器、逻辑回归分类器等)中的一个,其指示由分类器对无细胞生物样品的分类。经训练算法可以包括二进制分类器,使得一个或多个输出值中的每一个包括两个值(例如,{0,1},{阳性,阴性}或{高风险,低风险})中的一个,其指示由分类器对无细胞生物样品的分类。经训练算法可以是另一种类型的分类器,使得一个或多个输出值中的每一个包括多于两个值(例如,{0,1,2},{阳性,阴性或不确定},或{高风险,中风险或低风险})中的一个,其指示由分类器对无细胞生物样品的分类。输出值可以包括描述性标签、数值或其组合。一些输出值可以包括描述性标签。此类描述性标签可以提供对象的疾病或病症状态的鉴定或指示,并且可以包括例如阳性、阴性、高风险、中风险、低风险或不确定。此类描述性标签可以提供针对对象的妊娠相关状态的治疗的标识,并且可以包括例如治疗性干预、治疗性干预的持续时间和/或适于治疗妊娠相关状况的治疗性干预的剂量。此类描述性标签可以提供可以适于对对象进行的二级临床试验的鉴定,并且可以包括例如成像检测、血液测试、计算机断层(CT)扫描、磁共振成像(MRI)扫描、超声扫描、胸部X射线、正电子发射断层(PET)扫描、PET-CT扫描、无细胞生物细胞学检查、羊膜腔穿刺术、无创产前检查(NIPT)或其任何组合。例如,此类描述性标签可以提供对象的妊娠相关状态的预后。作为另一实例,此类描述性标签可以提供对象的妊娠相关状态(例如,以天、周或月表示的估计胎龄)的相对评估。一些描述性标签可以映射到数值,例如,通过将“阳性”映射到1,将“阴性”映射到0。The trained algorithm may include a classifier so that each of one or more output values includes one of a fixed number of possible values (e.g., linear classifier, logistic regression classifier, etc.), indicating the classification of the acellular biological sample by the classifier. The trained algorithm may include a binary classifier so that each of one or more output values includes one of two values (e.g., {0,1}, {positive, negative} or {high risk, low risk}), indicating the classification of the acellular biological sample by the classifier. The trained algorithm may be another type of classifier so that each of one or more output values includes more than two values (e.g., {0,1,2}, {positive, negative or uncertain}, or {high risk, medium risk or low risk}), indicating the classification of the acellular biological sample by the classifier. The output value may include a descriptive label, a numerical value or a combination thereof. Some output values may include a descriptive label. Such a descriptive label may provide an identification or indication of a disease or condition state of an object, and may include, for example, positive, negative, high risk, medium risk, low risk or uncertainty. Such descriptive labels can provide identification of treatment for pregnancy-related states of the object, and can include, for example, therapeutic intervention, duration of therapeutic intervention and/or dosage of therapeutic intervention suitable for treating pregnancy-related conditions. Such descriptive labels can provide identification suitable for secondary clinical trials performed on the object, and can include, for example, imaging detection, blood test, computed tomography (CT) scan, magnetic resonance imaging (MRI) scan, ultrasound scan, chest X-ray, positron emission tomography (PET) scan, PET-CT scan, cell-free biological cytology, amniocentesis, non-invasive prenatal testing (NIPT) or any combination thereof. For example, such descriptive labels can provide the prognosis of pregnancy-related states of the object. As another example, such descriptive labels can provide a relative assessment of the pregnancy-related states of the object (e.g., estimated gestational age expressed in days, weeks or months). Some descriptive labels can be mapped to numerical values, for example, by mapping "positive" to 1 and "negative" to 0.

一些输出值可以包括数值,诸如二进制、整数或连续值。此类二进制输出值可以包括,例如,{0,1},{阳性,阴性}或{高风险,低风险}。此类整数输出值可以包括,例如,{0,1,2}。此类连续输出值可以包括,例如,至少为0且不超过1的概率值。此类连续输出值可以包括,例如,至少为0的非标准化概率值。此类连续的输出值可以指示对象的妊娠相关状态的预后。一些数值可以映射到描述性标签,例如,通过将1映射到“阳性”,将0映射到“阴性”。Some output values may include numerical values, such as binary, integer, or continuous values. Such binary output values may include, for example, {0, 1}, {positive, negative}, or {high risk, low risk}. Such integer output values may include, for example, {0, 1, 2}. Such continuous output values may include, for example, probability values that are at least 0 and not more than 1. Such continuous output values may include, for example, unstandardized probability values that are at least 0. Such continuous output values may indicate a prognosis of a pregnancy-related state of a subject. Some numerical values may be mapped to descriptive labels, for example, by mapping 1 to "positive" and 0 to "negative".

一些输出值可以基于一个或多个截止值进行分配。例如,如果样品表明对象具有妊娠相关状态(例如,妊娠相关并发症)的概率至少为50%,则样品的二进制分类可以分配“阳性”或1的输出值。例如,如果样品表明对象具有妊娠相关状态(例如,妊娠相关并发症)的概率低于50%,则样品的二进制分类可以分配“阴性”或0的输出值。在该情况下,使用单个截止值50%将样品分类为两个可能的二进制输出值中的一个。单个截止值的实例可以包括约1%、约2%、约5%、约10%、约15%、约20%、约25%、约30%、约35%、约40%、约45%、约50%、约55%、约60%、约65%、约70%、约75%、约80%、约85%、约90%、约91%、约92%、约93%、约94%、约95%、约96%、约97%、约98%和约99%。Some output values can be assigned based on one or more cutoff values. For example, if the sample indicates that the probability that the subject has a pregnancy-related condition (e.g., a pregnancy-related complication) is at least 50%, then the binary classification of the sample can assign an output value of "positive" or 1. For example, if the sample indicates that the probability that the subject has a pregnancy-related condition (e.g., a pregnancy-related complication) is less than 50%, then the binary classification of the sample can assign an output value of "negative" or 0. In this case, a single cutoff value of 50% is used to classify the sample into one of two possible binary output values. Examples of single cutoff values can include about 1%, about 2%, about 5%, about 10%, about 15%, about 20%, about 25%, about 30%, about 35%, about 40%, about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, and about 99%.

作为另一个实例,如果样品表明对象具有妊娠相关状态(例如,妊娠相关并发症)的概率为以下数值,则样品分类可以分配“阳性”或1的输出值:至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约85%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%或更高。如果样品表明对象具有妊娠相关状态(例如,妊娠相关并发症)的概率为以下数值,则样品分类可以分配“阳性”或1的输出值:大于约50%、大于约55%、大于约60%、大于约65%、大于约70%、大于约75%、大于约80%、大于约85%、大于约90%、大于约91%、大于约92%、大于约93%、大于约94%、大于约95%、大于约96%、大于约97%、大于约98%或大于约99%。As another example, a sample classification can be assigned an output value of "positive" or 1 if the sample indicates that the subject has a probability of having a pregnancy-related condition (e.g., a pregnancy-related complication) of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more. A sample classification can be assigned an output value of "positive" or 1 if the sample indicates that the subject has a probability of having a pregnancy-related condition (e.g., a pregnancy-related complication) of greater than about 50%, greater than about 55%, greater than about 60%, greater than about 65%, greater than about 70%, greater than about 75%, greater than about 80%, greater than about 85%, greater than about 90%, greater than about 91%, greater than about 92%, greater than about 93%, greater than about 94%, greater than about 95%, greater than about 96%, greater than about 97%, greater than about 98%, or greater than about 99%.

如果样品表明对象具有妊娠相关状态(例如,妊娠相关并发症)的概率为以下数值,则样品分类可以分配“阴性”或0的输出值:小于约50%、小于约45%、小于约40%、小于约35%、小于约30%、小于约25%、小于约20%、小于约15%、小于约10%、小于约9%、小于约8%、小于约7%、小于约6%、小于约5%、小于约4%、小于约3%、小于约2%或小于约1%。如果样品表明对象具有妊娠相关状态(例如,妊娠相关并发症)的概率为以下数值,则样品分类可以分配“阴性”或0的输出值:不大于约50%、不大于约45%、不大于约40%、不大于约35%、不大于约30%、不大于约25%、不大于约20%、不大于约15%、不大于约10%、不大于约9%、不大于约8%、不大于约7%、不大于约6%、不大于约5%、不大于约4%、不大于约3%、不大于约2%或不大于约1%。A sample classification can be assigned an output value of "negative" or 0 if the sample indicates that the subject has a pregnancy-related condition (e.g., a pregnancy-related complication) with a probability of less than about 50%, less than about 45%, less than about 40%, less than about 35%, less than about 30%, less than about 25%, less than about 20%, less than about 15%, less than about 10%, less than about 9%, less than about 8%, less than about 7%, less than about 6%, less than about 5%, less than about 4%, less than about 3%, less than about 2%, or less than about 1%. A sample classification can be assigned an output value of "negative" or 0 if the sample indicates that the subject has a pregnancy-related condition (e.g., a pregnancy-related complication) with a probability of no greater than about 50%, no greater than about 45%, no greater than about 40%, no greater than about 35%, no greater than about 30%, no greater than about 25%, no greater than about 20%, no greater than about 15%, no greater than about 10%, no greater than about 9%, no greater than about 8%, no greater than about 7%, no greater than about 6%, no greater than about 5%, no greater than about 4%, no greater than about 3%, no greater than about 2%, or no greater than about 1%.

如果样品未分类为“阳性”、“阴性”、1或0,则样品的分类可以分配“不确定”或2的输出值。在该情况下,使用两个截止值集将样品分类为三个可能的输出值中的一个。截止值集的实例可以包括{1%,99%}、{2%,98%}、{5%,95%}、{10%,90%}、{15%,85%}、{20%,80%}、{25%,75%}、{30%,70%}、{35%,65%}、{40%,60%}和{45%,55%}。类似地,可以使用n个截止值的集将样品分类为n+1个可能的输出值中的一个,其中n是任何正整数。If the sample is not classified as "positive", "negative", 1 or 0, the classification of the sample can be assigned an output value of "uncertain" or 2. In this case, two cutoff value sets are used to classify the sample into one of three possible output values. Examples of cutoff value sets can include {1%, 99%}, {2%, 98%}, {5%, 95%}, {10%, 90%}, {15%, 85%}, {20%, 80%}, {25%, 75%}, {30%, 70%}, {35%, 65%}, {40%, 60%} and {45%, 55%}. Similarly, a set of n cutoff values can be used to classify the sample into one of n+1 possible output values, where n is any positive integer.

经训练算法可以用多个独立训练样品进行训练。独立训练样品中的每一个可以包括来自对象的无细胞生物样品、通过测定无细胞生物样品获得的相关联数据集(如本文其他地方所述)以及对应于无细胞生物样品的一个或多个已知输出值(例如,对象的妊娠相关状态的临床诊断、预后、不存在或疗效)。独立训练样品可以包括获得自或来源于多个不同对象的无细胞生物样品和相关联数据集和输出。独立训练样品可以包括在多个不同时间点(例如,定期诸如每周、每两周或每月)获得自同一对象的无细胞生物样品和相关联数据集和输出。独立训练样品可以与妊娠相关状态的存在相关联(例如,训练样品包括获得自或来源于已知具有妊娠相关状态的多个对象的无细胞生物样品和相关联数据集和输出)。独立训练样品可以与妊娠相关状态的不存在相关联(例如,训练样品包括获得自或来源于已知先前没有妊娠相关状态诊断或已接收到妊娠相关状态阴性检测结果的多个对象的无细胞生物样品和相关联数据集和输出)。The trained algorithm can be trained with multiple independent training samples. Each of the independent training samples can include a cell-free biological sample from the object, an associated data set obtained by measuring the cell-free biological sample (as described elsewhere herein), and one or more known output values corresponding to the cell-free biological sample (e.g., a clinical diagnosis, prognosis, absence or efficacy of a pregnancy-related state of the object). Independent training samples can include cell-free biological samples and associated data sets and outputs obtained from or derived from multiple different objects. Independent training samples can include cell-free biological samples and associated data sets and outputs obtained from the same object at multiple different time points (e.g., regularly such as weekly, biweekly or monthly). Independent training samples can be associated with the presence of pregnancy-related states (e.g., training samples include cell-free biological samples and associated data sets and outputs obtained from or derived from multiple objects known to have pregnancy-related states). Independent training samples can be associated with the absence of pregnancy-related states (e.g., training samples include cell-free biological samples and associated data sets and outputs obtained from or derived from multiple objects known to have no pregnancy-related state diagnosis or have received pregnancy-related state negative test results).

经训练算法可以用至少约5个、至少约10个、至少约15个、至少约20个、至少约25个、至少约30个、至少约35个、至少约40个、至少约45个、至少约50个、至少约100个、至少约150个、至少约200个、至少约250个、至少约300个、至少约350个、至少约400个、至少约450个、或至少约500个独立训练样品进行训练。独立训练样品可以包括与妊娠相关状态的存在相关联的无细胞生物样品和/或与妊娠相关状态的不存在相关联的无细胞生物样品。经训练算法可以用以下数量的与妊娠相关状态的存在相关联的独立训练样品进行训练:不多于约500个、不多于约450个、不多于约400个、不多于约350个、不多于约300个、不多于约250个、不多于约200个、不多于约150个、不多于约100个或不多于约50个。在一些实施方案中,无细胞生物样品独立于用于训练经训练算法的样品。The trained algorithm can be trained with at least about 5, at least about 10, at least about 15, at least about 20, at least about 25, at least about 30, at least about 35, at least about 40, at least about 45, at least about 50, at least about 100, at least about 150, at least about 200, at least about 250, at least about 300, at least about 350, at least about 400, at least about 450, or at least about 500 independent training samples. The independent training samples can include cell-free biological samples associated with the presence of a pregnancy-associated state and/or cell-free biological samples associated with the absence of a pregnancy-associated state. The trained algorithm can be trained with no more than about 500, no more than about 450, no more than about 400, no more than about 350, no more than about 300, no more than about 250, no more than about 200, no more than about 150, no more than about 100, or no more than about 50 independent training samples associated with the presence of a pregnancy-related state. In some embodiments, the cell-free biological sample is independent of the sample used to train the trained algorithm.

经训练算法可以用与妊娠相关状态的存在相关联的第一数量的独立训练样品和与妊娠相关状态的不存在相关联的第二数量的独立训练样品进行训练。与妊娠相关状态的存在相关联的第一数量的独立训练样品可以不多于与妊娠相关状态的不存在相关联的第二数量的独立训练样品。与妊娠相关状态的存在相关联的第一数量的独立训练样品可以等于与妊娠相关状态的不存在相关联的第二数量的独立训练样品。与妊娠相关状态的存在相关联的第一数量的独立训练样品可以大于与妊娠相关状态的不存在相关联的第二数量的独立训练样品。The trained algorithm may be trained with a first number of independent training samples associated with the presence of a pregnancy-related state and a second number of independent training samples associated with the absence of the pregnancy-related state. The first number of independent training samples associated with the presence of the pregnancy-related state may be no more than the second number of independent training samples associated with the absence of the pregnancy-related state. The first number of independent training samples associated with the presence of the pregnancy-related state may be equal to the second number of independent training samples associated with the absence of the pregnancy-related state. The first number of independent training samples associated with the presence of the pregnancy-related state may be greater than the second number of independent training samples associated with the absence of the pregnancy-related state.

经训练算法可以配置为以至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约81%、至少约82%、至少约83%、至少约84%、至少约85%、至少约86%、至少约87%、至少约88%、至少约89%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%或更高的准确度鉴定至少约5、至少约10、至少约15、至少约20、至少约25、至少约30、至少约35、至少约40、至少约45、至少约50、至少约100、至少约150、至少约200、至少约250、至少约300、至少约350、至少约400、至少约450或至少约500个独立训练样品的妊娠相关状态。通过经训练算法鉴定妊娠相关状态的准确度可以计算为独立检测样品(例如,已知具有妊娠相关状态的对象或妊娠相关状态为阴性临床试验结果的对象)的百分比,这些样品经正确鉴定或分类为具有或不具有妊娠相关状态。The trained algorithm can be configured to generate a training image with at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about The accuracy of identifying a pregnancy-associated state by the trained algorithm can be calculated as the percentage of independent test samples (e.g., subjects known to have a pregnancy-associated state or subjects with a negative clinical trial result for a pregnancy-associated state) that are correctly identified or classified as having or not having a pregnancy-associated state.

经训练算法可以被配置为以以下阳性预测值(PPV)鉴定妊娠相关状态:至少约5%、至少约10%、至少约15%、至少约20%、至少约25%、至少约30%、至少约35%、至少约40%、至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约81%、至少约82%、至少约83%、至少约84%、至少约85%、至少约86%、至少约87%、至少约88%、至少约89%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%或更高。使用经训练算法鉴定妊娠相关状态的PPV可以计算为鉴定或分类为具有与真正具有妊娠相关状态的对象相对应的妊娠相关状态的无细胞生物样品的百分比。The trained algorithm can be configured to identify a pregnancy-related state with a positive predictive value (PPV) of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more. The PPV for identifying a pregnancy-associated state using a trained algorithm can be calculated as the percentage of cell-free biological samples identified or classified as having a pregnancy-associated state that corresponds to subjects who actually have the pregnancy-associated state.

经训练算法可以被配置为以以下阴性预测值(NPV)鉴定妊娠相关状态:至少约5%、至少约10%、至少约15%、至少约20%、至少约25%、至少约30%、至少约35%、至少约40%、至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约81%、至少约82%、至少约83%、至少约84%、至少约85%、至少约86%、至少约87%、至少约88%、至少约89%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%或更高。使用经训练算法鉴定妊娠相关状态的NPV可以计算为经鉴定或分类为不具有与真正不具有妊娠相关状态的对象相对应的妊娠相关状态的无细胞生物样品的百分比。The trained algorithm can be configured to identify a pregnancy-related state with a negative predictive value (NPV) of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more. The NPV for identifying a pregnancy-associated state using a trained algorithm can be calculated as the percentage of cell-free biological samples identified or classified as not having the pregnancy-associated state corresponding to subjects who truly do not have the pregnancy-associated state.

经训练算法可以被配置为以以下临床灵敏度鉴定妊娠相关状态:至少约5%、至少约10%、至少约15%、至少约20%、至少约25%、至少约30%、至少约35%、至少约40%、至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约81%、至少约82%、至少约83%、至少约84%、至少约85%、至少约86%、至少约87%、至少约88%、至少约89%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%、至少约99.1%、至少约99.2%、至少约99.3%、至少约99.4%、至少约99.5%、至少约99.6%、至少约99.7%、至少约99.8%、至少约99.9%、至少约99.99%、至少约99.999%或更高。使用经训练算法鉴定妊娠相关状态的临床灵敏度可以计算为与妊娠相关状态(例如,已知具有妊娠相关状态的对象)的存在相关联的独立检测样品的百分比,这些样品经正确鉴定或分类为具有妊娠相关状态。The trained algorithm can be configured to identify pregnancy-related states with a clinical sensitivity of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, at least about 100%, at least about 101%, at least about 102%, at least about 103%, at least about 104%, at least about 105%, at least about 106%, at least about 107%, at least about 108%, at least about 109%, at least about 110%, at least about 111%, at least about 112%, at least about 113%, at least about 114%, at least about 115%, at least about 116%, at least about 117%, at least about 118%, at least about 119%, The clinical sensitivity of using the trained algorithm to identify a pregnancy-associated state can be calculated as the percentage of independently tested samples associated with the presence of a pregnancy-associated state (e.g., a subject known to have a pregnancy-associated state) that are correctly identified or classified as having the pregnancy-associated state.

经训练算法可以被配置为以以下临床特异性鉴定妊娠相关状态:至少约5%、至少约10%、至少约15%、至少约20%、至少约25%、至少约30%、至少约35%、至少约40%、至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约81%、至少约82%、至少约83%、至少约84%、至少约85%、至少约86%、至少约87%、至少约88%、至少约89%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%、至少约99.1%、至少约99.2%、至少约99.3%、至少约99.4%、至少约99.5%、至少约99.6%、至少约99.7%、至少约99.8%、至少约99.9%、至少约99.99%、至少约99.999%或更高。使用经训练算法鉴定妊娠相关状态的临床特异性可以计算为与妊娠相关状态的不存在相关联的独立检测样品(例如,妊娠相关状态为阴性临床试验结果的对象)的百分比,这些样品经正确鉴定或分类为不具有妊娠相关状态。The trained algorithm can be configured to identify pregnancy-related states with a clinical specificity of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, at least about 100%, at least about 101%, at least about 102%, at least about 103%, at least about 104%, at least about 105%, at least about 106%, at least about 107%, at least about 108%, at least about 109%, at least about 110%, at least about 111%, at least about 112%, at least about 113%, at least about 114%, at least about 115%, at least about 116%, at least about 117%, at least about 118%, at least about 119%, %, at least about 99%, at least about 99.1%, at least about 99.2%, at least about 99.3%, at least about 99.4%, at least about 99.5%, at least about 99.6%, at least about 99.7%, at least about 99.8%, at least about 99.9%, at least about 99.99%, at least about 99.999%, or more. The clinical specificity of identifying a pregnancy-associated state using a trained algorithm can be calculated as the percentage of independent test samples (e.g., subjects with a negative clinical trial result for a pregnancy-associated state) associated with the absence of a pregnancy-associated state that are correctly identified or classified as not having a pregnancy-associated state.

经训练算法被配置为以以下曲线下面积(AUC)鉴定妊娠相关状态:至少约0.50、至少约0.55、至少约0.60、至少约0.65、至少约0.70、至少约0.75、至少约0.80、至少约0.81、至少约0.82、至少约0.83、至少约0.84、至少约0.85、至少约0.86、至少约0.87、至少约0.88、至少约0.89、至少约0.90、至少约0.91、至少约0.92、至少约0.93、至少约0.94、至少约0.95、至少约0.96、至少约0.97、至少约0.98、至少约0.99或更大。AUC可以计算为与经训练算法相关联的受试者工作特征(ROC)曲线(例如,ROC曲线下的面积)的积分,该算法将无细胞生物样品分类为具有或不具有妊娠相关状态。The trained algorithm is configured to identify a pregnancy-related state with an area under the curve (AUC) of at least about 0.50, at least about 0.55, at least about 0.60, at least about 0.65, at least about 0.70, at least about 0.75, at least about 0.80, at least about 0.81, at least about 0.82, at least about 0.83, at least about 0.84, at least about 0.85, at least about 0.86, at least about 0.87, at least about 0.88, at least about 0.89, at least about 0.90, at least about 0.91, at least about 0.92, at least about 0.93, at least about 0.94, at least about 0.95, at least about 0.96, at least about 0.97, at least about 0.98, at least about 0.99, or more. The AUC can be calculated as the integral of a receiver operating characteristic (ROC) curve (eg, the area under the ROC curve) associated with a trained algorithm that classifies a cell-free biological sample as having or not having a pregnancy-related state.

可以调整或调节经训练算法,以改善鉴定妊娠相关状态的性能、准确度、PPV、NPV、临床灵敏度、临床特异性或AUC中的一个或多个。可以通过调整经训练算法的参数(例如,用于对本文其他地方描述的无细胞生物样品进行分类的截止值集,或神经网络的权重)调整或调节经训练算法。可以在训练过程中或在训练过程完成后连续调整或调节经训练算法。The trained algorithm can be adjusted or tuned to improve one or more of performance, accuracy, PPV, NPV, clinical sensitivity, clinical specificity, or AUC in identifying pregnancy-related states. The trained algorithm can be adjusted or tuned by adjusting parameters of the trained algorithm (e.g., a set of cutoff values for classifying acellular biological samples described elsewhere herein, or weights of a neural network). The trained algorithm can be adjusted or tuned continuously during the training process or after the training process is completed.

在最初训练经训练算法后,可以将输入的子集鉴定为最有影响力或最重要的,以被包括在内用于进行高质量的分类。例如,多个妊娠相关状态相关联基因组位点的子集可以被鉴定为最有影响力或最重要的,以被包括在内,用于对妊娠相关状态(或妊娠相关状态的亚型)进行高质量分类或鉴定。多个妊娠相关状态相关联基因组位点或其子集可以基于分类指标进行排序,这些指标指示每个基因组位点对妊娠相关状态(或妊娠相关状态的亚型)进行高质量分类或鉴定的影响力或重要性。在某些情况下,此类指标可以用于显著减少可以用于将经训练算法训练到所需性能水平的输入变量(例如,预测因子变量)的数量(例如,基于所需的最小精度、PPV、NPV、临床灵敏度、临床特异性、AUC或其组合)。例如,如果利用包括经训练算法中几十个或几百个输入变量的多个变量训练经训练算法导致分类准确度大于99%,则仅用以下数量的选定子集训练经训练算法:不多于约5、不多于约10、不多于15、不多于20、不多于25、不多于30、不多于35、不多于40、不多于45、不多于50或不多于约100个,该多个输入变量中的此类最有影响力或最重要的输入变量可以产生降低的但仍可接受的分类准确度(例如,至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约81%、至少约82%、至少约83%、至少约84%、至少约85%、至少约86%、至少约87%、至少约88%、至少约89%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、或至少约99%)。可以通过对整个多个输入变量进行排序并选择预定数量(例如,不多于约5、不多于约10、不多于约15、不多于约20、不多于约25、不多于约30、不多于约35、不多于约40、不多于约45、不多于约50或不多于约100个)的具有最佳分类指标的输入变量来选择子集。After initially training the trained algorithm, a subset of the inputs can be identified as the most influential or important to be included for high-quality classification. For example, a subset of multiple pregnancy-related state-associated genomic sites can be identified as the most influential or important to be included for high-quality classification or identification of pregnancy-related states (or subtypes of pregnancy-related states). Multiple pregnancy-related state-associated genomic sites or subsets thereof can be ranked based on classification metrics that indicate the influence or importance of each genomic site for high-quality classification or identification of pregnancy-related states (or subtypes of pregnancy-related states). In some cases, such metrics can be used to significantly reduce the number of input variables (e.g., predictor variables) that can be used to train the trained algorithm to a desired performance level (e.g., based on a desired minimum accuracy, PPV, NPV, clinical sensitivity, clinical specificity, AUC, or a combination thereof). For example, if training a trained algorithm with a plurality of variables including dozens or hundreds of input variables in the trained algorithm results in a classification accuracy greater than 99%, then training the trained algorithm with only a selected subset of no more than about 5, no more than about 10, no more than 15, no more than 20, no more than 25, no more than 30, no more than 35, no more than 40, no more than 45, no more than 50, or no more than about 100 of such most influential or important input variables of the plurality of input variables may produce a reduced but still acceptable classification accuracy (e.g., For example, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%). The subset can be selected by sorting the entire plurality of input variables and selecting a predetermined number (e.g., no more than about 5, no more than about 10, no more than about 15, no more than about 20, no more than about 25, no more than about 30, no more than about 35, no more than about 40, no more than about 45, no more than about 50, or no more than about 100) of input variables with the best classification indicators.

鉴定或监测妊娠相关状态Identify or monitor pregnancy-related conditions

在使用经训练算法处理数据集后,可以在对象中鉴定或监测妊娠相关状态或与妊娠相关并发症。鉴定可以至少部分基于处于妊娠相关状态相关联基因组位点的分组的数据集的序列读数的定量量度(例如,在妊娠相关状态相关联基因组位点处的RNA转录物或DNA的定量量度)、包括处于妊娠相关状态相关联蛋白质的分组的数据集的蛋白质的定量量度的蛋白质组学数据,和/或包括妊娠相关状态相关联代谢物的分组的定量量度的代谢组数据。After processing the dataset using the trained algorithm, a pregnancy-associated state or pregnancy-associated complication can be identified or monitored in a subject. The identification can be based at least in part on quantitative measures of sequence reads of a dataset of grouped genomic sites associated with a pregnancy-associated state (e.g., quantitative measures of RNA transcripts or DNA at genomic sites associated with a pregnancy-associated state), proteomic data including quantitative measures of proteins of a dataset of grouped proteins associated with a pregnancy-associated state, and/or metabolomic data including quantitative measures of grouped metabolites associated with a pregnancy-associated state.

可以在对象中以以下准确度鉴定妊娠相关状态:至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约81%、至少约82%、至少约83%、至少约84%、至少约85%、至少约86%、至少约87%、至少约88%、至少约89%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%或更高。通过经训练算法鉴定妊娠相关状态的准确度可以计算为独立检测样品(例如,已知具有妊娠相关状态的对象或妊娠相关状态为阴性临床试验结果的对象)的百分比,这些样品经正确鉴定或分类为具有或不具有妊娠相关状态。Pregnancy-associated states can be identified in subjects with an accuracy of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99% or more. The accuracy of identifying pregnancy-associated states by a trained algorithm can be calculated as the percentage of independently tested samples (e.g., subjects known to have a pregnancy-associated state or subjects with a negative clinical trial result for a pregnancy-associated state) that are correctly identified or classified as having or not having a pregnancy-associated state.

可以在对象中以以下阳性预测值(PPV)鉴定妊娠相关状态:至少约5%、至少约10%、至少约15%、至少约20%、至少约25%、至少约30%、至少约35%、至少约40%、至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约81%、至少约82%、至少约83%、至少约84%、至少约85%、至少约86%、至少约87%、至少约88%、至少约89%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%或更高。使用经训练算法鉴定妊娠相关状态的PPV可以计算为鉴定或分类为具有与真正具有妊娠相关状态的对象相对应的妊娠相关状态的无细胞生物样品的百分比。A pregnancy-related state can be identified in a subject with a positive predictive value (PPV) of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more. The PPV for identifying a pregnancy-associated state using a trained algorithm can be calculated as the percentage of cell-free biological samples identified or classified as having a pregnancy-associated state that corresponds to subjects who actually have the pregnancy-associated state.

可以在对象中以以下阴性预测值(NPV)鉴定妊娠相关状态:至少约5%、至少约10%、至少约15%、至少约20%、至少约25%、至少约30%、至少约35%、至少约40%、至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约81%、至少约82%、至少约83%、至少约84%、至少约85%、至少约86%、至少约87%、至少约88%、至少约89%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%或更高。使用经训练算法鉴定妊娠相关状态的NPV可以计算为经鉴定或分类为不具有与真正不具有妊娠相关状态的对象相对应的妊娠相关状态的无细胞生物样品的百分比。A pregnancy-related state can be identified in a subject with a negative predictive value (NPV) of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more. The NPV for identifying a pregnancy-associated state using a trained algorithm can be calculated as the percentage of cell-free biological samples identified or classified as not having the pregnancy-associated state corresponding to subjects who truly do not have the pregnancy-associated state.

可以在对象中以以下临床灵敏度鉴定妊娠相关状态:至少约5%、至少约10%、至少约15%、至少约20%、至少约25%、至少约30%、至少约35%、至少约40%、至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约81%、至少约82%、至少约83%、至少约84%、至少约85%、至少约86%、至少约87%、至少约88%、至少约89%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%、至少约99.1%、至少约99.2%、至少约99.3%、至少约99.4%、至少约99.5%、至少约99.6%、至少约99.7%、至少约99.8%、至少约99.9%、至少约99.99%、至少约99.999%或更高。使用经训练算法鉴定妊娠相关状态的临床灵敏度可以计算为与妊娠相关状态(例如,已知具有妊娠相关状态的对象)的存在相关联的独立检测样品的百分比,这些样品经正确鉴定或分类为具有妊娠相关状态。A pregnancy-related state can be identified in a subject with a clinical sensitivity of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%. %, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, at least about 99.1%, at least about 99.2%, at least about 99.3%, at least about 99.4%, at least about 99.5%, at least about 99.6%, at least about 99.7%, at least about 99.8%, at least about 99.9%, at least about 99.99%, at least about 99.999%, or more. The clinical sensitivity of using the trained algorithm to identify a pregnancy-associated state can be calculated as the percentage of independently tested samples associated with the presence of a pregnancy-associated state (e.g., a subject known to have a pregnancy-associated state) that are correctly identified or classified as having the pregnancy-associated state.

可以在对象中以以下临床特异性鉴定妊娠相关状态:至少约5%、至少约10%、至少约15%、至少约20%、至少约25%、至少约30%、至少约35%、至少约40%、至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约81%、至少约82%、至少约83%、至少约84%、至少约85%、至少约86%、至少约87%、至少约88%、至少约89%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%、至少约99.1%、至少约99.2%、至少约99.3%、至少约99.4%、至少约99.5%、至少约99.6%、至少约99.7%、至少约99.8%、至少约99.9%、至少约99.99%、至少约99.999%或更高。使用经训练算法鉴定妊娠相关状态的临床特异性可以计算为与妊娠相关状态的不存在相关联的独立检测样品(例如,妊娠相关状态为阴性临床试验结果的对象)的百分比,这些样品经正确鉴定或分类为不具有妊娠相关状态。A pregnancy-related state can be identified in a subject with a clinical specificity of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%. %, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, at least about 99.1%, at least about 99.2%, at least about 99.3%, at least about 99.4%, at least about 99.5%, at least about 99.6%, at least about 99.7%, at least about 99.8%, at least about 99.9%, at least about 99.99%, at least about 99.999%, or more. The clinical specificity of identifying a pregnancy-associated state using a trained algorithm can be calculated as the percentage of independent test samples associated with the absence of a pregnancy-associated state (e.g., subjects with a negative clinical trial result for a pregnancy-associated state) that are correctly identified or classified as not having a pregnancy-associated state.

在一个方面,本公开提供了一种用于确定对象处于早产风险中的方法,包括测定来源于对象的无细胞生物样品,从而以至少80%的特异性生成指示早产风险的数据集,以及使用在独立于无细胞生物样品的样品上训练的经训练算法,从而以以下准确度确定对象处于早产风险:至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约81%、至少约82%、至少约83%、至少约84%、至少约85%、至少约86%、至少约87%、至少约88%、至少约89%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%或更高。In one aspect, the present disclosure provides a method for determining that a subject is at risk for preterm birth, comprising assaying a cell-free biological sample derived from the subject to generate a data set indicative of risk for preterm birth with a specificity of at least 80%, and using a trained algorithm trained on a sample independent of the cell-free biological sample to determine that the subject is at risk for preterm birth with an accuracy of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99% or more.

在对象中鉴定妊娠相关状态之后,可以进一步鉴定妊娠相关状态的亚型(例如,选自妊娠相关状态的多个亚型)。可以至少部分基于以下确定妊娠相关状态的亚型:处于妊娠相关状态相关联基因组位点的分组的数据集的序列读数的定量量度(例如,在妊娠相关状态相关联基因组位点处的RNA转录物或DNA的定量量度)、包括处于妊娠相关状态相关联蛋白质的分组的数据集的蛋白质的定量量度的蛋白质组学数据,和/或包括妊娠相关状态相关联代谢物的分组的定量量度的代谢组数据。例如,对象可以鉴定为处于早产亚型的风险中(例如,选自多个早产亚型)。在将对象鉴定为处于早产亚型的风险中之后,可以至少部分基于对象被鉴定为所处的早产亚型风险来选择对对象的临床干预。在一些实施方案中,临床干预选自多种临床干预(例如,临床上指征用于早产的不同亚型的)。After identifying the pregnancy-related state in the object, the subtype of the pregnancy-related state can be further identified (e.g., selected from a plurality of subtypes of the pregnancy-related state). The subtype of the pregnancy-related state can be determined at least in part based on the following: the quantitative measurement of the sequence readings of the data set of the grouping of the pregnancy-related state associated genomic sites (e.g., the quantitative measurement of the RNA transcript or DNA at the pregnancy-related state associated genomic sites), the proteomic data including the quantitative measurement of the protein of the data set of the grouping of the pregnancy-related state associated protein, and/or the metabolomic data including the quantitative measurement of the grouping of the pregnancy-related state associated metabolites. For example, the object can be identified as being at risk of a preterm birth subtype (e.g., selected from a plurality of preterm birth subtypes). After the object is identified as being at risk of a preterm birth subtype, the clinical intervention of the object can be selected based at least in part on the risk of the preterm birth subtype in which the object is identified. In some embodiments, the clinical intervention is selected from a variety of clinical interventions (e.g., clinically indicated for different subtypes of preterm birth).

在一些实施方案中,经训练算法可以确定对象处于以下的早产风险:至少约5%、至少约10%、至少约15%、至少约20%、至少约25%、至少约30%、至少约35%、至少约40%、至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约81%、至少约82%、至少约83%、至少约84%、至少约85%、至少约86%、至少约87%、至少约88%、至少约89%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%或更大。In some embodiments, the trained algorithm can determine that a subject is at a risk of preterm birth of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more.

经训练算法可以以下准确度确定对象处于早产风险:至少约50%、至少约55%、至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约81%、至少约82%、至少约83%、至少约84%、至少约85%、至少约86%、至少约87%、至少约88%、至少约89%、至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%、至少约99.1%、至少约99.2%、至少约99.3%、至少约99.4%、至少约99.5%、至少约99.6%、至少约99.7%、至少约99.8%、至少约99.9%、至少约99.99%、至少约99.999%或更高。The trained algorithm can determine that a subject is at risk for preterm birth with an accuracy of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%. , at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, at least about 99.1%, at least about 99.2%, at least about 99.3%, at least about 99.4%, at least about 99.5%, at least about 99.6%, at least about 99.7%, at least about 99.8%, at least about 99.9%, at least about 99.99%, at least about 99.999%, or more.

在鉴定对象具有妊娠相关状态时,可任选地向对象提供治疗性干预(例如,开具适当的治疗方案以治疗对象的妊娠相关状态)。治疗性干预可以包括有效剂量的药物处方、妊娠相关状态的进一步检测或评估、妊娠相关状态的进一步监测、引产或抑制分娩,或其组合。如果对象目前正在接受一个疗程的妊娠相关状态的治疗,则治疗性干预可包括随后的不同疗程(例如,由于当前疗程的无效而增加疗效)。When the subject is identified as having a pregnancy-related condition, therapeutic intervention may be optionally provided to the subject (e.g., an appropriate treatment regimen is prescribed to treat the subject's pregnancy-related condition). Therapeutic intervention may include prescription of an effective dose of a drug, further testing or evaluation of the pregnancy-related condition, further monitoring of the pregnancy-related condition, induction of labor or inhibition of labor, or a combination thereof. If the subject is currently receiving a course of treatment for a pregnancy-related condition, therapeutic intervention may include a subsequent different course of treatment (e.g., to increase efficacy due to the ineffectiveness of the current course of treatment).

治疗性干预可包括推荐对象进行二次临床试验,以确认妊娠相关状态的诊断。该二级临床试验可包括成像检测、血液测试、计算机断层(CT)扫描、磁共振成像(MRI)扫描、超声扫描、胸部X射线、正电子发射断层(PET)扫描、PET-CT扫描、无细胞生物细胞学、羊膜腔穿刺术、无创产前检查(NIPT)或其任何组合。The therapeutic intervention may include recommending that the subject undergo a secondary clinical trial to confirm the diagnosis of the pregnancy-related condition. The secondary clinical trial may include imaging tests, blood tests, computed tomography (CT) scans, magnetic resonance imaging (MRI) scans, ultrasound scans, chest x-rays, positron emission tomography (PET) scans, PET-CT scans, cell-free biocytology, amniocentesis, non-invasive prenatal testing (NIPT), or any combination thereof.

可在一段时间内评估以下项:妊娠相关状态相关联基因组位点的分组的数据集的序列读数的定量量度(例如,处于妊娠相关状态相关联基因组位点的RNA转录物或DNA的定量量度)、包括处于妊娠相关状态相关联蛋白质的分组的数据集的蛋白质的定量量度的蛋白质组学数据和/或包括妊娠相关状态相关联代谢物的分组的定量量度的代谢组数据,以监测患者(例如,具有妊娠相关状态或正在接受妊娠相关状态治疗的对象)。在此类情况下,患者数据集的定量量度在治疗过程中可以改变。例如,由于有效治疗而具有降低的妊娠相关状态风险的患者的数据集的定量量度可以向健康对象(例如,没有妊娠相关并发症的对象)的概况或分布转移。相反,例如,由于无效治疗而具有增加的妊娠相关状态风险的患者的数据集的定量量度可以向具有更高的妊娠相关状态风险或更晚期的妊娠相关状态的对象的概况或分布转移。The following items can be evaluated over a period of time: quantitative measures of sequence reads of a data set of grouped genomic sites associated with pregnancy-related states (e.g., quantitative measures of RNA transcripts or DNA at genomic sites associated with pregnancy-related states), proteomic data including quantitative measures of proteins in a data set of grouped proteins associated with pregnancy-related states, and/or metabolomic data including quantitative measures of grouped metabolites associated with pregnancy-related states, to monitor patients (e.g., subjects with pregnancy-related states or being treated for pregnancy-related states). In such cases, the quantitative measures of the patient data set can change during treatment. For example, the quantitative measures of the data set of patients with a reduced risk of pregnancy-related states due to effective treatment can be shifted toward the profile or distribution of healthy subjects (e.g., subjects without pregnancy-related complications). Conversely, for example, the quantitative measures of the data set of patients with an increased risk of pregnancy-related states due to ineffective treatment can be shifted toward the profile or distribution of subjects with a higher risk of pregnancy-related states or a more advanced pregnancy-related state.

可以通过监测用于治疗对象的妊娠相关状态的疗程来监测对象的妊娠相关状态。监测可包括在两个或多个时间点评估对象的妊娠相关状态。评估可以至少基于处于妊娠相关状态相关联基因组位点的分组的数据集的序列读数的定量量度(例如,在妊娠相关状态相关联基因组位点处的RNA转录物或DNA的定量量度)、包括处于妊娠相关状态相关联蛋白质的分组的数据集的蛋白质的定量量度的蛋白质组学数据,和/或包括处于妊娠相关状态相关联代谢物的分组的定量量度的代谢组数据。The pregnancy-related state of the subject can be monitored by monitoring the course of treatment for the pregnancy-related state of the subject. Monitoring can include evaluating the pregnancy-related state of the subject at two or more time points. The evaluation can be based at least on quantitative measurements of sequence reads of a data set of grouped genomic sites associated with the pregnancy-related state (e.g., quantitative measurements of RNA transcripts or DNA at genomic sites associated with the pregnancy-related state), proteomic data including quantitative measurements of proteins of a data set of grouped proteins associated with the pregnancy-related state, and/or metabolomic data including quantitative measurements of grouped metabolites associated with the pregnancy-related state.

在一些实施方案中,处于妊娠相关状态相关联基因组位点的分组的数据集的序列读数的定量量度(例如,妊娠相关状态相关联基因组位点处的RNA转录物或DNA的定量量度)、包括处于妊娠相关状态相关联蛋白质的分组的数据集的蛋白质的定量量度的蛋白质组学数据,和/或包括在两个或多个时间点之间确定的妊娠相关状态相关联代谢物的分组的定量量度的代谢组数据的差异可以指示一个或多个临床指征,诸如(i)对象的妊娠相关状态的诊断,(ii)对象的妊娠相关状态的预后,(iii)对象的妊娠相关状态的风险增加,(iv)对象的妊娠相关状态的风险降低,(v)用于治疗对象的妊娠相关状态的疗程的有效性,以及(vi)用于治疗对象的妊娠相关状态的疗程的无效性。In some embodiments, a quantitative measure of sequence reads in a data set of grouped genomic loci associated with a pregnancy-associated state (e.g., a quantitative measure of RNA transcripts or DNA at a genomic loci associated with a pregnancy-associated state), proteomic data comprising quantitative measures of proteins in a data set of grouped proteins associated with a pregnancy-associated state, and/or a difference in metabolomic data comprising quantitative measures of grouped metabolites associated with a pregnancy-associated state determined between two or more time points can indicate one or more clinical indications, such as (i) diagnosis of a pregnancy-associated state in a subject, (ii) prognosis of a pregnancy-associated state in a subject, (iii) increased risk of a pregnancy-associated state in a subject, (iv) decreased risk of a pregnancy-associated state in a subject, (v) effectiveness of a course of treatment for treating a pregnancy-associated state in a subject, and (vi) ineffectiveness of a course of treatment for treating a pregnancy-associated state in a subject.

在一些实施方案中,处于妊娠相关状态相关联基因组位点的分组的数据集的序列读数的定量量度(例如,妊娠相关状态相关联基因组位点处的RNA转录物或DNA的定量量度)、包括处于妊娠相关状态相关联蛋白质的分组的数据集的蛋白质的定量量度的蛋白质组学数据,和/或包括在两个或多个时间点之间确定的妊娠相关状态相关联代谢物的分组的定量量度的代谢组数据的差异可以指示对象的妊娠相关状态的诊断。例如,如果在较早的时间点未在对象中检测到妊娠相关状态,但在较晚的时间点在对象中检测到,则差异指示对对象的妊娠相关状态的诊断。可以基于对象的妊娠相关状态的诊断这一指示做出临床行动或决定,例如,为对象开具新的治疗性干预处方。临床行动或决定可包括推荐对象进行二次临床试验,以确认妊娠相关状态的诊断。该二级临床试验可包括成像检测、血液测试、计算机断层(CT)扫描、磁共振成像(MRI)扫描、超声扫描、胸部X射线、正电子发射断层(PET)扫描、PET-CT扫描、无细胞生物细胞学、羊膜腔穿刺术、无创产前检查(NIPT)或其任何组合。In some embodiments, a quantitative measure of sequence reads of a data set of grouped genomic sites associated with a pregnancy-associated state (e.g., a quantitative measure of RNA transcripts or DNA at a genomic site associated with a pregnancy-associated state), a proteomic data comprising a quantitative measure of proteins of a data set of grouped proteins associated with a pregnancy-associated state, and/or a difference in metabolomic data comprising a quantitative measure of grouped metabolites associated with a pregnancy-associated state determined between two or more time points can indicate a diagnosis of a pregnancy-associated state in a subject. For example, if a pregnancy-associated state is not detected in a subject at an earlier time point, but is detected in a subject at a later time point, the difference indicates a diagnosis of a pregnancy-associated state in the subject. A clinical action or decision can be made based on this indication of a diagnosis of a pregnancy-associated state in a subject, for example, prescribing a new therapeutic intervention for a subject. A clinical action or decision can include recommending a subject to undergo a secondary clinical trial to confirm the diagnosis of a pregnancy-associated state. The secondary clinical trial may include imaging tests, blood tests, computed tomography (CT) scans, magnetic resonance imaging (MRI) scans, ultrasound scans, chest X-rays, positron emission tomography (PET) scans, PET-CT scans, cell-free biocytology, amniocentesis, non-invasive prenatal testing (NIPT), or any combination thereof.

在一些实施方案中,处于妊娠相关状态相关联基因组位点的分组的数据集的序列读数的定量量度(例如,妊娠相关状态相关联基因组位点处的RNA转录物或DNA的定量量度)、包括处于妊娠相关状态相关联蛋白质的分组的数据集的蛋白质的定量量度的蛋白质组学数据,和/或包括在两个或多个时间点之间确定的妊娠相关状态相关联代谢物的分组的定量量度的代谢组数据的差异可以指示对象的妊娠相关状态的预后。In some embodiments, a quantitative measure of sequence reads in a data set of grouped genomic loci associated with a pregnancy-associated state (e.g., a quantitative measure of RNA transcripts or DNA at a pregnancy-associated state genomic loci), a proteomic data comprising quantitative measures of proteins in a data set of grouped proteins associated with a pregnancy-associated state, and/or a difference in metabolomic data comprising quantitative measures of grouped metabolites associated with a pregnancy-associated state determined between two or more time points can indicate a prognosis for a pregnancy-associated state in a subject.

在一些实施方案中,处于妊娠相关状态相关联基因组位点的分组的数据集的序列读数的定量量度(例如,妊娠相关状态相关联基因组位点处的RNA转录物或DNA的定量量度)、包括处于妊娠相关状态相关联蛋白质的分组的数据集的蛋白质的定量量度的蛋白质组学数据,和/或包括在两个或多个时间点之间确定的妊娠相关状态相关联代谢物的分组的定量量度的代谢组数据的差异可以指示对象具有增加的妊娠相关状态的风险。例如,如果在较早的时间点和较晚的时间点在对象中检测到妊娠相关状态,并且差异为负差异(例如,处于妊娠相关状态相关联基因组位点的分组的数据集的序列读数的定量量度(例如,妊娠相关状态相关联基因组位点处的RNA转录物或DNA的定量量度)、包括处于妊娠相关状态相关联蛋白质的分组的数据集的蛋白质的定量量度的蛋白质组学数据,和/或包括妊娠相关状态相关联代谢物的分组的定量量度的代谢组数据从较早时间点到较晚时间点增加),则该差异可以指示对象具有增加的妊娠相关状态的风险。可以基于妊娠相关状态风险增加的这一指示做出临床行动或决定,例如,为对象开具新的治疗性干预或切换治疗性干预处方(例如,结束当前的治疗并开具新的治疗处方)。临床行动或决定可包括推荐对象进行二次临床试验,以确认妊娠相关状态的风险增加。该二级临床试验可包括成像检测、血液测试、计算机断层(CT)扫描、磁共振成像(MRI)扫描、超声扫描、胸部X射线、正电子发射断层(PET)扫描、PET-CT扫描、无细胞生物细胞学、羊膜腔穿刺术、无创产前检查(NIPT)或其任何组合。In some embodiments, a difference in a quantitative measure of sequence reads of a data set of grouped genomic sites associated with a pregnancy-associated state (e.g., a quantitative measure of RNA transcripts or DNA at a genomic site associated with a pregnancy-associated state), a proteomic data comprising a quantitative measure of a protein of a data set of grouped proteins associated with a pregnancy-associated state, and/or a metabolomic data comprising a quantitative measure of a grouped metabolite associated with a pregnancy-associated state determined between two or more time points can indicate that the subject has an increased risk of a pregnancy-associated state. For example, if a pregnancy-associated state is detected in the subject at an earlier time point and a later time point, and the difference is a negative difference (e.g., a quantitative measure of sequence reads of a data set of grouped genomic sites associated with a pregnancy-associated state (e.g., a quantitative measure of RNA transcripts or DNA at a genomic site associated with a pregnancy-associated state), a proteomic data comprising a quantitative measure of a protein of a data set of grouped proteins associated with a pregnancy-associated state, and/or a metabolomic data comprising a quantitative measure of a grouped metabolite associated with a pregnancy-associated state increases from an earlier time point to a later time point), then the difference can indicate that the subject has an increased risk of a pregnancy-associated state. A clinical action or decision can be made based on this indication of an increased risk of a pregnancy-related condition, such as prescribing a new therapeutic intervention or switching a therapeutic intervention prescription for the subject (e.g., ending the current treatment and prescribing a new treatment prescription). The clinical action or decision may include recommending that the subject undergo a secondary clinical trial to confirm the increased risk of a pregnancy-related condition. The secondary clinical trial may include imaging tests, blood tests, computed tomography (CT) scans, magnetic resonance imaging (MRI) scans, ultrasound scans, chest x-rays, positron emission tomography (PET) scans, PET-CT scans, cell-free biocytology, amniocentesis, non-invasive prenatal testing (NIPT), or any combination thereof.

在一些实施方案中,处于妊娠相关状态相关联基因组位点的分组的数据集的序列读数的定量量度(例如,妊娠相关状态相关联基因组位点处的RNA转录物或DNA的定量量度)、包括处于妊娠相关状态相关联蛋白质的分组的数据集的蛋白质的定量量度的蛋白质组学数据,和/或包括在两个或多个时间点之间确定的妊娠相关状态相关联代谢物的分组的定量量度的代谢组数据的差异可以指示对象具有降低的妊娠相关状态的风险。例如,如果在较早的时间点和较晚的时间点在对象中检测到妊娠相关状态,并且差异为正差异(例如,处于妊娠相关状态相关联基因组位点的分组的数据集的序列读数的定量量度(例如,妊娠相关状态相关联基因组位点处的RNA转录物或DNA的定量量度)、包括处于妊娠相关状态相关联蛋白质的分组的数据集的蛋白质的定量量度的蛋白质组学数据,和/或包括妊娠相关状态相关联代谢物的分组的定量量度的代谢组数据从较早时间点到较晚时间点降低),则该差异可以指示对象具有降低的妊娠相关状态的风险。可以基于对象的妊娠相关状态的风险降低的这一指示做出临床行动或决定(例如,继续或结束当前的治疗性干预)。临床行动或决定可包括推荐对象进行二次临床试验,以确认妊娠相关状态的风险降低。该二级临床试验可包括成像检测、血液测试、计算机断层(CT)扫描、磁共振成像(MRI)扫描、超声扫描、胸部X射线、正电子发射断层(PET)扫描、PET-CT扫描、无细胞生物细胞学、羊膜腔穿刺术、无创产前检查(NIPT)或其任何组合。In some embodiments, a difference in a quantitative measure of sequence reads of a data set of grouped genomic sites associated with a pregnancy-associated state (e.g., a quantitative measure of RNA transcripts or DNA at a genomic site associated with a pregnancy-associated state), a proteomic data comprising a quantitative measure of a protein of a data set of grouped proteins associated with a pregnancy-associated state, and/or a metabolomic data comprising a quantitative measure of a grouped metabolite associated with a pregnancy-associated state determined between two or more time points can indicate that the subject has a reduced risk of a pregnancy-associated state. For example, if a pregnancy-associated state is detected in the subject at an earlier time point and a later time point, and the difference is a positive difference (e.g., a quantitative measure of sequence reads of a data set of grouped genomic sites associated with a pregnancy-associated state (e.g., a quantitative measure of RNA transcripts or DNA at a genomic site associated with a pregnancy-associated state), a proteomic data comprising a quantitative measure of a protein of a data set of grouped proteins associated with a pregnancy-associated state, and/or a metabolomic data comprising a quantitative measure of a grouped metabolite associated with a pregnancy-associated state decreases from an earlier time point to a later time point), then the difference can indicate that the subject has a reduced risk of a pregnancy-associated state. A clinical action or decision (e.g., to continue or terminate a current therapeutic intervention) can be made based on this indication of a reduced risk of a pregnancy-related condition in the subject. The clinical action or decision can include recommending that the subject undergo a secondary clinical trial to confirm the reduced risk of a pregnancy-related condition. The secondary clinical trial can include imaging tests, blood tests, computed tomography (CT) scans, magnetic resonance imaging (MRI) scans, ultrasound scans, chest x-rays, positron emission tomography (PET) scans, PET-CT scans, cell-free biocytology, amniocentesis, non-invasive prenatal testing (NIPT), or any combination thereof.

在一些实施方案中,处于妊娠相关状态相关联基因组位点的分组的数据集的序列读数的定量量度(例如,妊娠相关状态相关联基因组位点处的RNA转录物或DNA的定量量度)、包括处于妊娠相关状态相关联蛋白质的分组的数据集的蛋白质的定量量度的蛋白质组学数据,和/或包括在两个或多个时间点之间确定的妊娠相关状态相关联代谢物的分组的定量量度的代谢组数据的差异可以指示用于治疗对象的妊娠相关状态的疗程的有效性。例如,如果在较早的时间点在对象中检测到妊娠相关状态,但在较晚的时间点在对象中未检测到,则该差异可以指示该疗程对治疗对象的妊娠相关状态的有效性。可以基于该疗程对治疗对象的妊娠相关状态的有效性的这一指示做出临床行动或决定,例如,继续或结束对对象的当前治疗性干预。临床行动或决定可包括推荐对象进行二次临床试验,以确认该疗程对治疗妊娠相关状态的有效性。该二级临床试验可包括成像检测、血液测试、计算机断层(CT)扫描、磁共振成像(MRI)扫描、超声扫描、胸部X射线、正电子发射断层(PET)扫描、PET-CT扫描、无细胞生物细胞学、羊膜腔穿刺术、无创产前检查(NIPT)或其任何组合。In some embodiments, a quantitative measure of sequence reads in a data set of grouped genomic sites associated with a pregnancy-associated state (e.g., a quantitative measure of RNA transcripts or DNA at a genomic site associated with a pregnancy-associated state), a proteomic data comprising a quantitative measure of proteins in a data set of grouped proteins associated with a pregnancy-associated state, and/or a difference in metabolomic data comprising a quantitative measure of grouped metabolites associated with a pregnancy-associated state determined between two or more time points can indicate the effectiveness of a course of treatment for treating a pregnancy-associated state in a subject. For example, if a pregnancy-associated state is detected in a subject at an earlier time point, but not detected in a subject at a later time point, the difference can indicate the effectiveness of the course of treatment for treating the pregnancy-associated state in the subject. A clinical action or decision can be made based on this indication of the effectiveness of the course of treatment for treating the pregnancy-associated state in a subject, for example, to continue or terminate a current therapeutic intervention for the subject. The clinical action or decision can include recommending that the subject undergo a secondary clinical trial to confirm the effectiveness of the course of treatment for treating the pregnancy-associated state. The secondary clinical trial may include imaging tests, blood tests, computed tomography (CT) scans, magnetic resonance imaging (MRI) scans, ultrasound scans, chest X-rays, positron emission tomography (PET) scans, PET-CT scans, cell-free biocytology, amniocentesis, non-invasive prenatal testing (NIPT), or any combination thereof.

在一些实施方案中,处于妊娠相关状态相关联基因组位点的分组的数据集的序列读数的定量量度(例如,妊娠相关状态相关联基因组位点处的RNA转录物或DNA的定量量度)、包括处于妊娠相关状态相关联蛋白质的分组的数据集的蛋白质的定量量度的蛋白质组学数据,和/或包括在两个或多个时间点之间确定的妊娠相关状态相关联代谢物的分组的定量量度的代谢组数据的差异可以指示用于治疗对象的妊娠相关状态的疗程无效。例如,如果在较早的时间点和较晚的时间点在对象中检测到妊娠相关状态,并且如果该差异为负差异或零差异(例如,处于妊娠相关状态相关联基因组位点的分组的数据集的序列读数的定量量度(例如,妊娠相关状态相关联基因组位点处的RNA转录物或DNA的定量量度)、包括处于妊娠相关状态相关联蛋白质的分组的数据集的蛋白质的定量量度的蛋白质组学数据,和/或包括妊娠相关状态相关联代谢物的分组的定量量度的代谢组数据从较早时间点到较晚时间点增加或保持在恒定水平),并且如果在较早的时间点指示有效的治疗,则该差异可以指示该疗程对治疗对象的妊娠相关状态的无效性。可以基于该疗程对治疗对象的妊娠相关状态的无效性的这一指示做出临床行动或决定,例如,结束当前的治疗性干预和/或切换到(例如,开具处方)用于对象的不同的新治疗性干预。临床行动或决定可包括推荐对象进行二次临床试验,以确认该疗程对治疗妊娠相关状态的无效性。该二级临床试验可包括成像检测、血液测试、计算机断层(CT)扫描、磁共振成像(MRI)扫描、超声扫描、胸部X射线、正电子发射断层(PET)扫描、PET-CT扫描、无细胞生物细胞学、羊膜腔穿刺术、无创产前检查(NIPT)或其任何组合。In some embodiments, a difference in quantitative measures of sequence reads in a data set of grouped genomic loci associated with a pregnancy-associated state (e.g., quantitative measures of RNA transcripts or DNA at genomic loci associated with a pregnancy-associated state), proteomic data comprising quantitative measures of proteins in a data set of grouped proteins associated with a pregnancy-associated state, and/or metabolomic data comprising quantitative measures of grouped metabolites associated with a pregnancy-associated state determined between two or more time points may indicate that a course of therapy for treating a subject's pregnancy-associated condition is ineffective. For example, if a pregnancy-associated state is detected in a subject at an earlier time point and a later time point, and if the difference is a negative difference or a zero difference (e.g., a quantitative measure of sequence reads (e.g., a quantitative measure of RNA transcripts or DNA at a pregnancy-associated state-associated genomic site) of a data set of grouped genomic sites associated with a pregnancy-associated state, a proteomic data comprising quantitative measures of proteins of a data set of grouped proteins associated with a pregnancy-associated state, and/or a metabolomic data comprising quantitative measures of grouped metabolites associated with a pregnancy-associated state increases or remains at a constant level from the earlier time point to the later time point), and if an effective treatment was indicated at the earlier time point, then the difference can indicate ineffectiveness of the course of treatment for treating the pregnancy-associated state in the subject. A clinical action or decision can be made based on this indication of ineffectiveness of the course of treatment for treating the pregnancy-associated state in the subject, such as, for example, ending the current therapeutic intervention and/or switching to (e.g., prescribing) a different new therapeutic intervention for the subject. The clinical action or decision can include recommending that the subject undergo a secondary clinical trial to confirm the ineffectiveness of the course of treatment for treating the pregnancy-associated state. The secondary clinical trial may include imaging tests, blood tests, computed tomography (CT) scans, magnetic resonance imaging (MRI) scans, ultrasound scans, chest X-rays, positron emission tomography (PET) scans, PET-CT scans, cell-free biocytology, amniocentesis, non-invasive prenatal testing (NIPT), or any combination thereof.

在另一方面,本公开提供了一种用于预测对象的早产风险的计算机实施的方法,包括:(a)接收对象的临床健康数据,其中该临床健康数据包括该对象的多个定量或分类量度;(b)使用经训练算法处理对象的临床健康数据,以确定指示对象的早产风险的风险评分;以及(c)以电子方式输出指示对象的早产风险的风险评分的报告。In another aspect, the present disclosure provides a computer-implemented method for predicting a subject's risk of preterm birth, comprising: (a) receiving clinical health data of the subject, wherein the clinical health data comprises a plurality of quantitative or categorical measurements of the subject; (b) processing the subject's clinical health data using a trained algorithm to determine a risk score indicating the subject's risk of preterm birth; and (c) electronically outputting a report of the risk score indicating the subject's risk of preterm birth.

在一些实施方案中,例如,临床健康数据包括对象的一个或多个定量量度,诸如年龄、体重、身高、体重指数(BMI)、血压、心率、血糖水平、既往妊娠次数和既往生育次数。作为另一个实例,临床健康数据可以包括一个或多个分类量度,诸如人种、种族、药物或其他临床治疗史、吸烟史、饮酒史、日常活动或健康水平、基因检测结果、血液测试结果、成像结果和胎儿筛查结果。In some embodiments, for example, clinical health data includes one or more quantitative measurements of a subject, such as age, weight, height, body mass index (BMI), blood pressure, heart rate, blood glucose level, number of previous pregnancies, and number of previous births. As another example, clinical health data may include one or more categorical measurements, such as race, ethnicity, medication or other clinical treatment history, smoking history, drinking history, daily activities or fitness levels, genetic test results, blood test results, imaging results, and fetal screening results.

在一些实施方案中,使用计算机或移动设备应用程序执行用于预测对象的早产风险的计算机实施的方法。例如,对象可以使用计算机或移动设备应用程序输入自己的临床健康数据,包括定量和/或分类量度。然后,计算机或移动设备应用程序可以使用经训练算法来处理临床健康数据,以确定指示对象的早产风险的风险评分。然后,计算机或移动设备应用程序可以显示指示对象的早产风险的风险评分的报告。In some embodiments, a computer or mobile device application is used to perform a computer-implemented method for predicting the risk of premature birth of an object. For example, an object can use a computer or mobile device application to input its own clinical health data, including quantitative and/or categorical measurements. Then, the computer or mobile device application can use a trained algorithm to process the clinical health data to determine a risk score for the risk of premature birth of an object. Then, the computer or mobile device application can display a report on the risk score for the risk of premature birth of an object.

在一些实施方案中,指示对象的早产风险的风险评分可以通过对对象执行一次或多次后续临床测试来细化。例如,医生可以基于初始风险评分将对象转诊进行一项或多项后续临床测试(例如,超声成像或血液测试)。接下来,计算机或移动设备应用程序可以使用经训练算法处理来自一个或多个后续临床测试的结果,以确定指示对象的早产风险的更新风险评分。In some embodiments, the risk score indicating the risk of premature birth for the subject can be refined by performing one or more subsequent clinical tests on the subject. For example, a physician can refer the subject for one or more subsequent clinical tests (e.g., ultrasound imaging or blood tests) based on the initial risk score. Next, the computer or mobile device application can process the results from the one or more subsequent clinical tests using the trained algorithm to determine an updated risk score indicating the risk of premature birth for the subject.

在一些实施方案中,风险评分包括对象在预定持续时间内早产的似然性。例如,预定持续时间可以为约1小时、约2小时、约4小时、约6小时、约8小时、约10小时、约12小时、约14小时、约16小时、约18小时、约20小时、约22小时、约24小时、约1.5天、约2天、约2.5天、约3天、约3.5天、约4天、约4.5天、约5天、约5.5天、约6天、约6.5天、约7天、约8天、约9天、约10天、约12天、约14天、约3周、约4周、约5周、约6周、约7周、约8周、约9周、约10周、约11周、约12周、约13周或大于约13周。In some embodiments, the risk score includes the likelihood of a subject having a preterm birth within a predetermined duration. For example, the predetermined duration can be about 1 hour, about 2 hours, about 4 hours, about 6 hours, about 8 hours, about 10 hours, about 12 hours, about 14 hours, about 16 hours, about 18 hours, about 20 hours, about 22 hours, about 24 hours, about 1.5 days, about 2 days, about 2.5 days, about 3 days, about 3.5 days, about 4 days, about 4.5 days, about 5 days, about 5.5 days, about 6 days, about 6.5 days, about 7 days, about 8 days, about 9 days, about 10 days, about 12 days, about 14 days, about 3 weeks, about 4 weeks, about 5 weeks, about 6 weeks, about 7 weeks, about 8 weeks, about 9 weeks, about 10 weeks, about 11 weeks, about 12 weeks, about 13 weeks, or more than about 13 weeks.

输出妊娠相关状态的报告Output reports on pregnancy-related status

在鉴定对象的妊娠相关状态或监测妊娠相关状态的风险增加之后,可以以电子方式输出指示(例如,鉴定或提供指示)对象的妊娠相关状态的报告。对象可以没有表现出妊娠相关状态(例如,没有妊娠相关状态的症状,诸如妊娠相关并发症)。报告可以呈现在用户的电子设备的图形用户界面(GUI)上。用户可以是对象、护理人员、医生、护士或其他卫生保健工作者。After identifying the pregnancy-related state of the subject or monitoring the increased risk of the pregnancy-related state, a report indicating (e.g., identifying or providing an indication) the pregnancy-related state of the subject can be output electronically. The subject may not exhibit a pregnancy-related state (e.g., no symptoms of a pregnancy-related state, such as pregnancy-related complications). The report can be presented on a graphical user interface (GUI) of the user's electronic device. The user can be a subject, a caregiver, a doctor, a nurse, or other health care worker.

报告可以包括一种或多种临床指征,诸如(i)对象的妊娠相关状态的诊断,(ii)对象的妊娠相关状态的预后,(iii)对象的妊娠相关状态的风险增加,(iv)对象的妊娠相关状态的风险降低,(v)疗程对对象的妊娠相关状态的有效性,和(vi)疗程对治疗对象的妊娠相关状态的无效性。报告可以包括基于这些一种或多种临床指征做出的一个或多个临床行动或决定。此类临床行动或决定可以针对治疗性干预、引产或抑制分娩,或对对象的妊娠相关状态进行进一步的临床评估或检测。The report may include one or more clinical indications, such as (i) diagnosis of a subject's pregnancy-related condition, (ii) prognosis of a subject's pregnancy-related condition, (iii) increased risk of a subject's pregnancy-related condition, (iv) reduced risk of a subject's pregnancy-related condition, (v) effectiveness of a course of treatment for a subject's pregnancy-related condition, and (vi) ineffectiveness of a course of treatment for a subject's pregnancy-related condition. The report may include one or more clinical actions or decisions made based on these one or more clinical indications. Such clinical actions or decisions may be directed to therapeutic intervention, induction of labor or inhibition of labor, or further clinical evaluation or testing of a subject's pregnancy-related condition.

例如,对象的妊娠相关状态的诊断的临床指征可以伴随着为对象开具新的治疗性干预处方的临床行动。作为另一实例,对象的妊娠相关状态的风险增加的临床指征可以伴随着为对象开具新的治疗性干预处方或切换治疗性干预(例如,结束当前的治疗并开具新的治疗处方)的临床行动。作为另一实例,对象的妊娠相关状态的风险降低的临床指征可以伴随着继续或结束对对象的当前治疗性干预的临床行动。作为另一实例,疗程对治疗对象的妊娠相关状态的疗程的有效性的临床指征可以伴随着继续或结束对对象的当前治疗性干预的临床行动。作为另一实例,疗程对治疗对象的妊娠相关状态的无效性的临床指征可以伴随着结束当前的治疗性干预和/或切换到(例如,开具处方)用于对象的不同的新治疗性干预。For example, the clinical indication of the diagnosis of the pregnancy-related state of the object can be accompanied by the clinical action of prescribing a new therapeutic intervention prescription for the object. As another example, the clinical indication of the increased risk of the pregnancy-related state of the object can be accompanied by the clinical action of prescribing a new therapeutic intervention prescription or switching therapeutic intervention (e.g., ending the current treatment and prescribing a new treatment prescription) for the object. As another example, the clinical indication of the reduced risk of the pregnancy-related state of the object can be accompanied by the clinical action of continuing or ending the current therapeutic intervention of the object. As another example, the clinical indication of the effectiveness of the course of treatment for the pregnancy-related state of the object can be accompanied by the clinical action of continuing or ending the current therapeutic intervention of the object. As another example, the clinical indication of the invalidity of the pregnancy-related state of the object of the treatment course can be accompanied by ending the current therapeutic intervention and/or switching to (e.g., prescribing) different new therapeutic interventions for the object.

计算机系统Computer Systems

本公开提供了编程为实施本公开的方法的计算机系统。图1示出了计算机系统101,其被编程或以其他方式配置为例如,(i)训练和测试经训练算法,(ii)使用经训练算法来处理数据以确定对象的妊娠相关状态,(iii)确定指示对象的妊娠相关状态的定量量度,(iv)鉴定或监测对象的妊娠相关状态,以及(v)以电子方式输出指示对象的妊娠相关状态的报告。The present disclosure provides computer systems programmed to implement the methods of the present disclosure. Figure 1 shows a computer system 101 that is programmed or otherwise configured to, for example, (i) train and test a trained algorithm, (ii) use the trained algorithm to process data to determine a subject's pregnancy-related state, (iii) determine a quantitative measure indicative of a subject's pregnancy-related state, (iv) identify or monitor a subject's pregnancy-related state, and (v) electronically output a report indicative of a subject's pregnancy-related state.

计算机系统101可以调节本公开的分析、计算和生成的各个方面,例如,(i)训练和测试经训练算法,(ii)使用经训练算法来处理数据以确定对象的妊娠相关状态,(iii)确定指示对象的妊娠相关状态的定量量度,(iv)鉴定或监测对象的妊娠相关状态,以及(v)以电子方式输出指示对象的妊娠相关状态的报告。计算机系统101可以是用户的电子设备或相对于该电子设备远程定位的计算机系统。电子设备可以是移动电子设备。Computer system 101 can coordinate various aspects of the analysis, calculation, and generation of the present disclosure, such as (i) training and testing a trained algorithm, (ii) using a trained algorithm to process data to determine a subject's pregnancy-related state, (iii) determining a quantitative measure indicative of a subject's pregnancy-related state, (iv) identifying or monitoring a subject's pregnancy-related state, and (v) electronically outputting a report indicative of a subject's pregnancy-related state. Computer system 101 can be a user's electronic device or a computer system remotely located relative to the electronic device. The electronic device can be a mobile electronic device.

计算机系统101包括中央处理单元(CPU,本文中也称为“处理器”和“计算机处理器”)105,其可以是单核或多核处理器,也可以是用于并行处理的多个处理器。计算机系统101还包括存储器或存储单元110(例如,随机存取存储器、只读存储器、闪存)、电子存储单元115(例如,硬盘)、用于与一个或多个其他系统通信的通信接口120(例如,网络适配器),以及外围设备125,诸如缓存、其他存储器、数据存储装置和/或电子显示适配器。存储器110、存储单元115、接口120和外围设备125通过诸如主板的通信总线(实线)与CPU 105通信。存储单元115可以是用于存储数据的数据存储单元(或数据存储库)。借助于通信接口120,计算机系统101可操作地耦联到计算机网络(“网络”)130。网络130可以是因特网、互联网和/或外联网,或与因特网通信的内联网和/或外联网。Computer system 101 includes a central processing unit (CPU, also referred to herein as a "processor" and "computer processor") 105, which may be a single-core or multi-core processor, or may be multiple processors for parallel processing. Computer system 101 also includes a memory or storage unit 110 (e.g., random access memory, read-only memory, flash memory), an electronic storage unit 115 (e.g., a hard disk), a communication interface 120 (e.g., a network adapter) for communicating with one or more other systems, and peripherals 125, such as cache, other memory, data storage devices, and/or electronic display adapters. Memory 110, storage unit 115, interface 120, and peripherals 125 communicate with CPU 105 via a communication bus (solid lines) such as a motherboard. Storage unit 115 may be a data storage unit (or data repository) for storing data. By means of communication interface 120, computer system 101 may be operatively coupled to a computer network ("network") 130. The network 130 may be the Internet, the internet and/or an extranet, or an intranet and/or an extranet in communication with the Internet.

在某些情况下,网络130是电信和/或数据网络。网络130可以包括一个或多个计算机服务器,其能够启用分布式计算,诸如云计算。例如,一个或多个计算机服务器可以启用网络130(“云”)上的云计算来执行分析、计算并生成本公开的各个方面,例如,(i)训练和测试经训练算法,(ii)使用经训练算法来处理数据以确定对象的妊娠相关状态,(iii)确定指示对象的妊娠相关状态的定量量度,(iv)鉴定或监测对象的妊娠相关状态,以及(v)以电子方式输出指示对象的妊娠相关状态的报告。此种云计算可以由云计算平台提供,例如,Amazon Web Services(AWS)、Microsoft Azure、Google Cloud Platform和IBM云。在某些情况下,借助于计算机系统101,网络130可以实现对等网络,其可以启用耦联到计算机系统101的设备充当客户端或服务器。In some cases, network 130 is a telecommunications and/or data network. Network 130 may include one or more computer servers that can enable distributed computing, such as cloud computing. For example, one or more computer servers may enable cloud computing on network 130 (“cloud”) to perform analysis, calculations, and generate various aspects of the present disclosure, such as (i) training and testing trained algorithms, (ii) using trained algorithms to process data to determine a pregnancy-related state of a subject, (iii) determining a quantitative measure indicating a pregnancy-related state of a subject, (iv) identifying or monitoring a pregnancy-related state of a subject, and (v) electronically outputting a report indicating a pregnancy-related state of a subject. Such cloud computing may be provided by cloud computing platforms, such as Amazon Web Services (AWS), Microsoft Azure, Google Cloud Platform, and IBM Cloud. In some cases, with the aid of computer system 101, network 130 may implement a peer-to-peer network that may enable devices coupled to computer system 101 to act as clients or servers.

CPU 105可以包括一个或多个计算机处理器和/或一个或多个图形处理器(GPU)。CPU 105可以执行一系列机器可读指令,这些指令可以体现在程序或软件中。指令可以存储在存储位置,诸如存储器110。指令可以定向到CPU 105,其随后可以对CPU 105进行编程或以其他方式进行配置以实施本公开的方法。由CPU 105执行的操作的实例可以包括获取、解码、执行和写回。The CPU 105 may include one or more computer processors and/or one or more graphics processing units (GPUs). The CPU 105 may execute a series of machine-readable instructions, which may be embodied in a program or software. The instructions may be stored in a storage location, such as a memory 110. The instructions may be directed to the CPU 105, which may then program or otherwise configure the CPU 105 to implement the methods of the present disclosure. Examples of operations performed by the CPU 105 may include fetching, decoding, executing, and writing back.

CPU 105可以是电路(诸如集成电路)的一部分。系统101的一个或多个其他组件可以包括在电路中。在某些情况下,电路是专用集成电路(ASIC)。CPU 105 may be part of a circuit, such as an integrated circuit. One or more other components of system 101 may be included in the circuit. In some cases, the circuit is an application specific integrated circuit (ASIC).

存储单元115可以存储文件,诸如驱动程序、库和保存的程序。存储单元115可以存储用户数据,例如用户偏好和用户程序。在某些情况下,计算机系统101可以包括一个或多个在计算机系统101外部的附加数据存储单元,诸如位于通过内联网或因特网与计算机系统101通信的远程服务器上。Storage unit 115 may store files, such as drivers, libraries, and saved programs. Storage unit 115 may store user data, such as user preferences and user programs. In some cases, computer system 101 may include one or more additional data storage units external to computer system 101, such as located on a remote server that communicates with computer system 101 via an intranet or the Internet.

计算机系统101可以通过网络130与一个或多个远程计算机系统通信。例如,计算机系统101可以与用户的远程计算机系统通信。远程计算机系统的实例包括个人电脑(例如,便携式PC)、平板个人计算机或平板电脑(例如,iPad,GalaxyTab)、电话、智能手机(例如,iPhone、支持Android的设备、)或个人数字助理。用户可以通过网络130访问计算机系统101。Computer system 101 can communicate with one or more remote computer systems via network 130. For example, computer system 101 can communicate with a user's remote computer system. Examples of remote computer systems include personal computers (e.g., portable PCs), tablet personal computers or tablet computers (e.g., iPad, GalaxyTab), phones, smartphones (e.g. iPhone, Android-enabled devices, ) or a personal digital assistant. Users can access the computer system 101 through the network 130.

本文描述的方法可以通过存储在计算机系统101(例如,在存储器110或电子存储单元115上)的电子存储位置上的机器(例如,计算机处理器)可执行代码来实施。机器可执行或机器可读代码可以以软件的形式提供。在使用期间,代码可以由处理器105执行。在某些情况下,代码可以从存储单元115检索并存储在存储器110上,以便处理器105随时访问。在某些情况下,可以排除电子存储单元115,并且机器可执行指令被存储在存储器110上。The methods described herein may be implemented by machine (e.g., computer processor) executable code stored in an electronic storage location of computer system 101 (e.g., on memory 110 or electronic storage unit 115). Machine executable or machine readable code may be provided in the form of software. During use, the code may be executed by processor 105. In some cases, the code may be retrieved from storage unit 115 and stored on memory 110 so that it is accessible to processor 105 at any time. In some cases, electronic storage unit 115 may be excluded and machine executable instructions are stored on memory 110.

代码可以预先编译和配置,以便与具有适于执行代码的处理器的机器一起使用,或可以在运行时编译。代码可以以编程语言提供,可以选择该编程语言以使代码能够以预编译或按编译的方式执行。The code may be pre-compiled and configured for use with a machine having a processor suitable for executing the code, or may be compiled at run time. The code may be provided in a programming language which may be selected to enable the code to be executed in a pre-compiled or compiled manner.

本文提供的系统和方法的各个方面,诸如计算机系统101,可以体现在编程中。技术的各个方面可以被认为是“产品”或“制品”,通常以机器(或处理器)可执行代码和/或相关数据的形式进行或体现在一种机器可读介质中。机器可执行代码可以存储在电子存储单元上,诸如存储器(例如,只读存储器、随机存取存储器、闪存)或硬盘。“存储”型介质可以包括计算机、处理器等的任何或所有有形存储器,或其相关模块,诸如各种半导体存储器、磁带驱动器、磁盘驱动器等,它们可以随时为软件编程提供非暂时性存储。软件的全部或部分有时可以通过互联网或各种其他电信网络进行通信。例如,此类通信例如可以使软件能够从一台计算机或处理器加载到另一台计算机或处理器中,例如,从管理服务器或主机加载到应用程序服务器的计算机平台中。因此,可以承载软件元素的另一种类型的介质包括光、电和电磁波,诸如通过有线和光学陆线网络以及通过各种空中链路在本地设备之间的物理接口上使用的。承载此类波的物理元件,诸如有线或无线链路、光链路等,也可以被视为承载软件的介质。如本文所用,除非限于非暂时的、有形的“存储”介质,诸如计算机或机器“可读介质”的术语是指参与向处理器提供指令以供执行的任何介质。Various aspects of the systems and methods provided herein, such as computer system 101, can be embodied in programming. Various aspects of the technology can be considered as "products" or "articles", usually in the form of machine (or processor) executable code and/or related data or embodied in a machine-readable medium. The machine executable code can be stored on an electronic storage unit, such as a memory (e.g., read-only memory, random access memory, flash memory) or a hard disk. "Storage" type media can include any or all tangible memories of a computer, processor, etc., or its related modules, such as various semiconductor memories, tape drives, disk drives, etc., which can provide non-temporary storage for software programming at any time. All or part of the software can sometimes communicate through the Internet or various other telecommunications networks. For example, such communications can, for example, enable software to be loaded from one computer or processor to another computer or processor, for example, from a management server or host to a computer platform of an application server. Therefore, another type of medium that can carry software elements includes light, electricity, and electromagnetic waves, such as those used on physical interfaces between local devices through wired and optical landline networks and through various air links. Physical elements that carry such waves, such as wired or wireless links, optical links, etc., can also be considered media that carry the software. As used herein, unless limited to non-transitory, tangible "storage" media, terms such as computer or machine "readable media" refer to any medium that participates in providing instructions to a processor for execution.

因此,机器可读介质,诸如计算机可执行代码,可以采取多种形式,包括但不限于有形存储介质、载波介质或物理传输介质。非易失性存储介质包括,例如,光盘或磁盘(诸如任何计算机中的任何存储设备等),诸如可以用于实施数据库等,如附图所示。易失性存储介质包括动态存储器,诸如此种计算机平台的主存储器。有形传输介质包括同轴电缆;铜线和光纤,包括构成计算机系统内总线的电线。载波传输介质可以采用电信号或电磁信号的形式,或者声波或光波的形式,诸如射频(RF)和红外(IR)数据通信期间产生的声波或光波。因此,计算机可读介质的常见形式包括例如:软盘、软磁盘、硬盘、磁带、任何其他磁性介质、CD-ROM、DVD或DVD-ROM、任何其他光学介质、穿孔卡纸带、任何其他具有孔图案的物理存储介质、RAM、ROM、PROM和EPROM、FLASH-EPROM、任何其他存储芯片或盒、传输数据或指令的载波、传输此类载波的电缆或链路、或计算机可以从中读取编程代码和/或数据的任何其他介质。这些形式的计算机可读介质中的许多计算机可读介质可以参与将一个或多个指令的一个或多个序列传送到处理器供执行。Thus, machine-readable media, such as computer executable code, may take a variety of forms, including but not limited to tangible storage media, carrier media, or physical transmission media. Non-volatile storage media include, for example, optical or magnetic disks (such as any storage device in any computer, etc.), such as may be used to implement a database, etc., as shown in the accompanying drawings. Volatile storage media include dynamic memory, such as the main memory of such a computer platform. Tangible transmission media include coaxial cables; copper wire and optical fiber, including the wires that make up a bus within a computer system. Carrier transmission media may take the form of an electrical or electromagnetic signal, or an acoustic or light wave, such as those generated during radio frequency (RF) and infrared (IR) data communications. Thus, common forms of computer-readable media include, for example: a floppy disk, a diskette, a hard disk, magnetic tape, any other magnetic medium, a CD-ROM, a DVD or DVD-ROM, any other optical medium, a punched card tape, any other physical storage medium having a pattern of holes, a RAM, a ROM, a PROM and an EPROM, a FLASH-EPROM, any other memory chip or cartridge, a carrier wave that transmits data or instructions, a cable or link that transmits such a carrier wave, or any other medium from which a computer can read programming code and/or data. Many of these forms of computer-readable media can participate in conveying one or more sequences of one or more instructions to a processor for execution.

计算机系统101可以包括或与电子显示器135通信,该电子显示器包括用户界面(UI)140,用于提供例如(i)指示经训练算法的训练和测试的视觉显示,(ii)指示对象的妊娠相关状态的数据的视觉显示,(iii)对象的妊娠相关状态的定量量度,(iv)将对象鉴定为具有妊娠相关状态的对象,或(v)指示对象的妊娠相关状态的电子报告。UI的实例包括但不限于图形用户界面(GUI)和基于Web的用户界面。Computer system 101 may include or communicate with an electronic display 135 that includes a user interface (UI) 140 for providing, for example, (i) a visual display indicating training and testing of a trained algorithm, (ii) a visual display of data indicating a pregnancy-related state of a subject, (iii) a quantitative measure of a pregnancy-related state of a subject, (iv) an identification of a subject as having a pregnancy-related state, or (v) an electronic report indicating a pregnancy-related state of a subject. Examples of UIs include, but are not limited to, graphical user interfaces (GUIs) and web-based user interfaces.

本公开的方法和系统可以通过一种或多种算法来实施。算法可以在由中央处理单元205执行时通过软件的方式实施。算法可以例如,(i)训练和测试经训练算法,(ii)使用经训练算法来处理数据以确定对象的妊娠相关状态,(iii)确定指示对象的妊娠相关状态的定量量度,(iv)鉴定或监测对象的妊娠相关状态,以及(v)以电子方式输出指示对象的妊娠相关状态的报告。The methods and systems of the present disclosure may be implemented by one or more algorithms. The algorithms may be implemented by way of software when executed by the central processing unit 205. The algorithms may, for example, (i) train and test the trained algorithms, (ii) use the trained algorithms to process data to determine a subject's pregnancy-related state, (iii) determine a quantitative measure indicative of a subject's pregnancy-related state, (iv) identify or monitor a subject's pregnancy-related state, and (v) electronically output a report indicative of a subject's pregnancy-related state.

实施例Example

实施例1:使用分子亚分型和cfRNA谱分析对高风险群体中自发性早产和晚期流产的早期预测Example 1: Early prediction of spontaneous preterm birth and late miscarriage in a high-risk population using molecular subtyping and cfRNA profiling

使用本公开的系统和方法,在来自由于临床病史而早产(PTB)风险增加的群体的母体血液样品中鉴定PTB和死产的早期分子标志物。Using the systems and methods of the present disclosure, early molecular markers for preterm birth (PTB) and stillbirth are identified in maternal blood samples from a population at increased risk for PTB due to clinical history.

研究设计进行如下。在妊娠12-24周(IQR 18.9-20.9)之间并在临产前收集来自229名女性的血液样品。在英国,跨越4个独立地点收集样品,且队列的51%被认为处于高风险,高风险由以下中的至少一项定义:既往sPTB(16-36)、既往子宫颈手术或小于25mm的子宫颈长度。80%(n=183)是足月产,且20%(n=46)具有sPTB(GA小于35周)。30%的sPTB在小于25周的GA分娩。80%(n=183)是足月产,且20%(n=46)具有sPTB(GA小于35周)。30%的sPTB在小于25周的GA分娩。使用统一的实验和计算的NGS管道处理所有样品用于无细胞(cfRNA)测序。将双胎妊娠和先兆子痫病例排除。The study design was conducted as follows. Blood samples from 229 women were collected between 12-24 weeks of gestation (IQR 18.9-20.9) and before delivery. In the UK, samples were collected across 4 independent sites, and 51% of the cohort was considered to be at high risk, defined by at least one of the following: previous sPTB (16-36), previous cervical surgery, or cervical length less than 25 mm. 80% (n = 183) were term births, and 20% (n = 46) had sPTB (GA less than 35 weeks). 30% of sPTB were delivered at a GA of less than 25 weeks. 80% (n = 183) were term births, and 20% (n = 46) had sPTB (GA less than 35 weeks). 30% of sPTB were delivered at a GA of less than 25 weeks. All samples were processed using a unified experimental and computational NGS pipeline for cell-free (cfRNA) sequencing. Twin pregnancy and preeclampsia cases were excluded.

对获得自所有229个母体血液样品的血浆进行cfRNA谱分析。41个基因(COL3A1、HSD17B1、GPC4、CDR1-AS、COL5A1、COL6A1、COL1A1、EFHD1、LENG8、DCN、CYB5R2、ELN、ANTXR1、CH507-24F1.2、EIF4A1、ABI3BP、LAPTM5、SLC38A10、CCDC80、C1S、VPS45、COL1A2、C7、FN1、COL14A1、ARPC4-TTLL3、LINC01002、PSG4、TMTC2、STAG3L5P、AMT、TAP1、CSH1、MMP2、PLXNA3、LGMNP1、LUM、MYO18B、DAPK2、GCM1L、GALS14、FNDC 10、FBN2、CAPN13、TNFRSF25、MYH11、POGLUT1、GH2、DNAH1、DES、NUP210)被鉴定为与sPTB的风险增加相关联(FDR<0.1)。基于这些转录物,开发逻辑回归分类器模型以预测PTB,获得AUC=0.72,如图2A-图2C所示。通过留一交叉验证(LOOCV)验证该模型。PTB在病理生理学中的其他见解呈现在图2D中。针对与sPTB相关联的转录物的途径分析揭示,最终在小于35周的GA具有sPTB的那些个体中与含胶原和细胞外基质相关的基因的富集。Plasma obtained from all 229 maternal blood samples was subjected to cfRNA profiling. 41 genes (COL3A1, HSD17B1, GPC4, CDR1-AS, COL5A1, COL6A1, COL1A1, EFHD1, LENG8, DCN, CYB5R2, ELN, ANTXR1, CH507-24F1.2, EIF4A1, ABI3BP, LAPTM5, SLC38A10, CCDC80, C1S, VPS45, COL1A2, C7, FN1, COL14A1, ARPC4-TTLL3, LINC01002, PSG4, TMTC2, STAG3L5P, AMT, TAP1, CSH1, MMP2, PLXNA3, LGMNP1, LUM, MYO18B, DAPK2, GCM1L, GALS14, FNDC 10, FBN2, CAPN13, TNFRSF25, MYH11, POGLUT1, GH2, DNAH1, DES, NUP210) were identified as being associated with an increased risk of sPTB (FDR<0.1). Based on these transcripts, a logistic regression classifier model was developed to predict PTB, obtaining AUC=0.72, as shown in Figures 2A-2C. The model was validated by leave-one-out cross validation (LOOCV). Other insights into PTB in pathophysiology are presented in Figure 2D. Pathway analysis of transcripts associated with sPTB revealed an enrichment of genes related to collagen and extracellular matrix in those individuals who ultimately had sPTB at less than 35 weeks of GA.

针对极晚期流产/早期早产(分娩时GA小于25周)对14个病例和166个足月对照进行了附加的分析。基于11个基因(AC011043.1、IGFBP2、SH3GL3、AMT、GTF2IP4、GYPB、PAPPA、CH17-472G23.2、OMA1、ACADSB、ACER3)开发逻辑回归分类器以预测极晚期流产/早期早产的风险,在LOOCV中具有AUC=0.76的增强性能。同时富集与GA小于25周的sPTB相关联。Additional analysis was performed on 14 cases and 166 term controls for very late miscarriage/early preterm birth (GA at delivery less than 25 weeks). A logistic regression classifier was developed based on 11 genes (AC011043.1, IGFBP2, SH3GL3, AMT, GTF2IP4, GYPB, PAPPA, CH17-472G23.2, OMA1, ACADSB, ACER3) to predict the risk of very late miscarriage/early preterm birth with enhanced performance in LOOCV with AUC=0.76. Simultaneous enrichment was associated with sPTB with GA less than 25 weeks.

6种转录物集的途径分析揭示了在来自在25周GA之前分娩的母体对象的样品中与基膜和内质网腔相关的基因集的富集,以及胰岛素样生长因子转运和摄取以及氨基酸代谢途径中的基因的富集,从而在极晚期流产/早期早产的病理生理学中提供了另外的见解,如图3所示。Pathway analysis of the six transcript sets revealed enrichment of gene sets associated with the basement membrane and endoplasmic reticulum lumen in samples from maternal subjects who delivered before 25 weeks GA, as well as enrichment of genes in the insulin-like growth factor transport and uptake and amino acid metabolism pathways, providing additional insights in the pathophysiology of very late miscarriage/early preterm birth, as shown in Figure 3.

在妊娠并发症风险增加的妊娠中,母体血浆中cfRNA的分析提供了对母体胎儿健康的非侵入性窗口。我们的数据显示,转录物子集的升高的表达潜在地涉及早产的至少两种分子亚型和两种不同的下划线机制。在第一分子亚型中,含胶原的细胞外基质途径可以与子宫颈重塑相关联,而子宫颈重塑与较短的子宫颈相关联,并且子宫颈不足可以形成对于高风险PTB分娩的潜在生物学的基础。在第二分子亚型中,对于内质网腔途径,在蜕膜细胞中由氧化应激诱导的内质网应激可能与早期妊娠丢失的可能机制相关。此外,基膜途径可以指示在流产期间从子宫的胎膜早剥。In pregnancies at increased risk for pregnancy complications, analysis of cfRNA in maternal plasma provides a non-invasive window into maternal-fetal health. Our data show that elevated expression of a subset of transcripts is potentially involved in at least two molecular subtypes and two different underlining mechanisms of preterm birth. In the first molecular subtype, the collagen-containing extracellular matrix pathway can be associated with cervical remodeling, which is associated with a shorter cervix, and an inadequate cervix can form the basis of the underlying biology for high-risk PTB deliveries. In the second molecular subtype, for the endoplasmic reticulum luminal pathway, endoplasmic reticulum stress induced by oxidative stress in decidual cells may be associated with a possible mechanism of early pregnancy loss. In addition, the basement membrane pathway may indicate abruption of membranes from the uterus during miscarriage.

基于PTB的分子亚分型的这些结果,可以基于具体的分子亚型选择特定治疗并将其施用于母体对象,以调节早产的结果。例如,可以应用胶原调节治疗剂或宫颈环扎以使宫颈稳定化。可以通过施用治疗剂以减少氧化应激并且/或者调节与内质网应激相关的蛋白质的表达水平来防止晚期流产病例。Based on these results of molecular subtyping of PTB, specific treatments can be selected based on the specific molecular subtype and administered to the maternal subject to modulate the outcome of preterm birth. For example, collagen modulating therapeutics or cervical cerclage can be applied to stabilize the cervix. Late miscarriage cases can be prevented by administering therapeutics to reduce oxidative stress and/or modulate the expression levels of proteins associated with endoplasmic reticulum stress.

实施例2:使用分子亚分型和cfRNA谱分析对先兆子痫的早期预测Example 2: Early prediction of preeclampsia using molecular subtyping and cfRNA profiling

使用本公开的系统和方法,在来自由于临床病史而先兆子痫(PE)风险增加的群体的母体血液样品中鉴定先兆子痫(PE)的早期分子标志物。Using the systems and methods of the present disclosure, early molecular markers of pre-eclampsia (PE) are identified in maternal blood samples from a population at increased risk for pre-eclampsia (PE) due to clinical history.

此外,对象的队列包括在胎龄37周后分娩的对照对象集。一些对照对象被分类为健康对照,并且一些对照对象具有慢性高血压病史,无先兆子痫。病例对象集被诊断为先兆子痫,在胎龄37周前分娩。病例对象集被诊断为新发(de novo)先兆子痫,并且病例对象集患有先兆子痫并有慢性高血压病史。In addition, the cohort of subjects includes a control subject set that delivered after gestational age 37 weeks. Some control subjects were classified as healthy controls, and some control subjects had a history of chronic hypertension without pre-eclampsia. The case subject set was diagnosed with pre-eclampsia and delivered before gestational age 37 weeks. The case subject set was diagnosed with de novo pre-eclampsia, and the case subject set suffered from pre-eclampsia and had a history of chronic hypertension.

队列数据集的差异表达分析如下。使用无细胞RNA进行生物标志物发现以鉴定先兆子痫的早期诊断标志物。为了估计慢性高血压的影响,进行两次单独的差异表达分析来估计慢性高血压的影响。对先兆子痫病例集和健康对照集进行第一项分析;此外,进行第二项分析,其中添加患有慢性高血压的对照对象集,由此总共有更多的对照对象。Differential expression analysis of the cohort data set is as follows. Biomarker discovery was performed using cell-free RNA to identify early diagnostic markers for pre-eclampsia. To estimate the impact of chronic hypertension, two separate differential expression analyses were performed to estimate the impact of chronic hypertension. The first analysis was performed on the pre-eclampsia case set and the healthy control set; in addition, a second analysis was performed in which a control subject set with chronic hypertension was added, thereby having more control subjects in total.

鉴定队列中PE的排名靠前的差异表达基因集用于包括慢性高血压和不包括慢性高血压的比较。来自两项分析的排名靠前的基因被观察到重叠,这表明与先兆子痫相关的信号,而不是慢性高血压。The top-ranked differentially expressed gene sets for PE in the identified cohort were used for comparisons including and excluding chronic hypertension. Overlap was observed in the top-ranked genes from both analyses, suggesting a signal associated with pre-eclampsia rather than chronic hypertension.

与PE的更高风险相关联的高度显著基因的附加分析表明具有不同潜在生物学的至少两个单独的途径。在第一分子亚型中,低表达的胎盘特异性基因(如PAPA2和FABP1)的富集表明早期胎盘的显著变化,这可以与先兆子痫相关。而且,在13周之前的早期妊娠施用的高剂量阿司匹林可以降低PE极早发作的风险(<32周),但可以不减少稍后发展的PE。在第二分子亚型中,与角质形成细胞内皮相关的途径可以与血管炎症、内皮功能障碍和动脉性高血压相关。此外,皮肤保持复杂的毛细血管逆流系统,该系统控制体温、皮肤灌注以及明显全身的血压。因此,先兆子痫的一次的病例可与母亲的皮肤、毛细血管和/或动脉功能障碍的潜在生物学相关。Additional analysis of highly significant genes associated with a higher risk of PE suggests at least two separate pathways with different underlying biology. In the first molecular subtype, enrichment of low-expressed placenta-specific genes (such as PAPA2 and FABP1) indicates significant changes in the early placenta, which may be associated with pre-eclampsia. Moreover, high-dose aspirin administered in early pregnancy before 13 weeks can reduce the risk of very early onset of PE (<32 weeks), but may not reduce PE that develops later. In the second molecular subtype, pathways associated with keratinocyte endothelium may be associated with vascular inflammation, endothelial dysfunction, and arterial hypertension. In addition, the skin maintains a complex capillary countercurrent system that controls body temperature, skin perfusion, and significant systemic blood pressure. Therefore, a single case of pre-eclampsia may be associated with the underlying biology of the mother's skin, capillary, and/or arterial dysfunction.

基于PE的分子亚分型的这些结果,可以选择特定治疗并其施用于母体对象以调节发展先兆子痫的结果或风险。例如,可以用类似于阿司匹林的化合物治疗具有胎盘形成的分子亚型的对象,其与环氧合酶途径的调控或负责调节血管舒张介质和抑制血管重塑、血小板聚集和血小板粘附的途径相关。作为另一个示例,可以用血压管理化合物或靶向质子泵抑制剂(PPI)机制的化合物治疗患有与角质形成细胞内皮途径相关联的PE的分子亚型的对象。阻滞PPI可以导致sFlt-1和可溶性内皮糖蛋白(sENG)分泌减少和内皮功能障碍、血管扩张、BP减少以及抗氧化剂和抗炎特性。埃索美拉唑(另一种质子泵抑制剂,也用于胃回流)的使用可以在II期临床研究中评估以治疗母体对象中的早发PE(PIE踪迹)。Based on these results of the molecular subtyping of PE, specific treatment can be selected and applied to maternal subjects to adjust the results or risks of developing pre-eclampsia. For example, a compound similar to aspirin can be used to treat an object with a molecular subtype of placenta formation, which is related to the regulation of the cyclooxygenase pathway or the pathway responsible for regulating vasodilatory mediators and inhibiting vascular remodeling, platelet aggregation and platelet adhesion. As another example, a compound of blood pressure management compounds or a compound targeting the proton pump inhibitor (PPI) mechanism can be used to treat an object of a molecular subtype of PE associated with the keratinocyte endothelial pathway. Blocking PPI can lead to reduced secretion of sFlt-1 and soluble endoglin (sENG) and endothelial dysfunction, vasodilation, BP reduction, and antioxidant and anti-inflammatory properties. The use of esomeprazole (another proton pump inhibitor, also used for gastric reflux) can be evaluated in Phase II clinical studies to treat premature PE (PIE trace) in maternal subjects.

实施例3:使用分子亚分型和cfRNA谱分析对妊娠期糖尿病(GDM)的早期预测Example 3: Early prediction of gestational diabetes mellitus (GDM) using molecular subtyping and cfRNA profiling

使用本公开的系统和方法,在来自由于临床病史而糖尿病(GDM)风险增加的群体的母体血液样品中鉴定GDM的早期分子标志物。Using the systems and methods of the present disclosure, early molecular markers of diabetes mellitus (GDM) are identified in maternal blood samples from a population at increased risk for GDM due to clinical history.

此外,对象的队列包括对照对象集。一些对照对象被分类为具有口服葡萄糖耐量试验(OGTT)试验阴性的健康对照。基于OGTT测试,第一病例对象集被诊断患有妊娠GDM,第二病例对象集被诊断患有慢性2型糖尿病,并且第三病例对象集具有受损的葡萄糖状态。In addition, the cohort of subjects includes a control subject set. Some control subjects are classified as healthy controls with negative oral glucose tolerance test (OGTT) tests. Based on the OGTT test, the first case subject set is diagnosed with gestational GDM, the second case subject set is diagnosed with chronic type 2 diabetes, and the third case subject set has an impaired glucose state.

队列数据集的差异性表达分析进行如下。使用无细胞RNA进行生物标志物发现以鉴定GDM的早期诊断标志物。为了估计慢性2型糖尿病的效果,进行两种单独的差异表达分析以估计效果。对妊娠GDM病例集和健康对照集进行第一分析;此外,进行第二分析,其中加入患有慢性2型糖尿病的对照对象集,由此总计有更大数量的对照对象。Differential expression analysis of the cohort data set was performed as follows. Biomarker discovery was performed using cell-free RNA to identify early diagnostic markers for GDM. To estimate the effect of chronic type 2 diabetes, two separate differential expression analyses were performed to estimate the effect. The first analysis was performed on the set of pregnant GDM cases and the set of healthy controls; in addition, a second analysis was performed in which a set of control subjects with chronic type 2 diabetes was added, thereby totaling a larger number of control subjects.

针对包括慢性2型糖尿病和排除慢性2型糖尿病的两个比较,鉴定队列中针对GDM的排名靠前的差异表达基因。观察到来自两种分析的排名靠前的基因重叠,这指示与GDM相关联的信号,而不是慢性2型糖尿病。The top differentially expressed genes for GDM in the cohort were identified for both comparisons including chronic type 2 diabetes and excluding chronic type 2 diabetes. An overlap of the top genes from both analyses was observed, indicating a signal associated with GDM, rather than chronic type 2 diabetes.

与GDM更高风险相关联的高度显著基因的附加分析表明具有不同潜在生物学的至少三种单独的途径。在第一分子亚型中,低表达的胎盘特异性基因(PDK4、CSH1和PLAC4)的富集表明在妊娠期糖尿病下呈胎盘退化、胎盘功能不足、胎盘衰竭、胎盘功能障碍、过早老化、钙化以及胎盘功能受损形式的显著变化。在第二分子亚型中,一个基因TBCEL(微管蛋白特异性伴侣辅因子E样)可以与介导的高血糖记忆相关联,这可以是针对1型和2型糖尿病常见的。在第三分子亚型中,FBXO7基因涉及适应性免疫系统和抗原特异性免疫应答有效途径。GDM不仅通过增加的胰岛素抗性和葡萄糖不耐受来表征,而且还通过轻度全身炎症的状态和诱导1型和2型辅助性T细胞之间的失衡的免疫系统调节异常来表征。Additional analysis of highly significant genes associated with a higher risk of GDM indicated at least three separate pathways with different underlying biology. In the first molecular subtype, enrichment of low-expressed placenta-specific genes (PDK4, CSH1, and PLAC4) indicated significant changes in the form of placental degeneration, placental insufficiency, placental failure, placental dysfunction, premature aging, calcification, and impaired placental function under gestational diabetes. In the second molecular subtype, one gene, TBCEL (tubulin-specific chaperone cofactor E-like), can be associated with mediated hyperglycemic memory, which can be common to type 1 and type 2 diabetes. In the third molecular subtype, the FBXO7 gene is involved in the adaptive immune system and antigen-specific immune response effective pathways. GDM is characterized not only by increased insulin resistance and glucose intolerance, but also by a state of mild systemic inflammation and immune system dysregulation that induces an imbalance between type 1 and type 2 helper T cells.

基于对于GDM的分子亚分型的这些结果,可以选择并向母体对象施用特定治疗以调节发展GDM的结果或风险。例如,具有指示胎盘功能障碍或退化的显著变化的分子亚型的对象可以用潜在的候选药物靶效应物治疗,以改善子宫胎盘的血液流动、抗氧化剂、血红素加氧酶诱导、HIF抑制、胆固醇合成途径的诱导、增加胰岛素样生长因子II的可用性。作为另一个示例,对于具有与介导的高血糖记忆相关联的第二分子亚型的对象,这种葡萄糖失衡的早期侵袭性治疗可以施用于患有糖尿病的对象。高血糖可以伴随晚期糖基化终末产物(AGE)的形成。另一种治疗方法可以是尝试减少AGE形成、AGE受体(RAGE)表达和氧化应激产生。可以施用不同的药物(诸如二甲双胍和吡格列酮)以阻滞AGE形成。ACE抑制剂和AT-1阻滞剂是用于控制血压的化合物;然而,它们还能够减少AGE的形成。替米沙坦下调RAGE mRNA水平并随后抑制超氧化物生成,而格列齐特可以用于消除“记忆”。此外,可以施用GLP1受体激动剂以减少炎症、餐后高脂血症和凝固,从而导致对动脉粥样硬化血栓形成的有益效果。可以施用醛糖还原酶抑制剂如依帕司他,以通过减轻氧化应激和抑制多元醇途径保护免受糖尿病周围神经病变。作为另一个示例,对于具有涉及适应性免疫系统和抗原特异性途径的第三分子亚型的对象,妊娠是显著的代谢和免疫挑战;此外,GDM叠加了增强程度的低级的全身炎症和无补偿的胰岛素抵抗,这可以进一步与潜在免疫应答的失调相关,这对于1型和2型可以是常见的。可以施用免疫调节剂来治疗糖尿病,如:硫唑嘌呤、吗替麦考酚酯、奥昔组单抗、替利组单抗(其可以使细胞因子释放最小化并且防止β-细胞的进行性破坏)。Based on these results for the molecular subtyping of GDM, specific treatment can be selected and applied to the maternal object to adjust the results or risks of developing GDM. For example, the object of the molecular subtype with significant changes indicating placental dysfunction or degradation can be treated with potential candidate drug target effectors to improve the blood flow of the uterus and placenta, antioxidants, heme oxygenase induction, HIF inhibition, induction of cholesterol synthesis pathways, and increase the availability of insulin-like growth factor II. As another example, for the object of the second molecular subtype associated with the mediated hyperglycemic memory, the early aggressive treatment of this glucose imbalance can be applied to the object with diabetes. Hyperglycemia can be accompanied by the formation of advanced glycation end products (AGE). Another treatment method can be to try to reduce AGE formation, AGE receptor (RAGE) expression and oxidative stress generation. Different drugs (such as metformin and pioglitazone) can be applied to block AGE formation. ACE inhibitors and AT-1 blockers are compounds used to control blood pressure; however, they can also reduce the formation of AGE. Telmisartan downregulates RAGE mRNA levels and subsequently inhibits superoxide generation, while gliclazide can be used to eliminate "memory". In addition, GLP1 receptor agonists can be administered to reduce inflammation, postprandial hyperlipidemia and coagulation, resulting in beneficial effects on atherosclerotic thrombosis. Aldose reductase inhibitors such as epalrestat can be administered to protect against diabetic peripheral neuropathy by alleviating oxidative stress and inhibiting polyol pathways. As another example, pregnancy is a significant metabolic and immune challenge for subjects with a third molecular subtype involving adaptive immune systems and antigen-specific pathways; in addition, GDM superimposes enhanced levels of low-grade systemic inflammation and uncompensated insulin resistance, which can be further associated with the disorder of potential immune responses, which can be common for type 1 and type 2. Immunomodulators can be administered to treat diabetes, such as: azathioprine, mycophenolate mofetil, oxazolidinone, tilizumab (which can minimize cytokine release and prevent progressive destruction of β-cells).

实施例4:用于高风险群体中的早期和非常早期自发性早产的预测RNA谱和使用反应组(Reactome)数据库的途径分析Example 4: Predictive RNA profiles for early and very early spontaneous preterm birth in high-risk groups and pathway analysis using the Reactome database

使用本公开的系统和方法,在来自高风险临床病史的群体的母体血液样品中,鉴定早产(PTB)和非常早期自发性早产(sPTB)的早期分子标志物。Using the systems and methods of the present disclosure, early molecular markers of preterm birth (PTB) and very early spontaneous preterm birth (sPTB) were identified in maternal blood samples from a population with a high risk clinical history.

通过以下中的至少一项定义高风险妊娠:既往sPTB或晚期流产(妊娠12至37周)、既往破坏性宫颈手术或经阴道超声扫描偶然发现宫颈长度<25mm。从常规产前或超声检查诊所招募没有sPTB风险因素且在入组时其他方面情况良好的女性作为低风险对照。High-risk pregnancies were defined by at least one of the following: previous sPTB or late miscarriage (12 to 37 weeks' gestation), previous destructive cervical surgery, or incidental cervical length <25 mm on transvaginal ultrasound scan. Low-risk controls were recruited from routine antenatal or ultrasound clinics without sPTB risk factors and who were otherwise well at enrollment.

在妊娠12至24周收集来自募集自四家三级产前诊所的单胎妊娠的女性的血液样品(242个血液样品,每次妊娠一个样品)。对于sPTB病例,在分娩前平均9.4周收集样品。具有在收集血液样品时的胎龄和在分娩时的胎龄的242个所收集样品的分布示于图4中。在242例妊娠中,足月产(≥37 0/7GA)有194例,以及在妊娠35周前自发分娩的早产(早期早产,<35 0/7)有48例。16例妊娠的亚组在妊娠25周前分娩(非常早期早产,<25 0/7)。Blood samples from women with singleton pregnancies recruited from four tertiary antenatal clinics were collected at 12 to 24 weeks of gestation (242 blood samples, one sample per pregnancy). For sPTB cases, samples were collected an average of 9.4 weeks before delivery. The distribution of the 242 collected samples with gestational age at the time of blood sample collection and gestational age at delivery is shown in Figure 4. Of the 242 pregnancies, 194 were term births (≥37 0/7GA), and 48 were premature births (early preterm births, <35 0/7) that were spontaneously delivered before 35 weeks of gestation. A subgroup of 16 pregnancies delivered before 25 weeks of gestation (very early preterm births, <25 0/7).

为鉴定可预测早期sPTB(<35 0/7)风险的候选基因,在所有早期分娩(<35 0/7)和对照(≥37 0/7)之间进行差异表达分析。使用留一交叉验证(LOOCV)验证结果,得到表1中列出的25种差异表达基因的列表,这些基因用于建立逻辑回归分类器以预测早产的风险。To identify candidate genes that can predict the risk of early sPTB (<35 0/7), differential expression analysis was performed between all early deliveries (<35 0/7) and controls (≥37 0/7). The results were validated using leave-one-out cross validation (LOOCV), resulting in a list of 25 differentially expressed genes listed in Table 1, which were used to build a logistic regression classifier to predict the risk of preterm birth.

表1:通过LOOCV鉴定的早期sPTB差异表达基因以及相应的跨折(fold)鉴定频率Table 1: Differentially expressed genes in early sPTB identified by LOOCV and their corresponding cross-fold identification frequencies

该模型实现了曲线下面积(AUC)为0.80(95% CI 0.72-0.87)的经验证的LOOCV性能,灵敏度=0.76以及特异性=0.72(N=46例早期sPTB病例和N=183例足月对照),示于图5A中。该模型还以图5B所示的早产分娩的风险概率对每个样品进行评分。The model achieved validated LOOCV performance with an area under the curve (AUC) of 0.80 (95% CI 0.72-0.87), sensitivity = 0.76 and specificity = 0.72 (N = 46 early sPTB cases and N = 183 term controls), as shown in Figure 5A. The model also scored each sample with a risk probability of preterm delivery as shown in Figure 5B.

使用相同方法来鉴定对非常早期sPTB(<25周)具有特异性的分子标志物。在16个非常早期PTB病例和226个对照之间进行差异表达分析,并使用LOOCV验证这些结果。产生65种差异表达基因的列表(表2),其用于建立正则化的逻辑回归分类器来以交叉验证预测非常早期sPTB。The same approach was used to identify molecular markers specific for very early sPTB (<25 weeks). Differential expression analysis was performed between 16 very early PTB cases and 226 controls, and these results were validated using LOOCV. A list of 65 differentially expressed genes was generated (Table 2), which was used to build a regularized logistic regression classifier to predict very early sPTB with cross-validation.

表2:通过LOOCV鉴定的非常早期sPTB差异表达基因以及相应的跨折鉴定频率Table 2: Very early sPTB differentially expressed genes identified by LOOCV and the corresponding cross-fold identification frequencies

该模型实现了AUC=0.74(95%CI 0.64-0.83)的经验证的LOOCV性能。若干种基因被发现是涉及先兆子痫毒血症(PET)的相关基因。为了减少cfRNA特征在两种并发症上的串扰,通过排除所有PET样品和具有低质量测序指标的样品进行建模,导致降低至14个非常早期早产(<25 0/7)的病例。该模型展现出改善的LOOCV性能,对于14个非常早期sPTB病例(<25 0/7)和在25周或在25周后分娩的193个样品(≥25 0/7),AUC=0.76(95%CI 0.63-0.87)[灵敏度=0.64,特异性=0.80],如图5C所示。该模型基于表3中列出的39个差异表达基因的集,从其中在>95%的交叉验证折中鉴定三种基因(AC011043.1、IGFBP2和SH3GL3)的核心集,并且当用PET样品训练时,13个基因与发现的差异表达基因重叠。模型概率显示病例和对照之间的显著差异,尽管对于对照样品的子集观察到高sPTB概率的较长尾部,如图5D所示。The model achieved a validated LOOCV performance of AUC=0.74 (95%CI 0.64-0.83). Several genes were found to be related genes involved in preeclampsia toxemia (PET). In order to reduce the crosstalk of cfRNA features on the two complications, modeling was performed by excluding all PET samples and samples with low-quality sequencing indicators, resulting in a reduction to 14 cases of very early preterm birth (<25 0/7). The model showed improved LOOCV performance, AUC=0.76 (95%CI 0.63-0.87) [sensitivity=0.64, specificity=0.80] for 14 very early sPTB cases (<25 0/7) and 193 samples (≥25 0/7) delivered at 25 weeks or after 25 weeks, as shown in Figure 5C. The model was based on the set of 39 differentially expressed genes listed in Table 3, from a core set where three genes (AC011043.1, IGFBP2, and SH3GL3) were identified at >95% cross-validation compromise, and 13 genes overlapped with the differentially expressed genes found when trained with PET samples. The model probabilities showed significant differences between cases and controls, although a longer tail of high sPTB probabilities was observed for a subset of control samples, as shown in Figure 5D.

表3:排除PET样品后通过LOOCV鉴定的非常早期sPTB差异表达基因以及相应的跨折鉴定频率Table 3: Very early sPTB differentially expressed genes identified by LOOCV after excluding PET samples and the corresponding cross-fold identification frequencies

驱动sPTB预测基因的生物学途径的分析使用反应组数据库和对于表4中列出的每个早产模型的排名靠前的两种途径进行。Analysis of biological pathways driving sPTB predictive genes was performed using the Reactome database and the top two pathways for each preterm birth model listed in Table 4 .

表4.sPTB差异表达基因的途径分析。对于在早期sPTB预测因子(<35 0/7)和非常早期sPTB预测因子(<25 0/7)中发现的基因的途径分析。Table 4. Pathway analysis of differentially expressed genes in sPTB. Pathway analysis for genes found in early sPTB predictors (<35 0/7) and very early sPTB predictors (<25 0/7).

早期早产模型(<35 0/7)富含涉及细胞外基质(ECM)降解和重塑的基因。数据与以下观察一致:细胞与细胞粘附的介质(如粗纤维调节素(COL14A1)和弹性蛋白(ELN))在该模型中排名靠前的基因之中。来自血液抽取的ECM途径早期检测可以用作鉴别处于过早宫颈重塑风险中的个体。这样的筛查测试可以在与超声类似的窗口处实施以测量处于高风险的女性的宫颈长度。The early preterm birth model (<35 0/7) is enriched for genes involved in extracellular matrix (ECM) degradation and remodeling. The data are consistent with the observation that mediators of cell-cell adhesion such as crude fibromodulin (COL14A1) and elastin (ELN) are among the top genes in this model. Early detection of ECM pathways from a blood draw could be used to identify individuals at risk for premature cervical remodeling. Such a screening test could be implemented at a similar window to ultrasound to measure cervical length in women at high risk.

相比之下,在非常早期早产模型(<25 0/7周)中获得的基因上进行的类似分析揭示,观察到与胰岛素样生长因子转运相关的途径和氨基酸代谢途径在非常早期sPTB中差异表达。胰岛素结合蛋白高度表达于胎儿和底蜕膜中,并且是IGF-1的生物利用度的关键调节剂,因此是胎儿生长的关键调节剂。例如,IGFBP1与子宫内生长限制和胎盘形成受损相关联,并且在来自极早产儿的脐带血中升高。胰岛素生长因子cfRNA的检测作为导致非常早期sPTB的妊娠中的主导信号是合理的,并且就下游事件而言具有潜在的提示意义。In contrast, similar analyses performed on genes obtained in a very early preterm birth model (<25 0/7 weeks) revealed that pathways related to insulin-like growth factor transport and amino acid metabolism pathways were observed to be differentially expressed in very early sPTB. Insulin binding protein is highly expressed in the fetus and decidua basalis and is a key regulator of the bioavailability of IGF-1 and, therefore, of fetal growth. For example, IGFBP1 has been associated with intrauterine growth restriction and impaired placentation and is elevated in cord blood from extremely premature infants. Detection of insulin growth factor cfRNA is reasonable as a dominant signal in pregnancies leading to very early sPTB and has potential suggestive significance with respect to downstream events.

实施例5:用于先兆子痫病例研究和cfRNA NGS数据校正的前瞻性队列对象Example 5: Prospective Cohort Subjects for Preeclampsia Case Study and cfRNA NGS Data Correction

使用本公开的系统和方法,从2020年7月至2022年4月,对经由目标社交媒体利用参与者直接(direct-to-participant)募集的无细胞RNA平台进行前瞻性的观察研究。IRB批准的研究对在美国的18至45岁的具有单胎妊娠的对象开放。参与者签署知情同意书,提供记录发布表格,完成简短的问卷,并且通过经由基于网络的平台安排的移动的静脉切开术提交血液样品。Using the systems and methods of the present disclosure, a prospective observational study of a cell-free RNA platform using direct-to-participant recruitment via targeted social media was conducted from July 2020 to April 2022. The IRB-approved study was open to subjects aged 18 to 45 with singleton pregnancies in the United States. Participants signed informed consent, provided a record release form, completed a brief questionnaire, and submitted a blood sample via mobile phlebotomy scheduled via a web-based platform.

参与者提交17和22周胎龄之间的血液样品。针对超过85%的参与者接收医疗记录。该队列是地理上和民族上多样的,代表跨30个州的1220个邮政编码。所有样品使用统一实验和计算的NGS管道处理以用于无细胞RNA(cfRNA)测序。Participants submitted blood samples between 17 and 22 weeks of gestational age. Medical records were received for more than 85% of participants. The cohort was geographically and ethnically diverse, representing 1,220 zip codes across 30 states. All samples were processed using a unified experimental and computational NGS pipeline for cell-free RNA (cfRNA) sequencing.

3036个样品具有完整的医疗记录并且通过了cfRNA测定质量指标。图6显示先兆子痫观察研究的人口统计学和临床数据指标,包括2701名健康参与者和335名被诊断为先兆子痫的参与者。3036 samples had complete medical records and passed the cfRNA assay quality indicators. Figure 6 shows demographic and clinical data indicators for the pre-eclampsia observational study, including 2701 healthy participants and 335 participants diagnosed with pre-eclampsia.

图6显示与发展先兆子痫的风险相关联的该队列中的人口统计学和临床因素的分布,这些因素是基于美国预防服务工作组(USPSTF)指南和推荐收集的。记录该队列的各种先兆子痫风险因素,包括:生育次数(parity);种族;慢性高血压(chtn);不包括妊娠期糖尿病的糖尿病状态(diabetic_not_gdm);母亲年龄;体重指数(bmi);既往先兆子痫诊断(pm_pe);美国预防服务工作组(USPSTF)风险水平(www.uspreventiveservicestaskforce.org/uspstf/recommendation/preecl ampsia-screening);和人工体外受精(IVF)。Figure 6 shows the distribution of demographic and clinical factors in the cohort associated with the risk of developing preeclampsia, which were collected based on the U.S. Preventive Services Task Force (USPSTF) guidelines and recommendations. Various preeclampsia risk factors were recorded for the cohort, including: parity; race; chronic hypertension (chtn); diabetes status excluding gestational diabetes (diabetic_not_gdm); maternal age; body mass index (bmi); previous diagnosis of preeclampsia (pm_pe); U.S. Preventive Services Task Force (USPSTF) risk level (www.uspreventiveservicestaskforce.org/uspstf/recommendation/preeclampsia-screening); and artificial in vitro fertilization (IVF).

为定义先兆子痫的不同亚型,使用了针对诊断为先兆子痫的患者的医学上规定的分娩(分娩_胎龄)的不同截止值。图7显示在同一队列的2889个样品针对小于38周时分娩的先兆子痫的人口统计学,包括2690名健康参与者和199名诊断为先兆子痫并在38周胎龄前分娩的参与者。For defining the different subtypes of pre-eclampsia, the different cut-off values of the medically prescribed childbirth (delivery-gestational age) for the patient diagnosed as pre-eclampsia were used. Fig. 7 is presented in 2889 samples of the same queue for the demography of the pre-eclampsia of childbirth less than 38 weeks, including 2690 healthy participants and 199 participants diagnosed as pre-eclampsia and giving birth before 38 weeks of gestational age.

图8显示在同一队列的2889个样品针对小于37周时分娩的先兆子痫的人口统计学,包括2780名健康参与者和109名诊断为先兆子痫并在37周胎龄前分娩的参与者。Figure 8 shows the demographics for pre-eclampsia delivered at less than 37 weeks of gestation for 2889 samples from the same cohort, including 2780 healthy participants and 109 participants diagnosed with pre-eclampsia and delivered before 37 weeks of gestational age.

通过NGS技术定量的血浆中的无细胞RNA(cfRNA)水平测量可以受到由于样品加工技术的系统变异的影响,这可以损害测量过程的准确度并且有助于使研究中的关联的性估计偏倚。在以成百上千个特征为特征的数据集中,将变异的系统性来源的贡献量化具有挑战性。Measurements of cell-free RNA (cfRNA) levels in plasma quantified by NGS techniques can be affected by systematic variation due to sample processing techniques, which can compromise the accuracy of the measurement process and contribute to biasing the sex estimates of associations under study. In datasets characterized by hundreds to thousands of features, quantifying the contribution of systematic sources of variation is challenging.

在3036个样品的cfRNA数据集中鉴定了若干种系统变异来源,并且应用了若干统计校正方法。这些校正技术包括若干方法:1)使用来自多元线性回归的残差来校正数据残差,2)基于当时仅可以校正一个协变量的经验贝叶斯方法进行ComBat方法;以及3)开发替代变量分析(SVA)以去除预先鉴定的变异性来源以及未知的变异性来源。与其他两种技术相比,使用来自多元线性回归的残差来校正不想要的协变量的影响的数据的校正方法证明了对完整数据集更好的校正。Several sources of systematic variation were identified in the cfRNA dataset of 3036 samples, and several statistical correction methods were applied. These correction techniques included several methods: 1) using residuals from multiple linear regression to correct data residuals, 2) performing the ComBat method based on an empirical Bayesian approach that can only correct one covariate at the time; and 3) developing a surrogate variable analysis (SVA) to remove pre-identified sources of variability as well as unknown sources of variability. Compared with the other two techniques, the correction method of the data using residuals from multiple linear regression to correct the effects of unwanted covariates demonstrated better correction for the full dataset.

通过对3036个样品的cfDNA数据集的分析鉴定了不同变异来源,并且通过执行不同方法成功地进行了校正。首先,将归因于NGS特异性方法的技术变异(如:每个样品的测序深度;个体处理操作的批次效应;或各种原材料)鉴定并校正。Different sources of variation were identified through analysis of a cfDNA dataset of 3036 samples and successfully corrected by implementing different approaches. First, technical variations attributable to NGS-specific methods (e.g., sequencing depth of each sample; batch effects of individual processing operations; or various raw materials) were identified and corrected.

此外,鉴定了与血液采集的时间相关的两个外部变异来源,并且通过多元线性回归将其进行校正以校正数据残差。图9A显示NGS数据的系统变异的示例,该NGS数据与在与免疫细胞基因相反的方向上运动的妊娠相关联基因和胎盘相关联基因的季节变化相关联。分析该基因集并确定其与在血液采集的当天时刻离血液采集地点最近的气象站记录的外界温度高度相关。图9B显示由基因建模预测的局部外部温度与由靠近血液采集地点的气象站记录的实际温度之间的高相关性的示例。In addition, two external sources of variation associated with the time of blood collection were identified and corrected by multiple linear regression to correct for data residuals. FIG. 9A shows an example of systematic variation in NGS data associated with seasonal changes in pregnancy-associated genes and placenta-associated genes moving in the opposite direction of immune cell genes. This gene set was analyzed and determined to be highly correlated with the outside temperature recorded by the weather station closest to the blood collection site at the time of the day of blood collection. FIG. 9B shows an example of a high correlation between the local outside temperature predicted by genetic modeling and the actual temperature recorded by a weather station close to the blood collection site.

进行附加的分析以鉴定与生物钟相关联的血液抽取/采集时间上的cfRNA变异,如图10A所示。按血液抽取的时间将样品分组成上午(6am至10am,n=121)、中午(10am至2pm,n=303)和下午(2pm至6pm,n=113)。进行差异基因表达(DGE)分析以阐明当天时刻的影响。使用edgeR包(v.3.38.1)拟合准似然负二项式广义对数线性模型来计数数据,并且用经验贝叶斯准似然F-检验(edgeR)在所有3个可能的成对比较中发现DGE。当在上午组和下午组之间寻找DGE时,确定5729个基因(即所有分析的基因的43%)是显著差异表达的(图10B)。对于上午组对比中午组,确定4278个基因(所有分析的基因的32%)为DEG;并且对于中午组对比下午组,确定15个基因(所有分析的基因的0.1%)为DGE。这三个集具有非常高的重叠,范围从73%至87%(p<10-30)。此外,在所报告的具有最高分离的基因之中与昼夜节律相关(例如,PER1、PER3、DDIT4、FKBP5、RBM3、SOCS1、BTG1和ARHGEF10L)。Additional analyses were performed to identify cfRNA variation in blood draw/collection time associated with the circadian clock, as shown in FIG10A . Samples were grouped into morning (6am to 10am, n=121), noon (10am to 2pm, n=303), and afternoon (2pm to 6pm, n=113) by the time of blood draw. Differential gene expression (DGE) analysis was performed to elucidate the effect of the time of day. Quasi-likelihood negative binomial generalized log-linear models were fitted to count data using the edgeR package (v.3.38.1), and DGEs were found in all 3 possible pairwise comparisons using the empirical Bayesian quasi-likelihood F-test (edgeR). When DGEs were sought between the morning and afternoon groups, 5729 genes (i.e., 43% of all genes analyzed) were determined to be significantly differentially expressed ( FIG10B ). For the morning group vs. the noon group, 4278 genes (32% of all analyzed genes) were determined to be DEGs; and for the noon group vs. the afternoon group, 15 genes (0.1% of all analyzed genes) were determined to be DGEs. These three sets have very high overlap, ranging from 73% to 87% (p < 10-30). In addition, among the genes reported to have the highest separation, they are related to circadian rhythms (e.g., PER1, PER3, DDIT4, FKBP5, RBM3, SOCS1, BTG1, and ARHGEF10L).

此外,使用类似的技术有效地回归与对象BMI、胎儿分数以及血液采集时的胎龄相关联的生物学变异以增加基因发现的能力。In addition, similar techniques were used to efficiently regress biological variation associated with subject BMI, fetal fraction, and gestational age at blood collection to increase the power of gene discovery.

实施例6:使用cfRNA谱分析对来自3036人前瞻性试验的先兆子痫和先兆子痫分子亚型的早期预测Example 6: Early prediction of preeclampsia and preeclampsia molecular subtypes from a 3036-person prospective trial using cfRNA profiling

使用本公开的系统和方法,使用实施例5中所述的3036例的前瞻性队列,在母体血液样品中鉴定先兆子痫的早期差异表达分子标志物。使用统一实验和计算的NGS管道处理所有样品以用于无细胞(cfRNA)测序。将双胎妊娠、自发性早产和/或基于非先兆子痫诊断的早产排除。Using the systems and methods of the present disclosure, a prospective cohort of 3036 cases described in Example 5 was used to identify early differentially expressed molecular markers of pre-eclampsia in maternal blood samples. All samples were processed for cell-free (cfRNA) sequencing using a unified experimental and computational NGS pipeline. Twin pregnancies, spontaneous preterm births, and/or preterm births based on non-pre-eclampsia diagnoses were excluded.

研究设计进行如下。使用了两种方法。在第一种方法中,根据先兆子痫诊断的严重性将具有先兆子痫病例的队列分组,采用医学诱导的分娩以降低女性复合不良母体结果的风险。来自队列的严重早产先兆子痫病例按照在小于38周或37周胎龄分娩来分组,分别如图8和图9所示。在第二种方法中,在进行早产先兆子痫病例的特征发现时,添加观察到的基于USPSTF的主要临床因素作为候选特征,如图7所示。The study design was carried out as follows. Two methods were used. In the first method, the cohort with pre-eclampsia cases was grouped according to the severity of the pre-eclampsia diagnosis, and medically induced delivery was used to reduce the risk of composite adverse maternal outcomes in women. Severe pre-term pre-eclampsia cases from the cohort were grouped according to delivery at less than 38 weeks or 37 weeks of gestational age, as shown in Figures 8 and 9, respectively. In the second method, when performing feature discovery of pre-term pre-eclampsia cases, the main clinical factors based on USPSTF observed were added as candidate features, as shown in Figure 7.

特征发现的一种方法是确定独立性筛选,其确保特征彼此正交以捕获数据中的最大变异。对于整个数据集使用该方法,对于在小于38周胎龄分娩,分析早产先兆子痫信号。图11A和图11B显示跨重复交叉验证的与发展早产先兆子痫的高风险相关联的基因的发现率的示例。One approach to feature discovery is to determine independence screening, which ensures that features are orthogonal to each other to capture the maximum variation in the data. Using this approach for the entire data set, the preterm pre-eclampsia signal was analyzed for deliveries at less than 38 weeks gestational age. Figures 11A and 11B show examples of the discovery rate of genes associated with a high risk of developing preterm pre-eclampsia across repeated cross validation.

表5提供了通过确定独立性筛选发现的差异表达基因的列表,作为预测具有在小于38周的医学规定分娩的早产先兆子痫的分子亚型。类似地,表6提供了通过确定独立性筛选发现的差异表达基因的列表,作为指示具有在小于37周胎龄分娩的医学规定分娩的早产先兆子痫的分子亚型。Table 5 provides a list of differentially expressed genes found by determining independence screening as predicting molecular subtypes of preterm pre-eclampsia with medically prescribed delivery at less than 38 weeks. Similarly, Table 6 provides a list of differentially expressed genes found by determining independence screening as indicating molecular subtypes of preterm pre-eclampsia with medically prescribed delivery at less than 37 weeks gestational age.

表5.通过确定独立性筛选发现的预测在小于38周分娩的早产先兆子痫差异表达基因Table 5. Differentially expressed genes predicted for preterm preeclampsia at delivery less than 38 weeks found by ascertainment independence screening

表6.通过确定独立性筛选发现的预测在小于37周分娩的早产先兆子痫差异表达基因Table 6. Differentially expressed genes predicted for preterm preeclampsia at delivery less than 37 weeks found by ascertainment independence screening

PAPPA2PAPPA2 KRT18KRT18 SELENOPSELENOP TOMM5TOMM5 PAPPAPAPPA ATP5EATP5E APOBAPOB KRT8KRT8 GID4GID4 VAT1LVAT1L KISS1KISS1

用于特征发现的可替代方法是在针对多种变量校正的计数空间中筛选差异表达基因。示例通过BMI或总计数校正,或通过个体基因(诸如KRT7或SVEP1)校正。如果特征通过多个校正的p值阈值,则它们被保留,并且可以被显示为使用Aikake信息准则(AIC<-2)向模型添加值。图12A-图12B描绘了对于分别在小于38周或小于37周分娩的早产先兆子痫病例所发现的所有基因标志物和临床因素进行这种类型的建模的示例性结果。表7和表8提供了通过这种方法对于分别在小于38周或小于37周分娩的早产先兆子痫病例所发现的所有基因标志物的列表。The alternative method for feature discovery is to screen differentially expressed genes in the count space corrected for multiple variables.Example is corrected by BMI or total count, or corrected by individual genes (such as KRT7 or SVEP1).If feature passes through the p-value threshold of multiple corrections, then they are retained, and can be displayed as using Aikake information criterion (AIC<-2) to add value to the model.Figure 12 A-Figure 12 B depicts the exemplary results of this type of modeling for all gene markers and clinical factors found for the premature pre-eclampsia cases less than 38 weeks or less than 37 weeks of childbirth respectively.Table 7 and Table 8 provide the list of all gene markers found for the premature pre-eclampsia cases less than 38 weeks or less than 37 weeks of childbirth respectively by this method.

表7.通过多空间校正鉴定的在小于38周分娩的早产先兆子痫差异表达基因Table 7. Differentially expressed genes in preterm preeclampsia born at less than 38 weeks identified by multi-space calibration

PAPPA2PAPPA2 SVEP1SVEP1 KRT7KRT7 VSIG4VSIG4 APOBAPOB FGBFGB FCGBPFCGBP CD163CD163 TCHHTCH AXLAXL ETV5ETV5 LYVE1LYVE1 IGSF21IGSF21 MACROD2MACROD2 FGGFGG C1QCQ1C PPP1R14APPP1R14A APOEAPOE THRBTHRB APOHAPOH EFHD1EFHD1 TMEM176BTMEM176B FABP1FABP1 FPR3FPR3 KISS1KISS1 ALBALB NEK11NEK11 FGBFGB BCORBCOR AP1B1AP1B1 FGAFGA

表8.通过多空间校正鉴定的在小于37周分娩的早产先兆子痫基因Table 8. Preterm preeclampsia genes identified by multi-spatial calibration for delivery at less than 37 weeks

基于这些特征的建模使得能够早期预测早产先兆子痫,产生高至0.86的AUC(对于在小于36周分娩的早产先兆子痫病例)。图13显示了预测在小于37周分娩的PE病例的模型具有ROC平均为0.83的ROC曲线值的曲线下面积(AUC)的示例。Modeling based on these features enabled early prediction of preterm pre-eclampsia, yielding AUCs as high as 0.86 (for preterm pre-eclampsia cases delivered at less than 36 weeks). Figure 13 shows an example of an area under the curve (AUC) for a model predicting PE cases delivered at less than 37 weeks with an ROC mean of 0.83.

实施例7:使用分子亚分型和cfRNA谱分析对3036人的前瞻性队列中的自发性早产的早期预测Example 7: Early prediction of spontaneous preterm birth in a prospective cohort of 3036 people using molecular subtyping and cfRNA profiling

使用本公开的系统和方法,在来自实施例5中所述的3036名参与者的前瞻性队列的母体血液样品中鉴定早产(PTB)的早期差异表达分子标志物。使用统一的实验和计算NGS管道处理所有样品以用于无细胞(cfRNA)测序。将双胎妊娠、基于先兆子痫诊断的医学诱导分娩的早产病例和/或其他医学诱导分娩排除。Using the systems and methods of the present disclosure, early differentially expressed molecular markers of preterm birth (PTB) were identified in maternal blood samples from a prospective cohort of 3036 participants described in Example 5. All samples were processed for cell-free (cfRNA) sequencing using a unified experimental and computational NGS pipeline. Twin pregnancies, cases of preterm birth with medically induced labor based on a diagnosis of preeclampsia, and/or other medically induced labor were excluded.

为鉴定自发性早产的差异表达基因标志物,针对自发性早产的两种不同分子亚型进行两种类型的差异基因表达分析,这两种不同分子亚型定义为在小于35周分娩或在小于37周分娩。To identify differentially expressed gene markers for spontaneous preterm birth, two types of differential gene expression analyses were performed for two different molecular subtypes of spontaneous preterm birth, defined as birth at less than 35 weeks or birth at less than 37 weeks.

首先,针对在35周胎龄之前分娩的55个自发性早产病例和在37周胎龄之后分娩的2899个足月产对照案例进行Spearman和DESeq2差异基因表达分析。图14显示了在35周胎龄之前分娩的早产病例中差异表达基因的差异基因表达信号的分位数-分位数(QQ)图。表9显示了通过Spearman排位分析的前5个差异表达基因的集,用于预测早于35周妊娠分娩的自发性早产病例。First, Spearman and DESeq2 differential gene expression analysis was performed for 55 spontaneous preterm birth cases delivered before 35 weeks of gestational age and 2899 term birth control cases delivered after 37 weeks of gestational age. Figure 14 shows the quantile-quantile (QQ) graph of differential gene expression signals of differentially expressed genes in premature birth cases delivered before 35 weeks of gestational age. Table 9 shows the set of the top 5 differentially expressed genes analyzed by Spearman ranking for predicting spontaneous preterm birth cases delivered before 35 weeks of gestation.

表9.使用Spearman排位分析预测早于35周分娩的自发性早产病例的前5个差异表达基因的集Table 9. Set of top 5 differentially expressed genes predicting spontaneous preterm birth before 35 weeks of delivery using Spearman rank analysis

指数index 基因Gene P值P-value 11 ZNF812PZNF812P 0.014181080.01418108 22 FPR3FPR3 0.0208416430.020841643 33 LILRB5LILRB5 0.0312395040.031239504 44 CLEC9ACLEC9A 0.0391390740.039139074 55 ETV5ETV5 0.0437370940.043737094

表10显示了通过DESeq2差异基因表达分析来预测早于35周妊娠分娩的自发性早产病例的前43个差异表达基因的集。Table 10 shows the set of top 43 differentially expressed genes predicting spontaneous preterm birth cases before 35 weeks of gestation by DESeq2 differential gene expression analysis.

表10.使用DESeq2差异基因表达预测早于35周分娩的自发性早产病例的前43个差异表达基因的集Table 10. Set of top 43 differentially expressed genes predicting spontaneous preterm birth cases before 35 weeks of delivery using DESeq2 differential gene expression

指数index 基因Gene P值P-value 11 LYVE1LYVE1 1.66E-121.66E-12 22 TCHHTCH 3.24E-083.24E-08 33 CD163CD163 1.68E-071.68E-07 44 DUSP27DUSP27 1.91E-071.91E-07 55 VSIG4VSIG4 4.21E-074.21E-07 66 MRC1MRC1 2.41E-062.41E-06 77 GPR82GPR82 4.81E-064.81E-06 88 GPR34GPR34 5.36E-055.36E-05 99 MSR1MSR1 5.74E-055.74E-05 1010 RNASE1RNASE1 7.74E-057.74E-05 1111 TREM2TREM2 7.86E-057.86E-05 1212 FPR3FPR3 1.31E-041.31E-04 1313 FCGBPFCGBP 1.44E-041.44E-04 1414 MERTKMERTK 1.71E-041.71E-04 1515 STAB1STAB1 2.13E-042.13E-04 1616 LILRB5LILRB5 2.29E-042.29E-04 1717 CADM1CADM1 3.60E-043.60E-04 1818 SELENOPSELENOP 4.75E-044.75E-04 1919 NCKAP5NCKAP5 0.001552760.00155276 2020 C1QCQ1C 0.001605020.00160502 21twenty one C1QBQ1Q 0.001827010.00182701 22twenty two DBNDD2DBNDD2 0.001873670.00187367 23twenty three MMP2MMP2 0.002874750.00287475 24twenty four AXLAXL 0.003178780.00317878 2525 GATMGATM 0.003872930.00387293 2626 FOLR2FOLR2 0.004122760.00412276 2727 ZNF812PZNF812P 0.005309280.00530928 2828 ETV5ETV5 0.005467250.00546725 2929 MTCO1P12MTCO1P12 0.006371430.00637143 3030 CLEC9ACLEC9A 0.006962170.00696217 3131 MMP14MMP14 0.007593310.00759331 3232 CD209CD209 0.00803830.0080383 3333 ZFHX3ZFHX3 0.008680130.00868013 3434 EPS8EPS8 0.009331790.00933179 3535 SLCO2B1SLCO2B1 0.010866370.01086637 3636 OLFML3OLFML3 0.015623140.01562314 3737 CD276CD276 0.01706610.0170661 3838 CABLES1CABLES1 0.019589570.01958957 3939 SDC3SDC3 0.019770680.01977068 4040 MAFMAF 0.021089060.02108906 4141 RGL1RGL1 0.032715220.03271522 4242 SPRED1SPRED1 0.043269070.04326907 4343 UNC13CUNC13C 0.049522780.04952278

对在37周胎龄之前自发性出生的135个自发性早产病例和2899个在37周胎龄之后分娩的足月产对照案例进行用DESeq2差异基因表达的类似分析。图15显示了在37周胎龄之前分娩的早产病例中差异表达基因的差异基因表达信号的分位数-分位数(QQ)图。A similar analysis of differential gene expression using DESeq2 was performed on 135 spontaneous preterm birth cases that were spontaneously born before 37 weeks of gestational age and 2899 term control cases that were delivered after 37 weeks of gestational age. Figure 15 shows a quantile-quantile (QQ) plot of differential gene expression signals for differentially expressed genes in preterm birth cases delivered before 37 weeks of gestational age.

表11显示了通过DESeq2差异基因表达分析来预测早于37周妊娠分娩的自发性早产病例的前12个差异表达基因的集。Table 11 shows the set of top 12 differentially expressed genes predicting spontaneous preterm birth cases before 37 weeks of gestation by DESeq2 differential gene expression analysis.

表11.使用DESeq2差异基因表达预测早于37周分娩的自发性早产病例的前12个差异表达基因的集Table 11. Set of top 12 differentially expressed genes predicting spontaneous preterm birth cases before 37 weeks of delivery using DESeq2 differential gene expression

指数index 基因Gene P值P-value 11 TCHHTCH 2.15E-062.15E-06 22 LYVE1LYVE1 3.15E-063.15E-06 33 GPR82GPR82 2.65E-052.65E-05 44 CD163CD163 1.45E-041.45E-04 55 SAA2SAA2 0.0011620.001162 66 DUSP27DUSP27 0.0013760.001376 77 CSRP3CSRP3 0.001410.00141 88 VSIG4VSIG4 0.0021360.002136 99 GPR34GPR34 0.0094530.009453 1010 MRC1MRC1 0.0152190.015219 1111 FPR3FPR3 0.0257910.025791 1212 ADGRG7ADGRG7 0.0360840.036084

虽然已在本文显示和描述了本发明的优选实施方案,但对于本领域技术人员来说显而易见的是,此类实施方案仅作为实例提供。这并不意味着本发明受说明书中提供的具体实例的限制。虽然已经参考前述说明书描述了本发明,但本文的实施方案的描述和图示并不意味着以限制的意义来解释。在不脱离本发明的情况下,本领域技术人员会构思到许多变化、改变和替换。此外,应当理解,本发明的所有方面不限于本文所阐述的特定描述、配置或相对比例,其取决于各种条件和变量。应当理解,在实施本发明时,可以采用本文描述的本发明实施方案的各种替代物。因此,预期本发明还应涵盖任何此类替代物、修改、变化或等同物。以下权利要求旨在限定本发明的范围,并且由此覆盖这些权利要求及其等同物范围内的方法和结构。Although preferred embodiments of the present invention have been shown and described herein, it is obvious to those skilled in the art that such embodiments are provided only as examples. This does not mean that the present invention is limited by the specific examples provided in the specification. Although the present invention has been described with reference to the foregoing description, the description and illustration of the embodiments herein are not meant to be interpreted in a limiting sense. Without departing from the present invention, those skilled in the art will conceive of many changes, modifications and substitutions. In addition, it should be understood that all aspects of the present invention are not limited to the specific description, configuration or relative proportions set forth herein, which depend on various conditions and variables. It should be understood that in the implementation of the present invention, various alternatives of the embodiments of the present invention described herein may be adopted. Therefore, it is contemplated that the present invention should also cover any such alternatives, modifications, variations or equivalents. The following claims are intended to define the scope of the present invention, and thus cover methods and structures within the scope of these claims and their equivalents.

Claims (72)

1.一种用于鉴定妊娠对象的妊娠相关状态的存在或升高的风险的方法,所述方法包括测定来源于所述妊娠对象的无细胞生物样品以检测生物标志物集,以及使用经训练算法处理所述生物标志物集或针对参考值处理所述生物标志物集以确定至少三种不同的妊娠相关状态的集之中的所述妊娠相关状态的所述存在或升高的风险。1. A method for identifying the presence or increased risk of a pregnancy-related state in a pregnant subject, the method comprising assaying a cell-free biological sample derived from the pregnant subject to detect a set of biomarkers, and processing the set of biomarkers using a trained algorithm or processing the set of biomarkers against reference values to determine the presence or increased risk of the pregnancy-related state in a set of at least three different pregnancy-related states. 2.如权利要求1所述的方法,其中所述妊娠相关状态选自早产、足月产、胎龄、预产期、分娩发作、妊娠相关高血压病症、先兆子痫、子痫、妊娠期糖尿病、所述妊娠对象的胎儿的先天性病症、异位妊娠、自然流产、死产、产后并发症、妊娠剧吐、分娩时出血或流血过多、胎膜早破、早产胎膜早破、前置胎盘、宫内/胎儿生长受限、巨大儿、新生儿状况,以及胎儿发育阶段或状态。2. The method of claim 1, wherein the pregnancy-related condition is selected from preterm birth, term birth, gestational age, due date, onset of labor, pregnancy-related hypertensive disorders, pre-eclampsia, eclampsia, gestational diabetes, congenital disorders of the fetus of the pregnant subject, ectopic pregnancy, spontaneous abortion, stillbirth, postpartum complications, hyperemesis gravidarum, hemorrhage or excessive bleeding during labor, premature rupture of membranes, preterm premature rupture of membranes, placenta previa, intrauterine/fetal growth restriction, macrosomia, neonatal condition, and fetal development stage or state. 3.如权利要求1所述的方法,其中所述妊娠相关状态是早产亚型,并且其中所述至少三种不同的妊娠相关状态包括至少两种不同的早产亚型。3. The method of claim 1, wherein the pregnancy-related condition is a subtype of preterm birth, and wherein the at least three different pregnancy-related conditions include at least two different subtypes of preterm birth. 4.如权利要求3所述的方法,其中所述早产亚型是早产分子亚型,并且其中所述至少两种不同的早产亚型包括至少两种不同的早产分子亚型。4. The method of claim 3, wherein the preterm birth subtype is a preterm birth molecular subtype, and wherein the at least two different preterm birth subtypes comprise at least two different preterm birth molecular subtypes. 5.如权利要求4所述的方法,其中所述早产分子亚型选自既往的早产、自发性早产、种族特异性早产风险和早产胎膜早破(PPROM)的病史。5. The method of claim 4, wherein the molecular subtype of preterm birth is selected from a history of previous preterm birth, spontaneous preterm birth, ethnic-specific risk of preterm birth, and preterm premature rupture of membranes (PPROM). 6.如权利要求5所述的方法,其中所述早产分子亚型是自发性早产,并且其中所述生物标志物集包含与自发性早产相关联的基因组位点。6. The method of claim 5, wherein the preterm birth molecular subtype is spontaneous preterm birth, and wherein the biomarker set comprises genomic loci associated with spontaneous preterm birth. 7.如权利要求6所述的方法,其中与自发性早产相关联的所述基因组位点选自表1中列出的基因、表2中列出的基因、表3中列出的基因、对应于表4中列出的途径的基因、表9中列出的基因、表10中列出的基因以及表11中列出的基因。7. The method of claim 6, wherein the genomic loci associated with spontaneous preterm birth are selected from the group consisting of genes listed in Table 1, genes listed in Table 2, genes listed in Table 3, genes corresponding to pathways listed in Table 4, genes listed in Table 9, genes listed in Table 10, and genes listed in Table 11. 8.如权利要求6所述的方法,其中所述自发性早产包括在小于25周分娩、在小于26周分娩、在小于27周分娩、在小于28周分娩、在小于29周分娩、在小于30周分娩、在小于31周分娩、在小于32周分娩、在小于33周分娩、在小于34周分娩、在小于35周分娩、在小于36周分娩、在小于37周分娩或在小于38周分娩。8. The method of claim 6, wherein the spontaneous preterm birth comprises delivering at less than 25 weeks, delivering at less than 26 weeks, delivering at less than 27 weeks, delivering at less than 28 weeks, delivering at less than 29 weeks, delivering at less than 30 weeks, delivering at less than 31 weeks, delivering at less than 32 weeks, delivering at less than 33 weeks, delivering at less than 34 weeks, delivering at less than 35 weeks, delivering at less than 36 weeks, delivering at less than 37 weeks, or delivering at less than 38 weeks. 9.如权利要求4所述的方法,进一步包括至少部分地基于所述早产分子亚型的所述存在或升高的风险来鉴定所述妊娠对象的临床干预。9. The method of claim 4, further comprising identifying a clinical intervention for the pregnant subject based at least in part on the presence or increased risk of the preterm birth molecular subtype. 10.如权利要求9所述的方法,其中所述临床干预选自多个临床干预。10. The method of claim 9, wherein the clinical intervention is selected from a plurality of clinical interventions. 11.如权利要求9所述的方法,其中所述临床干预包括药物、补充剂、生活方式建议、宫颈环扎、宫颈子宫托或电收缩抑制。11. The method of claim 9, wherein the clinical intervention comprises medication, supplementation, lifestyle advice, cervical cerclage, cervical pessary, or electrical contraction inhibition. 12.如权利要求11所述的方法,其中所述药物选自黄体酮、红霉素、宫缩抑制药物、皮质类固醇、阴道菌群和抗氧化剂。12. The method of claim 11, wherein the drug is selected from the group consisting of progesterone, erythromycin, tocolytic drugs, corticosteroids, vaginal flora, and antioxidants. 13.如权利要求1所述的方法,其中所述妊娠相关状态是先兆子痫亚型,并且其中所述至少三种不同的妊娠相关状态包括至少两种不同的先兆子痫亚型。13. The method of claim 1, wherein the pregnancy-related condition is a subtype of pre-eclampsia, and wherein the at least three different pregnancy-related conditions include at least two different subtypes of pre-eclampsia. 14.如权利要求13所述的方法,其中所述先兆子痫亚型是先兆子痫分子亚型,并且其中所述至少两种不同的先兆子痫亚型包括至少两种不同的先兆子痫分子亚型。14. The method of claim 13, wherein the pre-eclampsia subtype is a pre-eclampsia molecular subtype, and wherein the at least two different pre-eclampsia subtypes comprise at least two different pre-eclampsia molecular subtypes. 15.如权利要求14所述的方法,其中所述先兆子痫分子亚型选自慢性或原有高血压的病史、妊娠高血压的存在或病史、轻度先兆子痫的存在或病史、重度先兆子痫的存在或病史、子痫的存在或病史,以及HELLP综合征的存在或病史。15. The method of claim 14, wherein the pre-eclampsia molecular subtype is selected from a history of chronic or pre-existing hypertension, the presence or history of gestational hypertension, the presence or history of mild pre-eclampsia, the presence or history of severe pre-eclampsia, the presence or history of eclampsia, and the presence or history of HELLP syndrome. 16.如权利要求15所述的方法,其中所述先兆子痫分子亚型是早产先兆子痫,并且其中所述生物标志物集包含与早产先兆子痫相关联的基因组位点。16. The method of claim 15, wherein the pre-eclampsia molecular subtype is pre-term pre-eclampsia, and wherein the biomarker set comprises genomic loci associated with pre-term pre-eclampsia. 17.如权利要求16所述的方法,其中与早产先兆子痫相关联的所述基因组位点选自表5中列出的基因、表6中列出的基因、表7中列出的基因以及表8中列出的基因。17. The method of claim 16, wherein the genomic loci associated with preterm pre-eclampsia are selected from the genes listed in Table 5, the genes listed in Table 6, the genes listed in Table 7, and the genes listed in Table 8. 18.如权利要求16所述的方法,其中所述早产先兆子痫包括在小于25周分娩、在小于26周分娩、在小于27周分娩、在小于28周分娩、在小于29周分娩、在小于30周分娩、在小于31周分娩、在小于32周分娩、在小于33周分娩、在小于34周分娩、在小于35周分娩、在小于36周分娩、在小于37周分娩或在小于38周分娩。18. The method of claim 16, wherein the preterm pre-eclampsia comprises delivery at less than 25 weeks, delivery at less than 26 weeks, delivery at less than 27 weeks, delivery at less than 28 weeks, delivery at less than 29 weeks, delivery at less than 30 weeks, delivery at less than 31 weeks, delivery at less than 32 weeks, delivery at less than 33 weeks, delivery at less than 34 weeks, delivery at less than 35 weeks, delivery at less than 36 weeks, delivery at less than 37 weeks, or delivery at less than 38 weeks. 19.如权利要求14所述的方法,进一步包括至少部分地基于所述先兆子痫分子亚型的所述存在或升高的风险来鉴定所述妊娠对象的临床干预。19. The method of claim 14, further comprising identifying a clinical intervention for the pregnant subject based at least in part on the presence or increased risk of the pre-eclampsia molecular subtype. 20.如权利要求19所述的方法,其中所述临床干预选自多个临床干预。20. The method of claim 19, wherein the clinical intervention is selected from a plurality of clinical interventions. 21.如权利要求19所述的方法,其中所述临床干预包括药物、补充剂或生活方式建议。21. The method of claim 19, wherein the clinical intervention comprises medication, supplementation, or lifestyle advice. 22.如权利要求21所述的方法,其中所述药物选自阿司匹林、黄体酮、硫酸镁、胆固醇药物、胃灼热药物、血管紧张素II受体拮抗剂、钙通道阻滞剂、糖尿病药物和勃起功能障碍药物。22. The method of claim 21, wherein the drug is selected from the group consisting of aspirin, progesterone, magnesium sulfate, cholesterol medications, heartburn medications, angiotensin II receptor antagonists, calcium channel blockers, diabetes medications, and erectile dysfunction medications. 23.如权利要求1所述的方法,其中所述妊娠相关状态是妊娠期糖尿病亚型,并且其中所述至少三种不同的妊娠相关状态包括至少两种不同的妊娠期糖尿病亚型。23. The method of claim 1, wherein the pregnancy-related condition is a subtype of gestational diabetes, and wherein the at least three different pregnancy-related conditions include at least two different subtypes of gestational diabetes. 24.如权利要求23所述的方法,其中所述妊娠期糖尿病亚型是妊娠期糖尿病分子亚型,并且其中所述至少两种不同的妊娠期糖尿病亚型包括至少两种不同的妊娠期糖尿病分子亚型。24. The method of claim 23, wherein the gestational diabetes subtype is a gestational diabetes molecular subtype, and wherein the at least two different gestational diabetes subtypes comprise at least two different gestational diabetes molecular subtypes. 25.如权利要求23所述的方法,其中所述生物标志物集包含与妊娠期糖尿病相关联的基因组位点。25. The method of claim 23, wherein the biomarker set comprises genomic loci associated with gestational diabetes. 26.如权利要求25所述的方法,其中与妊娠期糖尿病相关联的所述基因组位点选自PDK4、CSH1、PLAC4、TBCEL和FBXO7。26. The method of claim 25, wherein the genomic loci associated with gestational diabetes are selected from PDK4, CSH1, PLAC4, TBCEL, and FBXO7. 27.如权利要求24所述的方法,进一步包括至少部分地基于所述妊娠期糖尿病分子亚型的所述存在或升高的风险来鉴定所述妊娠对象的临床干预。27. The method of claim 24, further comprising identifying a clinical intervention for the pregnant subject based at least in part on the presence or increased risk of the gestational diabetes molecular subtype. 28.如权利要求27所述的方法,其中所述临床干预选自多个临床干预。28. The method of claim 27, wherein the clinical intervention is selected from a plurality of clinical interventions. 29.如权利要求27所述的方法,其中所述临床干预包括药物、补充剂、生活方式建议、宫颈环扎、宫颈子宫托或电收缩抑制。29. The method of claim 27, wherein the clinical intervention comprises medication, supplementation, lifestyle advice, cervical cerclage, cervical pessary, or electrical contraction inhibition. 30.如权利要求1所述的方法,其中所述生物标志物集包含至少5个不同的基因组位点、至少10个不同的基因组位点、至少15个不同的基因组位点、至少20个不同的基因组位点、至少25个不同的基因组位点、至少30个不同的基因组位点、至少35个不同的基因组位点、至少40个不同的基因组位点、至少45个不同的基因组位点、至少50个不同的基因组位点、至少100个不同的基因组位点、至少150个不同的基因组位点或至少200个不同的基因组位点。30. The method of claim 1, wherein the biomarker set comprises at least 5 different genomic loci, at least 10 different genomic loci, at least 15 different genomic loci, at least 20 different genomic loci, at least 25 different genomic loci, at least 30 different genomic loci, at least 35 different genomic loci, at least 40 different genomic loci, at least 45 different genomic loci, at least 50 different genomic loci, at least 100 different genomic loci, at least 150 different genomic loci, or at least 200 different genomic loci. 31.如权利要求1所述的方法,其中所述测定包括使用来源于所述无细胞生物样品的无细胞核糖核酸(cfRNA)分子来生成转录组学数据、使用来源于所述无细胞生物样品的转录产物来生成转录产物数据、使用来源于所述无细胞生物样品的无细胞脱氧核糖核酸(cfDNA)分子来生成基因组数据和/或甲基化数据、使用来源于所述第一无细胞生物样品的蛋白质来生成蛋白质组学数据,或使用来源于所述第一无细胞生物样品的代谢物来生成代谢组学数据。31. The method of claim 1, wherein the determining comprises generating transcriptomic data using cell-free ribonucleic acid (cfRNA) molecules derived from the cell-free biological sample, generating transcript data using transcription products derived from the cell-free biological sample, generating genomic data and/or methylation data using cell-free deoxyribonucleic acid (cfDNA) molecules derived from the cell-free biological sample, generating proteomic data using proteins derived from the first cell-free biological sample, or generating metabolomics data using metabolites derived from the first cell-free biological sample. 32.如权利要求1所述的方法,其中所述无细胞生物样品选自无细胞核糖核酸(cfRNA)、无细胞脱氧核糖核酸(cfDNA)、无细胞胎儿DNA(cffDNA)、血浆、血清、尿液、唾液、羊水及其衍生物。32. The method of claim 1, wherein the cell-free biological sample is selected from the group consisting of cell-free ribonucleic acid (cfRNA), cell-free deoxyribonucleic acid (cfDNA), cell-free fetal DNA (cffDNA), plasma, serum, urine, saliva, amniotic fluid, and derivatives thereof. 33.如权利要求1所述的方法,其中所述无细胞生物样品是使用乙二胺四乙酸(EDTA)收集管、无细胞RNA收集管或无细胞脱氧核糖核酸(DNA)收集管而获得自或来源于所述妊娠对象。33. The method of claim 1, wherein the cell-free biological sample is obtained or derived from the pregnant subject using an ethylenediaminetetraacetic acid (EDTA) collection tube, a cell-free RNA collection tube, or a cell-free deoxyribonucleic acid (DNA) collection tube. 34.如权利要求1所述的方法,进一步包括对所述妊娠对象的全血样品进行分级分离以获得所述无细胞生物样品。34. The method of claim 1, further comprising fractionating a whole blood sample from the pregnant subject to obtain the cell-free biological sample. 35.如权利要求1所述的方法,其中所述测定包括无细胞核糖核酸(cfRNA)测定或代谢组学测定。35. The method of claim 1, wherein the assay comprises a cell-free RNA (cfRNA) assay or a metabolomics assay. 36.如权利要求35所述的方法,其中所述代谢组学测定包括靶向质谱(MS)或免疫测定。36. The method of claim 35, wherein the metabolomics assay comprises targeted mass spectrometry (MS) or immunoassay. 37.如权利要求1所述的方法,其中所述无细胞生物样品包括无细胞核糖核酸(cfRNA)或尿液。37. The method of claim 1, wherein the cell-free biological sample comprises cell-free RNA (cfRNA) or urine. 38.如权利要求1所述的方法,其中所述测定包括定量聚合酶链反应(qPCR)。38. The method of claim 1, wherein the assay comprises quantitative polymerase chain reaction (qPCR). 39.如权利要求1所述的方法,其中所述测定包括被配置为在家庭环境中进行的家用测试。39. The method of claim 1, wherein the assay comprises a home test configured to be performed in a home environment. 40.如权利要求1所述的方法,其中所述经训练算法以至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约85%、至少约90%或至少约95%的灵敏度确定所述妊娠对象的所述妊娠相关状态的所述存在或升高的风险。40. The method of claim 1, wherein the trained algorithm determines the presence or increased risk of the pregnancy-related condition in the pregnant subject with a sensitivity of at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, or at least about 95%. 41.如权利要求1所述的方法,其中所述经训练算法以至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约85%、至少约90%或至少约95%的阳性预测值(PPV)确定所述妊娠对象的所述妊娠相关状态的所述存在或升高的风险。41. The method of claim 1, wherein the trained algorithm determines the presence or increased risk of the pregnancy-related condition in the pregnant subject with a positive predictive value (PPV) of at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, or at least about 95%. 42.如权利要求1所述的方法,其中所述经训练算法以至少约0.60、至少约0.65、至少约0.70、至少约0.75、至少约0.80、至少约0.85、至少约0.90或至少约0.95的曲线下面积(AUC)确定所述妊娠对象的所述妊娠相关状态的所述存在或升高的风险。42. The method of claim 1, wherein the trained algorithm determines the presence or increased risk of the pregnancy-related condition in the pregnant subject with an area under the curve (AUC) of at least about 0.60, at least about 0.65, at least about 0.70, at least about 0.75, at least about 0.80, at least about 0.85, at least about 0.90, or at least about 0.95. 43.如权利要求1所述的方法,其中所述妊娠对象针对所述妊娠相关状态是无症状的。43. The method of claim 1, wherein the pregnant subject is asymptomatic for the pregnancy-related condition. 44.如权利要求1所述的方法,其中所述经训练算法使用与所述妊娠相关状态的存在或升高的风险相关联的第一独立训练样品集和与所述妊娠相关状态的不存在或无升高的风险相关联的第二独立训练样品集进行训练。44. The method of claim 1, wherein the trained algorithm is trained using a first independent set of training samples associated with the presence or increased risk of the pregnancy-related state and a second independent set of training samples associated with the absence or no increased risk of the pregnancy-related state. 45.如权利要求1所述的方法,进一步包括使用所述经训练算法或另一经训练算法来处理所述妊娠对象的临床健康数据集,以确定所述妊娠相关状态的所述存在或升高的风险。45. The method of claim 1, further comprising processing a clinical health dataset of the pregnant subject using the trained algorithm or another trained algorithm to determine the presence or increased risk of the pregnancy-related condition. 46.如权利要求1所述的方法,进一步包括使所述无细胞生物样品经受足以分离、富集或提取核糖核酸(RNA)分子、脱氧核糖核酸(DNA)分子、蛋白质或代谢物的集的条件;并且其中所述测定包括分析RNA分子、DNA分子、蛋白质或代谢物的所述集。46. The method of claim 1, further comprising subjecting the cell-free biological sample to conditions sufficient to separate, enrich or extract a collection of ribonucleic acid (RNA) molecules, deoxyribonucleic acid (DNA) molecules, proteins or metabolites; and wherein the assay comprises analyzing the collection of RNA molecules, DNA molecules, proteins or metabolites. 47.如权利要求46所述的方法,进一步包括从所述无细胞生物样品提取核酸分子集,以及对所述核酸分子集进行测序以生成测序读数集。47. The method of claim 46, further comprising extracting a set of nucleic acid molecules from the cell-free biological sample, and sequencing the set of nucleic acid molecules to generate a set of sequencing reads. 48.如权利要求47所述的方法,其中所述测序包括大规模并行测序。48. The method of claim 47, wherein the sequencing comprises massively parallel sequencing. 49.如权利要求47所述的方法,其中所述测序包括核酸扩增。49. The method of claim 47, wherein the sequencing comprises nucleic acid amplification. 50.如权利要求49所述的方法,其中所述核酸扩增包括聚合酶链反应(PCR)。50. The method of claim 49, wherein the nucleic acid amplification comprises polymerase chain reaction (PCR). 51.如权利要求50所述的方法,其中所述测序包括使用逆转录(RT)和聚合酶链反应(PCR)。51. The method of claim 50, wherein the sequencing comprises using reverse transcription (RT) and polymerase chain reaction (PCR). 52.如权利要求46所述的方法,进一步包括使用探针,所述探针被配置为选择性地富集对应于一个或多个基因组位点的分组的所述核酸分子集。52. The method of claim 46, further comprising using a probe configured to selectively enrich the set of nucleic acid molecules corresponding to a grouping of one or more genomic sites. 53.如权利要求52所述的方法,其中所述探针是核酸引物。53. The method of claim 52, wherein the probe is a nucleic acid primer. 54.如权利要求52所述的方法,其中所述探针与所述一个或多个基因组位点的所述分组的核酸序列具有序列互补性。54. The method of claim 52, wherein the probes have sequence complementarity with nucleic acid sequences of the groupings of the one or more genomic loci. 55.如权利要求52所述的方法,其中所述一个或多个基因组位点的所述分组包括与自发性早产相关联的基因组位点,其中所述基因组位点选自表1中列出的基因、表2中列出的基因、表3中列出的基因、对应于表4中列出的途径的基因、表9中列出的基因、表10中列出的基因以及表11中列出的基因。55. The method of claim 52, wherein the grouping of the one or more genomic loci comprises genomic loci associated with spontaneous preterm birth, wherein the genomic loci are selected from the group consisting of genes listed in Table 1, genes listed in Table 2, genes listed in Table 3, genes corresponding to pathways listed in Table 4, genes listed in Table 9, genes listed in Table 10, and genes listed in Table 11. 56.如权利要求52所述的方法,其中所述一个或多个基因组位点的所述分组包括与早产先兆子痫相关联的基因组位点,其中所述基因组位点选自表5中列出的基因、表6中列出的基因、表7中列出的基因以及表8中列出的基因。56. A method as claimed in claim 52, wherein the grouping of the one or more genomic loci includes genomic loci associated with preterm pre-eclampsia, wherein the genomic loci are selected from the genes listed in Table 5, the genes listed in Table 6, the genes listed in Table 7, and the genes listed in Table 8. 57.如权利要求52所述的方法,其中所述一个或多个基因组位点的所述分组包括与妊娠期糖尿病相关联的基因组位点,其中所述基因组位点选自PDK4、CSH1、PLAC4、TBCEL和FBXO7。57. The method of claim 52, wherein the grouping of the one or more genomic loci comprises genomic loci associated with gestational diabetes, wherein the genomic loci are selected from PDK4, CSH1, PLAC4, TBCEL, and FBXO7. 58.如权利要求52所述的方法,其中所述一个或多个基因组位点的所述分组包含至少5个不同的基因组位点、至少10个不同的基因组位点、至少15个不同的基因组位点、至少20个不同的基因组位点、至少25个不同的基因组位点、至少30个不同的基因组位点、至少35个不同的基因组位点、至少40个不同的基因组位点、至少45个不同的基因组位点、至少50个不同的基因组位点、至少100个不同的基因组位点、至少150个不同的基因组位点或至少200个不同的基因组位点。58. The method of claim 52, wherein the grouping of the one or more genomic loci comprises at least 5 different genomic loci, at least 10 different genomic loci, at least 15 different genomic loci, at least 20 different genomic loci, at least 25 different genomic loci, at least 30 different genomic loci, at least 35 different genomic loci, at least 40 different genomic loci, at least 45 different genomic loci, at least 50 different genomic loci, at least 100 different genomic loci, at least 150 different genomic loci, or at least 200 different genomic loci. 59.如权利要求1所述的方法,其中所述无细胞生物样品在没有核酸分离、富集或提取的情况下进行处理。59. The method of claim 1, wherein the cell-free biological sample is processed without nucleic acid isolation, enrichment or extraction. 60.如权利要求1所述的方法,进一步包括生成包括指示所述妊娠相关状态的所述经确定的存在或升高的风险的电子报告。60. The method of claim 1, further comprising generating an electronic report including an indication of the determined presence or increased risk of the pregnancy-related condition. 61.如权利要求1所述的方法,进一步包括确定所述妊娠对象的所述妊娠相关状态的所述存在或升高的风险的所述确定的似然性。61. The method of claim 1, further comprising determining the determined likelihood of the presence or increased risk of the pregnancy-related condition in the pregnant subject. 62.如权利要求1所述的方法,其中所述经训练算法包括经训练机器学习算法。62. The method of claim 1, wherein the trained algorithm comprises a trained machine learning algorithm. 63.如权利要求62所述的方法,其中所述经训练机器学习算法包括深度学习算法、支持向量机(SVM)、神经网络、随机森林、线性回归模型、逻辑回归模型或ANOVA模型。63. The method of claim 62, wherein the trained machine learning algorithm comprises a deep learning algorithm, a support vector machine (SVM), a neural network, a random forest, a linear regression model, a logistic regression model, or an ANOVA model. 64.如权利要求1所述的方法,进一步包括处理所述生物标志物集以减少系统变异。64. The method of claim 1, further comprising processing the biomarker set to reduce systematic variation. 65.如权利要求64所述的方法,其中减少所述系统变异包括使用来自多元线性回归的残差来校正数据残差、执行基于经验贝叶斯方法的ComBat方法以及执行替代变量分析(SVA)校正。65. The method of claim 64, wherein reducing the systematic variation comprises correcting data residuals using residuals from multiple linear regression, performing a ComBat method based on an empirical Bayesian approach, and performing a surrogate variable analysis (SVA) correction. 66.如权利要求64所述的方法,其中所述系统变异包括每个样品的测序深度、个体处理操作的批次效应、各种原材料的使用、样品收集的局部外部温度、所述对象的BMI、胎儿分数、样品收集时的胎儿胎龄或其组合。66. A method as claimed in claim 64, wherein the systematic variation includes the sequencing depth of each sample, batch effects of individual processing operations, the use of various raw materials, the local external temperature at sample collection, the BMI of the subject, fetal fraction, fetal gestational age at sample collection, or a combination thereof. 67.如权利要求1所述的方法,进一步包括监测所述妊娠相关状态的所述存在或升高的风险,其中所述监测包括在多个时间点评估所述妊娠对象的所述妊娠相关状态的所述存在或升高的风险,其中所述评估至少基于在所述多个时间点中的每一个处确定的所述妊娠相关状态的所述存在或升高的风险。67. The method of claim 1, further comprising monitoring the presence or increased risk of the pregnancy-related condition, wherein the monitoring comprises assessing the pregnant subject for the presence or increased risk of the pregnancy-related condition at a plurality of time points, wherein the assessing is based at least on the presence or increased risk of the pregnancy-related condition determined at each of the plurality of time points. 68.如权利要求67所述的方法,其中所述多个时间点之间对所述妊娠对象的所述妊娠相关状态的所述存在或升高的风险的所述评估的差异指示选自以下的一个或多个临床指征:(i)所述妊娠对象的所述妊娠相关状态的所述存在或升高的风险的诊断,(ii)所述妊娠对象的所述妊娠相关状态的所述存在或升高的风险的预后,以及(iii)治疗所述妊娠对象的所述妊娠相关状态的所述存在或升高的风险的疗程的有效性或无效性。68. The method of claim 67, wherein the difference in the assessment of the presence or increased risk of the pregnancy-associated condition for the pregnant subject between the multiple time points is indicative of one or more clinical indications selected from: (i) diagnosis of the presence or increased risk of the pregnancy-associated condition for the pregnant subject, (ii) prognosis of the presence or increased risk of the pregnancy-associated condition for the pregnant subject, and (iii) effectiveness or ineffectiveness of a course of treatment for the presence or increased risk of the pregnancy-associated condition for the pregnant subject. 69.如权利要求1所述的方法,其中所述妊娠对象处在妊娠的首三个月、妊娠的中间三个月或妊娠的末三个月。69. The method of claim 1, wherein the pregnant subject is in the first trimester, the second trimester, or the third trimester. 70.如权利要求1所述的方法,其中从妊娠对象和/或非妊娠对象确定所述参考值。70. The method of claim 1, wherein the reference value is determined from a pregnant subject and/or a non-pregnant subject. 71.如权利要求1所述的方法,其中针对所述参考值处理所述生物标志物集包括确定所述生物标志物集与所述参考值之间的差异。71. The method of claim 1, wherein processing the biomarker set against the reference value comprises determining a difference between the biomarker set and the reference value. 72.如权利要求1所述的方法,其中所述妊娠对象处在妊娠的首三个月、妊娠的中间三个月或妊娠的末三个月。72. The method of claim 1, wherein the pregnant subject is in the first trimester, the second trimester, or the third trimester.
CN202280087768.XA 2021-11-04 2022-11-03 Methods and systems for determining a pregnancy-related status of a subject Pending CN118510911A (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US63/275,726 2021-11-04
US63/276,809 2021-11-08
US202163288044P 2021-12-10 2021-12-10
US63/288,044 2021-12-10
PCT/US2022/079235 WO2023081768A1 (en) 2021-11-04 2022-11-03 Methods and systems for determining a pregnancy-related state of a subject

Publications (1)

Publication Number Publication Date
CN118510911A true CN118510911A (en) 2024-08-16

Family

ID=92245562

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202280087768.XA Pending CN118510911A (en) 2021-11-04 2022-11-03 Methods and systems for determining a pregnancy-related status of a subject

Country Status (1)

Country Link
CN (1) CN118510911A (en)

Similar Documents

Publication Publication Date Title
US11851706B2 (en) Methods and systems for determining a pregnancy-related state of a subject
US20230332229A1 (en) Methods and systems for determining a pregnancy-related state of a subject
Mirzakhani et al. Early pregnancy vitamin D status and risk of preeclampsia
EP3701043B1 (en) A noninvasive molecular clock for fetal development predicts gestational age and preterm delivery
Tarca et al. Maternal whole blood mRNA signatures identify women at risk of early preeclampsia: a longitudinal study
Devesa-Peiro et al. Breaking the ageing paradigm in endometrium: endometrial gene expression related to cilia and ageing hallmarks in women over 35 years
US9458495B2 (en) Methods and systems for assessment of clinical infertility
US20240352527A1 (en) Methods and systems for determining a pregnancy-related state of a subject
US20170351806A1 (en) Method for assessing fertility based on male and female genetic and phenotypic data
Kim et al. Maternal plasma miRNAs as potential biomarkers for detecting risk of small-for-gestational-age births
US20230135486A1 (en) Circulating rna signatures specific to preeclampsia
CN113348367A (en) Methods, systems and kits for predicting preterm labor status
JPWO2022036053A5 (en)
CN118510911A (en) Methods and systems for determining a pregnancy-related status of a subject
WO2024229244A2 (en) Methods and systems for determining a pregnancy-related state of a subject
Zhang et al. Preliminary RNA-microarray analysis of long non-coding RNA expression in abnormally invasive placenta
US20230410957A1 (en) Methods and systems for conducting pregnancy-related clinical trials
US20240150837A1 (en) Methods and systems for methylation profiling of pregnancy-related states
HK40064776A (en) Methods and systems for determining a pregnancy-related state of a subject
Strunk et al. Early pregnancy vitamin D status and risk of preeclampsia

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination