CN105512508B - Automatically generate the method and device of genetic test report - Google Patents
Automatically generate the method and device of genetic test report Download PDFInfo
- Publication number
- CN105512508B CN105512508B CN201410487501.7A CN201410487501A CN105512508B CN 105512508 B CN105512508 B CN 105512508B CN 201410487501 A CN201410487501 A CN 201410487501A CN 105512508 B CN105512508 B CN 105512508B
- Authority
- CN
- China
- Prior art keywords
- information
- genetic
- report
- gene
- detection report
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Landscapes
- Apparatus Associated With Microorganisms And Enzymes (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
技术领域technical field
本发明涉及一种自动生成基因检测报告的方法及装置。The invention relates to a method and device for automatically generating a gene detection report.
背景技术Background technique
基因检测是从染色体结构、DNA序列、DNA变异位点或基因表达程度,提供给受检者与医疗研究人员评估一些与基因遗传有关的疾病、体质或个人特质的依据。Genetic testing is based on chromosome structure, DNA sequence, DNA variation site or gene expression level, which provides the basis for the subjects and medical researchers to evaluate some diseases, physical constitution or personal characteristics related to genetic inheritance.
基因检测报告是以可供受检者与临床研究人员看得懂的方式来展现基因检测结果,它不仅包括基因检测的直接结果,还包括了对结果的解读。其中,对结果的解读可以提供对患病风险的预测和预防建议,可供受检者和临床医生参考,具有很重要的实用价值。目前,许多基因检测机构出具的基因检测报告几乎都是依靠手工录入的方式。The genetic test report presents the genetic test results in a way that can be understood by the subjects and clinical researchers. It includes not only the direct results of the genetic test, but also the interpretation of the results. Among them, the interpretation of the results can provide the prediction of disease risk and preventive suggestions, which can be used as a reference for examinees and clinicians, and has very important practical value. At present, the genetic test reports issued by many genetic testing institutions are almost all manually entered.
但是,手工摘录需要耗费大量的时间,准确性不能得到保证。另外,如果有新的研究发现,不能及时地更新报告。However, manual excerpts are time-consuming and accuracy cannot be guaranteed. In addition, if there are new research findings, the report cannot be updated in a timely manner.
发明内容Contents of the invention
本发明主要解决的技术问题是提供一种自动生成基因检测报告的方法及装置,能够节约大量时间,保证准确性,及时使基因检测报告得到更新。The technical problem mainly solved by the present invention is to provide a method and device for automatically generating a gene detection report, which can save a lot of time, ensure accuracy, and update the gene detection report in time.
为解决上述技术问题,本发明采用的一个技术方案是:提供一种自动生成基因检测报告的方法,包括:获取受检对象的遗传信息和样本信息;通过所述遗传信息查询基因数据库,获取与所述遗传信息对应的表征信息,所述基因数据库中保存有所述遗传信息与表征信息之间的对应关系;调用基因检测报告的标准化模板,并将所述样本信息和与所述遗传信息对应的表征信息导入所述标准化模板中,自动生成基因检测报告。In order to solve the above technical problems, a technical solution adopted by the present invention is to provide a method for automatically generating a gene detection report, including: obtaining the genetic information and sample information of the subject to be tested; querying the genetic database through the genetic information, obtaining and The characterization information corresponding to the genetic information, the gene database stores the correspondence between the genetic information and the characterization information; calls the standardized template of the genetic detection report, and matches the sample information with the genetic information Import the characterization information into the standardized template to automatically generate a gene detection report.
其中,所述调用基因检测报告的标准化模板,并将所述样本信息和与所述遗传信息对应的表征信息导入所述标准化模板中,自动生成基因检测报告的步骤之前,包括:确定所述基因检测报告的标准化模板的格式;采用拉泰赫LaTeX源代码编写所述基因检测报告的标准化模板。Wherein, calling the standardized template of the gene detection report, and importing the sample information and the characterization information corresponding to the genetic information into the standardized template, before the step of automatically generating the gene detection report, includes: determining the gene The format of the standardized template of the test report; the standardized template of the genetic test report is written using LaTeX source code.
其中,所述遗传信息是基因分型结果,所述表征信息是疾病信息,所述基因数据库是疾病基因数据库。Wherein, the genetic information is a genotyping result, the characterization information is disease information, and the gene database is a disease gene database.
其中,所述调用基因检测报告的标准化模板,并将所述样本信息和与所述遗传信息对应的表征信息导入所述标准化模板中,自动生成基因检测报告的步骤,包括:调用所述基因检测报告的标准化模板;将所述样本信息和与所述基因分型结果对应的疾病信息导入所述标准化模板中;设置与疾病相关的筛选条件;根据所述与疾病相关的筛选条件,运行所述生成基因检测报告的纯文本文件,获得基因检测报告的纯文本文件;将所述基因检测报告的纯文本文件生成便携式文件格式PDF的基因检测报告。Wherein, the step of calling the standardized template of the gene detection report, importing the sample information and the characterization information corresponding to the genetic information into the standardized template, and automatically generating the gene detection report includes: calling the gene detection report A standardized template for reporting; importing the sample information and disease information corresponding to the genotyping result into the standardized template; setting disease-related screening conditions; according to the disease-related screening conditions, running the Generating a plain text file of the genetic testing report to obtain the plain text file of the genetic testing report; generating a genetic testing report in a portable file format PDF from the plain text file of the genetic testing report.
其中,所述通过所述遗传信息查询基因数据库,获取与所述遗传信息对应的表征信息的步骤之前,包括:从文献资料或数据库中提取所需的与所述遗传信息相关的数据信息;确定所述数据信息的数据格式;根据所述数据信息和所述数据格式建立所述基因数据库。Wherein, before the step of querying the gene database through the genetic information and obtaining the characterization information corresponding to the genetic information, it includes: extracting the required data information related to the genetic information from literature or databases; determining The data format of the data information; the gene database is established according to the data information and the data format.
为解决上述技术问题,本发明采用的另一个技术方案是:提供一种自动生成基因检测报告的装置,所述装置包括:第一获取模块,用于获取受检对象的遗传信息和样本信息;第二获取模块,用于通过所述遗传信息查询基因数据库,获取与所述遗传信息对应的表征信息,所述基因数据库中保存有所述遗传信息与表征信息之间的对应关系;报告生成模块,用于调用基因检测报告的标准化模板,并将所述样本信息和与所述遗传信息对应的表征信息导入所述标准化模板中,自动生成基因检测报告。In order to solve the above-mentioned technical problems, another technical solution adopted by the present invention is to provide a device for automatically generating a genetic test report, the device comprising: a first acquisition module, used to acquire the genetic information and sample information of the subject; The second acquisition module is used to query the gene database through the genetic information to obtain the characterization information corresponding to the genetic information, and the genetic database stores the correspondence between the genetic information and the characterization information; a report generation module , for invoking a standardized template of a gene detection report, and importing the sample information and characterization information corresponding to the genetic information into the standardized template to automatically generate a gene detection report.
其中,所述装置还包括:第一确定模块,用于确定所述基因检测报告的标准化模板的格式;编写模块,用于采用拉泰赫LaTeX源代码编写所述基因检测报告的标准化模板。Wherein, the device further includes: a first determining module, used to determine the format of the standardized template of the genetic testing report; a writing module, used to write the standardized template of the genetic testing report using LaTeX source code.
其中,所述遗传信息是基因分型结果,所述表征信息是疾病信息,所述基因数据库是疾病基因数据库。Wherein, the genetic information is a genotyping result, the characterization information is disease information, and the gene database is a disease gene database.
其中,所述报告生成模块包括:调用单元,用于调用所述基因检测报告的标准化模板;导入单元,用于将所述样本信息和与所述基因分型结果对应的疾病信息导入所述标准化模板中;设置单元,用于设置与疾病相关的筛选条件;运行单元,用于根据所述与疾病相关的筛选条件,运行所述生成基因检测报告的纯文本文件,获得基因检测报告的纯文本文件;生成单元,用于将所述基因检测报告的纯文本文件生成便携式文件格式PDF的基因检测报告。Wherein, the report generation module includes: a calling unit, used to call the standardized template of the gene detection report; an import unit, used to import the sample information and the disease information corresponding to the genotyping result into the standardized In the template; the setting unit is used to set the screening conditions related to the disease; the operation unit is used to run the plain text file of the gene detection report generated according to the disease-related screening conditions to obtain the plain text of the genetic detection report A file; a generating unit, configured to generate a genetic detection report in a portable file format PDF from the plain text file of the genetic detection report.
其中,所述装置还包括:提取模块,用于从文献资料或数据库中提取所需的与所述遗传信息相关的数据信息;第二确定模块,用于确定所述数据信息的数据格式;建立模块,用于根据所述数据信息和所述数据格式建立所述基因数据库。Wherein, the device also includes: an extraction module, used to extract the required data information related to the genetic information from literature or databases; a second determination module, used to determine the data format of the data information; A module for establishing the gene database according to the data information and the data format.
本发明的有益效果是:区别于现有技术的情况,本发明获取受检对象的遗传信息和样本信息;通过遗传信息查询基因数据库,获取与遗传信息对应的表征信息;调用基因检测报告的标准化模板,并将样本信息和与遗传信息对应的表征信息导入标准化模板中,自动生成基因检测报告。由于通过遗传信息获取到表征信息后,调用标准模板,将样本信息和表征信息导入标准模板后,即可自动生成基因检测报告,因而不用手工录入,能够节约大量时间,避免人为误差,可以保证准确性,而且基因数据库可以根据新的研究发现及时更新相应信息,从而能够及时使基因检测报告得到更新。The beneficial effects of the present invention are: different from the situation of the prior art, the present invention obtains the genetic information and sample information of the subject to be tested; queries the genetic database through the genetic information to obtain the characterization information corresponding to the genetic information; calls the standardization of the genetic detection report template, and import sample information and characterization information corresponding to genetic information into the standardized template to automatically generate a genetic test report. After the characterization information is obtained through the genetic information, the standard template is called, and after the sample information and characterization information are imported into the standard template, the genetic test report can be automatically generated, so there is no need for manual entry, which can save a lot of time, avoid human errors, and ensure accuracy. Moreover, the gene database can update the corresponding information in time according to new research findings, so that the gene test report can be updated in time.
附图说明Description of drawings
图1是本发明自动生成基因检测报告的方法一实施方式的流程图;Fig. 1 is a flowchart of an embodiment of the method for automatically generating a gene detection report in the present invention;
图2是本发明自动生成基因检测报告的方法另一实施方式的流程图;Fig. 2 is a flowchart of another embodiment of the method for automatically generating a gene detection report of the present invention;
图3是本发明自动生成基因检测报告的方法又一实施方式的流程图;Fig. 3 is a flowchart of another embodiment of the method for automatically generating a gene detection report in the present invention;
图4是本发明自动生成基因检测报告的方法又一实施方式的流程图;Fig. 4 is a flowchart of another embodiment of the method for automatically generating a gene detection report in the present invention;
图5是本发明自动生成基因检测报告的装置一实施方式的结构示意图;Fig. 5 is a structural schematic diagram of an embodiment of the device for automatically generating a gene detection report according to the present invention;
图6是本发明自动生成基因检测报告的装置另一实施方式的结构示意图;Fig. 6 is a structural schematic diagram of another embodiment of the device for automatically generating a gene detection report according to the present invention;
图7是本发明自动生成基因检测报告的装置又一实施方式的结构示意图;Fig. 7 is a structural schematic diagram of another embodiment of the device for automatically generating a gene detection report according to the present invention;
图8是本发明自动生成基因检测报告的装置又一实施方式的结构示意图。Fig. 8 is a structural schematic diagram of another embodiment of the device for automatically generating a gene detection report according to the present invention.
具体实施方式Detailed ways
参阅图1,图1是本发明自动生成基因检测报告的方法一实施方式的流程图,包括:Referring to Fig. 1, Fig. 1 is a flowchart of an embodiment of the method for automatically generating a gene detection report of the present invention, including:
步骤S101:获取受检对象的遗传信息和样本信息。Step S101: Obtain the genetic information and sample information of the subject.
遗传信息是指生物为复制与自己相同的东西,由亲代传递给子代、或各细胞每次分裂时由细胞传递给细胞的信息,即碱基对的排列顺序(或指DNA分子的脱氧核苷酸的排列顺序)。此处的遗传信息可以是代表个体差异性的遗传信息、DNA突变的遗传信息、或者与基因遗传疾病相关的遗传信息等等。样本信息主要是受检对象的基本情况信息,如姓名、性别、年龄、样本类型、检测日期、检测单位等等。Genetic information refers to the information that organisms pass from parent to offspring, or from cell to cell each time each cell divides, in order to copy the same thing as itself, that is, the sequence of base pairs (or the deoxygenated nucleus of the DNA molecule sequence of nucleotides). The genetic information here may be genetic information representing individual differences, genetic information of DNA mutations, or genetic information related to genetic diseases, etc. The sample information is mainly the basic situation information of the tested object, such as name, gender, age, sample type, testing date, testing unit, and so on.
步骤S102:通过遗传信息查询基因数据库,获取与遗传信息对应的表征信息,基因数据库中保存有遗传信息与表征信息之间的对应关系。Step S102: Query the gene database through the genetic information to obtain the representation information corresponding to the genetic information, and the gene database stores the correspondence between the genetic information and the representation information.
表征信息是指根据遗传信息可以表征在生物体表面、可以观察到的个体差异性的性状表现,包括但不限于长相、身高、体重、肤色、性格等。Representation information refers to the performance of individual differences that can be represented on the surface of organisms and can be observed based on genetic information, including but not limited to appearance, height, weight, skin color, personality, etc.
由于基因数据库中保存有遗传信息与表征信息之间的对应关系,因此,通过遗传信息查询基因数据库,可以获取与遗传信息对应的表征信息。Since the gene database stores the correspondence between the genetic information and the characterization information, the characterization information corresponding to the genetic information can be obtained by querying the gene database through the genetic information.
步骤S103:调用基因检测报告的标准化模板,并将样本信息和与遗传信息对应的表征信息导入标准化模板中,自动生成基因检测报告。Step S103: call the standardized template of the genetic testing report, import the sample information and the characterization information corresponding to the genetic information into the standardized template, and automatically generate the genetic testing report.
基因检测报告的标准化模板是按照预设要求创建完毕的,在获得样本信息和表征信息后,调用标准化模板,即可自动生成基因检测报告。The standardized template of the genetic testing report is created according to the preset requirements. After obtaining the sample information and characterization information, the standardized template can be called to automatically generate the genetic testing report.
本发明实施方式获取受检对象的遗传信息和样本信息;通过遗传信息查询基因数据库,获取与遗传信息对应的表征信息;调用基因检测报告的标准化模板,并将样本信息和与遗传信息对应的表征信息导入标准化模板中,自动生成基因检测报告。由于通过遗传信息获取到表征信息后,调用标准模板,将样本信息和表征信息导入标准模板后,即可自动生成基因检测报告,因而不用手工录入,能够节约大量时间,避免人为误差,可以保证准确性,而且基因数据库可以根据新的研究发现及时更新相应信息,从而能够及时使基因检测报告得到更新。The embodiment of the present invention obtains the genetic information and sample information of the subject to be tested; queries the gene database through the genetic information to obtain the characterization information corresponding to the genetic information; calls the standardized template of the gene detection report, and compares the sample information and the characterization information corresponding to the genetic information The information is imported into the standardized template, and the genetic testing report is automatically generated. After the characterization information is obtained through the genetic information, the standard template is called, and after the sample information and characterization information are imported into the standard template, the genetic test report can be automatically generated, so there is no need for manual entry, which can save a lot of time, avoid human errors, and ensure accuracy. Moreover, the gene database can update the corresponding information in time according to new research findings, so that the gene test report can be updated in time.
参阅图2,如果基因检测报告的标准化模板事先没有创建好,那么在步骤S103之前,还包括如下内容:Referring to Figure 2, if the standardized template of the genetic testing report has not been created in advance, then before step S103, the following content is also included:
步骤S201:确定基因检测报告的标准化模板的格式。Step S201: Determine the format of the standardized template of the genetic testing report.
步骤S202:采用拉泰赫LaTeX源代码编写基因检测报告的标准化模板。Step S202: Using the LaTeX source code of LaTeX to write the standardized template of the gene detection report.
LaTeX(音译“拉泰赫”)是一种基于ΤΕΧ的排版系统,由美国计算机学家莱斯利·兰伯特(Leslie Lamport)在20世纪80年代初期开发,利用这种格式,即使使用者没有排版和程序设计的知识也可以充分发挥由TeX所提供的强大功能,能在几天、甚至几小时内生成很多具有书籍质量的印刷品。对于生成复杂表格和数学公式,这一点表现得尤为突出。因此它非常适用于生成高印刷质量的科技和数学类文档,这个系统同样适用于生成从简单的信件到完整书籍的所有其他种类的文档。LaTeX (transliteration "La Taihe") is a TEX-based typesetting system developed by American computer scientist Leslie Lamport in the early 1980s. Using this format, even users No knowledge of typesetting and programming can give full play to the powerful functions provided by TeX, and can generate many book-quality prints in a few days, or even a few hours. This is especially true for generating complex tables and mathematical formulas. It is therefore well suited for producing high print quality technical and mathematical documents, and the system is equally suitable for producing all other kinds of documents, from simple letters to complete books.
本实施方式通过使用LaTeX源代码来编写标准化模板,可以进行批量处理,实现自动化生成基因检测报告。In this embodiment, by using LaTeX source code to write standardized templates, batch processing can be performed to realize automatic generation of genetic testing reports.
在实际应用中,除了使用LaTeX源代码来编写标准化模板外,也可以采用其他纯文本的源代码来编写标准化模板,例如:Markdown,troff,TeX等。In practical applications, in addition to using LaTeX source code to write standardized templates, other plain text source codes can also be used to write standardized templates, such as: Markdown, troff, TeX, etc.
参阅图3,当遗传信息是基因分型结果,表征信息是疾病信息,基因数据库是疾病基因数据库时,步骤S103具体可以包括如下内容:Referring to Figure 3, when the genetic information is the genotyping result, the characterization information is disease information, and the gene database is a disease gene database, step S103 may specifically include the following:
步骤S301:调用基因检测报告的标准化模板。Step S301: Invoking the standardized template of the gene detection report.
步骤S302:将样本信息和与基因分型结果对应的疾病信息导入标准化模板中。Step S302: Import sample information and disease information corresponding to genotyping results into a standardized template.
基因分型(Genotyping)是利用生物学检测方法测定个体基因型(Genotype)的技术。一些常用的基因分型技术有:限制性片段长度多态性(Restriction Fragment LengthPolymorphism,RFLP)、末端限制性长度多态性(Terminal Restriction Fragment LengthPolymorphism,t-RFLP)、扩增片段长度多态性(Amplified fragment lengthpolymorphisms,AFLP)、多重连接探针扩增(multiplex ligation-dependent probeamplification,MLPA)、单核苷酸多态性(Single Nucleotide Polymorphisms,SNP)等。基因分型分析能够诊断疾病导致的遗传变异,如微卫星序列不稳定(MicrosatelliteInstability,MSI)、三体型(Trisomy)、非整倍体(Aneuploidy)、杂合性缺失(loss ofheterozygosity,LOH)等;微卫星序列不稳定和杂合性缺失与结肠癌、乳腺癌、子宫颈癌的肿瘤细胞基因型有关;第21号染色体的三体型是种常见的非整倍体,临床表现为先天愚型(Down Syndrome)。Genotyping (Genotyping) is the use of biological detection methods to determine the individual genotype (Genotype) technology. Some commonly used genotyping techniques are: restriction fragment length polymorphism (Restriction Fragment Length Polymorphism, RFLP), terminal restriction length polymorphism (Terminal Restriction Fragment Length Polymorphism, t-RFLP), amplified fragment length polymorphism ( Amplified fragment length polymorphisms (AFLP), multiplex ligation-dependent probe amplification (multiplex ligation-dependent probe amplification, MLPA), single nucleotide polymorphisms (Single Nucleotide Polymorphisms, SNP), etc. Genotyping analysis can diagnose genetic variation caused by diseases, such as microsatellite instability (Microsatellite Instability, MSI), trisomy (Trisomy), aneuploidy (Aneuploidy), loss of heterozygosity (loss of heterozygosity, LOH), etc; Microsatellite instability and loss of heterozygosity are related to the tumor cell genotypes of colon cancer, breast cancer, and cervical cancer; trisomy of chromosome 21 is a common aneuploidy, and the clinical manifestation is congenital stupidity ( Down Syndrome).
步骤S303:设置与疾病相关的筛选条件。Step S303: Set the filtering conditions related to the disease.
与疾病相关的筛选条件可以是一些参数的临界阈值,例如:综合风险系数的最小值minRisk,设置综合风险系数的最小值后,当大于该值且值越大时,风险就越大。Disease-related screening conditions can be critical thresholds of some parameters, for example: the minimum value of the comprehensive risk coefficient minRisk, after setting the minimum value of the comprehensive risk coefficient, when it is greater than this value and the value is larger, the risk is greater.
步骤S304:根据与疾病相关的筛选条件,运行生成基因检测报告的纯文本文件,获得基因检测报告的纯文本文件。Step S304: According to the screening conditions related to the disease, run and generate the plain text file of the gene detection report to obtain the plain text file of the gene detection report.
步骤S305:将基因检测报告的纯文本文件生成便携式文件格式PDF的基因检测报告。Step S305: Generating the genetic testing report in the portable file format PDF from the plain text file of the genetic testing report.
基因检测报告可以包括与基因分型结果对应的疾病信息,以及高风险、中风险、低风险疾病的统计。The genetic testing report may include disease information corresponding to genotyping results, as well as statistics of high-risk, medium-risk, and low-risk diseases.
通过本实施方式,可以获得详细的疾病信息及相关的统计文档的PDF格式的基因检测报告。Through this embodiment, a gene detection report in PDF format with detailed disease information and related statistical documents can be obtained.
参阅图4,当基因数据库还没有建立的时候,步骤S102之前,还包括:Referring to Figure 4, when the gene database has not been established, before step S102, it also includes:
步骤S401:从文献资料或数据库中提取所需的与遗传信息相关的数据信息。Step S401: Extracting required data information related to genetic information from literature or databases.
步骤S402:确定数据信息的数据格式。Step S402: Determine the data format of the data information.
步骤S403:根据数据信息和数据格式建立基因数据库。Step S403: Establish a gene database according to the data information and data format.
通过本实施方式,可以构建基因数据库。According to this embodiment, a gene database can be constructed.
下面以一个具体的应用实例来说明本发明自动生成基因检测报告的方法,具体以基因分型技术中的SNP为例。A specific application example is used below to illustrate the method of the present invention for automatically generating a gene detection report, specifically taking SNP in genotyping technology as an example.
(1)基因数据库构建。选取103种常见的人类疾病对应的SNP位点,然后通过SNP位点跟SNP数据库(dbSNP)、hg19数据库以及HapMap数据库进行比较,得到对应的基因型,还可以补充其他的SNP信息和相关的疾病信息,形成表格形式的基因数据库。疾病信息包括疾病中英文名称、好发病年龄、人群发病率、性别风险系数、遗传类型、体检建议、疾病简介、总结与建议等。SNP信息包括阳性SNP编码、致病基因、正常碱基型、人群SNP阳性率、阳性风险系数、致病机理、基因图谱等。(1) Gene database construction. Select the SNP loci corresponding to 103 common human diseases, and then compare the SNP loci with the SNP database (dbSNP), hg19 database and HapMap database to obtain the corresponding genotype, and can also supplement other SNP information and related diseases Information, forming a gene database in tabular form. Disease information includes disease name in Chinese and English, best age of onset, population incidence rate, gender risk coefficient, genetic type, medical examination suggestion, disease profile, summary and suggestion, etc. SNP information includes positive SNP codes, pathogenic genes, normal base types, population SNP positive rates, positive risk coefficients, pathogenic mechanisms, gene maps, etc.
(2)标准化模板构建。确定基因检测报告的模板,以LaTeX源代码的形式来编写模板。(2) Standardized template construction. Determine the template of the genetic testing report, and write the template in the form of LaTeX source code.
(3)准备输入文件,如样本的信息(patientInfo.txt),下机数据编号(ncId2SampleID.txt)和基因分型结果(report.xls)。样本信息包括样品名称、样品编号、性别、年龄、样本类型、收样日期、送检单位。基因分型结果包括GC_Score等,并结合基因数据库获取阳性基因型的信息。(3) Prepare input files, such as sample information (patientInfo.txt), off-machine data number (ncId2SampleID.txt) and genotyping results (report.xls). Sample information includes sample name, sample number, gender, age, sample type, date of sample receipt, and inspection unit. Genotyping results include GC_Score, etc., combined with the gene database to obtain positive genotype information.
(4)运行脚本,查询基因数据库中的疾病信息和SNP信息,并计算阳性发病率、综合风险系数和加权阳性发病率(计算公式如下),然后调用LaTeX模板文件。(4) Run the script to query the disease information and SNP information in the gene database, and calculate the positive incidence rate, comprehensive risk coefficient and weighted positive incidence rate (the calculation formula is as follows), and then call the LaTeX template file.
2)阳性发病率的计算:2) Calculation of positive incidence rate:
发病率与性别相关:Incidence is sex-related:
男性:阳性发病率={(阳性风险系数-1)×性别风险系数+1}×人群发病率Male: positive incidence rate = {(positive risk coefficient - 1) × sex risk coefficient + 1} × population incidence rate
发病率与性别无关:阳性发病率=阳性风险系数×人群发病率The incidence rate has nothing to do with gender: positive incidence rate = positive risk coefficient × population incidence rate
3)若疾病有多突变位点:3) If the disease has multiple mutation sites:
(5)设置过滤条件,包括综合风险系数的最小值minRisk(默认为1,若需要设置不能小于1),GCScore的最小值minGCScore(默认为0,若需要设置不能小于0),人群SNP阳性率的最大值maxMutRate(默认为1,若需要设置不能大于1),检测到SNP频率的最大值maxFreq(默认为1,若需要设置不能大于1)。(5) Set filter conditions, including the minimum value of the comprehensive risk coefficient minRisk (the default is 1, if necessary, the setting cannot be less than 1), the minimum value of GCScore minGCScore (the default is 0, if necessary, the setting cannot be less than 0), the population SNP positive rate The maximum value of maxMutRate (the default is 1, if you need to set it cannot be greater than 1), the maximum value of the detected SNP frequency maxFreq (the default is 1, if you need to set it cannot be greater than 1).
(6)运行程序,输出RST000000000.report.tex文档和一些统计文件,包括对疾病和SNP的统计信息,以及高风险、中风险、低风险疾病的统计。(6) Run the program to output the RST000000000.report.tex file and some statistical files, including statistical information on diseases and SNPs, and statistics on high-risk, medium-risk, and low-risk diseases.
(7)利用LaTeX编译运行RST000000000.report.tex文档,生成PDF基因检测报告RST000000000.report.pdf。(7) Use LaTeX to compile and run the RST000000000.report.tex document to generate a PDF gene detection report RST000000000.report.pdf.
参阅图5,图5是本发明自动生成基因检测报告的装置一实施方式的结构示意图,该装置包括:第一获取模块101、第二获取模块102以及报告生成模块103。Referring to FIG. 5 , FIG. 5 is a structural diagram of an embodiment of the device for automatically generating a genetic test report according to the present invention. The device includes: a first acquisition module 101 , a second acquisition module 102 and a report generation module 103 .
第一获取模块101用于获取受检对象的遗传信息和样本信息。The first acquisition module 101 is used to acquire the genetic information and sample information of the subject.
遗传信息是指生物为复制与自己相同的东西,由亲代传递给子代、或各细胞每次分裂时由细胞传递给细胞的信息,即碱基对的排列顺序。此处的遗传信息可以是代表个体差异性的遗传信息、DNA突变的遗传信息、或者与基因遗传疾病相关的遗传信息等等。样本信息主要是受检对象的基本情况信息,如姓名、性别、年龄、样本类型、检测日期、检测单位等等。Genetic information refers to the information that organisms pass from parent to offspring, or from cell to cell each time each cell divides, in order to copy the same thing as itself, that is, the sequence of base pairs. The genetic information here may be genetic information representing individual differences, genetic information of DNA mutations, or genetic information related to genetic diseases, etc. The sample information is mainly the basic situation information of the tested object, such as name, gender, age, sample type, testing date, testing unit, and so on.
第二获取模块102用于通过遗传信息查询基因数据库,获取与遗传信息对应的表征信息,基因数据库中保存有遗传信息与表征信息之间的对应关系。The second acquiring module 102 is used to query the gene database through the genetic information, and acquire the characteristic information corresponding to the genetic information, and the gene database stores the correspondence between the genetic information and the characteristic information.
表征信息是指根据遗传信息可以表征在生物体表面、可以观察到的个体差异性的性状表现,包括但不限于长相、身高、体重、肤色、性格等。Representation information refers to the performance of individual differences that can be represented on the surface of organisms and can be observed based on genetic information, including but not limited to appearance, height, weight, skin color, personality, etc.
由于基因数据库中保存有遗传信息与表征信息之间的对应关系,因此,通过遗传信息查询基因数据库,可以获取与遗传信息对应的表征信息。Since the gene database stores the correspondence between the genetic information and the characterization information, the characterization information corresponding to the genetic information can be obtained by querying the gene database through the genetic information.
报告生成模块103用于调用基因检测报告的标准化模板,并将样本信息和与遗传信息对应的表征信息导入标准化模板中,自动生成基因检测报告。The report generation module 103 is used to call the standardized template of the gene detection report, import the sample information and the characterization information corresponding to the genetic information into the standardized template, and automatically generate the gene detection report.
基因检测报告的标准化模板是按照预设要求创建完毕的,在获得样本信息和表征信息后,调用标准化模板,即可自动生成基因检测报告。The standardized template of the genetic testing report is created according to the preset requirements. After obtaining the sample information and characterization information, the standardized template can be called to automatically generate the genetic testing report.
本发明实施方式获取受检对象的遗传信息和样本信息;通过遗传信息查询基因数据库,获取与遗传信息对应的表征信息;调用基因检测报告的标准化模板,并将样本信息和与遗传信息对应的表征信息导入标准化模板中,自动生成基因检测报告。由于通过遗传信息获取到表征信息后,调用标准模板,将样本信息和表征信息导入标准模板后,即可自动生成基因检测报告,因而不用手工录入,能够节约大量时间,避免人为误差,可以保证准确性,而且基因数据库可以根据新的研究发现及时更新相应信息,从而能够及时使基因检测报告得到更新。The embodiment of the present invention obtains the genetic information and sample information of the subject to be tested; queries the gene database through the genetic information to obtain the characterization information corresponding to the genetic information; calls the standardized template of the gene detection report, and compares the sample information and the characterization information corresponding to the genetic information The information is imported into the standardized template, and the genetic testing report is automatically generated. After the characterization information is obtained through the genetic information, the standard template is called, and after the sample information and characterization information are imported into the standard template, the genetic test report can be automatically generated, so there is no need for manual entry, which can save a lot of time, avoid human errors, and ensure accuracy. Moreover, the gene database can update the corresponding information in time according to new research findings, so that the gene test report can be updated in time.
参阅图6,如果基因检测报告的标准化模板事先没有创建好,该装置还包括:第一确定模块201和编写模块202。Referring to FIG. 6 , if the standardized template of the gene detection report has not been created in advance, the device further includes: a first determination module 201 and a writing module 202 .
第一确定模块201用于确定基因检测报告的标准化模板的格式。The first determination module 201 is used to determine the format of the standardized template of the gene detection report.
编写模块202用于采用拉泰赫LaTeX源代码编写基因检测报告的标准化模板。The writing module 202 is used to write the standardized template of the gene detection report by using LaTeX source code.
LaTeX(音译“拉泰赫”)是一种基于ΤΕΧ的排版系统,利用这种格式,即使使用者没有排版和程序设计的知识也可以充分发挥由TeX所提供的强大功能,能在几天、甚至几小时内生成很多具有书籍质量的印刷品。因此它非常适用于生成高印刷质量的科技和数学类文档,这个系统同样适用于生成从简单的信件到完整书籍的所有其他种类的文档。LaTeX (transliteration "La Taihe") is a typesetting system based on TEX. Using this format, users can give full play to the powerful functions provided by TeX even if they have no knowledge of typesetting and programming. Even produce many book-quality prints in a few hours. It is therefore well suited for producing high print quality technical and mathematical documents, and the system is equally suitable for producing all other kinds of documents, from simple letters to complete books.
本实施方式通过使用LaTeX源代码来编写标准化模板,可以进行批量处理,实现自动化生成基因检测报告。In this embodiment, by using LaTeX source code to write standardized templates, batch processing can be performed to realize automatic generation of genetic testing reports.
在实际应用中,除了使用LaTeX源代码来编写标准化模板外,也可以采用其他纯文本的源代码来编写标准化模板。In practical applications, in addition to using LaTeX source code to write standardized templates, other plain text source codes can also be used to write standardized templates.
参阅图7,当遗传信息是基因分型结果,表征信息是疾病信息,基因数据库是疾病基因数据库时,报告生成模块103包括:调用单元1031、导入单元1032、设置单元1033、运行单元1034以及生成单元1035。Referring to Fig. 7, when the genetic information is the genotyping result, the characterization information is the disease information, and the gene database is the disease gene database, the report generation module 103 includes: a call unit 1031, an import unit 1032, a setting unit 1033, an operation unit 1034 and a generation Unit 1035.
调用单元1031用于调用基因检测报告的标准化模板。The calling unit 1031 is used to call the standardized template of the gene detection report.
导入单元1032用于将样本信息和与基因分型结果对应的疾病信息导入标准化模板中。The import unit 1032 is used for importing the sample information and the disease information corresponding to the genotyping result into the standardized template.
基因分型是利用生物学检测方法测定个体基因型的技术。一些常用的基因分型技术有:限制性片段长度多态性(RFLP)、末端限制性长度多态性(t-RFLP)、扩增片段长度多态性(AFLP)、多重连接探针扩增(MLPA)、单核苷酸多态性(SNP)等。基因分型分析能够诊断疾病导致的遗传变异,如微卫星序列不稳定(MSI)、三体型(Trisomy)、非整倍体(Aneuploidy)、杂合性缺失(LOH)等;微卫星序列不稳定和杂合性缺失与结肠癌、乳腺癌、子宫颈癌的肿瘤细胞基因型有关;第21号染色体的三体型是种常见的非整倍体,临床表现为先天愚型(Down Syndrome)。Genotyping is the technique of determining an individual's genotype using biological assays. Some of the commonly used genotyping techniques are: restriction fragment length polymorphism (RFLP), terminal restriction length polymorphism (t-RFLP), amplified fragment length polymorphism (AFLP), multiplex ligation probe amplification (MLPA), single nucleotide polymorphism (SNP), etc. Genotyping analysis can diagnose genetic variation caused by diseases, such as microsatellite sequence instability (MSI), trisomy (Trisomy), aneuploidy (Aneuploidy), loss of heterozygosity (LOH), etc.; microsatellite sequence instability And loss of heterozygosity is related to the tumor cell genotype of colon cancer, breast cancer, and cervical cancer; the trisomy of chromosome 21 is a common aneuploidy, and the clinical manifestation is Down Syndrome.
设置单元1033用于设置与疾病相关的筛选条件。The setting unit 1033 is used to set the screening conditions related to the disease.
运行单元1034用于根据与疾病相关的筛选条件,运行生成基因检测报告的纯文本文件,获得基因检测报告的纯文本文件。The running unit 1034 is used to run and generate the plain text file of the gene detection report according to the screening conditions related to the disease, and obtain the plain text file of the gene detection report.
生成单元1035用于将基因检测报告的纯文本文件生成便携式文件格式PDF的基因检测报告。The generation unit 1035 is used to generate a genetic detection report in a portable file format PDF from the plain text file of the genetic detection report.
基因检测报告可以包括与基因分型结果对应的疾病信息,以及高风险、中风险、低风险疾病的统计。The genetic testing report may include disease information corresponding to genotyping results, as well as statistics of high-risk, medium-risk, and low-risk diseases.
通过本实施方式,可以获得详细的疾病信息及相关的统计文档的PDF格式的基因检测报告。Through this embodiment, a gene detection report in PDF format with detailed disease information and related statistical documents can be obtained.
参阅图8,当基因数据库还没有建立的时候,装置还包括:提取模块301、第二确定模块302以及建立模块303。Referring to FIG. 8 , when the gene database has not been established, the device further includes: an extraction module 301 , a second determination module 302 and an establishment module 303 .
提取模块301用于从文献资料或数据库中提取所需的与遗传信息相关的数据信息。The extraction module 301 is used to extract the required data information related to the genetic information from literature or databases.
第二确定模块302用于确定数据信息的数据格式;The second determination module 302 is used to determine the data format of the data information;
建立模块303用于根据数据信息和数据格式建立基因数据库。The establishment module 303 is used to establish the gene database according to the data information and data format.
通过本实施方式,可以构建基因数据库。According to this embodiment, a gene database can be constructed.
以上所述仅为本发明的实施方式,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。The above is only the embodiment of the present invention, and does not limit the patent scope of the present invention. Any equivalent structure or equivalent process conversion made by using the description of the present invention and the contents of the accompanying drawings, or directly or indirectly used in other related technologies fields, all of which are equally included in the scope of patent protection of the present invention.
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410487501.7A CN105512508B (en) | 2014-09-22 | 2014-09-22 | Automatically generate the method and device of genetic test report |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410487501.7A CN105512508B (en) | 2014-09-22 | 2014-09-22 | Automatically generate the method and device of genetic test report |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105512508A CN105512508A (en) | 2016-04-20 |
CN105512508B true CN105512508B (en) | 2018-05-15 |
Family
ID=55720485
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410487501.7A Expired - Fee Related CN105512508B (en) | 2014-09-22 | 2014-09-22 | Automatically generate the method and device of genetic test report |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105512508B (en) |
Families Citing this family (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106777911A (en) * | 2016-11-28 | 2017-05-31 | 墨宝股份有限公司 | A kind of ob gene check and evaluation system and data processing method |
CN106778083A (en) * | 2016-11-28 | 2017-05-31 | 墨宝股份有限公司 | A kind of method and device for automatically generating genetic test report |
CN107066836A (en) * | 2017-06-15 | 2017-08-18 | 上海思路迪生物医学科技有限公司 | Genetic test management method and system |
CN108388775A (en) * | 2018-02-05 | 2018-08-10 | 上海海云生物科技有限公司 | Genetic analysis guidance system and its method |
CN108776748A (en) * | 2018-05-16 | 2018-11-09 | 成都奇恩生物科技有限公司 | A kind of gene detection system and its detection method |
CN109378044A (en) * | 2018-10-18 | 2019-02-22 | 东莞博奥木华基因科技有限公司 | Individual administration report generation method and system |
CN109754856B (en) * | 2018-12-07 | 2021-06-22 | 荣联科技集团股份有限公司 | Method and device for automatically generating gene detection report and electronic equipment |
CN109817299A (en) * | 2019-02-14 | 2019-05-28 | 北京安智因生物技术有限公司 | A kind of relevant genetic test report automatic generating method of disease and system |
CN110335638B (en) * | 2019-05-22 | 2021-11-23 | 北京安智因生物技术有限公司 | Automatic generation method and system for statin drug gene detection report |
CN111161824A (en) * | 2019-12-20 | 2020-05-15 | 苏州赛美科基因科技有限公司 | Automatic report interpretation method and system |
CN111145855A (en) * | 2019-12-30 | 2020-05-12 | 南京求臻基因科技有限公司 | Automatic generation method and system for clinical PDF report |
CN111243661A (en) * | 2020-01-13 | 2020-06-05 | 北京奇云诺德信息科技有限公司 | Gene physical examination system based on gene data |
CN111564178B (en) * | 2020-04-15 | 2023-07-21 | 圣湘生物科技股份有限公司 | Method, device, equipment and storage medium for generating gene polymorphism analysis report |
CN111627509A (en) * | 2020-05-07 | 2020-09-04 | 圣湘生物科技股份有限公司 | Method, device, equipment and storage medium for generating virus gene detection report |
CN111681706A (en) * | 2020-06-11 | 2020-09-18 | 江苏萨芳纳健康科技有限公司 | Genetic testing method for chronic disease risk |
CN112768005A (en) * | 2020-12-23 | 2021-05-07 | 阔然生物医药科技(上海)有限公司 | Automatic report generation method for tumor gene detection |
CN117275656B (en) * | 2023-11-22 | 2024-04-09 | 北斗生命科学(广州)有限公司 | Method and system for automatically generating standardized report of clinical test record |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1385702A (en) * | 2001-03-20 | 2002-12-18 | 奥索临床诊断有限公司 | Method for supply clinical diagnosis |
CN1547721A (en) * | 2001-08-28 | 2004-11-17 | System, method, and apparatus for storing, retrieving, and integrating clinical, diagnostic, genomic, and therapeutic data | |
CN1889105A (en) * | 2005-06-28 | 2007-01-03 | 联合基因科技有限公司 | Gene medical health system based on network structure and method thereof |
CN102509250A (en) * | 2011-11-03 | 2012-06-20 | 上海大学 | Automatic generation method for medical report |
CN103424541A (en) * | 2006-05-18 | 2013-12-04 | 分子压型学会股份有限公司 | System and method for determining individualized medical intervention for a disease state |
-
2014
- 2014-09-22 CN CN201410487501.7A patent/CN105512508B/en not_active Expired - Fee Related
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1385702A (en) * | 2001-03-20 | 2002-12-18 | 奥索临床诊断有限公司 | Method for supply clinical diagnosis |
CN1547721A (en) * | 2001-08-28 | 2004-11-17 | System, method, and apparatus for storing, retrieving, and integrating clinical, diagnostic, genomic, and therapeutic data | |
CN1889105A (en) * | 2005-06-28 | 2007-01-03 | 联合基因科技有限公司 | Gene medical health system based on network structure and method thereof |
CN103424541A (en) * | 2006-05-18 | 2013-12-04 | 分子压型学会股份有限公司 | System and method for determining individualized medical intervention for a disease state |
CN102509250A (en) * | 2011-11-03 | 2012-06-20 | 上海大学 | Automatic generation method for medical report |
Also Published As
Publication number | Publication date |
---|---|
CN105512508A (en) | 2016-04-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105512508B (en) | Automatically generate the method and device of genetic test report | |
Zook et al. | A robust benchmark for detection of germline large deletions and insertions | |
Hoyt et al. | From telomere to telomere: The transcriptional and epigenetic state of human repeat elements | |
Zhao et al. | Accuracy and efficiency of germline variant calling pipelines for human genome data | |
Mao et al. | Pathway-level information extractor (PLIER) for gene expression data | |
Knowles et al. | Allele-specific expression reveals interactions between genetic variation and environment | |
Bonder et al. | Identification of rare and common regulatory variants in pluripotent cells using population-scale transcriptomics | |
Tankard et al. | Detecting expansions of tandem repeats in cohorts sequenced with short-read sequencing data | |
Goldmann et al. | Germline de novo mutation clusters arise during oocyte aging in genomic regions with high double-strand-break incidence | |
Forgetta et al. | Cohort profile: genomic data for 26 622 individuals from the Canadian Longitudinal Study on Aging (CLSA) | |
Hughes et al. | Identification of multiple independent susceptibility loci in the HLA region in Behcet's disease | |
Grundberg et al. | Mapping cis-and trans-regulatory effects across multiple tissues in twins | |
Dixon et al. | A genome-wide association study of global gene expression | |
Marchini et al. | Genotype imputation for genome-wide association studies | |
Patterson et al. | Genetic evidence for complex speciation of humans and chimpanzees | |
Bamshad et al. | Deconstructing the relationship between genetics and race | |
Reich et al. | Reconstructing Indian population history | |
Korn et al. | Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs | |
De et al. | Bioinformatics challenges in genome-wide association studies (GWAS) | |
Feng et al. | GWAPower: a statistical power calculation software for genome-wide association studies with quantitative traits | |
JP2019054812A (en) | Methods and processes for non-invasive assessment of genetic variations | |
Young | Discovering missing heritability in whole-genome sequencing data | |
Zeng et al. | Shared genetics and couple-associated environment are major contributors to the risk of both clinical and self-declared depression | |
Yuan et al. | Analysis of genome-wide RNA-sequencing data suggests age of the CEPH/Utah (CEU) lymphoblastoid cell lines systematically biases gene expression profiles | |
Werner et al. | Population variability in X-chromosome inactivation across 10 mammalian species |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20180515 Termination date: 20190922 |
|
CF01 | Termination of patent right due to non-payment of annual fee |