[go: up one dir, main page]

CN114577967B - Chromatographic analysis method of traditional Chinese medicine compound samples based on artificial neural network and differential spectrum - Google Patents

Chromatographic analysis method of traditional Chinese medicine compound samples based on artificial neural network and differential spectrum Download PDF

Info

Publication number
CN114577967B
CN114577967B CN202011385656.1A CN202011385656A CN114577967B CN 114577967 B CN114577967 B CN 114577967B CN 202011385656 A CN202011385656 A CN 202011385656A CN 114577967 B CN114577967 B CN 114577967B
Authority
CN
China
Prior art keywords
artificial neural
neural network
compound
chinese medicine
sample
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011385656.1A
Other languages
Chinese (zh)
Other versions
CN114577967A (en
Inventor
梁鑫淼
刘喆
刘艳芳
薛兴亚
沈爱金
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dalian Institute of Chemical Physics of CAS
Original Assignee
Dalian Institute of Chemical Physics of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dalian Institute of Chemical Physics of CAS filed Critical Dalian Institute of Chemical Physics of CAS
Priority to CN202011385656.1A priority Critical patent/CN114577967B/en
Publication of CN114577967A publication Critical patent/CN114577967A/en
Application granted granted Critical
Publication of CN114577967B publication Critical patent/CN114577967B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/86Signal analysis
    • G01N30/8675Evaluation, i.e. decoding of the signal into analytical information
    • G01N30/8686Fingerprinting, e.g. without prior knowledge of the sample components
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/17Systems in which incident light is modified in accordance with the properties of the material investigated
    • G01N21/25Colour; Spectral properties, i.e. comparison of effect of material on the light at two or more different wavelengths or wavelength bands
    • G01N21/31Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry
    • G01N21/33Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using ultraviolet light
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/86Signal analysis
    • G01N30/8665Signal analysis for calibrating the measuring apparatus
    • G01N30/8668Signal analysis for calibrating the measuring apparatus using retention times

Landscapes

  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Biochemistry (AREA)
  • Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Immunology (AREA)
  • Pathology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Engineering & Computer Science (AREA)
  • Library & Information Science (AREA)
  • Other Investigation Or Analysis Of Materials By Electrical Means (AREA)

Abstract

The invention relates to a chromatographic fingerprint spectrum analysis method of a traditional Chinese medicine compound. The invention aims to provide a traditional Chinese medicine compound fingerprint analysis method based on a linear artificial neural network model and a chromatographic difference spectrum algorithm, aiming at the limitations of the existing traditional Chinese medicine chromatographic fingerprint processing method. The single medicine composition of the traditional Chinese medicine compound is analyzed by a method of performing difference spectrum analysis finally through a chromatographic peak matching, chromatogram correction and chromatogram reconstruction flow of a linear artificial neural network model, and the method has a good visualization effect. Lays an important foundation for the improvement of the quality control technology of the traditional Chinese medicine compound fingerprint.

Description

基于人工神经网络和差谱的中药复方样品色谱分析方法Chromatographic analysis method of traditional Chinese medicine compound samples based on artificial neural network and difference spectrum

技术领域Technical Field

本发明属于中药复方谱图解析方法领域,具体为一种基于人工神经网络模型LSM算法和差谱算法的中药复方色谱指纹图谱解析方法。The invention belongs to the field of Chinese medicine compound spectrum analysis methods, and specifically is a Chinese medicine compound chromatographic fingerprint analysis method based on an artificial neural network model LSM algorithm and a difference spectrum algorithm.

背景技术Background Art

随着中药产业与中医文化的不断发展,近年来中药的生产规模不断扩大,中药厂家数量不断增多,所显现的复方药材造假问题愈加明显,如偷减药材、饮片替换、药材过期问题层出不穷,这些都给中药安全与中药信誉造成了重大损失,严重危害了中药行业的持续发展。对于中药的各种复方药材进行有效解析,能有效的建立中药质量标准,具有重大的意义。如果能建立一个系统、全面、可视化的中药色谱分析方法,就可以对中药成分进行有效分析,对中药复方产品质量进行有效监管,保证市场健康发展。With the continuous development of the Chinese medicine industry and Chinese medicine culture, the production scale of Chinese medicine has been expanding in recent years, and the number of Chinese medicine manufacturers has been increasing. The problem of compound medicinal materials counterfeiting has become more and more obvious, such as stealing and reducing medicinal materials, replacing decoction pieces, and expiration of medicinal materials. These have caused significant losses to the safety and reputation of Chinese medicine, and seriously endangered the sustainable development of the Chinese medicine industry. It is of great significance to effectively analyze various compound medicinal materials of Chinese medicine and effectively establish the quality standards of Chinese medicine. If a systematic, comprehensive, and visual Chinese medicine chromatographic analysis method can be established, it is possible to effectively analyze the components of Chinese medicine, effectively supervise the quality of Chinese medicine compound products, and ensure the healthy development of the market.

目前国内对于中药复方指纹图谱的解析还处于初级阶段,如根据保留时间进行人工的峰匹配和特征峰归属,这种方法在大规模处理复方样品时耗时长,精度较低,可视化程度不明显限制了其应用。At present, the analysis of fingerprint spectra of traditional Chinese medicine compounds is still in its early stages in China, such as manual peak matching and characteristic peak attribution based on retention time. This method is time-consuming, has low accuracy, and has a low degree of visualization when processing compound samples on a large scale, which limits its application.

发明内容Summary of the invention

本发明的目的是针对现有的中药色谱指纹图谱处理方法所存在的局限性,提供一种基于线性人工神经网络模型和色谱差谱算法的中药复方解析方法,通过色谱峰匹配,色谱图校正,线性人工神经网络模型重构色谱的流程,最终得到色谱差谱并进行分析的方法,以准确、自动的解析复方中的单味药组成并直观呈现。The purpose of the present invention is to provide a method for analyzing Chinese medicine compound prescriptions based on a linear artificial neural network model and a chromatographic difference spectrum algorithm in view of the limitations of existing Chinese medicine chromatographic fingerprint processing methods. The method finally obtains a chromatographic difference spectrum and performs analysis through chromatographic peak matching, chromatogram correction, and a chromatographic process reconstructed by a linear artificial neural network model, so as to accurately and automatically analyze the composition of single herbs in the compound prescription and present it intuitively.

本发明为实现上述目的所采用的技术方案是:基于人工神经网络和差谱的中药复方样品色谱分析方法,包括以下步骤:The technical solution adopted by the present invention to achieve the above-mentioned purpose is: a chromatographic analysis method of a Chinese medicine compound sample based on an artificial neural network and a difference spectrum, comprising the following steps:

对于不同的复方或单味中药样品,将每一个样品通过二极管阵列检测器设定固定波长采集紫外光谱数据,提取不同样品的色谱并积分,并采用光谱匹配的方式对任意两种样品进行色谱积分结果的峰匹配;For different compound or single Chinese medicine samples, each sample is set to a fixed wavelength through a diode array detector to collect ultraviolet spectrum data, the chromatograms of different samples are extracted and integrated, and the peaks of the chromatogram integration results of any two samples are matched by spectral matching;

在通过光谱匹配的方式中相同时间窗口T内,将相似度值最大且大于阈值的峰归为统一峰Fi,作为色谱峰匹配结果,其中Fi为峰类编号;In the same time window T through spectrum matching, the peak with the largest similarity value and greater than the threshold is classified as a unified peak F i as the chromatographic peak matching result, where F i is the peak class number;

对于每一个色谱峰匹配结果,以色谱峰顶点作为基准,使用傅里叶变换法对色谱进行重采样,并依据时间间隔进行色谱时间校正,得到校正后的复方中药谱图和单味中药谱图;For each chromatographic peak matching result, the chromatographic peak apex is used as a reference, the chromatogram is resampled using the Fourier transform method, and the chromatographic time is corrected according to the time interval to obtain the corrected compound Chinese medicine spectrum and single Chinese medicine spectrum;

根据校正色谱,分别拟合未知复方样品输入线性人工神经网络模型和已知复方样品输入线性人工神经网络模型,两个模型的输出均为样品系数向量

Figure BDA0002809609590000013
对于未知的复方中药样品,将校正后的复方中药谱图通过使用未知复方样品输入线性人工神经网络模型得到结果
Figure BDA0002809609590000011
对于已知的复方或单味中药样品,将校正后的复方中药谱图或单味中药谱图使用已知复方样品输入线性人工神经网络模型得到结果
Figure BDA0002809609590000012
与单味中药谱图进行运算得到解析谱图;According to the calibration chromatogram, the unknown compound sample input linear artificial neural network model and the known compound sample input linear artificial neural network model are fitted respectively. The outputs of both models are sample coefficient vectors.
Figure BDA0002809609590000013
For unknown compound Chinese medicine samples, the corrected compound Chinese medicine spectrum is input into the linear artificial neural network model by using the unknown compound sample to obtain the result.
Figure BDA0002809609590000011
For known compound or single Chinese medicine samples, the corrected compound Chinese medicine spectrum or single Chinese medicine spectrum is input into the linear artificial neural network model using the known compound sample to obtain the result.
Figure BDA0002809609590000012
Calculate with the spectrum of single Chinese medicine to obtain the analytical spectrum;

将解析谱图与已知样品的单味中药谱图对比,解析出复方样品中缺失的饮片药材。The analyzed spectrum is compared with the single-herb spectrum of known samples to parse out the missing medicinal materials in the compound sample.

所述采用光谱匹配的方式对任意两种样品进行色谱积分结果的峰匹配,包括以下步骤:The method of performing peak matching of chromatographic integration results of any two samples by using spectrum matching comprises the following steps:

使用在时间窗口T内的峰提取顶点光谱图,存为峰-光谱向量Xij,其中i为峰编号,j为所属复方或单味饮片样品编号,为无关正整数。The peaks in the time window T are used to extract the vertex spectra and stored as peak-spectrum vectors Xij , where i is the peak number and j is the sample number of the compound or single-flavor herbal medicine piece to which it belongs, which are irrelevant positive integers.

对X11,X22……Xij中任意两个进行余弦相似度计算,所得相似度结果相应为The cosine similarity calculation is performed on any two of X 11 , X 22 ……X ij , and the similarity result is

Figure BDA0002809609590000021
k为峰编号,m为所属复方或单味饮片样品编号,其相似度公式如下:
Figure BDA0002809609590000021
k is the peak number, m is the sample number of the compound or single herbal medicine, and the similarity formula is as follows:

Figure BDA0002809609590000022
Figure BDA0002809609590000022

通过选取时间窗口T值,大小表示参与匹配的峰的时间范围,取值范围在(0~10)分钟之间。By selecting the time window T value, the size represents the time range of the peaks involved in the matching, and the value range is between (0 and 10) minutes.

所述使用傅里叶变换法对色谱进行重采样,并依据时间间隔进行色谱时间校正,通过下式实现:The chromatogram is resampled using the Fourier transform method, and the chromatogram time correction is performed according to the time interval, which is achieved by the following formula:

Figure BDA0002809609590000023
Figure BDA0002809609590000023

其中t为时间,f(t)为色谱信号函数,p为抽样速率,k为正整数。Where t is time, f(t) is the chromatographic signal function, p is the sampling rate, and k is a positive integer.

所述拟合未知复方样品输入线性人工神经网络模型或标准复方样品输入线性人工神经网络模型,包括以下步骤:The fitting of the unknown compound sample input linear artificial neural network model or the standard compound sample input linear artificial neural network model comprises the following steps:

校正色谱为校正后的复方中药谱图或单味中药谱图,校正色谱为向量

Figure BDA0002809609590000024
单味饮片谱图为
Figure BDA0002809609590000025
j为单味饮片样品编号,将
Figure BDA0002809609590000026
组合构成单味饮片谱图矩阵D,定义系数向量为
Figure BDA0002809609590000027
在迭代次数为n时系数向量为
Figure BDA0002809609590000028
学习率为α,残差为‖Δ‖则有:The calibrated chromatogram is the calibrated compound Chinese medicine spectrum or single Chinese medicine spectrum. The calibrated chromatogram is a vector
Figure BDA0002809609590000024
The spectrum of single herbal medicine pieces is
Figure BDA0002809609590000025
j is the sample number of the single-ingredient decoction piece.
Figure BDA0002809609590000026
The single-ingredient herbal medicine spectrum matrix D is composed, and the coefficient vector is defined as
Figure BDA0002809609590000027
When the number of iterations is n, the coefficient vector is
Figure BDA0002809609590000028
If the learning rate is α and the residual is ‖Δ‖, then:

Figure BDA0002809609590000029
Figure BDA0002809609590000029

对Θ求偏导则有:Taking partial derivatives of Θ, we have:

Figure BDA00028096095900000210
Figure BDA00028096095900000210

此时有迭代公式:

Figure BDA00028096095900000211
At this time, there is an iterative formula:
Figure BDA00028096095900000211

迭代至当‖Δ‖小于设定值时停止学习,得到

Figure BDA00028096095900000212
Iterate until ‖Δ‖ is less than the set value and stop learning.
Figure BDA00028096095900000212

其中,对于未知复方样品输入线性人工神经网络模型和标准复方样品输入线性人工神经网络模型的输入为向量

Figure BDA00028096095900000213
模型输出为样品系数向量
Figure BDA00028096095900000214
The input of the linear artificial neural network model for unknown compound samples and the linear artificial neural network model for standard compound samples is the vector
Figure BDA00028096095900000213
The model output is a sample coefficient vector
Figure BDA00028096095900000214

所述解析谱图通过下式得到:The analytical spectrum is obtained by the following formula:

Figure BDA00028096095900000215
Figure BDA00028096095900000215

其中P为差谱结果,D为单味饮片谱图,

Figure BDA00028096095900000216
为标准复方样品输入线性人工神经网络模型输出参数,
Figure BDA0002809609590000031
为未知复方样品输入线性人工神经网络模型输出参数。Where P is the difference spectrum result, D is the spectrum of a single herbal medicine piece,
Figure BDA00028096095900000216
Input the output parameters of the linear artificial neural network model for the standard compound sample,
Figure BDA0002809609590000031
Input the output parameters of the linear artificial neural network model for unknown compound samples.

ω值大小表示不同样品的样品间不同峰的光谱相似程度,取值范围在(0~1)之间。The ω value indicates the degree of spectral similarity between different peaks of different samples, and its value range is between (0 and 1).

二极管阵列检测器可以替换为高效液相色谱-DAD检测器、高效液相色谱-质谱检测器、气相色谱-DAD检测器、气相色谱-质谱检测器、高效毛细管电泳色谱-DAD检测器或高效毛细管电泳色谱-质谱检测器中的一种。The diode array detector can be replaced by one of a high performance liquid chromatography-DAD detector, a high performance liquid chromatography-mass spectrometry detector, a gas chromatography-DAD detector, a gas chromatography-mass spectrometry detector, a high performance capillary electrophoresis chromatography-DAD detector or a high performance capillary electrophoresis chromatography-mass spectrometry detector.

未知复方样品输入线性人工神经网络模型或标准复方样品输入线性人工神经网络模型的构建,通过LSM、梯度下降和最小二乘法中的一种最小化残差。The unknown compound sample input linear artificial neural network model or the standard compound sample input linear artificial neural network model is constructed, and the residual is minimized by one of LSM, gradient descent and least square method.

所述色谱为高效液相色谱、气相色谱或高效毛细管电泳色谱。The chromatography is high performance liquid chromatography, gas chromatography or high performance capillary electrophoresis chromatography.

本发明具有以下有益效果及优点:The present invention has the following beneficial effects and advantages:

本发明从复方样品和单味样品饮片的关联性进行考虑,充分发挥线性人工神经网络算法优势,利用差谱解析的方式将复杂图谱解析结果可视化。The present invention considers the correlation between compound samples and single-flavor sample pieces, gives full play to the advantages of linear artificial neural network algorithm, and visualizes the complex spectrum analysis results by using the difference spectrum analysis method.

本发明方法简单易行,便于实现,算法适用性广,自动化程度高,适合大批量检测使用;采用傅里叶变换的重采样进行时间校正提高了谱图一致性,降低了操作条件要求,采用线性人工神经网络算法建立模型从而大大提高了测量的准确度,采用差谱方法解析复方谱图从而提高了结果可视化程度。The method of the present invention is simple and easy to implement, and has wide algorithm applicability and high degree of automation, so it is suitable for large-scale detection. The time correction by resampling of Fourier transform improves the consistency of spectrum and reduces the requirements of operating conditions. The linear artificial neural network algorithm is used to establish the model, thereby greatly improving the accuracy of measurement. The difference spectrum method is used to analyze the compound spectrum, thereby improving the visualization of the results.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1本发明的方法流程图;Fig. 1 is a flow chart of the method of the present invention;

图2本发明的示例-清骨散复方样品谱图;Fig. 2 is an example of the present invention - a spectrum of a Qinggusan compound recipe sample;

图3本发明的示例-清骨散校正谱图;Fig. 3 is an example of the present invention - a calibrated spectrum of Qinggusan;

图4本发明的线性神经网络结构图;Fig. 4 is a diagram of the linear neural network structure of the present invention;

图5本发明的示例-基于线性神经网络重建的清骨散色谱图;FIG5 is an example of the present invention - a chromatogram of Qinggusan reconstructed based on a linear neural network;

图6本发明的示例-缺失胡黄连清骨散、胡黄连及差谱对比图;FIG6 is an example of the present invention - missing Hu Huang Lian Qing Gu San, Hu Huang Lian and difference spectrum comparison diagram;

图7本发明的示例-缺失甘草清骨散、胡黄连及差谱对比图;FIG. 7 is an example of the present invention - missing Gancao Qinggu San, Picrorhizoma Coptidis and difference spectrum comparison diagram;

图8本发明的示例-缺失秦艽清骨散、胡黄连及差谱对比图。FIG8 is an example of the present invention - missing Qinji Qinggu San, Picrorhizoma Coptidis and difference spectrum comparison diagram.

具体实施方式DETAILED DESCRIPTION

下面结合附图及实施例对本发明做进一步的详细说明。The present invention is further described in detail below in conjunction with the accompanying drawings and embodiments.

基于人工神经网络模型和差谱方法的中药复方样品色谱分析方法,基于液相色谱技术,采用光谱匹配的方式进行峰匹配,采用重采样的方法进行色谱时间校正,使用人工神经网络模型计算单味样品权重并重构谱图,使用差谱方法解析中药复方的色谱指纹图谱,包括以下步骤:The chromatographic analysis method of a Chinese medicine compound sample based on an artificial neural network model and a difference spectrum method is based on liquid chromatography technology, uses a spectrum matching method to perform peak matching, uses a resampling method to perform chromatographic time correction, uses an artificial neural network model to calculate the weight of a single flavor sample and reconstruct the spectrum, and uses a difference spectrum method to analyze the chromatographic fingerprint of the Chinese medicine compound, including the following steps:

(1)根据不同样品的二极管阵列检测器采集二极管阵列紫外光谱(DAD)数据,根据复方样品特点选定统一波长提取色谱并积分,采用光谱匹配的方式进行色谱积分结果的峰匹配;(1) Collecting diode array ultraviolet spectrum (DAD) data according to the diode array detector of different samples, selecting a unified wavelength to extract and integrate the chromatogram according to the characteristics of the compound sample, and performing peak matching of the chromatogram integration results by spectrum matching;

(2)在步骤(1)中所使用的匹配方法,使用在时间窗口T内的峰提取顶点光谱图,存为峰-光谱向量Xij,其中i为峰编号,j为所属复方或单味饮片样品编号,为无关正整数,对X11,X22……Xij中任意两个进行余弦相似度计算,所得结果为

Figure BDA0002809609590000041
其相似度公式如下:(2) The matching method used in step (1) uses the peaks in the time window T to extract the vertex spectra and stores them as peak-spectrum vectors Xij , where i is the peak number and j is the sample number of the compound or single-flavor herbal medicine piece to which it belongs, which are irrelevant positive integers. The cosine similarity calculation is performed on any two of Xi11 , Xi22 ... Xij , and the result is
Figure BDA0002809609590000041
The similarity formula is as follows:

Figure BDA0002809609590000042
Figure BDA0002809609590000042

(3)将相同时间窗口T内计算得到的ω最大,且大于阈值的峰归为统一峰Fi,得到匹配结果,其中Fi为峰类编号。(3) The peaks with the largest ω calculated within the same time window T and greater than the threshold are classified as unified peaks F i to obtain matching results, where F i is the peak class number.

(4)根据在步骤(3)中所得到的色谱峰匹配结果,以色谱峰顶点作为基准,使用傅里叶变换法对色谱进行重采样,并依据时间间隔进行色谱时间校正,其算法核心为:(4) Based on the chromatographic peak matching result obtained in step (3), the chromatographic peak apex is used as a reference, the chromatogram is resampled using the Fourier transform method, and the chromatographic time correction is performed according to the time interval. The core of the algorithm is:

Figure BDA0002809609590000043
Figure BDA0002809609590000043

其中t为时间,f(t)为色谱信号函数,p为抽样速率,k为正整数。Where t is time, f(t) is the chromatographic signal function, p is the sampling rate, and k is a positive integer.

(5)根据在步骤(4)中所得到的校正色谱,使用Least mean square算法(LMS)分别拟合线性人工神经网络模型,其LMS学习算法及线性人工神经网络公式如下:(5) According to the calibrated chromatogram obtained in step (4), a least mean square algorithm (LMS) is used to fit the linear artificial neural network model. The LMS learning algorithm and the linear artificial neural network formula are as follows:

设复方图谱(标准复方样品图谱或待分析复方样品谱)为向量

Figure BDA0002809609590000044
单味饮片谱图为
Figure BDA0002809609590000045
(j为所属单味饮片样品编号)。将
Figure BDA0002809609590000046
组合构成单味饮片谱图矩阵D,定义系数向量为
Figure BDA0002809609590000047
在迭代次数为n时系数向量为
Figure BDA0002809609590000048
学习率为α,残差为‖Δ‖则有:Let the compound spectrum (standard compound sample spectrum or compound sample spectrum to be analyzed) be vector
Figure BDA0002809609590000044
The spectrum of single herbal medicine pieces is
Figure BDA0002809609590000045
(j is the sample number of the single herbal medicine piece).
Figure BDA0002809609590000046
The single-ingredient herbal medicine spectrum matrix D is composed, and the coefficient vector is defined as
Figure BDA0002809609590000047
When the number of iterations is n, the coefficient vector is
Figure BDA0002809609590000048
If the learning rate is α and the residual is ‖Δ‖, then:

Figure BDA0002809609590000049
Figure BDA0002809609590000049

对Θ求偏导则有:Taking partial derivatives of Θ, we have:

Figure BDA00028096095900000410
Figure BDA00028096095900000410

此时有迭代公式:At this time, there is an iterative formula:

Figure BDA00028096095900000411
Figure BDA00028096095900000411

迭代至当‖Δ‖小于特定值时停止学习,得到

Figure BDA00028096095900000412
Iterate until ‖Δ‖ is less than a certain value and stop learning, and get
Figure BDA00028096095900000412

其中,对于每次分析中具体的复方样品模型输入为复方谱图向量

Figure BDA00028096095900000413
模型输出为样品系数向量
Figure BDA00028096095900000414
Among them, the input of the specific compound sample model in each analysis is the compound spectrum vector
Figure BDA00028096095900000413
The model output is a sample coefficient vector
Figure BDA00028096095900000414

(6)使用未知复方样品输入线性人工神经网络模型得到结果

Figure BDA00028096095900000415
使用标准复方样品输入线性人工神经网络模型得到结果
Figure BDA00028096095900000416
与单味饮片谱图矩阵计算得到解析谱图,其公式为:(6) Use unknown compound samples to input the linear artificial neural network model to obtain the results
Figure BDA00028096095900000415
The results were obtained by inputting the standard compound samples into the linear artificial neural network model.
Figure BDA00028096095900000416
The analytical spectrum is calculated with the spectrum matrix of single-flavor decoction pieces, and the formula is:

Figure BDA00028096095900000417
Figure BDA00028096095900000417

其中P为差谱结果,D为单味饮片谱图,

Figure BDA00028096095900000418
为标准谱人工神经网络模型参数,
Figure BDA00028096095900000419
为未知样品谱人工神经网络模型参数。Where P is the difference spectrum result, D is the spectrum of a single herbal medicine piece,
Figure BDA00028096095900000418
are the standard spectral artificial neural network model parameters,
Figure BDA00028096095900000419
are the artificial neural network model parameters of the unknown sample spectrum.

(7)对于差谱结果P,与相应单味饮片谱图对照,可进行直观对比与分析,根据差谱特点,可轻易解析复方样品中缺失的饮片药材。(7) The difference spectrum result P can be compared with the corresponding single-ingredient herbal medicine spectrum for intuitive comparison and analysis. Based on the characteristics of the difference spectrum, the missing herbal medicine in the compound sample can be easily analyzed.

通过选取时间窗口T值,大小表示参与匹配的峰的时间范围,取值范围在(0~10)分钟之间。ω值大小表示不同样品的样品间不同峰的光谱相似程度,取值范围在(0~1)之间。在相同时间口内,选取ω取值范围在(0.2~∞)之间的值,并取最大值作为匹配峰。By selecting the time window T value, the size indicates the time range of the peaks involved in the matching, and the value range is between (0 and 10) minutes. The ω value indicates the spectral similarity between different peaks of different samples, and the value range is between (0 and 1). Within the same time window, select the value of ω in the range of (0.2 to ∞), and take the maximum value as the matching peak.

通过二极管阵列三维数据或三维质谱数据进行色谱峰峰相似度计算,包括其特征是所述的二维检测技术,包括但不限于高效液相色谱-DAD检测器、高效液相色谱-质谱检测器、气相色谱-DAD检测器、气相色谱-质谱检测器、高效毛细管电泳色谱-DAD检测器或高效毛细管电泳色谱-质谱检测器的时间-强度-波长/荷质比。The chromatographic peak similarity calculation is performed through diode array three-dimensional data or three-dimensional mass spectrometry data, including the two-dimensional detection technology, including but not limited to the time-intensity-wavelength/charge-to-mass ratio of a high performance liquid chromatography-DAD detector, a high performance liquid chromatography-mass spectrometry detector, a gas chromatography-DAD detector, a gas chromatography-mass spectrometry detector, a high performance capillary electrophoresis chromatography-DAD detector or a high performance capillary electrophoresis chromatography-mass spectrometry detector.

根据匹配峰保留时间,将相邻峰顶点间的数据进行重采样使数据点数目一致,使用相同的时间间隔校正色谱。Based on the matching peak retention times, the data between adjacent peak apexes were resampled to make the number of data points consistent, and the chromatograms were corrected using the same time interval.

使用复方色谱与单味饮片色谱进行线性模型构建,算法包括但不限于LSM、梯度下降和最小二乘法最小化残差。以模型与目标残差‖Δ‖为终止条件,当其满足取值范围时终止,‖Δ‖取值范围为(-∞~0.5)。The linear model was constructed using compound chromatograms and single herbal medicine chromatograms, and the algorithms included but were not limited to LSM, gradient descent, and least squares method to minimize the residual. The model and target residual ‖Δ‖ was used as the termination condition, and the termination was terminated when it met the value range, and the value range of ‖Δ‖ was (-∞~0.5).

将进行时间校正后的色谱或又数据模型重构色谱进行差减得到色谱差谱。The chromatogram after time correction or the chromatogram reconstructed by the data model is subtracted to obtain the chromatogram difference spectrum.

通过谱中药色谱差谱进行物质成分分析,包括其特征是所述的色谱方法包括高效液相色谱、气相色谱或高效毛细管电泳色谱。The material components are analyzed by using the chromatographic difference spectrum of traditional Chinese medicine, and its characteristic is that the chromatographic method includes high performance liquid chromatography, gas chromatography or high performance capillary electrophoresis chromatography.

如图1所示,本发明根据不同样品的二极管阵列检测器采集DAD数据图,根据药材选定波长提取色谱并积分,采用光谱匹配的方式进行色谱积分结果的峰匹配,使用匹配结果,采用重采样方法进行色谱峰校正,进而使用线性人工神经网络模型,得到色谱模型,重构色谱后进行差减,得到差谱数据。As shown in Figure 1, the present invention collects DAD data graphs according to the diode array detector of different samples, extracts and integrates the chromatogram according to the wavelength selected by the medicinal material, uses spectrum matching to perform peak matching of the chromatogram integration result, uses the matching result, adopts a resampling method to perform chromatogram peak correction, and then uses a linear artificial neural network model to obtain a chromatogram model, reconstructs the chromatogram, and performs subtraction to obtain differential spectrum data.

以清骨散复方及单味样品为例,使用清骨散样品,其色谱图如图2所示,使用在时间窗口内的峰提取顶点光谱图,存为向量存为峰-光谱向量Xij,其中i为峰编号,j为所属样品编号,为无关正整数,选取时间窗口为1分钟,在时间窗口T内进行余弦相似度计算,对X11,X22……Xij中任意两个进行余弦相似度计算,所得结果为

Figure BDA0002809609590000051
控制相似度阈值为0.9,取最大值ω后得到匹配结果如表1所示。Taking the Qinggusan compound and single-flavor samples as examples, the Qinggusan sample is used, and its chromatogram is shown in Figure 2. The peak in the time window is used to extract the vertex spectrum, which is stored as a vector stored as a peak-spectrum vector Xij , where i is the peak number, j is the sample number to which it belongs, and is an irrelevant positive integer. The time window is selected as 1 minute, and the cosine similarity calculation is performed within the time window T. The cosine similarity calculation is performed on any two of X11 , X22 ... Xij , and the result is
Figure BDA0002809609590000051
The similarity threshold is controlled to 0.9, and the matching results are obtained after taking the maximum value ω as shown in Table 1.

依据表1内容作为内标峰,进行清骨散色谱图校正,首先对色谱图傅里叶变化得到F(w),对其进行傅里叶逆变换进行重采样,所得校正色谱图如图3所示。The chromatogram of Qinggu Powder was calibrated based on the contents of Table 1 as the internal standard peak. First, the chromatogram was Fourier transformed to obtain F(w), and then the inverse Fourier transform was performed to resample it. The obtained calibrated chromatogram is shown in Figure 3.

使用如图4所示的线性人工神经网络模型对清骨散进行建模,所得利用模型重构清骨散色谱图,所的模型如图5所示。The linear artificial neural network model shown in FIG4 was used to model Qinggu Powder, and the chromatogram of Qinggu Powder was reconstructed using the model. The obtained model is shown in FIG5 .

将清骨散标准谱图与缺失不同药材的清骨散色图谱进行差减,得到差谱,如图6至图8所示,其中,图6为清骨散标准谱图,缺失胡黄连清骨散的谱图,胡黄连色谱图和差谱谱图,图7为清骨散标准谱图,缺失甘草清骨散的谱图,甘草色谱图和差谱谱图,图8为清骨散标准谱图,缺失秦艽清骨散的谱图,秦艽色谱图和差谱谱图。The standard spectrum of Qinggu San was subtracted from the spectrum of Qinggu San with different medicinal materials missing to obtain the difference spectrum, as shown in Figures 6 to 8, wherein Figure 6 is the standard spectrum of Qinggu San, the spectrum of Huhuanglian Qinggu San missing, the Huhuanglian chromatogram and the difference spectrum, Figure 7 is the standard spectrum of Qinggu San, the spectrum of Gancao Qinggu San missing, the chromatogram and the difference spectrum of Gancao, and Figure 8 is the standard spectrum of Qinggu San, the spectrum of Qinjiao Qinggu San missing, the chromatogram and the difference spectrum of Qinjiao.

与单味药材饮片色谱对比,可轻易分析出复方样品中缺失的单味饮片药材。By comparing with the chromatogram of single-ingredient medicinal materials, the missing single-ingredient medicinal materials in the compound sample can be easily analyzed.

表1.清骨散与单味饮片样品样品峰匹配结果表Table 1. Peak matching results of Qinggusan and single-ingredient decoction pieces

Figure BDA0002809609590000061
Figure BDA0002809609590000061

本发明首先根据复方样品色谱图及其DAD谱图,对样品峰进行匹配,根据样品峰的实际分布情况,选择匹配的时间窗口与相似度阈值,通常时间窗口控制在0.1~10分钟间,相似度阈值在(0~1)之间,如使用8分钟为时间窗,0.2为相似度,进行色谱峰匹配。然后根据峰匹配结果作为内标物,使用傅里叶变换对色谱数据进行重采样并进行时间校正。在充分考虑复方样品中的峰来源同时,扩校正范围。最后使用线性人工神经网络模型建立复方样品与单味饮片样品的解析模型,使用模型重构色谱图后进行差谱运算,进行解析。The present invention first matches the sample peaks according to the compound sample chromatogram and its DAD spectrum, and selects the matching time window and similarity threshold according to the actual distribution of the sample peaks. Usually, the time window is controlled between 0.1 and 10 minutes, and the similarity threshold is between (0 and 1), such as using 8 minutes as the time window and 0.2 as the similarity to match the chromatographic peaks. Then, according to the peak matching result as the internal standard, the chromatographic data is resampled and time-corrected using Fourier transform. While fully considering the peak source in the compound sample, the correction range is expanded. Finally, a linear artificial neural network model is used to establish an analytical model for the compound sample and the single-flavor decoction piece sample, and a differential spectrum operation is performed after the chromatogram is reconstructed using the model for analysis.

Claims (9)

1. The chromatographic analysis method of the traditional Chinese medicine compound sample based on the artificial neural network and the difference spectrum is characterized by comprising the following steps of:
for different compound or single traditional Chinese medicine samples, setting fixed wavelength for each sample through a diode array detector to acquire ultraviolet spectrum data, extracting chromatograms and integrating different samples, and performing peak matching of chromatogram integration results on any two samples in a spectrum matching mode;
in the same time window T in the mode of spectrum matching, classifying the peak with the maximum similarity value omega and larger than the threshold value as a uniform peak F i As a result of chromatographic peak matching, wherein F i Numbering peak classes;
for each chromatographic peak matching result, taking the chromatographic peak top as a reference, resampling the chromatographic peak by using a Fourier transform method, and carrying out chromatographic time correction according to time intervals to obtain a corrected compound traditional Chinese medicine spectrogram and a corrected single traditional Chinese medicine spectrogram;
respectively fitting an unknown compound sample input linear artificial neural network model and a known compound sample input linear artificial neural network model according to the correction chromatogram, wherein the outputs of the two models are sample coefficient vectors
Figure FDA0003969507080000011
For unknown compound Chinese medicine sample, the corrected compound Chinese medicine spectrogram is input into linear artificial neural network model by using the unknown compound Chinese medicine sample to obtain the result
Figure FDA0003969507080000012
Inputting the corrected compound Chinese medicinal spectrogram or single Chinese medicinal spectrogram into linear artificial neural network model to obtain the result
Figure FDA0003969507080000013
Operating with the spectrogram of a single Chinese medicine to obtain an analytic spectrogram;
the fitting of the unknown compound sample input linear artificial neural network model or the standard compound sample input linear artificial neural network model comprises the following steps:
the corrected chromatogram is a compound Chinese medicinal spectrogram or a single Chinese medicinal spectrogramCorrecting the chromatogram to vector
Figure FDA0003969507080000014
The spectrogram of the single-ingredient decoction piece is
Figure FDA0003969507080000015
j is the sample number of the single-ingredient decoction pieces, and
Figure FDA0003969507080000016
combining to form single-ingredient decoction piece spectrogram matrix D with defined coefficient vector of
Figure FDA0003969507080000017
The coefficient vector is n when the number of iterations is n
Figure FDA0003969507080000018
The learning rate is α, the residual error is | Δ | then:
Figure FDA0003969507080000019
the partial derivatives for Θ are:
Figure FDA00039695070800000110
there is an iterative formula at this time:
Figure FDA00039695070800000111
iterating to stop learning when | is less than the set value, to obtain
Figure FDA00039695070800000112
Wherein, the input of the unknown compound sample input linear artificial neural network model and the standard compound sample input linear artificial neural network model is vector
Figure FDA00039695070800000113
The model output is the sample coefficient vector
Figure FDA00039695070800000114
And comparing the analysis spectrogram with the spectrogram of a single traditional Chinese medicine of a known sample to analyze the missing medicinal materials of the decoction pieces in the compound sample.
2. The chromatographic analysis method for the traditional Chinese medicine compound sample based on the artificial neural network and the difference spectrum as claimed in claim 1, wherein the peak matching of the chromatographic integration result is performed on any two samples in a spectral matching manner, comprising the following steps:
extracting a vertex spectrogram using peaks within a time window T, stored as a peak-to-spectrum vector X ij Wherein i is a peak number, and j is a number of the compound or single decoction piece sample and is an irrelevant positive integer;
to X 11 ,X 22 ……X ij Any two of the above two are subjected to cosine similarity calculation, and the obtained similarity result is correspondingly:
Figure FDA0003969507080000021
wherein k is a peak number, m is a sample number of the compound or single decoction piece, j is not equal to m, and the similarity formula is as follows:
Figure FDA0003969507080000022
wherein j ≠ m.
3. The chromatographic analysis method for the traditional Chinese medicine compound sample based on the artificial neural network and the difference spectrum as claimed in claim 1, characterized in that: the time window T value is selected, the size of the time window T value represents the time range of the peak participating in matching, and the value range is between 0 and 10 minutes.
4. The method for chromatographic analysis of compound Chinese medicine samples based on artificial neural networks and difference spectra as claimed in claim 1, wherein the re-sampling of the chromatogram using fourier transform method and the time correction of the chromatogram according to the time interval are achieved by the following formula:
Figure FDA0003969507080000023
where t is time, f (t) is the chromatographic signal function, p is the sampling rate, and k is a positive integer.
5. The method for chromatographic analysis of Chinese herbal compound sample based on artificial neural network and difference spectrum as claimed in claim 1, wherein said analytical spectrogram is obtained by the following formula:
Figure FDA0003969507080000024
wherein P is the difference spectrum result, D is the single-flavor decoction piece spectrogram,
Figure FDA0003969507080000025
inputting output parameters of a linear artificial neural network model for a standard compound sample,
Figure FDA0003969507080000026
and inputting output parameters of the linear artificial neural network model for the unknown compound sample.
6. The chromatographic analysis method for the traditional Chinese medicine compound sample based on the artificial neural network and the difference spectrum as claimed in claim 1, wherein the similarity value ω represents the spectral similarity degree of different peaks among samples of different samples, and the value range is between 0 and 1.
7. The chromatographic analysis method for compound traditional Chinese medicine samples based on the artificial neural network and the difference spectrum as claimed in claim 1, wherein the diode array detector can be replaced by one of a high performance liquid chromatography-DAD detector, a high performance liquid chromatography-mass spectrometry detector, a gas chromatography-DAD detector, a gas chromatography-mass spectrometry detector, a high performance capillary electrophoresis chromatography-DAD detector or a high performance capillary electrophoresis-mass spectrometry detector.
8. The method for chromatographic analysis of traditional Chinese medicine compound samples based on artificial neural networks and difference spectra as claimed in claim 1, wherein the construction of the unknown compound sample input linear artificial neural network model or the standard compound sample input linear artificial neural network model minimizes the residual error by one of LSM, gradient descent and least squares.
9. The method for chromatographic analysis of Chinese herbal compound sample based on artificial neural network and difference spectrum as claimed in claim 1, wherein the chromatogram is high performance liquid chromatography, gas chromatography or high performance capillary electrophoresis chromatography.
CN202011385656.1A 2020-12-01 2020-12-01 Chromatographic analysis method of traditional Chinese medicine compound samples based on artificial neural network and differential spectrum Active CN114577967B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011385656.1A CN114577967B (en) 2020-12-01 2020-12-01 Chromatographic analysis method of traditional Chinese medicine compound samples based on artificial neural network and differential spectrum

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011385656.1A CN114577967B (en) 2020-12-01 2020-12-01 Chromatographic analysis method of traditional Chinese medicine compound samples based on artificial neural network and differential spectrum

Publications (2)

Publication Number Publication Date
CN114577967A CN114577967A (en) 2022-06-03
CN114577967B true CN114577967B (en) 2023-03-24

Family

ID=81766925

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011385656.1A Active CN114577967B (en) 2020-12-01 2020-12-01 Chromatographic analysis method of traditional Chinese medicine compound samples based on artificial neural network and differential spectrum

Country Status (1)

Country Link
CN (1) CN114577967B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116973478B (en) * 2023-07-31 2024-01-19 中国人民解放军联勤保障部队第九六四医院 Quality standard analysis method of bletilla striata compound spleen-tonifying traditional Chinese medicine
CN116973495B (en) * 2023-09-21 2023-12-15 山东鲁地源天然药物有限公司 Analysis and management system for detection data of traditional Chinese medicine decoction pieces based on gas chromatograph

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005057208A1 (en) * 2003-12-03 2005-06-23 Prolexys Pharmaceuticals, Inc. Methods of identifying peptides and proteins
WO2006110848A2 (en) * 2005-04-11 2006-10-19 Cerno Bioscience Llc Chromatographic and mass spectral data analysis
WO2011154259A1 (en) * 2010-06-09 2011-12-15 Galderma Research & Development Personal Identification Based on Sebum Composition
CN109507315A (en) * 2018-11-15 2019-03-22 宁夏医科大学 Complex samples GC-MS, which is automatically parsed, realizes that compound accurately identifies and the method for otherness component screening

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005057208A1 (en) * 2003-12-03 2005-06-23 Prolexys Pharmaceuticals, Inc. Methods of identifying peptides and proteins
WO2006110848A2 (en) * 2005-04-11 2006-10-19 Cerno Bioscience Llc Chromatographic and mass spectral data analysis
WO2011154259A1 (en) * 2010-06-09 2011-12-15 Galderma Research & Development Personal Identification Based on Sebum Composition
CN109507315A (en) * 2018-11-15 2019-03-22 宁夏医科大学 Complex samples GC-MS, which is automatically parsed, realizes that compound accurately identifies and the method for otherness component screening

Non-Patent Citations (10)

* Cited by examiner, † Cited by third party
Title
Simultaneous ultra-trace quantitative colorimetric determination of antidiabetic drugs based on gold nanoparticles aggregation using multivariate calibration and neural network methods;Maryam Moradi 等;《Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy》;20200312;第1-9页 *
一种评价中药色谱指纹谱相似性的新方法:向量夹角法;王龙星 等;《药学学报》;20021231;第37卷(第09期);第713-717页 *
中药及其复方指纹图谱的研究新进展;聂燕 等;《时珍国医国药》;20121231;第23卷(第12期);第3110-3112页 *
中药复方质量控制方法研究进展;黄瑛;《药学实践杂志》;20081231;第26卷(第01期);第11-13,37页 *
中药色谱指纹图谱相似性计算方法的研究;陈闽军 等;《中成药》;20021231;第24卷(第12期);第905-908页 *
人工神经元网络方法在毛细管电泳和色谱分析中的应用;张雅雄 等;《分析化学》;20040531;第32卷(第05期);第673-678页 *
基于化学模式识别的栀子UPLC定量指纹图谱研究;徐鑫 等;《中国中药杂志》;20200930;第45卷(第18期);第4416-4422页 *
复方有效部位与组方药材指纹图谱色谱峰相关性研究;胡柳 等;《中国药学杂志》;20041231;第39卷(第12期);第895-898页 *
谱效关系分析在中药组方研究中的应用进展;冯鑫 等;《中国中医基础医学杂志》;20180331;第24卷(第03期);第422-427页 *
质谱差谱方法及其在中药复方研究中的应用;冉小蓉 等;《高等学校化学学报》;20070228;第28卷(第02期);第250-253页 *

Also Published As

Publication number Publication date
CN114577967A (en) 2022-06-03

Similar Documents

Publication Publication Date Title
CN103487395B (en) A kind of Paris polyphylla medicinal material multiple index quick detecting method
CN104792652B (en) A multi-indicator rapid detection method for Radix Astragali
CN114577967B (en) Chromatographic analysis method of traditional Chinese medicine compound samples based on artificial neural network and differential spectrum
Zhu et al. Simultaneous measurement of contents of liquirtin and glycyrrhizic acid in liquorice based on near infrared spectroscopy
Dong et al. Deep learning for geographical discrimination of Panax notoginseng with directly near-infrared spectra image
CN105574474A (en) Mass spectrometry information-based biological characteristic image identification method
CN103411906B (en) The near infrared spectrum qualitative identification method of pearl powder and oyster shell whiting
CN104990895B (en) A kind of near infrared spectrum signal standards normal state bearing calibration based on regional area
CN103344572A (en) Method for evaluating blending homogeneity of cut rolled stems and regenerated cut tobaccos in cigarettes
CN104569263A (en) Method for quickly and accurately evaluating quality stability of cigarette product
Fu et al. Simple automatic strategy for background drift correction in chromatographic data analysis
CN105352913B (en) A kind of method of near infrared spectrum detection Ganodenna Lucidum P.E polyoses content
CN109001181B (en) A rapid identification method of edible oil species based on Raman spectroscopy canonical correlation analysis
CN105548079A (en) Method for determining cut tobacco composition based on near infrared spectrum
CN114088661A (en) Online prediction method for chemical components in tobacco leaf curing process based on transfer learning and near infrared spectrum
CN107449753A (en) The method of rutin content near infrared spectrum quick test sophora flower processed product
CN106404711B (en) Method for identifying adulteration of Chinese yam wall-broken decoction pieces
Zhang et al. Improving the geographical origin classification of Radix glycyrrhizae (licorice) through hyperspectral imaging assisted by U-Net fine structure recognition
Ma et al. Maintaining the predictive abilities of near-infrared spectroscopy models for the determination of multi-parameters in White Paeony Root
Zhong et al. Spatial and temporal distribution characteristics of Paris polyphylla var. yunnanensis and the prediction of steroidal saponins content
WO2020248961A1 (en) Method for selecting spectral wavenumber without reference value
CN102485249A (en) Detection method for quality evaluation of saffron
CN105787518B (en) A near-infrared spectral preprocessing method based on zero-space projection
CN114295751A (en) Method for evaluating quality of radix linderae based on multi-wavelength fingerprint spectrum
CN117958469A (en) A method and device for predicting water-soluble sugar content in tobacco leaf baking process

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant