CN108898160A - Breast cancer tissue's Pathologic Grading method based on CNN and image group Fusion Features - Google Patents
Breast cancer tissue's Pathologic Grading method based on CNN and image group Fusion Features Download PDFInfo
- Publication number
- CN108898160A CN108898160A CN201810555876.0A CN201810555876A CN108898160A CN 108898160 A CN108898160 A CN 108898160A CN 201810555876 A CN201810555876 A CN 201810555876A CN 108898160 A CN108898160 A CN 108898160A
- Authority
- CN
- China
- Prior art keywords
- cnn
- model
- breast cancer
- features
- image
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/25—Determination of region of interest [ROI] or a volume of interest [VOI]
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Molecular Biology (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Mathematical Physics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Multimedia (AREA)
- Image Analysis (AREA)
Abstract
Description
技术领域technical field
本发明涉及CNN及图像分类识别技术领域,尤其涉及基于CNN和影像组学特征融合的乳腺癌组织病理学分级方法。The invention relates to the technical field of CNN and image classification and recognition, in particular to a histopathological grading method for breast cancer based on fusion of CNN and radiomics features.
背景技术Background technique
乳腺癌作为女性常见的癌症,是第二个最容易引起女性死亡的疾病。全球乳腺癌发病率自20世纪70年代末开始一直呈上升趋势,而因乳腺癌死亡的病人不在少数。乳腺钼靶X线摄影检查技术是目前判别乳腺疾病的首选和最简便、最可靠的无创性检测手段,并且分辨率高,有助于早期发现乳腺癌。Breast cancer is the most common cancer among women and the second most common cause of death among women. The global incidence of breast cancer has been on the rise since the late 1970s, and many patients died of breast cancer. Mammography mammography is the first choice, the easiest and most reliable non-invasive detection method for judging breast diseases at present, and its high resolution is helpful for early detection of breast cancer.
近年来,随着大数据和高性能计算的发展,CNN(卷积神经网络)在计算机视觉领域取得了显著性的成绩,在自然图像分类上的识别率已超过人类识别水平。CNN通过多层卷积和池化提取图像特征,然后通过反向传播算法进行参数更新,改变了以往人工设计特征受人们经验的局限性。In recent years, with the development of big data and high-performance computing, CNN (Convolutional Neural Network) has achieved remarkable results in the field of computer vision, and the recognition rate in natural image classification has exceeded the human recognition level. CNN extracts image features through multi-layer convolution and pooling, and then updates parameters through the backpropagation algorithm, which changes the limitations of human experience in previous manual design features.
乳腺癌的组织病理学分级(SBR分级)主要是通过癌细胞有丝分裂的指数、乳腺腺管的差异以及癌细胞核的异型性三个方面的图像形态学特性联合进行评估,乳腺癌的组织病理学分级和患者的预后具有重要的关系,在同一临床分期内,患者的5年生存率随着组织病理学分级的提高而下降。而对乳腺癌SBR分级进行判别主要是通过在显微镜下观察患者病理切片的癌细胞分化情况,目前医生尚不能直接从常规的钼靶图像进行分级判别。The histopathological grading of breast cancer (SBR grading) is mainly based on the combined evaluation of image morphological characteristics in three aspects: the index of cancer cell mitosis, the difference of mammary ducts, and the atypia of cancer cell nuclei. The histopathological grading of breast cancer It has an important relationship with the prognosis of patients. In the same clinical stage, the 5-year survival rate of patients decreases with the increase of histopathological grade. The classification of SBR of breast cancer is mainly based on observing the differentiation of cancer cells in pathological sections of patients under a microscope. At present, doctors cannot directly judge the classification from conventional mammography images.
发明内容Contents of the invention
针对上述问题,本发明提出了基于CNN和影像组学特征融合的乳腺癌组织病理学分级方法,可以通过直接对患者的乳腺钼靶图像进行分析,通过将人工设计的影像组学特征和CNN自动提取的图像高层语义特征在新添加的全连接层上进行特征融合,训练特征融合后的CNN模型从而得出患者所处的乳腺癌组织病理学等级,为进一步的疾病判别和预后分析提供依据。In view of the above problems, the present invention proposes a breast cancer histopathological grading method based on the fusion of CNN and radiomics features, which can directly analyze the mammogram images of patients, and automatically combine the artificially designed radiomics features with CNN The extracted high-level semantic features of the image are fused on the newly added fully connected layer, and the CNN model after the feature fusion is trained to obtain the histopathological grade of breast cancer of the patient, which provides a basis for further disease discrimination and prognosis analysis.
为了实现上述目的,本发明采用以下技术方案:In order to achieve the above object, the present invention adopts the following technical solutions:
基于CNN和影像组学特征融合的乳腺癌组织病理学分级方法,包括以下步骤:A breast cancer histopathological grading method based on the fusion of CNN and radiomics features, including the following steps:
步骤1:对乳腺钼靶图像肿瘤区域进行提取,在提取的钼靶肿瘤区域上进行灰度、纹理和小波特征的计算,通过上述计算共提取180维影像组学特征向量;将提取的乳腺钼靶图像肿瘤区域制作成相同大小的乳腺肿瘤区域钼靶图像样本,将图像样本划分为训练集、验证集和测试集;Step 1: Extract the tumor area of the mammography image, calculate the grayscale, texture and wavelet features on the extracted mammography tumor area, and extract a total of 180-dimensional radiomics feature vectors through the above calculation; the extracted mammography The target image tumor area is made into breast tumor area mammography image samples of the same size, and the image samples are divided into training set, verification set and test set;
步骤2:对提取的180维影像组学特征向量,采用LASSO logistic回归模型进行特征筛选,利用筛选后的影像组学特征以进行特征融合;Step 2: For the extracted 180-dimensional radiomics feature vector, use the LASSO logistic regression model to perform feature screening, and use the screened radiomics features to perform feature fusion;
步骤3:采用预训练的CNN模型进行迁移学习,训练CNN分级模型,在CNN分级模型的原有基础上添加新的全连接层,在新的全连接层上将CNN分级模型全连接层之前的输出和筛选后的影像组学特征进行特征融合,并在CNN分级模型参数的基础上进行再训练,更新融合后的CNN分级模型参数,根据模型在验证集上的分级效果对融合后的CNN分级模型参数进行调整,得到特征融合的CNN模型,用于对乳腺钼靶图像进行乳腺癌组织病理学分级。Step 3: Use the pre-trained CNN model for migration learning, train the CNN classification model, add a new fully connected layer on the basis of the original CNN classification model, and transfer the CNN classification model before the fully connected layer to the new fully connected layer. The output and screened radiomics features are fused, and retrained on the basis of the CNN classification model parameters, the fused CNN classification model parameters are updated, and the fused CNN classification is performed according to the classification effect of the model on the verification set The model parameters were adjusted to obtain a feature-fused CNN model, which was used for breast cancer histopathological grading on mammography images.
进一步地,在所述步骤3之后还包括:Further, after the step 3, it also includes:
利用测试集对得到特征融合的CNN模型验证模型分级准确率。Use the test set to verify the classification accuracy of the CNN model obtained by feature fusion.
进一步地,所述步骤1包括:Further, said step 1 includes:
步骤1.1:对乳腺钼靶图像肿瘤区域进行ROI提取,得到ROI图像,计算ROI图像的14个灰度特征、22个纹理特征和144个小波特征,共提取180维影像组学特征向量;Step 1.1: Extract the ROI of the tumor area of the mammography image to obtain the ROI image, calculate 14 gray features, 22 texture features and 144 wavelet features of the ROI image, and extract a total of 180-dimensional radiomics feature vectors;
步骤1.2:通过数据增强方法扩充ROI图像的规模;Step 1.2: Expand the scale of the ROI image through data enhancement methods;
步骤1.3:将数据规模扩充后的ROI图像统一缩放到相同大小以适应CNN模型的输入要求。Step 1.3: Scale the ROI images after data scale expansion to the same size to meet the input requirements of the CNN model.
进一步地,所述步骤3包括:Further, said step 3 includes:
步骤3.1:将训练集中乳腺肿瘤区域钼靶图像样本作为CNN模型的输入,在ImageNet自然图像数据集上预训练的CNN模型上进行迁移学习,训练CNN分级模型;Step 3.1: Take the mammography image samples of the breast tumor area in the training set as the input of the CNN model, perform transfer learning on the CNN model pre-trained on the ImageNet natural image dataset, and train the CNN classification model;
步骤3.2:在CNN分级模型的原有基础上添加一个新的全连接层,在新的全连接层上将CNN分级模型全连接层之前的乳腺肿瘤区域钼靶图像高层语义特征输出与采用LASSOlogistic回归模型筛选的影像组学特征进行特征融合,并在CNN分级模型参数的基础上进行再训练,更新融合后的CNN分级模型参数,根据模型在验证集上的分级效果对融合后的CNN分级模型参数进行调整,得到特征融合的CNN模型。Step 3.2: Add a new fully-connected layer on the basis of the original CNN grading model. On the new fully-connected layer, output the high-level semantic features of the breast tumor area mammography image before the fully-connected layer of the CNN grading model and use LASSOlogistic regression The radiomics features screened by the model are fused, and retrained on the basis of the CNN classification model parameters, the fused CNN classification model parameters are updated, and the fused CNN classification model parameters are updated according to the classification effect of the model on the verification set. Make adjustments to obtain the CNN model of feature fusion.
与现有技术相比,本发明具有的有益效果:Compared with the prior art, the present invention has the beneficial effects:
本发明提出通过构建特征融合的CNN模型判断钼靶影像的乳腺癌组织病理学等级,利用钼靶肿瘤区域提取的灰度特征、纹理特征和小波特征,通过LASSO logistic回归模型进行特征筛选,选出与乳腺癌组织病理学等级相关性大的特征,再通过将CNN提取的高层语义特征和筛选出的影像组学特征在网络新添加的全连接层进行特征融合,而拟合得到特征融合的CNN模型用来识别乳腺癌组织病理学等级。本发明能够直接对患者扫描的乳腺钼靶图像进行分析判断患者所处的乳腺癌组织病理学等级,在保证判别精度的同时进一步缩短了判别时间。The present invention proposes to judge the histopathological grade of breast cancer in mammography images by constructing a feature-fused CNN model, using the grayscale features, texture features and wavelet features extracted from the mammography tumor area, and performing feature screening through the LASSO logistic regression model to select For features that are highly correlated with the histopathological grade of breast cancer, the high-level semantic features extracted by CNN and the screened radiomics features are fused in the newly added fully connected layer of the network, and the feature-fused CNN is obtained by fitting. The model was used to identify breast cancer histopathological grades. The invention can directly analyze the mammography target image scanned by the patient to determine the histopathological grade of breast cancer of the patient, and further shorten the discrimination time while ensuring the discrimination accuracy.
附图说明Description of drawings
图1为本发明实施例的基于CNN和影像组学特征融合的乳腺癌组织病理学分级方法的基本流程图。FIG. 1 is a basic flow chart of a breast cancer histopathological grading method based on fusion of CNN and radiomics features according to an embodiment of the present invention.
图2为本发明另一实施例的基于CNN和影像组学特征融合的乳腺癌组织病理学分级方法的基本流程图。FIG. 2 is a basic flowchart of a breast cancer histopathological grading method based on fusion of CNN and radiomics features according to another embodiment of the present invention.
图3为本发明实施例的基于CNN和影像组学特征融合的乳腺癌组织病理学分级方法的不同投照体位的钼靶图像。Fig. 3 is a mammography image of different irradiation positions in the histopathological grading method of breast cancer based on fusion of CNN and radiomics features according to an embodiment of the present invention.
具体实施方式Detailed ways
下面结合附图和具体的实施例对本发明做进一步的解释说明:The present invention will be further explained below in conjunction with accompanying drawing and specific embodiment:
实施例一:Embodiment one:
如图1所示,本发明的一种基于CNN和影像组学特征融合的乳腺癌组织病理学分级方法,包括以下步骤:As shown in Figure 1, a kind of breast cancer histopathological grading method based on CNN and radiomics feature fusion of the present invention comprises the following steps:
步骤S101:对乳腺钼靶图像肿瘤区域进行提取,在提取的钼靶肿瘤区域上进行灰度、纹理和小波特征的计算,通过上述计算共提取180维影像组学特征向量;将提取的乳腺钼靶图像肿瘤区域制作成相同大小的乳腺肿瘤区域钼靶图像样本,将图像样本划分为训练集、验证集和测试集。Step S101: Extract the tumor area of the mammography image, and calculate grayscale, texture and wavelet features on the extracted mammography tumor area, and extract a total of 180-dimensional radiomics feature vectors through the above calculation; the extracted mammography The target image tumor area is made into mammary tumor image samples of the same size, and the image samples are divided into training set, verification set and test set.
步骤S102:对提取的180维影像组学特征向量,采用LASSO logistic回归模型进行特征筛选,利用筛选后的影像组学特征以进行特征融合。Step S102: For the extracted 180-dimensional radiomics feature vector, the LASSO logistic regression model is used for feature screening, and the screened radiomics feature is used for feature fusion.
步骤S103:采用预训练的CNN模型进行迁移学习,训练CNN分级模型,在CNN分级模型的原有基础上添加新的全连接层,在新的全连接层上将CNN分级模型全连接层之前的输出和筛选后的影像组学特征进行特征融合,并在CNN分级模型参数的基础上进行再训练,得到融合后的CNN模型,根据模型在验证集上的分级效果对融合后的CNN分级模型参数进行调整,得到特征融合的CNN模型,用于对乳腺钼靶图像进行乳腺癌组织病理学分级。Step S103: Use the pre-trained CNN model for transfer learning, train the CNN classification model, add a new fully connected layer on the basis of the original CNN classification model, and transfer the CNN classification model before the fully connected layer to the new fully connected layer. The output and screened radiomics features are fused, and retrained on the basis of the CNN classification model parameters to obtain the fused CNN model, and the fused CNN classification model parameters are adjusted according to the classification effect of the model on the verification set. Adjustments were made to obtain a feature-fused CNN model for breast cancer histopathological grading on mammography images.
实施例二:Embodiment two:
如图2所示,本发明的另一种基于CNN和影像组学特征融合的乳腺癌组织病理学分级方法,包括以下步骤:As shown in Figure 2, another breast cancer histopathological grading method based on CNN and radiomics feature fusion of the present invention includes the following steps:
步骤S201:对乳腺钼靶图像肿瘤区域进行提取,在提取的钼靶肿瘤区域上进行灰度、纹理和小波特征的计算,通过上述计算共提取180维影像组学特征向量;将提取的乳腺钼靶图像肿瘤区域制作成相同大小的乳腺肿瘤区域钼靶图像样本,将图像样本划分为训练集、验证集和测试集。Step S201: Extract the tumor area of the mammography image, calculate the grayscale, texture and wavelet features on the extracted mammography tumor area, and extract a total of 180-dimensional radiomics feature vectors through the above calculation; the extracted mammography The target image tumor area is made into mammary tumor image samples of the same size, and the image samples are divided into training set, verification set and test set.
所述步骤S201包括:The step S201 includes:
步骤S2011:对乳腺钼靶图像肿瘤区域进行ROI提取,得到ROI图像,计算ROI图像的14个灰度特征、22个纹理特征和144个小波特征,共提取180维影像组学特征向量;Step S2011: extract the ROI from the tumor area of the mammography image to obtain the ROI image, calculate 14 grayscale features, 22 texture features and 144 wavelet features of the ROI image, and extract a total of 180-dimensional radiomics feature vectors;
所述灰度特征为灰度最大值、最小值、均值、中值、方差、峰态、能量、熵、绝对方差均值、歪斜度、标准差、均匀度、灰度值域、均方根共14个特征(参见Aerts H J W L,Velazquez E R,Leijenaar R T H,et al.Decoding tumour phenotype by noninvasiveimaging using a quantitative radiomics approach[J].Nature communications,2014,5:4006);The grayscale features are grayscale maximum value, minimum value, mean value, median value, variance, kurtosis, energy, entropy, mean absolute variance, skewness, standard deviation, uniformity, grayscale range, root mean square total 14 features (see Aerts H J W L, Velazquez E R, Leijenaar R T H, et al. Decoding tumor phenotype by noninvasive imaging using a quantitative radiomics approach [J]. Nature communications, 2014, 5: 4006);
所述纹理特征为基于灰度共生矩阵衍生出的9维特征,即能量、对比度、熵、同质性、相关性、方差、和平均、差异性、自相关(参见Weszka J S,Dyer C R,Rosenfeld A.Acomparative study of texture measures for terrain classification[J].IEEEtransactions on Systems,Man,and Cybernetics,1976(4):269-285)和基于灰度游程矩阵衍生出的13维特征,即Short Run Emphasis、Long Run Emphasis、Gray-LevelNonuniformity、Run-Length Nonuniformity、Run Percentage、Low Gray-Level RunEmphasis、High Gray-Level Run Emphasis、Short Run Low Gray-Level Emphasis、ShortRun High Gray-Level Emphasis、Long Run Low Gray-Level Emphasis、Long Run HighGray-Level Emphasis、Gray-Level Variance、Run-Length Variance(参见Galloway MM.Texture analysis using grey level run lengths[J].NASA STI/Recon TechnicalReport N,1974,75;Chu A,Sehgal C M,Greenleaf J F.Use of gray valuedistribution of run lengths for texture analysis[J].Pattern RecognitionLetters,1990,11(6):415-419)的共22维特征;The texture feature is a 9-dimensional feature derived based on the gray level co-occurrence matrix, namely energy, contrast, entropy, homogeneity, correlation, variance, and average, difference, autocorrelation (see Weszka J S, Dyer CR, Rosenfeld A. A comparative study of texture measures for terrain classification [J]. IEEE transactions on Systems, Man, and Cybernetics, 1976(4): 269-285) and 13-dimensional features derived from the gray run matrix, namely Short Run Emphasis, Long Run Emphasis, Gray-LevelNonuniformity, Run-Length Nonuniformity, Run Percentage, Low Gray-Level RunEmphasis, High Gray-Level Run Emphasis, Short Run Low Gray-Level Emphasis, ShortRun High Gray-Level Emphasis, Long Run Low Gray-Level Emphasis, Long Run High Gray-Level Emphasis, Gray-Level Variance, Run-Length Variance (see Galloway MM. Texture analysis using gray level run lengths [J]. NASA STI/Recon Technical Report N, 1974, 75; Chu A, Sehgal C M ,Greenleaf J F.Use of gray value distribution of run lengths for texture analysis[J].Pattern RecognitionLetters,1990,11(6):415-419) a total of 22-dimensional features;
所述小波特征为在4个小波分量上分别计算灰度特征和纹理特征,共144个特征。The wavelet features are grayscale features and texture features calculated on 4 wavelet components, 144 features in total.
步骤S2012:通过数据增强方法扩充ROI图像的规模;作为一种可实施方式,可以通过随机平移、旋转、翻转以及多尺度缩放的数据增强方法扩充ROI图像的规模;Step S2012: Enlarge the scale of the ROI image through a data enhancement method; as an implementable manner, the scale of the ROI image can be expanded through a data enhancement method of random translation, rotation, flip, and multi-scale scaling;
步骤S2013:将数据规模扩充后的ROI图像统一缩放到相同大小以适应CNN模型的输入要求。Step S2013: uniformly scale the ROI images after data scale expansion to the same size to meet the input requirements of the CNN model.
步骤S202:对提取的180维影像组学特征向量,采用LASSO logistic回归模型进行特征筛选,选出与乳腺癌组织病理学等级相关性大的特征,利用筛选后的影像组学特征以进行特征融合。Step S202: For the extracted 180-dimensional radiomics feature vector, use the LASSO logistic regression model to perform feature screening, select features that are highly correlated with the histopathological grade of breast cancer, and use the screened radiomics features to perform feature fusion .
LASSO回归是在最小二乘拟合的基础上加入L1正则化项来提高线性回归模型的精度,它的惩罚函数是回归系数的绝对值,这可使一些参数估计结果等于零,因此有助于特征选择。组织病理学分级是一个二元的分类问题,Logistic回归分析是二元分类或者一对多分类常用的广义线性模型,它将简单线性回归的响应归一化到0和1,因此可将LASSO回归模型中的线性回归替代为logistic回归来挑选二元分类的特征。LASSO logistic回归优化的目标函数如下:LASSO regression is to add L1 regularization term on the basis of least squares fitting to improve the accuracy of linear regression model. Its penalty function is the absolute value of regression coefficient, which can make some parameter estimation results equal to zero, so it is helpful for characteristics choose. Histopathological grading is a binary classification problem. Logistic regression analysis is a generalized linear model commonly used for binary classification or one-to-many classification. It normalizes the response of simple linear regression to 0 and 1, so LASSO can be regressed Linear regression in the model is replaced by logistic regression to select features for binary classification. The objective function of LASSO logistic regression optimization is as follows:
其中,n是样本的个数,Xi是一个m×n大小的原始数据,即每个样本有m个特征,yi是每个样本对应的响应值,ω是线性回归系数,b是线性回归的截断值,λ是用来控制回归系数稀疏度的非负正则化参数。将提取的影像组学特征输入LASSO logistic回归模型可以进行影像组学特征筛选。Among them, n is the number of samples, Xi is an m× n size of the original data, that is, each sample has m features, y i is the corresponding response value of each sample, ω is the linear regression coefficient, b is the linear The cutoff value for the regression, λ is a non-negative regularization parameter used to control the sparsity of the regression coefficients. Inputting the extracted radiomics features into the LASSO logistic regression model can perform radiomics feature screening.
步骤S203:采用预训练的CNN模型进行迁移学习,训练CNN分级模型,在CNN分级模型的原有基础上添加新的全连接层,在新的全连接层上将CNN分级模型全连接层之前的输出和筛选后的影像组学特征进行特征融合,并在CNN分级模型参数的基础上进行再训练,更新融合后的CNN分级模型参数,根据模型在验证集上的分级效果对融合后的CNN分级模型参数进行调整,得到特征融合的CNN模型,用于对乳腺钼靶图像进行乳腺癌组织病理学分级。Step S203: Use the pre-trained CNN model for migration learning, train the CNN classification model, add a new fully connected layer on the basis of the original CNN classification model, and transfer the CNN classification model before the fully connected layer to the new fully connected layer. The output and screened radiomics features are fused, and retrained on the basis of the CNN classification model parameters, the fused CNN classification model parameters are updated, and the fused CNN classification is performed according to the classification effect of the model on the verification set The model parameters were adjusted to obtain a feature-fused CNN model, which was used for breast cancer histopathological grading on mammography images.
所述步骤S203包括:The step S203 includes:
步骤S2031:将训练集中乳腺肿瘤区域钼靶图像样本作为CNN模型的输入,在ImageNet自然图像数据集上预训练的CNN模型上进行迁移学习,训练CNN分级模型;Step S2031: using the mammography image samples of the breast tumor area in the training set as the input of the CNN model, performing transfer learning on the CNN model pre-trained on the ImageNet natural image data set, and training the CNN classification model;
步骤S2032:在CNN分级模型的原有基础上添加一个新的全连接层,在新的全连接层上将CNN分级模型全连接层之前的乳腺肿瘤区域钼靶图像高层语义特征输出与采用LASSO logistic回归模型筛选的影像组学特征进行特征融合,并在CNN分级模型参数的基础上进行再训练,更新融合后的CNN分级模型参数,根据模型在验证集上的分级效果对融合后的CNN分级模型参数进行调整,得到特征融合的CNN模型。Step S2032: Add a new fully-connected layer on the basis of the original CNN grading model, and output the high-level semantic features of the breast tumor region mammography image before the fully-connected layer of the CNN grading model on the new fully-connected layer with LASSO logistic The radiomics features screened by the regression model are fused, and retrained on the basis of the parameters of the CNN classification model, and the parameters of the fused CNN classification model are updated. According to the classification effect of the model on the verification set, the fused CNN classification model is The parameters are adjusted to obtain the CNN model of feature fusion.
步骤S204:利用测试集对得到的特征融合的CNN模型验证模型分级准确率。Step S204: Use the test set to verify the classification accuracy of the obtained feature-fused CNN model.
作为一种可实施方式,使用的乳腺钼靶图像数据集共有204个病例,每个病例包含轴位(craniocaudal,CC)图像和侧斜位(mediolateral oblique,MLO)图像,如图3所示,图3中(a)部分为轴位钼靶图像,图3中(b)部分为测斜位钼靶图像。钼靶图像存储采用标准的DICOM格式,其分辨率(宽×高)有3328×4096和2560×3328两种。所有钼靶图像中的肿瘤区域都是经医院专业的放射科医生勾画,并且所有的病例都配有医院病理科的准确诊断结果以确定其病理学等级。通过对不同的乳腺癌钼靶影像组织病理学分级算法在收集的乳腺钼靶图像数据集进行试验,采用分类准确率和AUC值对分类性能进行定量评价,结果如表1所示。本发明实施例的基于CNN和影像组学特征融合的乳腺癌组织病理学分级方法相比GoogLeNet(参见Szegedy C,Liu W,Jia Y,et al.Going deeper with convolutions[C].IEEE Conference on Computer Vision and Pattern Recognition,2015:1-9)和传统的随机森林分类器的分类效果有了显著提升,分类准确率达到0.7500,AUC值达到0.8051。As a possible implementation, the mammography image data set used has a total of 204 cases, and each case contains an axial (craniocaudal, CC) image and a lateral oblique (mediolateral oblique, MLO) image, as shown in Figure 3, Part (a) in Figure 3 is the axial mammography image, and part (b) in Figure 3 is the inclinometric mammography image. The standard DICOM format is used for mammography image storage, and its resolution (width×height) is 3328×4096 and 2560×3328. The tumor areas in all mammography images are delineated by the hospital's professional radiologists, and all cases are accompanied by accurate diagnosis results from the hospital's pathology department to determine their pathological grades. Different mammography imaging histopathological grading algorithms were tested on the collected mammography image data sets, and the classification accuracy and AUC value were used to quantitatively evaluate the classification performance. The results are shown in Table 1. Compared with GoogLeNet (see Szegedy C, Liu W, Jia Y, et al.Going deeper with convolutions[C]. IEEE Conference on Computer Vision and Pattern Recognition, 2015:1-9) and the classification effect of the traditional random forest classifier have been significantly improved, the classification accuracy rate reached 0.7500, and the AUC value reached 0.8051.
表1乳腺癌钼靶影像病理学分级算法分类性能Table 1 Classification performance of breast cancer mammography imaging pathology grading algorithm
以上所示仅是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。What is shown above is only a preferred embodiment of the present invention. It should be pointed out that for those of ordinary skill in the art, some improvements and modifications can also be made without departing from the principles of the present invention. It should be regarded as the protection scope of the present invention.
Claims (4)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201810555876.0A CN108898160B (en) | 2018-06-01 | 2018-06-01 | Breast cancer histopathological grading method based on fusion of CNN and radiomics features |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201810555876.0A CN108898160B (en) | 2018-06-01 | 2018-06-01 | Breast cancer histopathological grading method based on fusion of CNN and radiomics features |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN108898160A true CN108898160A (en) | 2018-11-27 |
| CN108898160B CN108898160B (en) | 2022-04-08 |
Family
ID=64343881
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201810555876.0A Active CN108898160B (en) | 2018-06-01 | 2018-06-01 | Breast cancer histopathological grading method based on fusion of CNN and radiomics features |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN108898160B (en) |
Cited By (24)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109583447A (en) * | 2018-12-11 | 2019-04-05 | 山东省肿瘤防治研究院(山东省肿瘤医院) | A kind of image group credit analysis model building method and analysis method |
| CN109816632A (en) * | 2018-12-25 | 2019-05-28 | 东软集团股份有限公司 | Brain image processing method, device, readable storage medium storing program for executing and electronic equipment |
| CN109919187A (en) * | 2019-01-28 | 2019-06-21 | 浙江工商大学 | A method for fine-tuning a CNN with bagging to classify thyroid follicular pictures |
| CN109948667A (en) * | 2019-03-01 | 2019-06-28 | 桂林电子科技大学 | Image classification method and device for prediction of distant metastasis of head and neck cancer |
| CN109993740A (en) * | 2019-04-02 | 2019-07-09 | 广州中医药大学(广州中医药研究院) | A kind of cell carcinoma image grading method and device |
| CN110175978A (en) * | 2019-04-02 | 2019-08-27 | 南方医科大学南方医院 | A kind of liver cancer image group data processing method, system, device and storage medium |
| CN110533683A (en) * | 2019-08-30 | 2019-12-03 | 东南大学 | A kind of image group analysis method merging traditional characteristic and depth characteristic |
| CN110689960A (en) * | 2019-09-24 | 2020-01-14 | 广州大学 | An intelligent discrimination method, device, equipment and storage medium for breast cancer |
| CN111242174A (en) * | 2019-12-31 | 2020-06-05 | 浙江大学 | A method and device for image feature extraction and pathological classification of liver cancer based on radiomics |
| CN111415741A (en) * | 2020-03-05 | 2020-07-14 | 北京深睿博联科技有限责任公司 | Breast X-ray image classification model training method based on implicit appearance learning |
| CN111583320A (en) * | 2020-03-17 | 2020-08-25 | 哈尔滨医科大学 | Breast cancer ultrasonic image typing method and system fusing deep convolutional network and image omics characteristics and storage medium |
| CN111739033A (en) * | 2020-06-22 | 2020-10-02 | 中山大学肿瘤防治中心(中山大学附属肿瘤医院、中山大学肿瘤研究所) | A method for establishing a radiomics model for mammography and MR imaging based on machine learning |
| CN111914952A (en) * | 2020-08-21 | 2020-11-10 | 山东省医学影像学研究所 | AD characteristic parameter screening method and system based on deep neural network |
| CN112365440A (en) * | 2020-10-10 | 2021-02-12 | 天津大学 | Yoov 3-based mastopathy image mitosis detection device and method |
| CN113035360A (en) * | 2019-12-09 | 2021-06-25 | 浙江普罗亭健康科技有限公司 | Cell classification model learning method |
| CN113436158A (en) * | 2021-06-21 | 2021-09-24 | 遂宁市中心医院 | Liver lump auxiliary identification method based on deep learning |
| CN114170183A (en) * | 2021-12-07 | 2022-03-11 | 南昌大学 | A method and system for predicting chemotherapy sensitivity of ovarian cancer patients based on pathological images |
| CN114171197A (en) * | 2021-11-12 | 2022-03-11 | 东莞市人民医院 | A method for predicting HER2 status of breast cancer and related equipment |
| CN114360718A (en) * | 2022-03-10 | 2022-04-15 | 之江实验室 | Feature fitting-based PET/CT automatic lung cancer diagnosis and classification system and construction method |
| CN114463455A (en) * | 2021-12-21 | 2022-05-10 | 深圳先进技术研究院 | Expert knowledge constraint deep learning method for brain glioma classification |
| CN114511564A (en) * | 2022-04-19 | 2022-05-17 | 天津市肿瘤医院(天津医科大学肿瘤医院) | Image analysis method for breast cancer residual tumor load based on DCE-MRI |
| CN114783517A (en) * | 2022-05-13 | 2022-07-22 | 中国医科大学 | Prediction of RAS gene status of CRLM patients based on imagery omics and semantic features |
| WO2023115333A1 (en) * | 2021-12-21 | 2023-06-29 | 深圳先进技术研究院 | Expert knowledge constraint-based deep learning method for brain glioma grading |
| CN116883995A (en) * | 2023-07-07 | 2023-10-13 | 广东食品药品职业学院 | A system for identifying molecular subtypes of breast cancer |
Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN106096654A (en) * | 2016-06-13 | 2016-11-09 | 南京信息工程大学 | A kind of cell atypia automatic grading method tactful based on degree of depth study and combination |
| CN106326931A (en) * | 2016-08-25 | 2017-01-11 | 南京信息工程大学 | Mammary gland molybdenum target image automatic classification method based on deep learning |
| CN106599883A (en) * | 2017-03-08 | 2017-04-26 | 王华锋 | Face recognition method capable of extracting multi-level image semantics based on CNN (convolutional neural network) |
| CN106683081A (en) * | 2016-12-17 | 2017-05-17 | 复旦大学 | Brain glioma molecular marker nondestructive prediction method and prediction system based on radiomics |
| CN106682435A (en) * | 2016-12-31 | 2017-05-17 | 西安百利信息科技有限公司 | System and method for automatically detecting lesions in medical image through multi-model fusion |
| US20170140253A1 (en) * | 2015-11-12 | 2017-05-18 | Xerox Corporation | Multi-layer fusion in a convolutional neural network for image classification |
| CN106780448A (en) * | 2016-12-05 | 2017-05-31 | 清华大学 | A kind of pernicious sorting technique of ultrasonic Benign Thyroid Nodules based on transfer learning Yu Fusion Features |
| CN107330263A (en) * | 2017-06-26 | 2017-11-07 | 成都知识视觉科技有限公司 | A kind of method of area of computer aided breast invasive ductal carcinoma histological grading |
| CN108062753A (en) * | 2017-12-29 | 2018-05-22 | 重庆理工大学 | The adaptive brain tumor semantic segmentation method in unsupervised domain based on depth confrontation study |
-
2018
- 2018-06-01 CN CN201810555876.0A patent/CN108898160B/en active Active
Patent Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20170140253A1 (en) * | 2015-11-12 | 2017-05-18 | Xerox Corporation | Multi-layer fusion in a convolutional neural network for image classification |
| CN106096654A (en) * | 2016-06-13 | 2016-11-09 | 南京信息工程大学 | A kind of cell atypia automatic grading method tactful based on degree of depth study and combination |
| CN106326931A (en) * | 2016-08-25 | 2017-01-11 | 南京信息工程大学 | Mammary gland molybdenum target image automatic classification method based on deep learning |
| CN106780448A (en) * | 2016-12-05 | 2017-05-31 | 清华大学 | A kind of pernicious sorting technique of ultrasonic Benign Thyroid Nodules based on transfer learning Yu Fusion Features |
| CN106683081A (en) * | 2016-12-17 | 2017-05-17 | 复旦大学 | Brain glioma molecular marker nondestructive prediction method and prediction system based on radiomics |
| CN106682435A (en) * | 2016-12-31 | 2017-05-17 | 西安百利信息科技有限公司 | System and method for automatically detecting lesions in medical image through multi-model fusion |
| CN106599883A (en) * | 2017-03-08 | 2017-04-26 | 王华锋 | Face recognition method capable of extracting multi-level image semantics based on CNN (convolutional neural network) |
| CN107330263A (en) * | 2017-06-26 | 2017-11-07 | 成都知识视觉科技有限公司 | A kind of method of area of computer aided breast invasive ductal carcinoma histological grading |
| CN108062753A (en) * | 2017-12-29 | 2018-05-22 | 重庆理工大学 | The adaptive brain tumor semantic segmentation method in unsupervised domain based on depth confrontation study |
Non-Patent Citations (5)
| Title |
|---|
| FABIO ALEXANDRE SPANHOL 等: "Breast cancer histopathological image classification using Convolutional Neural Networks", 《2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS》 * |
| HAIBO WANG 等: "Mitosis detection in breast cancer pathology images by combining handcrafted and convolutional neural network features", 《JOURNAL OF MEDICAL IMAGING》 * |
| YAN-QI HUANG 等: "Development and Validation of a Radiomics Nomogram for Preoperative Prediction of Lymph Node Metastasis in Colorectal Cancer", 《JOURNAL OF CLINICAL ONCOLOGY》 * |
| 周超 等: "基于深度卷积网络和结合策略的乳腺组织病理图像细胞核异型性自动评分", 《中国生物医学工程学报》 * |
| 龚磊 等: "基于多特征描述的乳腺癌肿瘤病理自动分级", 《计算机应用》 * |
Cited By (30)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109583447A (en) * | 2018-12-11 | 2019-04-05 | 山东省肿瘤防治研究院(山东省肿瘤医院) | A kind of image group credit analysis model building method and analysis method |
| CN109816632A (en) * | 2018-12-25 | 2019-05-28 | 东软集团股份有限公司 | Brain image processing method, device, readable storage medium storing program for executing and electronic equipment |
| CN109919187B (en) * | 2019-01-28 | 2021-02-12 | 浙江工商大学 | Method for classifying thyroid follicular picture by using bagging fine tuning CNN |
| CN109919187A (en) * | 2019-01-28 | 2019-06-21 | 浙江工商大学 | A method for fine-tuning a CNN with bagging to classify thyroid follicular pictures |
| CN109948667A (en) * | 2019-03-01 | 2019-06-28 | 桂林电子科技大学 | Image classification method and device for prediction of distant metastasis of head and neck cancer |
| CN109993740A (en) * | 2019-04-02 | 2019-07-09 | 广州中医药大学(广州中医药研究院) | A kind of cell carcinoma image grading method and device |
| CN110175978A (en) * | 2019-04-02 | 2019-08-27 | 南方医科大学南方医院 | A kind of liver cancer image group data processing method, system, device and storage medium |
| CN110533683A (en) * | 2019-08-30 | 2019-12-03 | 东南大学 | A kind of image group analysis method merging traditional characteristic and depth characteristic |
| CN110689960A (en) * | 2019-09-24 | 2020-01-14 | 广州大学 | An intelligent discrimination method, device, equipment and storage medium for breast cancer |
| CN113035360A (en) * | 2019-12-09 | 2021-06-25 | 浙江普罗亭健康科技有限公司 | Cell classification model learning method |
| CN111242174A (en) * | 2019-12-31 | 2020-06-05 | 浙江大学 | A method and device for image feature extraction and pathological classification of liver cancer based on radiomics |
| CN111415741A (en) * | 2020-03-05 | 2020-07-14 | 北京深睿博联科技有限责任公司 | Breast X-ray image classification model training method based on implicit appearance learning |
| CN111415741B (en) * | 2020-03-05 | 2023-09-26 | 北京深睿博联科技有限责任公司 | Mammary gland X-ray image classification model training method based on implicit apparent learning |
| CN111583320A (en) * | 2020-03-17 | 2020-08-25 | 哈尔滨医科大学 | Breast cancer ultrasonic image typing method and system fusing deep convolutional network and image omics characteristics and storage medium |
| CN111739033A (en) * | 2020-06-22 | 2020-10-02 | 中山大学肿瘤防治中心(中山大学附属肿瘤医院、中山大学肿瘤研究所) | A method for establishing a radiomics model for mammography and MR imaging based on machine learning |
| CN111914952A (en) * | 2020-08-21 | 2020-11-10 | 山东省医学影像学研究所 | AD characteristic parameter screening method and system based on deep neural network |
| CN111914952B (en) * | 2020-08-21 | 2024-03-08 | 山东第一医科大学附属省立医院(山东省立医院) | AD characteristic parameter screening method and system based on deep neural network |
| CN112365440A (en) * | 2020-10-10 | 2021-02-12 | 天津大学 | Yoov 3-based mastopathy image mitosis detection device and method |
| CN113436158A (en) * | 2021-06-21 | 2021-09-24 | 遂宁市中心医院 | Liver lump auxiliary identification method based on deep learning |
| CN114171197A (en) * | 2021-11-12 | 2022-03-11 | 东莞市人民医院 | A method for predicting HER2 status of breast cancer and related equipment |
| CN114170183A (en) * | 2021-12-07 | 2022-03-11 | 南昌大学 | A method and system for predicting chemotherapy sensitivity of ovarian cancer patients based on pathological images |
| CN114463455A (en) * | 2021-12-21 | 2022-05-10 | 深圳先进技术研究院 | Expert knowledge constraint deep learning method for brain glioma classification |
| WO2023115333A1 (en) * | 2021-12-21 | 2023-06-29 | 深圳先进技术研究院 | Expert knowledge constraint-based deep learning method for brain glioma grading |
| CN114463455B (en) * | 2021-12-21 | 2025-09-05 | 深圳先进技术研究院 | Expert knowledge-constrained deep learning method for glioma grading |
| CN114360718A (en) * | 2022-03-10 | 2022-04-15 | 之江实验室 | Feature fitting-based PET/CT automatic lung cancer diagnosis and classification system and construction method |
| CN114360718B (en) * | 2022-03-10 | 2022-07-12 | 之江实验室 | Feature fitting-based PET/CT automatic lung cancer diagnosis and classification system and construction method |
| CN114511564A (en) * | 2022-04-19 | 2022-05-17 | 天津市肿瘤医院(天津医科大学肿瘤医院) | Image analysis method for breast cancer residual tumor load based on DCE-MRI |
| CN114783517A (en) * | 2022-05-13 | 2022-07-22 | 中国医科大学 | Prediction of RAS gene status of CRLM patients based on imagery omics and semantic features |
| CN114783517B (en) * | 2022-05-13 | 2025-08-22 | 中国医科大学 | Prediction of RAS gene status in CRLM patients based on radiomics and semantic features |
| CN116883995A (en) * | 2023-07-07 | 2023-10-13 | 广东食品药品职业学院 | A system for identifying molecular subtypes of breast cancer |
Also Published As
| Publication number | Publication date |
|---|---|
| CN108898160B (en) | 2022-04-08 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN108898160B (en) | Breast cancer histopathological grading method based on fusion of CNN and radiomics features | |
| Shah et al. | A robust approach for brain tumor detection in magnetic resonance images using finetuned efficientnet | |
| Mohamed et al. | A deep learning method for classifying mammographic breast density categories | |
| Kallenberg et al. | Unsupervised deep learning applied to breast density segmentation and mammographic risk scoring | |
| Zhang et al. | NODULe: Combining constrained multi-scale LoG filters with densely dilated 3D deep convolutional neural network for pulmonary nodule detection | |
| CN104751178B (en) | Lung neoplasm detection means and method based on shape template matching combining classification device | |
| Ye et al. | Medical image diagnosis of prostate tumor based on PSP-Net+ VGG16 deep learning network | |
| CN107507197B (en) | A method for extracting lung parenchyma based on clustering algorithm and convolutional neural network | |
| CN111767952B (en) | Interpretable lung nodule benign and malignant classification method | |
| Li et al. | Transfer learning based classification of cervical cancer immunohistochemistry images | |
| CN106096654A (en) | A kind of cell atypia automatic grading method tactful based on degree of depth study and combination | |
| Soleymanifard et al. | Multi-stage glioma segmentation for tumour grade classification based on multiscale fuzzy C-means | |
| CN113658151B (en) | Breast lesion magnetic resonance image classification method, equipment and readable storage medium | |
| CN116188423A (en) | Superpixel sparse unmixing detection method based on hyperspectral images of pathological slices | |
| CN113269799A (en) | A cervical cell segmentation method based on deep learning | |
| Song et al. | Segmenting overlapping cervical cell in pap smear images | |
| CN108765427A (en) | A kind of prostate image partition method | |
| Cao et al. | An automatic breast cancer grading method in histopathological images based on pixel-, object-, and semantic-level features | |
| Mokni et al. | CADNet157 model: fine-tuned ResNet152 model for breast cancer diagnosis from mammography images | |
| CN116542937A (en) | Method for judging lung tumor infiltration based on clinical medicine, deep learning neural network and image histology | |
| CN115147600A (en) | GBM Multimodal MR Image Segmentation Method Based on Classifier Weight Converter | |
| Yari et al. | Accuracy improvement in binary and multi-class classification of breast histopathology images | |
| Kriti et al. | A Review of Segmentation Algorithms Applied to B-Mode Breast Ultrasound Images: A Characterization Approach: Kriti et al. | |
| Jaffar et al. | Ensemble classification of pulmonary nodules using gradient intensity feature descriptor and differential evolution | |
| Jose et al. | Liver tumor classification using optimal opposition-based grey wolf optimization |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |