Summary of the invention
The purpose of the present invention is to provide a kind of systems for carrying out parting to the patient with colorectal cancer, use
System of the invention carries out parting to the patient with colorectal cancer, is the trouble with high risk of recurrence by the patient classification
The patient of person and low risk of recurrence, the patient subgroups of high risk of recurrence are the patients that should be treated after surgery.
To achieve the above object, the technical solution taken: a kind of for dividing the patient with colorectal cancer
The system of type, including
Data input module, the first biomarker in the tissue samples of the patient for colorectal cancer will to be suffered from
Rna expression level with 5 years in have described in the tissue samples of the patient of palindromia and low cell cycle events the first biology
The tissue samples of first similarity value and the patient with colorectal cancer between the rna expression level of marker
Described in the first biomarker rna expression level with 5 years in first described in the tissue samples of patient that do not recur of disease
Second similarity value input model computing module between the rna expression level of biomarker, first biomarker
Including at least three gene listed in table 1;For biological by the tissue samples of the patient with colorectal cancer second
The rna expression level of marker with 5 years in have institute in the tissue samples of the patient of palindromia and non-low cell cycle events
State the third similarity value between the rna expression level of the second biomarker and the patient with colorectal cancer
Tissue samples described in the second biomarker rna expression level with 5 years in the tissue samples of patient that do not recur of disease
Described in the second biomarker rna expression level between the 4th similarity input model computing module, described second
Biomarker includes at least three gene listed in table 2;
Model computation module, including parting model, for similar according to first similarity value, second similarity value, third
Property value, the 4th similarity and parting model calculate the patient with colorectal cancer the first parting score value and second point
Type score value;The first parting score value=first similarity value-second similarity value, the second parting score value=the
Three the-the four similarities of similarity;
As a result output module, for being obtained according to the first parting score value and the second parting of the patient with colorectal cancer
Score value determines patient with poor prognosis or has good prognosis, when the first parting score of the patient with colorectal cancer
Value >=first similarity threshold value, or/and with colorectal cancer patient the second parting score value >=second similarity threshold value
When, patient classification is high risk of recurrence, has poor prognosis;As the first parting score value < of the patient with colorectal cancer
First similarity threshold value, and when the second parting score value < second similarity threshold value of the patient with colorectal cancer, patient
It is classified as low risk of recurrence, there is good prognosis.
Preferably, first similarity threshold value=0.155, second similarity threshold value=0.076.
The present invention provides a kind of methods for carrying out parting to the patient with colorectal cancer, including following step
It is rapid:
(1) the RNA sample extracted from the tumor sample of the patient is provided, the tumor sample includes that colorectal cancer is thin
Born of the same parents;
Determine that the rna expression of biomarker in the RNA sample is horizontal, wherein the biomarker includes in table 1
At least three gene listed;
Determine the rna expression level of the biomarker with 5 years in have palindromia and low cell cycle events
Similarity a between the rna expression level of the biomarker of patient;
It determines the rna expression level of the biomarker and the biology of the patient of palindromia is not marked in 5 years
Similarity b between the rna expression level of will object;
Determine the difference e of above-mentioned similarity a and similarity b;
(2) determine that the rna expression of biomarker in the RNA sample is horizontal, wherein the biomarker includes table
At least three gene listed in 2;
It determines the rna expression level of the biomarker and has palindromia and non-low cell cycle events in 5 years
Patient the biomarker rna expression level between similarity c;
It determines the rna expression level of the biomarker and the biology of the patient of palindromia is not marked in 5 years
Similarity d between the rna expression level of will object;
Determine the difference f of above-mentioned similarity c and similarity d;
(3) if the difference e is higher than similarity threshold or/and the difference f higher than similarity threshold, by the trouble
Person is classified as with poor prognosis;If the difference e is lower than the similarity threshold, and the difference f is lower than the phase
It is with good prognosis by the patient classification like property threshold value.
The present invention provides a kind of systems for carrying out parting to the patient with colorectal cancer, including
Data input module, third biomarker in the tissue samples of the patient for colorectal cancer will to be suffered from
Rna expression level with 5 years in have third described in the tissue samples of the patient of palindromia and low cell cycle events biological
The tissue samples of the 5th similarity and the patient with colorectal cancer between the rna expression level of marker
Described in third biomarker rna expression level with 5 years in third described in the tissue samples of patient that do not recur of disease
The 6th similarity input model computing module between the rna expression level of biomarker, the third biomarker
Including at least three gene listed in table 3;For biological by the in the tissue samples of the patient with colorectal cancer the 4th
The rna expression level of marker with 5 years in have institute in the tissue samples of the patient of palindromia and non-low cell cycle events
State the 7th similarity between the rna expression level of the 4th biomarker and the patient with colorectal cancer
Tissue samples described in the 4th biomarker rna expression level with 5 years in the tissue samples of patient that do not recur of disease
Described in the 4th biomarker rna expression level between the 8th similarity input model computing module, the described 4th
Biomarker includes at least three gene listed in table 4;
Model computation module, including parting model, for according to the 5th similarity, the 6th similarity, the 7th similar
Property value, the 8th similarity and parting model calculate the patient with colorectal cancer third parting score value and the 4th point
Type score value;The third parting score value=the 5th the-the six similarity of similarity, tetrad segregation types score value=the
Seven the-the eight similarities of similarity;
As a result output module, for being obtained according to the third parting score value and tetrad segregation types of the patient with colorectal cancer
Score value determines patient with poor prognosis or has good prognosis, when the third parting score of the patient with colorectal cancer
Value >=third similarity threshold value, or/and with colorectal cancer patient tetrad segregation types score value >=the 4th similarity threshold
When, patient classification is high risk of recurrence, has poor prognosis;As the third parting score value < of the patient with colorectal cancer
Third similarity threshold value, and when tetrad segregation types score value four similarity threshold of < of the patient with colorectal cancer, patient
It is classified as low risk of recurrence, there is good prognosis.
Preferably, third similarity threshold value=0.198, the 4th similarity threshold=- 0.003.
The present invention provides a kind of methods for carrying out parting to the patient with colorectal cancer, using stablizing cell
Core gene spectrum, comprising the following steps:
(1) the RNA sample extracted from the tumor sample of the patient is provided, the tumor sample includes that colorectal cancer is thin
Born of the same parents;
Determine that the rna expression of biomarker in the RNA sample is horizontal;Wherein, the biomarker includes in table 3
At least three gene listed;
Determine the rna expression level of the biomarker with 5 years in have palindromia and low cell cycle events
Similarity a between the rna expression level of the biomarker of patient;
It determines the rna expression level of the biomarker and the biology of the patient of palindromia is not marked in 5 years
Similarity b between the rna expression level of will object;
Determine the difference e of above-mentioned similarity a and similarity b;
(2) determine that the rna expression of biomarker in the RNA sample is horizontal;Wherein, the biomarker includes table
At least three gene listed in 4;
It determines the rna expression level of the biomarker and has palindromia and non-low cell cycle events in 5 years
Patient the biomarker rna expression level between similarity c;
It determines the rna expression level of the biomarker and the biology of the patient of palindromia is not marked in 5 years
Similarity d between the rna expression level of will object;
Determine the difference f of above-mentioned similarity c and similarity d;
(3) if the difference e is higher than similarity threshold or/and the difference f higher than similarity threshold, by the trouble
Person is classified as with poor prognosis;If the difference e is lower than the similarity threshold, and the difference f is lower than the phase
It is with good prognosis by the patient classification like property threshold value.
Preferably, the colorectal cancer includes the TNM II phase cancer according to TNM stage system.
Preferably, the tissue samples include the expression of colorectal cancer cell or the nucleic acid from colorectal cancer cell
The clinically relevant sample of product.
Preferably, the measuring method of the rna expression level includes blot hybridization, quantitative PCR, RNAseq sequencing and micro- battle array
Column analysis;Measure the first similarity value, second similarity value, third similarity value, the 4th similarity, the 5th similitude
Value, the 6th similarity, the 7th similarity or the 8th similarity method include but be not limited to Euclidean distance, it is graceful
Hatton's distance, Chebyshev's distance, Minkowski Distance standardize Euclidean distance, mahalanobis distance, included angle cosine, Hamming distance
From, Jie Kade distance, related coefficient or comentropy.Preferably, the first similarity value, second similarity value, are measured
Three similarities, the 4th similarity, the 5th similarity, the 6th similarity, the 7th similarity or the 8th similitude
The method of value is Pearson correlation coefficient.
The method of the system deviation at least three genes that normalization table 1 is listed includes but is not limited to, gene
CPM method is sequenced in chip fRMA method, genetic chip RMA method, RNAseq, and FPKM method is sequenced in RNAseq.
The method of the system deviation at least three genes that normalization table 2 is listed includes but is not limited to, gene
CPM method is sequenced in chip fRMA method, genetic chip RMA method, RNAseq, and FPKM method is sequenced in RNAseq.
The method of the system deviation at least three genes that normalization table 3 is listed includes but is not limited to, gene
CPM method is sequenced in chip fRMA method, genetic chip RMA method, RNAseq, and FPKM method is sequenced in RNAseq.
The method of the system deviation at least three genes that normalization table 4 is listed includes but is not limited to, gene
CPM method is sequenced in chip fRMA method, genetic chip RMA method, RNAseq, and FPKM method is sequenced in RNAseq.
Preferably, having the patient of palindromia and low cell cycle events in described 5 years is to have disease in follow-up 5 years
It recurs and the patient of low cell cycle events, the patient that disease does not recur in described 5 years is that disease does not recur in follow-up 5 years
Patient.
Threshold value is can to distinguish the RNA sample of the patient with high cancer relapse risk and with low cancer relapse risk
Any number of the RNA sample of patient.
Preferably, first biomarker includes at least four gene listed in table 1;It is highly preferred that described first
Biomarker includes at least five gene listed in table 1;It is highly preferred that first biomarker includes listing in table 1
At least six gene;It is highly preferred that first biomarker includes at least seven gene listed in table 1;It is highly preferred that
First biomarker includes at least eight gene listed in table 1;It is highly preferred that first biomarker includes
At least nine gene listed in table 1;It is highly preferred that first biomarker includes at least ten base listed in table 1
Cause;It is highly preferred that first biomarker includes at least 15 genes listed in table 1;It is highly preferred that described first is raw
Object marker includes at least 20 genes listed in table 1;It is highly preferred that first biomarker includes listing in table 1
At least 25 genes;It is highly preferred that first biomarker includes at least 30 genes listed in table 1;More preferably
Ground, first biomarker include at least 40 genes listed in table 1;It is highly preferred that first biomarker
Including at least 50 genes listed in table 1;It is highly preferred that first biomarker includes at least 60 listed in table 1
A gene;It is highly preferred that first biomarker includes at least 70 genes listed in table 1;It is highly preferred that described
One biomarker includes at least 80 genes listed in table 1;It is highly preferred that first biomarker includes in table 1
At least 90 genes listed;It is highly preferred that first biomarker includes at least 100 genes listed in table 1;More
Preferably, first biomarker includes at least 150 genes listed in table 1;It is highly preferred that the first biology mark
Will object includes at least 200 genes listed in table 1;Most preferably, first biomarker includes the institute listed in table 1
There is gene;
Second biomarker includes at least four gene listed in table 2;It is highly preferred that the second biology mark
Will object includes at least five gene listed in table 2;It is highly preferred that second biomarker includes listing at least in table 2
6 genes;It is highly preferred that second biomarker includes at least seven gene listed in table 2;It is highly preferred that described
Two biomarkers include at least eight gene listed in table 2;It is highly preferred that second biomarker includes arranging in table 2
At least nine gene out;It is highly preferred that second biomarker includes at least ten gene listed in table 2;More preferably
Ground, second biomarker include at least 15 genes listed in table 2;It is highly preferred that second biomarker
Including at least 20 genes listed in table 2;It is highly preferred that second biomarker includes at least 25 listed in table 2
A gene;It is highly preferred that second biomarker includes at least 30 genes listed in table 2;It is highly preferred that described
Two biomarkers include at least 40 genes listed in table 2;It is highly preferred that second biomarker includes in table 2
At least 50 genes listed;It is highly preferred that second biomarker includes at least 60 genes listed in table 2;More
Preferably, second biomarker includes at least 70 genes listed in table 2;It is highly preferred that the second biology mark
Will object includes at least 80 genes listed in table 2;It is highly preferred that second biomarker include listed in table 2 to
Few 90 genes;Most preferably, second biomarker includes all genes listed in table 2.
Preferably, the third biomarker includes at least four gene listed in table 3;It is highly preferred that the third
Biomarker includes at least five gene listed in table 3;It is highly preferred that the third biomarker includes listing in table 3
At least six gene;It is highly preferred that the third biomarker includes at least seven gene listed in table 3;It is highly preferred that
The third biomarker includes at least eight gene listed in table 3;It is highly preferred that the third biomarker includes
At least nine gene listed in table 3;It is highly preferred that the third biomarker includes at least ten base listed in table 3
Cause;It is highly preferred that the third biomarker includes at least 15 genes listed in table 3;It is highly preferred that the third is raw
Object marker includes at least 20 genes listed in table 3;It is highly preferred that the third biomarker includes listing in table 3
At least 25 genes;It is highly preferred that the third biomarker includes at least 30 genes listed in table 3;More preferably
Ground, the third biomarker include at least 40 genes listed in table 3;It is highly preferred that the third biomarker
Including at least 50 genes listed in table 3;It is highly preferred that the third biomarker includes at least 60 listed in table 3
A gene;It is highly preferred that the third biomarker includes at least 70 genes listed in table 3;It is highly preferred that described
Three biomarkers include at least 80 genes listed in table 3;It is highly preferred that the third biomarker includes in table 3
At least 90 genes listed;It is highly preferred that the third biomarker includes at least 100 genes listed in table 3;Most
Preferably, the third biomarker includes all genes listed in table 3;
4th biomarker includes at least four gene listed in table 4;It is highly preferred that the 4th biology mark
Will object includes at least five gene listed in table 4;It is highly preferred that the 4th biomarker includes listing at least in table 4
6 genes;It is highly preferred that the 4th biomarker includes at least seven gene listed in table 4;It is highly preferred that described
Four biomarkers include at least eight gene listed in table 4;It is highly preferred that the 4th biomarker includes arranging in table 4
At least nine gene out;It is highly preferred that the 4th biomarker includes at least ten gene listed in table 4;More preferably
Ground, the 4th biomarker include at least 15 genes listed in table 4;It is highly preferred that the 4th biomarker
Including at least 20 genes listed in table 4;It is highly preferred that the 4th biomarker includes at least 25 listed in table 4
A gene;It is highly preferred that the 4th biomarker includes at least 30 genes listed in table 4;Most preferably, described
Four biomarkers include all genes listed in table 4.
Preferably, increase the stability of final parting score value using model superposition.Tables 1 and 2 gene is according to gene function
8 groups can be divided into: (1) cell division gene (1 gene function of table, cell cycle), (2) DNA-repair gene (1 gene function of table
Can, dnarepair), (3) epithelium mesenchyma transformed gene (1 gene function of table, emt), (4) cell metastatic gene (1 gene of table
Function, movement), (5) T cell has correlation gene (1 gene function of table, tcell), and (6) have recurrence and low cell cycle events
Most significant preceding 60 genes (1 gene function of table, top) are counted, (7) Wnt Information Conduction (2 gene function of table, wnt) (8) has
Recurrence and non-low cell cycle cycle events count most significant preceding 60 genes (2 gene function of table, top).This 8 groups of genes,
Every group of gene is all used alone nearest mass center classification method and calculates separately parting score value, finally uses K- neighbour homing method
(Hechenbichler and Schliep, 2004) this 8 parting score value of Model Fusion become 2 final parting score value.
The method of model superposition includes but is not limited to, K- neighbour's Return Law, nearest mass center classification method, neural network.For each
Sample, if the final parting score value of the 1st of sample is more than that first similarity threshold value or the 2nd final parting score value are super
Second similarity threshold value is crossed, is considered as high risk of recurrence sample, otherwise is low risk of recurrence sample.
Present invention provides the methods to patient's designated treatment with colorectal cancer, including system according to the present invention
The patient classification is the patient of patient and low risk of recurrence with high risk of recurrence by system, if the patient is classified as
With high risk of recurrence, receive adjuvant treatment.Chemotherapy compound include but be not limited to oxaliplatin, tetrahydrofolate formate,
5FU, capecitabine replace anti-, 1 inhibitor of topoisomerase according to vertical.Antybody therapy includes but is not limited to bevacizumab, western appropriate
Former times monoclonal antibody, PD-1 inhibitor Ou Diwo, TGFBeta acceptor inhibitor.
The present invention provides the RNA at least three genes listed at least three genes listed in table 1, detection table 1
The RNA table at least three genes listed at least three genes or detection table 2 listed in the reagent of expression, table 2
Up to horizontal reagent in preparation for the application in the reagent or kit to patient's progress parting with colorectal cancer.
The present invention provides the combination of at least three genes listed in table 1 and at least three genes listed in table 2 or
At least three genes listed in the reagent and detection table 2 of the rna expression level at least three genes listed in person's detection table 1
Rna expression level reagent combination preparation for colorectal cancer patient carry out parting reagent or reagent
Application in box.
The present invention provides the RNA at least three genes listed at least three genes listed in table 3, detection table 3
The RNA table at least three genes listed at least three genes or detection table 4 listed in the reagent of expression, table 4
Up to horizontal reagent in preparation for the application in the reagent or kit to patient's progress parting with colorectal cancer.
The present invention provides the combination of at least three genes listed in table 3 and at least three genes listed in table 4 or
At least three genes listed in the reagent and detection table 4 of the rna expression level at least three genes listed in person's detection table 3
Rna expression level reagent combination preparation for colorectal cancer patient carry out parting reagent or reagent
Application in box.
The utility model has the advantages that
The present invention provides a kind of systems for carrying out parting to the patient with colorectal cancer, are using the present invention
It unites and parting is carried out to the patient with colorectal cancer can show that significant performance, parting is high risk of recurrence type and low multiple
There is dramatically different recurrence-free survival rate in hair state of risk 5 years, and genotyping result is better than medical handbook recommendation used at present
Risk assessment clinical parameter.