CN106169020A - Data processing method and tumor companion diagnosis system based on genotyping - Google Patents
Data processing method and tumor companion diagnosis system based on genotyping Download PDFInfo
- Publication number
- CN106169020A CN106169020A CN201610480400.6A CN201610480400A CN106169020A CN 106169020 A CN106169020 A CN 106169020A CN 201610480400 A CN201610480400 A CN 201610480400A CN 106169020 A CN106169020 A CN 106169020A
- Authority
- CN
- China
- Prior art keywords
- data
- tumor
- index file
- diagnosis
- processing method
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 206010028980 Neoplasm Diseases 0.000 title claims abstract description 73
- 238000003745 diagnosis Methods 0.000 title claims abstract description 38
- 238000003672 processing method Methods 0.000 title claims abstract description 38
- 238000003205 genotyping method Methods 0.000 title abstract 3
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 72
- 238000012163 sequencing technique Methods 0.000 claims abstract description 30
- 230000005540 biological transmission Effects 0.000 claims abstract description 22
- 238000001514 detection method Methods 0.000 claims abstract description 14
- 239000003814 drug Substances 0.000 claims abstract description 11
- 230000011218 segmentation Effects 0.000 claims abstract description 9
- 230000006835 compression Effects 0.000 claims abstract description 7
- 238000007906 compression Methods 0.000 claims abstract description 7
- 229940079593 drug Drugs 0.000 claims abstract description 7
- 201000011510 cancer Diseases 0.000 claims description 6
- 238000012856 packing Methods 0.000 claims description 6
- 230000008034 disappearance Effects 0.000 claims description 5
- 210000000352 storage cell Anatomy 0.000 claims description 5
- 210000003677 hemocyte Anatomy 0.000 claims description 4
- 229940000351 hemocyte Drugs 0.000 claims description 4
- 210000002381 plasma Anatomy 0.000 claims description 4
- 210000003296 saliva Anatomy 0.000 claims description 4
- 210000002700 urine Anatomy 0.000 claims description 4
- 108700020796 Oncogene Proteins 0.000 claims description 3
- 230000035772 mutation Effects 0.000 abstract description 12
- 238000003860 storage Methods 0.000 abstract description 8
- 238000005516 engineering process Methods 0.000 abstract description 7
- 238000004422 calculation algorithm Methods 0.000 abstract description 2
- 230000006837 decompression Effects 0.000 abstract description 2
- 238000007481 next generation sequencing Methods 0.000 abstract 2
- 238000010276 construction Methods 0.000 abstract 1
- 238000000034 method Methods 0.000 description 25
- 230000008569 process Effects 0.000 description 16
- 238000010586 diagram Methods 0.000 description 14
- 230000000875 corresponding effect Effects 0.000 description 8
- 238000001712 DNA sequencing Methods 0.000 description 7
- 230000008859 change Effects 0.000 description 6
- 238000004590 computer program Methods 0.000 description 6
- 230000006870 function Effects 0.000 description 4
- 230000004927 fusion Effects 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 239000000203 mixture Substances 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 238000005192 partition Methods 0.000 description 2
- 238000004321 preservation Methods 0.000 description 2
- 102000004169 proteins and genes Human genes 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 241000208340 Araliaceae Species 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- 102000006479 Heterogeneous-Nuclear Ribonucleoproteins Human genes 0.000 description 1
- 108010019372 Heterogeneous-Nuclear Ribonucleoproteins Proteins 0.000 description 1
- 235000005035 Panax pseudoginseng ssp. pseudoginseng Nutrition 0.000 description 1
- 235000003140 Panax quinquefolius Nutrition 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 210000004027 cell Anatomy 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 235000008434 ginseng Nutrition 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Public Health (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Theoretical Computer Science (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Pathology (AREA)
- Analytical Chemistry (AREA)
- Epidemiology (AREA)
- Primary Health Care (AREA)
- Medical Treatment And Welfare Office Work (AREA)
Abstract
The invention provides a data processing method and a tumor companion diagnosis system based on genotyping. The data processing method comprises the steps of firstly, aiming at a compression and decompression algorithm of next generation sequencing data; segmentation, index construction and online high-fidelity transmission of sequencing data; thirdly, the sequencing data and the related diagnosis and treatment data are stored separately; fourthly, mutation detection based on next generation sequencing data; fifthly, annotation and personalized medication suggestion of mutation data; sixthly, the diagnosis is accompanied by the evolution progress of the tumor. A data transmission and storage unit of the tumor accompanying diagnosis system based on genotyping transmits and stores gene sequence data and diagnosis data by using the data processing method; and the tumor accompanying diagnosis unit judges the tumor evolution stage according to the gene sequence data and the diagnosis data. The system combines the internet technology, integrates mutation related information and diagnosis and treatment information of doctors, provides a tumor accompanying diagnosis reading report, and effectively interacts with system users.
Description
Technical field
The present invention relates to field of computer technology, be specifically related to a kind of data processing method and tumor based on gene type
With diagnostic system.
Background technology
Through the joint efforts of six state scientist several years, and the input of 3,000,000,000 dollars, the mankind obtained relatively in calendar year 2001
For complete human genome sketch.And the developing rapidly of sequencing technologies of future generation so that 1000 dollars, fortnight, complete
The gene order-checking of one people is possibly realized.2010, complete thousand human genome plans.
Human genome excessively bulky complex, current data understand the artificial participation needing professional.Meanwhile, based under
The sequence data that generation sequencing technologies obtains, data volume is bigger;The medical information of doctor, data are relative complex;Order-checking department, sequence
Column information understands department, medical science understands department, medical department etc. relatively far apart.
Sequencing data, data volume is the biggest, the most also to distinguish and come from who, the information such as which kind of tissue samples.And cure
Treat diagnosis and treatment information, relatively complicated, have that format differences is big, to relate to the factor more.Meanwhile, sequencing data and Medical treatment
The problem that information exists asynchronous transmission.It is big that such data have data volume, relates to the features such as individual privacy.Therefore in storage,
Except the stability of data to be ensured, the follow-up deciphering of data to be ensured and confidentiality.
Online efficient, the high-fidelity transfer that how to realize data become problem demanding prompt solution.
Summary of the invention
The technical problem to be solved is: how to provide a kind of data processing method to realize the high-fidelity of data
Transmission.
For solving above-mentioned technical problem, one aspect of the present invention proposes a kind of data processing method, this data processing method
Including:
Transmitting terminal is to transmission data carry out segmentation compression packing, and builds the index file of packet;
Described index file is sent to receiving terminal by transmitting terminal;
After receiving terminal receives described index file, then send index file reception successful information to terminate rope to transmitting terminal
Draw the transmission of bag, and initial transmitting terminal packet sends;
Transmitting terminal sends packet to described receiving terminal;
Receiving terminal is according to the integrity of described index file detection received data bag;
If receiving terminal receives complete packet, then carry out described packet decompressing integrating.
Alternatively, receiving terminal includes to transmitting terminal transmission index file reception successful information:
The receiving terminal index file to receiving carries out integrity detection, if receiving complete index file, then to sending out
Sending end sends index file and receives successful information.
Alternatively, described index file is sent to receiving terminal by described transmitting terminal, including:
Described index file is sent to receiving terminal by described transmitting terminal with pattern the most repeatedly.
Alternatively, this data processing method also includes:
If the packet that receiving terminal receives is imperfect, then building feedback index file, described feedback index file includes
The information of disappearance file.
Alternatively, this data processing method also includes:
Transmitting terminal sends packet according to feedback index file to receiving terminal.
Alternatively, if it is complete to receive data, then sequencing data of future generation is carried out decompression and contracts integration.
Alternatively, this data processing method also includes: be encrypted different types of packet, and preserves to different
Memory node.
On the other hand, the invention allows for a kind of tumor based on gene type with diagnostic system, should be based on gene
The tumor of typing includes with diagnostic system:
Data transport storage cell, is used for utilizing above-mentioned data processing method to transmit and store gene sequence data and diagnosis
Data;
Tumor, with diagnosis unit, judges swollen for the abrupt information according to described acquisition, annotation information and diagnosis data
The tumor evolutionary phase.
Alternatively, described gene sequence data includes: oncogene sequencing data, cancer beside organism's gene sequencing data, urine
Liquid-based is because of sequencing data, saliva gene sequencing data, hemocyte gene sequencing data and blood plasma gene sequencing data.
Alternatively, described tumor is additionally operable to provide medication according to described tumor evolutionary phase and genotype with diagnosis unit
Advisory information.
Alternatively, should tumor based on gene type also include with diagnostic system:
Diagnostic result administrative unit, for obtaining diagnostic result according to the described tumor evolutionary phase, and stores described diagnosis
Result.
The data processing method of present invention offer and tumor based on gene type are with diagnostic system.This data process side
Method achieves the transmission of online data high efficiency, high-fidelity, and storage system based on protection medical information privacy.Based on base
Because the tumor of typing utilizes above-mentioned data processing method transmit and store gene with the data transport storage cell of diagnostic system
Sequence data and diagnosis data;According to described gene sequence data and diagnosis data, tumor judges that tumor develops with diagnosis unit
Stage.This system combines Internet technology, comprehensive sudden change relevant information and the medical information of doctor, provides tumor and solves with diagnosis
Read the newspaper announcement, and carry out effective interaction with system user.
Accompanying drawing explanation
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing
In having technology to describe, the required accompanying drawing used is briefly described, it should be apparent that, the accompanying drawing in describing below is the present invention
Some embodiments, for those of ordinary skill in the art, on the premise of not paying creative work, it is also possible to according to
These accompanying drawings obtain other accompanying drawing.
Fig. 1 is the schematic flow sheet of the data processing method of one embodiment of the invention;
Fig. 2 is the transmission principle schematic diagram of the data processing method of another embodiment of the present invention;
Fig. 3 is the preservation principle schematic of the data processing method of another embodiment of the present invention;
Fig. 4 is the tumor based on the gene type structural representation with diagnostic system of one embodiment of the invention;
Fig. 5 is that the tumor based on gene type of one embodiment of the invention solves automatically with the sequence data of diagnostic system
The schematic diagram read;
Fig. 6 is that the tumor based on gene type of one embodiment of the invention annotates with the gene test of diagnostic system
Schematic diagram;
Fig. 7 is that the tumor based on gene type of one embodiment of the invention exports single with the diagnosis report of diagnostic system
The schematic diagram of unit.
Detailed description of the invention
For making the purpose of the embodiment of the present invention, technical scheme and advantage clearer, below in conjunction with the embodiment of the present invention
In accompanying drawing, the technical scheme in the embodiment of the present invention is carried out clear, complete description, it is clear that described embodiment is
The a part of embodiment of the present invention rather than whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art
The every other embodiment obtained under not making creative work premise, broadly falls into the scope of protection of the invention.
Fig. 1 is the schematic flow sheet of the data processing method of one embodiment of the invention.As it is shown in figure 1, the present embodiment
Data processing method includes:
Data to be transmitted is carried out segmentation compression packing by S11: transmitting terminal, and builds the index file of packet;
Described index file is sent to receiving terminal by S12: transmitting terminal;
After S13: receiving terminal receives described index file, then send index file reception successful information with end to transmitting terminal
The only transmission of index bag, and initial transmitting terminal packet sends;
S14: transmitting terminal sends packet to described receiving terminal;
S15: receiving terminal is according to the integrity of described index file detection received data bag;
S16: if receiving terminal receives complete packet, then carry out described packet decompressing integrating.
The data processing method of the present embodiment, data to be transmitted is carried out segmentation packing, and builds packet by transmitting terminal
Index file;Transmitting terminal sends packet to described receiving terminal;Receiving terminal detects received data according to described index file
The integrity of bag;If receiving terminal receives complete packet, then described packet is integrated, it is achieved that data online
Efficiently, high-fidelity transfer.
In the optional embodiment of one, receiving terminal sends index file reception successful information to transmitting terminal and includes:
The receiving terminal index file to receiving carries out integrity detection, if receiving complete index file, then to sending out
Sending end sends index file and receives successful information.The integrity detection of the receiving terminal index file to receiving can ensure that reception
Termination harvests whole index file.
Further, for making receiving terminal receive complete index file, described index file is sent by described transmitting terminal
To receiving terminal, including:
Described index file is sent to receiving terminal by described transmitting terminal with pattern the most repeatedly.
This data processing method also includes:
If the packet that receiving terminal receives is imperfect, then building feedback index file, described feedback index file includes
The information of disappearance file.Transmitting terminal sends packet according to feedback index file to receiving terminal.
Receiving terminal, by sending feedback index file to transmitting terminal, can make transmitting terminal according to feedback index file to receiving terminal
Send corresponding packet, to ensure the high-fidelity transfer of online data.
In order to ensure that the safety of data stores, this data processing method also includes: add different types of packet
Close, and preserve to different memory nodes.
Fig. 2 is the transmission principle schematic diagram of the data processing method of another embodiment of the present invention.As in figure 2 it is shown, this reality
The transmission of the data processing method executing example comprises the steps:
(1) data, according to the usefulness of network transmission, are carried out segmentation compression packing by assessment initial data size, and will
The message identifications such as ID, sample type, client ip address are to corresponding packet;
(2) gene sequencing data, data volume is the biggest, is usually and exports with " fastq " form, including read sequence title,
Read sequence base " ATCG " and the quality score of corresponding base;The character types limited amount that reading sequence and quality score relate to, and read
Sequence title typically has certain rule.Therefore, during segmentation compression, differentiation is read sequence title, reads sequence base and read sequence quality, adopt
With different compact models and algorithm, pack, reduce volume of transmitted data;
(2) for the information of segmentation, index building file;
(3) index file will use timing repeatedly sending mode, it is ensured that receiving node is able to receive that complete index literary composition
Part;
(4), after reception service end receives index file, index file will be carried out integrity detection, if complete, to client
Hold to send and receive signal, to terminate client continuation transmission index file, and the packet transmission that initial is follow-up;If index literary composition
Part receives and has terminated, and index file is imperfect, then delete this index file, waits new index file to be received;
(5), after transmitting terminal determines the feedback information of receiving terminal, initial, one by one by the transmission of partition data bag, uses disposable
Send information pattern;
(6) receiving terminal is after the reception terminating packet, according to the information of index file, to the integrity receiving packet
Detect;If complete, then the information of all of partition data bag is integrated, recover to be reduced into the original number of client
According to;
(7) if imperfect, then build feedback index bag, the information of disappearance file used fixed time interval pattern repeatedly,
Index is sent to client;
(8), after client receives index, receiving terminal is sent a signal to terminate the transmission of index bag;Then, for lacking
Lose packet, the index package informatin sent based on receiving terminal, missing data bag is averaged and is divided into two or more numbers
Use two way classification according to the pattern of bag, carry out new cutting, and index building file, carry out new transmission and receive circulation.
For completing the transmission of Medical treatment information, the present invention provides expansible web page form pattern, based on the unique ID of user,
The pattern of artificial defeated record, and original document is submitted to the pattern of adnexa.
Fig. 3 is the preservation principle schematic of the data processing method of another embodiment of the present invention.As it is shown on figure 3, this reality
The data processing method executing example uses partial node the most uniquely to name, retain the memory module of backup, and data file name annotates
Form independently stores with data, to ensure the stability of data, the easily property understood and confidentiality.To obtain index file and
Recovering the original document of reduction, the storage first carrying out data processes.Sequence data, goes to special sequence memory node, and
Building corresponding backup memory node, file designation uses stochastic generation and unique pattern;All kinds of medical informations, Ge Renyin
The data such as private, store, other relevant informations in the encrypted mode, then build file storage relevant for the unique ID of patient;Sequence literary composition
Part name information, Medical treatment file name information, and other effective informations of correspondence, will be automatically stored to the data base of correspondence
Among form, database table information, will back up the most completely.Medical diagnostic information, sequence information, database table are believed
Breath, and follow-up relevant destination file, use different memory nodes to store, to ensure that the separation of associated data maintains secrecy.
The data processing method of the present embodiment sequencing data of future generation to receiving, existing has delivered document by utilizing
Software, for the particularity of sequencing data based on queue, adjust parameter, to carry out abrupt climatic change, such as mononucleotide difference
(SNV), little insertion and deletion (indel), gene fusion (fusion), gene structure change (CNV);And the monokaryon in chemotherapy site
Nucleotide polymorphism (SNP) detects.For said mutation information, carry out annotation of gene function, and the personalized medicine of correspondence is built
View annotation.Change based on abrupt information, calculates tumor load and changes.Fig. 4 is dividing based on gene of one embodiment of the invention
The tumor of type is with the structural representation of diagnostic system.As shown in Figure 4, the tumor based on gene type of the present embodiment is with examining
Disconnected system includes: data transport storage cell 41 and tumor are with diagnosis unit 42;Specifically:
Data transport storage cell 41, is used for utilizing above-mentioned data processing method transmit and store gene sequence data and examine
Disconnected data;
Tumor is with diagnosis unit 42, for judging tumor evolution rank according to described gene sequence data and diagnosis data
Section.
The tumor based on gene type of the present embodiment is with diagnostic system, by utilizing above-mentioned data processing method to pass
Defeated and store gene sequence data and diagnosis data, it is achieved that gene sequence data and diagnosis data online fidelity transmission, and
Utilize tumor to judge the tumor evolutionary phase automatically and accurately with diagnosis unit, improve accuracy and the speed of diagnosis.
Specifically, described gene sequence data includes: oncogene sequencing data, cancer beside organism's gene sequencing data, urine
Liquid-based is because of sequencing data, saliva gene sequencing data, hemocyte gene sequencing data and blood plasma gene sequencing data.
The diagnosing tumors such as in actual applications, user is uploading data terminal, the image provide sequencing data and doctor
Information etc. are transferred to data receiving terminal.Sequencing data is that capture probe based on tumor-related gene cohort design is enriched to
Tumour DNA sequencing data, cancer beside organism's DNA sequencing data, urine DNA sequencing data, saliva DNA sequencing data, hemocyte DNA
Sequencing data, plasma dna sequencing data etc.,.
Further, tumor is additionally operable to obtain medication recommendation letter according to the described tumor evolutionary phase with diagnosis unit 42
Breath.
Should tumor based on gene type also include with diagnostic system:
Diagnostic result administrative unit, for obtaining diagnostic result according to the described tumor evolutionary phase, and stores described diagnosis
Result.
The tumor based on gene type of the present embodiment can use unified flow process to complete relatively with diagnostic system to join
Examine the Difference test of genome.Fig. 5 is the tumor based on the gene type sequence with diagnostic system of one embodiment of the invention
The schematic diagram that column data is understood automatically.After the storage completing sequence data, will start sequencing data deciphering:
Sequence quality detection and screening, if sequence quality is defective, then send early warning information to related personnel;Qualified sequence
Location, call academic circles at present, sequence positioning software that industrial quarters is all approved, such as bwa, bowtie etc., for the ginseng of the mankind
Examine genome, such as hg19, carry out reading sequence location;The detection of mutation type, and for order-checking region design, retrieve each position
Put and whether have enough reading sequence coverings, identify conservative interval, the absent region etc. of coverage rate deficiency.Inspection for mutation type
Survey, mainly include SNV, indel, CNV and fusion, i.e. relative to reference to the mononucleotide difference of genome, the insertion of small fragment
With disappearance, the quantity variance of large fragment and gene fusion.In the context of detection of mutation type, except with reference to the most conventional science
Outside software, also optimize corresponding software by developing further, complete work.
The tumor based on gene type of the present embodiment can realize detecting the abrupt information in region certainly with diagnostic system
Move and dissolve reading.Human genome is huge and complicated, relies on the gene information that resolved, obtain more quickly difference site for
The impact of protein coding gene.Fig. 6 is the tumor based on the gene type base with diagnostic system of one embodiment of the invention
Schematic diagram because of detection annotation.At present conventional Academic Software, such as annovar, snpEff etc., can assist annotation SNV,
The impact on the amino acid residue of protein sequence such as indel and the impact of gene structure.Fractional mutations, it is also possible to further
Obtain the occurrence rate in different crowd;In conjunction with corresponding tumor probability of happening in crowd, gene can be determined whether
Difference whether tumor is had an impact;The most thus can be determined which comprises and the reference discrepant gene of genome can
Using as wild type.After completing substantially to annotate, one of them suggestion being to provide rationality medication, as partial nucleotide is many
State property (SNP) has difference reaction to chemotherapeutics, therefore can use certain medicine when certain SNP, when another SNP,
Then can not use certain medicine;Followed by for the choosing of targeted drug of specific gene or genic mutation type;Another is operated in
In the judgement of tumor evolution process, as some cancer evolution in early days, often there is more eurypalynous gene mutation type, along with
The evolution of cancer, the mutation type being suitable for human body can be enriched with, thus declined in terms of mutation type, but cancerous cell is disliked
Property but can increase, and subsequently, mutation type may increase further, causes aggravation etc..
Should tumor based on gene type can also include with diagnostic system:
Diagnosis report output unit, according to tumor with the result of calculation of diagnosis unit, and the diagnosis of the doctor in charge and
Treatment information, and the genotype of pharmaceutical relevant gene, it is provided that medication accurately is advised;The gene of gene is driven based on tumor
Type, it is provided that the evolutionary phase diagnostic message of tumor.Fig. 7 is that the tumor based on gene type of one embodiment of the invention is with examining
The schematic diagram of the diagnosis report output unit of disconnected system.First, based on microsoft office word increased income, artificial constructed template,
Wherein need the part of amendment, be then identified with spcial character and numeral etc.;Then, the tumor meter with diagnostic system is read
Calculate result, and medical record information, generate the report of preliminary webpage based on template style, and be sent to be correlated with lettergram mode by link
Report auditor;Then, report auditor browses report, and online modification needs the part of amendment, completes examination & verification, and raw
Become Final Report.Amendment information and corresponding report, will encrypt storage automatically to corresponding data base.And Final Report, will send
To related personnel, in case follow-up use.
The DNA sequencing data type that the tumor based on gene type of this embodiment provides according to user with diagnostic system
And batch, and the diagnostic message etc. that doctor provides, provide a user with the most real-time, medicine suggestion and tumor are drilled accurately
The diagnostic message in change stage.
The data processing method of present invention offer and tumor based on gene type are with diagnostic system, and transmitting terminal is by be passed
Transmission of data carries out segmentation compression packing, and builds the index file of packet;Transmitting terminal sends packet to described receiving terminal;Connect
Receiving end is according to the integrity of described index file detection received data bag;If receiving terminal receives complete packet, then
Described packet is integrated, it is achieved that the online high-fidelity transfer of data.
Those skilled in the art are it should be appreciated that embodiments herein can be provided as method, system or computer journey
Sequence product.Therefore, in terms of the application can use complete hardware embodiment, complete software implementation or combine software and hardware
The form of embodiment.And, the application can use in one or more calculating wherein including computer usable program code
The upper computer program implemented of machine usable storage medium (including but not limited to disk memory, CD-ROM, optical memory etc.)
The form of product.
The application is with reference to method, equipment (system) and the flow process of computer program according to the embodiment of the present application
Figure and/or block diagram describe.It should be understood that can the most first-class by computer program instructions flowchart and/or block diagram
Flow process in journey and/or square frame and flow chart and/or block diagram and/or the combination of square frame.These computer programs can be provided
Instruction arrives the processor of general purpose computer, special-purpose computer, Embedded Processor or other programmable data processing device to produce
A raw machine so that the instruction performed by the processor of computer or other programmable data processing device is produced for real
The device of the function specified in one flow process of flow chart or multiple flow process and/or one square frame of block diagram or multiple square frame now.
These computer program instructions may be alternatively stored in and computer or other programmable data processing device can be guided with spy
Determine in the computer-readable memory that mode works so that the instruction being stored in this computer-readable memory produces and includes referring to
Make the manufacture of device, this command device realize at one flow process of flow chart or multiple flow process and/or one square frame of block diagram or
The function specified in multiple square frames.
These computer program instructions also can be loaded in computer or other programmable data processing device so that at meter
Perform sequence of operations step on calculation machine or other programmable devices to produce computer implemented process, thus at computer or
The instruction performed on other programmable devices provides for realizing at one flow process of flow chart or multiple flow process and/or block diagram one
The step of the function specified in individual square frame or multiple square frame.
It should be noted that term " includes ", " comprising " or its any other variant are intended to the bag of nonexcludability
Contain, so that include that the process of a series of key element, method, article or equipment not only include those key elements, but also include
Other key elements being not expressly set out, or also include the key element intrinsic for this process, method, article or equipment.
In the case of there is no more restriction, statement " including ... " key element limited, it is not excluded that including described key element
Process, method, article or equipment in there is also other identical element.
In the description of the present invention, illustrate a large amount of detail.Although it is understood that, embodiments of the invention can
To put into practice in the case of there is no these details.In some instances, it is not shown specifically known method, structure and skill
Art, in order to do not obscure the understanding of this description.Similarly, it will be appreciated that disclose to simplify the present invention and help to understand respectively
One or more in individual inventive aspect, above in the description of the exemplary embodiment of the present invention, each of the present invention is special
Levy and be sometimes grouped together in single embodiment, figure or descriptions thereof.But, should be by the method solution of the disclosure
Release in reflecting an intention that i.e. the present invention for required protection requires than the feature being expressly recited in each claim more
Many features.More precisely, as the following claims reflect, inventive aspect is less than single reality disclosed above
Execute all features of example.Therefore, it then follows claims of detailed description of the invention are thus expressly incorporated in this detailed description of the invention,
The most each claim itself is as the independent embodiment of the present invention.
Above example is merely to illustrate technical scheme, is not intended to limit;Although with reference to previous embodiment
The present invention is described in detail, it will be understood by those within the art that: it still can be to aforementioned each enforcement
Technical scheme described in example is modified, or wherein portion of techniques feature is carried out equivalent;And these are revised or replace
Change, do not make the essence of appropriate technical solution depart from the spirit and scope of various embodiments of the present invention technical scheme.
Claims (10)
1. a data processing method, it is characterised in that including:
Data to be transmitted is carried out segmentation compression packing by transmitting terminal, and builds the index file of packet;
Described index file is sent to receiving terminal by transmitting terminal;
After receiving terminal receives described index file, then send index file reception successful information to terminate index bag to transmitting terminal
Transmission, and initial transmitting terminal packet send;
Transmitting terminal sends packet to described receiving terminal;
Receiving terminal is according to the integrity of described index file detection received data bag;
If receiving terminal receives complete packet, then carry out described packet decompressing integrating.
Data processing method the most according to claim 1, it is characterised in that receiving terminal sends index file to transmitting terminal and connects
Receipts successful information includes:
The receiving terminal index file to receiving carries out integrity detection, if receiving complete index file, then to transmitting terminal
Send index file and receive successful information.
Data processing method the most according to claim 1, it is characterised in that described index file is sent by described transmitting terminal
To receiving terminal, including:
Described index file is sent to receiving terminal by described transmitting terminal with pattern the most repeatedly.
Data processing method the most according to claim 1, it is characterised in that also include:
If the packet that receiving terminal receives is imperfect, then building feedback index file, described feedback index file includes disappearance
The information of file.
Data processing method the most according to claim 4, it is characterised in that also include:
Transmitting terminal sends packet according to feedback index file to receiving terminal.
Data processing method the most according to claim 1, it is characterised in that also include: different types of packet is entered
Row encryption, and preserve to different memory nodes.
7. a tumor based on gene type is with diagnostic system, it is characterised in that including:
Data transport storage cell, for utilizing the data processing method of claim 1-6 any one transmit and store gene
Sequence data and diagnosis data;
Tumor is with diagnosis unit, for judging the tumor evolutionary phase according to described gene sequence data and diagnosis data.
Tumor based on gene type the most according to claim 7 is with diagnostic system, it is characterised in that described gene sequence
Column data includes: oncogene sequencing data, cancer beside organism's gene sequencing data, urine gene sequencing data, saliva gene are surveyed
Ordinal number evidence, hemocyte gene sequencing data and blood plasma gene sequencing data.
Tumor based on gene type the most according to claim 7 is with diagnostic system, it is characterised in that described tumor is accompanied
It is additionally operable to according to described tumor evolutionary phase and genotype with diagnosis unit, it is provided that medication advisory information.
Tumor based on gene type the most according to claim 7 is with diagnostic system, it is characterised in that also include:
Diagnostic result administrative unit, for obtaining diagnostic result according to the described tumor evolutionary phase, and stores described diagnostic result.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201610480400.6A CN106169020A (en) | 2016-06-27 | 2016-06-27 | Data processing method and tumor companion diagnosis system based on genotyping |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201610480400.6A CN106169020A (en) | 2016-06-27 | 2016-06-27 | Data processing method and tumor companion diagnosis system based on genotyping |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN106169020A true CN106169020A (en) | 2016-11-30 |
Family
ID=58064397
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201610480400.6A Pending CN106169020A (en) | 2016-06-27 | 2016-06-27 | Data processing method and tumor companion diagnosis system based on genotyping |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN106169020A (en) |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107346372A (en) * | 2017-06-19 | 2017-11-14 | 苏州班凯基因科技有限公司 | A kind of database and its construction method understood applied to gene mutation |
| CN109698703A (en) * | 2017-10-20 | 2019-04-30 | 人和未来生物科技(长沙)有限公司 | Gene sequencing data decompression method, system and computer-readable medium |
| WO2019080670A1 (en) * | 2017-10-24 | 2019-05-02 | 人和未来生物科技(长沙)有限公司 | Gene sequencing data compression method and decompression method, system, and computer readable medium |
| CN111091914A (en) * | 2018-10-23 | 2020-05-01 | 百度在线网络技术(北京)有限公司 | Cancer typing staging method and device based on medical record |
| CN114267445A (en) * | 2021-12-23 | 2022-04-01 | 山东众阳健康科技集团有限公司 | Diagnostic consistency checking method, system, equipment and medium |
Citations (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20020078062A1 (en) * | 1999-08-13 | 2002-06-20 | Fujitsu Limited | File processing method, data processing apparatus and storage medium |
| CN1379363A (en) * | 2001-04-02 | 2002-11-13 | 华邦电子股份有限公司 | Method for digital image processing and method for multi-purpose data processing |
| CN1610270A (en) * | 2003-10-24 | 2005-04-27 | 上海华虹计通智能卡系统有限公司 | Multi-message compression transmitting method in transit card |
| CN1758587A (en) * | 2005-09-06 | 2006-04-12 | 宏碁股份有限公司 | Data processing method |
| CN101441639A (en) * | 2007-11-21 | 2009-05-27 | 英业达股份有限公司 | A method for generating an image file |
| CN101478684A (en) * | 2008-12-31 | 2009-07-08 | 杭州华三通信技术有限公司 | Method and system for detecting integrity of stored video data |
| CN101996227A (en) * | 2009-08-13 | 2011-03-30 | 鸿富锦精密工业(深圳)有限公司 | Document compression system and method |
| CN102820982A (en) * | 2011-09-21 | 2012-12-12 | 金蝶软件(中国)有限公司 | Data transmission method and device |
| CN104822063A (en) * | 2015-04-16 | 2015-08-05 | 长沙理工大学 | Compressed sensing video reconstruction method based on dictionary learning residual-error reconstruction |
| CN105243298A (en) * | 2015-11-06 | 2016-01-13 | 吴志宏 | Pancreatic cancer related cancer gene mutation information collection and analysis system and analysis method |
| CN105447300A (en) * | 2015-09-28 | 2016-03-30 | 董永华 | Medical information sharing method as well as system and terminal thereof |
-
2016
- 2016-06-27 CN CN201610480400.6A patent/CN106169020A/en active Pending
Patent Citations (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20020078062A1 (en) * | 1999-08-13 | 2002-06-20 | Fujitsu Limited | File processing method, data processing apparatus and storage medium |
| CN1379363A (en) * | 2001-04-02 | 2002-11-13 | 华邦电子股份有限公司 | Method for digital image processing and method for multi-purpose data processing |
| CN1610270A (en) * | 2003-10-24 | 2005-04-27 | 上海华虹计通智能卡系统有限公司 | Multi-message compression transmitting method in transit card |
| CN1758587A (en) * | 2005-09-06 | 2006-04-12 | 宏碁股份有限公司 | Data processing method |
| CN101441639A (en) * | 2007-11-21 | 2009-05-27 | 英业达股份有限公司 | A method for generating an image file |
| CN101478684A (en) * | 2008-12-31 | 2009-07-08 | 杭州华三通信技术有限公司 | Method and system for detecting integrity of stored video data |
| CN101996227A (en) * | 2009-08-13 | 2011-03-30 | 鸿富锦精密工业(深圳)有限公司 | Document compression system and method |
| CN102820982A (en) * | 2011-09-21 | 2012-12-12 | 金蝶软件(中国)有限公司 | Data transmission method and device |
| CN104822063A (en) * | 2015-04-16 | 2015-08-05 | 长沙理工大学 | Compressed sensing video reconstruction method based on dictionary learning residual-error reconstruction |
| CN105447300A (en) * | 2015-09-28 | 2016-03-30 | 董永华 | Medical information sharing method as well as system and terminal thereof |
| CN105243298A (en) * | 2015-11-06 | 2016-01-13 | 吴志宏 | Pancreatic cancer related cancer gene mutation information collection and analysis system and analysis method |
Non-Patent Citations (2)
| Title |
|---|
| 向重伦等: "《多用户微机系统及其使用》", 30 October 1990, 四川科学技术出版社 * |
| 陈冠铭等: "《计算机网络原理与应用》", 31 January 2003, 中国铁道出版社 * |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107346372A (en) * | 2017-06-19 | 2017-11-14 | 苏州班凯基因科技有限公司 | A kind of database and its construction method understood applied to gene mutation |
| CN109698703A (en) * | 2017-10-20 | 2019-04-30 | 人和未来生物科技(长沙)有限公司 | Gene sequencing data decompression method, system and computer-readable medium |
| CN109698703B (en) * | 2017-10-20 | 2020-10-20 | 人和未来生物科技(长沙)有限公司 | Gene sequencing data decompression method, system and computer readable medium |
| WO2019080670A1 (en) * | 2017-10-24 | 2019-05-02 | 人和未来生物科技(长沙)有限公司 | Gene sequencing data compression method and decompression method, system, and computer readable medium |
| CN111091914A (en) * | 2018-10-23 | 2020-05-01 | 百度在线网络技术(北京)有限公司 | Cancer typing staging method and device based on medical record |
| CN111091914B (en) * | 2018-10-23 | 2023-11-21 | 百度在线网络技术(北京)有限公司 | Cancer classification and staging method and device based on medical records |
| CN114267445A (en) * | 2021-12-23 | 2022-04-01 | 山东众阳健康科技集团有限公司 | Diagnostic consistency checking method, system, equipment and medium |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Ahmed et al. | Human gene and disease associations for clinical‐genomics and precision medicine research | |
| Cleemput et al. | Genome Detective Coronavirus Typing Tool for rapid identification and characterization of novel coronavirus genomes | |
| Pérez-Cobas et al. | Metagenomic approaches in microbial ecology: an update on whole-genome and marker gene sequencing analyses | |
| Malone et al. | Artificial intelligence predicts the immunogenic landscape of SARS-CoV-2 leading to universal blueprints for vaccine designs | |
| TWI229807B (en) | Method and apparatus for deriving the genome of an individual | |
| Arora et al. | Challenges, barriers, and facilitators in telemedicine implementation in India: a scoping review | |
| Rorie et al. | Electronic case report forms and electronic data capture within clinical trials and pharmacoepidemiology | |
| US20210233664A1 (en) | Data Based Cancer Research and Treatment Systems and Methods | |
| CN106169020A (en) | Data processing method and tumor companion diagnosis system based on genotyping | |
| Zhang et al. | Analysis of genomic characteristics and transmission routes of patients with confirmed SARS-CoV-2 in Southern California during the early stage of the US COVID-19 pandemic | |
| EP3826021A1 (en) | Method for preserving and using genome and genomic data | |
| Johnston et al. | Identifying tagging SNPs for African specific genetic variation from the African Diaspora Genome | |
| Hatherell et al. | Declaring a tuberculosis outbreak over with genomic epidemiology | |
| Douglas et al. | Tracing the international arrivals of SARS-CoV-2 Omicron variants after Aotearoa New Zealand reopened its border | |
| CA3116712A1 (en) | Data based cancer research and treatment systems and methods | |
| Borges et al. | SARS-CoV-2 introductions and early dynamics of the epidemic in Portugal | |
| CN110178184A (en) | Carcinogenic splice variant determines | |
| US20160070881A1 (en) | System, method and graphical user interface for creating modular, patient transportable genomic analytic data | |
| JP2025156469A (en) | Information processing method, information processing system, and computer program | |
| Perez et al. | The early SARS-CoV-2 epidemic in Senegal was driven by the local emergence of B. 1.416 and the introduction of B. 1.1. 420 from Europe | |
| CN118762745A (en) | Method and device for determining organ change information based on images and genes | |
| Zeghbib et al. | The importance of equally accessible genomic surveillance in the age of pandemics | |
| Brown et al. | Pilot evaluation of a fully automated bioinformatics system for analysis of methicillin-resistant Staphylococcus aureus genomes and detection of outbreaks | |
| Shahamatdar et al. | Deceptive learning in histopathology | |
| JP2015225356A (en) | Gene information providing method, gene information providing program, and gene information providing system |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20161130 |