CO2019009922A2 - Método y sistemas para la reconstrucción de secuencias genómicas de referencia a partir de lecturas de secuencias genómicas comprimidas - Google Patents
Método y sistemas para la reconstrucción de secuencias genómicas de referencia a partir de lecturas de secuencias genómicas comprimidasInfo
- Publication number
- CO2019009922A2 CO2019009922A2 CONC2019/0009922A CO2019009922A CO2019009922A2 CO 2019009922 A2 CO2019009922 A2 CO 2019009922A2 CO 2019009922 A CO2019009922 A CO 2019009922A CO 2019009922 A2 CO2019009922 A2 CO 2019009922A2
- Authority
- CO
- Colombia
- Prior art keywords
- genomic sequences
- syntax elements
- reference genome
- compressed
- aligned
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2282—Tablespace storage structures; Management thereof
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/23—Updating
- G06F16/2365—Ensuring data consistency and integrity
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/60—Protecting data
- G06F21/602—Providing cryptographic facilities or services
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/60—Protecting data
- G06F21/62—Protecting access to data via a platform, e.g. using keys or access control rules
- G06F21/6218—Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/60—Protecting data
- G06F21/62—Protecting access to data via a platform, e.g. using keys or access control rules
- G06F21/6218—Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
- G06F21/6245—Protecting personal data, e.g. for financial or medical purposes
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/10—Ploidy or copy number detection
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/20—Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
- G16B30/10—Sequence alignment; Homology search
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
- G16B30/20—Sequence assembly
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
- G16B40/10—Signal processing, e.g. from mass spectrometry [MS] or from PCR
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B45/00—ICT specially adapted for bioinformatics-related data visualisation, e.g. displaying of maps or networks
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
- G16B50/10—Ontologies; Annotations
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
- G16B50/30—Data warehousing; Computing architectures
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
- G16B50/40—Encryption of genetic data
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
- G16B50/50—Compression of genetic data
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B99/00—Subject matter not provided for in other groups of this subclass
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/3084—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method
- H03M7/3086—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method employing a sliding window, e.g. LZ77
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/70—Type of the data to be coded, other than image and sound
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- Biophysics (AREA)
- Databases & Information Systems (AREA)
- Bioethics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Chemical & Material Sciences (AREA)
- Data Mining & Analysis (AREA)
- Software Systems (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Computer Security & Cryptography (AREA)
- Computer Hardware Design (AREA)
- Evolutionary Computation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Public Health (AREA)
- Artificial Intelligence (AREA)
- Epidemiology (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Television Signal Processing For Recording (AREA)
- Labeling Devices (AREA)
- Apparatus Associated With Microorganisms And Enzymes (AREA)
Abstract
El método y aparato descritos en esta divulgación incluyen representar un genoma de referencia en términos de elementos de sintaxis que describen las diferencias entre dicho genoma de referencia y secuencias genómicas previamente alineadas. Cada una de las secuencias genómicas alineadas se describe por medio de un subconjunto de elementos de sintaxis. Los elementos de sintaxis que describen todas las secuencias genómicas se dividen en bloques de acuerdo con sus propiedades estadísticas. Cada bloque de elementos de sintaxis se codifica por entropía. Los bloques codificados por entropía se concatenan luego para formar una corriente de bits comprimida. Las diferencias entre el genoma de referencia y las secuencias alineadas se expresan en términos de elementos de sintaxis, que se incrustan en la corriente de bits de bloques codificados de elementos de sintaxis que describen lecturas alineadas. El método descrito permite la reconstrucción del genoma de referencia utilizado para la alineación cuando se decodifican las secuencias genómicas comprimidas mientras se conservan diferentes opciones de acceso aleatorio en los datos comprimidos y se permite la compresión eficiente.
Applications Claiming Priority (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/EP2016/074311 WO2018068830A1 (en) | 2016-10-11 | 2016-10-11 | Method and system for the transmission of bioinformatics data |
| PCT/EP2016/074307 WO2018068829A1 (en) | 2016-10-11 | 2016-10-11 | Method and apparatus for compact representation of bioinformatics data |
| PCT/EP2016/074297 WO2018068827A1 (en) | 2016-10-11 | 2016-10-11 | Efficient data structures for bioinformatics information representation |
| PCT/EP2016/074301 WO2018068828A1 (en) | 2016-10-11 | 2016-10-11 | Method and system for storing and accessing bioinformatics data |
| PCT/US2017/017842 WO2018071055A1 (en) | 2016-10-11 | 2017-02-14 | Method and apparatus for the compact representation of bioinformatics data |
| PCT/US2017/041579 WO2018071078A1 (en) | 2016-10-11 | 2017-07-11 | Method and apparatus for the access to bioinformatics data structured in access units |
| PCT/US2017/066458 WO2018151786A1 (en) | 2016-10-11 | 2017-12-14 | Method and systems for the reconstruction of genomic reference sequences from compressed genomic sequence reads |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CO2019009922A2 true CO2019009922A2 (es) | 2020-01-17 |
Family
ID=61905752
Family Applications (6)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CONC2019/0003639A CO2019003639A2 (es) | 2016-10-11 | 2019-04-11 | Método y sistemas para la indexación de datos bioinformáticos |
| CONC2019/0003638A CO2019003638A2 (es) | 2016-10-11 | 2019-04-11 | Método y aparato para el acceso a datos bioinformáticos estructurados en unidades de acceso |
| CONC2019/0003595A CO2019003595A2 (es) | 2016-10-11 | 2019-04-11 | Método y sistemas para la representación y procesamiento de datos de bioinformática mediante el uso de secuencias de referencia |
| CONC2019/0003842A CO2019003842A2 (es) | 2016-10-11 | 2019-04-15 | Método y sistema para el acceso selectivo de datos bioinformáticos almacenados o transmitidos |
| CONC2019/0009920A CO2019009920A2 (es) | 2016-10-11 | 2019-09-12 | Método y aparato para la representación compacta de datos de bioinformática mediante el uso de múltiples descriptores genómicos |
| CONC2019/0009922A CO2019009922A2 (es) | 2016-10-11 | 2019-09-12 | Método y sistemas para la reconstrucción de secuencias genómicas de referencia a partir de lecturas de secuencias genómicas comprimidas |
Family Applications Before (5)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CONC2019/0003639A CO2019003639A2 (es) | 2016-10-11 | 2019-04-11 | Método y sistemas para la indexación de datos bioinformáticos |
| CONC2019/0003638A CO2019003638A2 (es) | 2016-10-11 | 2019-04-11 | Método y aparato para el acceso a datos bioinformáticos estructurados en unidades de acceso |
| CONC2019/0003595A CO2019003595A2 (es) | 2016-10-11 | 2019-04-11 | Método y sistemas para la representación y procesamiento de datos de bioinformática mediante el uso de secuencias de referencia |
| CONC2019/0003842A CO2019003842A2 (es) | 2016-10-11 | 2019-04-15 | Método y sistema para el acceso selectivo de datos bioinformáticos almacenados o transmitidos |
| CONC2019/0009920A CO2019009920A2 (es) | 2016-10-11 | 2019-09-12 | Método y aparato para la representación compacta de datos de bioinformática mediante el uso de múltiples descriptores genómicos |
Country Status (17)
| Country | Link |
|---|---|
| US (6) | US20200042735A1 (es) |
| EP (3) | EP3526694A4 (es) |
| JP (4) | JP2020505702A (es) |
| KR (4) | KR20190073426A (es) |
| CN (6) | CN110168651A (es) |
| AU (3) | AU2017342688A1 (es) |
| BR (7) | BR112019007359A2 (es) |
| CA (3) | CA3040138A1 (es) |
| CL (6) | CL2019000972A1 (es) |
| CO (6) | CO2019003639A2 (es) |
| EA (2) | EA201990916A1 (es) |
| IL (3) | IL265879B2 (es) |
| MX (2) | MX2019004130A (es) |
| PE (7) | PE20191058A1 (es) |
| PH (6) | PH12019550060A1 (es) |
| SG (3) | SG11201903270RA (es) |
| WO (4) | WO2018071054A1 (es) |
Families Citing this family (46)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB2526598B (en) * | 2014-05-29 | 2018-11-28 | Imagination Tech Ltd | Allocation of primitives to primitive blocks |
| US11574287B2 (en) | 2017-10-10 | 2023-02-07 | Text IQ, Inc. | Automatic document classification |
| US11030324B2 (en) * | 2017-11-30 | 2021-06-08 | Koninklijke Philips N.V. | Proactive resistance to re-identification of genomic data |
| WO2019191083A1 (en) * | 2018-03-26 | 2019-10-03 | Colorado State University Research Foundation | Apparatuses, systems and methods for generating and tracking molecular digital signatures to ensure authenticity and integrity of synthetic dna molecules |
| MX2020012672A (es) * | 2018-05-31 | 2021-02-09 | Koninklijke Philips Nv | Sistema y metodo para interpretacion de alelos usando un genoma de referencia basado en graficos. |
| CN108753765B (zh) * | 2018-06-08 | 2020-12-08 | 中国科学院遗传与发育生物学研究所 | 一种构建超长连续dna序列的基因组组装方法 |
| US12210904B2 (en) * | 2018-06-29 | 2025-01-28 | International Business Machines Corporation | Hybridized storage optimization for genomic workloads |
| US11474978B2 (en) * | 2018-07-06 | 2022-10-18 | Capital One Services, Llc | Systems and methods for a data search engine based on data profiles |
| US12300358B2 (en) * | 2018-08-20 | 2025-05-13 | The Board Of Trustees Of The Leland Stanford Junior University | Systems and methods for compressing genetic sequencing data and uses thereof |
| GB2585816A (en) * | 2018-12-12 | 2021-01-27 | Univ York | Proof-of-work for blockchain applications |
| US20210074381A1 (en) * | 2019-09-11 | 2021-03-11 | Enancio | Method for the compression of genome sequence data |
| WO2021063904A1 (en) * | 2019-10-01 | 2021-04-08 | Koninklijke Philips N.V. | System and methods for the efficient identification and extraction of sequence paths in genome graphs |
| CN110797087B (zh) * | 2019-10-17 | 2020-11-03 | 南京医基云医疗数据研究院有限公司 | 测序序列处理方法及装置、存储介质、电子设备 |
| JP7631330B2 (ja) * | 2019-10-18 | 2025-02-18 | コーニンクレッカ フィリップス エヌ ヴェ | 多様な表形式データの効果的な圧縮、表現、および展開のためのシステムおよび方法 |
| US12322477B1 (en) * | 2019-12-04 | 2025-06-03 | John Hayward | Methods of efficiently transforming and comparing recombinable DNA information |
| US12445148B2 (en) | 2020-01-03 | 2025-10-14 | Koninklijke Philips N.V. | System and method for effective compression representation and decompression of diverse tabulated data |
| BR112022015328A2 (pt) * | 2020-02-07 | 2022-09-27 | Koninklijke Philips Nv | Método e sistema para compressão de informações |
| CN111243663B (zh) * | 2020-02-26 | 2022-06-07 | 西安交通大学 | 一种基于模式增长算法的基因变异检测方法 |
| CN111370070B (zh) * | 2020-02-27 | 2023-10-27 | 中国科学院计算技术研究所 | 一种针对大数据基因测序文件的压缩处理方法 |
| US12014802B2 (en) | 2020-03-17 | 2024-06-18 | Western Digital Technologies, Inc. | Devices and methods for locating a sample read in a reference genome |
| US12006539B2 (en) | 2020-03-17 | 2024-06-11 | Western Digital Technologies, Inc. | Reference-guided genome sequencing |
| US11837330B2 (en) | 2020-03-18 | 2023-12-05 | Western Digital Technologies, Inc. | Reference-guided genome sequencing |
| US12347528B2 (en) | 2020-04-07 | 2025-07-01 | Koninklijke Philips N.V. | System and method for storing and transporting diverse genomic data |
| EP3896698A1 (en) * | 2020-04-15 | 2021-10-20 | Genomsys SA | Method and system for the efficient data compression in mpeg-g |
| CN111459208A (zh) * | 2020-04-17 | 2020-07-28 | 南京铁道职业技术学院 | 针对地铁供电系统电能的操纵系统及其方法 |
| US12224042B2 (en) | 2020-06-22 | 2025-02-11 | SanDisk Technologies, Inc. | Devices and methods for genome sequencing |
| US12093803B2 (en) * | 2020-07-01 | 2024-09-17 | International Business Machines Corporation | Downsampling genomic sequence data |
| JP2023541341A (ja) * | 2020-09-14 | 2023-10-02 | イルミナ インコーポレイテッド | 個別化医療のためのカスタムデータファイル |
| WO2022073810A1 (en) * | 2020-10-06 | 2022-04-14 | Koninklijke Philips N.V. | Methods and systems for storing genomic data in a file structure comprising protection metadata |
| AU2021357587A1 (en) * | 2020-10-06 | 2023-06-08 | Koninklijke Philips N.V. | Methods and systems for storing genomic data in a file structure comprising an information metadata structure |
| CN112836355B (zh) * | 2021-01-14 | 2023-04-18 | 西安科技大学 | 一种预测采煤工作面顶板来压概率的方法 |
| JP7118199B1 (ja) * | 2021-03-26 | 2022-08-15 | エヌ・ティ・ティ・コミュニケーションズ株式会社 | 処理システム、処理方法及び処理プログラム |
| US12406413B2 (en) | 2021-05-10 | 2025-09-02 | Optum Services (Ireland) Limited | Predictive data analysis using image representations of genomic data |
| ES2930699A1 (es) * | 2021-06-10 | 2022-12-20 | Veritas Intercontinental S L | Metodo de analisis genomico en una plataforma bioinformatica |
| CN113670643B (zh) * | 2021-08-30 | 2023-05-12 | 四川虹美智能科技有限公司 | 智能空调测试方法及系统 |
| CN113643761B (zh) * | 2021-10-13 | 2022-01-18 | 苏州赛美科基因科技有限公司 | 一种用于解读二代测序结果所需数据的提取方法 |
| US20230187020A1 (en) * | 2021-12-15 | 2023-06-15 | Illumina Software, Inc. | Systems and methods for iterative and scalable population-scale variant analysis |
| JP2025512716A (ja) | 2022-03-08 | 2025-04-22 | イルミナ インコーポレイテッド | マルチパスソフトウェアで加速されたゲノムリードマッピングエンジン |
| CN115458050B (zh) * | 2022-08-05 | 2026-01-06 | 武汉大学 | 多基因发现网络构造方法、装置、设备及存储介质 |
| CN115391284B (zh) * | 2022-10-31 | 2023-02-03 | 四川大学华西医院 | 基因数据文件快速识别方法、系统和计算机可读存储介质 |
| WO2024114597A1 (en) * | 2022-12-02 | 2024-06-06 | City University Of Hong Kong | Reinforcement-learning-based network transmission of compressed genome sequence |
| CN116541348B (zh) * | 2023-03-22 | 2023-09-26 | 河北热点科技股份有限公司 | 数据智能存储方法及终端查询一体机 |
| CN116739646B (zh) * | 2023-08-15 | 2023-11-24 | 南京易联阳光信息技术股份有限公司 | 网络交易大数据分析方法及分析系统 |
| CN117153270B (zh) * | 2023-10-30 | 2024-02-02 | 吉林华瑞基因科技有限公司 | 一种基因二代测序数据处理方法 |
| CN117708755B (zh) * | 2023-12-17 | 2024-06-21 | 重庆文理学院 | 基于生态环境的数据处理方法及装置 |
| WO2025137316A1 (en) * | 2023-12-21 | 2025-06-26 | Illumina, Inc. | Sequence data processing, retention, and recovery |
Family Cites Families (54)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6303297B1 (en) * | 1992-07-17 | 2001-10-16 | Incyte Pharmaceuticals, Inc. | Database for storage and analysis of full-length sequences |
| JP3429674B2 (ja) | 1998-04-28 | 2003-07-22 | 沖電気工業株式会社 | 多重通信システム |
| EP1410301A4 (en) * | 2000-04-12 | 2008-01-23 | Cleveland Clinic Foundation | SYSTEM FOR IDENTIFYING AND ANALYZING GENE EXPRESSION CONTAINING ELEMENTS RICH IN ADENYLATE URIDYLATE (ARE) |
| FR2820563B1 (fr) * | 2001-02-02 | 2003-05-16 | Expway | Procede de compression/decompression d'un document structure |
| US20040153255A1 (en) * | 2003-02-03 | 2004-08-05 | Ahn Tae-Jin | Apparatus and method for encoding DNA sequence, and computer readable medium |
| DE10320711A1 (de) * | 2003-05-08 | 2004-12-16 | Siemens Ag | Verfahren und Anordnung zur Einrichtung und Aktualisierung einer Benutzeroberfläche zum Zugriff auf Informationsseiten in einem Datennetz |
| US8280640B2 (en) * | 2003-08-11 | 2012-10-02 | Eloret Corporation | System and method for pattern recognition in sequential data |
| WO2005094363A2 (en) * | 2004-03-30 | 2005-10-13 | New York University | System, method and software arrangement for bi-allele haplotype phasing |
| WO2006052242A1 (en) * | 2004-11-08 | 2006-05-18 | Seirad, Inc. | Methods and systems for compressing and comparing genomic data |
| WO2007132461A2 (en) * | 2006-05-11 | 2007-11-22 | Ramot At Tel Aviv University Ltd. | Classification of protein sequences and uses of classified proteins |
| SE531398C2 (sv) | 2007-02-16 | 2009-03-24 | Scalado Ab | Generering av en dataström och identifiering av positioner inuti en dataström |
| KR101369745B1 (ko) * | 2007-04-11 | 2014-03-07 | 삼성전자주식회사 | 비동기화된 비트스트림들의 다중화 및 역다중화 방법 및장치 |
| US8832112B2 (en) * | 2008-06-17 | 2014-09-09 | International Business Machines Corporation | Encoded matrix index |
| GB2477703A (en) * | 2008-11-14 | 2011-08-10 | Real Time Genomics Inc | A method and system for analysing data sequences |
| US20100217532A1 (en) * | 2009-02-25 | 2010-08-26 | University Of Delaware | Systems and methods for identifying structurally or functionally significant amino acid sequences |
| AU2010313247A1 (en) * | 2009-10-30 | 2012-05-24 | Synthetic Genomics, Inc. | Encoding text into nucleic acid sequences |
| EP2362657B1 (en) * | 2010-02-18 | 2013-04-24 | Research In Motion Limited | Parallel entropy coding and decoding methods and devices |
| WO2011143231A2 (en) * | 2010-05-10 | 2011-11-17 | The Broad Institute | High throughput paired-end sequencing of large-insert clone libraries |
| KR101952965B1 (ko) * | 2010-05-25 | 2019-02-27 | 더 리젠츠 오브 더 유니버시티 오브 캘리포니아 | Bambam:고처리율 서열분석 데이터의 병렬 비교 분석 |
| US20140229495A1 (en) * | 2011-01-19 | 2014-08-14 | Koninklijke Philips N.V. | Method for processing genomic data |
| US9215162B2 (en) * | 2011-03-09 | 2015-12-15 | Annai Systems Inc. | Biological data networks and methods therefor |
| WO2012168815A2 (en) * | 2011-06-06 | 2012-12-13 | Koninklijke Philips Electronics N.V. | Method for assembly of nucleic acid sequence data |
| BR112013032332B1 (pt) * | 2011-06-16 | 2022-08-16 | Ge Video Compression, Llc | Inicialização de contexto em codificação de entropia |
| US8707289B2 (en) * | 2011-07-20 | 2014-04-22 | Google Inc. | Multiple application versions |
| CN104081772B (zh) * | 2011-10-06 | 2018-04-10 | 弗劳恩霍夫应用研究促进协会 | 熵编码缓冲器配置 |
| CA2854832C (en) * | 2011-11-07 | 2023-05-23 | Ingenuity Systems, Inc. | Methods and systems for identification of causal genomic variants |
| KR101922129B1 (ko) * | 2011-12-05 | 2018-11-26 | 삼성전자주식회사 | 차세대 시퀀싱을 이용하여 획득된 유전 정보를 압축 및 압축해제하는 방법 및 장치 |
| US10140683B2 (en) * | 2011-12-08 | 2018-11-27 | Five3 Genomics, Llc | Distributed system providing dynamic indexing and visualization of genomic data |
| EP2608096B1 (en) * | 2011-12-24 | 2020-08-05 | Tata Consultancy Services Ltd. | Compression of genomic data file |
| US9600625B2 (en) * | 2012-04-23 | 2017-03-21 | Bina Technologies, Inc. | Systems and methods for processing nucleic acid sequence data |
| CN103049680B (zh) * | 2012-12-29 | 2016-09-07 | 深圳先进技术研究院 | 基因测序数据读取方法及系统 |
| US9679104B2 (en) * | 2013-01-17 | 2017-06-13 | Edico Genome, Corp. | Bioinformatics systems, apparatuses, and methods executed on an integrated circuit processing platform |
| WO2014145503A2 (en) * | 2013-03-15 | 2014-09-18 | Lieber Institute For Brain Development | Sequence alignment using divide and conquer maximum oligonucleotide mapping (dcmom), apparatus, system and method related thereto |
| JP6054790B2 (ja) * | 2013-03-28 | 2016-12-27 | 三菱スペース・ソフトウエア株式会社 | 遺伝子情報記憶装置、遺伝子情報検索装置、遺伝子情報記憶プログラム、遺伝子情報検索プログラム、遺伝子情報記憶方法、遺伝子情報検索方法及び遺伝子情報検索システム |
| GB2512829B (en) * | 2013-04-05 | 2015-05-27 | Canon Kk | Method and apparatus for encoding or decoding an image with inter layer motion information prediction according to motion information compression scheme |
| WO2014186604A1 (en) * | 2013-05-15 | 2014-11-20 | Edico Genome Corp. | Bioinformatics systems, apparatuses, and methods executed on an integrated circuit processing platform |
| KR101522087B1 (ko) * | 2013-06-19 | 2015-05-28 | 삼성에스디에스 주식회사 | 미스매치를 고려한 염기 서열 정렬 시스템 및 방법 |
| CN103336916B (zh) * | 2013-07-05 | 2016-04-06 | 中国科学院数学与系统科学研究院 | 一种测序序列映射方法及系统 |
| US20150032711A1 (en) * | 2013-07-06 | 2015-01-29 | Victor Kunin | Methods for identification of organisms, assigning reads to organisms, and identification of genes in metagenomic sequences |
| KR101493982B1 (ko) * | 2013-09-26 | 2015-02-23 | 대한민국 | 품종인식 코드화 시스템 및 이를 이용한 코드화 방법 |
| CN104699998A (zh) * | 2013-12-06 | 2015-06-10 | 国际商业机器公司 | 用于对基因组进行压缩和解压缩的方法和装置 |
| US10902937B2 (en) * | 2014-02-12 | 2021-01-26 | International Business Machines Corporation | Lossless compression of DNA sequences |
| US9639542B2 (en) * | 2014-02-14 | 2017-05-02 | Sap Se | Dynamic mapping of extensible datasets to relational database schemas |
| US9886561B2 (en) * | 2014-02-19 | 2018-02-06 | The Regents Of The University Of California | Efficient encoding and storage and retrieval of genomic data |
| US9354922B2 (en) | 2014-04-02 | 2016-05-31 | International Business Machines Corporation | Metadata-driven workflows and integration with genomic data processing systems and techniques |
| US20150379195A1 (en) * | 2014-06-25 | 2015-12-31 | The Board Of Trustees Of The Leland Stanford Junior University | Software haplotying of hla loci |
| GB2527588B (en) * | 2014-06-27 | 2016-05-18 | Gurulogic Microsystems Oy | Encoder and decoder |
| US20160019339A1 (en) * | 2014-07-06 | 2016-01-21 | Mercator BioLogic Incorporated | Bioinformatics tools, systems and methods for sequence assembly |
| US10230390B2 (en) * | 2014-08-29 | 2019-03-12 | Bonnie Berger Leighton | Compressively-accelerated read mapping framework for next-generation sequencing |
| US10116632B2 (en) * | 2014-09-12 | 2018-10-30 | New York University | System, method and computer-accessible medium for secure and compressed transmission of genomic data |
| US20160125130A1 (en) * | 2014-11-05 | 2016-05-05 | Agilent Technologies, Inc. | Method for assigning target-enriched sequence reads to a genomic location |
| CN107851137A (zh) * | 2015-06-16 | 2018-03-27 | 汉诺威戈特弗里德威廉莱布尼茨大学 | 用于压缩基因组数据的方法 |
| CN105956417A (zh) * | 2016-05-04 | 2016-09-21 | 西安电子科技大学 | 云环境下基于编辑距离的相似碱基序列查询方法 |
| CN105975811B (zh) * | 2016-05-09 | 2019-03-15 | 管仁初 | 一种智能比对的基因序列分析装置 |
-
2017
- 2017-02-14 MX MX2019004130A patent/MX2019004130A/es unknown
- 2017-02-14 EP EP17859972.6A patent/EP3526694A4/en not_active Withdrawn
- 2017-02-14 JP JP2019540510A patent/JP2020505702A/ja not_active Withdrawn
- 2017-02-14 CN CN201780062919.5A patent/CN110168651A/zh active Pending
- 2017-02-14 PE PE2019000804A patent/PE20191058A1/es unknown
- 2017-02-14 CA CA3040138A patent/CA3040138A1/en not_active Abandoned
- 2017-02-14 US US16/341,426 patent/US20200042735A1/en not_active Abandoned
- 2017-02-14 BR BR112019007359A patent/BR112019007359A2/pt not_active IP Right Cessation
- 2017-02-14 AU AU2017342688A patent/AU2017342688A1/en not_active Abandoned
- 2017-02-14 WO PCT/US2017/017841 patent/WO2018071054A1/en not_active Ceased
- 2017-02-14 KR KR1020197013567A patent/KR20190073426A/ko not_active Withdrawn
- 2017-02-14 WO PCT/US2017/017842 patent/WO2018071055A1/en not_active Ceased
- 2017-02-14 SG SG11201903270RA patent/SG11201903270RA/en unknown
- 2017-07-11 BR BR112019007363A patent/BR112019007363A2/pt not_active Application Discontinuation
- 2017-07-11 PE PE2019000805A patent/PE20191227A1/es unknown
- 2017-07-11 MX MX2019004128A patent/MX2019004128A/es unknown
- 2017-07-11 JP JP2019540511A patent/JP7079786B2/ja active Active
- 2017-07-11 SG SG11201903272XA patent/SG11201903272XA/en unknown
- 2017-07-11 JP JP2019540513A patent/JP2020500383A/ja not_active Withdrawn
- 2017-07-11 EP EP17860868.3A patent/EP3526707A4/en not_active Withdrawn
- 2017-07-11 US US16/337,639 patent/US20190214111A1/en not_active Abandoned
- 2017-07-11 AU AU2017341685A patent/AU2017341685A1/en not_active Abandoned
- 2017-07-11 KR KR1020197013419A patent/KR20190069469A/ko not_active Withdrawn
- 2017-07-11 CN CN201780062885.XA patent/CN110114830B/zh active Active
- 2017-07-11 CN CN201780063013.5A patent/CN110506272B/zh active Active
- 2017-07-11 PE PE2019000803A patent/PE20191057A1/es unknown
- 2017-07-11 PE PE2019000802A patent/PE20191056A1/es unknown
- 2017-07-11 EA EA201990916A patent/EA201990916A1/ru unknown
- 2017-07-11 EA EA201990917A patent/EA201990917A1/ru unknown
- 2017-07-11 EP EP17860980.6A patent/EP3526657A4/en active Pending
- 2017-07-11 AU AU2017341684A patent/AU2017341684A1/en not_active Abandoned
- 2017-07-11 KR KR1020197013418A patent/KR20190062541A/ko not_active Withdrawn
- 2017-07-11 JP JP2019540512A patent/JP2019537172A/ja not_active Withdrawn
- 2017-07-11 BR BR112019007357A patent/BR112019007357A2/pt not_active Application Discontinuation
- 2017-07-11 CA CA3040147A patent/CA3040147A1/en not_active Abandoned
- 2017-07-11 WO PCT/US2017/041591 patent/WO2018071080A2/en not_active Ceased
- 2017-07-11 WO PCT/US2017/041585 patent/WO2018071079A1/en not_active Ceased
- 2017-07-11 IL IL265879A patent/IL265879B2/en unknown
- 2017-07-11 CA CA3040145A patent/CA3040145A1/en not_active Abandoned
- 2017-07-11 CN CN201780063014.XA patent/CN110121577B/zh active Active
- 2017-07-11 BR BR112019007360A patent/BR112019007360A2/pt not_active Application Discontinuation
- 2017-07-11 US US16/337,642 patent/US11404143B2/en active Active
- 2017-07-11 SG SG11201903271UA patent/SG11201903271UA/en unknown
- 2017-12-14 KR KR1020197026863A patent/KR20190117652A/ko not_active Withdrawn
- 2017-12-14 CN CN201780086529.1A patent/CN110603595B/zh active Active
- 2017-12-14 US US16/485,623 patent/US20190385702A1/en active Pending
- 2017-12-14 PE PE2019001667A patent/PE20200323A1/es unknown
- 2017-12-14 BR BR112019016230A patent/BR112019016230A2/pt not_active Application Discontinuation
- 2017-12-15 US US16/485,649 patent/US20200051667A1/en active Pending
- 2017-12-15 CN CN201780086770.4A patent/CN110678929B/zh active Active
- 2017-12-15 PE PE2019001669A patent/PE20200226A1/es unknown
- 2017-12-15 BR BR112019016232A patent/BR112019016232A2/pt not_active Application Discontinuation
-
2018
- 2018-02-14 PE PE2019001668A patent/PE20200227A1/es unknown
- 2018-02-14 BR BR112019016236A patent/BR112019016236A2/pt unknown
- 2018-02-14 US US16/485,670 patent/US20200051665A1/en active Pending
-
2019
- 2019-04-08 IL IL265928A patent/IL265928B/en active IP Right Grant
- 2019-04-10 CL CL2019000972A patent/CL2019000972A1/es unknown
- 2019-04-10 CL CL2019000968A patent/CL2019000968A1/es unknown
- 2019-04-10 CL CL2019000973A patent/CL2019000973A1/es unknown
- 2019-04-11 CO CONC2019/0003639A patent/CO2019003639A2/es unknown
- 2019-04-11 PH PH12019550060A patent/PH12019550060A1/en unknown
- 2019-04-11 CO CONC2019/0003638A patent/CO2019003638A2/es unknown
- 2019-04-11 IL IL265972A patent/IL265972A/en unknown
- 2019-04-11 PH PH12019550057A patent/PH12019550057A1/en unknown
- 2019-04-11 CO CONC2019/0003595A patent/CO2019003595A2/es unknown
- 2019-04-11 PH PH12019550059A patent/PH12019550059A1/en unknown
- 2019-04-11 PH PH12019550058A patent/PH12019550058A1/en unknown
- 2019-04-15 CO CONC2019/0003842A patent/CO2019003842A2/es unknown
- 2019-08-12 CL CL2019002277A patent/CL2019002277A1/es unknown
- 2019-08-12 CL CL2019002276A patent/CL2019002276A1/es unknown
- 2019-08-12 CL CL2019002275A patent/CL2019002275A1/es unknown
- 2019-08-13 PH PH12019501879A patent/PH12019501879A1/en unknown
- 2019-08-13 PH PH12019501881A patent/PH12019501881A1/en unknown
- 2019-09-12 CO CONC2019/0009920A patent/CO2019009920A2/es unknown
- 2019-09-12 CO CONC2019/0009922A patent/CO2019009922A2/es unknown
Also Published As
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CO2019009922A2 (es) | Método y sistemas para la reconstrucción de secuencias genómicas de referencia a partir de lecturas de secuencias genómicas comprimidas | |
| MX2019009682A (es) | Metodo y sistemas para la reconstruccion de secuencias genomicas de referencia a partir de lecturas de secuencias genomicas comprimidas. | |
| CL2018001895A1 (es) | Estructura de árbol multi-tipo para codificación de video | |
| MX2018004250A (es) | Metodo y dispositivo para codificar o decodificar imagen. | |
| CL2016002184A1 (es) | Conmutacion adaptativa de espacios de color, frecuencias de muestreo de color y/o profundidades de bits | |
| MX2023006112A (es) | Metodo y dispositivo de codificacion de video, metodo y dispositivo de descodificacion de video. | |
| MX2019010795A (es) | Metodo de codificacion de video utilizando division de bloque de arbol binario. | |
| PH12014501531A1 (en) | Sub-streams for wavefront parallel processing in video coding | |
| EP4307592A3 (en) | Bit allocation for encoding and decoding | |
| CL2017002423A1 (es) | Determinación de modo de derivación de información de movimiento en codificación de video | |
| MX336566B (es) | Dispositivo de procesamiento de imagenes y metodo de procesamiento de imagenes. | |
| CO2017003348A2 (es) | Un dispositivo configurado para decodificar un flujo de bits representativo de una señal de audio ambisónica de orden superior, un método de decodificación de dicho flujo de bits, un dispositivo configurado para codificar una señal de audio ambisónica de orden superior para generar un flujo de bits y un método de codificación de dicho flujo de bits | |
| AR111014A2 (es) | Método para codificación llevado a cabo por un codificador de medios, codificadores y decodificadores de medios y dispositivos inalámbricos que comprenden dichos codificadores y decodificadores | |
| BR112021017453A2 (pt) | Inicialização de probabilidade para codificação de vídeo | |
| MX2016014691A (es) | Sistema de presion anular del nivel agotado de las bombas durante la perforacion. | |
| PH12019500294A1 (en) | Method and apparatuse for coding and decoding polar codes | |
| EA201991906A1 (ru) | Способ и системы для восстановления геномных референсных последовательностей из сжатых прочтений геномной последовательности | |
| AR101058A1 (es) | Dispositivo codificador de video, método de codificación de video y programa | |
| MX2022000965A (es) | Metodo y dispositivo de decodificacion de video, y metodo y dispositivo de codificacion del mismo. | |
| MX2018008126A (es) | Aparato de transmision, metodo de transmision, aparato de recepcion, y metodo de recepcion. | |
| AR110436A1 (es) | Método de codificación de vídeo, método de decodificación de vídeo, dispositivo de codificación de vídeo y dispositivo de decodificación de vídeo | |
| TW201612895A (en) | Method and apparatus for coding or decoding subband configuration data for subband groups | |
| MX2018010460A (es) | Dispositivo de transmision, metodo de transmision, dispositivo de recepcion y metodo de recepcion. | |
| MX2024004357A (es) | Metodo y aparato de decodificacion de video, y metodo y aparato de codificacion de video. | |
| AR118200A1 (es) | Codificación de sub-bloques por intrapredicción generalizada en codificación de video |