[go: up one dir, main page]

PE20191056A1 - Metodo y aparato para el acceso a datos bioinformaticos estructurados en unidades de acceso - Google Patents

Metodo y aparato para el acceso a datos bioinformaticos estructurados en unidades de acceso

Info

Publication number
PE20191056A1
PE20191056A1 PE2019000802A PE2019000802A PE20191056A1 PE 20191056 A1 PE20191056 A1 PE 20191056A1 PE 2019000802 A PE2019000802 A PE 2019000802A PE 2019000802 A PE2019000802 A PE 2019000802A PE 20191056 A1 PE20191056 A1 PE 20191056A1
Authority
PE
Peru
Prior art keywords
access
genomic
sequence reads
descriptor sets
access units
Prior art date
Application number
PE2019000802A
Other languages
English (en)
Inventor
Daniele Renzi
Giorgio Zoia
Claudio Alberti
Mohamed Khoso Baluch
Original Assignee
Genomsys Sa
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from PCT/EP2016/074311 external-priority patent/WO2018068830A1/en
Priority claimed from PCT/EP2016/074307 external-priority patent/WO2018068829A1/en
Priority claimed from PCT/EP2016/074297 external-priority patent/WO2018068827A1/en
Priority claimed from PCT/EP2016/074301 external-priority patent/WO2018068828A1/en
Application filed by Genomsys Sa filed Critical Genomsys Sa
Priority claimed from PCT/US2017/041579 external-priority patent/WO2018071078A1/en
Publication of PE20191056A1 publication Critical patent/PE20191056A1/es

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2282Tablespace storage structures; Management thereof
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • G06F16/2365Ensuring data consistency and integrity
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/602Providing cryptographic facilities or services
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6218Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6218Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
    • G06F21/6245Protecting personal data, e.g. for financial or medical purposes
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/10Ploidy or copy number detection
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/20Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/10Sequence alignment; Homology search
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/20Sequence assembly
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/10Signal processing, e.g. from mass spectrometry [MS] or from PCR
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B45/00ICT specially adapted for bioinformatics-related data visualisation, e.g. displaying of maps or networks
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/10Ontologies; Annotations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/30Data warehousing; Computing architectures
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/40Encryption of genetic data
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/50Compression of genetic data
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B99/00Subject matter not provided for in other groups of this subclass
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/3084Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method
    • H03M7/3086Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method employing a sliding window, e.g. LZ77
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/70Type of the data to be coded, other than image and sound

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • Biophysics (AREA)
  • Databases & Information Systems (AREA)
  • Bioethics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Analytical Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Chemical & Material Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Hardware Design (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Public Health (AREA)
  • Artificial Intelligence (AREA)
  • Epidemiology (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Television Signal Processing For Recording (AREA)
  • Labeling Devices (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)

Abstract

Metodo y aparato para la codificacion y acceso selectivo de datos comprimidos de secuencia genomica producidos por maquinas de secuenciacion genomica. El proceso de codificacion se basa en alinear lecturas de secuencia con respecto a secuencias de referencia preexistentes o construidas, en clasificar y codificar las lecturas de secuencia por medio de conjuntos de descriptores, y ademas dividir los conjuntos de descriptores en unidades de acceso de diferentes tipos. El acceso selectivo eficiente a regiones genomicas especificas con la garantia de recuperar todas las lecturas de secuencia mapeadas a esas regiones se proporciona al: senalizar del tipo de configuracion de mapeo de datos utilizada para almacenar o transmitir los conjuntos de descriptores, determinar el numero minimo de unidades de acceso que necesitan recuperarse y decodificarse para acceder a una region genomica, proporcionar una tabla maestra de indice que contiene toda la informacion para optimizar el proceso de acceso a datos.
PE2019000802A 2016-10-11 2017-07-11 Metodo y aparato para el acceso a datos bioinformaticos estructurados en unidades de acceso PE20191056A1 (es)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
PCT/EP2016/074311 WO2018068830A1 (en) 2016-10-11 2016-10-11 Method and system for the transmission of bioinformatics data
PCT/EP2016/074307 WO2018068829A1 (en) 2016-10-11 2016-10-11 Method and apparatus for compact representation of bioinformatics data
PCT/EP2016/074297 WO2018068827A1 (en) 2016-10-11 2016-10-11 Efficient data structures for bioinformatics information representation
PCT/EP2016/074301 WO2018068828A1 (en) 2016-10-11 2016-10-11 Method and system for storing and accessing bioinformatics data
PCT/US2017/017842 WO2018071055A1 (en) 2016-10-11 2017-02-14 Method and apparatus for the compact representation of bioinformatics data
PCT/US2017/017841 WO2018071054A1 (en) 2016-10-11 2017-02-14 Method and system for selective access of stored or transmitted bioinformatics data
PCT/US2017/041579 WO2018071078A1 (en) 2016-10-11 2017-07-11 Method and apparatus for the access to bioinformatics data structured in access units

Publications (1)

Publication Number Publication Date
PE20191056A1 true PE20191056A1 (es) 2019-08-06

Family

ID=61905752

Family Applications (7)

Application Number Title Priority Date Filing Date
PE2019000804A PE20191058A1 (es) 2016-10-11 2017-02-14 Metodo y sistema para el acceso selectivo de datos bioinformaticos almacenados o transmitidos
PE2019000805A PE20191227A1 (es) 2016-10-11 2017-07-11 Metodo y sistemas para la representacion y procesamiento de datos de bioinformatica mediante el uso de secuencias de referencia
PE2019000803A PE20191057A1 (es) 2016-10-11 2017-07-11 Metodo y sistemas para la indexacion de datos bioinformaticos
PE2019000802A PE20191056A1 (es) 2016-10-11 2017-07-11 Metodo y aparato para el acceso a datos bioinformaticos estructurados en unidades de acceso
PE2019001667A PE20200323A1 (es) 2016-10-11 2017-12-14 Metodo y sistemas para la reconstruccion de secuencias genomicas de referencia a partir de lecturas de secuencias genomicas comprimidas
PE2019001669A PE20200226A1 (es) 2016-10-11 2017-12-15 Metodo y sistemas para la compresion eficiente de lecturas de secuencias genomicas
PE2019001668A PE20200227A1 (es) 2016-10-11 2018-02-14 Metodo y aparato para la presentacion compacta de datos de bioinformatica mediante el uso de multiples descriptores genomicos

Family Applications Before (3)

Application Number Title Priority Date Filing Date
PE2019000804A PE20191058A1 (es) 2016-10-11 2017-02-14 Metodo y sistema para el acceso selectivo de datos bioinformaticos almacenados o transmitidos
PE2019000805A PE20191227A1 (es) 2016-10-11 2017-07-11 Metodo y sistemas para la representacion y procesamiento de datos de bioinformatica mediante el uso de secuencias de referencia
PE2019000803A PE20191057A1 (es) 2016-10-11 2017-07-11 Metodo y sistemas para la indexacion de datos bioinformaticos

Family Applications After (3)

Application Number Title Priority Date Filing Date
PE2019001667A PE20200323A1 (es) 2016-10-11 2017-12-14 Metodo y sistemas para la reconstruccion de secuencias genomicas de referencia a partir de lecturas de secuencias genomicas comprimidas
PE2019001669A PE20200226A1 (es) 2016-10-11 2017-12-15 Metodo y sistemas para la compresion eficiente de lecturas de secuencias genomicas
PE2019001668A PE20200227A1 (es) 2016-10-11 2018-02-14 Metodo y aparato para la presentacion compacta de datos de bioinformatica mediante el uso de multiples descriptores genomicos

Country Status (17)

Country Link
US (6) US20200042735A1 (es)
EP (3) EP3526694A4 (es)
JP (4) JP2020505702A (es)
KR (4) KR20190073426A (es)
CN (6) CN110168651A (es)
AU (3) AU2017342688A1 (es)
BR (7) BR112019007359A2 (es)
CA (3) CA3040138A1 (es)
CL (6) CL2019000972A1 (es)
CO (6) CO2019003639A2 (es)
EA (2) EA201990916A1 (es)
IL (3) IL265879B2 (es)
MX (2) MX2019004130A (es)
PE (7) PE20191058A1 (es)
PH (6) PH12019550060A1 (es)
SG (3) SG11201903270RA (es)
WO (4) WO2018071054A1 (es)

Families Citing this family (46)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2526598B (en) * 2014-05-29 2018-11-28 Imagination Tech Ltd Allocation of primitives to primitive blocks
US11574287B2 (en) 2017-10-10 2023-02-07 Text IQ, Inc. Automatic document classification
US11030324B2 (en) * 2017-11-30 2021-06-08 Koninklijke Philips N.V. Proactive resistance to re-identification of genomic data
WO2019191083A1 (en) * 2018-03-26 2019-10-03 Colorado State University Research Foundation Apparatuses, systems and methods for generating and tracking molecular digital signatures to ensure authenticity and integrity of synthetic dna molecules
MX2020012672A (es) * 2018-05-31 2021-02-09 Koninklijke Philips Nv Sistema y metodo para interpretacion de alelos usando un genoma de referencia basado en graficos.
CN108753765B (zh) * 2018-06-08 2020-12-08 中国科学院遗传与发育生物学研究所 一种构建超长连续dna序列的基因组组装方法
US12210904B2 (en) * 2018-06-29 2025-01-28 International Business Machines Corporation Hybridized storage optimization for genomic workloads
US11474978B2 (en) * 2018-07-06 2022-10-18 Capital One Services, Llc Systems and methods for a data search engine based on data profiles
US12300358B2 (en) * 2018-08-20 2025-05-13 The Board Of Trustees Of The Leland Stanford Junior University Systems and methods for compressing genetic sequencing data and uses thereof
GB2585816A (en) * 2018-12-12 2021-01-27 Univ York Proof-of-work for blockchain applications
US20210074381A1 (en) * 2019-09-11 2021-03-11 Enancio Method for the compression of genome sequence data
WO2021063904A1 (en) * 2019-10-01 2021-04-08 Koninklijke Philips N.V. System and methods for the efficient identification and extraction of sequence paths in genome graphs
CN110797087B (zh) * 2019-10-17 2020-11-03 南京医基云医疗数据研究院有限公司 测序序列处理方法及装置、存储介质、电子设备
JP7631330B2 (ja) * 2019-10-18 2025-02-18 コーニンクレッカ フィリップス エヌ ヴェ 多様な表形式データの効果的な圧縮、表現、および展開のためのシステムおよび方法
US12322477B1 (en) * 2019-12-04 2025-06-03 John Hayward Methods of efficiently transforming and comparing recombinable DNA information
US12445148B2 (en) 2020-01-03 2025-10-14 Koninklijke Philips N.V. System and method for effective compression representation and decompression of diverse tabulated data
BR112022015328A2 (pt) * 2020-02-07 2022-09-27 Koninklijke Philips Nv Método e sistema para compressão de informações
CN111243663B (zh) * 2020-02-26 2022-06-07 西安交通大学 一种基于模式增长算法的基因变异检测方法
CN111370070B (zh) * 2020-02-27 2023-10-27 中国科学院计算技术研究所 一种针对大数据基因测序文件的压缩处理方法
US12014802B2 (en) 2020-03-17 2024-06-18 Western Digital Technologies, Inc. Devices and methods for locating a sample read in a reference genome
US12006539B2 (en) 2020-03-17 2024-06-11 Western Digital Technologies, Inc. Reference-guided genome sequencing
US11837330B2 (en) 2020-03-18 2023-12-05 Western Digital Technologies, Inc. Reference-guided genome sequencing
US12347528B2 (en) 2020-04-07 2025-07-01 Koninklijke Philips N.V. System and method for storing and transporting diverse genomic data
EP3896698A1 (en) * 2020-04-15 2021-10-20 Genomsys SA Method and system for the efficient data compression in mpeg-g
CN111459208A (zh) * 2020-04-17 2020-07-28 南京铁道职业技术学院 针对地铁供电系统电能的操纵系统及其方法
US12224042B2 (en) 2020-06-22 2025-02-11 SanDisk Technologies, Inc. Devices and methods for genome sequencing
US12093803B2 (en) * 2020-07-01 2024-09-17 International Business Machines Corporation Downsampling genomic sequence data
JP2023541341A (ja) * 2020-09-14 2023-10-02 イルミナ インコーポレイテッド 個別化医療のためのカスタムデータファイル
WO2022073810A1 (en) * 2020-10-06 2022-04-14 Koninklijke Philips N.V. Methods and systems for storing genomic data in a file structure comprising protection metadata
AU2021357587A1 (en) * 2020-10-06 2023-06-08 Koninklijke Philips N.V. Methods and systems for storing genomic data in a file structure comprising an information metadata structure
CN112836355B (zh) * 2021-01-14 2023-04-18 西安科技大学 一种预测采煤工作面顶板来压概率的方法
JP7118199B1 (ja) * 2021-03-26 2022-08-15 エヌ・ティ・ティ・コミュニケーションズ株式会社 処理システム、処理方法及び処理プログラム
US12406413B2 (en) 2021-05-10 2025-09-02 Optum Services (Ireland) Limited Predictive data analysis using image representations of genomic data
ES2930699A1 (es) * 2021-06-10 2022-12-20 Veritas Intercontinental S L Metodo de analisis genomico en una plataforma bioinformatica
CN113670643B (zh) * 2021-08-30 2023-05-12 四川虹美智能科技有限公司 智能空调测试方法及系统
CN113643761B (zh) * 2021-10-13 2022-01-18 苏州赛美科基因科技有限公司 一种用于解读二代测序结果所需数据的提取方法
US20230187020A1 (en) * 2021-12-15 2023-06-15 Illumina Software, Inc. Systems and methods for iterative and scalable population-scale variant analysis
JP2025512716A (ja) 2022-03-08 2025-04-22 イルミナ インコーポレイテッド マルチパスソフトウェアで加速されたゲノムリードマッピングエンジン
CN115458050B (zh) * 2022-08-05 2026-01-06 武汉大学 多基因发现网络构造方法、装置、设备及存储介质
CN115391284B (zh) * 2022-10-31 2023-02-03 四川大学华西医院 基因数据文件快速识别方法、系统和计算机可读存储介质
WO2024114597A1 (en) * 2022-12-02 2024-06-06 City University Of Hong Kong Reinforcement-learning-based network transmission of compressed genome sequence
CN116541348B (zh) * 2023-03-22 2023-09-26 河北热点科技股份有限公司 数据智能存储方法及终端查询一体机
CN116739646B (zh) * 2023-08-15 2023-11-24 南京易联阳光信息技术股份有限公司 网络交易大数据分析方法及分析系统
CN117153270B (zh) * 2023-10-30 2024-02-02 吉林华瑞基因科技有限公司 一种基因二代测序数据处理方法
CN117708755B (zh) * 2023-12-17 2024-06-21 重庆文理学院 基于生态环境的数据处理方法及装置
WO2025137316A1 (en) * 2023-12-21 2025-06-26 Illumina, Inc. Sequence data processing, retention, and recovery

Family Cites Families (54)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6303297B1 (en) * 1992-07-17 2001-10-16 Incyte Pharmaceuticals, Inc. Database for storage and analysis of full-length sequences
JP3429674B2 (ja) 1998-04-28 2003-07-22 沖電気工業株式会社 多重通信システム
EP1410301A4 (en) * 2000-04-12 2008-01-23 Cleveland Clinic Foundation SYSTEM FOR IDENTIFYING AND ANALYZING GENE EXPRESSION CONTAINING ELEMENTS RICH IN ADENYLATE URIDYLATE (ARE)
FR2820563B1 (fr) * 2001-02-02 2003-05-16 Expway Procede de compression/decompression d'un document structure
US20040153255A1 (en) * 2003-02-03 2004-08-05 Ahn Tae-Jin Apparatus and method for encoding DNA sequence, and computer readable medium
DE10320711A1 (de) * 2003-05-08 2004-12-16 Siemens Ag Verfahren und Anordnung zur Einrichtung und Aktualisierung einer Benutzeroberfläche zum Zugriff auf Informationsseiten in einem Datennetz
US8280640B2 (en) * 2003-08-11 2012-10-02 Eloret Corporation System and method for pattern recognition in sequential data
WO2005094363A2 (en) * 2004-03-30 2005-10-13 New York University System, method and software arrangement for bi-allele haplotype phasing
WO2006052242A1 (en) * 2004-11-08 2006-05-18 Seirad, Inc. Methods and systems for compressing and comparing genomic data
WO2007132461A2 (en) * 2006-05-11 2007-11-22 Ramot At Tel Aviv University Ltd. Classification of protein sequences and uses of classified proteins
SE531398C2 (sv) 2007-02-16 2009-03-24 Scalado Ab Generering av en dataström och identifiering av positioner inuti en dataström
KR101369745B1 (ko) * 2007-04-11 2014-03-07 삼성전자주식회사 비동기화된 비트스트림들의 다중화 및 역다중화 방법 및장치
US8832112B2 (en) * 2008-06-17 2014-09-09 International Business Machines Corporation Encoded matrix index
GB2477703A (en) * 2008-11-14 2011-08-10 Real Time Genomics Inc A method and system for analysing data sequences
US20100217532A1 (en) * 2009-02-25 2010-08-26 University Of Delaware Systems and methods for identifying structurally or functionally significant amino acid sequences
AU2010313247A1 (en) * 2009-10-30 2012-05-24 Synthetic Genomics, Inc. Encoding text into nucleic acid sequences
EP2362657B1 (en) * 2010-02-18 2013-04-24 Research In Motion Limited Parallel entropy coding and decoding methods and devices
WO2011143231A2 (en) * 2010-05-10 2011-11-17 The Broad Institute High throughput paired-end sequencing of large-insert clone libraries
KR101952965B1 (ko) * 2010-05-25 2019-02-27 더 리젠츠 오브 더 유니버시티 오브 캘리포니아 Bambam:고처리율 서열분석 데이터의 병렬 비교 분석
US20140229495A1 (en) * 2011-01-19 2014-08-14 Koninklijke Philips N.V. Method for processing genomic data
US9215162B2 (en) * 2011-03-09 2015-12-15 Annai Systems Inc. Biological data networks and methods therefor
WO2012168815A2 (en) * 2011-06-06 2012-12-13 Koninklijke Philips Electronics N.V. Method for assembly of nucleic acid sequence data
BR112013032332B1 (pt) * 2011-06-16 2022-08-16 Ge Video Compression, Llc Inicialização de contexto em codificação de entropia
US8707289B2 (en) * 2011-07-20 2014-04-22 Google Inc. Multiple application versions
CN104081772B (zh) * 2011-10-06 2018-04-10 弗劳恩霍夫应用研究促进协会 熵编码缓冲器配置
CA2854832C (en) * 2011-11-07 2023-05-23 Ingenuity Systems, Inc. Methods and systems for identification of causal genomic variants
KR101922129B1 (ko) * 2011-12-05 2018-11-26 삼성전자주식회사 차세대 시퀀싱을 이용하여 획득된 유전 정보를 압축 및 압축해제하는 방법 및 장치
US10140683B2 (en) * 2011-12-08 2018-11-27 Five3 Genomics, Llc Distributed system providing dynamic indexing and visualization of genomic data
EP2608096B1 (en) * 2011-12-24 2020-08-05 Tata Consultancy Services Ltd. Compression of genomic data file
US9600625B2 (en) * 2012-04-23 2017-03-21 Bina Technologies, Inc. Systems and methods for processing nucleic acid sequence data
CN103049680B (zh) * 2012-12-29 2016-09-07 深圳先进技术研究院 基因测序数据读取方法及系统
US9679104B2 (en) * 2013-01-17 2017-06-13 Edico Genome, Corp. Bioinformatics systems, apparatuses, and methods executed on an integrated circuit processing platform
WO2014145503A2 (en) * 2013-03-15 2014-09-18 Lieber Institute For Brain Development Sequence alignment using divide and conquer maximum oligonucleotide mapping (dcmom), apparatus, system and method related thereto
JP6054790B2 (ja) * 2013-03-28 2016-12-27 三菱スペース・ソフトウエア株式会社 遺伝子情報記憶装置、遺伝子情報検索装置、遺伝子情報記憶プログラム、遺伝子情報検索プログラム、遺伝子情報記憶方法、遺伝子情報検索方法及び遺伝子情報検索システム
GB2512829B (en) * 2013-04-05 2015-05-27 Canon Kk Method and apparatus for encoding or decoding an image with inter layer motion information prediction according to motion information compression scheme
WO2014186604A1 (en) * 2013-05-15 2014-11-20 Edico Genome Corp. Bioinformatics systems, apparatuses, and methods executed on an integrated circuit processing platform
KR101522087B1 (ko) * 2013-06-19 2015-05-28 삼성에스디에스 주식회사 미스매치를 고려한 염기 서열 정렬 시스템 및 방법
CN103336916B (zh) * 2013-07-05 2016-04-06 中国科学院数学与系统科学研究院 一种测序序列映射方法及系统
US20150032711A1 (en) * 2013-07-06 2015-01-29 Victor Kunin Methods for identification of organisms, assigning reads to organisms, and identification of genes in metagenomic sequences
KR101493982B1 (ko) * 2013-09-26 2015-02-23 대한민국 품종인식 코드화 시스템 및 이를 이용한 코드화 방법
CN104699998A (zh) * 2013-12-06 2015-06-10 国际商业机器公司 用于对基因组进行压缩和解压缩的方法和装置
US10902937B2 (en) * 2014-02-12 2021-01-26 International Business Machines Corporation Lossless compression of DNA sequences
US9639542B2 (en) * 2014-02-14 2017-05-02 Sap Se Dynamic mapping of extensible datasets to relational database schemas
US9886561B2 (en) * 2014-02-19 2018-02-06 The Regents Of The University Of California Efficient encoding and storage and retrieval of genomic data
US9354922B2 (en) 2014-04-02 2016-05-31 International Business Machines Corporation Metadata-driven workflows and integration with genomic data processing systems and techniques
US20150379195A1 (en) * 2014-06-25 2015-12-31 The Board Of Trustees Of The Leland Stanford Junior University Software haplotying of hla loci
GB2527588B (en) * 2014-06-27 2016-05-18 Gurulogic Microsystems Oy Encoder and decoder
US20160019339A1 (en) * 2014-07-06 2016-01-21 Mercator BioLogic Incorporated Bioinformatics tools, systems and methods for sequence assembly
US10230390B2 (en) * 2014-08-29 2019-03-12 Bonnie Berger Leighton Compressively-accelerated read mapping framework for next-generation sequencing
US10116632B2 (en) * 2014-09-12 2018-10-30 New York University System, method and computer-accessible medium for secure and compressed transmission of genomic data
US20160125130A1 (en) * 2014-11-05 2016-05-05 Agilent Technologies, Inc. Method for assigning target-enriched sequence reads to a genomic location
CN107851137A (zh) * 2015-06-16 2018-03-27 汉诺威戈特弗里德威廉莱布尼茨大学 用于压缩基因组数据的方法
CN105956417A (zh) * 2016-05-04 2016-09-21 西安电子科技大学 云环境下基于编辑距离的相似碱基序列查询方法
CN105975811B (zh) * 2016-05-09 2019-03-15 管仁初 一种智能比对的基因序列分析装置

Also Published As

Publication number Publication date
IL265928B (en) 2020-10-29
CL2019000973A1 (es) 2019-08-23
CN110114830A (zh) 2019-08-09
AU2017341684A1 (en) 2019-05-02
JP7079786B2 (ja) 2022-06-02
CN110678929B (zh) 2024-04-16
US11404143B2 (en) 2022-08-02
WO2018071080A3 (en) 2018-06-28
US20190214111A1 (en) 2019-07-11
EP3526707A2 (en) 2019-08-21
PH12019550060A1 (en) 2019-12-16
BR112019007360A2 (pt) 2019-07-09
PE20191227A1 (es) 2019-09-11
AU2017342688A1 (en) 2019-05-02
CL2019000972A1 (es) 2019-08-23
PH12019550057A1 (en) 2020-01-20
PH12019501879A1 (en) 2020-06-29
IL265928A (en) 2019-05-30
US20200051665A1 (en) 2020-02-13
MX2019004128A (es) 2019-08-21
CN110506272A (zh) 2019-11-26
AU2017341685A1 (en) 2019-05-02
JP2020500383A (ja) 2020-01-09
WO2018071080A2 (en) 2018-04-19
KR20190062541A (ko) 2019-06-05
BR112019016236A2 (pt) 2020-04-07
CO2019003639A2 (es) 2020-02-28
CL2019002275A1 (es) 2019-11-22
KR20190117652A (ko) 2019-10-16
EA201990917A1 (ru) 2019-08-30
PE20191057A1 (es) 2019-08-06
CN110603595B (zh) 2023-08-08
WO2018071054A1 (en) 2018-04-19
CO2019003595A2 (es) 2019-08-30
JP2020505702A (ja) 2020-02-20
EP3526707A4 (en) 2020-06-17
CO2019009920A2 (es) 2020-01-17
CN110121577A (zh) 2019-08-13
CL2019002277A1 (es) 2019-11-22
CN110114830B (zh) 2023-10-13
SG11201903272XA (en) 2019-05-30
JP2019537172A (ja) 2019-12-19
PH12019550058A1 (en) 2019-12-16
WO2018071079A1 (en) 2018-04-19
IL265879B2 (en) 2024-01-01
IL265879A (en) 2019-06-30
US20200051667A1 (en) 2020-02-13
CL2019002276A1 (es) 2019-11-29
PE20200323A1 (es) 2020-02-13
BR112019007363A2 (pt) 2019-07-09
EP3526657A4 (en) 2020-07-01
IL265879B1 (en) 2023-09-01
CO2019009922A2 (es) 2020-01-17
CO2019003638A2 (es) 2019-08-30
BR112019007359A2 (pt) 2019-07-16
CA3040138A1 (en) 2018-04-19
EP3526694A4 (en) 2020-08-12
PE20200226A1 (es) 2020-01-29
PE20200227A1 (es) 2020-01-29
MX2019004130A (es) 2020-01-30
JP2020500382A (ja) 2020-01-09
CN110168651A (zh) 2019-08-23
PH12019501881A1 (en) 2020-06-29
EA201990916A1 (ru) 2019-10-31
BR112019016230A2 (pt) 2020-04-07
EP3526657A1 (en) 2019-08-21
KR20190073426A (ko) 2019-06-26
BR112019007357A2 (pt) 2019-07-16
US20200035328A1 (en) 2020-01-30
IL265972A (en) 2019-06-30
CO2019003842A2 (es) 2019-08-30
WO2018071055A1 (en) 2018-04-19
CN110121577B (zh) 2023-09-19
PH12019550059A1 (en) 2019-12-16
US20200042735A1 (en) 2020-02-06
SG11201903270RA (en) 2019-05-30
PE20191058A1 (es) 2019-08-06
CA3040147A1 (en) 2018-04-19
CL2019000968A1 (es) 2019-08-23
KR20190069469A (ko) 2019-06-19
SG11201903271UA (en) 2019-05-30
BR112019016232A2 (pt) 2020-04-07
US20190385702A1 (en) 2019-12-19
CN110506272B (zh) 2023-08-01
CA3040145A1 (en) 2018-04-19
CN110678929A (zh) 2020-01-10
CN110603595A (zh) 2019-12-20
EP3526694A1 (en) 2019-08-21

Similar Documents

Publication Publication Date Title
PE20191056A1 (es) Metodo y aparato para el acceso a datos bioinformaticos estructurados en unidades de acceso
MY204138A (en) Three-dimensional data encoding method, three- dimensional data decoding method, three-dimensional data encoding device, and three-dimensional data decoding device
MX2024009494A (es) Metodo de codificacion de datos tridimensionales, metodo de decodificacion de datos tridimensionales, dispositivo de codificacion de datos tridimensionales y dispositivo de decodificacion de datos tridimensionales.
CO2017009676A2 (es) Derivación de información de movimiento a subbloques en la codificación de video
MX2018000651A (es) Sistemas y metodos para dividir indices de busqueda para una mayor eficiencia en la identificacion de segmentos de medios.
MX2020012935A (es) Metodo de codificacion de datos tridimensionales, metodo de decodificacion de datos tridimensionales, dispositivo codificador de datos tridimensionales y dispositivo decodificador de datos tridimensionales.
MX2021010964A (es) Metodo de codificacion de datos tridimensionales, metodo de decodificacion de datos tridimensionales, dispositivo codificador de datos tridimensionales y dispositivo decodificador de datos tridimensionales.
MX2021003384A (es) Metodo de codificacion de datos tridimensionales, metodo de decodificacion de datos tridimensionales, dispositivo de codificacion de datos tridimensionales y dispositivo de decodificacion de datos tridimensionales.
JP2020500383A5 (es)
CL2017000821A1 (es) Capas de señalizacion para codificación escalable de datos de audio ambisónicos de orden superior
MX2024009279A (es) Metodo de codificacion de datos tridimensionales, metodo de decodificacion de datos tridimensionales, dispositivo codificador de datos tridimensionales y dispositivo decodificador de datos tridimensionales.
MX2019004131A (es) Metodo y aparato para el acceso a datos bioinformaticos estructurados en unidades de acceso.
BR112013031624A8 (pt) Método de decodificação de imagem, método de codificação de imagem, dispositivo de decodificação de imagem, dispositivo de codificação de imagem, e dispositivo de codificação/decodificação de imagem
MX2021006574A (es) Metodo de codificacion de datos tridimensionales, metodo de decodificacion de datos tridimensionales, dispositivo codificador de datos tridimensionales y dispositivo decodificador de datos tridimensionales.
PH12019500791A1 (en) Efficient data structures for bioinformatics information presentation
MX2024010444A (es) Metodo de codificacion de datos tridimensionales, metodo de decodificacion de datos tridimensionales, dispositivo de codificacion de datos tridimensionales y dispositivo de decodificacion de datos tridimensionales.
MX2023011745A (es) Metodo y dispositivo de codificacion, y metodo y dispositivo de decodificacion.
MX2019009680A (es) Metodo y aparato para la representacion compacta de datos de bioinformatica mediante el uso de multiples descriptores genomicos.
CO2019003587A2 (es) Método y aparato para representación compacta de datos bioinformáticos
BR112017021865A2 (pt) método e dispositivos para a codificação de múltiplos sinais de áudio, e método e dispositivo para a decodificação de múltiplos sinais de áudio contendo separação aperfeiçoada
MX2019007609A (es) Tecnologias para escalar grupos de terminales de parte de servidor de interfaz de usuario para aplicaciones vinculadas a bases de datos.
MX2024006694A (es) Metodo de codificacion de datos tridimensionales, metodo de decodificacion de datos tridimensionales, dispositivo de codificacion de datos tridimensionales y dispositivo de decodificacion de datos tridimensionales.
MX2017010995A (es) Aparato de intercalado de paridad para codificar informacion de señalizacion con longitud fija, y metodo de intercalado de paridad que utiliza el mismo.
SG11202000106WA (en) Method and device for comparing media features
AR110436A1 (es) Método de codificación de vídeo, método de decodificación de vídeo, dispositivo de codificación de vídeo y dispositivo de decodificación de vídeo