[go: up one dir, main page]

MX2022001419A - Clusterización de segmentos emparejados para determinar el enlace del conjunto de datos en una base de datos. - Google Patents

Clusterización de segmentos emparejados para determinar el enlace del conjunto de datos en una base de datos.

Info

Publication number
MX2022001419A
MX2022001419A MX2022001419A MX2022001419A MX2022001419A MX 2022001419 A MX2022001419 A MX 2022001419A MX 2022001419 A MX2022001419 A MX 2022001419A MX 2022001419 A MX2022001419 A MX 2022001419A MX 2022001419 A MX2022001419 A MX 2022001419A
Authority
MX
Mexico
Prior art keywords
individual data
data sets
parent
data set
cluster
Prior art date
Application number
MX2022001419A
Other languages
English (en)
Inventor
Keith D Noto
Thi Hong Luong Nguyen
Jingwen Pei
Harendra Guturu
Original Assignee
Ancestry Com Dna Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ancestry Com Dna Llc filed Critical Ancestry Com Dna Llc
Publication of MX2022001419A publication Critical patent/MX2022001419A/es

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/288Entity relationship models
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/30Data warehousing; Computing architectures
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/20ICT specially adapted for the handling or processing of patient-related medical or healthcare data for electronic clinical trials or questionnaires
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/40ICT specially adapted for the handling or processing of patient-related medical or healthcare data for data related to laboratory analysis, e.g. patient specimen analysis
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Epidemiology (AREA)
  • Public Health (AREA)
  • Primary Health Care (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biophysics (AREA)
  • Bioethics (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Collating Specific Patterns (AREA)

Abstract

Un método implementado por computadora para enlazar conjuntos de datos de individuos en una base de datos puede incluir recibir un conjunto de datos de individuo objetivo de un individuo objetivo y una pluralidad de conjuntos de datos de individuos adicionales. Un servidor de computación puede generar una pluralidad de pares de sub-clúster de primeros grupos parentales y segundos grupos parentales. Al menos uno de pares de sub-clúster incluye un primer grupo parental de segmentos emparejados y un segundo grupo parental de segmentos emparejados. Un servidor de computación puede vincular los primeros grupos parentales y los segundos grupos parentales a través de la pluralidad de pares de sub-clúster para generar al menos un súper-clúster de un lado parental. Un servidor de computación puede asignar metadatos a uno o más conjuntos de datos de individuos adicionales de la pluralidad de conjuntos de datos de individuos adicionales. Los metadatos pueden especificar que uno o más conjuntos de datos de individuos adicionales están conectados al conjunto de datos de individuo objetivo mediante el lado parental del súper-clúster.
MX2022001419A 2019-08-02 2020-07-23 Clusterización de segmentos emparejados para determinar el enlace del conjunto de datos en una base de datos. MX2022001419A (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201962882188P 2019-08-02 2019-08-02
PCT/IB2020/056937 WO2021024074A1 (en) 2019-08-02 2020-07-23 Clustering of matched segments to determine linkage of dataset in a database

Publications (1)

Publication Number Publication Date
MX2022001419A true MX2022001419A (es) 2022-06-08

Family

ID=74259440

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2022001419A MX2022001419A (es) 2019-08-02 2020-07-23 Clusterización de segmentos emparejados para determinar el enlace del conjunto de datos en una base de datos.

Country Status (6)

Country Link
US (1) US20210034647A1 (es)
EP (2) EP4401081B1 (es)
AU (1) AU2020326389B2 (es)
CA (1) CA3149354C (es)
MX (1) MX2022001419A (es)
WO (1) WO2021024074A1 (es)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016061568A1 (en) * 2014-10-17 2016-04-21 Ancestry.Com Dna, Llc Haplotype phasing models
US12050629B1 (en) * 2019-08-02 2024-07-30 Ancestry.Com Dna, Llc Determining data inheritance of data segments
AU2022308670A1 (en) * 2021-07-07 2024-01-25 Mars, Incorporated System, method, and apparatus for predicting genetic ancestry
US12045219B2 (en) * 2021-11-24 2024-07-23 Ancestry.Com Dna, Llc Scoring method for matches based on age probability
CN116628187B (zh) * 2022-02-10 2025-02-07 腾讯科技(深圳)有限公司 一种文本分类方法、装置、电子设备和存储介质
US20250032199A1 (en) 2022-02-10 2025-01-30 Digital Surgery Systems, Inc. Microphone directionality control based on surgeon's command
WO2023175516A1 (en) 2022-03-15 2023-09-21 Ancestry.Com Operations Inc. Machine-learning based automated document integration into genealogical trees
US12461970B2 (en) * 2022-08-19 2025-11-04 Ancestry.Com Dna, Llc Catalog-based data inheritance determination
US12353674B2 (en) * 2023-01-24 2025-07-08 Ancestry.Com Operations Inc. Artificial reality family history experience
WO2025049155A1 (en) 2023-08-25 2025-03-06 Ancestry.Com Dna, Llc Determining data inheritance of genomic data segments
US20250386803A1 (en) * 2024-06-24 2025-12-25 Ancestry.Com Dna, Llc Systems and methods for pairing domestic companion animals
US20260017284A1 (en) 2024-07-09 2026-01-15 Ancestry.Com Dna, Llc Determining labels of inheritance datasets using simulated data instances
CN119673275A (zh) * 2024-11-29 2025-03-21 中国农业大学 一种家禽大规模群体分子系谱自动化构建方法

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8700334B2 (en) * 2006-07-31 2014-04-15 International Business Machines Corporation Methods and systems for reconstructing genomic common ancestors
US9330418B2 (en) * 2012-09-14 2016-05-03 Ancestry.Com Operations Inc. System and method for creating a family tree data structure
US9977708B1 (en) * 2012-11-08 2018-05-22 23Andme, Inc. Error correction in ancestry classification
CA2942106C (en) * 2013-04-17 2021-06-29 Andrew Ka-Ching Wong Aligning and clustering sequence patterns to reveal classificatory functionality of sequences
DK3207481T3 (da) * 2014-10-14 2020-02-03 Ancestry Com Dna Llc Reduktion af fejl ved forudsigelse af genetiske slægtskab
WO2016061568A1 (en) * 2014-10-17 2016-04-21 Ancestry.Com Dna, Llc Haplotype phasing models
US20170213127A1 (en) * 2016-01-24 2017-07-27 Matthew Charles Duncan Method and System for Discovering Ancestors using Genomic and Genealogic Data
US10347365B2 (en) * 2017-02-08 2019-07-09 10X Genomics, Inc. Systems and methods for visualizing a pattern in a dataset

Also Published As

Publication number Publication date
EP4401081A2 (en) 2024-07-17
EP4401081B1 (en) 2025-09-03
WO2021024074A1 (en) 2021-02-11
EP4008007A4 (en) 2023-04-26
EP4401081C0 (en) 2025-09-03
EP4008007C0 (en) 2024-05-15
EP4008007A1 (en) 2022-06-08
CA3149354A1 (en) 2021-02-11
AU2020326389B2 (en) 2023-10-05
CA3149354C (en) 2025-03-25
EP4008007B1 (en) 2024-05-15
US20210034647A1 (en) 2021-02-04
AU2020326389A1 (en) 2022-03-24
EP4401081A3 (en) 2024-07-31

Similar Documents

Publication Publication Date Title
MX2022001419A (es) Clusterización de segmentos emparejados para determinar el enlace del conjunto de datos en una base de datos.
MX2020014293A (es) Generación de metadatos de secuenciación basada en inteligencia artificial.
CL2018000982A1 (es) Procedimiento y dispositivo para identificar autentificando mediante la fusión de múltiples características biológicas
SV2018005732A (es) Sistema y metodo para la verificacion de la autenticidad de la informacion de documentos
MX2017008086A (es) Deteccion de objetos con red neuronal.
BR112017008380A2 (pt) módulo, sistema e dispositivo de planejamento de produção, e, método de planejamento de fabricação de um produto intermediário ou produto final.
CL2019000972A1 (es) Método y sistemas para la representación y procesamiento de datos de bioinformática mediante el uso de secuencias de referencia.
BR112016020457A8 (pt) Método de busca por impressões digitais de áudio armazenadas em um banco de dados dentro de um sistema de detecção de impressões digitais de áudio
MX2020006803A (es) Estrategias de descodificacion para identificacion de proteinas.
BR112019006689A2 (pt) métodos e sistemas para análise de dados de cromatografia
BR112014011056A2 (pt) sistema e método de usar subconjuntos de dados espacialmente independentes para determinar a incerteza de não influenciamento de dados incertos de distribuições de propriedade de dados de reservatório espacialmente correlacionados
BR112019003144A2 (pt) sistema e método de classificação de partículas biológicas
BR112017000635A2 (pt) sistema e método de remoção de ruído para dados de detecção acústica distribuída.
MX2015000588A (es) Analiticas de series temporales.
MX389247B (es) Sistema y metodo para la verificacion de direccion automatizada
BR112014010471A2 (pt) método de uso de subconjuntos de dados espacialmente independentes para calcular incerteza de curva com tendência vertical de dados do reservatório espacialmente correlacionados
BR112015015904A2 (pt) renderização de linguagem natural de consultas de busca estruturadas
JP2017188137A5 (es)
MX344125B (es) Modificacion de consultas de busqueda estructuradas en redes sociales en linea.
BR112019004964A2 (pt) gerador de caso de teste construído em editor de fluxo de trabalho de integração de dados
MX2021000543A (es) Sistema y metodo para la resolucion de entidades genealogicas.
CL2025003119A1 (es) Decodificación especulativa en modelos de inteligencia artificial generativa autorregresiva.
MX2019008257A (es) Metodo y sistema para la deteccion automotizada de criterios de inclusion o exclusion.
BR112018003254A2 (pt) sistemas e métodos para atribuir restrições de grupo em um layout de circuito integrado
BR112014022060A2 (pt) Método, e sistema de computação