[go: up one dir, main page]

MX2019002701A - Busqueda de similitud utilizando codigos polisemicos. - Google Patents

Busqueda de similitud utilizando codigos polisemicos.

Info

Publication number
MX2019002701A
MX2019002701A MX2019002701A MX2019002701A MX2019002701A MX 2019002701 A MX2019002701 A MX 2019002701A MX 2019002701 A MX2019002701 A MX 2019002701A MX 2019002701 A MX2019002701 A MX 2019002701A MX 2019002701 A MX2019002701 A MX 2019002701A
Authority
MX
Mexico
Prior art keywords
query
polysemic
hamming distance
vector
quantizer
Prior art date
Application number
MX2019002701A
Other languages
English (en)
Inventor
Douze Matthys
Jegou Hervé
Perronnin Florent
Original Assignee
Facebook Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Facebook Inc filed Critical Facebook Inc
Publication of MX2019002701A publication Critical patent/MX2019002701A/es

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9536Search customisation based on social or collaborative filtering
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • G06Q10/40

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Business, Economics & Management (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • Software Systems (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Computing Systems (AREA)
  • Evolutionary Computation (AREA)
  • Development Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Operations Research (AREA)
  • Medical Informatics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Quality & Reliability (AREA)
  • Mathematical Physics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

En una modalidad, un método incluye recibir una consulta, donde la consulta se representa por un vector n-dimensional en un espacio vectorial n-dimensional; cuantizar el vector que representa la consulta utilizando un cuantizador, donde el vector cuantizado corresponde a un código polisémico, y donde el cuantizador se ha entrenado por aprendizaje automático para determinar los códigos polisémicos de manera que la distancia de Hamming se aproxime a la distancia entre centroides utilizando una función objetivo; calcular, para cada uno de una pluralidad de objetos de contenido, una distancia de Hamming entre el código polisémico correspondiente al vector que representa la consulta y un código polisémico correspondiente a un vector cuantizado que representa el objeto de contenido; y determinar que un objeto de contenido de la pluralidad de objetos de contenido es un vecino más cercano aproximado a la consulta con base en determinar que la distancia de Hamming calculada es menor que una cantidad umbral.
MX2019002701A 2016-09-07 2017-09-06 Busqueda de similitud utilizando codigos polisemicos. MX2019002701A (es)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201662384421P 2016-09-07 2016-09-07
US15/393,926 US20180068023A1 (en) 2016-09-07 2016-12-29 Similarity Search Using Polysemous Codes
PCT/US2017/050211 WO2018048853A1 (en) 2016-09-07 2017-09-06 Similarity search using polysemous codes

Publications (1)

Publication Number Publication Date
MX2019002701A true MX2019002701A (es) 2019-06-06

Family

ID=61280896

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2019002701A MX2019002701A (es) 2016-09-07 2017-09-06 Busqueda de similitud utilizando codigos polisemicos.

Country Status (9)

Country Link
US (1) US20180068023A1 (es)
JP (1) JP2019532445A (es)
KR (1) KR20190043604A (es)
CN (1) CN109906451A (es)
AU (1) AU2017324850A1 (es)
BR (1) BR112019004335A2 (es)
CA (1) CA3034323A1 (es)
MX (1) MX2019002701A (es)
WO (1) WO2018048853A1 (es)

Families Citing this family (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11347751B2 (en) * 2016-12-07 2022-05-31 MyFitnessPal, Inc. System and method for associating user-entered text to database entries
US10817774B2 (en) * 2016-12-30 2020-10-27 Facebook, Inc. Systems and methods for providing content
US10489468B2 (en) * 2017-08-22 2019-11-26 Facebook, Inc. Similarity search using progressive inner products and bounds
US10191921B1 (en) * 2018-04-03 2019-01-29 Sas Institute Inc. System for expanding image search using attributes and associations
US10824592B2 (en) * 2018-06-14 2020-11-03 Microsoft Technology Licensing, Llc Database management using hyperloglog sketches
US12164510B2 (en) * 2018-07-11 2024-12-10 Home Depot Product Authority, Llc Presentation of related and corrected queries for a search engine
CN109635084B (zh) * 2018-11-30 2020-11-24 宁波深擎信息科技有限公司 一种多源数据文档实时快速去重方法及系统
CN109740660A (zh) * 2018-12-27 2019-05-10 深圳云天励飞技术有限公司 图像处理方法及装置
CN109992716B (zh) * 2019-03-29 2023-01-17 电子科技大学 一种基于itq算法的印尼语相似新闻推荐方法
US10990424B2 (en) * 2019-05-07 2021-04-27 Bank Of America Corporation Computer architecture for emulating a node in conjunction with stimulus conditions in a correlithm object processing system
KR102276728B1 (ko) * 2019-06-18 2021-07-13 빅펄 주식회사 멀티모달 콘텐츠 분석 시스템 및 그 방법
CN112446483B (zh) * 2019-08-30 2024-04-23 阿里巴巴集团控股有限公司 一种基于机器学习的计算方法和计算单元
CN112445943B (zh) * 2019-09-05 2025-03-14 阿里巴巴集团控股有限公司 数据处理的方法、装置和系统
US11494734B2 (en) * 2019-09-11 2022-11-08 Ila Design Group Llc Automatically determining inventory items that meet selection criteria in a high-dimensionality inventory dataset
KR102448061B1 (ko) 2019-12-11 2022-09-27 네이버 주식회사 딥러닝 기반의 문서 유사도 측정 모델을 이용한 중복 문서 탐지 방법 및 시스템
KR102432600B1 (ko) 2019-12-17 2022-08-16 네이버 주식회사 벡터 양자화를 이용한 중복 문서 탐지 방법 및 시스템
US11354293B2 (en) 2020-01-28 2022-06-07 Here Global B.V. Method and apparatus for indexing multi-dimensional records based upon similarity of the records
CN111522975B (zh) * 2020-03-10 2022-04-08 浙江工业大学 等价连续变化的二值离散优化的非线性哈希图像检索方法
US11645292B2 (en) * 2020-03-17 2023-05-09 Gsi Technology Inc. Efficient similarity search
US20210321165A1 (en) * 2020-04-09 2021-10-14 Rovi Guides, Inc. Methods and systems for generating and presenting content recommendations for new users
CN112487256B (zh) * 2020-12-10 2024-05-24 中国移动通信集团江苏有限公司 对象查询方法、装置、设备及存储介质
KR102491915B1 (ko) * 2021-03-19 2023-01-26 (주)데이터코리아 변호사 스마트 매칭 시스템
CN113032427B (zh) * 2021-04-12 2023-12-08 中国人民大学 一种用于cpu和gpu平台的向量化查询处理方法
US11860876B1 (en) * 2021-05-05 2024-01-02 Change Healthcare Holdings, Llc Systems and methods for integrating datasets
CN113177130B (zh) * 2021-06-09 2022-04-08 山东科技大学 基于二值语义嵌入的图像检索和识别方法和装置
US11886445B2 (en) * 2021-06-29 2024-01-30 United States Of America As Represented By The Secretary Of The Army Classification engineering using regional locality-sensitive hashing (LSH) searches
CN114329006B (zh) * 2021-09-24 2024-08-09 腾讯科技(深圳)有限公司 图像检索方法、装置、设备、计算机可读存储介质
CN113821622B (zh) * 2021-09-29 2023-09-15 平安银行股份有限公司 基于人工智能的答案检索方法、装置、电子设备及介质
CN116051917B (zh) * 2021-10-28 2024-10-18 腾讯科技(深圳)有限公司 一种训练图像量化模型的方法、检索图像的方法及装置
KR102772554B1 (ko) * 2021-12-28 2025-02-24 성균관대학교산학협력단 역색인 구조 및 벡터 양자화의 협력적 최적화 장치 및 방법
US12183056B2 (en) 2022-01-11 2024-12-31 Adobe Inc. Adversarially robust visual fingerprinting and image provenance models
US12314347B2 (en) * 2022-03-24 2025-05-27 Microsoft Technology Licensing, Llc Method and system of retrieving multimodal assets
US12242491B2 (en) 2022-04-08 2025-03-04 Microsoft Technology Licensing, Llc Method and system of retrieving assets from personalized asset libraries
US12505565B2 (en) 2022-05-27 2025-12-23 Adobe Inc. Identifying and localizing editorial changes to images utilizing deep learning
CN115169489B (zh) * 2022-07-25 2023-06-09 北京百度网讯科技有限公司 数据检索方法、装置、设备以及存储介质
US12081827B2 (en) * 2022-08-26 2024-09-03 Adobe Inc. Determining video provenance utilizing deep learning
US12326867B2 (en) 2023-01-23 2025-06-10 Microsoft Technology Licensing, Llc Method and system of using domain specific knowledge in retrieving multimodal assets
CN116610840B (zh) * 2023-05-19 2026-01-23 山东云海国创云计算装备产业创新中心有限公司 一种相似数据搜索方法、系统及电子设备
US12373209B1 (en) * 2024-01-30 2025-07-29 Zilliz Inc. Vector dataset index parameter determination
US12373451B1 (en) * 2024-01-30 2025-07-29 Zilliz Inc. Vector dataset index parameter determination

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8429173B1 (en) * 2009-04-20 2013-04-23 Google Inc. Method, system, and computer readable medium for identifying result images based on an image query
US8761512B1 (en) * 2009-12-03 2014-06-24 Google Inc. Query by image
US8239364B2 (en) * 2009-12-08 2012-08-07 Facebook, Inc. Search and retrieval of objects in a social networking system
WO2012121728A1 (en) * 2011-03-10 2012-09-13 Textwise Llc Method and system for unified information representation and applications thereof
US9054876B1 (en) * 2011-11-04 2015-06-09 Google Inc. Fast efficient vocabulary computation with hashed vocabularies applying hash functions to cluster centroids that determines most frequently used cluster centroid IDs
JP2013206187A (ja) * 2012-03-28 2013-10-07 Fujitsu Ltd 情報変換装置、情報検索装置、情報変換方法、情報検索方法、情報変換プログラム、情報検索プログラム
JP5563016B2 (ja) * 2012-05-30 2014-07-30 株式会社デンソーアイティーラボラトリ 情報検索装置、情報検索方法及びプログラム
US8935271B2 (en) * 2012-12-21 2015-01-13 Facebook, Inc. Extract operator
US20150169644A1 (en) * 2013-01-03 2015-06-18 Google Inc. Shape-Gain Sketches for Fast Image Similarity Search
US9336312B2 (en) * 2013-04-08 2016-05-10 Facebook, Inc. Vertical-based query optionalizing
IL226219A (en) * 2013-05-07 2016-10-31 Picscout (Israel) Ltd Efficient comparison of images for large groups of images
JP6208898B2 (ja) * 2014-02-10 2017-10-04 ジーニー ゲゼルシャフト ミット ベシュレンクテル ハフツング 画像特徴式認識のためのシステムおよび方法
CN104123375B (zh) * 2014-07-28 2018-01-23 清华大学 数据搜索方法及系统
US9754037B2 (en) * 2014-08-27 2017-09-05 Facebook, Inc. Blending by query classification on online social networks

Also Published As

Publication number Publication date
WO2018048853A1 (en) 2018-03-15
AU2017324850A1 (en) 2019-04-18
JP2019532445A (ja) 2019-11-07
CA3034323A1 (en) 2018-03-15
CN109906451A (zh) 2019-06-18
KR20190043604A (ko) 2019-04-26
BR112019004335A2 (pt) 2019-05-28
US20180068023A1 (en) 2018-03-08

Similar Documents

Publication Publication Date Title
MX2019002701A (es) Busqueda de similitud utilizando codigos polisemicos.
CO2017009675A2 (es) Derivación del vector de movimiento en la codificación de video
CL2019000968A1 (es) Método y sistema para el acceso selectivo de datos bioinformáticos almacenados o transmitidos.
CO2018013672A2 (es) Representación relacional de objetos holográficos
MX2018008104A (es) Identificacion de entidades utilizando un modelo de aprendizaje profundo.
MX2017011793A (es) Deteccion de segmentos de un programa de video.
BR112016014226A2 (pt) Sistemas, métodos e aparelho para codificar formações de objeto
BR112016022268A2 (pt) Treinamento, reconhecimento e geração em uma rede de extrema convicção de pico (dbn)
MX2023005933A (es) Metodo de codificacion/decodificacion de imagenes y dispositivo para el mismo.
BR112017016159A2 (pt) contextos para unidades de árvore de codificação grandes
MX2014002541A (es) Dispositivo de codificacion, dispositivo de decodificacion, metodo de codificacion, y metodo de decodificacion.
IN2014MU00919A (es)
SG11201900261QA (en) Method and system of mining information, electronic device and readable storage medium
BR112016015988A2 (pt) Suporte de camada base não-hevc em extensões de multi-camada hevc
MX2016005489A (es) Metodo y aparato para determinar similitud y terminal.
AR106791A1 (es) Sistemas y métodos de rastreo de equipo de emplazamiento de pozo
CL2020003251A1 (es) Estimación de probabilidad eficientemente ponderada para codificación aritmética binaria
CL2023001903A1 (es) Estimación de movimiento global usando etiquetas de objetos de carretera y suelo para compresión de nube de puntos basada en geometría.
MX2018006677A (es) Busqueda de configuracion de cuantificador de vector de la piramide.
MY174218A (en) Search processing method and device
BR112016023955A2 (pt) sistema e método para cálculo do parâmetro de lagrange para compressão do fluxo de exibição (dsc)
MX2019003648A (es) Metodo para evaluar la conformidad de un sistema de rastreo con un grupo de requerimentos y dispositivos asociados.
BR112017014399A2 (pt) aparelhos, métodos e sistemas de processamento de cubo de criptografia de múltiplas partes
ES2722109T3 (es) Oligómeros polinucleotídicos de timina modificados, y métodos
MX373039B (es) Transmisor y metodo de segmentacion del mismo.