MX2019002701A - Busqueda de similitud utilizando codigos polisemicos. - Google Patents
Busqueda de similitud utilizando codigos polisemicos.Info
- Publication number
- MX2019002701A MX2019002701A MX2019002701A MX2019002701A MX2019002701A MX 2019002701 A MX2019002701 A MX 2019002701A MX 2019002701 A MX2019002701 A MX 2019002701A MX 2019002701 A MX2019002701 A MX 2019002701A MX 2019002701 A MX2019002701 A MX 2019002701A
- Authority
- MX
- Mexico
- Prior art keywords
- query
- polysemic
- hamming distance
- vector
- quantizer
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9536—Search customisation based on social or collaborative filtering
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
-
- G06Q10/40—
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Business, Economics & Management (AREA)
- Economics (AREA)
- Human Resources & Organizations (AREA)
- Strategic Management (AREA)
- Software Systems (AREA)
- Marketing (AREA)
- General Business, Economics & Management (AREA)
- Tourism & Hospitality (AREA)
- Computing Systems (AREA)
- Evolutionary Computation (AREA)
- Development Economics (AREA)
- Game Theory and Decision Science (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Operations Research (AREA)
- Medical Informatics (AREA)
- Entrepreneurship & Innovation (AREA)
- Quality & Reliability (AREA)
- Mathematical Physics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Image Analysis (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Primary Health Care (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
En una modalidad, un método incluye recibir una consulta, donde la consulta se representa por un vector n-dimensional en un espacio vectorial n-dimensional; cuantizar el vector que representa la consulta utilizando un cuantizador, donde el vector cuantizado corresponde a un código polisémico, y donde el cuantizador se ha entrenado por aprendizaje automático para determinar los códigos polisémicos de manera que la distancia de Hamming se aproxime a la distancia entre centroides utilizando una función objetivo; calcular, para cada uno de una pluralidad de objetos de contenido, una distancia de Hamming entre el código polisémico correspondiente al vector que representa la consulta y un código polisémico correspondiente a un vector cuantizado que representa el objeto de contenido; y determinar que un objeto de contenido de la pluralidad de objetos de contenido es un vecino más cercano aproximado a la consulta con base en determinar que la distancia de Hamming calculada es menor que una cantidad umbral.
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201662384421P | 2016-09-07 | 2016-09-07 | |
| US15/393,926 US20180068023A1 (en) | 2016-09-07 | 2016-12-29 | Similarity Search Using Polysemous Codes |
| PCT/US2017/050211 WO2018048853A1 (en) | 2016-09-07 | 2017-09-06 | Similarity search using polysemous codes |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| MX2019002701A true MX2019002701A (es) | 2019-06-06 |
Family
ID=61280896
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| MX2019002701A MX2019002701A (es) | 2016-09-07 | 2017-09-06 | Busqueda de similitud utilizando codigos polisemicos. |
Country Status (9)
| Country | Link |
|---|---|
| US (1) | US20180068023A1 (es) |
| JP (1) | JP2019532445A (es) |
| KR (1) | KR20190043604A (es) |
| CN (1) | CN109906451A (es) |
| AU (1) | AU2017324850A1 (es) |
| BR (1) | BR112019004335A2 (es) |
| CA (1) | CA3034323A1 (es) |
| MX (1) | MX2019002701A (es) |
| WO (1) | WO2018048853A1 (es) |
Families Citing this family (40)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11347751B2 (en) * | 2016-12-07 | 2022-05-31 | MyFitnessPal, Inc. | System and method for associating user-entered text to database entries |
| US10817774B2 (en) * | 2016-12-30 | 2020-10-27 | Facebook, Inc. | Systems and methods for providing content |
| US10489468B2 (en) * | 2017-08-22 | 2019-11-26 | Facebook, Inc. | Similarity search using progressive inner products and bounds |
| US10191921B1 (en) * | 2018-04-03 | 2019-01-29 | Sas Institute Inc. | System for expanding image search using attributes and associations |
| US10824592B2 (en) * | 2018-06-14 | 2020-11-03 | Microsoft Technology Licensing, Llc | Database management using hyperloglog sketches |
| US12164510B2 (en) * | 2018-07-11 | 2024-12-10 | Home Depot Product Authority, Llc | Presentation of related and corrected queries for a search engine |
| CN109635084B (zh) * | 2018-11-30 | 2020-11-24 | 宁波深擎信息科技有限公司 | 一种多源数据文档实时快速去重方法及系统 |
| CN109740660A (zh) * | 2018-12-27 | 2019-05-10 | 深圳云天励飞技术有限公司 | 图像处理方法及装置 |
| CN109992716B (zh) * | 2019-03-29 | 2023-01-17 | 电子科技大学 | 一种基于itq算法的印尼语相似新闻推荐方法 |
| US10990424B2 (en) * | 2019-05-07 | 2021-04-27 | Bank Of America Corporation | Computer architecture for emulating a node in conjunction with stimulus conditions in a correlithm object processing system |
| KR102276728B1 (ko) * | 2019-06-18 | 2021-07-13 | 빅펄 주식회사 | 멀티모달 콘텐츠 분석 시스템 및 그 방법 |
| CN112446483B (zh) * | 2019-08-30 | 2024-04-23 | 阿里巴巴集团控股有限公司 | 一种基于机器学习的计算方法和计算单元 |
| CN112445943B (zh) * | 2019-09-05 | 2025-03-14 | 阿里巴巴集团控股有限公司 | 数据处理的方法、装置和系统 |
| US11494734B2 (en) * | 2019-09-11 | 2022-11-08 | Ila Design Group Llc | Automatically determining inventory items that meet selection criteria in a high-dimensionality inventory dataset |
| KR102448061B1 (ko) | 2019-12-11 | 2022-09-27 | 네이버 주식회사 | 딥러닝 기반의 문서 유사도 측정 모델을 이용한 중복 문서 탐지 방법 및 시스템 |
| KR102432600B1 (ko) | 2019-12-17 | 2022-08-16 | 네이버 주식회사 | 벡터 양자화를 이용한 중복 문서 탐지 방법 및 시스템 |
| US11354293B2 (en) | 2020-01-28 | 2022-06-07 | Here Global B.V. | Method and apparatus for indexing multi-dimensional records based upon similarity of the records |
| CN111522975B (zh) * | 2020-03-10 | 2022-04-08 | 浙江工业大学 | 等价连续变化的二值离散优化的非线性哈希图像检索方法 |
| US11645292B2 (en) * | 2020-03-17 | 2023-05-09 | Gsi Technology Inc. | Efficient similarity search |
| US20210321165A1 (en) * | 2020-04-09 | 2021-10-14 | Rovi Guides, Inc. | Methods and systems for generating and presenting content recommendations for new users |
| CN112487256B (zh) * | 2020-12-10 | 2024-05-24 | 中国移动通信集团江苏有限公司 | 对象查询方法、装置、设备及存储介质 |
| KR102491915B1 (ko) * | 2021-03-19 | 2023-01-26 | (주)데이터코리아 | 변호사 스마트 매칭 시스템 |
| CN113032427B (zh) * | 2021-04-12 | 2023-12-08 | 中国人民大学 | 一种用于cpu和gpu平台的向量化查询处理方法 |
| US11860876B1 (en) * | 2021-05-05 | 2024-01-02 | Change Healthcare Holdings, Llc | Systems and methods for integrating datasets |
| CN113177130B (zh) * | 2021-06-09 | 2022-04-08 | 山东科技大学 | 基于二值语义嵌入的图像检索和识别方法和装置 |
| US11886445B2 (en) * | 2021-06-29 | 2024-01-30 | United States Of America As Represented By The Secretary Of The Army | Classification engineering using regional locality-sensitive hashing (LSH) searches |
| CN114329006B (zh) * | 2021-09-24 | 2024-08-09 | 腾讯科技(深圳)有限公司 | 图像检索方法、装置、设备、计算机可读存储介质 |
| CN113821622B (zh) * | 2021-09-29 | 2023-09-15 | 平安银行股份有限公司 | 基于人工智能的答案检索方法、装置、电子设备及介质 |
| CN116051917B (zh) * | 2021-10-28 | 2024-10-18 | 腾讯科技(深圳)有限公司 | 一种训练图像量化模型的方法、检索图像的方法及装置 |
| KR102772554B1 (ko) * | 2021-12-28 | 2025-02-24 | 성균관대학교산학협력단 | 역색인 구조 및 벡터 양자화의 협력적 최적화 장치 및 방법 |
| US12183056B2 (en) | 2022-01-11 | 2024-12-31 | Adobe Inc. | Adversarially robust visual fingerprinting and image provenance models |
| US12314347B2 (en) * | 2022-03-24 | 2025-05-27 | Microsoft Technology Licensing, Llc | Method and system of retrieving multimodal assets |
| US12242491B2 (en) | 2022-04-08 | 2025-03-04 | Microsoft Technology Licensing, Llc | Method and system of retrieving assets from personalized asset libraries |
| US12505565B2 (en) | 2022-05-27 | 2025-12-23 | Adobe Inc. | Identifying and localizing editorial changes to images utilizing deep learning |
| CN115169489B (zh) * | 2022-07-25 | 2023-06-09 | 北京百度网讯科技有限公司 | 数据检索方法、装置、设备以及存储介质 |
| US12081827B2 (en) * | 2022-08-26 | 2024-09-03 | Adobe Inc. | Determining video provenance utilizing deep learning |
| US12326867B2 (en) | 2023-01-23 | 2025-06-10 | Microsoft Technology Licensing, Llc | Method and system of using domain specific knowledge in retrieving multimodal assets |
| CN116610840B (zh) * | 2023-05-19 | 2026-01-23 | 山东云海国创云计算装备产业创新中心有限公司 | 一种相似数据搜索方法、系统及电子设备 |
| US12373209B1 (en) * | 2024-01-30 | 2025-07-29 | Zilliz Inc. | Vector dataset index parameter determination |
| US12373451B1 (en) * | 2024-01-30 | 2025-07-29 | Zilliz Inc. | Vector dataset index parameter determination |
Family Cites Families (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8429173B1 (en) * | 2009-04-20 | 2013-04-23 | Google Inc. | Method, system, and computer readable medium for identifying result images based on an image query |
| US8761512B1 (en) * | 2009-12-03 | 2014-06-24 | Google Inc. | Query by image |
| US8239364B2 (en) * | 2009-12-08 | 2012-08-07 | Facebook, Inc. | Search and retrieval of objects in a social networking system |
| WO2012121728A1 (en) * | 2011-03-10 | 2012-09-13 | Textwise Llc | Method and system for unified information representation and applications thereof |
| US9054876B1 (en) * | 2011-11-04 | 2015-06-09 | Google Inc. | Fast efficient vocabulary computation with hashed vocabularies applying hash functions to cluster centroids that determines most frequently used cluster centroid IDs |
| JP2013206187A (ja) * | 2012-03-28 | 2013-10-07 | Fujitsu Ltd | 情報変換装置、情報検索装置、情報変換方法、情報検索方法、情報変換プログラム、情報検索プログラム |
| JP5563016B2 (ja) * | 2012-05-30 | 2014-07-30 | 株式会社デンソーアイティーラボラトリ | 情報検索装置、情報検索方法及びプログラム |
| US8935271B2 (en) * | 2012-12-21 | 2015-01-13 | Facebook, Inc. | Extract operator |
| US20150169644A1 (en) * | 2013-01-03 | 2015-06-18 | Google Inc. | Shape-Gain Sketches for Fast Image Similarity Search |
| US9336312B2 (en) * | 2013-04-08 | 2016-05-10 | Facebook, Inc. | Vertical-based query optionalizing |
| IL226219A (en) * | 2013-05-07 | 2016-10-31 | Picscout (Israel) Ltd | Efficient comparison of images for large groups of images |
| JP6208898B2 (ja) * | 2014-02-10 | 2017-10-04 | ジーニー ゲゼルシャフト ミット ベシュレンクテル ハフツング | 画像特徴式認識のためのシステムおよび方法 |
| CN104123375B (zh) * | 2014-07-28 | 2018-01-23 | 清华大学 | 数据搜索方法及系统 |
| US9754037B2 (en) * | 2014-08-27 | 2017-09-05 | Facebook, Inc. | Blending by query classification on online social networks |
-
2016
- 2016-12-29 US US15/393,926 patent/US20180068023A1/en not_active Abandoned
-
2017
- 2017-09-06 AU AU2017324850A patent/AU2017324850A1/en not_active Abandoned
- 2017-09-06 BR BR112019004335-7A patent/BR112019004335A2/pt not_active Application Discontinuation
- 2017-09-06 CA CA3034323A patent/CA3034323A1/en not_active Abandoned
- 2017-09-06 KR KR1020197009570A patent/KR20190043604A/ko not_active Ceased
- 2017-09-06 CN CN201780066910.1A patent/CN109906451A/zh active Pending
- 2017-09-06 JP JP2019533301A patent/JP2019532445A/ja active Pending
- 2017-09-06 MX MX2019002701A patent/MX2019002701A/es unknown
- 2017-09-06 WO PCT/US2017/050211 patent/WO2018048853A1/en not_active Ceased
Also Published As
| Publication number | Publication date |
|---|---|
| WO2018048853A1 (en) | 2018-03-15 |
| AU2017324850A1 (en) | 2019-04-18 |
| JP2019532445A (ja) | 2019-11-07 |
| CA3034323A1 (en) | 2018-03-15 |
| CN109906451A (zh) | 2019-06-18 |
| KR20190043604A (ko) | 2019-04-26 |
| BR112019004335A2 (pt) | 2019-05-28 |
| US20180068023A1 (en) | 2018-03-08 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| MX2019002701A (es) | Busqueda de similitud utilizando codigos polisemicos. | |
| CO2017009675A2 (es) | Derivación del vector de movimiento en la codificación de video | |
| CL2019000968A1 (es) | Método y sistema para el acceso selectivo de datos bioinformáticos almacenados o transmitidos. | |
| CO2018013672A2 (es) | Representación relacional de objetos holográficos | |
| MX2018008104A (es) | Identificacion de entidades utilizando un modelo de aprendizaje profundo. | |
| MX2017011793A (es) | Deteccion de segmentos de un programa de video. | |
| BR112016014226A2 (pt) | Sistemas, métodos e aparelho para codificar formações de objeto | |
| BR112016022268A2 (pt) | Treinamento, reconhecimento e geração em uma rede de extrema convicção de pico (dbn) | |
| MX2023005933A (es) | Metodo de codificacion/decodificacion de imagenes y dispositivo para el mismo. | |
| BR112017016159A2 (pt) | contextos para unidades de árvore de codificação grandes | |
| MX2014002541A (es) | Dispositivo de codificacion, dispositivo de decodificacion, metodo de codificacion, y metodo de decodificacion. | |
| IN2014MU00919A (es) | ||
| SG11201900261QA (en) | Method and system of mining information, electronic device and readable storage medium | |
| BR112016015988A2 (pt) | Suporte de camada base não-hevc em extensões de multi-camada hevc | |
| MX2016005489A (es) | Metodo y aparato para determinar similitud y terminal. | |
| AR106791A1 (es) | Sistemas y métodos de rastreo de equipo de emplazamiento de pozo | |
| CL2020003251A1 (es) | Estimación de probabilidad eficientemente ponderada para codificación aritmética binaria | |
| CL2023001903A1 (es) | Estimación de movimiento global usando etiquetas de objetos de carretera y suelo para compresión de nube de puntos basada en geometría. | |
| MX2018006677A (es) | Busqueda de configuracion de cuantificador de vector de la piramide. | |
| MY174218A (en) | Search processing method and device | |
| BR112016023955A2 (pt) | sistema e método para cálculo do parâmetro de lagrange para compressão do fluxo de exibição (dsc) | |
| MX2019003648A (es) | Metodo para evaluar la conformidad de un sistema de rastreo con un grupo de requerimentos y dispositivos asociados. | |
| BR112017014399A2 (pt) | aparelhos, métodos e sistemas de processamento de cubo de criptografia de múltiplas partes | |
| ES2722109T3 (es) | Oligómeros polinucleotídicos de timina modificados, y métodos | |
| MX373039B (es) | Transmisor y metodo de segmentacion del mismo. |