MX2015008119A - Etiquetado de video o audio para deteccion de orador activo. - Google Patents
Etiquetado de video o audio para deteccion de orador activo.Info
- Publication number
- MX2015008119A MX2015008119A MX2015008119A MX2015008119A MX2015008119A MX 2015008119 A MX2015008119 A MX 2015008119A MX 2015008119 A MX2015008119 A MX 2015008119A MX 2015008119 A MX2015008119 A MX 2015008119A MX 2015008119 A MX2015008119 A MX 2015008119A
- Authority
- MX
- Mexico
- Prior art keywords
- video
- signal
- camera
- tag
- audio
- Prior art date
Links
- 238000001514 detection method Methods 0.000 title 1
- 230000005236 sound signal Effects 0.000 abstract 4
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
- H04N7/147—Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
- H04N7/155—Conference systems involving storage of or access to video conference sessions
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Closed-Circuit Television Systems (AREA)
- Telephonic Communication Services (AREA)
- Burglar Alarm Systems (AREA)
Abstract
Se describe un sistema de videoconferencia que está configurado para seleccionar un orador activo mientras se evita seleccionar erróneamente un micrófono o cámara que está captando audio o video de una señal remota conectada. Se hace una determinación si una señal de audio está por encima de un nivel de umbral. Si es así, entonces se hace una determinación en cuanto a si una etiqueta está presente en esa señal de audio. Si es así, se ignora esa señal. Si no es así, una cámara es dirigida hacia la fuente de sonido identificada por la señal de audio. Se hace una determinación si una etiqueta está presente la señal de video desde esa cámara. Si es así, la cámara es redirigida. Si no es así, se inserta una etiqueta(s) local en la señal de audio y/o la señal de video. La señal(s) etiquetada es transmitida. De esa forma, el sistema ignorará el sonido o video que tiene una etiqueta incorporada de otro sistema de videoconferencia.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US13/719,314 US9065971B2 (en) | 2012-12-19 | 2012-12-19 | Video and audio tagging for active speaker detection |
| PCT/US2013/076671 WO2014100466A2 (en) | 2012-12-19 | 2013-12-19 | Video and audio tagging for active speaker detection |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| MX2015008119A true MX2015008119A (es) | 2016-04-25 |
| MX352445B MX352445B (es) | 2017-11-24 |
Family
ID=49943568
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| MX2015008119A MX352445B (es) | 2012-12-19 | 2013-12-19 | Etiquetado de video o audio para deteccion de orador activo. |
Country Status (11)
| Country | Link |
|---|---|
| US (1) | US9065971B2 (es) |
| EP (1) | EP2912841B1 (es) |
| JP (1) | JP6321033B2 (es) |
| KR (1) | KR102110632B1 (es) |
| CN (1) | CN104937926B (es) |
| AU (1) | AU2013361258B2 (es) |
| BR (1) | BR112015011758B1 (es) |
| CA (1) | CA2889706C (es) |
| MX (1) | MX352445B (es) |
| RU (1) | RU2632469C2 (es) |
| WO (1) | WO2014100466A2 (es) |
Families Citing this family (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9065971B2 (en) * | 2012-12-19 | 2015-06-23 | Microsoft Technology Licensing, Llc | Video and audio tagging for active speaker detection |
| US20150281832A1 (en) * | 2014-03-28 | 2015-10-01 | Panasonic Intellectual Property Management Co., Ltd. | Sound processing apparatus, sound processing system and sound processing method |
| US9681097B1 (en) | 2016-01-20 | 2017-06-13 | Global Tel*Link Corporation | Secure video visitation system |
| US10296994B2 (en) | 2016-02-11 | 2019-05-21 | Global Tel*Link Corporation | System and method for visitation management in a controlled environment |
| US9558523B1 (en) | 2016-03-23 | 2017-01-31 | Global Tel* Link Corp. | Secure nonscheduled video visitation system |
| US10311219B2 (en) * | 2016-06-07 | 2019-06-04 | Vocalzoom Systems Ltd. | Device, system, and method of user authentication utilizing an optical microphone |
| JP6520878B2 (ja) * | 2016-09-21 | 2019-05-29 | トヨタ自動車株式会社 | 音声取得システムおよび音声取得方法 |
| KR102717784B1 (ko) | 2017-02-14 | 2024-10-16 | 한국전자통신연구원 | 스테레오 오디오 신호에 대한 태그 삽입 장치 및 태그 삽입 방법, 그리고, 태그 추출 장치 및 태그 추출 방법 |
| US11282537B2 (en) | 2017-06-09 | 2022-03-22 | International Business Machines Corporation | Active speaker detection in electronic meetings for providing video from one device to plurality of other devices |
| KR102827290B1 (ko) * | 2022-01-13 | 2025-06-27 | 최종성 | 사용자 추적이 가능한 ai 거치대 |
Family Cites Families (41)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5099319A (en) * | 1989-10-23 | 1992-03-24 | Esch Arthur G | Video information delivery method and apparatus |
| US5689641A (en) | 1993-10-01 | 1997-11-18 | Vicor, Inc. | Multimedia collaboration system arrangement for routing compressed AV signal through a participant site without decompressing the AV signal |
| AUPP392498A0 (en) * | 1998-06-04 | 1998-07-02 | Innes Corporation Pty Ltd | Traffic verification system |
| US7081915B1 (en) | 1998-06-17 | 2006-07-25 | Intel Corporation | Control of video conferencing using activity detection |
| US7062039B1 (en) * | 1999-05-27 | 2006-06-13 | Telefonaktiebolaget Lm Ericsson | Methods and apparatus for improving adaptive filter performance by inclusion of inaudible information |
| US6594629B1 (en) * | 1999-08-06 | 2003-07-15 | International Business Machines Corporation | Methods and apparatus for audio-visual speech detection and recognition |
| JP2002223422A (ja) * | 2001-01-29 | 2002-08-09 | Nec Corp | 多地点テレビ会議制御装置およびビデオパケット送信方法 |
| US7161939B2 (en) * | 2001-06-29 | 2007-01-09 | Ip Unity | Method and system for switching among independent packetized audio streams |
| KR100552468B1 (ko) * | 2001-07-19 | 2006-02-15 | 삼성전자주식회사 | 음성인식에 따른 오동작을 방지 및 음성인식율을 향상 할수 있는 전자기기 및 방법 |
| US6749512B2 (en) * | 2002-03-15 | 2004-06-15 | Macgregor Brian | Computer network implemented gaming system and method of using same |
| EP1443498B1 (en) * | 2003-01-24 | 2008-03-19 | Sony Ericsson Mobile Communications AB | Noise reduction and audio-visual speech activity detection |
| GB2404297B (en) * | 2003-07-24 | 2007-12-05 | Hewlett Packard Development Co | Editing multiple camera outputs |
| JP4414708B2 (ja) * | 2003-09-19 | 2010-02-10 | 株式会社リコー | 動画表示用パーソナルコンピュータ、データ表示システム、動画表示方法、動画表示プログラムおよび記録媒体 |
| US7379875B2 (en) * | 2003-10-24 | 2008-05-27 | Microsoft Corporation | Systems and methods for generating audio thumbnails |
| US20050138674A1 (en) * | 2003-12-17 | 2005-06-23 | Quadrock Communications, Inc | System and method for integration and synchronization of interactive content with television content |
| US7563168B2 (en) * | 2004-02-13 | 2009-07-21 | Texas Instruments Incorporated | Audio effect rendering based on graphic polygons |
| GB2415639B (en) * | 2004-06-29 | 2008-09-17 | Sony Comp Entertainment Europe | Control of data processing |
| US7304585B2 (en) * | 2004-07-02 | 2007-12-04 | Nokia Corporation | Initiation of actions with compressed action language representations |
| US20060147063A1 (en) | 2004-12-22 | 2006-07-06 | Broadcom Corporation | Echo cancellation in telephones with multiple microphones |
| US7450752B2 (en) * | 2005-04-07 | 2008-11-11 | Hewlett-Packard Development Company, L.P. | System and method for automatic detection of the end of a video stream |
| US9300790B2 (en) * | 2005-06-24 | 2016-03-29 | Securus Technologies, Inc. | Multi-party conversation analyzer and logger |
| CN100596061C (zh) * | 2006-01-12 | 2010-03-24 | 大连理工大学 | 一种基于盲源分离的小波域数字音频多目的水印方法 |
| CA2544459A1 (en) * | 2006-04-21 | 2007-10-21 | Evertz Microsystems Ltd. | Systems and methods for synchronizing audio and video data signals |
| US8087044B2 (en) * | 2006-09-18 | 2011-12-27 | Rgb Networks, Inc. | Methods, apparatus, and systems for managing the insertion of overlay content into a video signal |
| US7688889B2 (en) * | 2006-09-18 | 2010-03-30 | Rgb Networks, Inc. | Methods, apparatus, and systems for insertion of overlay content into a video signal with transrating capabilities |
| US20080136623A1 (en) * | 2006-12-06 | 2008-06-12 | Russell Calvarese | Audio trigger for mobile devices |
| EP2119233B1 (en) * | 2007-02-20 | 2012-05-16 | ST-Ericsson SA | Mobile video conference terminal with face recognition |
| US8385233B2 (en) * | 2007-06-12 | 2013-02-26 | Microsoft Corporation | Active speaker identification |
| US8300080B2 (en) | 2007-06-29 | 2012-10-30 | Microsoft Corporation | Techniques for detecting a display device |
| US20090210789A1 (en) * | 2008-02-14 | 2009-08-20 | Microsoft Corporation | Techniques to generate a visual composition for a multimedia conference event |
| FR2952263B1 (fr) * | 2009-10-29 | 2012-01-06 | Univ Paris Descartes | Procede et dispositif d'annulation d'echo acoustique par tatouage audio |
| US8713593B2 (en) * | 2010-03-01 | 2014-04-29 | Zazum, Inc. | Detection system and method for mobile device application |
| US20110214143A1 (en) * | 2010-03-01 | 2011-09-01 | Rits Susan K | Mobile device application |
| US8635066B2 (en) * | 2010-04-14 | 2014-01-21 | T-Mobile Usa, Inc. | Camera-assisted noise cancellation and speech recognition |
| US8468012B2 (en) * | 2010-05-26 | 2013-06-18 | Google Inc. | Acoustic model adaptation using geographic information |
| US8723914B2 (en) | 2010-11-19 | 2014-05-13 | Cisco Technology, Inc. | System and method for providing enhanced video processing in a network environment |
| US8589167B2 (en) * | 2011-05-11 | 2013-11-19 | Nuance Communications, Inc. | Speaker liveness detection |
| US20120321062A1 (en) * | 2011-06-17 | 2012-12-20 | Fitzsimmons Jeffrey E | Telephonic Conference Access System |
| CN102368816A (zh) * | 2011-12-01 | 2012-03-07 | 中科芯集成电路股份有限公司 | 一种视频会议智能前端系统 |
| US8886011B2 (en) * | 2012-12-07 | 2014-11-11 | Cisco Technology, Inc. | System and method for question detection based video segmentation, search and collaboration in a video processing environment |
| US9065971B2 (en) * | 2012-12-19 | 2015-06-23 | Microsoft Technology Licensing, Llc | Video and audio tagging for active speaker detection |
-
2012
- 2012-12-19 US US13/719,314 patent/US9065971B2/en active Active
-
2013
- 2013-12-19 EP EP13818933.7A patent/EP2912841B1/en active Active
- 2013-12-19 AU AU2013361258A patent/AU2013361258B2/en not_active Ceased
- 2013-12-19 WO PCT/US2013/076671 patent/WO2014100466A2/en not_active Ceased
- 2013-12-19 JP JP2015549731A patent/JP6321033B2/ja active Active
- 2013-12-19 BR BR112015011758-9A patent/BR112015011758B1/pt active IP Right Grant
- 2013-12-19 KR KR1020157016315A patent/KR102110632B1/ko active Active
- 2013-12-19 CA CA2889706A patent/CA2889706C/en active Active
- 2013-12-19 RU RU2015123696A patent/RU2632469C2/ru active
- 2013-12-19 MX MX2015008119A patent/MX352445B/es active IP Right Grant
- 2013-12-19 CN CN201380066894.8A patent/CN104937926B/zh active Active
Also Published As
| Publication number | Publication date |
|---|---|
| BR112015011758B1 (pt) | 2023-04-18 |
| KR20150096419A (ko) | 2015-08-24 |
| MX352445B (es) | 2017-11-24 |
| CA2889706C (en) | 2020-04-28 |
| RU2632469C2 (ru) | 2017-10-05 |
| RU2015123696A (ru) | 2017-01-10 |
| WO2014100466A3 (en) | 2014-08-07 |
| CA2889706A1 (en) | 2014-06-26 |
| AU2013361258B2 (en) | 2017-03-09 |
| EP2912841B1 (en) | 2020-10-28 |
| WO2014100466A2 (en) | 2014-06-26 |
| US20140168352A1 (en) | 2014-06-19 |
| US9065971B2 (en) | 2015-06-23 |
| KR102110632B1 (ko) | 2020-05-13 |
| EP2912841A2 (en) | 2015-09-02 |
| CN104937926A (zh) | 2015-09-23 |
| AU2013361258A1 (en) | 2015-05-14 |
| JP6321033B2 (ja) | 2018-05-09 |
| BR112015011758A2 (pt) | 2017-07-11 |
| CN104937926B (zh) | 2018-05-25 |
| JP2016506670A (ja) | 2016-03-03 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| MX352445B (es) | Etiquetado de video o audio para deteccion de orador activo. | |
| EP4485461A3 (en) | Systems and methods for providing a slow motion video stream concurrently with a normal-speed video stream upon detection of an event | |
| GB2482087A (en) | Separation alarm | |
| EP4236332A3 (en) | Techniques and apparatus for editing video | |
| IN2015MN01766A (es) | ||
| IN2014CH00810A (es) | ||
| WO2018100233A3 (en) | Distributed audio capture and mixing controlling | |
| MX2014003104A (es) | Dispositivo de recepcion, metodo de recepcion programa, y sistema de procesamiento de informacion. | |
| JP2016506670A5 (es) | ||
| IN2014CN03446A (es) | ||
| WO2013101460A3 (en) | Clustering-based object classification | |
| MX366249B (es) | Deteccion de conversacion. | |
| GB2462567A (en) | Data processing apparatus | |
| WO2008080673A3 (en) | Audio detection using distributed mobile computing | |
| MX2015012443A (es) | Sistemas y metodos para detectar un atributo de documento utilizando acustica. | |
| GB2569741A (en) | Guardian system in a network to improve situational awareness of a crowd at an incident | |
| WO2008078736A1 (ja) | 同一性判定装置、同一性判定方法および同一性判定用プログラム | |
| WO2017027397A3 (en) | Event detection for playback management in an audio device | |
| EP4329302A3 (en) | Systems and methods for hybrid video encoding | |
| MY172793A (en) | Picture decoding device, picture decoding method and picture decoding program | |
| TW201612549A (en) | Apparatus, system and method for space status detection based on an acoustic signal | |
| MX2016000469A (es) | Metodo y aparato de notificacion por voz. | |
| MX2015006441A (es) | Metodo y dispositivo para difundir datos de medios de flujo continuo. | |
| EP4362459A3 (en) | A method for decoding video | |
| MX2015008811A (es) | Metodo y dispositivo de monitoreo por video. |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FG | Grant or registration |