[go: up one dir, main page]

BRPI0812128A2 - Identificação ativa de locutor - Google Patents

Identificação ativa de locutor

Info

Publication number
BRPI0812128A2
BRPI0812128A2 BRPI0812128-1A2A BRPI0812128A BRPI0812128A2 BR PI0812128 A2 BRPI0812128 A2 BR PI0812128A2 BR PI0812128 A BRPI0812128 A BR PI0812128A BR PI0812128 A2 BRPI0812128 A2 BR PI0812128A2
Authority
BR
Brazil
Prior art keywords
speaker identification
active speaker
active
identification
speaker
Prior art date
Application number
BRPI0812128-1A2A
Other languages
English (en)
Inventor
Regis J Crinon
Humayun M Khan
Dalibor Kukoleca
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of BRPI0812128A2 publication Critical patent/BRPI0812128A2/pt

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications
    • H04L65/403Arrangements for multi-party communication, e.g. for conferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/02Details
    • H04L12/16Arrangements for providing special services to substations
    • H04L12/18Arrangements for providing special services to substations for broadcast or conference, e.g. multicast
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications
    • H04L65/403Arrangements for multi-party communication, e.g. for conferences
    • H04L65/4038Arrangements for multi-party communication, e.g. for conferences with floor control
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1069Session establishment or de-establishment
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/65Network streaming protocols, e.g. real-time transport protocol [RTP] or real-time control protocol [RTCP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
    • H04M3/568Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities audio processing specific to telephonic conferencing, e.g. spatial distribution, mixing of participants
    • H04M3/569Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities audio processing specific to telephonic conferencing, e.g. spatial distribution, mixing of participants using the instant speaker's algorithm
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M7/00Arrangements for interconnection between switching centres
    • H04M7/006Networks other than PSTN/ISDN providing telephone service, e.g. Voice over Internet Protocol (VoIP), including next generation networks with a packet-switched transport layer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2203/00Aspects of automatic or semi-automatic exchanges
    • H04M2203/50Aspects of automatic or semi-automatic exchanges related to audio conference
    • H04M2203/5072Multiple active speakers

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Telephonic Communication Services (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
BRPI0812128-1A2A 2007-06-12 2008-05-30 Identificação ativa de locutor BRPI0812128A2 (pt)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/761,963 US8385233B2 (en) 2007-06-12 2007-06-12 Active speaker identification
PCT/US2008/065441 WO2008157005A1 (en) 2007-06-12 2008-05-30 Active speaker identification

Publications (1)

Publication Number Publication Date
BRPI0812128A2 true BRPI0812128A2 (pt) 2014-11-18

Family

ID=40133145

Family Applications (1)

Application Number Title Priority Date Filing Date
BRPI0812128-1A2A BRPI0812128A2 (pt) 2007-06-12 2008-05-30 Identificação ativa de locutor

Country Status (8)

Country Link
US (3) US8385233B2 (pt)
EP (1) EP2163035B1 (pt)
JP (1) JP5579598B2 (pt)
KR (1) KR101486607B1 (pt)
CN (1) CN101689998A (pt)
BR (1) BRPI0812128A2 (pt)
RU (1) RU2483452C2 (pt)
WO (1) WO2008157005A1 (pt)

Families Citing this family (49)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8385233B2 (en) 2007-06-12 2013-02-26 Microsoft Corporation Active speaker identification
US7782802B2 (en) * 2007-12-26 2010-08-24 Microsoft Corporation Optimizing conferencing performance
US8325800B2 (en) 2008-05-07 2012-12-04 Microsoft Corporation Encoding streaming media as a high bit rate layer, a low bit rate layer, and one or more intermediate bit rate layers
US8379851B2 (en) 2008-05-12 2013-02-19 Microsoft Corporation Optimized client side rate control and indexed file layout for streaming media
US7925774B2 (en) 2008-05-30 2011-04-12 Microsoft Corporation Media streaming using an index file
US8265140B2 (en) 2008-09-30 2012-09-11 Microsoft Corporation Fine-grained client-side control of scalable media delivery
WO2010045869A1 (zh) * 2008-10-20 2010-04-29 华为终端有限公司 一种3d音频信号处理的方法、系统和装置
US8587634B1 (en) * 2008-12-12 2013-11-19 Cisco Technology, Inc. System and method for intelligent mode switching in a communications environment
CN102594776B (zh) * 2011-01-11 2016-08-03 中兴通讯股份有限公司 一种同步源标识更新的方法、装置和系统
CN103533294B (zh) * 2012-07-03 2017-06-20 中国移动通信集团公司 视频数据流的发送方法、终端及系统
CN104469255A (zh) 2013-09-16 2015-03-25 杜比实验室特许公司 改进的音频或视频会议
US20140114664A1 (en) * 2012-10-20 2014-04-24 Microsoft Corporation Active Participant History in a Video Conferencing System
US9210269B2 (en) * 2012-10-31 2015-12-08 Cisco Technology, Inc. Active speaker indicator for conference participants
US9065971B2 (en) * 2012-12-19 2015-06-23 Microsoft Technology Licensing, Llc Video and audio tagging for active speaker detection
US10348778B2 (en) * 2013-02-08 2019-07-09 Avaya Inc. Dynamic device pairing with media server audio substitution
CN104079870B (zh) * 2013-03-29 2017-07-11 杭州海康威视数字技术股份有限公司 单路视频多路音频的视频监控方法及系统
JP6534926B2 (ja) * 2013-06-10 2019-06-26 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America 話者識別方法、話者識別装置及び話者識別システム
CN105100698A (zh) * 2014-05-23 2015-11-25 中兴通讯股份有限公司 一种幼儿园视频监控方法及装置
US9257120B1 (en) 2014-07-18 2016-02-09 Google Inc. Speaker verification using co-location information
CN105376515B (zh) * 2014-09-02 2019-03-19 华为技术有限公司 用于视频通讯的通讯信息的呈现方法、装置及系统
US9318107B1 (en) 2014-10-09 2016-04-19 Google Inc. Hotword detection on multiple devices
US9812128B2 (en) 2014-10-09 2017-11-07 Google Inc. Device leadership negotiation among voice interface devices
US9424841B2 (en) 2014-10-09 2016-08-23 Google Inc. Hotword detection on multiple devices
US9704488B2 (en) * 2015-03-20 2017-07-11 Microsoft Technology Licensing, Llc Communicating metadata that identifies a current speaker
US9325853B1 (en) * 2015-09-24 2016-04-26 Atlassian Pty Ltd Equalization of silence audio levels in packet media conferencing systems
US11291235B2 (en) 2016-01-19 2022-04-05 Firmenich Sa Phloretin
US9779735B2 (en) 2016-02-24 2017-10-03 Google Inc. Methods and systems for detecting and processing speech signals
CN106648722B (zh) * 2016-05-10 2020-01-10 深圳前海信息技术有限公司 基于大数据的Flume接收端数据处理方法和装置
WO2018015425A1 (en) * 2016-07-19 2018-01-25 Schneider Electric Industries Sas Time-sensitive software defined networking
US9972320B2 (en) 2016-08-24 2018-05-15 Google Llc Hotword detection on multiple devices
KR102241970B1 (ko) 2016-11-07 2021-04-20 구글 엘엘씨 기록된 미디어 핫워드 트리거 억제
US10559309B2 (en) 2016-12-22 2020-02-11 Google Llc Collaborative voice controlled devices
US10522137B2 (en) 2017-04-20 2019-12-31 Google Llc Multi-user authentication on a device
US10170119B2 (en) * 2017-05-18 2019-01-01 International Business Machines Corporation Identifying speaker roles in a streaming environment
US10395650B2 (en) 2017-06-05 2019-08-27 Google Llc Recorded media hotword trigger suppression
US10708320B2 (en) * 2017-06-27 2020-07-07 Atlassian Pty Ltd Selective internal forwarding in conferences with distributed media servers
EP3503092A1 (en) * 2017-12-21 2019-06-26 Thomson Licensing Method for establishing a link between a device and a speaker in a gateway, corresponding computer program computer and apparatus
US10595083B2 (en) 2018-04-20 2020-03-17 The Nielsen Company (Us), Llc Methods and apparatus to determine audio source impact on an audience of media
US10692496B2 (en) 2018-05-22 2020-06-23 Google Llc Hotword suppression
US11258840B2 (en) 2018-12-20 2022-02-22 Cisco Technology, Inc Realtime communication architecture over hybrid ICN and realtime information centric transport protocol
KR102188537B1 (ko) * 2019-08-14 2020-12-08 라인플러스 주식회사 유니캐스트 및 멀티캐스트를 이용한 그룹 통화 방법 및 시스템
US10841357B1 (en) * 2019-09-12 2020-11-17 Dialpad, Inc. Using transport layer protocol packet headers to encode application layer attributes in an audiovisual over internet protocol (AVoIP) platform
CN111049792B (zh) * 2019-10-08 2022-03-22 广州视源电子科技股份有限公司 音频传输方法、装置、终端设备和存储介质
CN111245851B (zh) * 2020-01-13 2021-12-03 广州视源电子科技股份有限公司 多终端音频传输方法、装置、终端设备和存储介质
US10880315B1 (en) * 2020-02-28 2020-12-29 Cisco Technology, Inc. Active speaker naming and request in ICN-based real-time communication systems
US11863592B2 (en) * 2021-05-14 2024-01-02 Cisco Technology, Inc. Active speaker tracking using a global naming scheme
CN113271432B (zh) * 2021-06-30 2022-11-18 北京二六三企业通信有限公司 发送和接收说话者列表的方法及装置
US11546398B1 (en) * 2022-03-09 2023-01-03 Cisco Technology, Inc. Real-time transport (RTC) with low latency and high scalability
CN121040094A (zh) * 2023-03-29 2025-11-28 三星电子株式会社 用于管理关键任务服务中的同步源冲突的方法和装置

Family Cites Families (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4658398A (en) 1984-10-29 1987-04-14 Gte Laboratories Incorporated Framed digital voice summing for teleconferencing
US4658396A (en) * 1985-03-11 1987-04-14 Barden Robert A Redundancy arrangement for a local area network
US5317567A (en) * 1991-09-12 1994-05-31 The United States Of America As Represented By The Secretary Of The Air Force Multi-speaker conferencing over narrowband channels
US6453022B1 (en) * 1998-12-31 2002-09-17 At&T Corporation Multi-line telephone with input/output mixing and audio control
US6728221B1 (en) * 1999-04-09 2004-04-27 Siemens Information & Communication Networks, Inc. Method and apparatus for efficiently utilizing conference bridge capacity
US7006616B1 (en) 1999-05-21 2006-02-28 Terayon Communication Systems, Inc. Teleconferencing bridge with EdgePoint mixing
US6662211B1 (en) * 2000-04-07 2003-12-09 Lucent Technologies Inc. Method and system for providing conferencing services in a telecommunications system
US6970935B1 (en) * 2000-11-01 2005-11-29 International Business Machines Corporation Conversational networking via transport, coding and control conversational protocols
US6963561B1 (en) * 2000-12-15 2005-11-08 Atrica Israel Ltd. Facility for transporting TDM streams over an asynchronous ethernet network using internet protocol
US6728358B2 (en) * 2001-01-25 2004-04-27 Paltalk Holdings, Inc. Efficient buffer allocation for current and predicted active speakers in voice conferencing systems
US6894715B2 (en) * 2001-06-16 2005-05-17 Eric Harold Henrikson Mixing video signals for an audio and video multimedia conference call
US7161939B2 (en) 2001-06-29 2007-01-09 Ip Unity Method and system for switching among independent packetized audio streams
US6947417B2 (en) * 2001-06-29 2005-09-20 Ip Unity Method and system for providing media services
US6753375B2 (en) * 2001-07-02 2004-06-22 The Goodyear Tire & Rubber Company Process for preparing composite, composition and article thereof
US7006618B1 (en) * 2001-11-09 2006-02-28 Cisco Technology, Inc. Method and apparatus for managing incoming and outgoing calls at an endpoint placed on hold
WO2004006475A2 (en) 2002-07-04 2004-01-15 Nokia Corporation Managing a packet switched conference call
FR2844938B1 (fr) * 2002-09-23 2005-01-14 Cit Alcatel Procede d'interception de donnees de controle, notamment de qualite de service, et dispositif associe
US6888935B1 (en) * 2003-01-15 2005-05-03 Cisco Technology, Inc. Speak-louder signaling system for conference calls
RU2006101325A (ru) * 2003-08-18 2006-06-10 Алькатель (Fr) СПОСОБ СВЯЗИ VoIP С ПЕРЕДАЧЕЙ ДОПОЛНИТЕЛЬНЫХ ДАННЫХ
WO2005018192A1 (en) 2003-08-18 2005-02-24 Alcatel Method of voip communication with additional data transmission
US7460656B2 (en) * 2003-12-18 2008-12-02 Intel Corporation Distributed processing in conference call systems
US20050147261A1 (en) 2003-12-30 2005-07-07 Chiang Yeh Head relational transfer function virtualizer
US7558221B2 (en) * 2004-02-13 2009-07-07 Seiko Epson Corporation Method and system for recording videoconference data
KR100652655B1 (ko) * 2004-08-11 2006-12-06 엘지전자 주식회사 발언권 제어를 위한 피티티 서비스 시스템 및 방법
US7864209B2 (en) 2005-04-28 2011-01-04 Apple Inc. Audio processing in a multi-participant conference
DE102005049074B4 (de) * 2005-10-13 2008-04-03 Infineon Technologies Ag Verfahren zum rechnergestützten Vergeben eines Kommunikationsrechts, Verfahren zum rechnergestützten Erzeugen einer Kommunikationsrecht-Anforderungsnachricht, Kommunikationsrecht-Vergabe-Einheit, Kommunikations-Konferenz-Servereinheit, Kommunikations-Konferenz-Nachricht-Erzeugungseinheit, Kommunikations-Endgerät und Verfahren zum rechnergestützten Initialisieren eines Konferenz-Nachrichtenflusses in einer Kommunikations-Konferenz
US8487956B2 (en) * 2005-11-29 2013-07-16 Kyocera Corporation Communication terminal, system and display method to adaptively update a displayed image
US7664246B2 (en) * 2006-01-13 2010-02-16 Microsoft Corporation Sorting speakers in a network-enabled conference
US20080159507A1 (en) * 2006-12-27 2008-07-03 Nokia Corporation Distributed teleconference multichannel architecture, system, method, and computer program product
US8014322B2 (en) * 2007-02-26 2011-09-06 Cisco, Technology, Inc. Diagnostic tool for troubleshooting multimedia streaming applications
US20080260131A1 (en) * 2007-04-20 2008-10-23 Linus Akesson Electronic apparatus and system with conference call spatializer
US8385233B2 (en) 2007-06-12 2013-02-26 Microsoft Corporation Active speaker identification
US8300080B2 (en) * 2007-06-29 2012-10-30 Microsoft Corporation Techniques for detecting a display device

Also Published As

Publication number Publication date
US20140177482A1 (en) 2014-06-26
KR101486607B1 (ko) 2015-01-26
JP2010529814A (ja) 2010-08-26
JP5579598B2 (ja) 2014-08-27
EP2163035A1 (en) 2010-03-17
EP2163035A4 (en) 2012-04-25
US20080312923A1 (en) 2008-12-18
US9160775B2 (en) 2015-10-13
EP2163035B1 (en) 2013-06-26
RU2483452C2 (ru) 2013-05-27
WO2008157005A1 (en) 2008-12-24
US8385233B2 (en) 2013-02-26
RU2009146029A (ru) 2011-06-20
US8717949B2 (en) 2014-05-06
US20130138740A1 (en) 2013-05-30
CN101689998A (zh) 2010-03-31
KR20100021435A (ko) 2010-02-24

Similar Documents

Publication Publication Date Title
BRPI0812128A2 (pt) Identificação ativa de locutor
EP2154906A4 (en) SPEAKER SYSTEM
DK2846557T3 (da) Forbedret højtaleranordning
EP2177045A4 (en) INCREASED HEADER
DE602007003775D1 (de) Lautsprecher
EP2224751A4 (en) HEARING AIDS
BRPI0920589A2 (pt) dispositivo de alto-falante
EP2233952A4 (en) OMNIDIRECTIONAL RAINMETER INSTRUMENT
BRPI0821573A2 (pt) benzofuropirimidonas
BRPI0814314A2 (pt) Microbiocidas
BRPI0812753A2 (pt) Combinações de substância ativas acaricidas
BRPI0815670A2 (pt) Inseticidas
BRPI0816732A2 (pt) Dispositivo de alto-falante
BRPI0815547A2 (pt) Depsipeptídeos cícliocos
DK2082613T3 (da) Høreapparat
BRPI0820831A2 (pt) Combinações de composto ativo
BRPI0813781A2 (pt) microbiocidas
EP2301257A4 (en) EAR PROTECTION
BRPI0916990A2 (pt) Combinações inseticidas
DE112008003196A5 (de) Handpipettiergerät
EP1833278A4 (en) Speaker
CU20090155A7 (es) Macrolidos
EP2340650A4 (en) HEARING AID
DE112008001534A5 (de) Anreihgussform
DE102007038931A8 (de) Fadenlagennähwirkstoffe

Legal Events

Date Code Title Description
B25A Requested transfer of rights approved

Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC (US)

B06F Objections, documents and/or translations needed after an examination request according [chapter 6.6 patent gazette]
B15K Others concerning applications: alteration of classification

Free format text: A CLASSIFICACAO ANTERIOR ERA: H04L 12/18

Ipc: H04L 12/18 (1990.01), H04L 29/06 (1990.01), H04M 3

B06U Preliminary requirement: requests with searches performed by other patent offices: procedure suspended [chapter 6.21 patent gazette]
B11B Dismissal acc. art. 36, par 1 of ipl - no reply within 90 days to fullfil the necessary requirements