[go: up one dir, main page]

WO2005020208A3 - Topological voiceprints for speaker identification - Google Patents

Topological voiceprints for speaker identification Download PDF

Info

Publication number
WO2005020208A3
WO2005020208A3 PCT/US2004/027193 US2004027193W WO2005020208A3 WO 2005020208 A3 WO2005020208 A3 WO 2005020208A3 US 2004027193 W US2004027193 W US 2004027193W WO 2005020208 A3 WO2005020208 A3 WO 2005020208A3
Authority
WO
WIPO (PCT)
Prior art keywords
topological
voiceprints
speaker identification
speaker
spectral
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2004/027193
Other languages
French (fr)
Other versions
WO2005020208A2 (en
Inventor
Bernardo Gabriel Mindlin
Marcos Alberto Trevisan
Manuel Camilo Eguia
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Universidad Nacional de Quilmes
University of California Berkeley
University of California San Diego UCSD
Original Assignee
Universidad Nacional de Quilmes
University of California Berkeley
University of California San Diego UCSD
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Universidad Nacional de Quilmes, University of California Berkeley, University of California San Diego UCSD filed Critical Universidad Nacional de Quilmes
Priority to US10/568,564 priority Critical patent/US20070198262A1/en
Priority to ARP040103030A priority patent/AR047710A1/en
Publication of WO2005020208A2 publication Critical patent/WO2005020208A2/en
Publication of WO2005020208A3 publication Critical patent/WO2005020208A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)
  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
  • Collating Specific Patterns (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)

Abstract

The speaker recognition techniques of this application use a topological description of his/her voice spectral properties in order to use it as a biometric characterization for the speaker. Distinctly different from computing distances between spectral curves obtained from voices of different speakers in various spectral analysis methods, such topological features provide a one­to-one correspondence between a subject and a mold represented by a set of rational numbers.
PCT/US2004/027193 2003-08-20 2004-08-20 Topological voiceprints for speaker identification Ceased WO2005020208A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US10/568,564 US20070198262A1 (en) 2003-08-20 2004-08-20 Topological voiceprints for speaker identification
ARP040103030A AR047710A1 (en) 2003-08-20 2004-08-24 TOPOLOGICAL VOICE IMPRESSIONS FOR SPEAKER IDENTIFICATION

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US49700703P 2003-08-20 2003-08-20
US60/497,007 2003-08-20

Publications (2)

Publication Number Publication Date
WO2005020208A2 WO2005020208A2 (en) 2005-03-03
WO2005020208A3 true WO2005020208A3 (en) 2005-04-28

Family

ID=34216064

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2004/027193 Ceased WO2005020208A2 (en) 2003-08-20 2004-08-20 Topological voiceprints for speaker identification

Country Status (3)

Country Link
CN (1) CN1871639A (en)
AR (1) AR047710A1 (en)
WO (1) WO2005020208A2 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8086519B2 (en) 2004-10-14 2011-12-27 Cfph, Llc System and method for facilitating a wireless financial transaction
US7860778B2 (en) 2004-11-08 2010-12-28 Cfph, Llc System and method for implementing push technology in a wireless financial transaction
CN102129859B (en) * 2010-01-18 2013-10-30 盛乐信息技术(上海)有限公司 Voiceprint authentication system and method for rapid channel compensation
KR101357710B1 (en) * 2013-06-18 2014-02-04 (주) 엠티콤 Method for electronic document producing and inquiring, and recording medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4415767A (en) * 1981-10-19 1983-11-15 Votan Method and apparatus for speech recognition and reproduction
US5799276A (en) * 1995-11-07 1998-08-25 Accent Incorporated Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals
US6006186A (en) * 1997-10-16 1999-12-21 Sony Corporation Method and apparatus for a parameter sharing speech recognition system
US6092039A (en) * 1997-10-31 2000-07-18 International Business Machines Corporation Symbiotic automatic speech recognition and vocoder
US6236963B1 (en) * 1998-03-16 2001-05-22 Atr Interpreting Telecommunications Research Laboratories Speaker normalization processor apparatus for generating frequency warping function, and speech recognition apparatus with said speaker normalization processor apparatus
US6256609B1 (en) * 1997-05-09 2001-07-03 Washington University Method and apparatus for speaker recognition using lattice-ladder filters
US6470315B1 (en) * 1996-09-11 2002-10-22 Texas Instruments Incorporated Enrollment and modeling method and apparatus for robust speaker dependent speech models
US6618702B1 (en) * 2002-06-14 2003-09-09 Mary Antoinette Kohler Method of and device for phone-based speaker recognition

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4415767A (en) * 1981-10-19 1983-11-15 Votan Method and apparatus for speech recognition and reproduction
US5799276A (en) * 1995-11-07 1998-08-25 Accent Incorporated Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals
US6470315B1 (en) * 1996-09-11 2002-10-22 Texas Instruments Incorporated Enrollment and modeling method and apparatus for robust speaker dependent speech models
US6256609B1 (en) * 1997-05-09 2001-07-03 Washington University Method and apparatus for speaker recognition using lattice-ladder filters
US6006186A (en) * 1997-10-16 1999-12-21 Sony Corporation Method and apparatus for a parameter sharing speech recognition system
US6092039A (en) * 1997-10-31 2000-07-18 International Business Machines Corporation Symbiotic automatic speech recognition and vocoder
US6236963B1 (en) * 1998-03-16 2001-05-22 Atr Interpreting Telecommunications Research Laboratories Speaker normalization processor apparatus for generating frequency warping function, and speech recognition apparatus with said speaker normalization processor apparatus
US6618702B1 (en) * 2002-06-14 2003-09-09 Mary Antoinette Kohler Method of and device for phone-based speaker recognition

Also Published As

Publication number Publication date
WO2005020208A2 (en) 2005-03-03
CN1871639A (en) 2006-11-29
AR047710A1 (en) 2006-02-15

Similar Documents

Publication Publication Date Title
ATE246835T1 (en) SPEAKER RECOGNITION
ATE312398T1 (en) SPEAKER ADAPTATION FOR VOICE RECOGNITION
ATE410768T1 (en) SYSTEM AND METHOD FOR OPERATING A VOICE RECOGNITION SYSTEM IN A VEHICLE
EP0984431A3 (en) Speaker verification and speaker identification based on eigenvoices
WO2005059893A3 (en) System and method for providing improved claimant authentication
WO2009153788A3 (en) Speaker characterization through speech analysis
WO2003036617A1 (en) Speech recognition apparatus and speech recognition method
WO2004100638A3 (en) Source-dependent text-to-speech system
EP1189206A3 (en) Voice control of electronic devices
EP1103952A3 (en) Context-dependent acoustic models for speech recognition with eigenvoice training
WO2007095277A3 (en) Communication device having speaker independent speech recognition
Chen et al. Recognition of noisy speech using dynamic spectral subband centroids
Lei et al. Mel, linear, and antimel frequency cepstral coefficients in broad phonetic regions for telephone speaker recognition.
WO2008111190A1 (en) Accoustic model registration device, speaker recognition device, accoustic model registration method, and accoustic model registration processing program
CN105869657A (en) System and method for identifying voice emotion
DE602004015189D1 (en) Speech recognition apparatus and method with models adapted to the current noise conditions
WO2004068893A3 (en) Method and apparatus for noise suppression within a distributed speech recognition system
EP1376537A3 (en) Apparatus, method, and computer-readable recording medium for recognition of keywords from spontaneous speech
WO2005020208A3 (en) Topological voiceprints for speaker identification
Wildermoth et al. Use of voicing and pitch information for speaker recognition
McLaren et al. On the Issue of Calibration in DNN-Based Speaker Recognition Systems.
Chao Speaker identification using pairwise log-likelihood ratio measures
ATE441918T1 (en) VOICE DIALOGUE METHOD AND SYSTEM
Dumpala et al. Robust Vowel Landmark Detection Using Epoch-Based Features.
TW200710823A (en) Method and system for template inquiry dialogue system

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200480030850.0

Country of ref document: CN

AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 10568564

Country of ref document: US

Ref document number: 2007198262

Country of ref document: US

122 Ep: pct application non-entry in european phase
WWP Wipo information: published in national office

Ref document number: 10568564

Country of ref document: US