[go: up one dir, main page]

AU2048001A - System and method of templating specific human voices - Google Patents

System and method of templating specific human voices Download PDF

Info

Publication number
AU2048001A
AU2048001A AU20480/01A AU2048001A AU2048001A AU 2048001 A AU2048001 A AU 2048001A AU 20480/01 A AU20480/01 A AU 20480/01A AU 2048001 A AU2048001 A AU 2048001A AU 2048001 A AU2048001 A AU 2048001A
Authority
AU
Australia
Prior art keywords
voice
data
captured
template
specific
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
AU20480/01A
Other languages
English (en)
Inventor
Katherine Axia Keough
Steven J. Keough
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Publication of AU2048001A publication Critical patent/AU2048001A/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/54Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for retrieval
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Machine Translation (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
AU20480/01A 1999-11-23 2000-11-23 System and method of templating specific human voices Abandoned AU2048001A (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16716899P 1999-11-23 1999-11-23
US60167168 1999-11-23
PCT/US2000/032328 WO2001039180A1 (en) 1999-11-23 2000-11-23 System and method of templating specific human voices

Publications (1)

Publication Number Publication Date
AU2048001A true AU2048001A (en) 2001-06-04

Family

ID=22606225

Family Applications (1)

Application Number Title Priority Date Filing Date
AU20480/01A Abandoned AU2048001A (en) 1999-11-23 2000-11-23 System and method of templating specific human voices

Country Status (13)

Country Link
EP (1) EP1252620A1 (zh)
JP (1) JP2003515768A (zh)
KR (1) KR20020060975A (zh)
CN (1) CN1391690A (zh)
AP (1) AP2002002524A0 (zh)
AU (1) AU2048001A (zh)
BR (1) BR0015773A (zh)
CA (1) CA2392436A1 (zh)
EA (1) EA004079B1 (zh)
IL (1) IL149813A0 (zh)
NO (1) NO20022406L (zh)
WO (1) WO2001039180A1 (zh)
ZA (1) ZA200204036B (zh)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7697827B2 (en) 2005-10-17 2010-04-13 Konicek Jeffrey C User-friendlier interfaces for a camera
CN101622659B (zh) * 2007-06-06 2012-02-22 松下电器产业株式会社 音质编辑装置及音质编辑方法
US9240182B2 (en) * 2013-09-17 2016-01-19 Qualcomm Incorporated Method and apparatus for adjusting detection threshold for activating voice assistant function
US9552810B2 (en) 2015-03-31 2017-01-24 International Business Machines Corporation Customizable and individualized speech recognition settings interface for users with language accents
RU2617918C2 (ru) * 2015-06-19 2017-04-28 Иосиф Исаакович Лившиц Способ формирования образа человека с учетом характеристик его психологического портрета, полученных под контролем полиграфа
KR101963195B1 (ko) * 2017-06-21 2019-03-28 구동하 사용자 음성을 이용한 생리 주기 결정 방법 및 이를 실행하는 서버
US11314215B2 (en) 2017-09-15 2022-04-26 Kohler Co. Apparatus controlling bathroom appliance lighting based on user identity
US10448762B2 (en) 2017-09-15 2019-10-22 Kohler Co. Mirror
US11093554B2 (en) 2017-09-15 2021-08-17 Kohler Co. Feedback for water consuming appliance
US11099540B2 (en) 2017-09-15 2021-08-24 Kohler Co. User identity in household appliances
US10887125B2 (en) 2017-09-15 2021-01-05 Kohler Co. Bathroom speaker
CN109298642B (zh) * 2018-09-20 2021-08-27 三星电子(中国)研发中心 采用智能音箱进行监控的方法及装置
KR102466736B1 (ko) * 2021-06-18 2022-11-14 주식회사 한글과컴퓨터 사용자에 의해 입력된 음성을 기초로 본인 인증을 수행하는 음성 기반의 사용자 인증 서버 및 그 동작 방법

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5007081A (en) * 1989-01-05 1991-04-09 Origin Technology, Inc. Speech activated telephone
US5594789A (en) * 1994-10-13 1997-01-14 Bell Atlantic Network Services, Inc. Transaction implementation in video dial tone network
US5717828A (en) * 1995-03-15 1998-02-10 Syracuse Language Systems Speech recognition apparatus and method for learning
US5774841A (en) * 1995-09-20 1998-06-30 The United States Of America As Represented By The Adminstrator Of The National Aeronautics And Space Administration Real-time reconfigurable adaptive speech recognition command and control apparatus and method

Also Published As

Publication number Publication date
IL149813A0 (en) 2002-11-10
NO20022406L (no) 2002-07-12
EP1252620A1 (en) 2002-10-30
BR0015773A (pt) 2002-08-06
CA2392436A1 (en) 2001-05-31
ZA200204036B (en) 2003-08-21
JP2003515768A (ja) 2003-05-07
CN1391690A (zh) 2003-01-15
NO20022406D0 (no) 2002-05-21
KR20020060975A (ko) 2002-07-19
WO2001039180A1 (en) 2001-05-31
EA200200587A1 (ru) 2002-10-31
AP2002002524A0 (en) 2002-06-30
EA004079B1 (ru) 2003-12-25

Similar Documents

Publication Publication Date Title
US20020072900A1 (en) System and method of templating specific human voices
Goel et al. Audio flamingo 3: Advancing audio intelligence with fully open large audio language models
US20240361827A1 (en) Systems, Methods, And Devices to Curate and Present Content and Physical Elements Based on Personal Biometric Identifier Information
JP6876752B2 (ja) 応答方法及び装置
CN113010138B (zh) 文章的语音播放方法、装置、设备及计算机可读存储介质
US20050108011A1 (en) System and method of templating specific human voices
Gold et al. Speech and audio signal processing: processing and perception of speech and music
Rachman et al. DAVID: An open-source platform for real-time transformation of infra-segmental emotional cues in running speech
US20100324905A1 (en) Voice models for document narration
CN112164379A (zh) 音频文件生成方法、装置、设备及计算机可读存储介质
CN106847258A (zh) 用于共享调适语音简档的方法和设备
JP7696498B2 (ja) バーチャルコンサートの処理方法、処理装置、電子機器およびコンピュータプログラム
AU2048001A (en) System and method of templating specific human voices
CN112885326B (zh) 个性化语音合成模型创建、语音合成和测试方法及装置
Ramati Algorithmic ventriloquism: The contested state of voice in AI speech generators
CN114048299A (zh) 对话方法、装置、设备、计算机可读存储介质及程序产品
WO2025101781A1 (en) Synthetic narration generation
CN119993114A (zh) 基于多模态风格嵌入的语音合成方法、装置、设备及介质
Lee et al. The Sound of Hallucinations: Toward a more convincing emulation of internalized voices
WO2004008295A2 (en) System and method for voice characteristic medical analysis
Own et al. The Individual Perception in Synthetic Speech
KR102768266B1 (ko) 화자 설명 텍스트에 기초한 합성 음성 생성 방법 및 시스템
US12230244B1 (en) Graphical user interface for customized storytelling
JP4356334B2 (ja) 音声データ提供システムならびに音声データ作成装置
Lutsenko et al. Research on a voice changed by distortion