[go: up one dir, main page]

EA004079B1 - Система и способ моделирования голоса конкретных людей - Google Patents

Система и способ моделирования голоса конкретных людей Download PDF

Info

Publication number
EA004079B1
EA004079B1 EA200200587A EA200200587A EA004079B1 EA 004079 B1 EA004079 B1 EA 004079B1 EA 200200587 A EA200200587 A EA 200200587A EA 200200587 A EA200200587 A EA 200200587A EA 004079 B1 EA004079 B1 EA 004079B1
Authority
EA
Eurasian Patent Office
Prior art keywords
voice
data
computer
analysis
sufficient
Prior art date
Application number
EA200200587A
Other languages
English (en)
Russian (ru)
Other versions
EA200200587A1 (ru
Inventor
Стивен Дж. Киуг
Кэтрин Аксия Киуг
Original Assignee
Стивен Дж. Киуг
Кэтрин Аксия Киуг
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Стивен Дж. Киуг, Кэтрин Аксия Киуг filed Critical Стивен Дж. Киуг
Publication of EA200200587A1 publication Critical patent/EA200200587A1/ru
Publication of EA004079B1 publication Critical patent/EA004079B1/ru

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/54Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for retrieval
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Machine Translation (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
EA200200587A 1999-11-23 2000-11-23 Система и способ моделирования голоса конкретных людей EA004079B1 (ru)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US16716899P 1999-11-23 1999-11-23
PCT/US2000/032328 WO2001039180A1 (en) 1999-11-23 2000-11-23 System and method of templating specific human voices

Publications (2)

Publication Number Publication Date
EA200200587A1 EA200200587A1 (ru) 2002-10-31
EA004079B1 true EA004079B1 (ru) 2003-12-25

Family

ID=22606225

Family Applications (1)

Application Number Title Priority Date Filing Date
EA200200587A EA004079B1 (ru) 1999-11-23 2000-11-23 Система и способ моделирования голоса конкретных людей

Country Status (13)

Country Link
EP (1) EP1252620A1 (zh)
JP (1) JP2003515768A (zh)
KR (1) KR20020060975A (zh)
CN (1) CN1391690A (zh)
AP (1) AP2002002524A0 (zh)
AU (1) AU2048001A (zh)
BR (1) BR0015773A (zh)
CA (1) CA2392436A1 (zh)
EA (1) EA004079B1 (zh)
IL (1) IL149813A0 (zh)
NO (1) NO20022406L (zh)
WO (1) WO2001039180A1 (zh)
ZA (1) ZA200204036B (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2617918C2 (ru) * 2015-06-19 2017-04-28 Иосиф Исаакович Лившиц Способ формирования образа человека с учетом характеристик его психологического портрета, полученных под контролем полиграфа

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7697827B2 (en) 2005-10-17 2010-04-13 Konicek Jeffrey C User-friendlier interfaces for a camera
CN101622659B (zh) * 2007-06-06 2012-02-22 松下电器产业株式会社 音质编辑装置及音质编辑方法
US9240182B2 (en) * 2013-09-17 2016-01-19 Qualcomm Incorporated Method and apparatus for adjusting detection threshold for activating voice assistant function
US9552810B2 (en) 2015-03-31 2017-01-24 International Business Machines Corporation Customizable and individualized speech recognition settings interface for users with language accents
KR101963195B1 (ko) * 2017-06-21 2019-03-28 구동하 사용자 음성을 이용한 생리 주기 결정 방법 및 이를 실행하는 서버
US11093554B2 (en) 2017-09-15 2021-08-17 Kohler Co. Feedback for water consuming appliance
US11314214B2 (en) 2017-09-15 2022-04-26 Kohler Co. Geographic analysis of water conditions
US11099540B2 (en) 2017-09-15 2021-08-24 Kohler Co. User identity in household appliances
US10448762B2 (en) 2017-09-15 2019-10-22 Kohler Co. Mirror
US10887125B2 (en) 2017-09-15 2021-01-05 Kohler Co. Bathroom speaker
CN109298642B (zh) * 2018-09-20 2021-08-27 三星电子(中国)研发中心 采用智能音箱进行监控的方法及装置
KR102466736B1 (ko) * 2021-06-18 2022-11-14 주식회사 한글과컴퓨터 사용자에 의해 입력된 음성을 기초로 본인 인증을 수행하는 음성 기반의 사용자 인증 서버 및 그 동작 방법

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5007081A (en) * 1989-01-05 1991-04-09 Origin Technology, Inc. Speech activated telephone
US5594789A (en) * 1994-10-13 1997-01-14 Bell Atlantic Network Services, Inc. Transaction implementation in video dial tone network
US5717828A (en) * 1995-03-15 1998-02-10 Syracuse Language Systems Speech recognition apparatus and method for learning
US5774841A (en) * 1995-09-20 1998-06-30 The United States Of America As Represented By The Adminstrator Of The National Aeronautics And Space Administration Real-time reconfigurable adaptive speech recognition command and control apparatus and method

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2617918C2 (ru) * 2015-06-19 2017-04-28 Иосиф Исаакович Лившиц Способ формирования образа человека с учетом характеристик его психологического портрета, полученных под контролем полиграфа

Also Published As

Publication number Publication date
BR0015773A (pt) 2002-08-06
AU2048001A (en) 2001-06-04
CN1391690A (zh) 2003-01-15
ZA200204036B (en) 2003-08-21
NO20022406L (no) 2002-07-12
IL149813A0 (en) 2002-11-10
CA2392436A1 (en) 2001-05-31
KR20020060975A (ko) 2002-07-19
NO20022406D0 (no) 2002-05-21
EP1252620A1 (en) 2002-10-30
JP2003515768A (ja) 2003-05-07
EA200200587A1 (ru) 2002-10-31
AP2002002524A0 (en) 2002-06-30
WO2001039180A1 (en) 2001-05-31

Similar Documents

Publication Publication Date Title
Goel et al. Audio flamingo 3: Advancing audio intelligence with fully open large audio language models
US20020072900A1 (en) System and method of templating specific human voices
CN114242033B (zh) 语音合成方法、装置、设备、存储介质及程序产品
CN111667812A (zh) 一种语音合成方法、装置、设备及存储介质
CN112164379A (zh) 音频文件生成方法、装置、设备及计算机可读存储介质
CN116403558A (zh) 语音克隆模型的训练及语音合成的方法、装置和相关设备
US20050108011A1 (en) System and method of templating specific human voices
CN112863476B (zh) 个性化语音合成模型构建、语音合成和测试方法及装置
EA004079B1 (ru) Система и способ моделирования голоса конкретных людей
CN112885326B (zh) 个性化语音合成模型创建、语音合成和测试方法及装置
CN111477210A (zh) 语音合成方法和装置
US12400632B2 (en) System and method for posthumous dynamic speech synthesis using neural networks and deep learning by generating pixel coordinates using portable network graphic
CN119541451A (zh) 语音合成方法、装置、设备及计算机介质
WO2025101781A1 (en) Synthetic narration generation
CN115132204B (zh) 一种语音处理方法、设备、存储介质及计算机程序产品
Lee et al. The Sound of Hallucinations: Toward a more convincing emulation of internalized voices
WO2004008295A2 (en) System and method for voice characteristic medical analysis
KR102768266B1 (ko) 화자 설명 텍스트에 기초한 합성 음성 생성 방법 및 시스템
Alsabaan Pronunciation support for Arabic learners
Baral Preserving Indigenous Language: Text-To-Speech System for the Myaamia Language
US12230244B1 (en) Graphical user interface for customized storytelling
Rose Child phonology
Kraleva Design and development a children's speech database
Burgess et al. Voice AI and authenticity: current issues and emerging challenges
Gao et al. Advancing Speech Data Collection and Annotation: General Methods, Practical Experience, and Future Perspectives

Legal Events

Date Code Title Description
MM4A Lapse of a eurasian patent due to non-payment of renewal fees within the time limit in the following designated state(s)

Designated state(s): AM AZ BY KZ KG MD TJ TM RU