EA004079B1 - Система и способ моделирования голоса конкретных людей - Google Patents
Система и способ моделирования голоса конкретных людей Download PDFInfo
- Publication number
- EA004079B1 EA004079B1 EA200200587A EA200200587A EA004079B1 EA 004079 B1 EA004079 B1 EA 004079B1 EA 200200587 A EA200200587 A EA 200200587A EA 200200587 A EA200200587 A EA 200200587A EA 004079 B1 EA004079 B1 EA 004079B1
- Authority
- EA
- Eurasian Patent Office
- Prior art keywords
- voice
- data
- computer
- analysis
- sufficient
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/54—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for retrieval
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G10L2021/0135—Voice conversion or morphing
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Machine Translation (AREA)
- User Interface Of Digital Computer (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16716899P | 1999-11-23 | 1999-11-23 | |
| PCT/US2000/032328 WO2001039180A1 (en) | 1999-11-23 | 2000-11-23 | System and method of templating specific human voices |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| EA200200587A1 EA200200587A1 (ru) | 2002-10-31 |
| EA004079B1 true EA004079B1 (ru) | 2003-12-25 |
Family
ID=22606225
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EA200200587A EA004079B1 (ru) | 1999-11-23 | 2000-11-23 | Система и способ моделирования голоса конкретных людей |
Country Status (13)
| Country | Link |
|---|---|
| EP (1) | EP1252620A1 (zh) |
| JP (1) | JP2003515768A (zh) |
| KR (1) | KR20020060975A (zh) |
| CN (1) | CN1391690A (zh) |
| AP (1) | AP2002002524A0 (zh) |
| AU (1) | AU2048001A (zh) |
| BR (1) | BR0015773A (zh) |
| CA (1) | CA2392436A1 (zh) |
| EA (1) | EA004079B1 (zh) |
| IL (1) | IL149813A0 (zh) |
| NO (1) | NO20022406L (zh) |
| WO (1) | WO2001039180A1 (zh) |
| ZA (1) | ZA200204036B (zh) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| RU2617918C2 (ru) * | 2015-06-19 | 2017-04-28 | Иосиф Исаакович Лившиц | Способ формирования образа человека с учетом характеристик его психологического портрета, полученных под контролем полиграфа |
Families Citing this family (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7697827B2 (en) | 2005-10-17 | 2010-04-13 | Konicek Jeffrey C | User-friendlier interfaces for a camera |
| CN101622659B (zh) * | 2007-06-06 | 2012-02-22 | 松下电器产业株式会社 | 音质编辑装置及音质编辑方法 |
| US9240182B2 (en) * | 2013-09-17 | 2016-01-19 | Qualcomm Incorporated | Method and apparatus for adjusting detection threshold for activating voice assistant function |
| US9552810B2 (en) | 2015-03-31 | 2017-01-24 | International Business Machines Corporation | Customizable and individualized speech recognition settings interface for users with language accents |
| KR101963195B1 (ko) * | 2017-06-21 | 2019-03-28 | 구동하 | 사용자 음성을 이용한 생리 주기 결정 방법 및 이를 실행하는 서버 |
| US11093554B2 (en) | 2017-09-15 | 2021-08-17 | Kohler Co. | Feedback for water consuming appliance |
| US11314214B2 (en) | 2017-09-15 | 2022-04-26 | Kohler Co. | Geographic analysis of water conditions |
| US11099540B2 (en) | 2017-09-15 | 2021-08-24 | Kohler Co. | User identity in household appliances |
| US10448762B2 (en) | 2017-09-15 | 2019-10-22 | Kohler Co. | Mirror |
| US10887125B2 (en) | 2017-09-15 | 2021-01-05 | Kohler Co. | Bathroom speaker |
| CN109298642B (zh) * | 2018-09-20 | 2021-08-27 | 三星电子(中国)研发中心 | 采用智能音箱进行监控的方法及装置 |
| KR102466736B1 (ko) * | 2021-06-18 | 2022-11-14 | 주식회사 한글과컴퓨터 | 사용자에 의해 입력된 음성을 기초로 본인 인증을 수행하는 음성 기반의 사용자 인증 서버 및 그 동작 방법 |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5007081A (en) * | 1989-01-05 | 1991-04-09 | Origin Technology, Inc. | Speech activated telephone |
| US5594789A (en) * | 1994-10-13 | 1997-01-14 | Bell Atlantic Network Services, Inc. | Transaction implementation in video dial tone network |
| US5717828A (en) * | 1995-03-15 | 1998-02-10 | Syracuse Language Systems | Speech recognition apparatus and method for learning |
| US5774841A (en) * | 1995-09-20 | 1998-06-30 | The United States Of America As Represented By The Adminstrator Of The National Aeronautics And Space Administration | Real-time reconfigurable adaptive speech recognition command and control apparatus and method |
-
2000
- 2000-11-23 KR KR1020027006630A patent/KR20020060975A/ko not_active Withdrawn
- 2000-11-23 CN CN00816092A patent/CN1391690A/zh active Pending
- 2000-11-23 EA EA200200587A patent/EA004079B1/ru not_active IP Right Cessation
- 2000-11-23 CA CA002392436A patent/CA2392436A1/en not_active Abandoned
- 2000-11-23 JP JP2001540763A patent/JP2003515768A/ja active Pending
- 2000-11-23 WO PCT/US2000/032328 patent/WO2001039180A1/en not_active Ceased
- 2000-11-23 AP APAP/P/2002/002524A patent/AP2002002524A0/en unknown
- 2000-11-23 IL IL14981300A patent/IL149813A0/xx unknown
- 2000-11-23 AU AU20480/01A patent/AU2048001A/en not_active Abandoned
- 2000-11-23 EP EP00983768A patent/EP1252620A1/en not_active Withdrawn
- 2000-11-23 BR BR0015773-2A patent/BR0015773A/pt not_active IP Right Cessation
-
2002
- 2002-05-21 ZA ZA200204036A patent/ZA200204036B/xx unknown
- 2002-05-21 NO NO20022406A patent/NO20022406L/no unknown
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| RU2617918C2 (ru) * | 2015-06-19 | 2017-04-28 | Иосиф Исаакович Лившиц | Способ формирования образа человека с учетом характеристик его психологического портрета, полученных под контролем полиграфа |
Also Published As
| Publication number | Publication date |
|---|---|
| BR0015773A (pt) | 2002-08-06 |
| AU2048001A (en) | 2001-06-04 |
| CN1391690A (zh) | 2003-01-15 |
| ZA200204036B (en) | 2003-08-21 |
| NO20022406L (no) | 2002-07-12 |
| IL149813A0 (en) | 2002-11-10 |
| CA2392436A1 (en) | 2001-05-31 |
| KR20020060975A (ko) | 2002-07-19 |
| NO20022406D0 (no) | 2002-05-21 |
| EP1252620A1 (en) | 2002-10-30 |
| JP2003515768A (ja) | 2003-05-07 |
| EA200200587A1 (ru) | 2002-10-31 |
| AP2002002524A0 (en) | 2002-06-30 |
| WO2001039180A1 (en) | 2001-05-31 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Goel et al. | Audio flamingo 3: Advancing audio intelligence with fully open large audio language models | |
| US20020072900A1 (en) | System and method of templating specific human voices | |
| CN114242033B (zh) | 语音合成方法、装置、设备、存储介质及程序产品 | |
| CN111667812A (zh) | 一种语音合成方法、装置、设备及存储介质 | |
| CN112164379A (zh) | 音频文件生成方法、装置、设备及计算机可读存储介质 | |
| CN116403558A (zh) | 语音克隆模型的训练及语音合成的方法、装置和相关设备 | |
| US20050108011A1 (en) | System and method of templating specific human voices | |
| CN112863476B (zh) | 个性化语音合成模型构建、语音合成和测试方法及装置 | |
| EA004079B1 (ru) | Система и способ моделирования голоса конкретных людей | |
| CN112885326B (zh) | 个性化语音合成模型创建、语音合成和测试方法及装置 | |
| CN111477210A (zh) | 语音合成方法和装置 | |
| US12400632B2 (en) | System and method for posthumous dynamic speech synthesis using neural networks and deep learning by generating pixel coordinates using portable network graphic | |
| CN119541451A (zh) | 语音合成方法、装置、设备及计算机介质 | |
| WO2025101781A1 (en) | Synthetic narration generation | |
| CN115132204B (zh) | 一种语音处理方法、设备、存储介质及计算机程序产品 | |
| Lee et al. | The Sound of Hallucinations: Toward a more convincing emulation of internalized voices | |
| WO2004008295A2 (en) | System and method for voice characteristic medical analysis | |
| KR102768266B1 (ko) | 화자 설명 텍스트에 기초한 합성 음성 생성 방법 및 시스템 | |
| Alsabaan | Pronunciation support for Arabic learners | |
| Baral | Preserving Indigenous Language: Text-To-Speech System for the Myaamia Language | |
| US12230244B1 (en) | Graphical user interface for customized storytelling | |
| Rose | Child phonology | |
| Kraleva | Design and development a children's speech database | |
| Burgess et al. | Voice AI and authenticity: current issues and emerging challenges | |
| Gao et al. | Advancing Speech Data Collection and Annotation: General Methods, Practical Experience, and Future Perspectives |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| MM4A | Lapse of a eurasian patent due to non-payment of renewal fees within the time limit in the following designated state(s) |
Designated state(s): AM AZ BY KZ KG MD TJ TM RU |