CN109616096A - 多语种语音解码图的构建方法、装置、服务器和介质 - Google Patents
多语种语音解码图的构建方法、装置、服务器和介质 Download PDFInfo
- Publication number
- CN109616096A CN109616096A CN201811643641.3A CN201811643641A CN109616096A CN 109616096 A CN109616096 A CN 109616096A CN 201811643641 A CN201811643641 A CN 201811643641A CN 109616096 A CN109616096 A CN 109616096A
- Authority
- CN
- China
- Prior art keywords
- word
- languages
- subject kind
- pronunciation
- subject
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000010276 construction Methods 0.000 title claims abstract description 23
- 238000000034 method Methods 0.000 claims abstract description 17
- 238000013507 mapping Methods 0.000 claims description 27
- 230000015654 memory Effects 0.000 claims description 16
- 238000004590 computer program Methods 0.000 claims 1
- 238000010586 diagram Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 230000011218 segmentation Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 3
- 101100506221 Nitrosomonas europaea (strain ATCC 19718 / CIP 103999 / KCTC 2705 / NBRC 14298) hao3 gene Proteins 0.000 description 2
- 238000012549 training Methods 0.000 description 2
- 206010044565 Tremor Diseases 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000002618 waking effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Probability & Statistics with Applications (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201811643641.3A CN109616096B (zh) | 2018-12-29 | 2018-12-29 | 多语种语音解码图的构建方法、装置、服务器和介质 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201811643641.3A CN109616096B (zh) | 2018-12-29 | 2018-12-29 | 多语种语音解码图的构建方法、装置、服务器和介质 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN109616096A true CN109616096A (zh) | 2019-04-12 |
| CN109616096B CN109616096B (zh) | 2022-01-04 |
Family
ID=66015929
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201811643641.3A Active CN109616096B (zh) | 2018-12-29 | 2018-12-29 | 多语种语音解码图的构建方法、装置、服务器和介质 |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN109616096B (zh) |
Cited By (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110070853A (zh) * | 2019-04-29 | 2019-07-30 | 盐城工业职业技术学院 | 一种语音识别转化方法及系统 |
| CN110517668A (zh) * | 2019-07-23 | 2019-11-29 | 普强信息技术(北京)有限公司 | 一种中英文混合语音识别系统及方法 |
| CN110634487A (zh) * | 2019-10-24 | 2019-12-31 | 科大讯飞股份有限公司 | 一种双语种混合语音识别方法、装置、设备及存储介质 |
| CN111369974A (zh) * | 2020-03-11 | 2020-07-03 | 北京声智科技有限公司 | 一种方言发音标注方法、语言识别方法及相关装置 |
| CN111916062A (zh) * | 2019-05-07 | 2020-11-10 | 阿里巴巴集团控股有限公司 | 语音识别方法、装置和系统 |
| CN112185346A (zh) * | 2020-09-25 | 2021-01-05 | 北京百分点信息科技有限公司 | 多语种语音关键词检测、模型生成方法及电子设备 |
| CN112466293A (zh) * | 2020-11-13 | 2021-03-09 | 广州视源电子科技股份有限公司 | 解码图优化方法、装置及存储介质 |
| CN112837675A (zh) * | 2019-11-22 | 2021-05-25 | 阿里巴巴集团控股有限公司 | 语音识别方法、装置及相关系统和设备 |
| CN113077786A (zh) * | 2021-03-23 | 2021-07-06 | 北京儒博科技有限公司 | 一种语音识别方法、装置、设备及存储介质 |
| CN114038463A (zh) * | 2020-07-21 | 2022-02-11 | 中兴通讯股份有限公司 | 混合语音处理的方法、电子设备、计算机可读介质 |
| CN114495897A (zh) * | 2022-02-24 | 2022-05-13 | 中国科学技术大学 | 一种不依赖发音词典的语音合成系统及方法 |
| WO2022178996A1 (zh) * | 2021-02-26 | 2022-09-01 | 平安科技(深圳)有限公司 | 多语言语音模型生成方法、装置、计算机设备及存储介质 |
| CN115132176A (zh) * | 2022-06-28 | 2022-09-30 | 广州小鹏汽车科技有限公司 | 语音识别方法及服务器 |
| CN115810347A (zh) * | 2021-09-13 | 2023-03-17 | 北京猿力未来科技有限公司 | 语音识别方法、装置、存储介质及设备 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102063900A (zh) * | 2010-11-26 | 2011-05-18 | 北京交通大学 | 克服混淆发音的语音识别方法及系统 |
| JP2015001695A (ja) * | 2013-06-18 | 2015-01-05 | 日本電信電話株式会社 | 音声認識装置、音声認識方法及びプログラム |
| CN106935239A (zh) * | 2015-12-29 | 2017-07-07 | 阿里巴巴集团控股有限公司 | 一种发音词典的构建方法及装置 |
| CN107195296A (zh) * | 2016-03-15 | 2017-09-22 | 阿里巴巴集团控股有限公司 | 一种语音识别方法、装置、终端及系统 |
| CN108986791A (zh) * | 2018-08-10 | 2018-12-11 | 南京航空航天大学 | 针对民航陆空通话领域的中英文语种语音识别方法及系统 |
-
2018
- 2018-12-29 CN CN201811643641.3A patent/CN109616096B/zh active Active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102063900A (zh) * | 2010-11-26 | 2011-05-18 | 北京交通大学 | 克服混淆发音的语音识别方法及系统 |
| JP2015001695A (ja) * | 2013-06-18 | 2015-01-05 | 日本電信電話株式会社 | 音声認識装置、音声認識方法及びプログラム |
| CN106935239A (zh) * | 2015-12-29 | 2017-07-07 | 阿里巴巴集团控股有限公司 | 一种发音词典的构建方法及装置 |
| CN107195296A (zh) * | 2016-03-15 | 2017-09-22 | 阿里巴巴集团控股有限公司 | 一种语音识别方法、装置、终端及系统 |
| CN108986791A (zh) * | 2018-08-10 | 2018-12-11 | 南京航空航天大学 | 针对民航陆空通话领域的中英文语种语音识别方法及系统 |
Cited By (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110070853A (zh) * | 2019-04-29 | 2019-07-30 | 盐城工业职业技术学院 | 一种语音识别转化方法及系统 |
| CN111583905A (zh) * | 2019-04-29 | 2020-08-25 | 盐城工业职业技术学院 | 一种语音识别转化方法及系统 |
| CN111916062A (zh) * | 2019-05-07 | 2020-11-10 | 阿里巴巴集团控股有限公司 | 语音识别方法、装置和系统 |
| CN110517668A (zh) * | 2019-07-23 | 2019-11-29 | 普强信息技术(北京)有限公司 | 一种中英文混合语音识别系统及方法 |
| CN110517668B (zh) * | 2019-07-23 | 2022-09-27 | 普强时代(珠海横琴)信息技术有限公司 | 一种中英文混合语音识别系统及方法 |
| CN110634487A (zh) * | 2019-10-24 | 2019-12-31 | 科大讯飞股份有限公司 | 一种双语种混合语音识别方法、装置、设备及存储介质 |
| CN112837675A (zh) * | 2019-11-22 | 2021-05-25 | 阿里巴巴集团控股有限公司 | 语音识别方法、装置及相关系统和设备 |
| CN111369974A (zh) * | 2020-03-11 | 2020-07-03 | 北京声智科技有限公司 | 一种方言发音标注方法、语言识别方法及相关装置 |
| CN111369974B (zh) * | 2020-03-11 | 2024-01-19 | 北京声智科技有限公司 | 一种方言发音标注方法、语言识别方法及相关装置 |
| CN114038463A (zh) * | 2020-07-21 | 2022-02-11 | 中兴通讯股份有限公司 | 混合语音处理的方法、电子设备、计算机可读介质 |
| CN112185346A (zh) * | 2020-09-25 | 2021-01-05 | 北京百分点信息科技有限公司 | 多语种语音关键词检测、模型生成方法及电子设备 |
| CN112466293A (zh) * | 2020-11-13 | 2021-03-09 | 广州视源电子科技股份有限公司 | 解码图优化方法、装置及存储介质 |
| WO2022178996A1 (zh) * | 2021-02-26 | 2022-09-01 | 平安科技(深圳)有限公司 | 多语言语音模型生成方法、装置、计算机设备及存储介质 |
| CN113077786A (zh) * | 2021-03-23 | 2021-07-06 | 北京儒博科技有限公司 | 一种语音识别方法、装置、设备及存储介质 |
| CN113077786B (zh) * | 2021-03-23 | 2022-12-02 | 北京如布科技有限公司 | 一种语音识别方法、装置、设备及存储介质 |
| CN115810347A (zh) * | 2021-09-13 | 2023-03-17 | 北京猿力未来科技有限公司 | 语音识别方法、装置、存储介质及设备 |
| CN114495897A (zh) * | 2022-02-24 | 2022-05-13 | 中国科学技术大学 | 一种不依赖发音词典的语音合成系统及方法 |
| CN115132176A (zh) * | 2022-06-28 | 2022-09-30 | 广州小鹏汽车科技有限公司 | 语音识别方法及服务器 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN109616096B (zh) | 2022-01-04 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN109616096A (zh) | 多语种语音解码图的构建方法、装置、服务器和介质 | |
| US11942082B2 (en) | Facilitating communications with automated assistants in multiple languages | |
| EP3994683B1 (en) | Multilingual neural text-to-speech synthesis | |
| US11354521B2 (en) | Facilitating communications with automated assistants in multiple languages | |
| US10176804B2 (en) | Analyzing textual data | |
| US9472190B2 (en) | Method and system for automatic speech recognition | |
| CN111402861B (zh) | 一种语音识别方法、装置、设备及存储介质 | |
| CN109976702A (zh) | 一种语音识别方法、装置及终端 | |
| JP7158217B2 (ja) | 音声認識方法、装置及びサーバ | |
| US10242670B2 (en) | Syntactic re-ranking of potential transcriptions during automatic speech recognition | |
| CN109754809A (zh) | 语音识别方法、装置、电子设备及存储介质 | |
| CN109448704A (zh) | 语音解码图的构建方法、装置、服务器和存储介质 | |
| CN110852075B (zh) | 自动添加标点符号的语音转写方法、装置及可读存储介质 | |
| CN104573099A (zh) | 题目的搜索方法及装置 | |
| KR20230156795A (ko) | 단어 분할 규칙화 | |
| CN110196929A (zh) | 问答对的生成方法和装置 | |
| CN112037772A (zh) | 基于多模态的响应义务检测方法、系统及装置 | |
| CN108305618A (zh) | 语音获取及搜索方法、智能笔、搜索终端及存储介质 | |
| CN113488034A (zh) | 一种语音信息的处理方法、装置、设备及介质 | |
| KR20190074508A (ko) | 챗봇을 위한 대화 모델의 데이터 크라우드소싱 방법 | |
| CN110647613A (zh) | 一种课件构建方法、装置、服务器和存储介质 | |
| WO2022267405A1 (zh) | 语音交互方法、系统、电子设备及存储介质 | |
| CN111489742B (zh) | 声学模型训练方法、语音识别方法、装置及电子设备 | |
| CN113066473A (zh) | 一种语音合成方法、装置、存储介质及电子设备 | |
| CN116129879A (zh) | 一种语音识别方法、装置、设备及存储介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| CB02 | Change of applicant information | ||
| CB02 | Change of applicant information |
Address after: Room 508-598, Xitian Gezhuang Town Government Office Building, No. 8 Xitong Road, Miyun District Economic Development Zone, Beijing 101500 Applicant after: BEIJING ROOBO TECHNOLOGY Co.,Ltd. Address before: Room 508-598, Xitian Gezhuang Town Government Office Building, No. 8 Xitong Road, Miyun District Economic Development Zone, Beijing 101500 Applicant before: BEIJING INTELLIGENT STEWARD Co.,Ltd. |
|
| TA01 | Transfer of patent application right | ||
| TA01 | Transfer of patent application right |
Effective date of registration: 20210823 Address after: Room 301-112, floor 3, building 2, No. 18, YANGFANGDIAN Road, Haidian District, Beijing 100089 Applicant after: Beijing Rubu Technology Co.,Ltd. Address before: Room 508-598, Xitian Gezhuang Town Government Office Building, No. 8 Xitong Road, Miyun District Economic Development Zone, Beijing 101500 Applicant before: BEIJING ROOBO TECHNOLOGY Co.,Ltd. |
|
| GR01 | Patent grant | ||
| GR01 | Patent grant | ||
| TR01 | Transfer of patent right | ||
| TR01 | Transfer of patent right |
Effective date of registration: 20240927 Address after: Room 518, 5th Floor, Building A18, No. 9 Jiusheng Road, Shangcheng District, Hangzhou City, Zhejiang Province, 310000 Patentee after: HANGZHOU PINGZHI INFORMATION TECHNOLOGY CO.,LTD. Country or region after: China Address before: Room 301-112, floor 3, building 2, No. 18, YANGFANGDIAN Road, Haidian District, Beijing 100089 Patentee before: Beijing Rubu Technology Co.,Ltd. Country or region before: China |