CN1342968A - High-accuracy high-resolution base frequency extracting method for speech recognization - Google Patents
High-accuracy high-resolution base frequency extracting method for speech recognization Download PDFInfo
- Publication number
- CN1342968A CN1342968A CN00124711A CN00124711A CN1342968A CN 1342968 A CN1342968 A CN 1342968A CN 00124711 A CN00124711 A CN 00124711A CN 00124711 A CN00124711 A CN 00124711A CN 1342968 A CN1342968 A CN 1342968A
- Authority
- CN
- China
- Prior art keywords
- fundamental frequency
- per
- frequency
- resolution
- score
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Landscapes
- Auxiliary Devices For Music (AREA)
Abstract
用于语音识别的高精度高分辨率基频提取方法是一种将频域、时域分析以及动态规划(DP)相结合的基频提取方法。主要特点是:首先对语音信号在频域上进行FFT变换,再对其进行谐波分析,然后通过峰值检测,选若干个基频的候选,在时域上对这些候选基频用自相关系数评测,再用动态规划算法不定长回溯综合频域、时域的分析结果以及基频变化量确定一条最优的基频轮廓线。为保证基频提取的分辨率,采用了降采样率、插值等方法。
The high-precision and high-resolution fundamental frequency extraction method for speech recognition is a fundamental frequency extraction method that combines frequency domain, time domain analysis and dynamic programming (DP). The main features are: firstly perform FFT transformation on the voice signal in the frequency domain, then perform harmonic analysis on it, and then select several candidates for the fundamental frequency through peak detection, and use the autocorrelation coefficient for these candidate fundamental frequencies in the time domain Then use the dynamic programming algorithm to look back at an indefinite length to determine an optimal fundamental frequency contour based on the analysis results of the frequency domain and time domain and the variation of the fundamental frequency. In order to ensure the resolution of fundamental frequency extraction, methods such as downsampling rate and interpolation are adopted.
Description
Claims (7)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNB001247115A CN1151490C (en) | 2000-09-13 | 2000-09-13 | High-accuracy high-resolution base frequency extracting method for speech recognization |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNB001247115A CN1151490C (en) | 2000-09-13 | 2000-09-13 | High-accuracy high-resolution base frequency extracting method for speech recognization |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN1342968A true CN1342968A (en) | 2002-04-03 |
| CN1151490C CN1151490C (en) | 2004-05-26 |
Family
ID=4590609
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNB001247115A Expired - Lifetime CN1151490C (en) | 2000-09-13 | 2000-09-13 | High-accuracy high-resolution base frequency extracting method for speech recognization |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN1151490C (en) |
Cited By (26)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN100580768C (en) * | 2005-08-08 | 2010-01-13 | 中国科学院声学研究所 | A voiced sound detection method based on harmonic characteristics |
| CN100583235C (en) * | 2003-03-27 | 2010-01-20 | 法国电讯 | Method for analyzing fundamental frequency information and voice conversion method and system for implementing said analysis method |
| CN101030375B (en) * | 2007-04-13 | 2011-01-26 | 清华大学 | Method for extracting base-sound period based on dynamic plan |
| CN101051460B (en) * | 2006-04-05 | 2011-06-22 | 三星电子株式会社 | Speech signal pre-processing system and method of extracting characteristic information of speech signal |
| CN1835075B (en) * | 2006-04-07 | 2011-06-29 | 安徽中科大讯飞信息科技有限公司 | Speech synthetizing method combined natural sample selection and acaustic parameter to build mould |
| CN101727902B (en) * | 2008-10-29 | 2011-08-10 | 中国科学院自动化研究所 | Method for estimating tone |
| CN102163428A (en) * | 2011-01-19 | 2011-08-24 | 无敌科技(西安)有限公司 | Method for judging Chinese pronunciation |
| CN102176313B (en) * | 2009-10-10 | 2012-07-25 | 北京理工大学 | Formant-frequency-based Mandarin single final vioce visualizing method |
| WO2012103686A1 (en) * | 2011-02-01 | 2012-08-09 | Huawei Technologies Co., Ltd. | Method and apparatus for providing signal processing coefficients |
| CN102842305A (en) * | 2011-06-22 | 2012-12-26 | 华为技术有限公司 | Method and device for detecting keynote |
| CN103366736A (en) * | 2012-03-29 | 2013-10-23 | 北京中传天籁数字技术有限公司 | Phonetic tone identification method and phonetic tone identification apparatus |
| CN103680518A (en) * | 2013-12-20 | 2014-03-26 | 上海电机学院 | Voice gender recognition method and system based on virtual instrument technology |
| CN103871417A (en) * | 2014-03-25 | 2014-06-18 | 北京工业大学 | Specific continuous voice filtering method and device of mobile phone |
| CN104200818A (en) * | 2014-08-06 | 2014-12-10 | 重庆邮电大学 | Pitch detection method |
| CN104217722A (en) * | 2014-08-22 | 2014-12-17 | 哈尔滨工程大学 | Dolphin whistle signal spectrum contour extraction method |
| CN104251934A (en) * | 2013-06-26 | 2014-12-31 | 华为技术有限公司 | Harmonic analysis method and apparatus, and method and apparatus for determining clutter in harmonic wave |
| CN106205638A (en) * | 2016-06-16 | 2016-12-07 | 清华大学 | A kind of double-deck fundamental tone feature extracting method towards audio event detection |
| CN103943104B (en) * | 2014-04-15 | 2017-03-01 | 海信集团有限公司 | A kind of voice messaging knows method for distinguishing and terminal unit |
| CN106659428A (en) * | 2014-04-28 | 2017-05-10 | 麻省理工学院 | Vital signs monitoring via radio reflections |
| CN107045875A (en) * | 2016-02-03 | 2017-08-15 | 重庆工商职业学院 | Fundamental frequency detection method based on genetic algorithm |
| CN107833581A (en) * | 2017-10-20 | 2018-03-23 | 广州酷狗计算机科技有限公司 | A kind of method, apparatus and readable storage medium storing program for executing of the fundamental frequency for extracting sound |
| CN108447505A (en) * | 2018-05-25 | 2018-08-24 | 百度在线网络技术(北京)有限公司 | Audio signal zero-crossing rate processing method, device and speech recognition apparatus |
| CN109346109A (en) * | 2018-12-05 | 2019-02-15 | 百度在线网络技术(北京)有限公司 | Fundamental frequency extracting method and device |
| CN109410980A (en) * | 2016-01-22 | 2019-03-01 | 大连民族大学 | A kind of application of fundamental frequency estimation algorithm in the fundamental frequency estimation of all kinds of signals with harmonic structure |
| CN110379438A (en) * | 2019-07-24 | 2019-10-25 | 山东省计算中心(国家超级计算济南中心) | A kind of voice signal fundamental detection and extracting method and system |
| US10989803B1 (en) | 2017-08-21 | 2021-04-27 | Massachusetts Institute Of Technology | Security protocol for motion tracking systems |
-
2000
- 2000-09-13 CN CNB001247115A patent/CN1151490C/en not_active Expired - Lifetime
Cited By (40)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN100583235C (en) * | 2003-03-27 | 2010-01-20 | 法国电讯 | Method for analyzing fundamental frequency information and voice conversion method and system for implementing said analysis method |
| CN100580768C (en) * | 2005-08-08 | 2010-01-13 | 中国科学院声学研究所 | A voiced sound detection method based on harmonic characteristics |
| CN101051460B (en) * | 2006-04-05 | 2011-06-22 | 三星电子株式会社 | Speech signal pre-processing system and method of extracting characteristic information of speech signal |
| CN1835075B (en) * | 2006-04-07 | 2011-06-29 | 安徽中科大讯飞信息科技有限公司 | Speech synthetizing method combined natural sample selection and acaustic parameter to build mould |
| CN101030375B (en) * | 2007-04-13 | 2011-01-26 | 清华大学 | Method for extracting base-sound period based on dynamic plan |
| CN101727902B (en) * | 2008-10-29 | 2011-08-10 | 中国科学院自动化研究所 | Method for estimating tone |
| CN102176313B (en) * | 2009-10-10 | 2012-07-25 | 北京理工大学 | Formant-frequency-based Mandarin single final vioce visualizing method |
| CN102163428A (en) * | 2011-01-19 | 2011-08-24 | 无敌科技(西安)有限公司 | Method for judging Chinese pronunciation |
| WO2012103686A1 (en) * | 2011-02-01 | 2012-08-09 | Huawei Technologies Co., Ltd. | Method and apparatus for providing signal processing coefficients |
| CN102783034A (en) * | 2011-02-01 | 2012-11-14 | 华为技术有限公司 | Method and apparatus for providing signal processing coefficients |
| US9800453B2 (en) | 2011-02-01 | 2017-10-24 | Huawei Technologies Co., Ltd. | Method and apparatus for providing speech coding coefficients using re-sampled coefficients |
| CN102783034B (en) * | 2011-02-01 | 2014-12-17 | 华为技术有限公司 | Method and apparatus for providing signal processing coefficients |
| CN102842305A (en) * | 2011-06-22 | 2012-12-26 | 华为技术有限公司 | Method and device for detecting keynote |
| WO2012175054A1 (en) * | 2011-06-22 | 2012-12-27 | 华为技术有限公司 | Method and device for detecting fundamental tone |
| CN102842305B (en) * | 2011-06-22 | 2014-06-25 | 华为技术有限公司 | Method and device for detecting keynote |
| CN103366736A (en) * | 2012-03-29 | 2013-10-23 | 北京中传天籁数字技术有限公司 | Phonetic tone identification method and phonetic tone identification apparatus |
| WO2014206265A1 (en) * | 2013-06-26 | 2014-12-31 | 华为技术有限公司 | Harmonic analysis method and device and inter-harmonic clutter determination method and device |
| CN104251934B (en) * | 2013-06-26 | 2018-08-14 | 华为技术有限公司 | Harmonic analysis method and device and the method and apparatus for determining clutter between harmonic wave |
| CN104251934A (en) * | 2013-06-26 | 2014-12-31 | 华为技术有限公司 | Harmonic analysis method and apparatus, and method and apparatus for determining clutter in harmonic wave |
| CN103680518A (en) * | 2013-12-20 | 2014-03-26 | 上海电机学院 | Voice gender recognition method and system based on virtual instrument technology |
| CN103871417A (en) * | 2014-03-25 | 2014-06-18 | 北京工业大学 | Specific continuous voice filtering method and device of mobile phone |
| CN103943104B (en) * | 2014-04-15 | 2017-03-01 | 海信集团有限公司 | A kind of voice messaging knows method for distinguishing and terminal unit |
| CN106659428A (en) * | 2014-04-28 | 2017-05-10 | 麻省理工学院 | Vital signs monitoring via radio reflections |
| US10746852B2 (en) | 2014-04-28 | 2020-08-18 | Massachusetts Institute Of Technology | Vital signs monitoring via radio reflections |
| CN106659428B (en) * | 2014-04-28 | 2020-10-16 | 麻省理工学院 | Vital sign monitoring via radio reflection |
| CN104200818A (en) * | 2014-08-06 | 2014-12-10 | 重庆邮电大学 | Pitch detection method |
| CN104217722B (en) * | 2014-08-22 | 2017-07-11 | 哈尔滨工程大学 | A kind of dolphin whistle signal time-frequency spectrum contour extraction method |
| CN104217722A (en) * | 2014-08-22 | 2014-12-17 | 哈尔滨工程大学 | Dolphin whistle signal spectrum contour extraction method |
| CN109410980A (en) * | 2016-01-22 | 2019-03-01 | 大连民族大学 | A kind of application of fundamental frequency estimation algorithm in the fundamental frequency estimation of all kinds of signals with harmonic structure |
| CN107045875B (en) * | 2016-02-03 | 2019-12-06 | 重庆工商职业学院 | fundamental tone frequency detection method based on genetic algorithm |
| CN107045875A (en) * | 2016-02-03 | 2017-08-15 | 重庆工商职业学院 | Fundamental frequency detection method based on genetic algorithm |
| CN106205638A (en) * | 2016-06-16 | 2016-12-07 | 清华大学 | A kind of double-deck fundamental tone feature extracting method towards audio event detection |
| CN106205638B (en) * | 2016-06-16 | 2019-11-08 | 清华大学 | A two-layer pitch feature extraction method for audio event detection |
| US10989803B1 (en) | 2017-08-21 | 2021-04-27 | Massachusetts Institute Of Technology | Security protocol for motion tracking systems |
| CN107833581A (en) * | 2017-10-20 | 2018-03-23 | 广州酷狗计算机科技有限公司 | A kind of method, apparatus and readable storage medium storing program for executing of the fundamental frequency for extracting sound |
| CN107833581B (en) * | 2017-10-20 | 2021-04-13 | 广州酷狗计算机科技有限公司 | Method, device and readable storage medium for extracting fundamental tone frequency of sound |
| CN108447505A (en) * | 2018-05-25 | 2018-08-24 | 百度在线网络技术(北京)有限公司 | Audio signal zero-crossing rate processing method, device and speech recognition apparatus |
| CN109346109A (en) * | 2018-12-05 | 2019-02-15 | 百度在线网络技术(北京)有限公司 | Fundamental frequency extracting method and device |
| CN110379438B (en) * | 2019-07-24 | 2020-05-12 | 山东省计算中心(国家超级计算济南中心) | Method and system for detecting and extracting fundamental frequency of voice signal |
| CN110379438A (en) * | 2019-07-24 | 2019-10-25 | 山东省计算中心(国家超级计算济南中心) | A kind of voice signal fundamental detection and extracting method and system |
Also Published As
| Publication number | Publication date |
|---|---|
| CN1151490C (en) | 2004-05-26 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN1342968A (en) | High-accuracy high-resolution base frequency extracting method for speech recognization | |
| Drugman et al. | Glottal closure and opening instant detection from speech signals | |
| CN103617799B (en) | A kind of English statement pronunciation quality detection method being adapted to mobile device | |
| Deshmukh et al. | Use of temporal information: Detection of periodicity, aperiodicity, and pitch in speech | |
| CN103854662B (en) | Adaptive voice detection method based on multiple domain Combined estimator | |
| Dhingra et al. | Isolated speech recognition using MFCC and DTW | |
| CN108922563B (en) | Based on the visual verbal learning antidote of deviation organ morphology behavior | |
| Kane et al. | Identifying regions of non-modal phonation using features of the wavelet transform. | |
| CN109545188A (en) | A kind of real-time voice end-point detecting method and device | |
| Labied et al. | An overview of automatic speech recognition preprocessing techniques | |
| CN107610715A (en) | A kind of similarity calculating method based on muli-sounds feature | |
| CN103366735B (en) | The mapping method of speech data and device | |
| CN108305639B (en) | Speech emotion recognition method, computer-readable storage medium, and terminal | |
| CN105825852A (en) | Oral English reading test scoring method | |
| CN103366759A (en) | Speech data evaluation method and speech data evaluation device | |
| Zhou et al. | Classification of speech under stress based on features derived from the nonlinear Teager energy operator | |
| CN108682432B (en) | Voice emotion recognition device | |
| CN107564543A (en) | A kind of Speech Feature Extraction of high touch discrimination | |
| Rahman et al. | Dynamic time warping assisted svm classifier for bangla speech recognition | |
| Narayanan et al. | Speech rate estimation via temporal correlation and selected sub-band correlation | |
| Bouzid et al. | Voice source parameter measurement based on multi-scale analysis of electroglottographic signal | |
| Zhao et al. | A processing method for pitch smoothing based on autocorrelation and cepstral F0 detection approaches | |
| CN202758611U (en) | Speech data evaluation device | |
| Zolnay et al. | Extraction methods of voicing feature for robust speech recognition. | |
| Yadav et al. | Epoch detection from emotional speech signal using zero time windowing |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C06 | Publication | ||
| PB01 | Publication | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20020403 Assignee: The purple winter of Beijing is voice technology company limited with keen determination Assignor: Institute of Automation, Chinese Academy of Sciences Contract record no.: 2015110000014 Denomination of invention: High-accuracy high-resolution base frequency extracting method for speech recognization Granted publication date: 20040526 License type: Common License Record date: 20150519 |
|
| LICC | Enforcement, change and cancellation of record of contracts on the licence for exploitation of a patent or utility model | ||
| EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20020403 Assignee: Taro Technology (Hangzhou) Co., Ltd. Assignor: The purple winter of Beijing is voice technology company limited with keen determination Contract record no.: 2015110000050 Denomination of invention: High-accuracy high-resolution base frequency extracting method for speech recognization Granted publication date: 20040526 License type: Common License Record date: 20151130 |
|
| LICC | Enforcement, change and cancellation of record of contracts on the licence for exploitation of a patent or utility model | ||
| CX01 | Expiry of patent term | ||
| CX01 | Expiry of patent term |
Granted publication date: 20040526 |