[go: up one dir, main page]

CN106409297A - Voice recognition method - Google Patents

Voice recognition method Download PDF

Info

Publication number
CN106409297A
CN106409297A CN201610906925.1A CN201610906925A CN106409297A CN 106409297 A CN106409297 A CN 106409297A CN 201610906925 A CN201610906925 A CN 201610906925A CN 106409297 A CN106409297 A CN 106409297A
Authority
CN
China
Prior art keywords
digital information
voice
eigenvalue
identification
recognition method
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610906925.1A
Other languages
Chinese (zh)
Inventor
李让剑
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Anhui Tianda Network Technology Co Ltd
Original Assignee
Anhui Tianda Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Anhui Tianda Network Technology Co Ltd filed Critical Anhui Tianda Network Technology Co Ltd
Priority to CN201610906925.1A priority Critical patent/CN106409297A/en
Publication of CN106409297A publication Critical patent/CN106409297A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a voice recognition method comprising the following steps that a recognized analog voice signal is digitalized; digital information is preprocessed; feature selection is performed on the preprocessed digital information, the feature values of voice required to be recognized are selected, and information including the feature values is extracted from the preprocessed digital information; the feature values required to be recognized are mapped to a feature space from an object space in the feature value extraction process; the digital information of the feature values required to be recognized is reserved in the preprocessed digital information, and other digital information is eliminated so that the digital information required to be recognized is obtained; and the digital information required to be recognized is restored into the analog voice signal. According to the computer-based voice recognition method, the voice is digitalized, feature selection is performed on the digitalized voice, and the voice is recognized through the feature values so that the recognition accuracy and the recognition speed can be greatly enhanced.

Description

A kind of audio recognition method
Technical field
The present invention relates to voice technology field, particularly a kind of digitized audio recognition method.
Background technology
Voice, as analogue signal, starts appearance in recent years and carries out speech recognition by computer, but is difficult to logical at present Cross computer and be digitized identifying, computer digital process is all high than traditional mode in processing speed, accuracy, There is presently no the means by digitized processing.
Content of the invention
For solving above-mentioned technical problem, the invention provides a kind of audio recognition method, it comprises the following steps:
Identified analog voice signal is digitized, becomes and turn to the digital information being suitable to computer disposal;
Described digital information is carried out with pretreatment, removes the interference information being mixed into and reduce deformation or distortion;
Pretreated digital information is carried out with feature selection, selects to need the eigenvalue of the voice of identification, in pretreatment The information comprising described eigenvalue is extracted in digital information afterwards;
The eigenvalue needing identification is mapped to feature space so that each needs from object space by characteristics extraction process One of eigenvalue available feature space of identification point represents;
By the described digital information by retaining the eigenvalue needing identification in pretreated digital information, other are counted Word information rejects the digital information obtaining needs identification;
The described digital information needing identification is carried out being reduced into analog voice signal, the analog voice letter after described reduction Number it is the final voice needing identification.
It is preferred that the digitized process of described voice is voice signal to be digitized sample.
It is preferred that described eigenvalue includes the amplitude of sound, frequency.
The invention has the advantages that:
The computer based audio recognition method that the present invention provides passes through voice digitization, by digitized language Sound carries out feature selection, by eigenvalue, voice is identified, and recognition accuracy is all greatly improved with recognition speed.
Certainly, the arbitrary product implementing the present invention it is not absolutely required to reach all the above advantage simultaneously.
Specific embodiment
Below in conjunction with the embodiment of the present invention, the technical scheme in the present invention is clearly and completely described it is clear that institute The embodiment of description is only a part of embodiment of the present invention, rather than whole embodiments.Based on the embodiment in the present invention, All other embodiment that those of ordinary skill in the art are obtained under the premise of not making creative work, broadly falls into this The scope of bright protection.
Embodiments provide a kind of audio recognition method, it comprises the following steps:
Identified analog voice signal is digitized, becomes and turn to the digital information being suitable to computer disposal;
Described digital information is carried out with pretreatment, removes the interference information being mixed into and reduce deformation or distortion;
Pretreated digital information is carried out with feature selection, selects to need the eigenvalue of the voice of identification, in pretreatment The information comprising described eigenvalue is extracted in digital information afterwards;
The eigenvalue needing identification is mapped to feature space so that each needs from object space by characteristics extraction process One of eigenvalue available feature space of identification point represents;
By the described digital information by retaining the eigenvalue needing identification in pretreated digital information, other are counted Word information rejects the digital information obtaining needs identification;
The described digital information needing identification is carried out being reduced into analog voice signal, the analog voice letter after described reduction Number it is the final voice needing identification.
In the present embodiment, the digitized process of described voice is voice signal to be digitized sample.
Wherein said eigenvalue includes the amplitude of sound, frequency.
Present invention disclosed above preferred embodiment is only intended to help illustrate the present invention.Preferred embodiment is not detailed Describe all of details, also do not limit the specific embodiment that this invention is only described.Obviously, the content according to this specification, Can make many modifications and variations.This specification is chosen and is specifically described these embodiments, is to preferably explain the present invention Principle and practical application so that skilled artisan can be best understood by and utilize the present invention.The present invention is only Limited by claims and its four corner and equivalent.

Claims (3)

1. a kind of audio recognition method is it is characterised in that comprise the following steps:
Identified analog voice signal is digitized, becomes and turn to the digital information being suitable to computer disposal;
Described digital information is carried out with pretreatment, removes the interference information being mixed into and reduce deformation or distortion;
Pretreated digital information is carried out with feature selection, selects to need the eigenvalue of the voice of identification, after the pre-treatment The information comprising described eigenvalue is extracted in digital information;
The eigenvalue needing identification is mapped to feature space so that each needs to identify from object space by characteristics extraction process One of eigenvalue available feature space point represent;
By the described digital information by retaining the eigenvalue needing identification in pretreated digital information, by other numeral letters Breath rejects the digital information obtaining needs identification;
The described digital information needing identification is carried out being reduced into analog voice signal, the analog voice signal after described reduction is For the final voice needing identification.
2. audio recognition method as claimed in claim 1 is it is characterised in that the digitized process of described voice is that voice is believed Number it is digitized sampling.
3. audio recognition method as claimed in claim 1 is it is characterised in that described eigenvalue includes the amplitude of sound, frequency.
CN201610906925.1A 2016-10-18 2016-10-18 Voice recognition method Pending CN106409297A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610906925.1A CN106409297A (en) 2016-10-18 2016-10-18 Voice recognition method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610906925.1A CN106409297A (en) 2016-10-18 2016-10-18 Voice recognition method

Publications (1)

Publication Number Publication Date
CN106409297A true CN106409297A (en) 2017-02-15

Family

ID=58012347

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610906925.1A Pending CN106409297A (en) 2016-10-18 2016-10-18 Voice recognition method

Country Status (1)

Country Link
CN (1) CN106409297A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1104925A1 (en) * 1999-12-03 2001-06-06 Siemens Aktiengesellschaft Method for processing speech signals by substracting a noise function
US20080228478A1 (en) * 2005-06-15 2008-09-18 Qnx Software Systems (Wavemakers), Inc. Targeted speech
CN101807398A (en) * 2009-02-16 2010-08-18 宏正自动科技股份有限公司 Speech recognition device and operating method thereof
CN103761974A (en) * 2014-01-28 2014-04-30 上海力声特医学科技有限公司 Cochlear implant
CN104810024A (en) * 2014-01-28 2015-07-29 上海力声特医学科技有限公司 Double-path microphone speech noise reduction treatment method and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1104925A1 (en) * 1999-12-03 2001-06-06 Siemens Aktiengesellschaft Method for processing speech signals by substracting a noise function
US20080228478A1 (en) * 2005-06-15 2008-09-18 Qnx Software Systems (Wavemakers), Inc. Targeted speech
CN101807398A (en) * 2009-02-16 2010-08-18 宏正自动科技股份有限公司 Speech recognition device and operating method thereof
CN103761974A (en) * 2014-01-28 2014-04-30 上海力声特医学科技有限公司 Cochlear implant
CN104810024A (en) * 2014-01-28 2015-07-29 上海力声特医学科技有限公司 Double-path microphone speech noise reduction treatment method and system

Similar Documents

Publication Publication Date Title
CN108305615B (en) Object identification method and device, storage medium and terminal thereof
CN110473566A (en) Audio separation method, device, electronic equipment and computer readable storage medium
CN108831440A (en) A kind of vocal print noise-reduction method and system based on machine learning and deep learning
WO2010148141A3 (en) Apparatus and method for speech analysis
CN101887722A (en) Rapid voiceprint authentication method
CN107274911A (en) A kind of similarity analysis method based on sound characteristic
CN111081223B (en) Voice recognition method, device, equipment and storage medium
CN107705791A (en) Caller identity confirmation method, device and Voiceprint Recognition System based on Application on Voiceprint Recognition
CN106782521A (en) A kind of speech recognition system
CN105227966A (en) To televise control method, server and control system of televising
CN110782915A (en) Waveform music component separation method based on deep learning
CN112349267A (en) Synthesized voice detection method based on attention mechanism characteristics
CN108665901B (en) Phoneme/syllable extraction method and device
CN106409297A (en) Voice recognition method
CN106228984A (en) Voice recognition information acquisition methods
CN114093369B (en) A method, device, electronic device and storage medium for separating talkers
CN106485280A (en) A kind of computer based image-recognizing method
CN106653040A (en) Voice audio signal sampling processing method
Dutta Text dependent speaker identification based on spectrograms
CN110265049A (en) A kind of audio recognition method and speech recognition system
CN104318931A (en) Emotional activity obtaining method and apparatus of audio file, and classification method and apparatus of audio file
Khonglah et al. Indoor/Outdoor Audio Classification Using Foreground Speech Segmentation.
Tahliramani et al. Performance analysis of speaker identification system with and without spoofing attack of voice conversion
CN102592592A (en) Voice data extraction method and device
CN113571054B (en) Speech recognition signal preprocessing method, device, equipment and computer storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170215