[go: up one dir, main page]

CN110797009A - Aircraft cabin instruction recognition device to bakistan accent english - Google Patents

Aircraft cabin instruction recognition device to bakistan accent english Download PDF

Info

Publication number
CN110797009A
CN110797009A CN201810781122.7A CN201810781122A CN110797009A CN 110797009 A CN110797009 A CN 110797009A CN 201810781122 A CN201810781122 A CN 201810781122A CN 110797009 A CN110797009 A CN 110797009A
Authority
CN
China
Prior art keywords
english
instruction
module
english instruction
recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810781122.7A
Other languages
Chinese (zh)
Inventor
李曜
夏小春
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Aviation Electric Co Ltd
Original Assignee
Shanghai Aviation Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Aviation Electric Co Ltd filed Critical Shanghai Aviation Electric Co Ltd
Priority to CN201810781122.7A priority Critical patent/CN110797009A/en
Publication of CN110797009A publication Critical patent/CN110797009A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0635Training updating or merging of old and new templates; Mean values; Weighting
    • G10L2015/0636Threshold criteria for the updating

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Toys (AREA)

Abstract

The invention discloses an airplane cockpit instruction recognition device aiming at Bakistan accent English, which mainly comprises a voice acquisition module, an English instruction recognition module, an input/output module, an audio data storage module and a model updating module. The audio data storage module is used for receiving the English instruction audio signal from the voice acquisition module and the English instruction recognition result from the English instruction recognition module, and in a non-model updating state, the English instruction audio signal and the English instruction recognition result are judged to be stored or discarded by combining the recognition confidence coefficient so as to generate data to be updated. And the model updating module receives the data to be updated from the audio data storage module in a model updating state and updates a matching model used by an English instruction recognition engine built in the English instruction recognition module according to the data to be updated. The invention has the advantages that: the accuracy of the Bakistan accent English under the universal English model and the recognition engine is improved.

Description

Aircraft cabin instruction recognition device to bakistan accent english
Technical Field
The invention relates to an airplane cockpit instruction recognition device aiming at a Basstein accent English language.
Background
The voice interaction is the most natural interaction mode of human beings, so the voice input mode is also applied to the human-computer interaction process, and particularly under the condition that key input or touch screen input is inconvenient, the voice input can greatly improve the input efficiency. The technical background is that the voice control mode instead of the manual mode is adopted to complete part of the cabin operation in the aircraft cabin, which is a natural functional requirement upgrade. The airplane cabin command recognition device is internally provided with a voice recognition engine and a model and can be used for recognizing the collected input voice command. And when the voice command needing to be recognized is an English command, the corresponding model is an English model. Similar to Chinese speech in dialects of various regions, English in different regions also has own accent.
Disclosure of Invention
The invention aims to solve the problem that the accuracy of the Basstein accent English is low under a general English model and a recognition engine aiming at the position with obvious pronunciation difference between the Basstein accent English and standard English (American English), and provides a novel airplane cabin instruction recognition device aiming at the Basstein accent English.
In order to achieve the purpose, the technical scheme of the invention is as follows: an airplane cockpit instruction recognition device aiming at the Bakistan accent English comprises,
the voice acquisition module is used for picking up English instruction voice provided by the outside and generating an English instruction audio signal corresponding to the English instruction voice, and the English instruction language comprises Pakistan accent English;
the English instruction recognition module is used for receiving the English instruction audio signal from the voice acquisition module and matching a corresponding English instruction recognition result in a built-in English instruction recognition engine according to the English instruction audio signal;
the input and output module is used for receiving the English instruction recognition result from the English instruction recognition module in a non-model updating state, and matching the corresponding instruction code in a built-in instruction code query engine according to the English instruction recognition result so as to output an external instruction;
the audio data storage module is used for receiving the English instruction audio signal from the voice acquisition module and the English instruction recognition result from the English instruction recognition module, and making a judgment of saving or discarding the English instruction audio signal and the English instruction recognition result in a non-model updating state by combining recognition confidence to generate data to be updated; and the number of the first and second groups,
and the model updating module is used for receiving the data to be updated from the audio data storage module in a model updating state and updating a matching model used by an English instruction recognition engine built in the English instruction recognition module according to the data to be updated.
As a preferable scheme of the airplane cabin command recognition device aiming at the bakistan accent English, the airplane cabin command recognition device further comprises,
and the keyboard mouse and screen display module is used for selectively switching the instruction recognition device to a model updating state or a non-model updating state.
As a preferable scheme of the airplane cabin command recognition device aiming at the bakistan accent English, the airplane cabin command recognition device further comprises,
and the power supply module is used for supplying power to the voice acquisition module, the English instruction identification module, the instruction input and output functional module, the audio data storage module, the model updating module and the keyboard, mouse and screen display module.
The invention also provides an airplane cockpit instruction identification method aiming at the Bakistan accent English, which comprises the following steps,
step S1, providing the command recognition device;
step S2, collecting English instruction voice provided by the outside by the voice collecting module and generating English instruction audio signal corresponding to the English instruction voice;
step S3, receiving the English instruction audio signal from the voice acquisition module by an English instruction recognition module, and matching a corresponding English instruction recognition result in a built-in English instruction recognition engine according to the English instruction audio signal; and the number of the first and second groups,
step S4, the input/output module receives the english instruction recognition result from the english instruction recognition module and matches the corresponding instruction code in the built-in instruction code query engine according to the english instruction recognition result to output the external instruction.
The invention also provides an airplane cockpit instruction identification method aiming at the Bakistan accent English, which comprises the following steps,
step S1, providing the command recognition device;
step S2, collecting English instruction voice provided by the outside by the voice collecting module and generating English instruction audio signal corresponding to the English instruction voice;
step S3, the English instruction recognition module receives the English instruction audio signal from the voice acquisition module and matches the English instruction recognition result corresponding to the English instruction audio signal in a built-in English instruction recognition engine according to the English instruction audio signal;
step S4, the audio data storage module receives the English instruction audio signal from the voice acquisition module and the English instruction recognition result from the English instruction recognition module, and in a model updating state, the English instruction audio signal and the English instruction recognition result are judged to be stored or discarded according to recognition confidence, so that data to be updated are generated; and the number of the first and second groups,
step S5, the model updating module receives the data to be updated from the audio data storage module in the model updating state and updates the matching model used by the english instruction recognition engine built in the english instruction recognition module according to the data to be updated.
Compared with the prior art, the invention has the advantages that: the accuracy of the Bakistan accent English under the universal English model and the recognition engine is improved.
Drawings
Fig. 1 is a schematic structural diagram according to an embodiment of the present invention.
Detailed Description
The present invention will be described in further detail below with reference to specific embodiments and drawings.
Referring to fig. 1, an aircraft cabin command recognition apparatus for the bakistan accent english is shown. The method comprises the following steps: the device comprises a voice acquisition module 1, an English instruction recognition module 2, a model updating module 3, a data storage module 4, an instruction input/output module 5, a keyboard, mouse and screen display module 6, a device control switch 7 and a power supply module 8.
The complete instruction recognition and model update process is described as follows:
1. the whole airplane cabin English instruction recognition device is started through the control switch 7, and the power supply module 8 starts to supply power to the recognition device.
2. The user speaks a specific English instruction, the voice acquisition module 1 carries out pickup, and the processed audio signal is sent to the English instruction recognition module 2 and the data storage module 4.
3. The English instruction identification module 2 identifies the input audio signal and sends the identification result to the instruction input and output module 5 and the data storage module 4.
4. And the input and output module 5 matches the corresponding instruction code according to the identification result and outputs the instruction code to the outside.
5. And the data storage module 4 judges whether the received audio data and the recognition result are stored or discarded according to the confidence coefficient of the recognition result. And if the model is stored, storing the two corresponding data as the marking data of the model updating.
6. In the case of accessing the keyboard and mouse and the screen display module 6, the model update state can be entered by keyboard and mouse operation. Under the state of updating the model, the user can repeatedly and continuously carry out recording operation according to the prompt of the screen display interface. All the recording operations are subjected to three steps of 2, 3 and 5, and effective data are stored in the data storage module 4;
7. when the saved effective data reaches a certain amount, the model updating state can be entered through keyboard and mouse operation. In the model updating state, the user updates the English model used in the English instruction recognition module 2 through a model updating button on the screen display interface. Note that this step operation and the english instruction recognition step cannot be performed simultaneously, and the user is asked to schedule time for updating.
8. The whole airplane cabin English instruction recognition device is turned off through the control switch 7, and the power supply module 8 stops supplying power to the recognition device.
The confidence coefficient is a type of scoring mechanism of a built-in English instruction recognition engine in the matching process, (the scoring range can be normalized to an interval of 0-100 points), and the score size can be used for judging the reliability degree of the matching result. By setting a threshold (for example, selecting 90 as the threshold), selecting the english instruction audio signal and the english instruction recognition result whose confidence coefficient of the matching result is higher than the threshold to store, and discarding the one whose confidence coefficient is lower than the threshold, the strategy can make the stored data have better quality and be suitable for being used as the update data. Since the users of the device are based on the Bakistan users, the standard accent English data of a small number of non-Bakistan users does not affect the effect of the subsequent model update module.
The foregoing merely represents embodiments of the present invention, which are described in some detail and detail, and therefore should not be construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (5)

1. An airplane cockpit command recognition device aiming at Backistan accent English is characterized by comprising,
the voice acquisition module is used for picking up English instruction voice provided by the outside and generating an English instruction audio signal corresponding to the English instruction voice, and the English instruction language is Pakistan accent English;
the English instruction recognition module is used for receiving the English instruction audio signal from the voice acquisition module and matching a corresponding English instruction recognition result in a built-in English instruction recognition engine according to the English instruction audio signal;
the input and output module is used for receiving the English instruction recognition result from the English instruction recognition module in a non-model updating state, and matching the corresponding instruction code in a built-in instruction code query engine according to the English instruction recognition result so as to output an external instruction;
the audio data storage module is used for receiving the English instruction audio signal from the voice acquisition module and the English instruction recognition result from the English instruction recognition module, and making a judgment of saving or discarding the English instruction audio signal and the English instruction recognition result in a non-model updating state by combining recognition confidence to generate data to be updated; and the number of the first and second groups,
and the model updating module is used for receiving the data to be updated from the audio data storage module in a model updating state and updating a matching model used by an English instruction recognition engine built in the English instruction recognition module according to the data to be updated.
2. The cockpit command recognition device of claim 1, further comprising,
and the keyboard mouse and screen display module is used for selectively switching the instruction recognition device to a model updating state or a non-model updating state.
3. The cockpit command recognition device of claim 2 further comprising,
and the power supply module is used for supplying power to the voice acquisition module, the English instruction identification module, the instruction input and output functional module, the audio data storage module, the model updating module and the keyboard, mouse and screen display module.
4. A method for identifying an airplane cockpit instruction aiming at a Basstein accent English is characterized by comprising the following steps,
a step S1 of providing the instruction recognition device of any one of claims 1 to 3;
step S2, collecting English instruction voice provided by the outside by the voice collecting module and generating English instruction audio signal corresponding to the English instruction voice;
step S3, receiving the English instruction audio signal from the voice acquisition module by an English instruction recognition module, and matching a corresponding English instruction recognition result in a built-in English instruction recognition engine according to the English instruction audio signal; and the number of the first and second groups,
step S4, the input/output module receives the english instruction recognition result from the english instruction recognition module and matches the corresponding instruction code in the built-in instruction code query engine according to the english instruction recognition result to output the external instruction.
5. A method for identifying an airplane cockpit instruction aiming at a Basstein accent English is characterized by comprising the following steps,
a step S1 of providing the instruction recognition device of any one of claims 1 to 3;
step S2, collecting English instruction voice provided by the outside by the voice collecting module and generating English instruction audio signal corresponding to the English instruction voice;
step S3, the English instruction recognition module receives the English instruction audio signal from the voice acquisition module and matches the English instruction recognition result corresponding to the English instruction audio signal in a built-in English instruction recognition engine according to the English instruction audio signal;
step S4, the audio data storage module receives the English instruction audio signal from the voice acquisition module and the English instruction recognition result from the English instruction recognition module, and in a model updating state, the English instruction audio signal and the English instruction recognition result are judged to be stored or discarded according to recognition confidence, so that data to be updated are generated; and the number of the first and second groups,
step S5, the model updating module receives the data to be updated from the audio data storage module in the model updating state and updates the matching model used by the english instruction recognition engine built in the english instruction recognition module according to the data to be updated.
CN201810781122.7A 2018-07-17 2018-07-17 Aircraft cabin instruction recognition device to bakistan accent english Pending CN110797009A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810781122.7A CN110797009A (en) 2018-07-17 2018-07-17 Aircraft cabin instruction recognition device to bakistan accent english

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810781122.7A CN110797009A (en) 2018-07-17 2018-07-17 Aircraft cabin instruction recognition device to bakistan accent english

Publications (1)

Publication Number Publication Date
CN110797009A true CN110797009A (en) 2020-02-14

Family

ID=69424958

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810781122.7A Pending CN110797009A (en) 2018-07-17 2018-07-17 Aircraft cabin instruction recognition device to bakistan accent english

Country Status (1)

Country Link
CN (1) CN110797009A (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2642482A1 (en) * 2012-03-23 2013-09-25 Tata Consultancy Services Limited Speech processing method and system adapted to non-native speaker pronunciation
CN103632668A (en) * 2012-08-21 2014-03-12 北京百度网讯科技有限公司 Method and apparatus for training English voice model based on Chinese voice information
CN104575516A (en) * 2013-10-07 2015-04-29 霍尼韦尔国际公司 Systems and methods for correcting accent-induced speech in an aircraft cockpit using a dynamic speech database
CN104570835A (en) * 2014-12-02 2015-04-29 苏州长风航空电子有限公司 Cockpit voice command control system and operating method thereof
CN105408952A (en) * 2013-02-21 2016-03-16 谷歌技术控股有限责任公司 Recognizing accented speech
US20170256254A1 (en) * 2016-03-04 2017-09-07 Microsoft Technology Licensing, Llc Modular deep learning model
US9786271B1 (en) * 2016-09-28 2017-10-10 International Business Machines Corporation Voice pattern coding sequence and cataloging voice matching system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2642482A1 (en) * 2012-03-23 2013-09-25 Tata Consultancy Services Limited Speech processing method and system adapted to non-native speaker pronunciation
CN103632668A (en) * 2012-08-21 2014-03-12 北京百度网讯科技有限公司 Method and apparatus for training English voice model based on Chinese voice information
CN105408952A (en) * 2013-02-21 2016-03-16 谷歌技术控股有限责任公司 Recognizing accented speech
CN104575516A (en) * 2013-10-07 2015-04-29 霍尼韦尔国际公司 Systems and methods for correcting accent-induced speech in an aircraft cockpit using a dynamic speech database
CN104570835A (en) * 2014-12-02 2015-04-29 苏州长风航空电子有限公司 Cockpit voice command control system and operating method thereof
US20170256254A1 (en) * 2016-03-04 2017-09-07 Microsoft Technology Licensing, Llc Modular deep learning model
US9786271B1 (en) * 2016-09-28 2017-10-10 International Business Machines Corporation Voice pattern coding sequence and cataloging voice matching system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
苏意玲等: "英语作为第二语言的多媒体语音数据库设计制作及初步测试", 《贵州大学学报(自然科学版)》 *

Similar Documents

Publication Publication Date Title
CN108520743B (en) Voice control method of intelligent device, intelligent device and computer readable medium
EP0840288B1 (en) Method and system for editing phrases during continuous speech recognition
US6363347B1 (en) Method and system for displaying a variable number of alternative words during speech recognition
CN105501121B (en) A kind of intelligence awakening method and system
US5829000A (en) Method and system for correcting misrecognized spoken words or phrases
CN107301865B (en) Method and device for determining interactive text in voice input
US5899976A (en) Method and system for buffering recognized words during speech recognition
KR102007478B1 (en) Device and method for controlling application using speech recognition under predetermined condition
CN206595039U (en) A kind of interactive system for vehicle-mounted voice
EP1346343A1 (en) Speech recognition using word-in-phrase command
CN109243462A (en) Voice awakening method and device
WO2001084535A2 (en) Error correction in speech recognition
US10714082B2 (en) Information processing apparatus, information processing method, and program
JPWO2016103415A1 (en) Head mounted display system and method for operating head mounted display device
JP2016521383A (en) Method, apparatus and computer readable recording medium for improving a set of at least one semantic unit
CN115410572A (en) Voice interaction method, device, terminal, storage medium and program product
US10847154B2 (en) Information processing device, information processing method, and program
CN104424942A (en) Method for improving character speed input accuracy
KR101385012B1 (en) Multi-modal input device using handwriting and voice recognition and control method thereof
CN110797009A (en) Aircraft cabin instruction recognition device to bakistan accent english
KR101250897B1 (en) Apparatus for word entry searching in a portable electronic dictionary and method thereof
EP0840287A2 (en) Method and system for selecting recognized words when correcting recognized speech
CN111261164A (en) Unmanned aerial vehicle speech recognition controlling means
US20220100959A1 (en) Conversation support device, conversation support system, conversation support method, and storage medium
CN112102812B (en) Anti-false wake-up method based on multiple acoustic models

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200214