CN108492833A

CN108492833A - Voice messaging acquisition method, instantaneous communication system, mobile terminal and storage medium

Info

Publication number: CN108492833A
Application number: CN201810277436.3A
Authority: CN
Inventors: 杨威
Original assignee: Jiangxi University of Technology
Current assignee: Jiangxi University of Technology
Priority date: 2018-03-30
Filing date: 2018-03-30
Publication date: 2018-09-04

Abstract

The present invention provides a kind of voice messaging acquisition method, instantaneous communication system, mobile terminal and storage medium, methods to include：Start timing when receiving voice input instruction, obtains current speech data in real time, and according to current speech data Dynamic Announce voice input picture；It when receiving Speech Record stop instruction, closes voice input picture and stops the acquisition of current speech data, to obtain voice input data, format conversion is carried out to voice input data, and transformed voice input data are stored；The preset icon image being locally stored is obtained, and obtains the sonic data in voice input data and voice input time；Sonic data and voice input time are shown on preset icon image by the way of sound spectrum image with mode that numerical value is shown respectively, the mode that the present invention is shown by using the mode and numerical value of sound spectrum image, so as to be distinctly displayed to voice input data, differentiation of the user to different voice input data is facilitated.

Description

Voice information collection method, instant messaging system, mobile terminal and storage medium

技术领域technical field

本发明涉及通信技术领域，特别涉及一种语音信息采集方法、即时通信系统、移动终端及存储介质。The invention relates to the technical field of communication, in particular to a voice information collection method, an instant communication system, a mobile terminal and a storage medium.

背景技术Background technique

语音采集是语音识别的前置阶段，通过对用户发音进行语音数据采集，提取所采集的语音数据的语音特征，根据所提取的语音特征进行语音识别，可实现用户发音内容的确定，识别用户身份等目的。目前的语音采集方式为，使用设置于终端设备(如智能手机、平板电脑等用户设备)上的语音采集装置(如麦克风等)对用户发音进行采集，得到语音数据，而后对所采集的语音数据进行特征提取，语音信息采集方法在即时通信系统的使用中尤为频繁，因此人们对语音信息采集方法的便利性提出了更高的要求。Speech collection is the pre-stage of speech recognition. By collecting speech data from the user's pronunciation, extracting the speech features of the collected speech data, and performing speech recognition based on the extracted speech features, the content of the user's pronunciation can be determined and the identity of the user can be identified. etc. purpose. The current voice collection method is to use a voice collection device (such as a microphone, etc.) installed on a terminal device (such as a smart phone, a tablet computer, etc.) to collect the user's pronunciation, obtain voice data, and then analyze the collected voice data. For feature extraction, voice information collection methods are especially frequently used in instant messaging systems, so people have put forward higher requirements for the convenience of voice information collection methods.

现有的语音信息采集方法中当进行语音录入时，显示的录入画面较为单一，降低了用户体验，且现有的语音信息采集方法中，当完成语音数据的录入后进行语音存储时，不同的语音数据存储后采用的显示图标均相同，进而导致用户对不同的语音数据区分较为困难。In the existing voice information collection method, when voice input is performed, the displayed input screen is relatively single, which reduces user experience, and in the existing voice information collection method, when voice storage is performed after the voice data is input, different The display icons used after the voice data are stored are all the same, which makes it difficult for users to distinguish different voice data.

发明内容Contents of the invention

基于此，本发明实施例的目的在于提供一种方便用户对不同语音数据进行区分的语音信息采集方法、即时通信系统、移动终端及存储介质。Based on this, the purpose of the embodiments of the present invention is to provide a method for collecting voice information, an instant messaging system, a mobile terminal, and a storage medium that facilitate users to distinguish different voice data.

第一方面，本发明提供了一种语音信息采集方法，所述方法包括：In a first aspect, the present invention provides a method for collecting voice information, the method comprising:

当接收到语音录入指令时开始计时，实时获取当前语音数据，并根据所述当前语音数据中的动态特征动态显示语音录入画面；Start timing when receiving the voice input instruction, obtain the current voice data in real time, and dynamically display the voice input picture according to the dynamic characteristics in the current voice data;

当接收到语音录停指令时，关闭所述语音录入画面并停止所述当前语音数据的获取，以得到语音录入数据，对所述语音录入数据进行格式转换，并将转换后的所述语音录入数据进行存储；When receiving the voice recording stop instruction, close the voice recording screen and stop the acquisition of the current voice data to obtain the voice recording data, convert the format of the voice recording data, and record the converted voice data storage;

获取本地存储的预设图标图像，并获取所述语音录入数据中的声波数据和语音录入时间；Acquiring the locally stored preset icon image, and obtaining the sound wave data and voice recording time in the voice recording data;

将所述声波数据和所述语音录入时间依序对应采用声谱图像的方式和数值显示的方式分别在所述预设图标图像上进行显示。The sound wave data and the voice recording time are respectively displayed on the preset icon image in a manner corresponding to a sound spectrum image and a numerical value display manner.

上述语音信息采集方法，通过将所述声波数据采用声谱图像的方式进行显示，有效的将不同的所述语音录入数据进行了区别显示，进而方便了后续用户对不同的所述语音录入数据的区分，通过所述语音录入时间的获取和采用数值显示的设计，进一步方便了用户后续对不同的所述语音录入数据的区分，通过动态显示所述语音录入画面的设计，有效的提高了用户体验，防止了用户语音过程中的枯燥现象发生。The above-mentioned voice information collection method, by displaying the sound wave data in the form of a sound spectrum image, effectively distinguishes and displays the different voice input data, thereby facilitating the follow-up users to understand the different voice input data. Distinguish, through the acquisition of the voice input time and the design of numerical display, it is further convenient for users to distinguish different voice input data in the future, and the design of dynamically displaying the voice input screen effectively improves the user experience , to prevent the boring phenomenon in the user's speech process.

进一步地，所述根据所述当前语音数据中的动态特征动态显示语音录入画面：Further, the voice entry screen is dynamically displayed according to the dynamic features in the current voice data:

实时获取当前计时时间和本地存储的预设背景图像、录音图标，并根据所述当前计时时间、所述预设背景图像和所述录音图标进行图像显示；Acquiring the current counting time and locally stored preset background image and recording icon in real time, and performing image display according to the current counting time, the preset background image and the recording icon;

实时获取所述当前语音数据中的语音分贝信息，并根据所述语音分贝信息对所述录音图标进行动态渲染。The voice decibel information in the current voice data is acquired in real time, and the recording icon is dynamically rendered according to the voice decibel information.

进一步地，所述根据所述语音分贝信息对所述录音图标进行动态渲染的步骤包括：Further, the step of dynamically rendering the recording icon according to the voice decibel information includes:

判定所述语音分贝信息的分贝等级，并根据所述分贝等级判定渲染区域；Determine the decibel level of the voice decibel information, and determine the rendering area according to the decibel level;

获取本地存储的渲染颜色，并根据所述渲染颜色对所述渲染区域进行颜色渲染。The locally stored rendering color is acquired, and color rendering is performed on the rendering area according to the rendering color.

进一步地，所述实时获取当前语音数据的步骤之后，所述方法还包括：Further, after the step of acquiring current voice data in real time, the method further includes:

判断预设时间内接收到的所述当前语音数据中的当前分贝数是否持续小于预设分贝数；judging whether the current decibel number in the current voice data received within the preset time is continuously smaller than the preset decibel number;

若是，则关闭所述语音录入画面并停止所述当前语音数据的获取。If yes, then close the voice input screen and stop acquiring the current voice data.

进一步地，所述对所述语音录入数据进行格式转换，并将转换后的所述语音录入数据进行存储的步骤包括：Further, the step of converting the format of the voice input data and storing the converted voice input data includes:

将所述语音录入数据转换为amr格式，并对所述语音录入数据进行文字识别以获取特征词，所述特征词为所述语音录入数据中重复次数最多的词语；The voice input data is converted into amr format, and the voice input data is carried out text recognition to obtain feature words, and the feature words are the most repeated words in the voice input data;

根据所述特征词对格式转换后的所述语音录入数据进行重命名。and renaming the speech input data after format conversion according to the characteristic words.

将所述语音录入数据转换为amr格式，并获取当前时间；Convert the voice input data into the amr format, and obtain the current time;

根据所述当前时间对格式转换后的所述语音录入数据进行重命名。Renaming the voice input data after format conversion according to the current time.

第二方面，本发明提供了一种即时通信系统，包括：In a second aspect, the present invention provides an instant messaging system, comprising:

即时通信平台，用于接收用户的操作指令；The instant messaging platform is used to receive the user's operation instructions;

语音采集设备，与所述即时通信平台通信连接，用于根据所述即时通信平台接收到的所述操作指令，以对应进行语音数据的采集或语音数据的播放；A voice collection device, connected in communication with the instant messaging platform, used to collect voice data or play voice data correspondingly according to the operation instruction received by the instant messaging platform;

所述语音采集设备包括：The voice collection equipment includes:

第一获取模块，用于当接收到语音录入指令时开始计时，实时获取当前语音数据，并根据所述当前语音数据中的动态特征动态显示语音录入画面；The first acquisition module is used to start timing when receiving the voice input instruction, acquire current voice data in real time, and dynamically display the voice input picture according to the dynamic characteristics in the current voice data;

存储模块，用于当接收到语音录停指令时，关闭所述语音录入画面并停止所述当前语音数据的获取，以得到语音录入数据，对所述语音录入数据进行格式转换，并将转换后的所述语音录入数据进行存储；The storage module is used to close the voice input screen and stop the acquisition of the current voice data when receiving the voice recording stop instruction, so as to obtain the voice input data, convert the format of the voice input data, and convert the converted The said speech input data of is stored;

第二获取模块，用于获取本地存储的预设图标图像，并获取所述语音录入数据中的声波数据和语音录入时间；The second acquisition module is used to acquire the locally stored preset icon image, and acquire the sound wave data and voice input time in the voice input data;

显示模块，用于将所述声波数据和所述语音录入时间依序对应采用声谱图像的方式和数值显示的方式分别在所述预设图标图像上进行显示。The display module is used to display the sound wave data and the voice recording time on the preset icon image respectively in the manner of adopting sound spectrum image and the manner of numerical value display in sequence.

上述即时通信系统，通过所述显示模块将所述声波数据采用声谱图像的方式进行显示，有效的将不同的所述语音录入数据进行了区别显示，进而方便了后续用户对不同的所述语音录入数据的区分，通过所述显示模块对所述语音录入时间的获取和采用数值显示的设计，进一步方便了用户后续对不同的所述语音录入数据的区分，通过所述第一获取模块动态显示所述语音录入画面的设计，有效的提高了用户体验，防止了用户语音过程中的枯燥现象发生。The above-mentioned instant messaging system displays the sound wave data in the form of a sound spectrum image through the display module, effectively distinguishing and displaying different voice input data, thereby facilitating subsequent users to understand different voice data. The distinction of input data, through the acquisition of the voice input time by the display module and the design of numerical display, further facilitates the user's subsequent distinction of different voice input data, which is dynamically displayed by the first acquisition module The design of the voice input screen effectively improves the user experience and prevents boring phenomena during the voice process of the user.

进一步地，所述第一获取模块包括：Further, the first acquisition module includes:

第一获取单元，用于实时获取当前计时时间和本地存储的预设背景图像、录音图标，并根据所述当前计时时间、所述预设背景图像和所述录音图标进行图像显示；The first acquiring unit is used to acquire the current counting time and locally stored preset background image and recording icon in real time, and perform image display according to the current counting time, the preset background image and the recording icon;

第二获取单元，用于实时获取所述当前语音数据中的语音分贝信息，并根据所述语音分贝信息对所述录音图标进行动态渲染。The second acquisition unit is configured to acquire voice decibel information in the current voice data in real time, and dynamically render the recording icon according to the voice decibel information.

第三方面，本发明提供了一种移动终端，包括存储器以及处理器，所述存储器用于存储计算机程序，所述处理器运行所述计算机程序以使所述移动终端执行上述的语音信息采集方法。In a third aspect, the present invention provides a mobile terminal, including a memory and a processor, the memory is used to store a computer program, and the processor runs the computer program to enable the mobile terminal to execute the above voice information collection method .

第四方面，本发明提供了一种存储介质，其上存储有上述移动终端中所使用的计算机程序。In a fourth aspect, the present invention provides a storage medium on which the computer program used in the above mobile terminal is stored.

附图说明Description of drawings

图1为本发明第一实施例提供的语音信息采集方法的流程图；Fig. 1 is the flowchart of the voice information collecting method that the first embodiment of the present invention provides;

图2为本发明第二实施例提供的语音信息采集方法的流程图；Fig. 2 is the flowchart of the voice information collection method that the second embodiment of the present invention provides;

图3为图2中步骤S21的具体实施步骤的流程图；Fig. 3 is the flowchart of the specific implementation steps of step S21 in Fig. 2;

图4为本发明第三实施例提供的即时通信系统的结构示意图；FIG. 4 is a schematic structural diagram of an instant messaging system provided by a third embodiment of the present invention;

图5为本发明第四实施例提供的即时通信系统的结构示意图；FIG. 5 is a schematic structural diagram of an instant messaging system provided by a fourth embodiment of the present invention;

主要元素符号说明Description of main element symbols

即时通信系统instant messaging system 100100 即时通信平台instant messaging platform 101101 语音采集设备voice collection equipment 102102 第一获取模块first acquisition module 1010 第一获取单元first acquisition unit 1111 第二获取单元second acquisition unit 1212 判定单元Judgment unit 1313 渲染单元rendering unit 1414 存储模块storage module 2020 第一转换单元first conversion unit 21twenty one 第一命名单元first naming unit 22twenty two 第二转换单元second conversion unit 23twenty three 第二命名单元Second naming unit 24twenty four 第二获取模块Second acquisition module 3030 显示模块display module 3131 判断模块judgment module 4040 停止模块stop module 4141

具体实施方式Detailed ways

为了便于更好地理解本发明，下面将结合相关实施例附图对本发明进行进一步地解释。附图中给出了本发明的实施例，但本发明并不仅限于上述的优选实施例。相反，提供这些实施例的目的是为了使本发明的公开面更加得充分。In order to facilitate a better understanding of the present invention, the present invention will be further explained below in conjunction with the accompanying drawings of related embodiments. Embodiments of the invention are shown in the drawings, but the invention is not limited to the preferred embodiments described above. Rather, these embodiments are provided so that the disclosure of the invention will be more thorough.

请参阅图1，为本发明第一实施例提供的语音信息采集方法的流程图，包括步骤S10至S50。Please refer to FIG. 1 , which is a flow chart of a voice information collection method provided by the first embodiment of the present invention, including steps S10 to S50 .

步骤S10，当接收到语音录入指令时开始计时，实时获取当前语音数据，并根据所述当前语音数据中的动态特征动态显示语音录入画面；Step S10, start timing when the voice input instruction is received, obtain the current voice data in real time, and dynamically display the voice input screen according to the dynamic features in the current voice data;

其中，所述语音录入指令采用电信号、按键信号、无线信号或语音信号的方式进行传输，所述当前语音数据的获取通过激活麦克风的方式以进行获取，具体的，本实施例中通过设置动态特征，以控制显示设备上进行所述语音录入画面的动态显示，所述动态特征可以为时间特征或语音分贝特征等。Wherein, the voice input instruction is transmitted by way of electric signal, button signal, wireless signal or voice signal, and the acquisition of the current voice data is performed by activating the microphone. Specifically, in this embodiment, by setting the dynamic feature, to control the dynamic display of the voice input screen on the display device, and the dynamic feature may be a time feature or a voice decibel feature.

步骤S20，当接收到语音录停指令时，关闭所述语音录入画面并停止所述当前语音数据的获取，以得到语音录入数据；Step S20, when receiving the voice recording stop command, closing the voice recording screen and stopping the acquisition of the current voice data, so as to obtain the voice recording data;

其中，所述语音录停指令的传输方式与所述语音录入指令相同，当接收到用户发出的所述语音录停指令时，控制显示设备上停止所述语音录入画面的显示，并关闭麦克风对所述当亲语音数据的获取，此时，从开始进行语音录入至录入停止时获取到的数据为所述语音录入数据。Wherein, the transmission method of the voice recording stop instruction is the same as the voice input instruction, and when the voice recording stop instruction sent by the user is received, the display device is controlled to stop the display of the voice input screen, and the microphone is turned off. When the voice data is acquired, at this time, the data acquired from the start of the voice recording to the stop of the voice recording is the voice recording data.

步骤S30，对所述语音录入数据进行格式转换，并将转换后的所述语音录入数据进行存储；Step S30, performing format conversion on the voice input data, and storing the converted voice input data;

其中，由于所述语音录入数据的原始文件较大，因此采用格式转换的方式以降低所述语音录入数据文件的大小，方便了后续对所述语音录入数据的存储；Wherein, since the original file of the voice input data is relatively large, format conversion is adopted to reduce the size of the voice input data file, which facilitates subsequent storage of the voice input data;

优选的，当在本地完成对所述语音录入数据的格式转换和存储时，由后台线程将所述语音录入数据上传到一服务器，并控制服务器端接收程序在接收录音文件的同时进行加密处理，加密完成后以文件的形式存储到服务器，进而当本地的所述语音录入数据丢失或损坏时，可通过在服务器中进行文件下载，以进行所述语音录入数据的获取，进而提高了所述语音信息采集方法的安全性能。Preferably, when the format conversion and storage of the voice input data is completed locally, the background thread uploads the voice input data to a server, and controls the server-side receiving program to perform encryption processing while receiving the recording file, After the encryption is completed, it is stored in the server in the form of a file, and when the local voice input data is lost or damaged, the voice input data can be obtained by downloading the file in the server, thereby improving the voice input data. Security performance of information collection methods.

步骤S40，获取本地存储的预设图标图像，并获取所述语音录入数据中的声波数据和语音录入时间；Step S40, acquiring locally stored preset icon images, and acquiring the sound wave data and voice input time in the voice input data;

其中，所述预设图标图像为用户预先设置的图片，该图片可为本地图片或基于网络进行下载得到的图片，所述声波数据通过声谱仪进行获取，所述语音录入时间用过计时器进行获取。Wherein, the preset icon image is a picture preset by the user, and the picture can be a local picture or a picture downloaded based on the network, the sound wave data is obtained by a spectrometer, and the voice input time is used by a timer Get it.

步骤S50，将所述声波数据和所述语音录入时间依序对应采用声谱图像的方式和数值显示的方式分别在所述预设图标图像上进行显示；Step S50, displaying the sound wave data and the voice input time respectively on the preset icon image in the manner of corresponding to the sound spectrum image and the numerical display manner;

其中，由于不同所述声波数据对应的声谱图像均不同，因此本实施例中采用声谱图像显示的方式进行所述语音录入数据的区别显示，方便了用户对所述语音录入数据的区分；Wherein, since the sound spectrum images corresponding to different sound wave data are different, in this embodiment, the sound spectrum image display is used to display the difference of the voice input data, which is convenient for the user to distinguish the voice input data;

优选的，由于不可控因素导致不同所述语音录入数据中所述语音录入时间均不相同，因此可通过将所述语音录入时间进行数值显示的方式进行所述语音录入的区别显示。Preferably, the speech entry time in different speech entry data is different due to uncontrollable factors, so the speech entry time can be displayed in a numerical manner to distinguish the speech entry.

本实施例中，通过将所述声波数据采用声谱图像的方式进行显示，有效的将不同的所述语音录入数据进行了区别显示，进而方便了后续用户对不同的所述语音录入数据的区分，通过所述语音录入时间的获取和采用数值显示的设计，进一步方便了用户后续对不同的所述语音录入数据的区分，通过基于所述动态特征进行所述语音录入画面的动态显示的设计，有效的提高了用户体验，防止了用户语音过程中的枯燥现象发生。In this embodiment, by displaying the sound wave data in the form of a sound spectrum image, different voice input data are effectively displayed differently, thereby facilitating subsequent users to distinguish different voice input data , through the acquisition of the voice input time and the design of numerical display, it further facilitates the user's subsequent distinction of different voice input data, and through the design of the dynamic display of the voice input screen based on the dynamic characteristics, The user experience is effectively improved, and the boring phenomenon in the user's voice process is prevented.

请参阅图2，为本发明第二实施例提供的语音信息采集方法的流程图，所述方法包括步骤S11至S71。Please refer to FIG. 2 , which is a flow chart of a voice information collection method provided by a second embodiment of the present invention, the method includes steps S11 to S71 .

步骤S11，当接收到语音录入指令时开始计时，实时获取当前语音数据，实时获取当前计时时间和本地存储的预设背景图像、录音图标，并根据所述当前计时时间、所述预设背景图像和所述录音图标进行图像显示；Step S11, start timing when the voice input instruction is received, obtain the current voice data in real time, obtain the current timing time and the preset background image and recording icon stored locally in real time, and according to the current timing time, the preset background image performing image display with the recording icon;

其中，所述语音录入指令采用电信号、按键信号、无线信号或语音信号的方式进行传输，所述当前语音数据的获取通过激活麦克风的方式以进行获取，通过所述当前计时时间的获取和显示，方便了用户的语音录入操作。Wherein, the voice input instruction is transmitted by way of electric signal, button signal, wireless signal or voice signal, the acquisition of the current voice data is acquired by activating the microphone, and the acquisition and display of the current counting time , which facilitates the user's voice input operation.

步骤S21，实时获取所述当前语音数据中的语音分贝信息，并根据所述语音分贝信息对所述录音图标进行动态渲染；Step S21, acquiring the voice decibel information in the current voice data in real time, and dynamically rendering the recording icon according to the voice decibel information;

其中，本实施例中通过设置动态特征，以控制显示设备上进行所述语音录入画面的动态显示，所述动态特征可以为时间特征或语音分贝特征等，由于用户在进行语音录入时所述语音分贝信息实时进行了变化，因此本实施例中根据该变化以对应进行所述录音图标的动态渲染，以防止用户语音过程中的枯燥现象发生。优选的，对所述录音图标的渲染可以采用颜色渲染或图像变化的方式进行动态渲染，以提高用户的观感体验。Wherein, in this embodiment, by setting the dynamic feature, to control the dynamic display of the voice input picture on the display device, the dynamic feature can be time feature or voice decibel feature, etc. The decibel information changes in real time, so in this embodiment, the dynamic rendering of the recording icon is performed correspondingly according to the change, so as to prevent the boring phenomenon during the user's speech process. Preferably, the rendering of the audio recording icon can be dynamically rendered in the manner of color rendering or image change, so as to improve the visual experience of the user.

请参阅图3，为图2中步骤S21的具体实施步骤：Please refer to Fig. 3, which is the specific implementation steps of step S21 in Fig. 2:

步骤S210，判定所述语音分贝信息的分贝等级，并根据所述分贝等级判定渲染区域；Step S210, determine the decibel level of the voice decibel information, and determine the rendering area according to the decibel level;

其中，本地存储有渲染区域表，所述渲染区域表中存储有不同分贝等级对于的渲染坐标，以根据查询到的渲染坐标对应进行不同区域的渲染，例如当分贝等级为0时，录音图标两边显示灰色的弧线，随着判定的分贝等级的增大，逐渐进行不同区域的颜色渲染，以形成动态显示效果，提高了用户体验。Wherein, a rendering area table is stored locally, and the rendering coordinates of different decibel levels are stored in the rendering area table, so as to render different areas according to the queried rendering coordinates. For example, when the decibel level is 0, the recording icon on both sides The gray arc is displayed, and as the determined decibel level increases, the color rendering of different areas is gradually performed to form a dynamic display effect and improve the user experience.

步骤S211，获取本地存储的渲染颜色，并根据所述渲染颜色对所述渲染区域进行颜色渲染；Step S211, acquiring a locally stored rendering color, and performing color rendering on the rendering area according to the rendering color;

其中，所述渲染颜色可根据用户的需求自主进行设置，本实施例中所述渲染颜色为蓝色，具体的，随着分贝等级的增大，从内到外逐渐用蓝色的弧线替换灰色的弧线，以在所述录音图标上形成动态显示效果。Wherein, the rendering color can be set independently according to the needs of users. In this embodiment, the rendering color is blue. Specifically, as the decibel level increases, it is gradually replaced with a blue arc from the inside to the outside. A gray arc to create a dynamic display effect on said recording icon.

请继续参阅图2，步骤S31，当接收到语音录停指令时，关闭所述语音录入画面并停止所述当前语音数据的获取，以得到语音录入数据；Please continue to refer to Fig. 2, step S31, when receiving the voice recording stop command, close the voice recording screen and stop the acquisition of the current voice data, to obtain the voice recording data;

步骤S41，将所述语音录入数据转换为amr格式，并对所述语音录入数据进行文字识别以获取特征词，所述特征词为所述语音录入数据中重复次数最多的词语；Step S41, converting the voice input data into amr format, and performing text recognition on the voice input data to obtain feature words, the feature words being the most repeated words in the voice input data;

其中，由于所述语音录入数据的原始文件较大，因此采用格式转换的方式以降低所述语音录入数据文件的大小，方便了后续对所述语音录入数据的存储，且通过对所述语音录入数据进行文字识别的设计，方便了后续所述特征词的获取；Wherein, since the original file of the voice input data is relatively large, format conversion is adopted to reduce the size of the voice input data file, which facilitates subsequent storage of the voice input data, and by converting the voice input The data is designed for character recognition, which facilitates the acquisition of the feature words described later;

优选的，当在本地完成对所述语音录入数据的格式转换和存储时，由后台线程将所述语音录入数据上传到一服务器，并控制服务器端接收程序在接收录音文件的同时进行加密处理，加密完成后以文件的形式存储到服务器，进而当本地的所述语音录入数据丢失或损坏时，可通过在服务器中进行文件下载，以进行所述语音录入数据的获取，进而提高了所述语音信息采集方法的安全性能。Preferably, when the format conversion and storage of the voice input data is completed locally, the background thread uploads the voice input data to a server, and controls the server-side receiving program to perform encryption processing while receiving the recording file, After the encryption is completed, it is stored in the server in the form of a file, and when the local voice input data is lost or damaged, the file download can be performed in the server to obtain the voice input data, thereby improving the voice input data. Security performance of information collection methods.

步骤S51，根据所述特征词对格式转换后的所述语音录入数据进行重命名；Step S51, renaming the voice input data after format conversion according to the feature words;

其中，由于所述特征词在所述语音录入数据中出现的次数最多，进而可通过用所述特征词表述对应所述语音录入数据，并采用重命名的方式进行所述语音录入数据对应文件的命名显示。Wherein, because the number of occurrences of the feature words in the voice input data is the largest, and then the corresponding voice input data can be expressed by using the feature words, and the file corresponding to the voice input data can be created by renaming. Named display.

优选的，所述对所述语音录入数据进行格式转换，并将转换后的所述语音录入数据进行存储的步骤还可包括：Preferably, the step of converting the format of the voice input data and storing the converted voice input data may further include:

根据所述当前时间对格式转换后的所述语音录入数据进行重命名；renaming the voice input data after format conversion according to the current time;

步骤S61，获取本地存储的预设图标图像，并获取所述语音录入数据中的声波数据和语音录入时间；Step S61, acquiring locally stored preset icon images, and acquiring the sound wave data and voice input time in the voice input data;

步骤S71，将所述声波数据和所述语音录入时间依序对应采用声谱图像的方式和数值显示的方式分别在所述预设图标图像上进行显示；Step S71, displaying the sound wave data and the voice input time respectively on the preset icon image in a manner corresponding to a sound spectrum image and a numerical display method;

优选的，所述实时获取当前语音数据的步骤之后，所述方法还包括：Preferably, after the step of acquiring current voice data in real time, the method further includes:

若是，则关闭所述语音录入画面并停止所述当前语音数据的获取；If so, then close the voice input screen and stop the acquisition of the current voice data;

其中，通过判断预设时间内接收到的所述当前语音数据中的当前分贝数是否持续小于预设分贝数的设计，有效的防止了由于用户忘记停止语音录入导致的电量损耗。Wherein, through the design of judging whether the current decibel number in the current voice data received within the preset time is continuously smaller than the preset decibel number, the power loss caused by the user forgetting to stop voice recording is effectively prevented.

本实施例中，通过将所述声波数据采用声谱图像的方式进行显示，有效的将不同的所述语音录入数据进行了区别显示，进而方便了后续用户对不同的所述语音录入数据的区分，通过所述语音录入时间的获取和采用数值显示的设计，进一步方便了用户后续对不同的所述语音录入数据的区分，通过动态显示所述语音录入画面的设计，有效的提高了用户体验，防止了用户语音过程中的枯燥现象发生。In this embodiment, by displaying the sound wave data in the form of a sound spectrum image, different voice input data are effectively displayed differently, thereby facilitating subsequent users to distinguish different voice input data , through the acquisition of the voice input time and the design of numerical display, it further facilitates the user's subsequent distinction of different voice input data, and through the design of dynamically displaying the voice input screen, the user experience is effectively improved, The boring phenomenon in the process of the user's voice is prevented from occurring.

请参阅图4，为本发明第三实施例提供的即时通信系统100的结构示意图，包括：Please refer to FIG. 4 , which is a schematic structural diagram of an instant messaging system 100 provided in a third embodiment of the present invention, including:

即时通信平台101，用于接收用户的操作指令；The instant messaging platform 101 is used to receive the user's operation instruction;

语音采集设备102，与所述即时通信平台101通信连接，用于根据所述即时通信平台接收到的所述操作指令，以对应进行语音数据的采集或语音数据的播放；The voice collection device 102 is communicatively connected with the instant messaging platform 101, and is used to collect voice data or play voice data correspondingly according to the operation instructions received by the instant messaging platform;

所述语音采集设备102包括：The voice collection device 102 includes:

第一获取模块10，用于当接收到语音录入指令时开始计时，实时获取当前语音数据，并根据所述当前语音数据中的动态特征动态显示语音录入画面；The first acquisition module 10 is used to start timing when receiving the voice input instruction, obtain current voice data in real time, and dynamically display the voice input picture according to the dynamic characteristics in the current voice data;

存储模块20，用于当接收到语音录停指令时，关闭所述语音录入画面并停止所述当前语音数据的获取，以得到语音录入数据，对所述语音录入数据进行格式转换，并将转换后的所述语音录入数据进行存储；The storage module 20 is used to close the voice input screen and stop the acquisition of the current voice data when receiving the voice recording stop instruction, so as to obtain the voice input data, convert the format of the voice input data, and convert The later described voice entry data is stored;

具体的，由于所述语音录入数据的原始文件较大，因此采用格式转换的方式以降低所述语音录入数据文件的大小，方便了后续对所述语音录入数据的存储；Specifically, since the original file of the voice input data is relatively large, format conversion is adopted to reduce the size of the voice input data file, which facilitates subsequent storage of the voice input data;

第二获取模块30，用于获取本地存储的预设图标图像，并获取所述语音录入数据中的声波数据和语音录入时间；The second acquiring module 30 is used to acquire the locally stored preset icon image, and acquire the sound wave data and the voice input time in the voice input data;

其中，所述预设图标图像为用户预先设置的图片，该图片可为本地图片或基于网络进行下载得到的图片，所述声波数据通过声谱仪进行获取，所述语音录入时间用过计时器进行获取，Wherein, the preset icon image is a picture preset by the user, and the picture can be a local picture or a picture downloaded based on the network, the sound wave data is obtained by a spectrometer, and the voice input time is used by a timer to get,

显示模块31，用于将所述声波数据和所述语音录入时间依序对应采用声谱图像的方式和数值显示的方式分别在所述预设图标图像上进行显示；The display module 31 is used to display the sound wave data and the voice input time on the preset icon image respectively in a manner corresponding to a sound spectrum image and a numerical display method;

所述第一获取模块10包括：The first acquisition module 10 includes:

第一获取单元11，用于实时获取当前计时时间和本地存储的预设背景图像、录音图标，并根据所述当前计时时间、所述预设背景图像和所述录音图标进行图像显示；The first acquiring unit 11 is configured to acquire the current timing time and locally stored preset background image and recording icon in real time, and perform image display according to the current timing time, the preset background image and the recording icon;

第二获取单元12，用于实时获取所述当前语音数据中的语音分贝信息，并根据所述语音分贝信息对所述录音图标进行动态渲染。The second acquiring unit 12 is configured to acquire voice decibel information in the current voice data in real time, and dynamically render the recording icon according to the voice decibel information.

所述第二获取单元12包括：The second acquisition unit 12 includes:

判定单元13，用于判定所述语音分贝信息的分贝等级，并根据所述分贝等级判定渲染区域；A determination unit 13, configured to determine the decibel level of the voice decibel information, and determine the rendering area according to the decibel level;

渲染单元14，用于获取本地存储的渲染颜色，并根据所述渲染颜色对所述渲染区域进行颜色渲染。The rendering unit 14 is configured to obtain a locally stored rendering color, and perform color rendering on the rendering area according to the rendering color.

所述存储模块20包括：The storage module 20 includes:

第一转换单元21，用于将所述语音录入数据转换为amr格式，并对所述语音录入数据进行文字识别以获取特征词，所述特征词为所述语音录入数据中重复次数最多的词语；The first conversion unit 21 is used to convert the voice input data into amr format, and perform character recognition on the voice input data to obtain feature words, and the feature words are the most repeated words in the voice input data ;

第一命名单元22，用于根据所述特征词对格式转换后的所述语音录入数据进行重命名；The first naming unit 22 is used to rename the speech input data after format conversion according to the feature words;

第二转换单元23，用于将所述语音录入数据转换为amr格式，并获取当前时间；The second conversion unit 23 is used to convert the voice input data into amr format, and obtain the current time;

第二命名单元24，用于根据所述当前时间对格式转换后的所述语音录入数据进行重命名。The second naming unit 24 is configured to rename the speech input data after format conversion according to the current time.

本实施例中，通过所述显示模块31将所述声波数据采用声谱图像的方式进行显示，有效的将不同的所述语音录入数据进行了区别显示，进而方便了后续用户对不同的所述语音录入数据的区分，通过所述显示模块31对所述语音录入时间的获取和采用数值显示的设计，进一步方便了用户后续对不同的所述语音录入数据的区分，通过所述第一获取模块10动态显示所述语音录入画面的设计，有效的提高了用户体验，防止了用户语音过程中的枯燥现象发生。In this embodiment, the sound wave data is displayed in the form of a sound spectrum image through the display module 31, effectively distinguishing and displaying different voice input data, thereby facilitating subsequent users to understand different voice data. The distinction of voice input data, through the acquisition of the voice input time by the display module 31 and the design of numerical display, further facilitates the user's subsequent distinction of different voice input data, through the first acquisition module 10. The design of dynamically displaying the voice input screen effectively improves the user experience and prevents boring phenomena during the user's voice process.

请参阅图5，为本发明第四实施例提供的即时通信系统100的结构示意图，该第四实施例与第三实施例的结构大抵相同，其区别在于，本实施例中所述语音采集设备102还包括：。Please refer to FIG. 5 , which is a schematic structural diagram of an instant messaging system 100 provided by the fourth embodiment of the present invention. The structure of the fourth embodiment is roughly the same as that of the third embodiment. The difference is that the voice collection device described in this embodiment 102 also includes: .

判断模块40，用于判断预设时间内接收到的所述当前语音数据中的当前分贝数是否持续小于预设分贝数；A judging module 40, configured to judge whether the current decibel number in the current voice data received within the preset time is continuously smaller than the preset decibel number;

停止模块41，用于当所述判断模块40的判断结果为是时，关闭所述语音录入画面并停止所述当前语音数据的获取。The stop module 41 is configured to close the voice entry screen and stop the acquisition of the current voice data when the determination result of the determination module 40 is yes.

本实施例中，通过所述判断模块40判断预设时间内接收到的所述当前语音数据中的当前分贝数是否持续小于预设分贝数的设计，有效的防止了由于用户忘记停止语音录入导致的电量损耗。In this embodiment, through the design of the judging module 40 judging whether the current decibel number in the current voice data received within the preset time is continuously less than the preset decibel number, it is effectively prevented that the user forgets to stop the voice recording from causing power loss.

本实施例还提供了一种移动终端，包括存储器以及处理器，所述存储器用于存储计算机程序，所述处理器运行所述计算机程序以使所述移动终端执行上述的语音信息采集方法。This embodiment also provides a mobile terminal, including a memory and a processor, the memory is used to store a computer program, and the processor runs the computer program to enable the mobile terminal to execute the above voice information collection method.

本实施例还提供了一种存储介质，其上存储有上述移动终端中所使用的计算机程序，该程序在执行时，包括如下步骤：This embodiment also provides a storage medium, on which is stored the computer program used in the above-mentioned mobile terminal, when the program is executed, it includes the following steps:

将所述声波数据和所述语音录入时间依序对应采用声谱图像的方式和数值显示的方式分别在所述预设图标图像上进行显示。所述的存储介质，如：ROM/RAM、磁碟、光盘等。The sound wave data and the voice recording time are respectively displayed on the preset icon image in a manner corresponding to a sound spectrum image and a numerical value display manner. The storage medium, such as: ROM/RAM, magnetic disk, optical disk, etc.

上述实施例描述了本发明的技术原理，这些描述只是为了解释本发明的原理，而不能以任何方式解释为本发明保护范围的限制。基于此处的解释，本领域的技术人员不需要付出创造性的劳动即可联想到本发明的其他具体实施方式，这些方式都将落入本发明的保护范围内。The above-mentioned embodiments describe the technical principle of the present invention, and these descriptions are only for explaining the principle of the present invention, and cannot be construed as limiting the protection scope of the present invention in any way. Based on the explanations herein, those skilled in the art can think of other specific implementation modes of the present invention without creative efforts, and these modes will all fall within the protection scope of the present invention.

Claims

1. A voice information collection method, characterized in that the method comprises:

Start timing when receiving the voice input instruction, obtain the current voice data in real time, and dynamically display the voice input picture according to the dynamic characteristics in the current voice data;

When receiving the voice recording stop instruction, close the voice recording screen and stop the acquisition of the current voice data to obtain the voice recording data, convert the format of the voice recording data, and record the converted voice data storage;

Acquiring the locally stored preset icon image, and obtaining the sound wave data and voice recording time in the voice recording data;

The sound wave data and the voice recording time are respectively displayed on the preset icon image in a manner corresponding to a sound spectrum image and a numerical value display manner.

2. the voice information collection method according to claim 1, is characterized in that, described according to the dynamic feature in the current voice data dynamic display voice input picture:

Acquiring the current counting time and locally stored preset background image and recording icon in real time, and performing image display according to the current counting time, the preset background image and the recording icon;

The voice decibel information in the current voice data is acquired in real time, and the recording icon is dynamically rendered according to the voice decibel information.

3. The voice information collection method according to claim 2, wherein the step of dynamically rendering the recording icon according to the voice decibel information comprises:

Determine the decibel level of the voice decibel information, and determine the rendering area according to the decibel level;

The locally stored rendering color is acquired, and color rendering is performed on the rendering area according to the rendering color.

4. voice information collection method according to claim 1, is characterized in that, after the step of described real-time acquisition current voice data, described method also comprises:

judging whether the current decibel number in the current voice data received within the preset time is continuously smaller than the preset decibel number;

If yes, then close the voice input screen and stop acquiring the current voice data.

5. voice information collection method according to claim 1, is characterized in that, described voice input data is carried out format conversion, and the step of storing described voice input data after conversion comprises:

The voice input data is converted into amr format, and the voice input data is carried out text recognition to obtain feature words, and the feature words are the most repeated words in the voice input data;

and renaming the speech input data after format conversion according to the characteristic words.

6. The method for collecting voice information according to claim 1, wherein the step of performing format conversion on the voice input data and storing the converted voice input data comprises:

Convert the voice input data into the amr format, and obtain the current time;

Renaming the voice input data after format conversion according to the current time.

7. An instant messaging system, characterized in that, comprising:

The instant messaging platform is used to receive the user's operation instructions;

A voice collection device, connected in communication with the instant messaging platform, used to collect voice data or play voice data correspondingly according to the operation instruction received by the instant messaging platform;

The voice collection equipment includes:

The first acquisition module is used to start timing when receiving the voice input instruction, acquire current voice data in real time, and dynamically display the voice input picture according to the dynamic characteristics in the current voice data;

The storage module is used to close the voice input screen and stop the acquisition of the current voice data when receiving the voice recording stop instruction, so as to obtain the voice input data, convert the format of the voice input data, and convert the converted The voice input data of the above is stored;

The second acquisition module is used to acquire the locally stored preset icon image, and acquire the sound wave data and voice input time in the voice input data;

The display module is used to display the sound wave data and the voice recording time on the preset icon image respectively in the manner of adopting sound spectrum image and the manner of numerical value display in sequence.

8. The instant messaging system according to claim 7, wherein the first acquiring module comprises:

The first acquiring unit is used to acquire the current counting time and locally stored preset background image and recording icon in real time, and perform image display according to the current counting time, the preset background image and the recording icon;

The second acquisition unit is configured to acquire voice decibel information in the current voice data in real time, and dynamically render the recording icon according to the voice decibel information.

9. A mobile terminal, characterized in that it includes a memory and a processor, the memory is used to store a computer program, and the processor runs the computer program to enable the mobile terminal to execute any one of the following claims 1 to 6. The voice information collection method described in item.

10. A storage medium, which stores the computer program used in the mobile terminal according to claim 9.