CN105810208A

CN105810208A - Meeting recording device and method thereof for automatically generating meeting record

Info

Publication number: CN105810208A
Application number: CN201410839876.5A
Authority: CN
Inventors: 刘扬伟
Original assignee: Shenzhen Yuzhan Precision Technology Co ltd; Hon Hai Precision Industry Co Ltd
Current assignee: Shenzhen Yuzhan Precision Technology Co ltd; Hon Hai Precision Industry Co Ltd
Priority date: 2014-12-30
Filing date: 2014-12-30
Publication date: 2016-07-27

Abstract

The invention provides a meeting recording device and a method thereof for automatically generating a meeting record. The method comprises the steps of recognizing a silent fragment in speech data; judging whether the time of the silent fragment is greater than a preset value or not; taking the silent fragment of which the time is greater than the preset value as the boundary, segmenting the speech data or characters obtained by converting the speech data; and generating an original meeting record according to the segmentation condition of the speech data or the characters and a meeting recording template. The meeting recording device and the method thereof for automatically generating a meeting record can automatically generate the meeting record according to the preset meeting recording template, and are thus more time-saving, convenient and humanized than the prior art.

Description

Conference recording device and method for automatically generating conference records

技术领域 technical field

本发明涉及一种会议记录装置及其自动生成会议记录的方法。 The invention relates to a conference record device and a method for automatically generating conference records.

背景技术 Background technique

现有的会议中报告及记录的方法,通常是利用摄像机、麦克风、录音笔等设备对会议过程中各人员的发言进行录音及录像。会后做会议记录的人员可以查看、回放录音及录像以整理会议记录。然而，通过人工对语音数据进行标注和提取，对使用者来说，费时且极为不便。 The existing method for reporting and recording in a meeting usually uses equipment such as a video camera, a microphone, and a recording pen to record and video the speeches of each person during the meeting. Those who make meeting minutes after the meeting can view and play back audio and video recordings to organize meeting minutes. However, manually marking and extracting voice data is time-consuming and extremely inconvenient for users.

发明内容 Contents of the invention

鉴于此，有必要提供一种会议记录装置及自动生成会议记录的方法，能够自动生成会议记录，以解决上述问题。 In view of this, it is necessary to provide a conference record device and a method for automatically generating conference records, which can automatically generate conference records to solve the above problems.

本发明提供一种会议记录装置，包括存储器和处理器。所述会议记录装置还包括由所述处理器控制的且存储于所述存储器中的如下模块：辨识模块，用于识别语音数据中的无声片段；判断模块，用于判断所述无声片段所历经的时间是否大于一预设值；分割模块，用于以历经的时间大于所述预设值的无声片段为界，将所述语音数据或所述语音数据转换得到的文字进行分割；及生成模块，用于根据所述语音数据或所述文字被分割的情况以及所述存储器中存储的会议记录模板生成一原始会议记录。 The invention provides a conference recording device, which includes a memory and a processor. The meeting recording device also includes the following modules controlled by the processor and stored in the memory: an identification module, used to identify the silent segment in the voice data; a judging module, used to judge the silent segment experienced Whether the time is greater than a preset value; the segmentation module is used to segment the voice data or the converted text of the voice data with the silent segment whose elapsed time is greater than the preset value; and a generation module is used to generate an original meeting record according to the situation where the voice data or the text is segmented and the meeting record template stored in the memory.

本发明还提供一种自动生成会议记录的方法，运行于包括存储器和处理器的至少一装置中。所述方法包括由所述处理器控制所述存储器中存储的模块执行的如下步骤：识别步骤：识别语音数据中的无声片段；判断步骤：判断所述无声片段所历经的时间是否大于一预设值；分割步骤：以历经的时间大于所述预设值的无声片段为界，将所述语音数据或所述语音数据转换得到的文字进行分割；及生成步骤：根据所述语音数据或所述文字被分割的情况以及所述存储器中存储的会议记录模板生成一原始会议记录。 The present invention also provides a method for automatically generating meeting minutes, which runs in at least one device including a memory and a processor. The method includes the following steps of being executed by the processor controlling the modules stored in the memory: identifying step: identifying the silent segment in the voice data; judging step: judging whether the elapsed time of the silent segment is greater than a preset Value; Segmentation step: with the silent segment whose elapsed time is greater than the preset value as the boundary, the speech data or the converted text of the speech data is segmented; and generation step: according to the speech data or the The situation that the text is segmented and the meeting record template stored in the memory generate an original meeting record.

本发明所述的会议记录装置及其自动生成会议记录的方法，可根据预设的会议记录模板自动生成会议记录，因而，相较于现有的方式更省时、方便及人性化。 The meeting record device and the method for automatically generating meeting minutes described in the present invention can automatically generate meeting minutes according to a preset meeting record template, thus, compared with the existing methods, it is more time-saving, convenient and humanized.

附图说明 Description of drawings

图1为本发明一实施方式的会议记录装置的应用环境示意图。 FIG. 1 is a schematic diagram of an application environment of a meeting recording device according to an embodiment of the present invention.

图2为图1所示的会议记录装置的一实施方式的功能模块图。 FIG. 2 is a functional block diagram of an embodiment of the conference recording device shown in FIG. 1 .

图3为本发明一实施方式中，生成的原始会议记录及编辑后的会议记录的示意图。 Fig. 3 is a schematic diagram of the generated original meeting minutes and the edited meeting minutes in one embodiment of the present invention.

图4-图7分别为本发明不同实施方式的自动生成会议记录的方法的步骤流程图。 4-7 are flow charts of the steps of the method for automatically generating meeting minutes in different embodiments of the present invention, respectively.

主要元件符号说明 Description of main component symbols

会议记录装置conference recording device 100100 云端装置cloud device 200200 用户user 11 原始会议记录original meeting minutes 310310 编辑后的会议记录Edited Minutes 320320 自动生成会议记录的方法How to Automatically Generate Meeting Minutes 400、500、600、700400, 500, 600, 700 存储器memory 1010 录音模块recording module 1111 转换模块conversion module 1212 辨识模块Identification module 1313 判断模块judgment module 1414 校对编辑模块Proofreading and editing module 1515 生成模块build module 1616 发送模块sending module 1717 分割模块Segmentation module 1818 控制模块control module 1919 语音输入单元voice input unit 2020 触摸屏touch screen 3030 通信单元communication unit 4040 定位模组positioning module 5050 处理器processor 6060 步骤step S401-S407、S501-S508、S601-S607、S701-S707S401-S407, S501-S508, S601-S607, S701-S707

如下具体实施方式将结合上述附图进一步说明本发明。 The following specific embodiments will further illustrate the present invention in conjunction with the above-mentioned drawings.

具体实施方式 detailed description

请参阅图1，其为本发明的一实施方式的会议记录装置100的应用环境示意图。本实施方式中，会议记录装置100可与一云端装置200相连接。其中，会议记录装置100处于各用户1的附近，可接收各用户1在会议或报告上的语音，即用户1的发言。会议记录装置100和/或云端装置200具备根据会议记录装置100接收的语音自动生成会议记录的功能。用户1为会议或报告的参与者。为了描述方便，以下将会议或报告统一称为会议。 Please refer to FIG. 1 , which is a schematic diagram of an application environment of a conference recording device 100 according to an embodiment of the present invention. In this embodiment, the conference recording device 100 can be connected with a cloud device 200 . Wherein, the meeting recording device 100 is located near each user 1 and can receive the voice of each user 1 in the meeting or report, that is, the speech of the user 1 . The meeting recording device 100 and/or the cloud device 200 has the function of automatically generating meeting minutes according to the voice received by the meeting recording device 100 . User 1 is a participant in a meeting or report. For convenience of description, meetings or reports are collectively referred to as meetings below.

在一实施方式中，会议记录装置100具有自动生成会议记录的功能，即，可以自行生成会议记录。且会议记录装置100不依赖云端装置200，而自行根据其接收的语音自动生成会议记录。当多个用户1举行会议或报告时，会议记录装置100可自动记录各用户1的语音，并自动将识别各用户的语音，并将识别到的语音转换为文字后，按照预设的会议记录模板自动生成会议记录，并按照预设的方式自动发送至相关人员。相关人员包括各用户1和/或其他会议相关人员，例如待办事项负责人、相关主管等人员。从而实现自动记录、生成及发送会议记录的功能。 In one embodiment, the meeting recording device 100 has a function of automatically generating meeting minutes, that is, it can generate meeting minutes by itself. Moreover, the conference recording device 100 does not rely on the cloud device 200 , but automatically generates conference records according to the voice it receives. When multiple users 1 hold meetings or reports, the meeting recorder 100 can automatically record the voices of each user 1, and automatically recognize the voices of each user, and convert the recognized voices into text, and record them according to the preset meeting records. The template automatically generates meeting minutes and automatically sends them to relevant personnel according to the preset method. Relevant personnel include each user 1 and/or other relevant personnel of the meeting, such as the person in charge of the to-do items, relevant supervisors, and the like. So as to realize the function of automatically recording, generating and sending meeting minutes.

为说明方便，本段落中的以下括号中的文字为其前面的文字的简化的功能说明。具体的请参见如下的说明。会议记录装置100可以自动辨识接收的语音的各相应的用户（辨识语音中的用户），然后将接收的语音转换为包括辨识出的用户的用户名的文字，或者，将接收的语音自动转换为文字（语音转换为文字），然后从文字中识别出各用户1的用户名（辨识文字中的用户）。之后根据上述从语音和/或文字中识别出的用户名对文字进行段落划分（根据文字划分段落）之后，再根据预设的会议记录模板自动生成会议记录（生成会议记录）。会议记录装置100还可以根据接收的语音自动识别其中的无声片段（根据语音识别无声片段），根据识别出的无声片段将语音划分为多个语音片段（根据语音划分段落），然后分别将该多个语音片段转换为对应的文字（语音转换为文字），再根据预设的会议记录模板自动生成会议记录（生成会议记录）。会议记录装置100还可以自动辨识语音和/或文字信息中多次重复出现的词句，并存储于常用语数据库中，因而在生成会议记录的过程中，可以自动将文字记录中的词句校对成常用的词句。 For the convenience of description, the words in the following brackets in this paragraph are simplified functional descriptions of the preceding words. For details, please refer to the following instructions. The meeting recording device 100 can automatically identify each corresponding user of the received voice (identify the user in the voice), and then convert the received voice into text including the user name of the recognized user, or automatically convert the received voice into text (convert voice to text), and then recognize the user name of each user 1 from the text (identify the user in the text). Then divide the text into paragraphs according to the above-mentioned user names recognized from the voice and/or text (divide paragraphs according to the text), and then automatically generate meeting minutes according to the preset meeting record template (generate meeting minutes). The meeting recorder 100 can also automatically recognize the silent segment in the received voice (identify the silent segment according to the voice), divide the speech into multiple voice segments according to the recognized silent segment (divide the paragraphs according to the voice), and then divide the multiple voice segments into segments respectively. Convert a speech clip into corresponding text (speech to text), and then automatically generate meeting minutes according to the preset meeting record template (generating meeting minutes). The meeting recording device 100 can also automatically recognize words and sentences that appear repeatedly in voice and/or text information, and store them in the database of commonly used words. words and sentences.

在另一实施方式中，会议记录装置100可以与云端装置200进行数据通信，从而由会议记录装置100和云端装置200一起或由云端装置200单独根据会议记录装置100接收的语音自动生成会议记录。因而，本发明还可以是由会议记录装置100对会议进行录音，并将所录的语音转换为语音信号，将转换的到的语音信号和/或其他数据（例如根据语音信号转换得到的文字等）传输至至云端装置200，而由会议记录装置100和/或及云端装置200分别执行在上一实施方式中全部由会议记录装置100执行的以下功能中的全部或一部分：语音转换为文字、辨识语音和/或文字中的用户、根据语音和/或文字识别无声片段、根据语音和/或文字中划分段落、生成会议记录、辨识语音和/或文字中的常用词句、存储常用词句于常用语数据库，以及根据常用词句自动校对/编辑文字或会议记录。 In another embodiment, the conference recording device 100 can perform data communication with the cloud device 200, so that the conference recording device 100 and the cloud device 200 together or the cloud device 200 alone automatically generates conference records according to the voice received by the conference recording device 100. Therefore, in the present invention, the conference recording device 100 can also record the conference, and convert the recorded voice into a voice signal, and convert the converted voice signal and/or other data (such as text converted from the voice signal, etc.) ) to the cloud device 200, and the conference recording device 100 and/or and the cloud device 200 respectively perform all or part of the following functions performed by the conference recording device 100 in the previous embodiment: voice conversion into text, Recognize users in voice and/or text, identify silent segments based on voice and/or text, divide paragraphs based on voice and/or text, generate meeting minutes, recognize common words and sentences in voice and/or text, store common words and sentences in frequently used language database, and automatically proofread/edit text or meeting minutes based on commonly used words and sentences.

请参阅图2，其为本发明一实施方式的。需要说明的是，图2所示仅仅是本发明的一实施方式中的会议记录装置100的功能模块图，对应以上所描述的实现本发明的各实施方式，会议记录装置100还可以是只包括图2中示出的一部分的功能单元/模块。而云端装置200则可以包括图2所示的其他功能单元/模块。例如，在单独由云端装置200执行自动生成会议记录的功能的实施方式中，会议记录装置100可以包括图2所示的语音输入单元20、通信单元40、处理器60，云端装置200可以包括相应的通信单元、处理器以及存储器10中存储的模块12-19。以下在需要时将作相应的描述。 Please refer to FIG. 2 , which is an embodiment of the present invention. It should be noted that, what is shown in FIG. 2 is only a functional block diagram of the meeting recording device 100 in one embodiment of the present invention. Corresponding to the various embodiments of the present invention described above, the meeting recording device 100 may also only include Figure 2 shows a part of the functional units/modules. The cloud device 200 may include other functional units/modules shown in FIG. 2 . For example, in an implementation in which the function of automatically generating conference minutes is performed solely by the cloud device 200, the conference recording device 100 may include the voice input unit 20, the communication unit 40, and the processor 60 shown in FIG. 2, and the cloud device 200 may include corresponding The communication unit, the processor and the modules 12-19 stored in the memory 10. Corresponding descriptions will be made below when necessary.

本实施方式中，会议记录装置100包括一存储器10、语音输入单元20、触摸屏30、通信单元40、定位模组50和处理器60。存储器10、语音输入单元20、触摸屏30、通信单元40通过信号线和数据线分别连接于处理器60。会议记录装置100为一智能手机，在其他实施方式中，会议记录装置100还可以是平板电脑、笔记本电脑、台式计算机以及会议电话等装置。 In this embodiment, the conference recording device 100 includes a memory 10 , a voice input unit 20 , a touch screen 30 , a communication unit 40 , a positioning module 50 and a processor 60 . The memory 10, voice input unit 20, touch screen 30, and communication unit 40 are respectively connected to the processor 60 through signal lines and data lines. The conference recording device 100 is a smart phone. In other embodiments, the conference recording device 100 may also be a tablet computer, a notebook computer, a desktop computer, a conference phone, and the like.

本实施方式中，会议记录装置100可独立自动生成会议记录。会议记录装置100自动根据其语音输入单元20所接收到的参加会议的用户1的语音，将接收的语音转换为文字，之后再根据预设的会议记录模板自动生成一会议记录。具体的，会议记录装置100可以执行前述的将接收的语音自动转换为文字、自动辨识接收的语音或转换后的文字中的用户、根据辨识出的用户名对文字进行段落划分，再根据预设的会议记录模板自动生成会议记录。会议记录装置100还可以根据接收的语音自动识别其中的无声片段，根据识别出的无声片段将语音划分为多个语音片段，然后分别将该多个语音片段转换为对应的文字，再根据预设的会议记录模板自动生成会议记录。会议记录装置100还可以自动辨识语音和/或文字信息中多次重复出现的词句，并存储于常用语数据库中，因而在生成会议记录的过程中，可以自动将文字记录中的词句校对成常用的词句。会议记录装置100还可以将生成的会议记录和/或待办事项根据预设方式自动发送至相关人员的通讯地址。其中，该预设方式包括预设的发送格式、预设的发送时间等等。相关人员的通讯地址至少包括以下中的一种：电子邮件地址、电话号码、社交账号（例如QQ号码、微信账号）等等。 In this embodiment, the meeting recording device 100 can independently and automatically generate meeting minutes. The meeting recording device 100 automatically converts the received voice into text according to the voice received by the voice input unit 20 of the user 1 participating in the meeting, and then automatically generates a meeting record according to a preset meeting record template. Specifically, the conference recorder 100 can perform the aforementioned automatic conversion of the received voice into text, automatically identify users in the received voice or the converted text, divide the text into paragraphs according to the recognized user name, and then The meeting minutes template automatically generates meeting minutes. The meeting recorder 100 can also automatically recognize the silent segment in the received voice, divide the voice into multiple voice segments according to the recognized silent segment, and then convert the multiple voice segments into corresponding text, and then according to the preset The meeting minutes template automatically generates meeting minutes. The meeting recording device 100 can also automatically recognize words and sentences that appear repeatedly in voice and/or text information, and store them in the database of commonly used words. words and sentences. The meeting recording device 100 can also automatically send the generated meeting minutes and/or to-do items to the communication addresses of relevant personnel according to a preset method. Wherein, the preset manner includes a preset sending format, a preset sending time, and the like. The correspondence address of the relevant person includes at least one of the following: email address, phone number, social account (such as QQ number, WeChat account), etc.

存储器10中存储了一用户语音特征表，该语音特征表记录了多个用户名及其语音特征参数的一一对应关系。本实施方式中，用户名可以是用户的真实姓名，也可以是昵称或代号等。该用户语音特征表可以预先训练得到，即，在会议/报告开始之前的一时间内，对各用户进行语音训练、采集而得到。存储器10中还可以存储由用户或系统预设的会议记录模板。存储器10还可以用于存储录制的语音数据、语音文字转换所需的语音文字数据库等，以及常用语数据库。其中，常用语数据库是在会议记录装置100执行其自动生成会议记录的功能的过程中，累积、筛选存储的，也可以是从一常用语数据库中下载并存储的。 A user voice feature table is stored in the memory 10, and the voice feature table records the one-to-one correspondence between multiple user names and their voice feature parameters. In this embodiment, the user name may be the real name of the user, or may be a nickname or a code name. The user voice feature table can be obtained through pre-training, that is, it is obtained by performing voice training and collection on each user within a period of time before the start of the meeting/report. The memory 10 may also store a meeting record template preset by the user or the system. The memory 10 can also be used to store recorded speech data, a speech-to-text database required for speech-to-text conversion, and a database of commonly used words. Wherein, the database of commonly used words is accumulated, screened and stored during the process of the conference recorder 100 executing its function of automatically generating conference minutes, or it may be downloaded and stored from a database of commonly used words.

本实施方式中，语音输入单元20用于采集会议时各用户的语音，并将采集到的语音转换为语音信号。语音输入单元20为一麦克风。通信单元40用于响应处理器60的控制而与云端装置200进行数据通信。定位模组50用于提供会议记录装置100的实时位置信息，其可以是一GPS定位模组。 In this embodiment, the voice input unit 20 is used to collect the voices of each user during the conference, and convert the collected voices into voice signals. The voice input unit 20 is a microphone. The communication unit 40 is used for performing data communication with the cloud device 200 in response to the control of the processor 60 . The positioning module 50 is used to provide real-time location information of the conference recording device 100 , which may be a GPS positioning module.

在一实施方式中，会议记录装置100还包括一触摸屏30。 In one embodiment, the conference recording device 100 further includes a touch screen 30 .

在本实施方式中，存储器10中还存储了多个功能模块，该多个功能模块被配置成由一个或多个处理器（本实施方式为一个处理器60）执行，以完成本发明。例如，参阅图1所示，存储器10中存储了录音模块11、转换模块12、辨识模块13、判断模块14、校对编辑模块15、生成模块16、发送模块17、分割模块18和控制模块19。在其他实施方式中，存储器10中存储的功能模块还可以根据实际需要作相应的变化，例如，当语音转换为文字、自动辨识语音和/或文字中的常用词句、存储常用词句于常用语数据库，以及根据常用词句自动校对文字等功能中的一或多个功能被设置为由云端装置200来执行时，会议记录装置100的存储器10中可以不存储执行该功能所需的功能模块。本发明所称的模块是完成一特定功能的程序段，比程序更适合于描述软件在处理器60中的执行过程。关于各模块的功能将在图4-图7的流程图中具体描述。 In this embodiment, the memory 10 also stores a plurality of functional modules configured to be executed by one or more processors (one processor 60 in this embodiment), so as to complete the present invention. For example, as shown in FIG. 1, a recording module 11, a conversion module 12, a recognition module 13, a judgment module 14, a proofreading and editing module 15, a generation module 16, a sending module 17, a segmentation module 18 and a control module 19 are stored in the memory 10. In other embodiments, the functional modules stored in the memory 10 can also be changed according to actual needs, for example, when the voice is converted into text, the common words and sentences in the voice and/or text are automatically recognized, and the common words and sentences are stored in the common language database. , and when one or more functions in functions such as automatic text proofreading according to commonly used words and phrases are set to be executed by the cloud device 200, the memory 10 of the meeting recording device 100 may not store the functional modules required to perform the functions. The module referred to in the present invention is a program segment that completes a specific function, and is more suitable for describing the execution process of software in the processor 60 than a program. The functions of each module will be described in detail in the flowcharts of Fig. 4-Fig. 7 .

需要说明的是，为说明方便，以下关于自动生成会议记录的方法的介绍中，均是以该方法运行于一包括相应的单元和/或功能模块的会议记录装置（例如会议记录装置100）中来进行介绍的。根据前面的介绍可知，以下的各自动生成会议记录的方法中，某些步骤还可以设置由一与会议记录装置连接的云端装置（例如云端装置200）来执行，因此，相应的，需要时，可以在下述的各自动生成会议记录的方法的步骤中增加会议记录装置将语音信号/数据、文字数据和/或其他数据传输至该云端装置，以及该云端装置接收信号/数据的步骤。因该些为本领域技术人员可以根据本说明书所揭露的内容实施得到的一些技术手段，因此，为节约篇幅起见，将不在本说明书中一一具体详细的描述。 It should be noted that, for the convenience of description, in the following introductions about the method for automatically generating meeting minutes, this method is used to run in a meeting recording device (such as the meeting recording device 100) including corresponding units and/or functional modules for the introduction. According to the previous introduction, it can be seen that in the following methods for automatically generating meeting minutes, some steps can also be set to be executed by a cloud device (such as cloud device 200) connected to the meeting recording device. Therefore, correspondingly, when necessary, The steps of the conference recording device transmitting voice signals/data, text data and/or other data to the cloud device, and the cloud device receiving the signals/data can be added to the steps of the following methods for automatically generating conference minutes. Since these are some technical means that can be implemented by those skilled in the art based on the content disclosed in this specification, for the sake of saving space, they will not be described in detail in this specification one by one.

如图4所示，是本发明一实施方式的自动生成会议记录的方法400的流程图。自动生成会议记录的方法400是在一会议记录装置（例如会议记录装置100）和/或云端装置（例如云端装置200）的会议记录功能被开启后，运行于该会议记录装置和/或云端装置的，其可以开始于步骤S401、步骤S402或步骤S403。 As shown in FIG. 4 , it is a flowchart of a method 400 for automatically generating meeting minutes according to an embodiment of the present invention. The method 400 for automatically generating conference minutes is to run on the conference recording device and/or the cloud device after the meeting recording function of the conference recording device (such as the conference recording device 100) and/or the cloud device (such as the cloud device 200) is enabled. Yes, it may start from step S401, step S402 or step S403.

步骤S401，接收步骤：语音输入单元20接收语音并将接收的语音转换为相应的语音信号。本实施方式中，会议记录装置100设在会议的用户1附近，语音输入单元20为设置于会议记录装置100中的麦克风。 Step S401, receiving step: the voice input unit 20 receives voice and converts the received voice into a corresponding voice signal. In this embodiment, the meeting recorder 100 is set near the user 1 in the meeting, and the voice input unit 20 is a microphone set in the meeting recorder 100 .

在另一实施方式中，还可以在本步骤S401同时或之前执行如下步骤：控制模块19控制开启定位模组50以获取一会议记录装置100的位置信息及当前的会议时间信息，并将获取的位置信息及时间信息存储于存储器10中。在其他实施方式中，会议记录装置100还可以接收经由触摸屏30输入的当前会议的相关信息并存储，例如，会议日期、时间、地点以及参加会议的人员名等等。 In another embodiment, the following steps may be executed simultaneously or before this step S401: the control module 19 controls to start the positioning module 50 to obtain the location information of a meeting recorder 100 and the current meeting time information, and the acquired The location information and time information are stored in the memory 10 . In other implementation manners, the conference recording device 100 may also receive and store information related to the current conference input via the touch screen 30 , for example, the date, time, location of the conference, and names of persons attending the conference.

步骤S402，录音步骤：录音模块11将所述语音信号录制成语音数据，并将录制好的语音数据存储于存储器10。在一实施方式中，响应用户的选择，本步骤也可以省略，而直接执行步骤S403。 Step S402 , recording step: the recording module 11 records the voice signal into voice data, and stores the recorded voice data in the memory 10 . In one embodiment, in response to the user's selection, this step may also be omitted, and step S403 is directly executed.

步骤S403，辨识步骤：辨识模块13根据所述语音信号以及存储器10中存储的用户语音特征表，识别出所述语音信号对应的一或多个用户。本实施方式中，辨识模块13根据所述语音信号分析得到一或多个语音特征，并从所述语音特征表中查询到相同/最相近的语音特征对应的一或多个用户，从而得到语音数据中对应的一或多个用户。会议或报告进行时，当有多个用户发言/说话的时候，辨识模块13即可根据所述语音信号及所述语音特征表识别出所述语音数据中包含了哪个用户的声音。 Step S403 , identification step: the identification module 13 identifies one or more users corresponding to the voice signal according to the voice signal and the user voice feature table stored in the memory 10 . In this embodiment, the recognition module 13 obtains one or more speech features according to the speech signal analysis, and queries one or more users corresponding to the same/most similar speech features from the speech feature table, thereby obtaining the speech The corresponding one or more users in the data. When a meeting or a report is in progress, when multiple users speak/speak, the identification module 13 can identify which user's voice is included in the voice data according to the voice signal and the voice feature table.

在另一实施方式中，辨识模块13还给不同的用户的语音片段加上不同的标签，同一用户的语音片段加上相同的标签。 In another embodiment, the identification module 13 also adds different tags to the voice segments of different users, and adds the same tag to the voice segments of the same user.

步骤S404，转换步骤：转换模块12将所述语音信号转换为包含所述一或多个用户的用户名的文字。本实施方式中，转换模块12根据所述语音信号以及存储器10中存储的语音文字数据库，将所述语音信号转换为文字，并在辨识模块13识别到的一或多个用户的各用户的语音信号对应的转换得到的文字的一预设位置自动添加对应的用户的用户名，本实施方式中，预设位置为各用户的语音信号对应的转换得到的文字的最前端。 Step S404, conversion step: the conversion module 12 converts the voice signal into text containing the usernames of the one or more users. In this embodiment, the conversion module 12 converts the speech signal into text according to the speech signal and the speech-text database stored in the memory 10, and recognizes the speech of each user of one or more users recognized by the recognition module 13 The user name of the corresponding user is automatically added to a preset position of the converted text corresponding to the signal. In this embodiment, the preset position is the front end of the converted text corresponding to each user's voice signal.

在另一实施方式中，在辨识模块13给不同的用户的语音片段加上了些标签时，转换模块12转换得到的所述文字还包括了该些标签。 In another embodiment, when the recognition module 13 adds some tags to speech segments of different users, the text converted by the conversion module 12 also includes these tags.

步骤S405，生成步骤：生成模块16根据转换得到的所述文字以及存储器10中存储的会议记录模板生成一原始会议记录。请参阅图3所示，其示出有一实施方式中，生成模块16生成的一原始会议记录310。 Step S405 , generating step: the generating module 16 generates an original meeting record according to the converted text and the meeting record template stored in the memory 10 . Please refer to FIG. 3 , which shows an original meeting record 310 generated by the generation module 16 in one embodiment.

在一实施方式中，生成模块16还将定位模组50所获取的位置信息及时间信息自动添加到生成的原始会议记录中。例如，将时间信息添加到会议记录模板中的会议日期/时间的栏位中，将位置信息添加到会议记录模板中的会议地点的栏位中，等等。 In one embodiment, the generating module 16 also automatically adds the location information and time information acquired by the positioning module 50 to the generated original meeting minutes. For example, time information is added to the field of meeting date/time in the meeting minutes template, location information is added to the field of meeting location in the meeting minutes template, and so on.

生成模块16还可以将用户通过触摸屏30输入的会议参加者/出席者自动添加到会议记录模板中的出席者/与会者的栏位中。 The generating module 16 can also automatically add the meeting participants/attendees input by the user through the touch screen 30 to the attendee/attendee column in the meeting record template.

在另一实施方式中，生成模块16还可以根据辨识模块13识别到的所述文字中包含的用户名或辨识模块13根据语音信号辨识得到的发出所述语音信号对应的语音的用户的用户名，自动将该些用户名添加到会议记录模板中的出席者/与会者的栏位中。 In another embodiment, the generation module 16 may also use the user name contained in the text recognized by the recognition module 13 or the user name of the user who uttered the voice corresponding to the voice signal recognized by the recognition module 13 according to the voice signal , to automatically add those usernames to the Attendees/Attendees field in the meeting minutes template.

步骤S406，校对编辑步骤：校对编辑模块15根据预设的校对编辑规则对所述原始会议记录进行校对和/或编辑，以得到一会议记录。 Step S406 , proofreading and editing step: the proofreading and editing module 15 proofreads and/or edits the original meeting record according to preset proofreading and editing rules to obtain a meeting record.

本实施方式中，所述预设的校对编辑规则为从所述文字中的每一用户名处对文字进行段落划分。辨识模块13还从转换得到的所述文字中辨识/识别出用户的用户名，校对编辑模块15则根据辨识模块13识别到的所述文字中包含的用户名对所述原始会议记录进行段落划分。例如，校对编辑模块15以用户名的第一个或最后一个字为界来划分段落。当所述文字中包含用户名为王大明时，校对编辑模块15则从以王大明这三个文字作为段落的段首。需要说明的是，本实施方式中，优选的，此处所说的用户名均是由辨识模块13通过辨识语音而得到的用户的用户名。在另一实施方式中，该些用户名还可以是辨识模块13根据存储器10中原先存储的用户名，从所述文字中自动识别出来的。请参阅图3所示，其示出有一实施方式中，校对编辑模块15对原始会议记录310进行校对和/或编辑后得到的编辑后的会议记录320。 In this embodiment, the preset proofreading and editing rule is to divide the text into paragraphs from each user name in the text. The recognition module 13 also recognizes/recognizes the user name of the user from the converted text, and the proofreading and editing module 15 divides the original meeting record into paragraphs according to the user name contained in the text recognized by the recognition module 13 . For example, the proof-editing module 15 divides paragraphs by the first or last letter of the username. When the text contains the user name Wang Daming, the proofreading and editing module 15 then uses the three texts of Wang Daming as the beginning of the paragraph. It should be noted that, in this embodiment, preferably, the user name mentioned here is the user name of the user obtained by the recognition module 13 through speech recognition. In another embodiment, the user names can also be automatically recognized by the recognition module 13 from the text according to the user names previously stored in the memory 10 . Please refer to FIG. 3 , which shows an edited meeting record 320 obtained after the proofreading and editing module 15 proofreads and/or edits the original meeting record 310 in one embodiment.

在另一实施方式中，所述预设的校对编辑规则为根据辨识模块13给不同的用户的语音片段加上的标签，从每一语音片段起始处所对应的文字处对文字段落进行切分。 In another embodiment, the preset proofreading and editing rule is to segment the text paragraphs from the text corresponding to the beginning of each voice segment according to the tags added by the recognition module 13 to the voice segments of different users .

在再一实施方式中，校对编辑模块15还将校对编辑后的所述会议记录存储于所述存储器10中。或者，发送模块17控制通过通信单元40将校对编辑后的所述会议记录发送至所述云端装置200，以控制将所述会议记录存储于所述云端装置200。 In yet another embodiment, the proofreading and editing module 15 also stores the proofreading and editing of the meeting minutes in the memory 10 . Alternatively, the sending module 17 controls to send the proofread and edited meeting minutes to the cloud device 200 through the communication unit 40 , so as to control to store the meeting minutes in the cloud device 200 .

在其他实施方式中，校对编辑模块15还根据触摸屏30生成的编辑信号对会议记录进行编辑。例如，用户可以通过触摸屏30输入对原始会议记录的编辑内容和/或编辑操作，从而提供了供用户手动编辑原始会议记录的功能。此外，所述预设的校对编辑规则还包括智能识别校对文字等，具体请结合以下根据图5进行的说明。 In other embodiments, the proofreading and editing module 15 also edits the meeting minutes according to the editing signal generated by the touch screen 30 . For example, the user can input editing content and/or editing operations on the original meeting record through the touch screen 30, thereby providing a function for the user to manually edit the original meeting record. In addition, the preset proofreading and editing rules also include intelligent identification of proofreading text, etc., please refer to the following description based on FIG. 5 for details.

步骤S407，发送步骤：发送模块17根据预设的发送规则将经校对和/或编辑后的所述会议记录自动发送至会议相关人员的通讯地址。本实施方式中，所述预设的发送规则可以为立即发送（即，会议记录生成后即发送）至会议相关人员的通讯地址，也可以是在会议记录生成后的一预设时间点发送至会议相关人员的通讯地址。所述会议相关人员可以包括以下人员中的一或多个：会议出席者、会议记录中出现了其用户名的用户、会议记录中涉及/提及的用户（例如，待办事项的用户）、预设的主管、负责人、责任人等等。 Step S407, sending step: the sending module 17 automatically sends the proofread and/or edited meeting minutes to the communication addresses of the meeting related personnel according to the preset sending rules. In this embodiment, the preset sending rule can be sent immediately (that is, sent immediately after the meeting record is generated) to the communication address of the relevant person in the meeting, or can be sent at a preset time after the meeting record is generated. The mailing address of the person involved in the meeting. The people involved in the meeting may include one or more of the following: meeting attendees, users whose usernames appear in the meeting minutes, users involved/mentioned in the meeting minutes (for example, users with to-do items), Default supervisor, responsible person, responsible person, etc.

在另一实施方式中，所述预设的发送规则还可以包括在待办事项的预设到期日前的预设天数发送生成的所述会议记录至待办事项相关的人员的通讯地址，例如，可以包括待办事项的直接责任人、相关主管及与该待办事项相关的其他相关人员。 In another embodiment, the preset sending rule may also include sending the generated meeting minutes to the mailing address of the person related to the to-do item within a preset number of days before the preset due date of the to-do item, for example , which may include the person directly responsible for the to-do item, the relevant supervisor, and other relevant personnel related to the to-do item.

在其他实施方式中，还可以不设置本步骤S407，而由用户直接手动发送会议记录至会议相关人员的通讯地址；或者，在云端装置200接收并存储了该会议记录时，由云端装置200将该会议记录发送至会议相关人员。 In other implementation manners, this step S407 may not be set, and the user directly manually sends the meeting record to the communication address of the meeting related personnel; or, when the cloud device 200 receives and stores the meeting record, the cloud device 200 will send the meeting record to The meeting minutes are sent to the relevant personnel of the meeting.

如图5所示，是本发明一实施方式的自动生成会议记录的方法500的流程图。自动生成会议记录的方法500是在一会议记录装置（例如会议记录装置100）的会议记录功能被开启后，运行于该会议记录装置的。需要说明的是，图5所示的自动生成会议记录的方法500与图4所示的自动生成会议记录的方法400中执行的步骤中，有一部分相同或相类似的，因此，上述对图4中的自动生成会议记录的方法400进行描述时，针对某步骤进行说明的一些替代的、可同时执行的其他实施方式也是适用于图5中的自动生成会议记录的方法500中相同或相类似的步骤，在此就不再一一赘述。自动生成会议记录的方法500可以开始于步骤S501。 As shown in FIG. 5 , it is a flowchart of a method 500 for automatically generating meeting minutes according to an embodiment of the present invention. The method 500 for automatically generating meeting minutes is run on a meeting recording device (such as the meeting recording device 100 ) after the meeting recording function is turned on. It should be noted that some of the steps performed in the method 500 for automatically generating meeting minutes shown in FIG. 5 and the method 400 for automatically generating meeting minutes shown in FIG. When describing the method 400 for automatically generating meeting minutes in , some alternative implementations that can be executed simultaneously for a certain step are also applicable to the same or similar ones in the method 500 for automatically generating meeting minutes in FIG. 5 The steps will not be repeated here. The method 500 for automatically generating meeting minutes may start at step S501.

步骤S501，接收步骤：语音输入单元20接收语音并将接收的语音转换为相应的语音信号。 Step S501, receiving step: the voice input unit 20 receives voice and converts the received voice into a corresponding voice signal.

步骤S502，录音步骤：录音模块11将所述语音信号录制成语音数据，并将录制好的语音数据存储于存储器10。在一实施方式中，响应用户的选择，本步骤也可以省略，而直接执行步骤S503。 Step S502 , recording step: the recording module 11 records the voice signal into voice data, and stores the recorded voice data in the memory 10 . In one embodiment, in response to the user's selection, this step can also be omitted, and step S503 is directly executed.

步骤S503，辨识步骤：辨识模块13根据所述语音信号识别出所述语音数据中的无声片段。本实施方式中，所述无声片段即为所述语音数据中的为静音数据的片段，即，为所述语音中为静音的片段。例如，当所述语音信号中某部分对应的语音数据的语音片段的音量小于一预设的无声临界值时，辨识模块13即识别该语音片段为无声片段。所述语音数据中可能包含了多个无声片段。 Step S503, identifying step: the identifying module 13 identifies the silent segment in the voice data according to the voice signal. In this implementation manner, the silent segment is a segment of mute data in the voice data, that is, a segment of mute in the voice. For example, when the volume of a voice segment of voice data corresponding to a certain part of the voice signal is smaller than a preset silence threshold, the identification module 13 identifies the voice segment as a silent segment. The voice data may contain multiple silent segments.

在一实施方式中，当未包含步骤S502时，本步骤中，辨识模块13根据所述语音信号识别出所述语音中的无声片段。 In one embodiment, when step S502 is not included, in this step, the recognition module 13 recognizes a silent segment in the speech according to the speech signal.

步骤S504，判断步骤：判断模块14判断所述无声片段所历经的时间是否大于一预设值，如果是，则执行步骤S505，否则，流程结束。在一实施方式中，所述预设值为3秒。 Step S504, judging step: the judging module 14 judges whether the elapsed time of the silent segment is greater than a preset value, if yes, execute step S505, otherwise, the process ends. In one embodiment, the preset value is 3 seconds.

步骤S505，分割步骤：分割模块18根据所述无声片段将所述语音数据分割为多个语音数据片段。本实施方式中，分割模块18从所述无声片段处对所述语音数据进行分割，当所述语音数据中包含历经的时间均大于所述预设值的多个无声片段时，分割模块18根据多个无声片段将所述语音数据分割为多个语音数据片段。 Step S505, dividing step: the dividing module 18 divides the voice data into multiple voice data segments according to the silent segment. In this embodiment, the segmentation module 18 divides the voice data from the silent segment, and when the voice data includes multiple silent segments whose elapsed time is greater than the preset value, the segmentation module 18 The plurality of silent segments divides the voice data into a plurality of voice data segments.

步骤S506，辨识步骤：辨识模块13根据分割得到的多个语音数据片段对应的语音信号以及存储器10中存储的用户语音特征表，识别出所述多个语音数据片段中对应的一或多个用户。在一实施方式中，本自动生成会议记录的方法500还可以不包括本步骤。 Step S506, identification step: the identification module 13 identifies one or more users corresponding to the plurality of voice data segments according to the voice signals corresponding to the multiple voice data segments obtained by segmentation and the user voice feature table stored in the memory 10 . In an implementation manner, the method 500 for automatically generating meeting minutes may not include this step.

步骤S507，转换步骤：转换模块12将分割得到的多个语音数据片段对应的语音信号转换为包含多个段落的文字。本实施方式中，转换模块12根据所述多个语音数据片段对应的语音信号、辨识模块13识别到的一或多个用户以及存储器10中存储的语音文字数据库，将所述多个语音数据片段对应的语音信号转换为包含与各语音数据片段一一对应的多个段落的文字。 Step S507, conversion step: the conversion module 12 converts the voice signals corresponding to the segmented multiple voice data segments into texts including multiple paragraphs. In this embodiment, the conversion module 12 converts the plurality of voice data segments to the corresponding voice signal according to the voice signals corresponding to the multiple voice data segments, one or more users identified by the recognition module 13, and the voice and text database stored in the memory 10. The corresponding voice signal is converted into text including a plurality of paragraphs corresponding to each voice data segment.

步骤S508，生成步骤：生成模块16根据转换得到的所述包含多个段落的文字以及存储器10中存储的会议记录模板生成一原始会议记录。本步骤S509具体的方式与自动生成会议记录的方法400可以相同，在此就不在赘述。 Step S508 , generating step: the generating module 16 generates an original meeting record according to the converted text containing multiple paragraphs and the meeting record template stored in the memory 10 . The specific manner of this step S509 may be the same as the method 400 for automatically generating meeting minutes, and will not be repeated here.

在本实施方式中，在本步骤S508之后还可以执行自动生成会议记录的方法400中的步骤S406（校对编辑步骤）及步骤S407（发送步骤），在此就不再赘述。 In this embodiment, step S406 (proofreading and editing step) and step S407 (sending step) in the method 400 for automatically generating meeting minutes may be performed after step S508 , and details will not be repeated here.

如图6所示，是本发明一实施方式的自动生成会议记录的方法600的流程图。自动生成会议记录的方法600是在一会议记录装置（例如会议记录装置100）的会议记录功能被开启后，运行于该会议记录装置的。需要说明的是，图6所示的自动生成会议记录的方法600与图5及图4所示的自动生成会议记录的方法中所执行的步骤中，有一部分是相同或相类似的，因此，上述对图4中的自动生成会议记录的方法400以及对图5中的自动生成会议记录的方法500进行描述时，针对某步骤进行说明的一些替代的、可同时执行的其他实施方式也是适用于图6中的自动生成会议记录的方法600中相同或相类似的步骤，在此也不再一一赘述。自动生成会议记录的方法600可以开始于步骤S601。 As shown in FIG. 6 , it is a flowchart of a method 600 for automatically generating meeting minutes according to an embodiment of the present invention. The method 600 for automatically generating meeting minutes is run on a meeting recording device (such as the meeting recording device 100 ) after the meeting recording function is turned on. It should be noted that some of the steps performed in the method 600 for automatically generating meeting minutes shown in FIG. 6 and the methods for automatically generating meeting minutes shown in FIGS. 5 and 4 are the same or similar. Therefore, When describing the method 400 for automatically generating meeting minutes in FIG. 4 and the method 500 for automatically generating meeting minutes in FIG. 5 , some alternative implementations that can be executed simultaneously for a certain step are also applicable to The same or similar steps in the method 600 for automatically generating meeting minutes in FIG. 6 will not be repeated here. The method 600 for automatically generating meeting minutes may start at step S601.

步骤S601，接收步骤：语音输入单元20接收语音并将接收的语音转换为相应的语音信号。 Step S601, receiving step: the voice input unit 20 receives voice and converts the received voice into a corresponding voice signal.

步骤S602，录音步骤：录音模块11将所述语音信号录制成包含录音时间戳的语音数据，并将录制好的语音数据存储于存储器10。在一实施方式中，响应用户的选择，本步骤也可以省略，而直接执行步骤S603。 Step S602 , recording step: the recording module 11 records the voice signal into voice data including a recording time stamp, and stores the recorded voice data in the memory 10 . In one embodiment, in response to the user's selection, this step can also be omitted, and step S603 is directly executed.

步骤S603，辨识步骤：辨识模块13根据所述语音信号以及存储器10中存储的用户语音特征表，识别出所述语音信号中对应的一或多个用户。在一实施方式中，辨识模块13根据所述包含录音时间戳的语音数据以及存储器10中存储的用户语音特征表，识别出所述语音信号对应的一或多个用户。在另一实施方式中，自动生成会议记录的方法600也可以不包括本步骤。 Step S603, identification step: the identification module 13 identifies one or more users corresponding to the voice signal according to the voice signal and the user voice feature table stored in the memory 10 . In one embodiment, the identification module 13 identifies one or more users corresponding to the voice signal according to the voice data including the recording time stamp and the user voice feature table stored in the memory 10 . In another implementation manner, the method 600 for automatically generating meeting minutes may not include this step.

步骤S604，转换步骤：转换模块12将所述语音信号转换为包含所述录音时间戳及所述一或多个用户的用户名的文字。本实施方式中，转换模块12将所述语音信号转换为包含所述录音时间戳及所述一或多个用户的用户名的文字。转换模块12根据所述语音信号、录音模块11所录制的包含了录音时间戳的语音数据、辨识模块13识别到的一或多个用户以及存储器10中存储的语音文字数据库，将所述语音信号转换为包含了所述录音时间戳的文字，并在各用户的语音信号转换得到的文字的最前端自动添加对应的用户的用户名。在另一实施方式中，转换模块12根据所述语音信号、录音模块11所录制的包含了录音时间戳的语音数据以及存储器10中存储的语音文字数据库，将所述语音信号转换为包含了所述录音时间戳的文字。 Step S604, conversion step: the conversion module 12 converts the voice signal into text including the recording time stamp and the user names of the one or more users. In this embodiment, the conversion module 12 converts the voice signal into text including the recording time stamp and the user names of the one or more users. The conversion module 12 converts the voice signal according to the voice signal, the voice data recorded by the recording module 11 that includes the recording time stamp, one or more users recognized by the recognition module 13, and the voice text database stored in the memory 10. It is converted into text including the recording time stamp, and the corresponding user's username is automatically added to the front of the text obtained by converting the voice signal of each user. In another embodiment, the conversion module 12 converts the speech signal into a speech data containing the time stamp according to the speech signal, the speech data recorded by the recording module 11 and the speech and text database stored in the memory 10. Text describing the timestamp of the recording.

步骤S605，判断步骤：判断模块14根据转换后的所述文字，判断是否有相邻的文字对应的录音时间戳所记载的时间间隔达到一预设值，如果是，则执行步骤S606，否则，流程结束。在一实施方式中，所述预设值为3秒。所述包含所述录音时间戳的相邻的文字中可能包含有多个时间间隔达到该预设值的。 Step S605, judging step: the judging module 14 judges whether the time interval recorded in the recording time stamp corresponding to the adjacent text reaches a preset value according to the converted text, if yes, execute step S606, otherwise, The process ends. In one embodiment, the preset value is 3 seconds. The adjacent text containing the recording time stamp may contain multiple time intervals reaching the preset value.

步骤S606，分割步骤：分割模块18将所述对应的录音时间戳所记载的时间间隔达到所述预设值的相邻的文字为界划分文字段落。本实施方式中，具体的，该相邻的文字分别被划分到前一个段落以及相邻的后一个段落，直至所有的对应的录音时间戳所记载的时间间隔达到所述预设值的各相邻的文字均被划分到不同的段落。 Step S606, segmenting step: the segmenting module 18 divides text into paragraphs by dividing the adjacent text whose time interval recorded in the corresponding recording time stamp reaches the preset value. In this embodiment, specifically, the adjacent text is divided into the previous paragraph and the next adjacent paragraph, until the time intervals recorded in all corresponding recording time stamps reach the preset value of each phase. Adjacent text is divided into different paragraphs.

步骤S607，生成步骤：生成模块16根据划分段落后的所述文字以及存储器10中存储的会议记录模板生成一原始会议记录。本步骤S607具体的方式与自动生成会议记录的方法500可以相同，在此就不在赘述。 Step S607, generating step: the generating module 16 generates an original meeting record according to the text after the paragraphs are divided and the meeting record template stored in the memory 10 . The specific manner of this step S607 may be the same as the method 500 for automatically generating meeting minutes, and will not be repeated here.

如图7所示，是本发明一实施方式的自动生成会议记录的方法700的流程图。自动生成会议记录的方法700是在一会议记录装置（例如会议记录装置100）的会议记录功能被开启后，运行于该会议记录装置的。需要说明的是，图7所示的自动生成会议记录的方法700与图5及图4所示的自动生成会议记录的方法中所执行的步骤中，有一部分是相同或相类似的，因此，上述对图4中的自动生成会议记录的方法400以及对图5中的自动生成会议记录的方法500进行描述时，针对某步骤进行说明的一些替代的、可同时执行的其他实施方式也是适用于图7中的自动生成会议记录的方法700中相同或相类似的步骤，在此也不再一一赘述。本自动生成会议记录的方法700可以开始于步骤S701。 As shown in FIG. 7 , it is a flowchart of a method 700 for automatically generating meeting minutes according to an embodiment of the present invention. The method 700 for automatically generating meeting minutes is executed on a meeting recording device (such as the meeting recording device 100 ) after the meeting recording function is turned on. It should be noted that some of the steps performed in the method 700 for automatically generating meeting minutes shown in FIG. 7 and the methods for automatically generating meeting minutes shown in FIGS. 5 and 4 are the same or similar. Therefore, When describing the method 400 for automatically generating meeting minutes in FIG. 4 and the method 500 for automatically generating meeting minutes in FIG. 5 , some alternative implementations that can be executed simultaneously for a certain step are also applicable to The same or similar steps in the method 700 for automatically generating meeting minutes in FIG. 7 will not be repeated here. The method 700 for automatically generating meeting minutes may start at step S701.

步骤S701，建库步骤：控制模块19建立一包含常用语及其校正对象的常用语数据库，并将所述常用语数据库存储于存储器10中。本实施方式中，可以是当会议记录装置100为首次使用自动生成会议记录的功能时，控制模块19自动建立所述常用语数据库。所述常用语数据库中包含至少一常用语及其校正对象的对应关系，每一常用语至少与一校正对象对应。所述常用语包括了以下中的一或多种：常用字、常用词、常用句子等，还可以是语音数据或文字数据。每一常用语的校正对象可以是在用户手动编辑、修改会议记录过程中累积、记载下来的。校正对象包括以下语音数据和/或文字数据中的以下中的一或多种：字、词、句子等。 Step S701 , database building step: the control module 19 creates a database of commonly used expressions including common expressions and correction objects, and stores the database of commonly used expressions in the memory 10 . In this embodiment, when the conference recording device 100 uses the function of automatically generating conference records for the first time, the control module 19 automatically establishes the common phrase database. The common phrase database includes at least one common phrase and its corresponding relationship with its correction object, and each common phrase corresponds to at least one correction object. The common words include one or more of the following: common words, common words, common sentences, etc., and may also be voice data or text data. The correction object of each commonly used term can be accumulated and recorded during the process of manual editing and modification of meeting minutes by the user. The correction object includes one or more of the following speech data and/or text data: characters, words, sentences, etc.

在另一实施方式中，本自动生成会议记录的方法700还可以不包括本步骤S701。而是在该会议记录装置中预先存储有一常用语数据库，常用语数据库是在会议记录装置100执行其自动生成会议记录的功能的过程中，累积、筛选存储的，也可以是从一常用语数据库中下载并存储的。 In another implementation manner, the method 700 for automatically generating meeting minutes may not include this step S701. Instead, a database of commonly used words is pre-stored in the meeting recording device, and the database of commonly used words is accumulated, screened and stored during the process of the meeting recording device 100 performing its function of automatically generating meeting records, or it can be obtained from a database of commonly used words. downloaded and stored in .

步骤S702，接收步骤：语音输入单元20接收语音并将接收的语音转换为相应的语音信号。 Step S702, receiving step: the voice input unit 20 receives the voice and converts the received voice into a corresponding voice signal.

步骤S703，转换步骤：转换模块12将所述语音信号转换为文字。在一实施方式中，还可以包括自动生成会议记录的方法400、500及600中任一方法中所包含的接收步骤至转换步骤之间的其他步骤。即，可以包含前面所描述的各种实施方式的将语音信号转换为文字的步骤。 Step S703, conversion step: the conversion module 12 converts the voice signal into text. In one embodiment, other steps between the receiving step and the transforming step included in any one of the methods 400, 500 and 600 for automatically generating meeting minutes may also be included. That is, the step of converting the voice signal into text in the various embodiments described above may be included.

步骤S704，识别存储常用词步骤：判断模块14在识别判断出所述语音数据和/或所述文字中包含重复出现一预设次数的词句时，将所述重复出现该预设次数的词句作为常用语存储于所述常用语数据库中。本实施方式中，重复出现该预设次数的词句可以为字、词、句子等语音数据和/或文字数据。在一实施方式中，本步骤S704还可以省略。所述预设次数为20次。 Step S704, step of identifying and storing commonly used words: when the judging module 14 recognizes and judges that the voice data and/or the text contain words and sentences that repeat a preset number of times, use the words and sentences that repeat the preset number of times as Common words are stored in the common language database. In this embodiment, the words and sentences that appear repeatedly for the preset number of times may be voice data and/or text data such as words, phrases, and sentences. In an implementation manner, this step S704 may also be omitted. The preset number of times is 20 times.

步骤S705，判断步骤：判断模块14判断转换后的所述文字否包含一校正对象，如果是，则执行步骤S706，否则，流程结束。 Step S705, judging step: the judging module 14 judges whether the converted text contains a correction object, if yes, execute step S706, otherwise, the process ends.

步骤S706，校对步骤：校对编辑模块15根据所述常用语数据库自动将所述文字包含的校正对象校正为对应的常用语。在一实施方中，本步骤S706还可以在步骤S707之后执行。 Step S706, proofreading step: the proofreading and editing module 15 automatically corrects the correction objects contained in the text into corresponding common phrases according to the common phrase database. In an embodiment, this step S706 may also be performed after step S707.

步骤S707，生成步骤：生成模块16根据校正后的所述文字以及存储器10中存储的会议记录模板生成一原始会议记录。本步骤S707具体的方式与自动生成会议记录的方法500可以相同，在此就不在赘述。 Step S707 , generating step: the generating module 16 generates an original meeting record according to the corrected text and the meeting record template stored in the memory 10 . The specific manner of this step S707 may be the same as the method 500 for automatically generating meeting minutes, and will not be repeated here.

本发明提供的上述会议记录装置100及其自动生成会议记录的方法，可根据预设的会议记录模板自动生成会议记录，并可对会议记录进行智能的语音文字识别、内容格式化及编辑校对。而且，还可以根据预设的规则将会议记录发送至相关人员。因而，相较于现有的方式更省时、方便及人性化。 The meeting recording device 100 and the method for automatically generating meeting minutes provided by the present invention can automatically generate meeting minutes according to a preset meeting record template, and can perform intelligent speech and text recognition, content formatting, editing and proofreading on the meeting records. Moreover, meeting minutes can also be sent to relevant personnel according to preset rules. Therefore, compared with the existing methods, it is more time-saving, convenient and humanized.

最后应说明的是，以上实施例仅用以说明本发明的技术方案而非限制，尽管参照较佳实施例对本发明进行了详细说明，本领域的普通技术人员应当理解，可以对本发明的技术方案进行修改或等同替换，而不脱离本发明技术方案的精神和范围。 Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention without limitation. Although the present invention has been described in detail with reference to the preferred embodiments, those of ordinary skill in the art should understand that the technical solutions of the present invention can be Modifications or equivalent replacements can be made without departing from the spirit and scope of the technical solutions of the present invention.

Claims

1. A method for automatically generating meeting minutes, running in at least one device comprising a memory and a processor, characterized in that the method comprises the following steps of being executed by a module stored in the memory controlled by the processor:

Recognition step: recognize the silent segment in the speech data;

Judging step: judging whether the elapsed time of the silent segment is greater than a preset value;

Segmentation step: Segment the speech data or the text obtained by converting the speech data, taking the silent segment whose elapsed time is greater than the preset value as the boundary; and

Generating step: generating an original conference record according to the speech data or the situation of the text being segmented and the conference record template stored in the memory.

2. The method according to claim 1, further comprising an editing step: editing the original meeting minutes according to preset proofreading and editing rules to obtain a meeting record.

3. The method of claim 1, wherein:

The identifying step is: identifying the silent segment in the voice data according to the voice signal corresponding to the voice data;

The dividing step is: when it is judged that the elapsed time of the silent segment is greater than a preset value, dividing the voice data into a plurality of voice data segments according to the silent segment;

After the segmentation step, a conversion step is also included: converting the voice signals corresponding to the segmented multiple voice data segments into text comprising multiple paragraphs;

The generating step is: generating an original meeting record according to the converted text containing multiple paragraphs and the meeting record template stored in the memory.

4. The method according to claim 3, further comprising an identification step: according to the voice signals corresponding to a plurality of voice data segments obtained by segmentation and the user voice feature table stored in the memory, identify the one or more users corresponding to the plurality of voice data segments; and

The conversion steps include:

According to the voice signals corresponding to the multiple voice data segments and the voice-text database stored in the memory, convert the voice signals corresponding to the multiple voice data segments into a plurality of paragraphs corresponding to each voice data segment one-to-one the text of; and

The user name of the corresponding user is automatically added to a preset position of the text corresponding to the voice signal of the one or more users.

5. The method according to claim 3 or 4, wherein: the silent segment is a segment of silent data in the voice data; the method also includes a recording step: according to the corresponding voice of the voice data The signal records voice data, and stores the recorded voice data in a memory.

6. The method according to claim 1 or 2, further comprising:

Recording step: recording voice data according to a voice signal corresponding to the voice data; the recorded voice data includes a recording time stamp;

Converting step: converting the voice signal into text containing the recording time stamp;

The judging step is: according to the converted text, it is judged whether the time interval recorded in the recording time stamp corresponding to the adjacent text reaches the preset value;

The segmentation step is: when the time interval recorded in the recording time stamp corresponding to the adjacent text reaches the preset value, divide the time interval recorded in the corresponding recording time stamp into the preset value. The adjacent text is bounded to demarcate paragraphs of text.

7. A meeting recording device, comprising a memory and a processor, characterized in that it also includes the following modules controlled by the processor and stored in the memory:

An identification module, configured to identify silent segments in the speech data;

A judging module, configured to judge whether the elapsed time of the silent segment is greater than a preset value;

A segmentation module, configured to segment the speech data or the text obtained by converting the speech data with the silent segment whose elapsed time is greater than the preset value as a boundary; and

A generating module, configured to generate an original meeting record according to the speech data or the situation in which the text is segmented and the meeting record template stored in the memory.

8. The meeting record device according to claim 7, further comprising a proofreading and editing module, configured to edit the original meeting record according to preset proofreading and editing rules to obtain a meeting record.

9. The conference recording device according to claim 7, characterized in that:

The identification module recognizes the silent segment in the voice data according to the voice signal corresponding to the voice data;

When the segmentation module judges that the elapsed time of the silent segment is greater than a preset value, the voice data is divided into a plurality of voice data segments according to the silent segment;

The meeting recording device also includes a conversion module, which is used to convert the voice signals corresponding to the segmented multiple voice data segments into text comprising multiple paragraphs;

The generating module generates an original meeting record according to the converted text containing multiple paragraphs and the meeting record template stored in the memory.

10. The conference recording device according to claim 9, characterized in that:

The identification module is also used to identify one or more users corresponding to the multiple voice data segments according to the voice signals corresponding to the multiple voice data segments obtained through segmentation and the user voice feature table stored in the memory;

The conversion module is also used to:

11. The conference recording device according to claim 9 or 10, wherein: the silent segment is a segment of silent data in the voice data; the conference recording device also includes a recording module for record the voice data corresponding to the voice signal corresponding to the voice data, and store the recorded voice data in the memory.

12. The conference recording device according to claim 7 or 8, further comprising a recording module and a conversion module, wherein:

The recording module is used to record voice data according to a voice signal corresponding to the voice data; the recorded voice data includes a recording time stamp;

The conversion module is used to convert the voice signal into text containing the recording time stamp;

The judgment module judges whether the time interval recorded in the recording time stamp corresponding to the adjacent text reaches the preset value according to the converted text;

When the time interval recorded in the recording time stamp corresponding to the adjacent text reaches the preset value, the segmentation module will record the time interval recorded in the corresponding recording time stamp reaching the preset value. Text boundaries divide text paragraphs.