CN112115696A

CN112115696A - Data processing method and device and recording equipment

Info

Publication number: CN112115696A
Application number: CN202010988800.4A
Authority: CN
Inventors: 崔文华; 路呈璋; 李健涛
Original assignee: Beijing Sogou Technology Development Co Ltd
Current assignee: Beijing Sogou Technology Development Co Ltd
Priority date: 2020-09-18
Filing date: 2020-09-18
Publication date: 2020-12-22

Abstract

Embodiments of the present invention provide a data processing method, device, and recording device, wherein the method includes: acquiring a target image by a recording device; performing text recognition on the target image, determining corresponding text information, and determining the text information The typesetting information in the target image; the text information is displayed according to the typesetting information of the text information in the target image; the typesetting of the displayed text information can be made the same or similar to the typesetting in the target image, which is convenient for users The reading and comprehension of the information in the image; thus improving the user experience.

Description

A data processing method, device and recording device

技术领域technical field

本发明涉及数据处理技术领域，特别是涉及一种数据处理方法、装置和录音设备。The present invention relates to the technical field of data processing, in particular to a data processing method, device and recording device.

背景技术Background technique

近年来，录音设备作为专业领域的产品，发展迅速并进入大众领域。记者、学生、教师等各种群体，通常都需要录音设备进行录音。此外各种电视节目、电影、音乐等录制也需要使用到录音设备。In recent years, recording equipment, as a product in the professional field, has developed rapidly and entered the public field. Reporters, students, teachers and other groups usually need recording equipment for recording. In addition, the recording of various TV programs, movies, music, etc. also requires the use of recording equipment.

随着录音设备使用的普遍性，用户对录音设备的功能也逐渐提高；目前录音设备的功能无法满足用户需求，导致用户使用体验差。With the popularization of the use of recording devices, the functions of the recording devices are gradually improved by users; at present, the functions of the recording devices cannot meet the needs of users, resulting in poor user experience.

发明内容SUMMARY OF THE INVENTION

本发明实施例提供一种数据处理方法，以便于用户对图像中信息的阅读和理解。The embodiment of the present invention provides a data processing method, so as to facilitate the user's reading and understanding of the information in the image.

相应的，本发明实施例还提供了一种数据处理装置和一种录音设备，用以保证上述方法的实现及应用。Correspondingly, the embodiments of the present invention also provide a data processing apparatus and a recording device, so as to ensure the implementation and application of the above method.

为了解决上述问题，本发明实施例公开了一种数据处理方法，具体包括：录音设备获取目标图像；对所述目标图像进行文本识别，确定对应的文本信息和确定所述文本信息在所述目标图像中的排版信息；依据所述文本信息在目标图像中的排版信息，展示所述文本信息。In order to solve the above problem, an embodiment of the present invention discloses a data processing method, which specifically includes: acquiring a target image by a recording device; performing text recognition on the target image, determining corresponding text information, and determining that the text information is in the target image. The typesetting information in the image; the text information is displayed according to the typesetting information of the text information in the target image.

可选地，所述文本信息包括多个文本，所述确定所述文本信息在所述目标图像中的排版信息，包括：分别记录各文本在目标图像中对应的行信息，以及各文本所在行的位置信息；依据所述文本信息中各文本的行信息和位置信息，生成所述文本信息在所述目标图像中的排版信息。Optionally, the text information includes a plurality of texts, and the determining the typesetting information of the text information in the target image includes: respectively recording line information corresponding to each text in the target image, and the line where each text is located. The position information of the text information; according to the line information and position information of each text in the text information, the typesetting information of the text information in the target image is generated.

可选地，所述依据所述文本信息在目标图像中的排版信息，展示所述文本信息，包括：按照所述文本信息中各文本的的行信息和位置信息，对所述文本信息进行段落划分；分段落展示所述文本信息。Optionally, the displaying the text information according to the typesetting information of the text information in the target image includes: performing a paragraph on the text information according to the line information and position information of each text in the text information. Divide; display the textual information in paragraphs.

可选地，所述依据所述文本信息在目标图像中的排版信息，展示所述文本信息，包括：按照文本信息中各文本的行信息和位置信息，控制所述文本信息中各文本按照与所述文本在图像中相同的排版展示。Optionally, displaying the text information according to the typesetting information of the text information in the target image includes: controlling each text in the text information according to the line information and position information of each text in the text information according to the The text is shown in the same typography in the image.

可选地，所述的方法还包括：对所述文本信息进行翻译，得到对应的翻译结果并展示所述翻译结果。Optionally, the method further includes: translating the text information, obtaining a corresponding translation result, and displaying the translation result.

可选地，所述翻译结果包括：图片翻译结果和/或文本翻译结果。Optionally, the translation results include: picture translation results and/or text translation results.

可选地，所述的方法还包括：接收传输指令，所述传输指令包括以下至少一种：分享指令、转发指令和转存指令；将所述传输指令对应的数据，传输至其他设备；所述传输指令对应的数据包括以下至少一种：目标图像、文本信息和翻译结果。Optionally, the method further includes: receiving a transmission instruction, where the transmission instruction includes at least one of the following: a sharing instruction, a forwarding instruction, and a dump instruction; transmitting the data corresponding to the transmission instruction to other devices; The data corresponding to the transmission instruction includes at least one of the following: target image, text information and translation result.

可选地，所述的方法还包括：获取目标音频数据，所述目标音频数据与所述目标图像关联，所述目标图像是录音设备在录制目标音频数据过程中采集的；依据所述文本信息对所述目标音频数据进行语音识别，确定对应语音识别结果。Optionally, the method further includes: acquiring target audio data, the target audio data is associated with the target image, and the target image is collected by a recording device in the process of recording the target audio data; according to the text information Perform speech recognition on the target audio data to determine a corresponding speech recognition result.

本发明实施例还公开了一种数据处理装置，应用于录音设备中，具体包括：图像获取模块，用于获取目标图像；文本识别模块，用于对所述目标图像进行文本识别，确定对应的文本信息和确定所述文本信息在所述目标图像中的排版信息；展示模块，用于依据所述文本信息在目标图像中的排版信息，展示所述文本信息。The embodiment of the present invention also discloses a data processing device, which is applied to a recording device, and specifically includes: an image acquisition module, used to acquire a target image; a text recognition module, used to perform text recognition on the target image, and determine the corresponding text information and determining the typesetting information of the text information in the target image; a display module, configured to display the text information according to the typesetting information of the text information in the target image.

可选地，所述文本信息包括多个文本，所述文本识别模块，用于分别记录各文本在目标图像中对应的行信息，以及各文本所在行的位置信息；依据所述文本信息中各文本的行信息和位置信息，生成所述文本信息在所述目标图像中的排版信息。Optionally, the text information includes a plurality of texts, and the text recognition module is used to respectively record the line information corresponding to each text in the target image, and the position information of the line where each text is located; The line information and position information of the text are used to generate the typesetting information of the text information in the target image.

可选地，所述展示模块，包括：第一排版展示子模块，用于按照所述文本信息中各文本的的行信息和位置信息，对所述文本信息进行段落划分；分段落展示所述文本信息。Optionally, the display module includes: a first typesetting display sub-module, configured to divide the text information into paragraphs according to the line information and position information of each text in the text information; text information.

可选地，所述展示模块，包括：第二排版展示子模块，用于按照文本信息中各文本的行信息和位置信息，控制所述文本信息中各文本按照与所述文本在图像中相同的排版展示。Optionally, the display module includes: a second typesetting display sub-module, configured to control each text in the text information to be the same as the text in the image according to the line information and position information of each text in the text information. typography display.

可选地，所述的装置还包括：翻译模块，用于对所述文本信息进行翻译，得到对应的翻译结果并展示所述翻译结果。Optionally, the apparatus further includes: a translation module, configured to translate the text information, obtain a corresponding translation result, and display the translation result.

可选地，所述的装置还包括：数据传输模块，用于接收传输指令，所述传输指令包括以下至少一种：分享指令、转发指令和转存指令；将所述传输指令对应的数据，传输至其他设备；所述传输指令对应的数据包括以下至少一种：目标图像、文本信息和翻译结果。Optionally, the device further includes: a data transmission module, configured to receive a transmission instruction, where the transmission instruction includes at least one of the following: a sharing instruction, a forwarding instruction, and a dump instruction; the data corresponding to the transmission instruction, The data is transmitted to other devices; the data corresponding to the transmission instruction includes at least one of the following: target image, text information and translation result.

可选地，所述的装置还包括：语音识别模块，用于获取目标音频数据，所述目标音频数据与所述目标图像关联，所述目标图像是录音设备在录制目标音频数据过程中采集的；依据所述文本信息对所述目标音频数据进行语音识别，确定对应语音识别结果。Optionally, the device further includes: a speech recognition module for acquiring target audio data, the target audio data is associated with the target image, and the target image is collected by the recording device during the recording of the target audio data ; Perform speech recognition on the target audio data according to the text information, and determine a corresponding speech recognition result.

本发明实施例还公开了一种可读存储介质，当所述存储介质中的指令由录音设备的处理器执行时，使得录音设备能够执行如本发明实施例任一所述的数据处理方法。The embodiment of the present invention also discloses a readable storage medium, when the instructions in the storage medium are executed by the processor of the recording device, the recording device can execute the data processing method according to any one of the embodiments of the present invention.

本发明实施例还公开了一种录音设备，包括有存储器，以及一个或者一个以上的程序，其中一个或者一个以上程序存储于存储器中，且经配置以由一个或者一个以上处理器执行所述一个或者一个以上程序包含用于进行以下操作的指令：获取目标图像；对所述目标图像进行文本识别，确定对应的文本信息和确定所述文本信息在所述目标图像中的排版信息；依据所述文本信息在目标图像中的排版信息，展示所述文本信息。An embodiment of the present invention also discloses a recording device including a memory and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by one or more processors. Or one or more programs include instructions for performing the following operations: acquiring a target image; performing text recognition on the target image, determining corresponding text information and determining the typesetting information of the text information in the target image; The typesetting information of the text information in the target image, showing the text information.

可选地，还包含用于进行以下操作的指令：对所述文本信息进行翻译，得到对应的翻译结果并展示所述翻译结果。Optionally, it also includes instructions for performing the following operations: translating the text information, obtaining a corresponding translation result, and displaying the translation result.

可选地，还包含用于进行以下操作的指令：接收传输指令，所述传输指令包括以下至少一种：分享指令、转发指令和转存指令；将所述传输指令对应的数据，传输至其他设备；所述传输指令对应的数据包括以下至少一种：目标图像、文本信息和翻译结果。Optionally, it also includes an instruction for performing the following operations: receiving a transmission instruction, where the transmission instruction includes at least one of the following: a sharing instruction, a forwarding instruction, and a dump instruction; transmitting the data corresponding to the transmission instruction to other equipment; the data corresponding to the transmission instruction includes at least one of the following: target image, text information and translation result.

可选地，还包含用于进行以下操作的指令：获取目标音频数据，所述目标音频数据与所述目标图像关联，所述目标图像是录音设备在录制目标音频数据过程中采集的；依据所述文本信息对所述目标音频数据进行语音识别，确定对应语音识别结果。Optionally, it also includes instructions for performing the following operations: acquiring target audio data, the target audio data is associated with the target image, and the target image is collected by the recording device in the process of recording the target audio data; The text information is used to perform speech recognition on the target audio data, and the corresponding speech recognition result is determined.

本发明实施例包括以下优点：The embodiments of the present invention include the following advantages:

本发明实施例中，可以在录音设备中增加图像的文本识别功能；进而当用户使用录音设备针对目标图像进行文本识别时，录音设备可以获取目标图像；然后对所述目标图像进行文本识别，确定对应的文本信息和确定所述文本信息在所述目标图像中的排版信息；并依据所述文本信息在目标图像中的排版信息，展示所述文本信息；进而能够使得展示的文本信息的排版与其在目标图像中的排版相同或相似，便于用户对图像中信息的阅读和理解；从而提高用户的使用体验。In the embodiment of the present invention, the text recognition function of the image can be added to the recording device; then when the user uses the recording device to perform text recognition on the target image, the recording device can obtain the target image; then perform text recognition on the target image, determine Corresponding text information and determine the typesetting information of the text information in the target image; and display the text information according to the typesetting information of the text information in the target image; and then make the typesetting of the displayed text information match the same. The layout in the target image is the same or similar, which facilitates the user to read and understand the information in the image, thereby improving the user experience.

附图说明Description of drawings

图1是本发明的一种数据处理方法实施例的步骤流程图；Fig. 1 is the step flow chart of a kind of data processing method embodiment of the present invention;

图2是本发明的一种数据处理方法可选实施例的步骤流程图；Fig. 2 is a flow chart of steps of an optional embodiment of a data processing method of the present invention;

图3是本发明的另一种数据处理方法实施例的步骤流程图；3 is a flow chart of steps of another data processing method embodiment of the present invention;

图4是本发明的又一种数据处理方法实施例的步骤流程图；4 is a flow chart of steps of another data processing method embodiment of the present invention;

图5是本发明的再一种数据处理方法实施例的步骤流程图；Fig. 5 is a flow chart of steps of still another data processing method embodiment of the present invention;

图6是本发明的一种数据处理装置实施例的结构框图；6 is a structural block diagram of an embodiment of a data processing apparatus of the present invention;

图7是本发明的一种数据处理装置可选实施例的结构框图；7 is a structural block diagram of an optional embodiment of a data processing apparatus of the present invention;

图8根据一示例性实施例示出的一种用于数据处理的录音设备的结构框图。Fig. 8 shows a structural block diagram of a recording device for data processing according to an exemplary embodiment.

具体实施方式Detailed ways

为使本发明的上述目的、特征和优点能够更加明显易懂，下面结合附图和具体实施方式对本发明作进一步详细的说明。In order to make the above objects, features and advantages of the present invention more clearly understood, the present invention will be described in further detail below with reference to the accompanying drawings and specific embodiments.

本发明实施例提供的一种数据处理方法，应用于录音设备中，所述录音设备可以是指具有录音功能的设备，如录音笔、翻译设备如翻译笔、翻译机等等；本发明实施例对此不作限制。A data processing method provided by an embodiment of the present invention is applied to a recording device, and the recording device may refer to a device with a recording function, such as a recording pen, a translation device such as a translation pen, a translator, etc.; the embodiment of the present invention There is no restriction on this.

本发明实施例中，可以在所述录音设备中设置图像采集模块，以在录音设备中增加图像采集功能；进而使得用户可以使用录音设备进行图像采集。所述录音设备中还设置有显示组件，所述显示组件可以包括显示屏，可以用于信息显示。In the embodiment of the present invention, an image acquisition module may be set in the recording device, so as to add an image acquisition function in the recording device; thus, the user can use the recording device to perform image acquisition. The recording device is also provided with a display component, and the display component may include a display screen, which may be used for information display.

很多情况下，用户获取图像后，需要对图像进行文本识别，以获取图像中的文本信息。因此，本发明实施例还在录音设备中增加了图像的文本识别功能，以满足用户的使用需求，提高用户体验。In many cases, after a user obtains an image, it is necessary to perform text recognition on the image to obtain text information in the image. Therefore, the embodiment of the present invention also adds a text recognition function of an image to the recording device, so as to meet the user's usage requirements and improve the user experience.

在上述基础上，本发明实施例的核心构思之一在于，在识别图像中文本的同时识别文本的排版；然后基于图像中文本的排版，对识别得到的文本信息进行展示；能够使得展示的文本信息的排版与其在目标图像中的排版相同或相似；进而能够便于用户对图像中信息的阅读和理解。On the above basis, one of the core concepts of the embodiments of the present invention is to recognize the typesetting of the text while recognizing the text in the image; then, based on the typesetting of the text in the image, the recognized text information is displayed; The typesetting of the information is the same as or similar to the typesetting of the information in the target image; thus, it is convenient for the user to read and understand the information in the image.

参照图1，示出了本发明的一种数据处理方法实施例的步骤流程图，具体可以包括如下步骤：Referring to FIG. 1, a flow chart of steps of an embodiment of a data processing method of the present invention is shown, which may specifically include the following steps:

步骤102、录音设备获取目标图像。Step 102: The recording device acquires the target image.

本发明实施例中，用户可以在录音设备中执行图像采集操作，对应的，录音设备可以接收到图像采集指令，然后可以调用其中设置的图像采集模块进行图像采集，获取目标图像。当然，用户也可以从录音设备存储的图像中，选取需要进行文本识别的图像，作为目标图像。In this embodiment of the present invention, a user may perform an image acquisition operation in a recording device, and correspondingly, the recording device may receive an image acquisition instruction, and then call an image acquisition module set therein to perform image acquisition to acquire a target image. Of course, the user can also select an image that needs text recognition from the images stored in the recording device as the target image.

其中，录音设备中存储的目标图像，可以是预先由录音设备调用其中设置的图像采集模块采集并存储的；也可以是由其他设备发送给录音设备后，录音设备存储的，本发明对此也不作限制。所述其他设备可以是指除录音设备之外的设备。The target image stored in the recording device may be pre-collected and stored by the recording device calling the image acquisition module set in the recording device; it may also be stored by the recording device after being sent to the recording device by other devices. No restrictions apply. The other devices may refer to devices other than recording devices.

步骤104、对所述目标图像进行文本识别，确定对应的文本信息和确定所述文本信息在所述目标图像中的排版信息。Step 104: Perform text recognition on the target image, determine corresponding text information, and determine the typesetting information of the text information in the target image.

在获取目标图像后，可以对目标图像进行文本识别，确定目标图像对应的文本信息；以及在对目标图像进行文本识别的同时，确定文本信息在目标图像中的排版信息。其中，所述排版信息可以用于描述目标图像中文本排版。After acquiring the target image, text recognition can be performed on the target image to determine text information corresponding to the target image; and while the text recognition is performed on the target image, typesetting information of the text information in the target image can be determined. The typesetting information may be used to describe the typesetting of text in the target image.

步骤106、依据所述文本信息在目标图像中的排版信息，展示所述文本信息。Step 106: Display the text information according to the typesetting information of the text information in the target image.

本发明实施例中，在显示屏中展示文本信息时，可以依据所述文本信息在目标图像中的排版信息，控制文本信息按照与其在图像中相同或相似的排版进行展示。In the embodiment of the present invention, when displaying text information on the display screen, the text information can be controlled to be displayed according to the same or similar typesetting as the typesetting information in the target image according to the typesetting information of the text information in the target image.

其中，所述排版信息可以用于表征文本信息在目标图像中的排版，如可以表征文本信息所在的段落、文本信息在其所在段落的行、各行文本信息之间的行间距，同一行文本信息中各文本之间的间距等等。The typesetting information can be used to represent the typesetting of the text information in the target image, for example, it can represent the paragraph where the text information is located, the line of the paragraph where the text information is located, the line spacing between each line of text information, the same line of text information space between texts, etc.

其中，本发明实施例中，可以是由录音设备执行步骤102-步骤106；也可以是由录音设备执行步骤102后，将目标图像发送至服务器，由服务器执行步骤104；然后再将目标图像的文本信息和排版信息返回给录音设备，由录音设备执行步骤106；本发明实施例对此不作限制。Wherein, in this embodiment of the present invention, steps 102 to 106 may be executed by the recording device; or after the recording device executes step 102, the target image is sent to the server, and the server executes step 104; The text information and typesetting information are returned to the recording device, and the recording device performs step 106; this is not limited in this embodiment of the present invention.

综上，本发明实施例中，可以在录音设备中增加图像的文本识别功能；进而当用户使用录音设备针对目标图像进行文本识别时，录音设备可以获取目标图像；然后对所述目标图像进行文本识别，确定对应的文本信息和确定所述文本信息在所述目标图像中的排版信息；并依据所述文本信息在目标图像中的排版信息，展示所述文本信息；进而能够使得展示的文本信息的排版与其在目标图像中的排版相同或相似，便于用户对图像中信息的阅读和理解；从而提高用户的使用体验。To sum up, in the embodiment of the present invention, the text recognition function of the image can be added to the recording device; and when the user uses the recording device to perform text recognition on the target image, the recording device can obtain the target image; Identify, determine the corresponding text information and determine the typesetting information of the text information in the target image; and display the text information according to the typesetting information of the text information in the target image; and then make the displayed text information The typesetting of the image is the same as or similar to the typesetting in the target image, which is convenient for the user to read and understand the information in the image; thereby improving the user's experience.

本发明实施例中，可以依据所述文本信息在目标图像中的排版信息，控制所述文本信息按照与所述文本信息在图像中相似的排版展示。具体如下：In this embodiment of the present invention, according to the typesetting information of the text information in the target image, the text information can be controlled to be displayed according to the typesetting similar to that of the text information in the image. details as follows:

参照图2，示出了本发明的一种数据处理方法可选实施例的步骤流程图，具体可以包括如下步骤：Referring to FIG. 2, a flowchart of steps of an optional embodiment of a data processing method of the present invention is shown, which may specifically include the following steps:

步骤202、录音设备获取目标图像。Step 202, the recording device acquires the target image.

本发明实施例中，目标图像可以是在录音设备在录制音频数据的过程中，调用其中设置的图像采集模块采集的；也可以是独立于录音设备的录音过程，调用其中设置的图像采集模块采集的，本发明实施例对此不作限制。In the embodiment of the present invention, the target image may be collected by invoking the image acquisition module set in the recording device during the process of recording audio data; or it may be acquired by invoking the image acquisition module set in the recording process independent of the recording device. is not limited in this embodiment of the present invention.

步骤204、对所述目标图像进行文本识别，确定对应的文本信息。Step 204: Perform text recognition on the target image to determine corresponding text information.

其中，可以采用OCR(Optical Character Recognition，光学字符识别)识别技术，对所述目标图像的各文本进行文本识别，得到目标图像对应的文本信息。Wherein, an OCR (Optical Character Recognition, Optical Character Recognition) recognition technology may be used to perform text recognition on each text of the target image to obtain text information corresponding to the target image.

其中，所述文本信息包括多个文本，所述文本可以包括文字和字符。Wherein, the text information includes a plurality of texts, and the texts may include characters and characters.

步骤206、分别记录所述文本信息中各文本在目标图像中对应的行信息，以及各文本所在行的位置信息。Step 206 , respectively record the line information corresponding to each text in the target image in the text information, and the position information of the line where each text is located.

步骤208、依据所述文本信息中各文本的行信息和位置信息，生成所述文本信息在所述目标图像中的排版信息。Step 208: Generate typesetting information of the text information in the target image according to the line information and position information of each text in the text information.

其中，本发明实施例在执行步骤204的同时，可以执行步骤206-步骤208。Wherein, in this embodiment of the present invention, when step 204 is performed, steps 206 to 208 may be performed.

本发明实施例中，可以在识别每个文本的同时，记录该文本在目标图像中的行信息，以及该文本在每行中的位置信息。所述行信息可以包括行序号如第1行、第2行等；也可以包括行坐标，所述行坐标可以包括该行四个顶点的像素点坐标。所述位置信息的表示方式可以包括多种，如可以包括：文本最左侧点的像素点坐标、文本最右侧点的像素点坐标、文本最上侧点的像素点坐标、文本最下侧点的像素点坐标；其中。又例如，还可以包括：文本中心位置的坐标。还例如，可以包括文本在其所在行的文本序号，如第1个、第2个等，本发明实施例对此不作限制。In this embodiment of the present invention, while identifying each text, the line information of the text in the target image and the position information of the text in each line can be recorded. The row information may include row serial numbers, such as row 1, row 2, etc.; and may also include row coordinates, and the row coordinates may include pixel coordinates of four vertices of the row. The position information can be represented in a variety of ways, such as: the pixel coordinates of the leftmost point of the text, the pixel coordinates of the rightmost point of the text, the pixel coordinates of the uppermost point of the text, and the lowermost point of the text. The pixel coordinates of ; where . For another example, it may also include: the coordinates of the center position of the text. Also for example, the text sequence number of the line where the text is located may be included, such as the first, the second, etc., which is not limited in this embodiment of the present invention.

本发明实施例中，一种依据所述文本信息在目标图像中的排版信息，展示所述文本信息的方式可以参照步骤In this embodiment of the present invention, a method for displaying the text information according to the typesetting information of the text information in the target image may refer to the steps

步骤210、按照所述文本信息中各文本的的行信息和位置信息，对所述文本信息进行段落划分。Step 210: Divide the text information into paragraphs according to the line information and position information of each text in the text information.

步骤212、分段落展示所述文本信息。Step 212 , displaying the text information in paragraphs.

本发明实施例中，可以按照文本信息中各文本的行信息和位置信息，将从目标图像中识别的所有文本信息进行段落划分，将文本信息划分为多个段落。其中，针对文本信息中的每个文本，可先根据该文本的行信息，确定该文本信息所在的行；然后可以依据该文本的位置信息，判断该文本是否是该文本所在行的第一个文本。若该文本是其所在行的第一个文本，则根据该文本的位置信息，判断该文本之前是否存在至少两个空格；当该文本之前存在至少两个空格，则可以确定该文本所在行是一个段落的起始行，该文本为一个段落起始行的起始文本。则可以将该文本所在行的上一行，作为该文本所在段落的上一个段落作为上一个段落的结束行。然后可以将文本信息划分为多个段落展示；展示的段落数量，与其在目标图像中的段落数量一致。In this embodiment of the present invention, all the text information identified from the target image can be divided into paragraphs according to the line information and position information of each text in the text information, and the text information can be divided into multiple paragraphs. Wherein, for each text in the text information, the row where the text information is located can be determined first according to the row information of the text; and then it can be determined whether the text is the first row of the row where the text is located according to the position information of the text text. If the text is the first text of the line where it is located, it is determined whether there are at least two spaces before the text according to the position information of the text; when there are at least two spaces before the text, it can be determined that the line where the text is located is The starting line of a paragraph, the text is the starting text of the starting line of a paragraph. Then the previous line of the line where the text is located can be used as the previous paragraph of the paragraph where the text is located as the end line of the previous paragraph. The text information can then be divided into multiple paragraphs for display; the number of paragraphs displayed is the same as the number of paragraphs in the target image.

综上，本发明实施例中，可以按照所述文本信息中各文本的的行信息和位置信息，对所述文本信息进行段落划分；分段落展示所述文本信息；进而能够使得文本信息展示的段落数量，与其在目标图像中的段落数量一致，便于用户清楚的分辨每个段落，进一步方便用户对图像中信息的阅读和理解。To sum up, in this embodiment of the present invention, the text information can be divided into paragraphs according to the line information and position information of each text in the text information; the text information is displayed in paragraphs; The number of paragraphs is consistent with the number of paragraphs in the target image, which is convenient for the user to clearly distinguish each paragraph, and further facilitates the user to read and understand the information in the image.

本发明实施例中，可以依据所述文本信息在目标图像中的排版信息，控制所述文本信息按照与所述文本信息在图像中相同的排版展示。具体如下：In this embodiment of the present invention, according to the typesetting information of the text information in the target image, the text information can be controlled to be displayed according to the same typesetting as the text information in the image. details as follows:

参照图3，示出了本发明的另一种数据处理方法实施例的步骤流程图，具体可以包括如下步骤：Referring to FIG. 3, a flowchart of steps of another data processing method embodiment of the present invention is shown, which may specifically include the following steps:

步骤302、录音设备获取目标图像。Step 302: The recording device acquires the target image.

步骤304、对所述目标图像进行文本识别，确定对应的文本信息。Step 304: Perform text recognition on the target image to determine corresponding text information.

步骤306、分别记录所述文本信息中各文本在目标图像中对应的行信息，以及各文本所在行的位置信息。Step 306 , respectively record the line information corresponding to each text in the target image in the text information, and the position information of the line where each text is located.

步骤302-步骤308，与上述步骤202-步骤308类似，在此不再赘述。Steps 302 to 308 are similar to the above-mentioned steps 202 to 308, and are not repeated here.

步骤310、按照文本信息中各文本的行信息和位置信息，控制所述文本信息中各文本按照与所述文本在图像中相同的排版展示。Step 310: Control each text in the text information to be displayed according to the same typesetting as the text in the image according to the line information and position information of each text in the text information.

然后可以按照文本信息中各文本的行信息和位置信息，控制所述文本信息中各文本按照与所述文本在图像中相同的排版展示；进而使得文本信息，与其在目标图像中完全相同的排版展示。Then, according to the line information and position information of each text in the text information, each text in the text information can be controlled to be displayed according to the same typesetting as the text in the image; further, the text information can be typed exactly the same as that in the target image. exhibit.

综上，本发明实施例中，可以按照文本信息中各文本的行信息和位置信息，控制所述文本信息中各文本按照与所述文本在图像中相同的排版展示；进而能够使得展示的文本信息，与其在目标图像中排版一致，进一步方便用户对图像中信息的阅读和理解。To sum up, in this embodiment of the present invention, according to the line information and position information of each text in the text information, each text in the text information can be controlled to be displayed according to the same typesetting as the text in the image; further, the displayed text can be displayed. The information, consistent with its typesetting in the target image, further facilitates the user's reading and understanding of the information in the image.

本发明实施例中，当目标图像中的文本信息所对应的语种，不是用户所熟练掌握的语种时，还可以对图像中的文本信息进行翻译，生成用户熟练掌握的语种对应的翻译结果，便于用户理解。In this embodiment of the present invention, when the language corresponding to the text information in the target image is not the language that the user is proficient in, the text information in the image can also be translated to generate a translation result corresponding to the language that the user is proficient in, which is convenient for the user. User understands.

参照图4，示出了本发明的又一种数据处理方法实施例的步骤流程图，具体可以包括如下步骤：Referring to FIG. 4 , a flowchart of steps of another data processing method embodiment of the present invention is shown, which may specifically include the following steps:

步骤402、录音设备获取目标图像。Step 402: The recording device acquires the target image.

步骤404、对所述目标图像进行文本识别，确定对应的文本信息和确定所述文本信息在所述目标图像中的排版信息。Step 404: Perform text recognition on the target image, determine corresponding text information, and determine the typesetting information of the text information in the target image.

步骤406、依据所述文本信息在目标图像中的排版信息，展示所述文本信息。Step 406: Display the text information according to the typesetting information of the text information in the target image.

其中，步骤402-步骤406，可以参照上述实施例，在此不再赘述。Wherein, for steps 402 to 406, reference may be made to the foregoing embodiments, and details are not described herein again.

步骤408、对所述文本信息进行翻译，得到对应的翻译结果。Step 408: Translate the text information to obtain a corresponding translation result.

步骤410、展示所述翻译结果。Step 410: Display the translation result.

在识别得到文本信息后，可以对文本信息进行翻译，确定对应的翻译结果。其中，可以确定目标语言，然后对文本信息进行翻译，得到目标语言对应的翻译结果。其中，所述目标语言可以是用户熟练掌握的语言。After the text information is identified, the text information can be translated to determine the corresponding translation result. The target language can be determined, and then the text information can be translated to obtain a translation result corresponding to the target language. Wherein, the target language may be a language that the user is proficient in.

本发明的一个可选实施例中，可以仅展示翻译结果，不展示文本信息；也可以同时展示翻译结果和文本信息；本发明实施例对此不作限制。In an optional embodiment of the present invention, only the translation result may be displayed without displaying the text information; the translation result and the text information may also be displayed at the same time; this is not limited in this embodiment of the present invention.

本发明的一个可选实施例中，所述翻译结果可以是图片翻译结果。其中，可以对所述文本信息进行翻译，确定对应的翻译文本信息后，基于所述翻译文本信息，生成图片翻译结果。其中，可以将目标图像与翻译文本信息进行合成，生成图片翻译结果。例如，可以将翻译文本信息，覆盖在目标图像中与该翻译文本信息对应的文本信息之上；又例如，可以将翻译文本信息，添加在目标图像中，与该翻译文本信息对应文本信息的关联位置；进而便于用户对照查看。例如，当目标图像是演示文稿的图像时，可以将每行文本对应的翻译文本信息，添加在该行文本与下一行/上一行文本之间的位置。例如，当目标图像是菜单的图像时，可以将翻译菜名，覆盖在目标图像中该翻译菜名对应的菜名之上。当然，也可以采用翻译文本信息，按照目标图像中文本的排版方式，生成一张新的图片，作为图片翻译结果。In an optional embodiment of the present invention, the translation result may be a picture translation result. The text information may be translated, and after the corresponding translated text information is determined, a picture translation result is generated based on the translated text information. Among them, the target image and the translated text information can be synthesized to generate a picture translation result. For example, the translated text information can be overlaid on the text information corresponding to the translated text information in the target image; for another example, the translated text information can be added in the target image, and the association with the text information corresponding to the translated text information location; thus, it is convenient for users to check and check. For example, when the target image is an image of a presentation, the translated text information corresponding to each line of text may be added at a position between the line of text and the next/previous line of text. For example, when the target image is an image of a menu, the translated dish name may be overlaid on the dish name corresponding to the translated dish name in the target image. Of course, it is also possible to use the translated text information to generate a new image according to the typesetting method of the text in the target image as the image translation result.

本发明实施例中，所述翻译结果还可以是文本翻译结果；即可以直接将翻译文本信息作为文本翻译结果。当翻译结果是文本翻译结果时，若同时展示翻译结果和文本信息，则可以将翻译结果和文本信息进行对照展示。In this embodiment of the present invention, the translation result may also be a text translation result; that is, the translated text information may be directly used as the text translation result. When the translation result is a text translation result, if the translation result and the text information are displayed at the same time, the translation result and the text information can be displayed in comparison.

本发明一个可选实施例中，所述的方法还包括：接收传输指令，所述传输指令包括以下至少一种：分享指令、转发指令和转存指令；将所述传输指令对应的数据，传输至其他设备；所述传输指令对应的数据包括以下至少一种：目标图像、文本信息和翻译结果。进而用户能够将目标图像、目标图像的文本信息，以及目标图像的翻译结果中的一种或多种，传输至其他设备中；便于用户在其他设备中使用将目标图像、目标图像的文本信息，以及目标图像的翻译结果。In an optional embodiment of the present invention, the method further includes: receiving a transmission instruction, where the transmission instruction includes at least one of the following: a sharing instruction, a forwarding instruction, and a dump instruction; to other devices; the data corresponding to the transmission instruction includes at least one of the following: target image, text information and translation result. Then the user can transmit one or more of the target image, the text information of the target image, and the translation result of the target image to other devices; it is convenient for the user to use the text information of the target image, the target image in other devices, and the translation result of the target image.

本发明的一个可选实施例中，当目标图像是在录音设备在录制目标音频数据过程中采集时，可以将目标图像与目标音频数据关联；实现将从多个维度记录数据的关联，便于用户后续同时使用记录的多个维度的数据，提高了用户体验。In an optional embodiment of the present invention, when the target image is collected during the recording of the target audio data by the recording device, the target image can be associated with the target audio data; the association of recorded data from multiple dimensions is realized, which is convenient for users Subsequent use of the recorded data of multiple dimensions at the same time improves user experience.

以下对如何在录制音频数据过程中，采集图像数据，以及如何将图像数据与音频数据关联进行说明。The following describes how to collect image data in the process of recording audio data, and how to associate image data with audio data.

在录音设备在录音过程中，接收图像采集指令。During the recording process, the recording device receives an image acquisition instruction.

本发明实施例中，当用户需要录音时，可以开启录音设备的录音功能，采用录音设备进行录音。在录音过程中，用户在需要记录其它维度的数据如图像资料，例如印刷资料、投屏图像等时，可以执行图像采集操作。待用户执行图像采集操作后，对应的录音设备可以接收到该图像采集操作对应的图像采集指令。In the embodiment of the present invention, when the user needs to record, the recording function of the recording device can be turned on, and the recording device is used for recording. During the recording process, users can perform image acquisition operations when they need to record data in other dimensions, such as image materials, such as printed materials, screen projection images, etc. After the user performs the image capture operation, the corresponding recording device can receive the image capture instruction corresponding to the image capture operation.

本发明的一个示例中，用户可以在录音设备中执行图像采集操作，对应的，可以录音设备可以根据接收到用户执行的图像采集操作，生成图像采集指令。In an example of the present invention, a user may perform an image capturing operation in a recording device, and correspondingly, the recording device may generate an image capturing instruction according to receiving an image capturing operation performed by the user.

本发明的一个示例中，当录音设备与其它设备连接时，用户也可以在其他设备的与该录音设备对应的应用程序中，执行图像采集设备。此时，可以由其他设备根据用户的图像采集操作，生成图像采集指令；然后将图像采集指令发送给录音设备。In an example of the present invention, when the recording device is connected to other devices, the user can also execute the image capturing device in the application program corresponding to the recording device of the other device. At this time, other devices may generate image capture instructions according to the user's image capture operation; and then send the image capture instructions to the recording device.

依据所述图像采集指令进行图像采集。Image acquisition is performed according to the image acquisition instruction.

然后录音设备可以根据图像采集指令，调用图像采集模块进行图像采集，得到图像数据。Then, the recording device can call the image acquisition module to perform image acquisition according to the image acquisition instruction to obtain image data.

在录音过程中，用户可以执行多次图像采集操作，对应的，录音设备可以接收到多次图像采集指令。录音设备可以在每接收到一次图像采集指令时，进行一次图像采集，得到对应的图像帧。During the recording process, the user may perform multiple image capturing operations, and correspondingly, the recording device may receive multiple image capturing instructions. The recording device may perform an image acquisition every time an image acquisition instruction is received to obtain a corresponding image frame.

将采集得到的图像数据与录音得到的音频数据进行关联并存储。The collected image data and the recorded audio data are associated and stored.

本发明实施例中，为了便于用户后续同时使用记录的多个维度的数据，在采集得到图像数据后，可以将采集得到的图像数据与录音得到的音频数据进行关联，并存储在录音设备中。其中，可以基于采集得到的图像数据的时间和录音得到的音频数据对应的时间，将图像数据和音频数据进行关联，本发明实施例对此不作限制。In this embodiment of the present invention, in order to facilitate the user to use the recorded data of multiple dimensions simultaneously, after the image data is collected, the collected image data can be associated with the recorded audio data and stored in the recording device. The image data and the audio data may be associated based on the time of the collected image data and the time corresponding to the recorded audio data, which is not limited in this embodiment of the present invention.

一个示例中，录音设备可以在每采集一个图像帧后，将该图像帧与录音过程中得到的与该图像帧对应的音频帧进行关联；进而实现将采集得到的图像数据与录音得到的音频数据进行关联。另一个示例中，录音设备可以将在每采集一个图像帧后存储在图像帧；并在录音结束后，将图像数据的每个图像帧与录音得到的音频数据中对应的音频帧进行关联。In an example, the recording device can associate the image frame with the audio frame corresponding to the image frame obtained in the recording process after collecting an image frame; and then realize the image data obtained by the collection and the audio data obtained by the recording. to associate. In another example, the recording device may store each image frame in the image frame; and after the recording ends, associate each image frame of the image data with the corresponding audio frame in the audio data obtained by the recording.

其中，将每个图像帧与对应音频帧进行关联的方式可以如下：针对所述图像数据中的目标图像帧，确定所述目标图像帧对应的目标时间戳；确定所述音频数据中时间戳与所述目标时间戳相同的目标音频帧；将所述目标图像帧和目标音频帧进行关联。Wherein, the way of associating each image frame with the corresponding audio frame may be as follows: for the target image frame in the image data, determine the target time stamp corresponding to the target image frame; The target audio frame with the same target time stamp; the target image frame and the target audio frame are associated.

其中，若录音设备是在每采集一个图像帧后，将该图像帧与录音过程中得到的与该图像帧对应的音频帧进行关联，则可以将每次采集的一个图像帧作为目标图像帧。若录音设备是在录音结束后，将图像数据的每个图像帧与录音得到的音频数据中对应的音频帧进行关联，则每次可以任意从图像数据中选取一图像帧作为目标图像帧，直到将图像数据中所有的图像帧与音频数据中对应的音频帧关联为止。Wherein, if the recording device associates the image frame with the audio frame corresponding to the image frame obtained in the recording process after each image frame is collected, one image frame collected each time can be used as the target image frame. If the recording device associates each image frame of the image data with the corresponding audio frame in the audio data obtained by the recording after the recording is completed, one image frame can be arbitrarily selected from the image data as the target image frame each time, until All the image frames in the image data are associated with the corresponding audio frames in the audio data.

本发明实施例中，针对一个目标图像帧，可以确定所述目标图像帧对应的目标时间戳，并从录音得到的音频数据中时间戳与所述目标时间戳相同的目标音频帧；然后将所述目标图像帧和目标音频帧进行关联。In the embodiment of the present invention, for a target image frame, a target time stamp corresponding to the target image frame may be determined, and a target audio frame with the same time stamp as the target time stamp in the audio data obtained from the recording; The target image frame and the target audio frame are associated.

进而当用户需要对目标音频数据进行语音识别时，可以结合该目标图像数据对该目标音频数据进行语音识别；从而通过结合与目标音频数据关联的信息，对所述目标音频数据进行语音识别，来提高语音识别的准确率。Then when the user needs to perform speech recognition on the target audio data, the target audio data can be speech recognized in combination with the target image data; thus, by combining the information associated with the target audio data, the target audio data is subjected to speech recognition, to Improve the accuracy of speech recognition.

参照图5、示出了本发明的再一种数据处理方法实施例的步骤流程图。Referring to FIG. 5 , a flow chart of the steps of another embodiment of a data processing method of the present invention is shown.

步骤502、录音设备获取目标图像。Step 502: The recording device acquires the target image.

步骤504、对所述目标图像进行文本识别，确定对应的文本信息和确定所述文本信息在所述目标图像中的排版信息。Step 504: Perform text recognition on the target image, determine corresponding text information, and determine the typesetting information of the text information in the target image.

步骤506、依据所述文本信息在目标图像中的排版信息，展示所述文本信息。Step 506: Display the text information according to the typesetting information of the text information in the target image.

步骤508、获取目标音频数据，所述目标音频数据与所述目标图像关联，所述目标图像是录音设备在录制目标音频数据过程中采集的。Step 508: Acquire target audio data, where the target audio data is associated with the target image, and the target image is collected by the recording device in the process of recording the target audio data.

步骤510、依据所述文本信息对所述目标音频数据进行语音识别，确定对应语音识别结果。Step 510: Perform speech recognition on the target audio data according to the text information, and determine a corresponding speech recognition result.

本发明实施例中，可以在录音设备录制目标音频数据的过程中，实时的对目标音频数据进行语音识别。其中，在录制目标音频数据的过程中，若录音设备获取到调用其中的图像采集模块采集的目标图像后，可以依据对目标图像识别出的文本信息，对在采集目标图像之后录制的目标音频数据进行语音识别，确定对应的语音识别结果。In this embodiment of the present invention, during the process of recording the target audio data by the recording device, speech recognition may be performed on the target audio data in real time. Wherein, in the process of recording the target audio data, if the recording device obtains the target image collected by the image acquisition module called therein, it can record the target audio data recorded after the target image is collected according to the text information identified on the target image. Perform speech recognition to determine the corresponding speech recognition result.

本发明实施例中，也可以是在录音设备录制目标音频数据后，对目标音频数据(即非实时的目标音频数据)进行语音识别。其中，可以依据在录制目标音频数据过程中，调用其中的图像采集模块采集的所有目标图像的文本信息，对目标音频数据进行语音识别，确定对应的语音识别结果；本发明实施例对此不作限制。In this embodiment of the present invention, after the recording device records the target audio data, speech recognition may also be performed on the target audio data (ie, the non-real-time target audio data). Wherein, according to the text information of all the target images collected by calling the image acquisition module in the process of recording the target audio data, the target audio data can be speech recognized, and the corresponding speech recognition result can be determined; the embodiment of the present invention does not limit this. .

其中，可以将该文本信息利用到对目标音频数据的语音识别过程中，来提高对目标音频数据的语音识别的准确率。Wherein, the text information can be used in the speech recognition process of the target audio data to improve the accuracy of the speech recognition of the target audio data.

当然，本发明实施例中，还可以接收针对目标音频数据和/或语音识别结果的传输指令，将所述目标音频数据和/或语音识别结果传输至其他设备中；本发明实施例对此不作限制。Of course, in this embodiment of the present invention, a transmission instruction for target audio data and/or speech recognition result may also be received, and the target audio data and/or speech recognition result are transmitted to other devices; this embodiment of the present invention does not make any limit.

综上，本发明实施例中，可以获取目标音频数据；然后依据目标图像的文本信息对所述目标音频数据进行语音识别，确定对应的语音识别结果；其中，所述目标图像是录音设备在录制目标音频数据过程中采集的，且所述目标音频数据与所述目标图像关联，进而通过结合与目标音频数据关联的信息，对所述目标音频数据进行语音识别，来提高语音识别的准确率。To sum up, in this embodiment of the present invention, the target audio data can be obtained; then, the target audio data is subjected to speech recognition according to the text information of the target image, and the corresponding speech recognition result is determined; The target audio data is collected in the process, and the target audio data is associated with the target image, and then the accuracy of speech recognition is improved by combining the information associated with the target audio data to perform speech recognition on the target audio data.

需要说明的是，对于方法实施例，为了简单描述，故将其都表述为一系列的动作组合，但是本领域技术人员应该知悉，本发明实施例并不受所描述的动作顺序的限制，因为依据本发明实施例，某些步骤可以采用其他顺序或者同时进行。其次，本领域技术人员也应该知悉，说明书中所描述的实施例均属于优选实施例，所涉及的动作并不一定是本发明实施例所必须的。It should be noted that, for the sake of simple description, the method embodiments are described as a series of action combinations, but those skilled in the art should know that the embodiments of the present invention are not limited by the described action sequences, because According to embodiments of the present invention, certain steps may be performed in other sequences or simultaneously. Secondly, those skilled in the art should also know that the embodiments described in the specification are all preferred embodiments, and the actions involved are not necessarily required by the embodiments of the present invention.

本发明实施例还提供了一种数据处理装置，应用于录音设备中。The embodiment of the present invention also provides a data processing apparatus, which is applied to a recording device.

参照图6，示出了本发明的一种数据处理装置实施例的结构框图，具体可以包括如下模块：Referring to FIG. 6 , a structural block diagram of an embodiment of a data processing apparatus of the present invention is shown, which may specifically include the following modules:

图像获取模块602，用于获取目标图像；an image acquisition module 602, configured to acquire a target image;

文本识别模块604，用于对所述目标图像进行文本识别，确定对应的文本信息和确定所述文本信息在所述目标图像中的排版信息；A text recognition module 604, configured to perform text recognition on the target image, determine corresponding text information and determine the typesetting information of the text information in the target image;

展示模块606，用于依据所述文本信息在目标图像中的排版信息，展示所述文本信息。The display module 606 is configured to display the text information according to the typesetting information of the text information in the target image.

参照图7，示出了本发明的一种数据处理装置可选实施例的结构框图。Referring to FIG. 7 , a structural block diagram of an optional embodiment of a data processing apparatus of the present invention is shown.

本发明一个可选的实施例中，所述文本信息包括多个文本，所述文本识别模块604，用于分别记录各文本在目标图像中对应的行信息，以及各文本所在行的位置信息；依据所述文本信息中各文本的行信息和位置信息，生成所述文本信息在所述目标图像中的排版信息。In an optional embodiment of the present invention, the text information includes a plurality of texts, and the text recognition module 604 is configured to respectively record the line information corresponding to each text in the target image, and the position information of the line where each text is located; According to the line information and position information of each text in the text information, the typesetting information of the text information in the target image is generated.

本发明一个可选的实施例中，所述展示模块606，包括：In an optional embodiment of the present invention, the display module 606 includes:

第一排版展示子模块6062，用于按照所述文本信息中各文本的的行信息和位置信息，对所述文本信息进行段落划分；分段落展示所述文本信息。The first typesetting display sub-module 6062 is configured to divide the text information into paragraphs according to the line information and position information of each text in the text information; and display the text information in paragraphs.

第二排版展示子模块6064，用于按照文本信息中各文本的行信息和位置信息，控制所述文本信息中各文本按照与所述文本在图像中相同的排版展示。The second typesetting display sub-module 6064 is configured to control each text in the text information to display according to the same typesetting as the text in the image according to the line information and position information of each text in the text information.

本发明一个可选的实施例中，所述的装置还包括：In an optional embodiment of the present invention, the device further includes:

翻译模块608，用于对所述文本信息进行翻译，得到对应的翻译结果并展示所述翻译结果。The translation module 608 is configured to translate the text information to obtain a corresponding translation result and display the translation result.

本发明一个可选的实施例中，所述翻译结果包括：图片翻译结果和/或文本翻译结果。In an optional embodiment of the present invention, the translation result includes: a picture translation result and/or a text translation result.

数据传输模块610，用于接收传输指令，所述传输指令包括以下至少一种：分享指令、转发指令和转存指令；将所述传输指令对应的数据，传输至其他设备；所述传输指令对应的数据包括以下至少一种：目标图像、文本信息和翻译结果。The data transmission module 610 is configured to receive transmission instructions, the transmission instructions include at least one of the following: a sharing instruction, a forwarding instruction, and a dump instruction; data corresponding to the transmission instruction is transmitted to other devices; the transmission instruction corresponds to The data includes at least one of the following: target image, text information and translation result.

语音识别模块612，用于获取目标音频数据，所述目标音频数据与所述目标图像关联，所述目标图像是录音设备在录制目标音频数据过程中采集的；依据所述文本信息对所述目标音频数据进行语音识别，确定对应语音识别结果。The speech recognition module 612 is configured to acquire target audio data, the target audio data is associated with the target image, and the target image is collected by the recording device in the process of recording the target audio data; The audio data is subjected to speech recognition, and the corresponding speech recognition result is determined.

对于装置实施例而言，由于其与方法实施例基本相似，所以描述的比较简单，相关之处参见方法实施例的部分说明即可。As for the apparatus embodiment, since it is basically similar to the method embodiment, the description is relatively simple, and reference may be made to the partial description of the method embodiment for related parts.

图8是根据一示例性实施例示出的一种用于数据处理的录音设备800的结构框图。例如，录音设备800可以是录音笔、翻译笔、翻译机等等。FIG. 8 is a structural block diagram of a recording device 800 for data processing according to an exemplary embodiment. For example, the recording device 800 may be a voice recorder, a translator pen, a translator, or the like.

参照图8，录音设备800可以包括以下一个或多个组件：处理组件802，存储器804，电力组件806，多媒体组件808，音频组件810，输入/输出(I/O)的接口812，传感器组件814，以及通信组件816。8, the recording device 800 may include one or more of the following components: a processing component 802, a memory 804, a power component 806, a multimedia component 808, an audio component 810, an input/output (I/O) interface 812, a sensor component 814 , and the communication component 816 .

处理组件802通常控制录音设备800的整体操作，诸如与显示，电话呼叫，数据通信，相机操作和记录操作相关联的操作。处理元件802可以包括一个或多个处理器820来执行指令，以完成上述的方法的全部或部分步骤。此外，处理组件802可以包括一个或多个模块，便于处理组件802和其他组件之间的交互。例如，处理部件802可以包括多媒体模块，以方便多媒体组件808和处理组件802之间的交互。The processing component 802 generally controls the overall operations of the recording device 800, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations. The processing element 802 may include one or more processors 820 to execute instructions to perform all or part of the steps of the methods described above. Additionally, processing component 802 may include one or more modules that facilitate interaction between processing component 802 and other components. For example, processing component 802 may include a multimedia module to facilitate interaction between multimedia component 808 and processing component 802.

存储器804被配置为存储各种类型的数据以支持在录音设备800的操作。这些数据的示例包括用于在录音设备800上操作的任何应用程序或方法的指令，联系人数据，电话簿数据，消息，图片，视频等。存储器804可以由任何类型的易失性或非易失性存储设备或者它们的组合实现，如静态随机存取存储器(SRAM)，电可擦除可编程只读存储器(EEPROM)，可擦除可编程只读存储器(EPROM)，可编程只读存储器(PROM)，只读存储器(ROM)，磁存储器，快闪存储器，磁盘或光盘。Memory 804 is configured to store various types of data to support operation of recording device 800 . Examples of such data include instructions for any application or method operating on the recording device 800, contact data, phonebook data, messages, pictures, videos, and the like. Memory 804 may be implemented by any type of volatile or nonvolatile storage device or combination thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), erasable Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Magnetic or Optical Disk.

电力组件806为录音设备800的各种组件提供电力。电力组件806可以包括电源管理系统，一个或多个电源，及其他与为录音设备800生成、管理和分配电力相关联的组件。Power component 806 provides power to the various components of recording device 800 . Power components 806 may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power to recording device 800 .

多媒体组件808包括在所述录音设备800和用户之间的提供一个输出接口的屏幕。在一些实施例中，屏幕可以包括液晶显示器(LCD)和触摸面板(TP)。如果屏幕包括触摸面板，屏幕可以被实现为触摸屏，以接收来自用户的输入信号。触摸面板包括一个或多个触摸传感器以感测触摸、滑动和触摸面板上的手势。所述触摸传感器可以不仅感测触摸或滑动动作的边界，而且还检测与所述触摸或滑动操作相关的持续时间和压力。在一些实施例中，多媒体组件808包括一个前置摄像头和/或后置摄像头。当录音设备800处于操作模式，如拍摄模式或视频模式时，前置摄像头和/或后置摄像头可以接收外部的多媒体数据。每个前置摄像头和后置摄像头可以是一个固定的光学透镜系统或具有焦距和光学变焦能力。Multimedia component 808 includes a screen that provides an output interface between the recording device 800 and the user. In some embodiments, the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from a user. The touch panel includes one or more touch sensors to sense touch, swipe, and gestures on the touch panel. The touch sensor may not only sense the boundaries of a touch or swipe action, but also detect the duration and pressure associated with the touch or swipe action. In some embodiments, the multimedia component 808 includes a front-facing camera and/or a rear-facing camera. When the recording device 800 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera may receive external multimedia data. Each of the front and rear cameras can be a fixed optical lens system or have focal length and optical zoom capability.

音频组件810被配置为输出和/或输入音频信号。例如，音频组件810包括一个麦克风(MIC)，当录音设备800处于操作模式，如呼叫模式、记录模式和语音识别模式时，麦克风被配置为接收外部音频信号。所接收的音频信号可以被进一步存储在存储器804或经由通信组件816发送。在一些实施例中，音频组件810还包括一个扬声器，用于输出音频信号。Audio component 810 is configured to output and/or input audio signals. For example, audio component 810 includes a microphone (MIC) that is configured to receive external audio signals when recording device 800 is in operating modes, such as call mode, recording mode, and voice recognition mode. The received audio signal may be further stored in memory 804 or transmitted via communication component 816 . In some embodiments, audio component 810 also includes a speaker for outputting audio signals.

I/O接口812为处理组件802和外围接口模块之间提供接口，上述外围接口模块可以是键盘，点击轮，按钮等。这些按钮可包括但不限于：主页按钮、音量按钮、启动按钮和锁定按钮。The I/O interface 812 provides an interface between the processing component 802 and a peripheral interface module, which may be a keyboard, a click wheel, a button, or the like. These buttons may include, but are not limited to: home button, volume buttons, start button, and lock button.

传感器组件814包括一个或多个传感器，用于为录音设备800提供各个方面的状态评估。例如，传感器组件814可以检测到录音设备800的打开/关闭状态，组件的相对定位，例如所述组件为录音设备800的显示器和小键盘，传感器组件814还可以检测录音设备800或录音设备800一个组件的位置改变，用户与录音设备800接触的存在或不存在，录音设备800方位或加速/减速和录音设备800的温度变化。传感器组件814可以包括接近传感器，被配置用来在没有任何的物理接触时检测附近物体的存在。传感器组件814还可以包括光传感器，如CMOS或CCD图像传感器，用于在成像应用中使用。在一些实施例中，该传感器组件814还可以包括加速度传感器，陀螺仪传感器，磁传感器，压力传感器或温度传感器。Sensor assembly 814 includes one or more sensors for providing status assessments of various aspects of recording device 800 . For example, the sensor component 814 can detect the on/off state of the recording device 800, the relative positioning of the components, such as the display and the keypad of the recording device 800, the sensor component 814 can also detect the recording device 800 or a recording device 800. The position of the components changes, the presence or absence of user contact with the recording device 800, the orientation or acceleration/deceleration of the recording device 800, and the temperature of the recording device 800 changes. Sensor assembly 814 may include a proximity sensor configured to detect the presence of nearby objects in the absence of any physical contact. Sensor assembly 814 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications. In some embodiments, the sensor assembly 814 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.

通信组件816被配置为便于录音设备800和其他设备之间有线或无线方式的通信。录音设备800可以接入基于通信标准的无线网络，如WiFi，2G或3G，或它们的组合。在一个示例性实施例中，通信部件814经由广播信道接收来自外部广播管理系统的广播信号或广播相关信息。在一个示例性实施例中，所述通信部件814还包括近场通信(NFC)模块，以促进短程通信。例如，在NFC模块可基于射频识别(RFID)技术，红外数据协会(IrDA)技术，超宽带(UWB)技术，蓝牙(BT)技术和其他技术来实现。Communication component 816 is configured to facilitate wired or wireless communication between recording device 800 and other devices. The recording device 800 can access a wireless network based on a communication standard, such as WiFi, 2G or 3G, or a combination thereof. In one exemplary embodiment, the communication component 814 receives broadcast signals or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component 814 also includes a near field communication (NFC) module to facilitate short-range communication. For example, the NFC module may be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology and other technologies.

在示例性实施例中，录音设备800可以被一个或多个应用专用集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理设备(DSPD)、可编程逻辑器件(PLD)、现场可编程门阵列(FPGA)、控制器、微控制器、微处理器或其他电子元件实现，用于执行上述方法。In an exemplary embodiment, recording device 800 may be implemented by one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable A programmed gate array (FPGA), controller, microcontroller, microprocessor or other electronic component implementation is used to perform the above method.

在示例性实施例中，还提供了一种包括指令的非临时性计算机可读存储介质，例如包括指令的存储器804，上述指令可由录音设备800的处理器820执行以完成上述方法。例如，所述非临时性计算机可读存储介质可以是ROM、随机存取存储器(RAM)、CD-ROM、磁带、软盘和光数据存储设备等。In an exemplary embodiment, there is also provided a non-transitory computer-readable storage medium including instructions, such as a memory 804 including instructions, executable by the processor 820 of the recording device 800 to perform the above method. For example, the non-transitory computer-readable storage medium may be ROM, random access memory (RAM), CD-ROM, magnetic tape, floppy disk, optical data storage device, and the like.

一种非临时性计算机可读存储介质，当所述存储介质中的指令由录音设备的处理器执行时，使得录音设备能够执行一种数据处理方法，所述方法包括：录音设备获取目标图像；对所述目标图像进行文本识别，确定对应的文本信息和确定所述文本信息在所述目标图像中的排版信息；依据所述文本信息在目标图像中的排版信息，展示所述文本信息。A non-transitory computer-readable storage medium, when an instruction in the storage medium is executed by a processor of a recording device, the recording device can execute a data processing method, the method comprising: acquiring a target image by the recording device; Perform text recognition on the target image, determine corresponding text information and determine the typesetting information of the text information in the target image; display the text information according to the typesetting information of the text information in the target image.

本说明书中的各个实施例均采用递进的方式描述，每个实施例重点说明的都是与其他实施例的不同之处，各个实施例之间相同相似的部分互相参见即可。The various embodiments in this specification are described in a progressive manner, and each embodiment focuses on the differences from other embodiments, and the same and similar parts between the various embodiments may be referred to each other.

本发明实施例是参照根据本发明实施例的方法、终端设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理终端设备的处理器以产生一个机器，使得通过计算机或其他可编程数据处理终端设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。Embodiments of the present invention are described with reference to flowcharts and/or block diagrams of methods, terminal devices (systems), and computer program products according to embodiments of the present invention. It will be understood that each process and/or block in the flowchart illustrations and/or block diagrams, and combinations of processes and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to the processor of a general purpose computer, special purpose computer, embedded processor or other programmable data processing terminal equipment to produce a machine that causes the instructions to be executed by the processor of the computer or other programmable data processing terminal equipment Means are created for implementing the functions specified in the flow or flows of the flowcharts and/or the blocks or blocks of the block diagrams.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理终端设备以特定方式工作的计算机可读存储器中，使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品，该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer readable memory capable of directing a computer or other programmable data processing terminal equipment to operate in a particular manner, such that the instructions stored in the computer readable memory result in an article of manufacture comprising instruction means, the The instruction means implement the functions specified in the flow or flow of the flowcharts and/or the block or blocks of the block diagrams.

这些计算机程序指令也可装载到计算机或其他可编程数据处理终端设备上，使得在计算机或其他可编程终端设备上执行一系列操作步骤以产生计算机实现的处理，从而在计算机或其他可编程终端设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded on a computer or other programmable data processing terminal equipment, so that a series of operational steps are performed on the computer or other programmable terminal equipment to produce a computer-implemented process, thereby executing on the computer or other programmable terminal equipment The instructions executed on the above provide steps for implementing the functions specified in the flowchart or blocks and/or the block or blocks of the block diagrams.

尽管已描述了本发明实施例的优选实施例，但本领域内的技术人员一旦得知了基本创造性概念，则可对这些实施例做出另外的变更和修改。所以，所附权利要求意欲解释为包括优选实施例以及落入本发明实施例范围的所有变更和修改。Although preferred embodiments of the embodiments of the present invention have been described, additional changes and modifications to these embodiments may be made by those skilled in the art once the basic inventive concepts are known. Therefore, the appended claims are intended to be construed to include the preferred embodiment as well as all changes and modifications that fall within the scope of the embodiments of the present invention.

最后，还需要说明的是，在本文中，诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来，而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且，术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含，从而使得包括一系列要素的过程、方法、物品或者终端设备不仅包括那些要素，而且还包括没有明确列出的其他要素，或者是还包括为这种过程、方法、物品或者终端设备所固有的要素。在没有更多限制的情况下，由语句“包括一个……”限定的要素，并不排除在包括所述要素的过程、方法、物品或者终端设备中还存在另外的相同要素。Finally, it should also be noted that in this document, relational terms such as first and second are used only to distinguish one entity or operation from another, and do not necessarily require or imply these entities or there is any such actual relationship or sequence between operations. Moreover, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion such that a process, method, article or terminal device that includes a list of elements includes not only those elements, but also a non-exclusive list of elements. other elements, or also include elements inherent to such a process, method, article or terminal equipment. Without further limitation, an element defined by the phrase "comprises a..." does not preclude the presence of additional identical elements in the process, method, article, or terminal device that includes the element.

以上对本发明所提供的一种数据处理方法、一种数据处理装置和一种录音设备，进行了详细介绍，本文中应用了具体个例对本发明的原理及实施方式进行了阐述，以上实施例的说明只是用于帮助理解本发明的方法及其核心思想；同时，对于本领域的一般技术人员，依据本发明的思想，在具体实施方式及应用范围上均会有改变之处，综上所述，本说明书内容不应理解为对本发明的限制。A data processing method, a data processing device and a recording device provided by the present invention have been described in detail above. Specific examples are used in this paper to illustrate the principles and implementations of the present invention. The description is only used to help understand the method of the present invention and its core idea; at the same time, for those skilled in the art, according to the idea of the present invention, there will be changes in the specific embodiments and application scope. , the contents of this specification should not be construed as limiting the invention.

Claims

1. a data processing method, is characterized in that, comprises:

The recording device obtains the target image;

Perform text recognition on the target image, determine the corresponding text information and determine the typesetting information of the text information in the target image;

The text information is displayed according to the typesetting information of the text information in the target image.

2. The method according to claim 1, wherein the text information comprises a plurality of texts, and the determining the typesetting information of the text information in the target image comprises:

Record the line information corresponding to each text in the target image in the text information, and the position information of the line where each text is located;

According to the line information and position information of each text in the text information, the typesetting information of the text information in the target image is generated.

3. The method according to claim 2, wherein the displaying the text information according to the typesetting information of the text information in the target image comprises:

According to the line information and position information of each text in the text information, the text information is divided into paragraphs;

The textual information is presented in paragraphs.

4. The method according to claim 2, wherein the displaying the text information according to the typesetting information of the text information in the target image comprises:

According to the line information and position information of each text in the text information, each text in the text information is controlled to be displayed according to the same typesetting as the text in the image.

5. The method according to claim 1, wherein the method further comprises:

Translate the text information to obtain a corresponding translation result and display the translation result.

6. The method according to claim 5, wherein the translation result comprises: a picture translation result and/or a text translation result.

7. The method according to claim 5, wherein the method further comprises:

receiving a transmission instruction, where the transmission instruction includes at least one of the following: a sharing instruction, a forwarding instruction, and a dumping instruction;

The data corresponding to the transmission instruction is transmitted to other devices; the data corresponding to the transmission instruction includes at least one of the following: target image, text information and translation result.

8. A data processing device, characterized in that, applied in a recording device, comprising:

The image acquisition module is used to acquire the target image;

a text recognition module, configured to perform text recognition on the target image, determine corresponding text information and determine the typesetting information of the text information in the target image;

The display module is configured to display the text information according to the typesetting information of the text information in the target image.

9. A recording device comprising a memory and one or more programs, wherein one or more programs are stored in the memory and configured to be executed by one or more processors. More than one program contains instructions to:

Get the target image;

10. A readable storage medium, characterized in that, when the instructions in the storage medium are executed by a processor of a recording device, the recording device is enabled to execute the data processing method according to any one of method claims 1-7 .