[go: up one dir, main page]

CN108366182B - Calibration method and device for synchronous broadcast of text voice and computer storage medium - Google Patents

Calibration method and device for synchronous broadcast of text voice and computer storage medium Download PDF

Info

Publication number
CN108366182B
CN108366182B CN201810150541.0A CN201810150541A CN108366182B CN 108366182 B CN108366182 B CN 108366182B CN 201810150541 A CN201810150541 A CN 201810150541A CN 108366182 B CN108366182 B CN 108366182B
Authority
CN
China
Prior art keywords
line
broadcast
voice
display unit
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810150541.0A
Other languages
Chinese (zh)
Other versions
CN108366182A (en
Inventor
武赢
杨珣
李月
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BOE Technology Group Co Ltd
Original Assignee
BOE Technology Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BOE Technology Group Co Ltd filed Critical BOE Technology Group Co Ltd
Priority to CN201810150541.0A priority Critical patent/CN108366182B/en
Publication of CN108366182A publication Critical patent/CN108366182A/en
Application granted granted Critical
Publication of CN108366182B publication Critical patent/CN108366182B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/278Subtitling
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • H04N21/4884Data services, e.g. news ticker for displaying subtitles
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/04Synchronising
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Controls And Circuits For Display Device (AREA)
  • User Interface Of Digital Computer (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The embodiment of the invention provides a calibration method and a device for synchronous broadcast of text voice and a computer storage medium, wherein the calibration method comprises the following steps: the character display unit receives the voice recognition character string sent by the voice broadcasting unit, determines the characters displayed in each line, and automatically scrolls the characters displayed in each line at a preset scrolling speed; the character display unit obtains line feed information corresponding to each line according to the characters displayed by each line, and sends the line feed information to the voice broadcasting unit; the text display unit receives the broadcast line number sent by the voice broadcast unit, wherein the broadcast line number is the line number sent when the actual broadcast progress of the voice broadcast unit reaches the line feed information corresponding to each line; the character display unit determines the current scrolling display line number, compares the display line number with the broadcast line number, determines an error value, and adjusts the scrolling speed of the characters displayed on each line when the error value exceeds a preset threshold value.

Description

文字语音同步播报的校准方法及装置、计算机存储介质Calibration method and device for simultaneous broadcast of text and speech, and computer storage medium

技术领域technical field

本发明涉及智慧系统技术领域,尤其涉及一种文字语音同步播报的校准方法及装置、计算机存储介质。The present invention relates to the technical field of intelligent systems, and in particular, to a calibration method and device for simultaneous broadcast of text and speech, and a computer storage medium.

背景技术Background technique

现有技术文字语音同步播报解决方案,大都采用预先设置时间戳的方法,如电影字幕、歌词动态滚动。其特点为播报内容、播报时长已知。Most of the prior art solutions for simultaneous broadcast of text and speech use the method of presetting time stamps, such as movie subtitles and dynamic scrolling of lyrics. Its characteristics are that the broadcast content and broadcast duration are known.

但对于随机语音问答,文字语音同步播放这类播报内容、播报时长不定的情景,现有技术采用预先设置时间戳的方法并不适用,这种情况下,无法为用户呈现较佳的文字语音同步播报方式。However, for random voice question and answer, text and voice are played synchronously, and the broadcast content and broadcast duration are not fixed. The method of pre-setting timestamps in the prior art is not applicable. In this case, it is impossible to present a better text-voice synchronization for the user. broadcast method.

综上所述,现有技术文字语音同步播报时,用户体验较差。To sum up, in the prior art, when the text and voice are broadcast simultaneously, the user experience is poor.

发明内容SUMMARY OF THE INVENTION

有鉴于此,本发明的旨在提供一种文字语音同步播报的校准方法及装置、计算机存储介质,用以增加用户体验。In view of this, the purpose of the present invention is to provide a calibration method and device for simultaneous broadcast of text and speech, and a computer storage medium, so as to improve user experience.

为了实现上述目的,本发明提供以下技术方案:In order to achieve the above object, the present invention provides the following technical solutions:

一种文字语音同步播报的校准方法,包括:A calibration method for synchronous broadcast of text and speech, comprising:

文字显示单元接收语音播报单元发送的语音识别字符串,确定每行显示的字符,并将每行显示的字符以预设滚动速度自动滚动;The text display unit receives the speech recognition string sent by the voice broadcast unit, determines the characters displayed in each row, and automatically scrolls the characters displayed in each row at a preset scrolling speed;

所述文字显示单元根据每行显示的字符,得出每一行对应的换行信息,并将所述换行信息发送给所述语音播报单元;The character display unit obtains the line feed information corresponding to each line according to the characters displayed in each line, and sends the line feed information to the voice broadcasting unit;

所述文字显示单元接收所述语音播报单元发送的播报行数,所述播报行数为当所述语音播报单元实际的播报进度达到每一行对应的所述换行信息时,发送的行数;The text display unit receives the broadcast line number sent by the voice broadcast unit, and the broadcast line number is the number of lines sent when the actual broadcast progress of the voice broadcast unit reaches the line feed information corresponding to each line;

所述文字显示单元确定当前滚动到的显示行数,并将所述显示行数与所述播报行数进行比较,确定误差值,当该误差值超出预设阈值时,调整每行显示的字符的滚动速度。The text display unit determines the number of display lines currently scrolled to, and compares the number of displayed lines with the number of broadcast lines to determine an error value, and when the error value exceeds a preset threshold, adjust the characters displayed in each line scroll speed.

优选地,所述文字显示单元接收语音播报单元发送的语音识别字符串,确定每行显示的字符,包括:Preferably, the text display unit receives the voice recognition string sent by the voice broadcast unit, and determines the characters displayed in each line, including:

所述文字显示单元接收语音播报单元发送的语音识别字符串,根据该文字显示单元的宽度和高度,确定每行显示的字符。The text display unit receives the speech recognition character string sent by the voice broadcast unit, and determines the characters displayed in each line according to the width and height of the text display unit.

优选地,所述文字显示单元确定当前滚动到的显示行数,包括:Preferably, the text display unit determines the number of displayed rows currently scrolled to, including:

所述文字显示单元根据行数初始值、文字的高度像素和预设滚动速度,确定当前显示滚动到的显示行数。The text display unit determines the number of display lines to which the current display is scrolled according to the initial value of the number of lines, the height of the text in pixels and the preset scrolling speed.

优选地,所述将所述显示行数与所述播报行数进行比较,确定误差值,包括:Preferably, the comparing the number of displayed lines with the number of broadcast lines to determine an error value includes:

将所述显示行数与所述播报行数作差,将该差值作为误差值。A difference is made between the number of displayed lines and the number of broadcast lines, and the difference is used as an error value.

优选地,所述当该误差值超出预设阈值时,调整每行显示的字符的滚动速度,包括:Preferably, when the error value exceeds a preset threshold, adjusting the scrolling speed of the characters displayed in each row includes:

当所述显示行数与所述播报行数的差值大于零时,降低所述字符的滚动速度;When the difference between the number of displayed lines and the number of broadcast lines is greater than zero, reducing the scrolling speed of the characters;

以及,当所述显示行数与所述播报行数的差值小于零时,增加所述字符的滚动速度。And, when the difference between the number of displayed lines and the number of broadcast lines is less than zero, the scrolling speed of the characters is increased.

一种文字语音同步播报的校准方法,包括:A calibration method for synchronous broadcast of text and speech, comprising:

语音播报单元将获取的语音识别字符串发送给文字显示单元;The voice broadcast unit sends the acquired voice recognition string to the text display unit;

所述语音播报单元接收所述文字显示单元发送的换行信息,当该语音播报单元实际的播报进度达到每一行对应的换行信息时,发送播报行数给所述文字显示单元。The voice broadcast unit receives the line feed information sent by the text display unit, and when the actual broadcast progress of the voice broadcast unit reaches the line feed information corresponding to each line, sends the broadcast line number to the text display unit.

优选地,所述语音播报单元将获取的语音识别字符串发送给文字显示单元之后,所述语音播报单元接收所述文字显示单元发送的换行信息之前,还包括:Preferably, after the voice broadcast unit sends the acquired voice recognition character string to the text display unit, and before the voice broadcast unit receives the line feed information sent by the text display unit, the method further includes:

所述语音播报单元在预设时间后,开始播报获取的语音识别字符串。The voice broadcast unit starts to broadcast the acquired voice recognition character string after a preset time.

一种文字语音同步播报的校准装置,包括:A calibration device for simultaneous broadcast of text and voice, comprising:

字符滚动模块,用于接收语音播报单元发送的语音识别字符串,确定每行显示的字符,并将每行显示的字符以预设滚动速度自动滚动;The character scrolling module is used to receive the speech recognition string sent by the speech broadcasting unit, determine the characters displayed in each row, and automatically scroll the characters displayed in each row at a preset scrolling speed;

换行信息确定模块,用于根据每行显示的字符,得出每一行对应的换行信息,并将所述换行信息发送给所述语音播报单元;a newline information determination module, used for obtaining the newline information corresponding to each line according to the characters displayed in each line, and sending the newline information to the voice broadcasting unit;

信息接收模块,用于接收所述语音播报单元发送的播报行数,所述播报行数为当所述语音播报单元实际的播报进度达到每一行对应的所述换行信息时,发送的行数;an information receiving module, configured to receive the number of broadcast lines sent by the voice broadcast unit, where the number of broadcast lines is the number of lines sent when the actual broadcast progress of the voice broadcast unit reaches the line feed information corresponding to each line;

同步校准模块,用于确定当前滚动到的显示行数,并将所述显示行数与所述播报行数进行比较,确定误差值,当该误差值超出预设阈值时,调整每行显示的字符的滚动速度。The synchronous calibration module is used to determine the number of display lines currently scrolled to, and compare the number of display lines with the number of broadcast lines to determine the error value. When the error value exceeds the preset threshold, adjust the displayed line number of each line. The scrolling speed of the characters.

一种文字语音同步播报的校准装置,包括:A calibration device for simultaneous broadcast of text and voice, comprising:

字符串发送模块,用于将获取的语音识别字符串发送给文字显示单元;The string sending module is used to send the acquired speech recognition string to the text display unit;

换行反馈信息确定模块,用于接收所述文字显示单元发送的换行信息,当该语音播报单元实际的播报进度达到每一行对应的换行信息时,发送播报行数给所述文字显示单元。The line feed feedback information determination module is used to receive the line feed information sent by the text display unit, and when the actual broadcast progress of the voice broadcast unit reaches the line feed information corresponding to each line, send the broadcast line number to the text display unit.

一种计算机存储介质,存储有上述校准方法所对应的程序。A computer storage medium storing a program corresponding to the above calibration method.

相比于现有技术,本发明的方案具有以下有益效果:Compared with the prior art, the scheme of the present invention has the following beneficial effects:

本发明实施例提供的文字语音同步播报的校准方法,包括:文字显示单元接收语音播报单元发送的语音识别字符串,确定每行显示的字符,并将每行显示的字符以预设滚动速度自动滚动;文字显示单元根据每行显示的字符,得出每一行对应的换行信息,并将换行信息发送给语音播报单元;文字显示单元接收语音播报单元发送的播报行数,播报行数为当语音播报单元实际的播报进度达到每一行对应的换行信息时,发送的行数;文字显示单元确定当前滚动到的显示行数,并将显示行数与播报行数进行比较,确定误差值,当该误差值超出预设阈值时,调整每行显示的字符的滚动速度。该校准方法由于将显示行数与播报行数进行比较,确定误差值,当该误差值超出预设阈值时,调整每行显示的字符的滚动速度,因此能够校准长时间播放引起的漂移误差累计延时,进而增加用户体验。The method for calibrating text-voice synchronous broadcast provided by the embodiment of the present invention includes: the text display unit receives the speech recognition string sent by the voice broadcast unit, determines the characters displayed in each line, and automatically displays the characters displayed in each line at a preset scrolling speed. scrolling; the text display unit obtains the line break information corresponding to each line according to the characters displayed in each line, and sends the line break information to the voice broadcast unit; the text display unit receives the broadcast line number sent by the voice broadcast unit, and the broadcast line number is the current voice The number of lines sent when the actual broadcast progress of the broadcast unit reaches the linefeed information corresponding to each line; the text display unit determines the number of display lines currently scrolled to, and compares the number of displayed lines with the number of broadcast lines to determine the error value. When the error value exceeds the preset threshold, adjust the scrolling speed of the characters displayed on each line. This calibration method compares the number of displayed lines with the number of broadcast lines to determine the error value, and when the error value exceeds the preset threshold, the scrolling speed of the characters displayed in each line is adjusted, so the accumulated drift error caused by long-time playback can be calibrated delay, thereby increasing the user experience.

本发明附加的方面和优点将在下面的描述中部分给出,这些将从下面的描述中变得明显,或通过本发明的实践了解到。Additional aspects and advantages of the present invention will be set forth in part in the following description, which will be apparent from the following description, or may be learned by practice of the present invention.

附图说明Description of drawings

本发明上述的和/或附加的方面和优点从下面结合附图对实施例的描述中将变得明显和容易理解,其中:The above and/or additional aspects and advantages of the present invention will become apparent and readily understood from the following description of embodiments taken in conjunction with the accompanying drawings, wherein:

图1是本发明实施例一次完整的文字语音同步播报过程示意图;1 is a schematic diagram of a complete text-voice synchronous broadcast process according to an embodiment of the present invention;

图2是本发明实施例提供的一种文字语音同步播报的校准方法流程图;FIG. 2 is a flowchart of a calibration method for text-voice synchronous broadcast provided by an embodiment of the present invention;

图3是本发明实施例提供的另一文字语音同步播报的校准方法流程图;3 is a flowchart of another calibration method for text-voice synchronous broadcast provided by an embodiment of the present invention;

图4是本发明实施例提供的一种文字语音同步播报的校准装置结构框图;4 is a structural block diagram of a calibration device for text-voice synchronous broadcast provided by an embodiment of the present invention;

图5是本发明实施例提供的另一文字语音同步播报的校准装置结构框图。FIG. 5 is a structural block diagram of another calibration apparatus for synchronous broadcast of text and speech provided by an embodiment of the present invention.

具体实施方式Detailed ways

下面详细描述本发明的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,仅用于解释本发明,而不能解释为对本发明的限制。The following describes in detail the embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals refer to the same or similar elements or elements having the same or similar functions throughout. The embodiments described below with reference to the accompanying drawings are exemplary and are only used to explain the present invention, but not to be construed as a limitation of the present invention.

本技术领域技术人员可以理解,除非特意声明,这里使用的单数形式“一”、“一个”、“所述”和“该”也可包括复数形式。应该进一步理解的是,本发明的说明书中使用的措辞“包括”是指存在所述特征、整数、步骤、操作、元件和/或组件,但是并不排除存在或添加一个或多个其他特征、整数、步骤、操作、元件、组件和/或它们的组。应该理解,当我们称元件被“连接”或“耦接”到另一元件时,它可以直接连接或耦接到其他元件,或者也可以存在中间元件。此外,这里使用的“连接”或“耦接”可以包括无线连接或无线耦接。这里使用的措辞“和/或”包括一个或更多个相关联的列出项的全部或任一单元和全部组合。It will be understood by those skilled in the art that the singular forms "a", "an", "the" and "the" as used herein can include the plural forms as well, unless expressly stated otherwise. It should be further understood that the word "comprising" used in the description of the present invention refers to the presence of stated features, integers, steps, operations, elements and/or components, but does not exclude the presence or addition of one or more other features, Integers, steps, operations, elements, components and/or groups thereof. It will be understood that when we refer to an element as being "connected" or "coupled" to another element, it can be directly connected or coupled to the other element or intervening elements may also be present. Furthermore, "connected" or "coupled" as used herein may include wirelessly connected or wirelessly coupled. As used herein, the term "and/or" includes all or any element and all combination of one or more of the associated listed items.

本技术领域技术人员可以理解,除非另外定义,这里使用的所有术语(包括技术术语和科学术语),具有与本发明所属领域中的普通技术人员的一般理解相同的意义。还应该理解的是,诸如通用字典中定义的那些术语,应该被理解为具有与现有技术的上下文中的意义一致的意义,并且除非像这里一样被特定定义,否则不会用理想化或过于正式的含义来解释。It will be understood by those skilled in the art that, unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. It should also be understood that terms, such as those defined in a general dictionary, should be understood to have meanings consistent with their meanings in the context of the prior art and, unless specifically defined as herein, should not be interpreted in idealistic or overly formal meaning to explain.

首先介绍一下文字语音同步播报过程中的误差来源。First of all, the source of errors in the process of text-voice synchronous broadcasting is introduced.

在Android(安卓)系统中,有文字显示APK(Android Package,安卓安装包)和语音播报APK,一次完整的文字语音同步播报过程如图1所示。In the Android (Android) system, there are text display APK (Android Package, Android installation package) and voice broadcast APK. A complete text and voice synchronous broadcast process is shown in Figure 1.

如图1所示,当语音播报APK获得从云端返回的语音识别字符串(String)后,在T0时刻发送给文字显示APK,文字显示APK在T1时刻收到该String,文字显示APK根据当前设备宽度、高度的大小,以及根据显示参数,获得文字总体字符、显示行数、每行字符所占总体字符的百分比,并在T3时刻发送换行数组,换行数组中包括每行字符所占总体字符的百分比。As shown in Figure 1, after the voice broadcast APK obtains the voice recognition string (String) returned from the cloud, it is sent to the text display APK at time T0, the text display APK receives the String at T1 time, and the text display APK is based on the current device. The size of the width and height, and according to the display parameters, obtain the total characters of the text, the number of displayed lines, and the percentage of characters in each line of the total characters, and send a newline array at the time of T3, and the newline array includes the characters of each line. percentage.

如图1所示,在T4时刻,语音播报APK收到该换行数组,与语音播报APK自身播报函数提供的播报进度进行比较,当该播报进度达到文字显示APK换行数组中的值时,在T5时刻发送播报行数给文字显示APK,文字显示APK在T6时刻收到播报行数,并在T7时刻完成换行显示。As shown in Figure 1, at the moment of T4, the voice broadcast APK receives the newline array and compares it with the broadcast progress provided by the voice broadcast APK's own broadcast function. When the broadcast progress reaches the value in the text display APK newline array, at T5 Send the broadcast line number to the text display APK at any time, and the text display APK receives the broadcast line number at T6 time, and completes the line wrap display at T7 time.

从以上过程可以看到,从语音播报APK发送String,到文字显示APK显示,从语音播报APK发送播报行数,到文字显示APK实际完成换行显示,中间有很多延时,极大的影响了文字语音同步播报的准确性和用户体验。As can be seen from the above process, there are many delays in the middle from the voice broadcast APK sending String, to the text display APK display, from the voice broadcast APK sending the broadcast line number to the text display APK actually completing the line wrapping display, which greatly affects the text. Accuracy and user experience of simultaneous voice broadcast.

本申请的发明人发现,影响文字语音同步播报过程的准确性的因素主要有两点:第一、在语音播报APK和文字显示APK出厂前存在的播报延时(如:语音播报APK和文字显示APK的函数调用延时、文字显示APK函数延时和语音播报APK函数延时);第二、在实际应用过程中,长时间播放引起的漂移误差累计延时。The inventor of this application found that there are two main factors that affect the accuracy of the text-voice synchronous broadcast process: first, the broadcast delay before the voice broadcast APK and the text display APK leave the factory (for example: the voice broadcast APK and the text display APK) APK function call delay, text display APK function delay and voice broadcast APK function delay); second, in the actual application process, the accumulated delay of drift error caused by long-term playback.

本发明具体实施例解决语音播报APK和文字显示APK出厂前存在的播报延时的方法如下:The specific embodiment of the present invention solves the method for the broadcast delay existing before the delivery of the voice broadcast APK and the text display APK as follows:

由于语音播报APK和文字显示APK处在同一个系统中,所以语音播报APK和文字显示APK共享一个时钟,在此过程中有如下几个主要延时:Since the voice broadcast APK and the text display APK are in the same system, the voice broadcast APK and the text display APK share a clock. There are the following main delays in this process:

传播指令延时:Tds=T1-T0=T4-T3=T6-T5 Propagation instruction delay: T ds =T 1 -T 0 =T 4 -T 3 =T 6 -T 5

显示指令延时:TDspAc=T3-T1 Display command delay: T DspAc =T 3 -T 1

换行指令延时:TLineAc=T7-T6 Line feed command delay: T LineAc =T 7 -T 6

上述几个指令延时,虽然延时会略有不同,但是总体差别不大,可以近似相等。The delays of the above-mentioned instructions, although the delays will be slightly different, the overall difference is not large and can be approximately equal.

下面介绍如何测量上述延时的方法:Here's how to measure the above delay:

本发明具体实施例可以采用平均测量均值法,在语音播报APK和文字显示APK的软件版本出厂前,预先进行多次测量,并对多个测量值取平均值。即测量过程中,在发送指令中增加时间戳(由系统时间函数得出),这样就可以得出每条指令的发送绝对时间,进而得出各个延时。The specific embodiment of the present invention can adopt the average measurement method. Before the software versions of the voice broadcast APK and the text display APK are shipped from the factory, multiple measurements are performed in advance, and the multiple measurement values are averaged. That is to say, in the measurement process, a timestamp (obtained by the system time function) is added to the sending command, so that the absolute sending time of each command can be obtained, and then each delay can be obtained.

在校准出厂前存在的播报延时后,本发明实施例主要介绍如何校准长时间播放引起的漂移误差累计延时,以增加用户体验。After calibrating the broadcast delay existing before leaving the factory, the embodiment of the present invention mainly introduces how to calibrate the accumulated delay of drift error caused by long-time playing, so as to improve user experience.

下面结合附图介绍本发明实施例的技术方案。The technical solutions of the embodiments of the present invention are described below with reference to the accompanying drawings.

如图2所示,本发明具体实施例提供了一种文字语音同步播报的校准方法,包括:As shown in FIG. 2, a specific embodiment of the present invention provides a calibration method for synchronous broadcast of text and speech, including:

S201、文字显示单元接收语音播报单元发送的语音识别字符串,确定每行显示的字符,并将每行显示的字符以预设滚动速度自动滚动;S201, the text display unit receives the speech recognition string sent by the voice broadcast unit, determines the characters displayed in each row, and automatically scrolls the characters displayed in each row with a preset scrolling speed;

S202、文字显示单元根据每行显示的字符,得出每一行对应的换行信息,并将换行信息发送给语音播报单元;S202, the text display unit obtains the line break information corresponding to each line according to the characters displayed in each line, and sends the line break information to the voice broadcasting unit;

S203、文字显示单元接收语音播报单元发送的播报行数,播报行数为当语音播报单元实际的播报进度达到每一行对应的换行信息时,发送的行数;S203, the text display unit receives the broadcast line number sent by the voice broadcast unit, and the broadcast line number is the number of lines sent when the actual broadcast progress of the voice broadcast unit reaches the linefeed information corresponding to each line;

S204、文字显示单元确定当前滚动到的显示行数,并将显示行数与播报行数进行比较,确定误差值,当该误差值超出预设阈值时,调整每行显示的字符的滚动速度。S204: The text display unit determines the number of displayed lines currently scrolled to, and compares the number of displayed lines with the number of broadcast lines to determine an error value, and when the error value exceeds a preset threshold, adjust the scrolling speed of the characters displayed on each line.

由于本发明具体实施例中文字语音同步播报的校准方法包括:文字显示单元接收语音播报单元发送的语音识别字符串,确定每行显示的字符,并将每行显示的字符以预设滚动速度自动滚动;文字显示单元根据每行显示的字符,得出每一行对应的换行信息,并将换行信息发送给语音播报单元;文字显示单元接收语音播报单元发送的播报行数,播报行数为当语音播报单元实际的播报进度达到每一行对应的换行信息时,发送的行数;文字显示单元确定当前滚动到的显示行数,并将显示行数与播报行数进行比较,确定误差值,当该误差值超出预设阈值时,调整每行显示的字符的滚动速度。该校准方法由于将显示行数与播报行数进行比较,确定误差值,当该误差值超出预设阈值时,调整每行显示的字符的滚动速度,因此能够校准长时间播放引起的漂移误差累计延时,进而增加用户体验。Because the method for calibrating text and voice synchronous broadcast in the specific embodiment of the present invention includes: the text display unit receives the speech recognition string sent by the voice broadcast unit, determines the characters displayed in each line, and automatically displays the characters displayed in each line at a preset scrolling speed. scrolling; the text display unit obtains the line break information corresponding to each line according to the characters displayed in each line, and sends the line break information to the voice broadcast unit; the text display unit receives the broadcast line number sent by the voice broadcast unit, and the broadcast line number is the current voice The number of lines sent when the actual broadcast progress of the broadcast unit reaches the linefeed information corresponding to each line; the text display unit determines the number of display lines currently scrolled to, and compares the number of displayed lines with the number of broadcast lines to determine the error value. When the error value exceeds the preset threshold, adjust the scrolling speed of the characters displayed on each line. This calibration method compares the number of displayed lines with the number of broadcast lines to determine the error value, and when the error value exceeds the preset threshold, the scrolling speed of the characters displayed in each line is adjusted, so the accumulated drift error caused by long-time playback can be calibrated delay, thereby increasing the user experience.

较佳地,上述步骤S201中文字显示单元接收语音播报单元发送的语音识别字符串,确定每行显示的字符,包括:文字显示单元接收语音播报单元发送的语音识别字符串,根据该文字显示单元的宽度和高度,确定每行显示的字符。Preferably, in the above step S201, the text display unit receives the speech recognition string sent by the voice broadcast unit, and determines the characters displayed in each line, including: the text display unit receives the speech recognition string sent by the voice broadcast unit, and according to the text display unit The width and height determine the characters displayed on each line.

具体地,假设文字显示单元接收到的语音识别字符串包括100个字符,文字显示单元根据该文字显示单元的宽度和高度,实时计算每行能够显示的字符,假设每行最多显示30个字符(不同文字显示单元情况不同),这样,100个字符总共可以显示四行。Specifically, it is assumed that the speech recognition string received by the text display unit includes 100 characters, and the text display unit calculates the characters that can be displayed in each line in real time according to the width and height of the text display unit. It is assumed that each line can display up to 30 characters ( Different text display units have different situations), in this way, 100 characters can display four lines in total.

关于上述步骤S202中的换行信息,具体地,本发明具体实施例中的换行信息以换行数组为例,继续沿用上一段的实施例,文字显示单元根据每行显示的字符,可以得出换行数组LinePrecent[3]={30%,60%,90%},本发明具体实施例中换行数组LinePrecent[3]中的每一个值均以百分比表示,换行数组LinePrecent[3]中的30%表示第一行可以显示30个字符,60%表示第一行和第二行总共可以显示60个字符,90%表示第一行、第二行和第三行总共可以显示90个字符,由于最后一行不换行,因此换行数组LinePrecent[3]中仅包括了前三行需要换行时的百分比数值。Regarding the line feed information in the above step S202, specifically, the line feed information in the specific embodiment of the present invention takes the line feed array as an example, and the embodiment in the previous paragraph is continued. The text display unit can obtain the line feed array according to the characters displayed in each line. LinePrecent[3]={30%, 60%, 90%}, in the specific embodiment of the present invention, each value in the linefeed array LinePrecent[3] is represented by a percentage, and 30% in the linefeed array LinePrecent[3] represents the first One line can display 30 characters, 60% means that the first and second lines can display a total of 60 characters, and 90% means that the first, second and third lines can display a total of 90 characters. Linefeed, so the linefeed array LinePrecent[3] only includes the percentage value when the first three lines need to be wrapped.

较佳地,上述步骤S204中文字显示单元确定当前滚动到的显示行数,包括:文字显示单元根据行数初始值、文字的高度像素和预设滚动速度,确定当前滚动到的显示行数。Preferably, in the above step S204, the text display unit determines the current scrolled display line number, including: the text display unit determines the current scrolled display line number according to the initial value of the line number, the height pixel of the text and the preset scroll speed.

具体地,本发明具体实施例中文字显示单元中包含一个变量LineNum,该变量LineNum记录当前滚动到的显示行数,初始值为1,当滚动到第二行时,LineNum的值加1,滚动到的行数,可以根据文字的高度像素和预设滚动速度确定;如:文字的高度像素为20像素,预设滚动速度为4px/次,这样经过五次指令就滚动了一行,滚动速度的单位为像素/次。Specifically, in the specific embodiment of the present invention, the text display unit includes a variable LineNum, the variable LineNum records the number of displayed lines currently scrolled to, and the initial value is 1. When the second line is scrolled, the value of LineNum is increased by 1, and the scroll The number of lines reached can be determined according to the height pixels of the text and the preset scrolling speed; for example, the height of the text is 20 pixels, and the preset scrolling speed is 4px/time, so after five instructions, one line is scrolled, and the scrolling speed is The unit is pixel/time.

较佳地,上述步骤S204中将显示行数与播报行数进行比较,确定误差值,包括:将显示行数与播报行数作差,将该差值作为误差值。Preferably, in the above step S204, the number of displayed lines and the number of broadcast lines are compared to determine the error value, including: taking the difference between the number of displayed lines and the number of broadcast lines, and using the difference as the error value.

具体地,本发明若显示行数与播报行数作差后的差值大于零,则表示文字显示单元实际换行的速度大于语音播报单元的播报速度;若显示行数与播报行数作差后的差值小于零,则表示文字显示单元实际换行的速度小于语音播报单元的播报速度。随着播报时间的延长,文字显示单元实际换行的速度与语音播报单元的播报速度的差值会越来越大,这样会导致文字显示与语音播放的不同步。Specifically, in the present invention, if the difference between the number of displayed lines and the number of broadcast lines is greater than zero, it means that the actual line wrapping speed of the text display unit is greater than the broadcast speed of the voice broadcast unit; If the difference is less than zero, it means that the actual line wrapping speed of the text display unit is lower than the broadcasting speed of the voice broadcasting unit. With the extension of the broadcast time, the difference between the actual line wrapping speed of the text display unit and the broadcast speed of the voice broadcast unit will become larger and larger, which will cause the text display and voice playback to be out of sync.

本发明具体实施例中假设语音播报单元的播报速度为固定不变的值,为了能够使得文字显示与语音播放同步进行,文字显示单元需要调整每行显示的字符的滚动速度。In the specific embodiment of the present invention, it is assumed that the broadcasting speed of the voice broadcasting unit is a fixed value. In order to enable the text display and voice playback to be performed synchronously, the text display unit needs to adjust the scrolling speed of the characters displayed in each line.

较佳地,上述步骤S204中当该误差值超出预设阈值时,调整每行显示的字符的滚动速度,包括:当显示行数与播报行数的差值大于零时,降低每行显示的字符的滚动速度;以及,当显示行数与播报行数的差值小于零时,增加每行显示的字符的滚动速度。这样,即使设置的初始显示滚动速度过快或过慢,但是经过一段时间调整后,每行显示的字符的滚动速度最终会收敛稳定到一个小区间,并且,经过这样的调整,能够使得文字显示与语音播放同步进行。Preferably, in the above-mentioned step S204, when the error value exceeds the preset threshold, adjusting the scrolling speed of the characters displayed in each line, including: when the difference between the number of displayed lines and the number of broadcast lines is greater than zero, reducing the number of displayed characters in each line. The scrolling speed of the characters; and, when the difference between the number of displayed lines and the number of broadcast lines is less than zero, increasing the scrolling speed of the characters displayed on each line. In this way, even if the initial scrolling speed is set too fast or too slow, after a period of adjustment, the scrolling speed of the characters displayed in each line will eventually converge and stabilize to a small range. Synchronized with voice playback.

基于同一发明构思,本发明具体实施例还提供了一种文字语音同步播报的校准方法,如图3所示,该方法包括:Based on the same inventive concept, a specific embodiment of the present invention also provides a method for calibrating text and voice synchronous broadcast, as shown in FIG. 3 , the method includes:

S301、语音播报单元将获取的语音识别字符串发送给文字显示单元;S301, the voice broadcast unit sends the acquired voice recognition character string to the text display unit;

S302、语音播报单元接收文字显示单元发送的换行信息,当该语音播报单元实际的播报进度达到每一行对应的换行信息时,发送播报行数给文字显示单元。S302: The voice broadcasting unit receives the line feed information sent by the text display unit, and when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line, sends the broadcast line number to the text display unit.

较佳地,本发明具体实施例中语音播报单元将获取的语音识别字符串发送给文字显示单元之后,语音播报单元接收文字显示单元发送的换行信息之前,还包括:语音播报单元在预设时间后,开始播报获取的语音识别字符串。Preferably, in the specific embodiment of the present invention, after the voice broadcast unit sends the acquired voice recognition string to the text display unit, and before the voice broadcast unit receives the line feed information sent by the text display unit, the method further includes: the voice broadcast unit at a preset time. After that, start broadcasting the acquired speech recognition string.

具体地,如图1所示,本发明具体实施例中的预设时间为T2-T0,这样在预设时间后开始播报,能够很好的补偿指令延时误差。Specifically, as shown in FIG. 1 , the preset time in the specific embodiment of the present invention is T2-T0, so that the broadcast starts after the preset time, which can well compensate the instruction delay error.

下面结合一个具体实施例详细介绍本发明具体实施例提供的文字语音同步播报的校准方法。The following describes in detail a method for calibrating text-voice synchronous broadcast provided by a specific embodiment of the present invention in conjunction with a specific embodiment.

本发明具体实施例中文字和语音同步播报的基本单位是行同步。The basic unit of text and voice synchronous broadcast in the specific embodiment of the present invention is line synchronization.

假设语音播报速度为一秒一个字符,并且语音播报速度为固定不变的值。It is assumed that the voice broadcast speed is one character per second, and the voice broadcast speed is a fixed value.

本发明具体实施例以随机语音问答为例,提问者提问:“请介绍一下XXX公司?”。本发明具体实施例中的语音播报单元获得该问题对应的语音识别字符串,假设该语音识别字符串包含100个字符,string=100(包含标点符号),并且语音播报单元在T0时间值将string发送给文字显示单元,并且语音播报单元在T2-T0时刻开始播报,如图1所示。The specific embodiment of the present invention takes the random voice question and answer as an example, and the questioner asks: "Please introduce XXX company?". The voice announcement unit in the specific embodiment of the present invention obtains the speech recognition string corresponding to the question, assuming that the speech recognition string contains 100 characters, string=100 (including punctuation marks), and the speech announcement unit at T0 time value will string string It is sent to the text display unit, and the voice broadcast unit starts to broadcast at the time T2-T0, as shown in Figure 1.

如图1所示,文字显示单元收到string后,开始以一个合适的随机速度speed开始自动滚动字幕,该随机速度的单位为像素/次;同时,文字显示单元根据该文字显示单元的宽度和高度,实时计算每行能够显示的字符,假设每行最多显示30个字符(不同文字显示单元情况不同),这样,100个字符总共可以显示四行。As shown in Figure 1, after the text display unit receives the string, it starts to automatically scroll the subtitles at a suitable random speed, and the unit of the random speed is pixels/time; Height, calculate the characters that can be displayed in each line in real time, assuming that each line can display up to 30 characters (different text display units are different), in this way, 100 characters can be displayed in four lines in total.

如图1所示,本发明具体实施例中的换行信息以换行数组为例,文字显示单元根据每行显示的字符,得出换行数组LinePrecent[3]={30%,60%,90%},该换行数组中的30%表示第一行对应的换行信号,60%表示第二行对应的换行信息,90%表示第三行对应的换行信息,最后一行不换行,文字显示单元在T3时刻将换行数组LinePrecent[3]发送给语音播报单元。As shown in FIG. 1, the line feed information in the specific embodiment of the present invention takes the line feed array as an example, and the text display unit obtains the line feed array LinePrecent[3]={30%, 60%, 90%} according to the characters displayed in each line , 30% of the newline array represents the linefeed signal corresponding to the first line, 60% represents the linefeed information corresponding to the second line, 90% represents the linefeed information corresponding to the third line, the last line does not wrap, and the text display unit is at time T3 Send the linefeed array LinePrecent[3] to the voice broadcast unit.

并且,本发明具体实施例中文字显示单元中包含一个变量LineNum,该变量LineNum记录当前滚动到的显示行数,初始值为1,当滚动到第二行时,LineNum的值加1,滚动到的显示行数,可以根据行数初始值、文字的高度像素和预设滚动速度确定;如:文字的高度像素为20像素,预设的滚动速度为4px/次,这样经过五次指令就滚动了一行。Moreover, in the specific embodiment of the present invention, the text display unit includes a variable LineNum, the variable LineNum records the number of displayed lines currently scrolled to, and the initial value is 1. When the second line is scrolled, the value of LineNum is increased by 1, and the scroll to The number of displayed lines can be determined according to the initial value of the number of lines, the height of the text in pixels and the preset scrolling speed; for example: the height of the text in pixels is 20 pixels, and the preset scrolling speed is 4px/time, so after five instructions a line.

如图1所示,在T4时间值,语音播报单元收到文字显示单元发送的换行数组LinePrecent[3],语音播报单元在播报过程中,会实时得出自身的播报进度,当该语音播报单元实际的播报进度达到每一行对应的换行信息(即达到换行数组LinePrecent[3]中的每一个值)时,发送播报行数给文字显示单元。As shown in Figure 1, at the time value of T4, the voice broadcast unit receives the linefeed array LinePrecent[3] sent by the text display unit. During the broadcast process, the voice broadcast unit will obtain its own broadcast progress in real time. When the voice broadcast unit When the actual broadcast progress reaches the linefeed information corresponding to each line (ie, reaches each value in the linefeed array LinePrecent[3]), the broadcast line number is sent to the text display unit.

例如:记播报行数为LineCmd,播报行数LineCmd不实际执行换行,语音播报单元播报进度当前为25%,当播报到30%,便发送播报行数(LineCmd=2,表示语音播报单元此时已经播放到第二行);当播报到60%,再次发送播报行数(LineCmd=3,表示语音播报单元此时已经播放到第三行);当播报到90%,再次发送播报行数(LineCmd=4,表示语音播报单元此时已经播放到第四行)。For example: record the number of broadcast lines as LineCmd, the number of broadcast lines LineCmd does not actually perform line wrapping, the current broadcast progress of the voice broadcast unit is 25%, when the broadcast reaches 30%, the broadcast line number is sent (LineCmd=2, indicating that the voice broadcast unit is at this time has been played to the second line); when the broadcast reaches 60%, the number of broadcast lines is sent again (LineCmd=3, indicating that the voice broadcast unit has played the third line at this time); when the broadcast reaches 90%, the number of broadcast lines is sent again ( LineCmd=4, indicating that the voice broadcast unit has played to the fourth line at this time).

如图1所示,文字显示单元在T6时刻接收到播报行数,并在T7时刻完成实际换行操作。具体地,本发明具体实施例中文字显示单元在每次接收到语音播报单元发送的播报行数后,将接收到的播报行数(语音播报单元理论得到的换行的行数)与文字显示单元确定的实际的显示行数进行比较,确定出误差值,如:播报行数与显示行数作差,将该差值确定为误差值,当该误差值超出预设阈值时,调整每行显示的字符的滚动速度。As shown in Figure 1, the text display unit receives the broadcast line number at time T6, and completes the actual line feed operation at time T7. Specifically, in the specific embodiment of the present invention, after each time the text display unit receives the number of broadcast lines sent by the voice broadcast unit, it compares the received number of broadcast lines (the number of newlines obtained by the voice broadcast unit theoretically) with the text display unit. Compare the actual number of displayed lines to determine the error value. For example, the difference between the number of broadcast lines and the number of displayed lines is determined as the error value. When the error value exceeds the preset threshold, the display of each line is adjusted. The scrolling speed of the characters.

具体实施时,当显示行数与播报行数的差值大于零时,降低每行显示的字符的滚动速度;当显示行数与播报行数的差值小于零时,增加每行显示的字符的滚动速度。During specific implementation, when the difference between the number of displayed lines and the number of broadcast lines is greater than zero, the scrolling speed of the characters displayed in each line is reduced; when the difference between the number of displayed lines and the number of broadcast lines is less than zero, the characters displayed in each line are increased scroll speed.

这样,经过一定的时间后,文字显示和语音播报达到了同步,文字显示单元每行显示的字符的滚动速度稳定到一个合理的区间;本发明具体实施例通过调整每行显示的字符的滚动速度,解决了语音播报长时间播放引起的播报累计误差,从而达到文字语音实时同步的效果。In this way, after a certain period of time, the text display and the voice broadcast are synchronized, and the scrolling speed of the characters displayed in each line of the text display unit is stabilized to a reasonable range; the specific embodiment of the present invention adjusts the scrolling speed of the characters displayed in each line by adjusting the scrolling speed of the characters displayed in each line. , to solve the cumulative error of the broadcast caused by the long-time broadcast of the voice broadcast, so as to achieve the effect of real-time synchronization of text and voice.

基于同一发明构思,本发明具体实施例还提供了一种文字语音同步播报的校准装置,如图4所示,包括:Based on the same inventive concept, a specific embodiment of the present invention also provides a calibration device for synchronous broadcast of text and speech, as shown in FIG. 4 , including:

字符滚动模块41,用于接收语音播报单元发送的语音识别字符串,确定每行显示的字符,并将每行显示的字符以预设滚动速度自动滚动;The character scrolling module 41 is used to receive the speech recognition character string sent by the voice broadcasting unit, determine the characters displayed in each row, and automatically scroll the characters displayed in each row with a preset scrolling speed;

换行信息确定模块42,用于根据每行显示的字符,得出每一行对应的换行信息,并将换行信息发送给语音播报单元;The newline information determination module 42 is used to obtain the newline information corresponding to each line according to the characters displayed in each line, and send the newline information to the voice broadcasting unit;

信息接收模块43,用于接收语音播报单元发送的播报行数,播报行数为当语音播报单元实际的播报进度达到每一行对应的换行信息时,发送的行数;The information receiving module 43 is used to receive the broadcast line number sent by the voice broadcast unit, and the broadcast line number is the number of lines sent when the actual broadcast progress of the voice broadcast unit reaches the linefeed information corresponding to each line;

同步校准模块44,用于确定当前滚动到的显示行数,并将显示行数与播报行数进行比较,确定误差值,当该误差值超出预设阈值时,调整每行显示的字符的滚动速度。The synchronous calibration module 44 is used to determine the number of displayed lines currently scrolled to, and compare the number of displayed lines with the number of broadcast lines to determine an error value, and when the error value exceeds a preset threshold, adjust the scrolling of the characters displayed on each line speed.

基于同一发明构思,本发明具体实施例还提供了一种文字语音同步播报的校准装置,如图5所示,包括:Based on the same inventive concept, a specific embodiment of the present invention also provides a calibration device for synchronous broadcast of text and speech, as shown in FIG. 5 , including:

字符串发送模块51,用于将获取的语音识别字符串发送给文字显示单元;a character string sending module 51, used for sending the acquired speech recognition character string to the text display unit;

换行反馈信息确定模块52,用于接收文字显示单元发送的换行信息,当该语音播报单元实际的播报进度达到每一行对应的换行信息时,发送播报行数给文字显示单元。The line feed feedback information determining module 52 is used to receive the line feed information sent by the text display unit, and when the actual broadcast progress of the voice broadcast unit reaches the line feed information corresponding to each line, send the broadcast line number to the text display unit.

基于同一发明构思,本发明具体实施例还提供了一种计算机存储介质,该计算机存储介质存储有上述文字语音同步播报的校准方法对应的程序。Based on the same inventive concept, a specific embodiment of the present invention also provides a computer storage medium, where the computer storage medium stores a program corresponding to the above-mentioned calibration method for synchronous broadcast of text and speech.

本技术领域技术人员可以理解,本发明包括涉及用于执行本申请中所述操作中的一项或多项的设备。这些设备可以为所需的目的而专门设计和制造,或者也可以包括通用计算机中的已知设备。这些设备具有存储在其内的计算机程序,这些计算机程序选择性地激活或重构。这样的计算机程序可以被存储在设备(例如,计算机)可读介质中或者存储在适于存储电子指令并分别耦联到总线的任何类型的介质中,所述计算机可读介质包括但不限于任何类型的盘(包括软盘、硬盘、光盘、CD-ROM、和磁光盘)、ROM(Read-Only Memory,只读存储器)、RAM(Random Access Memory,随即存储器)、EPROM(Erasable ProgrammableRead-Only Memory,可擦写可编程只读存储器)、EEPROM(Electrically ErasableProgrammableRead-Only Memory,电可擦可编程只读存储器)、闪存、磁性卡片或光线卡片。也就是,可读介质包括由设备(例如,计算机)以能够读的形式存储或传输信息的任何介质。As will be appreciated by those skilled in the art, the present invention includes apparatuses for performing one or more of the operations described in this application. These devices may be specially designed and manufactured for the required purposes, or they may include those known in general purpose computers. These devices have computer programs stored in them that are selectively activated or reconfigured. Such a computer program may be stored in a device (eg, computer) readable medium including, but not limited to, any type of medium suitable for storing electronic instructions and coupled to a bus, respectively Types of disks (including floppy disks, hard disks, CD-ROMs, CD-ROMs, and magneto-optical disks), ROM (Read-Only Memory, read-only memory), RAM (Random Access Memory, random access memory), EPROM (Erasable Programmable Read-Only Memory, Erasable Programmable Read-Only Memory), EEPROM (Electrically Erasable Programmable Read-Only Memory, Electrically Erasable Programmable Read-Only Memory), flash memory, magnetic card or optical card. That is, a readable medium includes any medium that stores or transmits information in a form that can be read by a device (eg, a computer).

综上所述,本发明具体实施例提供的文字语音同步播报的校准方法,包括:文字显示单元接收语音播报单元发送的语音识别字符串,确定每行显示的字符,并将每行显示的字符以预设滚动速度自动滚动;文字显示单元根据每行显示的字符,得出每一行对应的换行信息,并将换行信息发送给语音播报单元;文字显示单元接收语音播报单元发送的播报行数,播报行数为当语音播报单元实际的播报进度达到每一行对应的换行信息时,发送的行数;文字显示单元确定当前滚动到的显示行数,并将显示行数与播报行数进行比较,确定误差值,当该误差值超出预设阈值时,调整每行显示的字符的滚动速度。该校准方法由于将显示行数与播报行数进行比较,确定误差值,当该误差值超出预设阈值时,调整每行显示的字符的滚动速度,因此能够校准长时间播放引起的漂移误差累计延时,进而增加用户体验。In summary, the method for calibrating text-voice synchronous broadcast provided by a specific embodiment of the present invention includes: the text display unit receives the speech recognition string sent by the voice broadcast unit, determines the characters displayed in each line, and compares the characters displayed in each line Automatically scroll at a preset scrolling speed; the text display unit obtains the line break information corresponding to each line according to the characters displayed in each line, and sends the line feed information to the voice broadcast unit; the text display unit receives the broadcast line number sent by the voice broadcast unit, The number of broadcast lines is the number of lines sent when the actual broadcast progress of the voice broadcast unit reaches the line feed information corresponding to each line; the text display unit determines the number of displayed lines currently scrolled to, and compares the number of displayed lines with the number of broadcast lines, Determine the error value, when the error value exceeds the preset threshold, adjust the scrolling speed of the characters displayed in each line. The calibration method compares the number of displayed lines with the number of broadcast lines to determine the error value, and when the error value exceeds the preset threshold, the scrolling speed of the characters displayed on each line is adjusted, so the accumulated drift error caused by long-time playback can be calibrated delay, thereby increasing the user experience.

以上所述仅是本发明的部分实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。The above are only some embodiments of the present invention. It should be pointed out that for those skilled in the art, without departing from the principles of the present invention, several improvements and modifications can be made. It should be regarded as the protection scope of the present invention.

Claims (10)

1. A calibration method for synchronous broadcast of text and voice is characterized by comprising the following steps:
the character display unit receives the voice recognition character string sent by the voice broadcasting unit, determines the characters displayed in each line, and automatically scrolls the characters displayed in each line at a preset scrolling speed;
the character display unit obtains line feed information corresponding to each line according to the characters displayed by each line, and sends the line feed information to the voice broadcasting unit;
the text display unit receives the broadcasting line number sent by the voice broadcasting unit, wherein the broadcasting line number is the line number sent when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line;
and the character display unit determines the display line number currently scrolled to, compares the display line number with the broadcast line number, determines an error value, and adjusts the scrolling speed of the characters displayed on each line when the error value exceeds a preset threshold value.
2. The calibration method according to claim 1, wherein the text display unit receives the voice recognition character string sent by the voice broadcasting unit, and determines the characters displayed in each row, including:
the character display unit receives the voice recognition character string sent by the voice broadcasting unit, and determines characters displayed in each line according to the width and the height of the character display unit.
3. The calibration method of claim 1, wherein the determining, by the text display unit, the number of display lines currently scrolled to comprises:
and the character display unit determines the display line number scrolled to currently display according to the initial line number value, the height pixel of the character and the preset scrolling speed.
4. The calibration method of claim 1, wherein comparing the number of display lines to the number of broadcast lines to determine an error value comprises:
and taking the difference between the display line number and the broadcast line number as an error value.
5. The calibration method according to claim 4, wherein adjusting the scrolling speed of the displayed characters for each line when the error value exceeds a preset threshold comprises:
when the difference value between the display line number and the broadcast line number is larger than zero, the scrolling speed of the character is reduced;
and when the difference value between the display line number and the broadcast line number is less than zero, increasing the scrolling speed of the character.
6. A calibration method for synchronous broadcast of text and voice is characterized by comprising the following steps:
the voice broadcasting unit sends the acquired voice recognition character string to a character display unit;
the voice broadcasting unit receives the line feed information sent by the character display unit, and sends broadcasting lines to the character display unit when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line.
7. The calibration method according to claim 6, wherein after the voice broadcasting unit sends the acquired voice recognition character string to a text display unit, and before the voice broadcasting unit receives line feed information sent by the text display unit, the method further comprises:
and the voice broadcasting unit starts to broadcast the acquired voice recognition character string after preset time.
8. The utility model provides a calibration device that synchronous report of characters pronunciation which characterized in that includes:
the character rolling module is used for receiving the voice recognition character string sent by the voice broadcasting unit, determining the characters displayed in each line and automatically rolling the characters displayed in each line at a preset rolling speed;
the line feed information determining module is used for obtaining line feed information corresponding to each line according to the characters displayed by each line and sending the line feed information to the voice broadcasting unit;
the information receiving module is used for receiving the broadcasting line number sent by the voice broadcasting unit, wherein the broadcasting line number is the line number sent when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line;
and the synchronous calibration module is used for determining the number of display lines currently scrolled to, comparing the number of display lines with the number of broadcast lines, determining an error value, and adjusting the scrolling speed of the characters displayed on each line when the error value exceeds a preset threshold value.
9. The utility model provides a calibration device that synchronous report of characters pronunciation which characterized in that includes:
the character string sending module is used for sending the acquired voice recognition character string to the character display unit;
and the line feed back information determining module is used for receiving the line feed information sent by the character display unit, and sending the broadcasting line number to the character display unit when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line.
10. A computer storage medium storing a program corresponding to the calibration method according to any one of claims 1 to 5; and/or a program corresponding to the calibration method according to any one of claims 6 to 7 is stored.
CN201810150541.0A 2018-02-13 2018-02-13 Calibration method and device for synchronous broadcast of text voice and computer storage medium Active CN108366182B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810150541.0A CN108366182B (en) 2018-02-13 2018-02-13 Calibration method and device for synchronous broadcast of text voice and computer storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810150541.0A CN108366182B (en) 2018-02-13 2018-02-13 Calibration method and device for synchronous broadcast of text voice and computer storage medium

Publications (2)

Publication Number Publication Date
CN108366182A CN108366182A (en) 2018-08-03
CN108366182B true CN108366182B (en) 2020-07-07

Family

ID=63002416

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810150541.0A Active CN108366182B (en) 2018-02-13 2018-02-13 Calibration method and device for synchronous broadcast of text voice and computer storage medium

Country Status (1)

Country Link
CN (1) CN108366182B (en)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109413479A (en) * 2018-09-28 2019-03-01 四川长虹电器股份有限公司 The method that smart television voice Interaction Interface content of text is completely shown
CN110267080A (en) * 2019-04-30 2019-09-20 北京云迹科技有限公司 Data processing method and device for robot casting content synchronization
CN110232194B (en) * 2019-06-17 2024-04-09 安徽听见科技有限公司 Translation display method, device, equipment and readable storage medium
CN110910918B (en) * 2019-10-11 2021-03-23 广州优谷信息技术有限公司 Subtitle rolling control method, system and device
CN110991455B (en) * 2020-02-11 2023-05-05 上海肇观电子科技有限公司 Image text broadcasting method and equipment, electronic circuit and storage medium thereof
US11776286B2 (en) 2020-02-11 2023-10-03 NextVPU (Shanghai) Co., Ltd. Image text broadcasting
CN111917937A (en) * 2020-06-23 2020-11-10 广州优谷信息技术有限公司 Prompter display method, device, prompter system, operation terminal and storage medium
CN112423094A (en) * 2020-10-30 2021-02-26 广州佰锐网络科技有限公司 Double-recording service broadcasting method and device and storage medium
CN112632120A (en) * 2020-12-08 2021-04-09 深圳前海微众银行股份有限公司 Information processing method, device, equipment and computer storage medium
CN112887779B (en) * 2021-01-20 2022-11-18 杭州小众圈科技有限公司 Method, system and device for automatically rolling subtitles based on voice rhythm
CN115148183B (en) * 2021-03-15 2025-04-04 北京中关村科金技术有限公司 TTS voice corresponding subtitle scrolling method, device and storage medium
CN115248841A (en) * 2021-04-27 2022-10-28 华为技术有限公司 Method and device for simultaneous broadcast of text and speech
CN118522302B (en) * 2024-06-11 2024-11-15 广州市悦程信息科技有限公司 A method and device for speech recognition sound-word synchronization

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6130722A (en) * 1996-05-03 2000-10-10 Samsung Electronics Co., Ltd Closed-caption broadcasting and displaying method and apparatus suitable for syllable characters
CN102148937A (en) * 2010-02-05 2011-08-10 新奥特(北京)视频技术有限公司 Realization method and realization device for tail screen staying in scrolling design
CN102663988A (en) * 2012-04-28 2012-09-12 广东威创视讯科技股份有限公司 Method, device and system for broadcasting subtitles
US8826354B2 (en) * 2010-12-01 2014-09-02 At&T Intellectual Property I, L.P. Method and system for testing closed caption content of video assets

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20010026609A (en) * 1999-09-07 2001-04-06 나현민 Audio play media device for combining automatic caption information extracting and storing function with character displaying function
CN1225721C (en) * 2001-03-29 2005-11-02 皇家菲利浦电子有限公司 Synchronise audio cursor and text cursor during editing
JP2004266577A (en) * 2003-02-28 2004-09-24 Casio Comput Co Ltd Media synchronized playback device, server device, media synchronized playback processing program, server processing program
JP3945778B2 (en) * 2004-03-12 2007-07-18 インターナショナル・ビジネス・マシーンズ・コーポレーション Setting device, program, recording medium, and setting method
CN101500127A (en) * 2008-01-28 2009-08-05 德信智能手机技术(北京)有限公司 Method for synchronously displaying subtitle in video telephone call
CN101382937B (en) * 2008-07-01 2011-03-30 深圳先进技术研究院 Speech recognition-based multimedia resource processing method and its online teaching system
JP2010136067A (en) * 2008-12-04 2010-06-17 Sony Corp Data processing device, data processing method, and program
CN101808202B (en) * 2009-02-18 2013-09-04 联想(北京)有限公司 Method, system and computer for realizing sound-and-caption synchronization in video file
KR101709053B1 (en) * 2010-04-01 2017-02-22 티제이커뮤니케이션 주식회사 Caption data structure and caption player for synchronizing syllables between a sound source and caption data
US8564721B1 (en) * 2012-08-28 2013-10-22 Matthew Berry Timeline alignment and coordination for closed-caption text using speech recognition transcripts
CN104038804B (en) * 2013-03-05 2017-09-29 三星电子(中国)研发中心 Captioning synchronization apparatus and method based on speech recognition
CN103297711B (en) * 2013-06-19 2016-11-02 青海汉拉信息科技股份有限公司 Chinese marks Chinese language subtitles recorded broadcast equipment the most in real time
CN103561217A (en) * 2013-10-14 2014-02-05 深圳创维数字技术股份有限公司 Method and terminal for generating captions
CN106033678A (en) * 2015-03-18 2016-10-19 珠海金山办公软件有限公司 Playing content display method and apparatus thereof
CN105791713A (en) * 2016-03-21 2016-07-20 安徽声讯信息技术有限公司 Intelligent device for playing voices and captions synchronously
CN106385548A (en) * 2016-09-05 2017-02-08 努比亚技术有限公司 Mobile terminal and method for generating video captions
CN106340291A (en) * 2016-09-27 2017-01-18 广东小天才科技有限公司 Bilingual subtitle making method and system
CN106340294A (en) * 2016-09-29 2017-01-18 安徽声讯信息技术有限公司 Synchronous translation-based news live streaming subtitle on-line production system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6130722A (en) * 1996-05-03 2000-10-10 Samsung Electronics Co., Ltd Closed-caption broadcasting and displaying method and apparatus suitable for syllable characters
CN102148937A (en) * 2010-02-05 2011-08-10 新奥特(北京)视频技术有限公司 Realization method and realization device for tail screen staying in scrolling design
US8826354B2 (en) * 2010-12-01 2014-09-02 At&T Intellectual Property I, L.P. Method and system for testing closed caption content of video assets
CN102663988A (en) * 2012-04-28 2012-09-12 广东威创视讯科技股份有限公司 Method, device and system for broadcasting subtitles

Also Published As

Publication number Publication date
CN108366182A (en) 2018-08-03

Similar Documents

Publication Publication Date Title
CN108366182B (en) Calibration method and device for synchronous broadcast of text voice and computer storage medium
CN112995735B (en) Distributed video display system, control device and control method
CN110418170B (en) Detection method and device, storage medium and electronic device
US10147440B2 (en) Method for playing data and apparatus and system thereof
CN103152651B (en) A kind of automatic adjustment streaming media buffer district plays the method and system of thresholding
CN105448312B (en) Audio synchronization playback method, device and system
CN101478683B (en) Synchronize audio frames with video frames
CN108495239B (en) Method, device, equipment and storage medium for accurately and synchronously playing audio among multiple equipment
CN105681874A (en) Network video online playing method and device
US8913190B2 (en) Method and apparatus for regenerating a pixel clock signal
CN101212073A (en) Power management device, power management method and portable player
WO2023029936A1 (en) Audio and video synchronization method and apparatus based on different reference clocks, and computer device
CN108174267A (en) The sending device of interactive information, method and computer readable storage medium in live streaming
CN104506631A (en) Audio file cache method and audio file cache equipment
CN103648011A (en) Audio and video synchronization device and method based on HLS protocol
CN103428584A (en) Method and device for keeping synchronization of audio and video on multimedia playing platform
TW201933880A (en) Method, apparatus, and system for displaying live broadcast messages
CN113382300B (en) Audio and video playing method and device
WO2021143717A1 (en) Video timing conversion method and apparatus, and computer readable medium
US9742966B2 (en) Data processing system
CN115484483A (en) Streaming media audio and video synchronization method, terminal equipment, control device and medium
US9806967B2 (en) Communication device and data processing method
TW201830229A (en) Calibration method and computer readable recording medium
CN117979106A (en) Barrage message calibration method and device, electronic equipment and storage medium
CN111862912B (en) Music score display method, device, server and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
OL01 Intention to license declared
OL01 Intention to license declared