[go: up one dir, main page]

CN100382594C - Fast forward playing method for video and audio signal - Google Patents

Fast forward playing method for video and audio signal Download PDF

Info

Publication number
CN100382594C
CN100382594C CNB2004100476527A CN200410047652A CN100382594C CN 100382594 C CN100382594 C CN 100382594C CN B2004100476527 A CNB2004100476527 A CN B2004100476527A CN 200410047652 A CN200410047652 A CN 200410047652A CN 100382594 C CN100382594 C CN 100382594C
Authority
CN
China
Prior art keywords
video
audio signal
fast
audio
buffer storage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB2004100476527A
Other languages
Chinese (zh)
Other versions
CN1705365A (en
Inventor
林士生
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ali Corp
Original Assignee
Ali Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ali Corp filed Critical Ali Corp
Priority to CNB2004100476527A priority Critical patent/CN100382594C/en
Publication of CN1705365A publication Critical patent/CN1705365A/en
Application granted granted Critical
Publication of CN100382594C publication Critical patent/CN100382594C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Television Signal Processing For Recording (AREA)

Abstract

The invention relates to a video signal fast-forward playing method, which is a video signal fast-forward playing method using a time scaling algorithm (timescaling algorithm). The purpose of compressing the video signal is achieved by an interpolation-coefficient algorithm (inter-coefficient) of audio range limitation (angement) and slope calculation (slope calculation), and the quality of the played video signal is improved. The purpose of fast forward playing of the invention is to perform a time scaling operation on each video signal unit in a video signal data stream (data stream), so that a plurality of video signal units are compressed into one video signal unit according to a required compression ratio, and good sound quality can be maintained.

Description

影音信号快进播放方法 Audio and video signal fast forward playback method

技术领域 technical field

本发明涉及一使用时间缩放算法的影音信号快进播放方法,通过音频范围限制与波形斜率计算来改善快进播放时的影音信号品质。The invention relates to a video and audio signal fast-forward playback method using a time scaling algorithm, which improves the video and audio signal quality during fast-forward playback through audio range limitation and waveform slope calculation.

背景技术 Background technique

一般使用者在使用如CD、VCD、DVD或卡带等的影音资料时,在影音信号播放时不时有前进或倒转的需要,尤其是要重复聆听或观看某个多媒体片段,若需要快速到达某一预定位置、并以慢速来播放,更是有快速/慢速前进(fast/slow forward)或快速/慢速倒转(fast/slow reverse)的需要。因此,公知技术发展出几种针对上述需求的解决方式,如取样频率(samplingfrequency)与时间缩放(time scale)等方法。Generally, when users use audio-visual materials such as CD, VCD, DVD or cassettes, they need to move forward or reverse from time to time when the audio-visual signal is played, especially to listen to or watch a certain multimedia clip repeatedly. Position, and play at a slow speed, there is a need for fast/slow forward or fast/slow reverse. Therefore, the known technologies have developed several solutions to the above requirements, such as sampling frequency and time scaling.

公知技术中时间缩放的概念可以参阅图1来说明。其中,有一输入影音信号M,并区分为多个开窗(window),如第一开窗11、第二开窗12、第三开窗13等;此开窗为声音信号中的最小单位,在一预先定义的压缩比下进行开窗重复的动作。如图示的输出影音信号N,使输入影音信号M的第一开窗11与第二开窗12有一重复信号位置P1,第二开窗12与第三开窗13有一重复信号位置P2,通过各相邻开窗重复的压缩步骤,达到时间压缩与影音信号快进的目的。The concept of time scaling in the known technology can be described with reference to FIG. 1 . Among them, there is an input audio-visual signal M, which is divided into multiple windows, such as the first window 11, the second window 12, the third window 13, etc.; this window is the smallest unit in the audio signal, Windowing repeats the action at a predefined compression ratio. As shown in the figure, the audio-visual signal N is output, so that the first window 11 and the second window 12 of the input audio-visual signal M have a repetitive signal position P1, and the second window 12 and the third window 13 have a repetitive signal position P2. The repeated compression steps of adjacent windows achieve the purpose of time compression and fast-forwarding of video and audio signals.

公知技术的时间缩放方法的另一实施例请参阅图2A至图2F所示,为音波与时间轴的关系示意图。图2A中有一最小波长Lmin与最大波长Lmax,通过一相似性(similarity)侦测步骤由最小波长Lmin增加到最大波长Lmax,得出一基本周期Lp,如图2B所示。再根据此基本周期Lp将该原始音波切分为第一波形A与第二波形B,如图2C所示。且由图2C可知,第一波形A为一斜率下降的波形(图2D),而第二波形B为一斜率上升的波形(图2E)。将第一波形A与第二波形B相加得图2F所示的混合波形(A+B),以代替原来的波形A、B,得到通过时间缩放修正方法而获取的结果。For another embodiment of the time scaling method in the prior art, please refer to FIG. 2A to FIG. 2F , which are schematic diagrams showing the relationship between the sound wave and the time axis. In FIG. 2A there is a minimum wavelength Lmin and a maximum wavelength Lmax. Through a similarity detection step, the minimum wavelength Lmin is increased to the maximum wavelength Lmax to obtain a fundamental period Lp, as shown in FIG. 2B . Then, according to the fundamental period Lp, the original sound wave is divided into a first waveform A and a second waveform B, as shown in FIG. 2C . And it can be seen from FIG. 2C that the first waveform A is a waveform with a falling slope ( FIG. 2D ), and the second waveform B is a waveform with a rising slope ( FIG. 2E ). The mixed waveform (A+B) shown in FIG. 2F is obtained by adding the first waveform A and the second waveform B to replace the original waveforms A and B to obtain the result obtained by the time scaling correction method.

再请参阅图3A,为美国专利No.6,424,789号所示的取样频率方法数据流示意图。此公知技术是依据影像内容来改变影像数据快转或慢转取样的方法,在此仅就其中的影音信号数据筛选方法来讨论。如图所示的影音信号数据流30,其中包括两组影音信号片段(shot):第一影音信号片段31与第二影音信号片段32;而各影音信号片段进一步分割成多个影音信号框(frame),如第一影音信号片段31中的影音信号框F1、F2、F3、F4、……、Fn,与第二影音信号片段32中的F1、F2、F3、……、Fm。若要将此影音信号数据以慢速播放,则需另外依据邻近的影音信号框来加入或复制额外的影音信号框,使播放的数据流拉长;若要快速播放,则需根据一选取原则(此原则并不在此详述)将选取的影音信号框删去,以缩短影音信号数据流长度。如图示,将第一影音信号片段31中的F2、F4丢弃,并将第二影音信号片段32中的F2、Fm’舍去,以达到影音信号快速/慢速前进或倒转播放的目的。Please refer to FIG. 3A again, which is a schematic diagram of the data flow of the sampling frequency method shown in US Patent No. 6,424,789. This known technology is a method of changing the fast-forward or slow-forward sampling of image data according to the image content, and here only discusses the screening method of the video and audio signal data. The audiovisual signal data stream 30 as shown in the figure includes two groups of audiovisual signal segments (shot): the first audiovisual signal segment 31 and the second audiovisual signal segment 32; and each audiovisual signal segment is further divided into a plurality of audiovisual signal frames ( frame), such as video and audio signal frames F1, F2, F3, F4, . . . If you want to play the audio and video signal data at a slow speed, you need to add or copy additional audio and video signal frames according to the adjacent audio and video signal frames to make the played data stream longer; if you want to play fast, you need to follow a selection principle (This principle will not be described in detail here) The selected video and audio signal frame is deleted to shorten the length of the audio and video signal data stream. As shown in the figure, F2 and F4 in the first audio-visual signal segment 31 are discarded, and F2 and Fm' in the second audio-visual signal segment 32 are discarded, so as to achieve the purpose of fast/slow forward or reverse playback of the audio-visual signal.

图3B为图3A所示公知技术的方法流程图。当影音信号开始播放(步骤301);影音设备即接收来自播放源的影音信号数据流(stream)(步骤302);由设备中的处理器,如数字信号处理器(DSP),将数据流区分成多个片段(shot)(步骤303);再由使用者控制决定速度改变的等级,如快转/慢转的速度、品质等(步骤304);再依使用者所决定的等级将各影音信号片段分类,使之分成多个不同的影音信号框(frame)(步骤305);运用此公知技术所述的适当的取样算法(sampling algorithm),将其中某些影音信号框丢弃或是复制,从而达到快转或慢速播放的目的(步骤306);再判断是否已处理至最后一个片段(步骤307);若否,表示仍有影音信号片段未处理,再跳至下一片段(步骤308),并继续进行依等级将各片段分类成不同的影音信号框等步骤(步骤305);若已处理至最后一个影音信号片段,则将各筛选下来的包含多个影音信号框的影音信号片段重组,而成为一修正后的影音信号数据流(步骤309),此新修正的影音信号数据流则为使用者所需的快转或慢速播放的影音信号数据。FIG. 3B is a flow chart of the method of the known technology shown in FIG. 3A . When the audio-visual signal starts to play (step 301); the audio-visual device receives the audio-visual signal data stream (stream) from the playback source (step 302); the processor in the device, such as a digital signal processor (DSP), distinguishes the data stream into multiple segments (shot) (step 303); then the level of speed change is controlled by the user, such as the speed and quality of fast forward/slow forward (step 304); Classify the signal segments so that they are divided into a plurality of different audio-visual signal frames (step 305); use the appropriate sampling algorithm (sampling algorithm) described in this known technology to discard or duplicate some of the audio-visual signal frames, So as to achieve the purpose of fast forward or slow playback (step 306); then judge whether the last segment has been processed (step 307); ), and continue to classify each segment into different audio-visual signal frames according to the level (step 305); Recombined to become a modified audio-visual signal data stream (step 309 ), the newly modified audio-visual signal data stream is the audio-visual signal data for fast forward or slow playback required by the user.

但上述公知技术中的第一实施例利用时间缩放压缩(time-scalecompress)技术而达到快转前进或倒转播放的目的;但由于在找相似点时,需要大量的运算,取定点的话,则会造成影音信号数据不连续的问题,因而尤其在复音(multi-tone)的快速播放时会出现爆音或噪声。而第二实施例的取样频率(sampling frequency)方法,会导致产生变频的现象,使播放出的声音为一不寻常的声响,通常会变成尖锐而音频较高的声响。因此,本发明针对公知技术快速前进或倒转的问题,提出一通过时间缩放技术所发展而来的影音信号快进播放方法。However, the first embodiment in the above-mentioned known technology utilizes time scaling and compression (time-scalecompress) technology to achieve the purpose of fast-forwarding or reversing playback; The problem of discontinuity of video and audio signal data is caused, so popping sound or noise will appear especially when multi-tone (multi-tone) is played quickly. And the sampling frequency (sampling frequency) method of the second embodiment can cause the phenomenon of frequency conversion, so that the sound played out is an unusual sound, which usually becomes a sharp and high-frequency sound. Therefore, the present invention proposes a fast-forward playback method of video and audio signals developed through time scaling technology to solve the problem of fast forward or reverse in the known technology.

发明内容 Contents of the invention

本发明的主要目的是要解决公知的影音信号快进时会产生变频或爆音的缺点。因此,本发明提出一由时间缩放技术所发展而来的影音信号快进播放的时间缩放算法(time scaling algorithm),通过其中的音频范围限制与波形斜率计算来改善快进播放的音质。The main purpose of the present invention is to solve the known shortcoming of frequency conversion or popping sound when the audio-visual signal is fast-forwarded. Therefore, the present invention proposes a time scaling algorithm for fast-forward playback of audio-visual signals developed from time-scaling technology. The sound quality of fast-forward playback is improved by limiting the audio range and calculating the slope of the waveform.

本发明提供一种影音信号快进播放方法,该方法是由一时间缩放技术(time scaling)所发展而来的影音信号快进播放的一内插系数算法(inter-coefficient),达到压缩一影音信号数据流的目的,该影音信号快进播放方法的步骤包括有:步骤一,将多个影音信号单元分组存入至少一个缓冲存储器中;步骤二,在该缓冲存储器中设定多个索引(index);步骤三,设定一参考点,该参考点为该内插系数算法的一基准点(alignment point);步骤四,通过该基准点地址开始该内插系数算法的演算,得出一新的压缩影音信号单元;步骤五,将该缓冲存储器其中一个的索引位移至下一个需要压缩的影音信号地址。并重复步骤四到步骤五,以完成影音信号的压缩,从而达到快进播放的目的。The present invention provides a method for fast-forward playback of audio-visual signals, which is an inter-coefficient algorithm (inter-coefficient) for fast-forward playback of audio-visual signals developed from a time scaling technology (time scaling), to achieve compression of an audio-visual The purpose of the signal data flow, the step of this audio-visual signal fast-forward playing method comprises: Step 1, a plurality of audio-visual signal units are grouped and stored in at least one buffer memory; Step 2, set a plurality of indexes in this buffer memory ( index); step 3, setting a reference point, which is a reference point (alignment point) of the interpolation coefficient algorithm; step 4, starting the calculation of the interpolation coefficient algorithm by the reference point address, and obtaining a A new compressed audio-visual signal unit; Step 5, shifting the index of one of the buffer memories to the address of the next audio-visual signal to be compressed. And repeat steps 4 to 5 to complete the compression of video and audio signals, so as to achieve the purpose of fast-forward playback.

根据上述构想,其中,在该步骤开始之初,将该影音信号数据流分为该多个影音信号单元。According to the above idea, wherein, at the beginning of the step, the video and audio signal data stream is divided into the plurality of audio and video signal units.

根据上述构想,其中,该影音信号单元中包括有多个取样影音信号。According to the above idea, wherein, the audio-video signal unit includes a plurality of sampled video-audio signals.

根据上述构想,其中,在将该多个影音信号单元分组存入至少一个缓冲存储器的步骤中,是依所需压缩比或快进播放速度而分组存入该至少一个缓冲存储器中。According to the above idea, wherein, in the step of grouping and storing the plurality of video and audio signal units into at least one buffer memory, grouping and storing the plurality of video and audio signal units into the at least one buffer memory is performed according to the required compression ratio or fast-forward playback speed.

根据上述构想,其中,计算出该参考点的步骤,是由影音信号单元的取样影音信号计算出一参考点,以此参考点作为初始点,然后从最前头的初始点开始逐点计算,找出最佳基准值,作为下一次运算的基准点。According to the above idea, wherein, the step of calculating the reference point is to calculate a reference point from the sampled audio-visual signal of the audio-visual signal unit, use this reference point as the initial point, and then start to calculate point by point from the initial point at the front, and find Get the best reference value as the reference point for the next operation.

本发明还提供一种影音信号快进播放方法,是由一时间缩放技术所发展而来的用于影音信号快进播放的一内插系数算法,用以压缩一影音信号数据流,其中,该影音信号快进播放方法的步骤包括:步骤一,将该影音信号数据流分为多个影音信号单元;步骤二,将该多个影音信号单元分组存入一第一缓冲存储器与一第二缓冲存储器;步骤三,在该第一缓冲存储器与该第二缓冲存储器中设定多个索引点;步骤四,设定一参考点,该参考点为该内插系数算法的一基准点;步骤五,由该基准点地址开始该内插系数算法的演算,得出一新的压缩影音信号单元;步骤六,以及将该缓冲存储器其中一个的索引位移至下一个需要压缩的影音信号地址;重复步骤五到步骤六以完成影音信号压缩,从而快进播放。The present invention also provides a method for fast-forward playback of audio-visual signals, which is an interpolation coefficient algorithm for fast-forward playback of audio-visual signals developed from a time scaling technique, and is used to compress an audio-visual signal data stream, wherein the The steps of the audio-visual signal fast-forward playback method include: Step 1, dividing the audio-visual signal data stream into a plurality of audio-visual signal units; Step 2, storing the plurality of audio-visual signal units in groups into a first buffer memory and a second buffer memory; Step 3, setting a plurality of index points in the first buffer memory and the second buffer memory; Step 4, setting a reference point, which is a reference point for the interpolation coefficient algorithm; Step 5 , start the calculation of the interpolation coefficient algorithm from the reference point address, and obtain a new compressed audio-visual signal unit; Step 6, and shift the index of one of the buffer memories to the next address of the audio-visual signal that needs to be compressed; repeat the steps Step 5 to step 6 to complete the video signal compression, so as to fast-forward playback.

根据上述构想,其中,该影音信号单元中包括有多个取样影音信号。According to the above idea, wherein, the audio-video signal unit includes a plurality of sampled video-audio signals.

根据上述构想,其中,在将该多个影音信号单元分组存入该第一缓冲存储器与该第二缓冲存储器的步骤中,是依所需压缩比或快进播放速度而分组存入该缓冲存储器中。According to the above idea, wherein, in the step of grouping and storing the multiple audio-visual signal units into the first buffer memory and the second buffer memory, they are grouped into the buffer memory according to the required compression ratio or fast-forward playback speed middle.

根据上述构想,其中,该基准点开始逐点计算的公式为:According to the above idea, wherein, the formula for starting point-by-point calculation of the reference point is:

temp[i]+=Buffer1[index1+i]×Buffer2[index2+j]temp[i]+=Buffer1[index1+i]×Buffer2[index2+j]

其中Buffer1[]为该第一缓冲存储器的地址函数,Buffer2[]为该第二缓冲存储器的地址函数,其中变量index1+i表示该第一缓冲存储器的影音信号单元中的取样影音信号地址,而变量index2+j表示该第二缓冲存储器的影音信号单元中的取样影音信号地址。Wherein Buffer1[] is the address function of the first buffer memory, and Buffer2[] is the address function of the second buffer memory, wherein the variable index1+i represents the sampling audio-visual signal address in the audio-visual signal unit of the first buffer memory, and The variable index2+j represents the address of the sampled audio/video signal in the audio/video signal unit of the second buffer memory.

根据上述构想,其中,该内插系数算法的计算公式为:According to the above idea, wherein, the calculation formula of the interpolation coefficient algorithm is:

Buffer1[alignment+i]=(Buffer2[i]×iBuffer1[alignment+i]=(Buffer2[i]×i

+Buffer1[alignment+i]×unit-Buffer1[alignment+i]×i)/unit+Buffer1[alignment+i]×unit-Buffer1[alignment+i]×i)/unit

其中Buffer1[]为该第一缓冲存储器的地址函数,Buffer2[]为该第二缓冲存储器的地址函数,其中变量alignment+i表示该第一缓冲存储器的影音信号单元中的基准点地址,而变量i表示位于该第二缓冲存储器的影音信号单元中的起始点地址。Wherein Buffer1[] is the address function of the first buffer memory, Buffer2[] is the address function of the second buffer memory, wherein the variable alignment+i represents the reference point address in the audio-visual signal unit of the first buffer memory, and the variable i represents the address of the starting point in the video and audio signal unit of the second buffer memory.

附图说明 Description of drawings

图1所示为公知技术的时间缩放概念的示意图;FIG. 1 is a schematic diagram of the concept of time scaling in the known technology;

图2A至图2F所示为音波与时间轴关系的示意图;2A to 2F are schematic diagrams showing the relationship between the sound wave and the time axis;

图3A所示为公知技术的取样频率方法数据流的示意图;Fig. 3 A shows the schematic diagram of the sampling frequency method data flow of known technology;

图3B所示为公知技术的取样频率方法的流程图;Fig. 3B shows the flowchart of the sampling frequency method of known technology;

图4A至图4C所示为本发明影音信号快进的时间缩放方法的示意图;4A to 4C are schematic diagrams of the time scaling method for fast-forwarding of video and audio signals according to the present invention;

图5为本发明影音信号快进播放方法的步骤流程图。FIG. 5 is a flow chart of the steps of the fast-forward playback method of audio-visual signals according to the present invention.

其中,附图标记说明如下:Wherein, the reference signs are explained as follows:

11第一开窗         12第二开窗              13第三开窗11 First window opening 12 Second window opening 13 Third window opening

M输入影音信号      N输出影音信号           P1,P2重复信号位置M input audio and video signal N output audio and video signal P1, P2 repeat signal position

30影音信号数据流   31第一影音信号片段      32第二影音信号片段30 AV signal data stream 31 First AV signal segment 32 Second AV signal segment

A第一波形          B第二波形               Lmax最大波长A The first waveform B The second waveform Lmax maximum wavelength

Lmin最小波长        Lp基本周期             40影音信号数据流Lmin Minimum Wavelength Lp Basic Period 40 AV signal data flow

41第一缓冲存储器                           42第二缓冲存储器41 First buffer memory 42 Second buffer memory

401,402,403,404影音信号单元401, 402, 403, 404 audio-visual signal unit

411,421记忆区块                           i401,i402索引点411, 421 memory blocks i401, i402 index points

具体实施方式 Detailed ways

本发明为一种影音信号快进播放方法,为解决公知技术中影音信号快进时使用时间缩放或取样频率等方法会产生太高或尖锐的变频或播放过程爆音的缺点,而提出一改善公知时间缩放技术所发展而来的影音信号快进播放的时间缩放算法(time scaling algorithm),通过其中的音频范围限制(rangerestriction)与波形斜率(slope)计算来改善快进播放的音质。The present invention is a fast-forward playback method of audio-visual signals. In order to solve the disadvantages of using time scaling or sampling frequency and other methods when fast-forwarding audio-visual signals in the known technology, which will produce too high or sharp frequency conversion or popping sound during playback, an improved known method is proposed. The time scaling algorithm for fast-forward playback of video and audio signals developed from time scaling technology improves the sound quality of fast-forward playback through the audio range restriction and waveform slope calculation.

请参阅图4A至图4C,所示为本发明的影音信号快进的时间缩放方法的示意图。Please refer to FIG. 4A to FIG. 4C , which are schematic diagrams of the time scaling method for fast-forwarding audio and video signals according to the present invention.

图4A所示为一影音信号数据流(stream)40,其中包括有多个影音信号单元401、402、403与404,每个影音信号单元进一步包含有多个影音信号数据流最小单元,即取样影音信号(sample)。而本发明快进播放的目的即为将其中各影音信号单元作一时间缩放的演算,使多个影音信号单元(unit)依所需的压缩比压缩为一个影音信号单元,并能维持良好的音质。例如,若为两倍快进播放时,即将两个影音信号单元压缩为一个新的影音信号单元;若为四倍快进播放时,则同理将四个影音信号单元压缩为一个新的影音信号单元。Figure 4A shows a video and audio signal data stream (stream) 40, which includes a plurality of audio and video signal units 401, 402, 403 and 404, and each audio and video signal unit further includes a plurality of audio and video signal data stream minimum units, namely sampling AV signal (sample). The purpose of fast-forward playback in the present invention is to perform a time-scaling calculation on each audio-visual signal unit, so that multiple audio-visual signal units (units) can be compressed into one audio-visual signal unit according to the required compression ratio, and can maintain a good sound quality. For example, if it is played at double fast-forward, two audio-visual signal units will be compressed into a new audio-visual signal unit; if it is played at four times fast-forward, four audio-visual signal units will be compressed into a new audio-visual signal unit in the same way signal unit.

请参阅图4B所示的两倍快进播放的实施例,其中包括有内存中配置的第一缓冲存储器41与第二缓冲存储器42。将影音信号数据流40中两个影音信号单元为一组分批存入此缓冲存储器,如影音信号单元401与402存入第二缓冲存储器42(如公式一的Buffer1)的记忆区块421,而影音信号单元403与404存入第一缓冲存储器41(如公式一的Buffer2)的记忆区块411。在两倍快进播放的实施例中,记忆区块411与记忆区块421的数据长度为两个影音信号单元的数据长度。Please refer to the embodiment of double fast-forward playback shown in FIG. 4B , which includes a first buffer memory 41 and a second buffer memory 42 configured in memory. Store the two audio-visual signal units in the audio-visual signal data stream 40 as a group into the buffer memory in batches, such as the audio-visual signal units 401 and 402 are stored in the memory block 421 of the second buffer memory 42 (such as Buffer1 in formula 1), The video and audio signal units 403 and 404 are stored in the memory block 411 of the first buffer memory 41 (such as Buffer2 in Formula 1). In the embodiment of double fast-forward playback, the data length of the memory block 411 and the memory block 421 is the data length of two audio-visual signal units.

缓冲存储器中定义索引地址(index),如在第一缓冲存储器41中设定索引点i401,在第二缓冲存储器42中设定索引点i402,索引值所标记的为影音信号单元中的取样影音信号(sample)。The index address (index) is defined in the buffer memory, such as setting the index point i401 in the first buffer memory 41, setting the index point i402 in the second buffer memory 42, and what the index value marks is the sample video and audio in the video and audio signal unit Signal (sample).

为了在快进播放时不发生变频或爆音的现象,故需要在压缩时先找到相似波形的基准点,此基准点即为内插系数(inter-coefficient)算法开始运算时的起始点。请参阅公式一。In order to avoid frequency conversion or popping during fast-forward playback, it is necessary to find the reference point of similar waveforms during compression. This reference point is the starting point when the inter-coefficient algorithm starts to operate. See formula one.

temp[i]+=Buffer1[index1+i]×Buffer2[index2+j]    (公式一)temp[i]+=Buffer1[index1+i]×Buffer2[index2+j] (Formula 1)

此式中,Buffer1[]为第一缓冲存储器41的地址函数,Buffer2[]为第二缓冲存储器42的地址函数,其中变量index1+i表示第一缓冲存储器41的影音信号单元中的取样影音信号(sample)地址,而变量index2+j表示第二缓冲存储器42的影音信号单元中的取样影音信号(sample)地址。In this formula, Buffer1[] is the address function of the first buffer memory 41, and Buffer2[] is the address function of the second buffer memory 42, wherein the variable index1+i represents the sample video signal in the video signal unit of the first buffer memory 41 (sample) address, and the variable index2+j represents the address of the sample video signal (sample) in the video signal unit of the second buffer memory 42 .

实际上,找内插系数的算法即为将影音信号单元401,402,403,404的数值带入公式一,求得一个最相似的图形。以图4B为例,Buffer1提供给影音信号单元401和402,而Buffer2提供给影音信号单元403和404。代入公式后,求得最大的temp[i],其意义即为此i点为两对缓冲存储器最相似的起始点,我们称此点为基准点(alignment)。求得基准点后,再代入公式二,求出取代的一段影音信号值。In fact, the algorithm for finding interpolation coefficients is to bring the values of the audio-visual signal units 401, 402, 403, and 404 into formula 1 to obtain a most similar figure. Taking FIG. 4B as an example, Buffer1 is provided to the video and audio signal units 401 and 402 , and Buffer2 is provided to the audio and video signal units 403 and 404 . After substituting into the formula, the maximum temp[i] is obtained, which means that point i is the most similar starting point of the two pairs of buffer memories, and we call this point the alignment. After the reference point is obtained, it is then substituted into formula 2 to obtain the value of a segment of audio-visual signal to be replaced.

Buffer 1[alignment+i]=(Buffer2[i]×iBuffer 1[alignment+i]=(Buffer2[i]×i

+Buffer 1[alignment+i]×unit-Buffer1[alignment+i]×i)/unit+Buffer 1[alignment+i]×unit-Buffer1[alignment+i]×i)/unit

(公式二)    i=0~unit(Formula 2) i=0~unit

其中Buffer1[]为该第一缓冲存储器的地址函数,Buffer2[]为该第二缓冲存储器的地址函数,其中变量alignment+i表示该第一缓冲存储器的影音信号单元中的基准点地址,而变量i表示位于该第二缓冲存储器的影音信号单元中的起始点地址。Wherein Buffer1[] is the address function of the first buffer memory, Buffer2[] is the address function of the second buffer memory, wherein the variable alignment+i represents the reference point address in the audio-visual signal unit of the first buffer memory, and the variable i represents the address of the starting point in the video and audio signal unit of the second buffer memory.

由于此找内插系数需要大量的乘法,所以在此提出通过搜寻斜率(slope)以及数值范围来找出相似点的方法,从而降低运算的复杂度。也就是,在影音信号单元403与404中取一点为一比较点,然后在401与402中取一起始索引点i401,定一个数值差范围(range A),从此起始点开始比较是否具有相同斜率以及数值差是否在range A内;如不是,就往后找,最多到索引点i402。当找到最佳化的基准点,我们会代入公式二,求出新的一段影音信号数值。此方法大大降低了求最相似波形的运算量。Since finding interpolation coefficients requires a lot of multiplications, a method of finding similarities by searching slopes and value ranges is proposed here, thereby reducing the complexity of operations. That is, take a point in the audio-visual signal units 403 and 404 as a comparison point, then take a starting index point i401 in 401 and 402, set a numerical difference range (range A), and compare whether it has the same slope from this starting point And whether the value difference is within range A; if not, search backward, up to index point i402. When the optimal reference point is found, we will substitute it into formula 2 to obtain a new segment of video and audio signal value. This method greatly reduces the amount of computation for finding the most similar waveform.

图4C所示为将第一缓冲存储器41中的索引点i401位移至下一组需要压缩的影音信号单元,而第二缓冲存储器42即以公式二所得到的基准点作为下一压缩步骤的起始点(如图示的索引点i402)。Figure 4C shows that the index point i401 in the first buffer memory 41 is shifted to the next set of audio-visual signal units to be compressed, and the second buffer memory 42 uses the reference point obtained by formula 2 as the starting point of the next compression step The starting point (as shown in the index point i402).

最后将第二缓冲存储器的结果输出,即得到本发明为达到影音信号快进播放的目的所需的压缩影音信号。Finally, the result of the second buffer memory is output to obtain the compressed audio-visual signal required by the present invention to achieve the purpose of fast-forward playback of the audio-visual signal.

图5所示即为本发明影音信号快进播放方法的步骤流程图,流程如下:Shown in Fig. 5 is exactly the step flow chart of audio-visual signal fast-forward playing method of the present invention, and flow process is as follows:

步骤S1:依需求将一影音信号数据流分为多个影音信号单元(unit);Step S1: divide an audio-visual signal data stream into a plurality of audio-visual signal units (units) according to requirements;

步骤S2:将该多个影音信号单元依所需压缩比或快进播放速度而分组存入至少一个缓冲存储器(buffer)中,如图4A所述的两倍压缩比快进播放即以两个影音信号单元作为分组储存的参考,而缓冲存储器即如图4B、图4C所示的第一缓冲存储器41与第二缓冲存储器42;Step S2: store the plurality of audio-visual signal units in groups according to the required compression ratio or fast-forward playback speed into at least one buffer memory (buffer), and the double compression ratio fast-forward playback as described in Figure 4A is two The audio-visual signal unit is used as a reference for group storage, and the buffer memory is the first buffer memory 41 and the second buffer memory 42 shown in FIG. 4B and FIG. 4C;

步骤S3:在上述的缓冲存储器中设定多个索引值(index)来标示其中影音信号的地址,如图4B所示的第一缓冲存储器41中有一第一索引i401,第二缓冲存储器42中有第二索引i402;Step S3: set a plurality of index values (index) in the above-mentioned buffer memory to mark the address of the audio-visual signal therein, as shown in Figure 4B, there is a first index i401 in the first buffer memory 41, and a first index i401 in the second buffer memory 42 have a second index i402;

步骤S4:通过各缓冲存储器中的多个影音信号单元所标记的取样影音信号(sample)计算出一参考点,计算此参考点是为得出内插系数算法(inter-coefficient)的初始点;Step S4: Calculate a reference point through the sampled audio-visual signal (sample) marked by a plurality of audio-visual signal units in each buffer memory, and calculate this reference point to obtain the initial point of the interpolation coefficient algorithm (inter-coefficient);

步骤S5:上述步骤的初始点为内插系数法的一基准点(alignment point),从最前头的初始点开始逐点计算,以内插系数算法找最佳基准值,以作为下一次运算的基准点(如公式二),而该第一个基准点是以经验法则得出,之后再以该内插系数算法得出下一基准点;Step S5: The initial point of the above steps is an alignment point of the interpolation coefficient method, which is calculated point by point from the initial point at the top, and the best alignment value is found by the interpolation coefficient algorithm as the benchmark for the next operation point (such as formula 2), and the first reference point is obtained by the rule of thumb, and then the next reference point is obtained by the interpolation coefficient algorithm;

每个影音信号单元中的取样影音信号(sample)循序代入公式二,最后加总即可得下一基准点;The sample audio-visual signal (sample) in each audio-visual signal unit is sequentially substituted into formula 2, and the next reference point can be obtained by summing up at last;

步骤S6:由基准点地址开始内插系数法的演算,即配合各索引地址,由缓冲存储器中可得出一新的压缩影音信号单元,并输出所得到的快进播放的输出结果;Step S6: start the calculation of the interpolation coefficient method from the reference point address, that is, coordinate with each index address, a new compressed audio-visual signal unit can be obtained from the buffer memory, and output the obtained output result of fast-forward playback;

步骤S7:判断影音信号压缩是否完成;Step S7: judging whether the audio-visual signal compression is completed;

步骤S8:若没有完成,将第一缓冲存储器的第一索引位移至下一个需要压缩的影音信号地址(以步骤S2所决定的压缩比为主),并重复上述步骤S5至步骤S7,以完成影音信号压缩而达到快进播放的目的;Step S8: If not completed, shift the first index of the first buffer memory to the address of the next audio-visual signal to be compressed (based on the compression ratio determined in step S2), and repeat the above steps S5 to S7 to complete Compress audio and video signals to achieve fast-forward playback;

步骤S9:若影音信号压缩完成,则结束此一影音信号快进方法的步骤。Step S9: If the audio-video signal compression is completed, then end the steps of the method for fast-forwarding the video-audio signal.

上述的步骤中,将多个影音信号单元依所需压缩比或快进播放速度而分组存入缓冲存储器中,若每读两个影音信号单元再由缓冲存储器输出一个新的压缩影音信号单元,即为两倍压缩比的快进播放;若每读四个影音信号单元再由缓冲存储器输出一个新的压缩影音信号单元,即为四倍快进播放。In the above-mentioned steps, a plurality of audio-visual signal units are grouped and stored in the buffer memory according to the required compression ratio or fast-forward playback speed. If two audio-visual signal units are read, a new compressed audio-visual signal unit is output from the buffer memory. That is, fast-forward playback with twice the compression ratio; if every four audio-visual signal units are read and a new compressed audio-visual signal unit is output from the buffer memory, it is four-fold fast-forward playback.

综上所述,本发明为解决公知的影音信号快进时会产生变频或爆音的缺点,而提出一由时间缩放技术(time scaling)所发展而来的影音信号快进播放的时间缩放算法(time scaling algorithm),其中的音频范围限制与波形斜率计算的内插系数算法(inter-coefficient)达到影音信号压缩的目的,并改善了快进播放的音质,且能降低在压缩计算时的功耗,更能减低内存的使用率,确实有着一举多得的功效。To sum up, the present invention proposes a time scaling algorithm for fast-forward playback of audio-visual signals developed from time scaling technology (time scaling) in order to solve the known shortcomings of frequency conversion or popping sound when audio-visual signals are fast-forwarded. time scaling algorithm), in which the interpolation coefficient algorithm (inter-coefficient) for audio range limitation and waveform slope calculation achieves the purpose of audio and video signal compression, improves the sound quality of fast-forward playback, and can reduce the power consumption during compression calculation , It can reduce the usage rate of memory, and it really has the effect of killing multiple birds with one stone.

以上所述仅为本发明的较佳可行实施例,并非以此限制本发明的专利范围,凡运用本发明说明书及附图内容所做的等效结构变化,均包含于本发明的专利保护范围内。The above descriptions are only preferred feasible embodiments of the present invention, and are not intended to limit the patent scope of the present invention. All equivalent structural changes made by using the description of the present invention and the contents of the accompanying drawings are included in the scope of patent protection of the present invention. Inside.

Claims (10)

1. video-audio signal fast-forward play method, this method is developed and a next interpolation coefficient algorithm that is used for the video-audio signal fast-forward play by a time zoom technology, in order to compress a video-audio signal data flow, wherein, the step of this video-audio signal fast-forward play method comprises:
Step 1 deposits a plurality of video-audio signal unit packet at least one buffer storage in;
Step 2 is set a plurality of index points in this buffer storage;
Step 3 is set a reference point, and this reference point is a datum mark of this interpolation coefficient algorithm;
Step 4 begins the calculation of this interpolation coefficient algorithm by this datum mark address, draws a new compression video-audio signal unit;
Step 5 is moved to the next video-audio signal address that needs compression with one of them index point of this buffer storage;
Repeat this step 4 and compress to finish video-audio signal to this step 5, thus fast-forward play.
2. video-audio signal fast-forward play method as claimed in claim 1 is characterized in that, at the beginning of the step of this video-audio signal fast-forward play method begins, this video-audio signal data flow is divided into this a plurality of video-audio signals unit.
3. video-audio signal fast-forward play method as claimed in claim 1 is characterized in that, includes a plurality of sampling video-audio signals in this video-audio signal unit.
4. video-audio signal fast-forward play method as claimed in claim 1, it is characterized in that, these a plurality of video-audio signal unit packet are being deposited in the step of at least one buffer storage, are to divide into groups to deposit in this at least one buffer storage according to required compression ratio or fast-forward play speed.
5. video-audio signal fast-forward play method as claimed in claim 1, it is characterized in that, in described step 3 and step 4, calculate the step of this datum mark, be that sampling video-audio signal by the video-audio signal unit calculates a reference point, as initial point, begin pointwise from foremost initial point then and calculate with this reference point, find out the optimal criteria value, as the datum mark of computing next time.
6. a video-audio signal fast-forward play method is developed and a next interpolation coefficient algorithm that is used for the video-audio signal fast-forward play by a time zoom technology, and in order to compress a video-audio signal data flow, wherein, the step of this video-audio signal fast-forward play method comprises:
Step 1 is divided into a plurality of video-audio signals unit with this video-audio signal data flow;
Step 2 should a plurality of video-audio signal unit packet deposit one first buffer storage and one second buffer storage in;
Step 3 is set a plurality of index points in this first buffer storage and this second buffer storage;
Step 4 is set a reference point, and this reference point is a datum mark of this interpolation coefficient algorithm;
Step 5 begins the calculation of this interpolation coefficient algorithm by this datum mark address, draws a new compression video-audio signal unit; And
Step 6 moves to the next video-audio signal address that needs compression with one of them index bit of this buffer storage;
Repeat this step 5 and compress to finish video-audio signal to this step 6, thus fast-forward play.
7. video-audio signal fast-forward play method as claimed in claim 6 is characterized in that, includes a plurality of sampling video-audio signals in this video-audio signal unit.
8. video-audio signal fast-forward play method as claimed in claim 6, it is characterized in that, these a plurality of video-audio signal unit packet are being deposited in the step of this first buffer storage and this second buffer storage, are to divide into groups to deposit in this buffer storage according to required compression ratio or fast-forward play speed.
9. video-audio signal fast-forward play method as claimed in claim 6 is characterized in that, the formula that this datum mark begins pointwise calculating is:
temp[i]+=Buffer1[index1+i]×Buffer2[index2+j]
Buffer1[wherein] be the address function of this first buffer storage, Buffer2[] be the address function of this second buffer storage, wherein variable i ndex1+i represents the sampling video-audio signal address in the video-audio signal unit of this first buffer storage, and variable i ndex2+j represents the sampling video-audio signal address in the video-audio signal unit of this second buffer storage.
10. video-audio signal fast-forward play method as claimed in claim 6 is characterized in that the computing formula of this interpolation coefficient algorithm is:
Buffer1[alignment+i]=(Buffer2[i]×i
+Buffer1[alignment+i]×unit-Buffer1[alignment+i]×i)/unit
Buffer1[wherein] be the address function of this first buffer storage, Buffer2[] be the address function of this second buffer storage, wherein variable alignment+i represents the datum mark address in the video-audio signal unit of this first buffer storage, and variable i represents to be arranged in the starting point address of the video-audio signal unit of this second buffer storage, and unit is the number of above-mentioned a plurality of video-audio signals unit.
CNB2004100476527A 2004-05-27 2004-05-27 Fast forward playing method for video and audio signal Expired - Fee Related CN100382594C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2004100476527A CN100382594C (en) 2004-05-27 2004-05-27 Fast forward playing method for video and audio signal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2004100476527A CN100382594C (en) 2004-05-27 2004-05-27 Fast forward playing method for video and audio signal

Publications (2)

Publication Number Publication Date
CN1705365A CN1705365A (en) 2005-12-07
CN100382594C true CN100382594C (en) 2008-04-16

Family

ID=35577871

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2004100476527A Expired - Fee Related CN100382594C (en) 2004-05-27 2004-05-27 Fast forward playing method for video and audio signal

Country Status (1)

Country Link
CN (1) CN100382594C (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101022028B (en) * 2007-01-12 2010-08-18 炬力集成电路设计有限公司 Quick forward or quick backward method for digital copyright management multimedia player
CN100551043C (en) 2007-02-08 2009-10-14 华为技术有限公司 A kind of method of fast forwarding and fast rewinding playing video data and streaming media server
US20110040981A1 (en) * 2009-08-14 2011-02-17 Apple Inc. Synchronization of Buffered Audio Data With Live Broadcast
JP5056884B2 (en) * 2010-03-26 2012-10-24 カシオ計算機株式会社 Communication device and communication control program
CN102136289B (en) * 2010-12-30 2013-03-13 华为技术有限公司 Methods, devices and systems for storing and reading media contents
CN103475927B (en) * 2012-06-08 2015-04-08 腾讯科技(深圳)有限公司 Online-video real-time speed change play method and system thereof

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0812112A2 (en) * 1996-06-05 1997-12-10 Sun Microsystems, Inc. System and method for indexing between trick play and normal play video streams in a video delivery system
CN1295758A (en) * 1998-04-03 2001-05-16 美商·提维股份有限公司 Method and apparatus implementing random access and time-based functions on continuos stream of formatted digital data
WO2004015688A1 (en) * 2002-08-08 2004-02-19 Cosmotan Inc. Audio signal time-scale modification method using variable length synthesis and reduced cross-correlation computations

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0812112A2 (en) * 1996-06-05 1997-12-10 Sun Microsystems, Inc. System and method for indexing between trick play and normal play video streams in a video delivery system
CN1295758A (en) * 1998-04-03 2001-05-16 美商·提维股份有限公司 Method and apparatus implementing random access and time-based functions on continuos stream of formatted digital data
WO2004015688A1 (en) * 2002-08-08 2004-02-19 Cosmotan Inc. Audio signal time-scale modification method using variable length synthesis and reduced cross-correlation computations

Also Published As

Publication number Publication date
CN1705365A (en) 2005-12-07

Similar Documents

Publication Publication Date Title
KR100385331B1 (en) Method of reproducing audio signal without causing tone variation upon fast or slow reproducing mode and reproducing apparatus for the same
JP4334355B2 (en) Trick mode audio playback
CN100382594C (en) Fast forward playing method for video and audio signal
JP4416244B2 (en) Pitch converter
US7507894B2 (en) Sound data encoding apparatus and sound data decoding apparatus
US8314321B2 (en) Apparatus and method for transforming an input sound signal
US20060107821A1 (en) Sound data encoding apparatus and sound data decoding apparatus
JP4542805B2 (en) Variable speed reproduction method and apparatus, and program
US20070055397A1 (en) Constant pitch variable speed audio decoding
TW200537346A (en) Swift-play method for playing audio signals
US20050195720A1 (en) Data processing apparatus, data processing method, reproducing apparatus, and reproducing method
JP3974408B2 (en) SAMPLING SIGNAL GENERATION DEVICE, SAMPLING SIGNAL REPRODUCTION DEVICE, AND METHOD THEREOF
JPH0573089A (en) Speech reproducing method
KR100802608B1 (en) How to Navigate the Recording Screen on Your Personal Video Recorder
JPH10282991A (en) Speech rate converting device
JP7343320B2 (en) Information processing device, information processing method, and program
US5841945A (en) Voice signal compacting and expanding device with frequency division
JP3875201B2 (en) Data playback method
JPH0883096A (en) Speech time base converter
JP2005242126A (en) Acoustic signal playback device
WO2005093750A1 (en) Digital dubbing device
CN1532833A (en) Synchronous recording management and playing method between audio frequency data and additional data
JP4862761B2 (en) Content reproduction apparatus and program thereof
JP2007179604A (en) Music player, reproduction control method, and program
CN100559855C (en) recording/reproducing device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20080416

Termination date: 20120527