WO2019001072A1 - 一种判断视频帧的帧类型的方法及电子设备 - Google Patents
一种判断视频帧的帧类型的方法及电子设备 Download PDFInfo
- Publication number
- WO2019001072A1 WO2019001072A1 PCT/CN2018/082159 CN2018082159W WO2019001072A1 WO 2019001072 A1 WO2019001072 A1 WO 2019001072A1 CN 2018082159 W CN2018082159 W CN 2018082159W WO 2019001072 A1 WO2019001072 A1 WO 2019001072A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- frame
- video
- key value
- value
- current video
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/142—Detection of scene cut or scene change
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/177—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/593—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
Definitions
- the present application relates to the field of video processing technologies, and in particular, to a method and an electronic device for determining a frame type of a video frame.
- Video codec is mainly realized by prediction, conversion, and coding.
- the prediction is further divided into inter prediction and intra prediction.
- Intra prediction is used to compress spatial correlation
- inter prediction is used to compress temporal correlation.
- Intra prediction refers to the definition of some prediction algorithms in the program. For example, 4*4 macroblocks have 9 kinds of prediction algorithms (vertical, horizontal, diagonal, average........), and 16*16 macroblocks have 4 kinds. Predictive algorithm, 8*8 macroblocks have 4 kinds of prediction algorithms.
- Inter-frame prediction refers to calculating a motion vector of the current block from the reference image by motion estimation algorithm (global acquisition, diamond collection, hexagonal acquisition, etc.) from the image before and after the current slice. .
- motion estimation algorithm global acquisition, diamond collection, hexagonal acquisition, etc.
- the pixel point data of one macroblock can be converted into a macroblock prediction mode, such as macroblock type, prediction mode, motion vector, etc., and the size of the space occupied by the pixel data of the macroblock (data Size)
- a macroblock prediction mode such as macroblock type, prediction mode, motion vector, etc.
- data Size the size of the space occupied by the pixel data of the macroblock
- video frames are divided into three types: I, P, and B
- using B frames can save a lot of space, and at the same bit rate
- I frames can provide better picture quality, and how to distinguish video frames and control video.
- the type of frame to reduce the bit error rate of inter video coding is a problem to be solved by the present invention.
- the present invention provides a method for determining a frame type of a video frame, solving or at least partially solving the technical problem of how to identify a video frame and thereby control the type of the video frame to reduce the bit error rate of the inter-frame video coding.
- the present invention provides a method for determining a frame type of a video frame, the method comprising:
- the inter-frame coding cost Pcost of the current video frame is calculated by using a motion estimation method
- the dynamic offset value bias is obtained based on the maximum I frame spacing keyint_max, the minimum I frame spacing keyint_min, the maximum scene switching threshold thresh_max, and the minimum scene switching threshold thresh_min, the last I frame is from the current video frame number GOP;
- a category of the current video frame based on an interframe coding cost value Pcost of the current video frame, an intraframe coding cost Icost of the current video frame, the dynamic offset value bias; a category of the current video frame Including an I frame and a P frame, wherein if the Pcost ⁇ (1.0-bias) * Icost, the current video frame is an I frame, and if Pcost ⁇ (1.0-bias) * Icost, the current video frame is a P frame.
- the previous video frame is used as a reference frame
- the inter-frame coding value Pcost of the current video frame is calculated by using a motion estimation method, including:
- the reducing the resolution of the current video frame comprises:
- the previous video frame is used as a reference frame
- the inter-frame coding cost Pcost of the current video frame is calculated by using a motion estimation method, including:
- the maximum I frame spacing keyint_max, the minimum I frame spacing keyint_min, the maximum scene switching threshold thresh_max, and the minimum scene switching threshold thresh_min, the last I frame is away from the current video frame number GOP, and obtains a dynamic offset value.
- bias including:
- the maximum I frame spacing keyint_max, the minimum I frame spacing keyint_min, the maximum scene switching threshold thresh_max, and the minimum scene switching threshold thresh_min, the last I frame is away from the current video frame number GOP, and obtains a dynamic offset value.
- bias including:
- Bias thresh_min*gop_size/keyint_min; where gop_size is the GOP of the current video frame.
- the method further includes:
- the method further includes:
- Bias thresh_min+(thresh_max-thresh_min)*(gop_size-keyint_min)/(keyint_max-keyint_min).
- the maximum scene switching threshold thresh_max and the minimum scene switching threshold thresh_min are equal.
- the present invention discloses an electronic device comprising a memory, a processor, and a computer program stored on the memory and operable on the processor, the processor implementing the program to implement the method in the above technical solution.
- the present invention has the following beneficial effects or advantages:
- the invention discloses a method for determining a frame type of a video frame and an electronic device, which obtains an interframe coding cost value Pcost and an intraframe coding cost value Icost of the current video frame; and the dynamic offset value bias is based on the maximum I
- the frame spacing keyint_max, the minimum I frame spacing keyint_min, the maximum scene switching threshold thresh_max, and the minimum scene switching threshold thresh_min are obtained by the last I frame from the number of frames of the current video frame GOP.
- the category of the current video frame is then determined based on the above three parameters.
- the present invention provides a method for affecting the type of a video frame by using a preset maximum I frame interval, a minimum I frame interval, a scene switching threshold, etc., thereby controlling the frequency of occurrence of the current i frame, and
- the i-frames are evenly distributed to the video queue. In the case of error, the i-frame does not affect the error to other frames, thus achieving the robustness of inter-frame video coding.
- FIG. 1 is a flowchart of a method for determining a frame type of a video frame according to an embodiment of the present invention
- 2A-2D are schematic diagrams showing compression of a video frame according to an embodiment of the present invention.
- 3A-3B are schematic diagrams showing the addition of pixel points in an embodiment of the present invention.
- FIG. 4 is a schematic diagram of an electronic device according to an embodiment of the present invention.
- the present invention discloses a method of determining the frame type of a video frame.
- I frame also known as internal picture
- I frame is usually the first frame of each GOP (a video compression technology, indicating the number of frames of the previous I frame from the current video frame), after moderate compression, do A reference point for random access can be used as an image.
- the P frame is predicted by the P frame or I frame in front of it, and it compares with the P frame or I frame in front of it.
- Interframe compression between the same information or data that is, considering the characteristics of motion.
- the P frame method compresses the frame data according to the difference between the current frame and the adjacent previous frame (I frame or P frame). The combination of P-frame and I-frame compression can achieve higher compression without obvious compression traces.
- the object of the present invention is to first calculate the interframe coding cost value Pcost and the intraframe coding cost value Icost of the current video frame. Then, the frequency of occurrence of the current i frame is controlled by a preset maximum I frame pitch keyint_max and a preset minimum I frame pitch keyint_min, and a scene switching threshold, that is, the smaller the threshold or the smaller the interframe spacing, the more the frequency of the i frame appears. small. Moreover, the i-frames can be evenly distributed into the video queue, and the robustness of the inter-frame video coding can be achieved by the fact that the i-frame does not affect the error to other frames in the event of an error.
- a flowchart of a method for determining a frame type of a video frame including:
- step 11 the previous video frame is used as a reference frame, and the inter-frame coding cost Pcost of the current video frame is calculated by using a motion estimation method.
- the present invention also performs data processing on the current video frame.
- the data processing process first expanding the length and width of the current video frame to an integer multiple of 16; then reducing the resolution of the current video frame; and further increasing the blank side of the edge pixel based on the current video frame. Corresponding pixel points, and then filling the edge pixel value of the current video frame into the corresponding pixel point.
- the length and width of the current video frame are expanded to an integral multiple of 16, by dividing the current video frame by an integer multiple of 16 in units of rows, and performing the current video frame as an integer multiple of 16 in units of columns.
- Each macroblock is a 16x16 pixel sub-block, so it will be expanded to an integer multiple of 16.
- the resolution of the current video frame When the resolution of the current video frame is lowered, it can be reduced based on the length and width of the current video frame (ie, line-by-line processing, column-by-column processing).
- the current video frame of 16*16 is taken as an example.
- the average value (average value of 56) is obtained by using the pixel values (52, 60) of the first and second pixels, and then The average value is filled in the first and second pixels, so that the pixel values of the first and second pixel points in the first row are the same, both are 56, and can be regarded as one pixel.
- FIG. 2B the results obtained after the above steps are performed row by row, it can be found that the resolution in FIG. 2B is reduced by 1/2 of the resolution in FIG. 2A after the above steps are performed in units of rows. .
- the purpose of this is to reduce the resolution of the current video frame, thereby reducing the amount of data of the current video frame and improving processing efficiency.
- the corresponding pixel points are added to the blank side of the top, bottom, left, and right pixel points of the edge of the current video frame.
- the thick black box indicates the pixels above and below, and thus visible. Each pixel in the thick black box has three pixels on the other side, and only one side is blank. In the present invention, the corresponding pixel is added on the blank side.
- the dotted line indicates the increase. pixel. Filling the edge pixel value of the current video frame into the corresponding pixel. That is, it can be seen that the pixel values in the thick black box and the corresponding pixel values in the dashed box are the same.
- inter-prediction and intra-prediction can be used as a reference to predict the pixel block.
- intra-prediction requires the previous block of the current block and the left column of pixels to be predicted according to the intra prediction mode.
- the added pixel block can be used as the region of the pixel block motion estimation offset, so the current video frame needs to be extended.
- the present invention increases the number of rows or columns of pixels in the upper, lower, left and right sides of the current video frame by 4 rows or 4 columns, which can increase the inter-frame prediction region and improve the accuracy.
- the general purpose is the diamond motion estimation. Algorithm, the algorithm first calculates the current macroblock by one pixel to the left to calculate the cost value, and then calculates the up, down, and right respectively........ until the optimal motion vector is obtained, it is obvious to sample 4 rows. The other four motion estimation regions are larger, so the accuracy is better, and the motion vector value calculation accuracy can be increased when performing interframe motion estimation.
- the previous video frame is first divided into N first macroblocks; wherein the macroblock is a 16*16 pixel sub-block, N Is a positive integer; then the current video frame is also divided into corresponding N second macroblocks, wherein the second macroblock is a 16*16 pixel sub-block. And calculating, according to the N first macroblocks, an interframe coding value of the N second macroblocks by using a motion estimation method, respectively, and accumulating interframe coding values of the N second macroblocks; Obtaining an interframe coding cost value Pcost of the current video frame.
- Step 12 Obtain an intra-frame coding cost Icost of the current video frame by using an intra prediction method.
- the intra prediction algorithm can be selected according to the actual situation.
- Step 13 based on the maximum I frame spacing keyint_max, the minimum I frame spacing keyint_min, the maximum scene switching threshold thresh_max, and the minimum scene switching threshold thresh_min, the last I frame is away from the current video frame number GOP, and obtains the dynamic offset value bias.
- the specific value of the maximum I frame spacing keyint_max and the minimum I frame spacing keyint_min is not limited in the present invention, and may be selected according to actual conditions.
- the maximum scene switching threshold thresh_max, and the minimum scene switching threshold thresh_min are determined according to the corresponding formula.
- the maximum scene switching threshold thresh_max and the minimum scene switching threshold thresh_min are equal.
- the maximum scene switching threshold thresh_max and the minimum scene switching threshold thresh_min are determined. Then further determine the dynamic offset value bias, the specific approach is:
- the maximum I frame pitch keyint_max, the specific value of the minimum I frame pitch keyint_min can be preset according to the actual situation.
- the dynamic offset value bias can be determined by:
- the GOP may further determine whether the last I frame is smaller than the minimum I frame from the current video frame.
- Step 14 Determine a category of the current video frame based on an inter-coding value Pcost of the current video frame, an intra-coding value Icost of the current video frame, and the dynamic offset value bias.
- the category of the current video frame includes an I frame and a P frame, where if the Pcost ⁇ (1.0-bias) * Icost, the current video frame is an I frame, if Pcost ⁇ (1.0-bias) *Icost, then the current video frame is a P frame.
- an embodiment of the present invention further discloses an electronic device 300 including a memory 310, a processor 320, and a computer program 311 stored on the memory and operable on the processor, wherein the processing
- the method in the above embodiment is implemented when the program is executed. Since the present invention has been described in detail above in the implementation of the method, it will not be described herein.
- the present invention has the following advantages or advantages:
- the invention discloses a method for determining a frame type of a video frame and an electronic device, which obtains an interframe coding cost value Pcost and an intraframe coding cost value Icost of the current video frame; and the dynamic offset value bias is based on the maximum I
- the frame spacing keyint_max, the minimum I frame spacing keyint_min, the maximum scene switching threshold thresh_max, and the minimum scene switching threshold thresh_min are obtained by the last I frame from the number of frames of the current video frame GOP.
- the category of the current video frame is then determined based on the above three parameters.
- the present invention can control the frequency of occurrence of the current i frame by using a preset maximum I frame pitch, minimum I frame spacing, scene switching threshold, etc., and can evenly distribute the i frame into the video queue, in the case of a bit error.
- the robustness of inter-frame video coding is achieved because there are i frames that do not affect the error to other frames.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
本发明公开了的一种判断视频帧的帧类型的方法及电子设备,通过计算得到当前视频帧的帧间编码代价值和帧内编码代价值;并且动态偏移值bias是基于最大I帧间距,最小I帧间距,最大场景切换阈值,最小场景切换阈值,上一个I帧距离当前视频帧的帧数GOP来获得。然后会根据上述三个参数确定当前视频帧的类别。本发明可以通过预先设置的最大I帧间距、最小I帧间距、场景切换阈值等来控制当前i帧出现的频率,而且可以将i帧均匀的分布到视频队列中,既在出现误码的情况下由于有i帧不会将误码影响到其他帧,从而实现帧间视频编码的鲁棒性。
Description
本申请涉及视频处理技术领域,尤其涉及一种判断视频帧的帧类型的方法及电子设备。
视频的编解码主要靠,预测,转换,编码,来实现的。预测又分为帧间预测和帧内预测,帧内预测用来压缩空间相关性,帧间预测用来压缩时间相关性。
帧内预测是指在程序中定义好一些预测算法,如4*4宏块有9种预测算法(垂直,水平,对角线,平均值……..),16*16宏块有4种预测算法,8*8宏块有4种预测算法。
帧间预测是指从当前片的前面和后面已编码的图像中通过运动估计算法(全局收索,钻石收索,六边形收索等等)计算出当前块相对于参考图像的一个运动矢量。
在经过预测处理操作后,可以将一个宏块的像素点数据,转换为宏块的预测方式既宏块类型,预测模式,运动矢等信息,与宏块的像素点数据占用的空间大小(数据大小)相比宏块类型,预测模式,运动矢量所占的空间会非常小,视频帧是以宏块为单位,从而达到将视频数据进行压缩的目的。
但是,由于视频帧分为I、P、B三个类型,使用B帧能节省大量空间,而在相同码率下,I帧多可以提供更好的画质,而如何辨别视频帧进而控制视频帧的类型来减少帧间视频编码的误码率,则是本发明需要解决的问题。
发明内容
本发明了提供了一种判断视频帧的帧类型的方法,解决或者至少部分解决如何辨别视频帧进而控制视频帧的类型来减少帧间视频编码的误码率的技术问题。
为解决上述技术问题,本发明提供了一种判断视频帧的帧类型的方法的方法,所述方法包括:
将前一视频帧作为参考帧,利用运动估计方法计算得到当前视频帧的帧间编码代价值Pcost;
利用帧内预测方法得到所述当前视频帧的帧内编码代价值Icost;
基于最大I帧间距keyint_max,最小I帧间距keyint_min,最大场景切换阈值thresh_max,最小场景切换阈值thresh_min,所述上一个I帧距离当前视频帧的 帧数GOP,获得动态偏移值bias;
基于所述当前视频帧的帧间编码代价值Pcost、所述当前视频帧的帧内编码代价值Icost、所述动态偏移值bias确定所述当前视频帧的类别;所述当前视频帧的类别包括I帧和P帧,其中,若Pcost≥(1.0-bias)*Icost,则所述当前视频帧为I帧,若Pcost<(1.0-bias)*Icost,则所述当前视频帧为P帧。
优选的,所述将前一视频帧作为参考帧,利用运动估计方法计算得到当前视频帧的帧间编码代价值Pcost之前,包括:
将所述当前视频帧的长宽扩大到16的整数倍;
降低所述当前视频帧的分辨率;
基于所述当前视频帧的边缘像素点的空白一侧增加对应的像素点,然后将所述当前视频帧的边缘像素点值填入到所述对应的像素点中。
优选的,所述降低所述当前视频帧的分辨率,包括:
对所述当前视频帧的每一行像素点执行下述步骤:依次利用所述当前视频帧的每一行像素点中的每两个像素点的原像素值求均值,然后利用所述均值替换掉对应两个像素点的原像素值,使所述对应两个像素点变成一个像素点;
对所述当前视频帧的每一列像素点执行下述步骤:依次利用所述当前视频帧的每一列像素点中的每两个像素点的原像素值求均值,然后利用所述均值替换掉对应两个像素点的原像素值,使所述对应两个像素点变成一个像素点。
优选的,所述将前一视频帧作为参考帧,利用运动估计方法计算得到当前视频帧的帧间编码代价值Pcost,包括:
将前一视频帧划分为N个第一宏块;其中,所述第一宏块为16*16的像素子块,N为正整数;
将所述当前视频帧划分为N个第二宏块;其中,所述第二宏块为16*16的像素子块;
基于所述N个第一宏块,分别利用运动估计方法对应计算所述N个第二宏块的帧间编码代价值;
对所述N个第二宏块的帧间编码代价值进行累加,获得所述当前视频帧的帧间编码代价值Pcost。
优选的,所述基于最大I帧间距keyint_max,最小I帧间距keyint_min,最大场景切换阈值thresh_max,最小场景切换阈值thresh_min,所述上一个I帧距离当前视频帧的帧数GOP,获得动态偏移值bias之前,包括:
根据公式thresh_max=float(scenecut_threshold*100%),获得所述最大场景切换阈值thresh_max;其中,scenecut_threshold为预设的场景切换阈值,float表示归一化处理;
根据公式thresh_min=float(thresh_max*0.25),获得所述最小场景切换阈值thresh_min。
优选的,所述基于最大I帧间距keyint_max,最小I帧间距keyint_min,最大场景切换阈值thresh_max,最小场景切换阈值thresh_min,所述上一个I帧距离当前视频帧的帧数GOP,获得动态偏移值bias之前,包括:
判断所述上一个I帧距离当前视频帧的帧数GOP是否小于最小I帧间距keyint_min;
若是,则根据下述公式获得所述动态偏移值bias:
bias=thresh_min*gop_size/keyint_min;其中,gop_size为所述当前视频帧的GOP。
优选的,所述判断所述上一个I帧距离当前视频帧的帧数GOP是否小于最小I帧间距keyint_min之后,所述方法还包括:
判断所述上一个I帧距离当前视频帧的帧数GOP是否小于最小I帧间距keyint_min的四分之一;
若是,则根据下述公式获得所述动态偏移值bias:
bias=thresh_min/4。
优选的,所述判断所述上一个I帧距离当前视频帧的帧数GOP是否小于最小I帧间距keyint_min之后,所述方法还包括:
若否,则根据下述公式获得所述动态偏移值bias:
bias=thresh_min+(thresh_max-thresh_min)*(gop_size-keyint_min)/(keyint_max-keyint_min)。
优选的,若所述最大I帧间距keyint_max和最小I帧间距keyint_min相等,则所述最大场景切换阈值thresh_max和所述最小场景切换阈值thresh_min相等。
本发明公开了一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述程序时实现上述技术方案中的方法。
通过本发明的一个或者多个技术方案,本发明具有以下有益效果或者优点:
本发明公开了的一种判断视频帧的帧类型的方法及电子设备,通过计算得到当前视频帧的帧间编码代价值Pcost和帧内编码代价值Icost;并且动态偏移值bias是基于最大I帧间距keyint_max,最小I帧间距keyint_min,最大场景切换阈值thresh_max,最小场景切换阈值thresh_min,上一个I帧距离当前视频帧的帧数GOP来获得。然后会根据上述三个参数确定当前视频帧的类别。由此可见,本发明提供了一种可以通过预先设置的最大I帧间距、最小I帧间距、场景切换 阈值等来影响视频帧的类型的方法,进而控制当前i帧出现的频率,而且可以将i帧均匀的分布到视频队列中,既在出现误码的情况下由于有i帧不会将误码影响到其他帧,从而实现帧间视频编码的鲁棒性。
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本发明的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:
图1为本发明实施例中一种判断视频帧的帧类型的方法的流程图;
图2A-图2D为本发明实施例中视频帧进行压缩的示意图;
图3A-图3B为本发明实施例中增加像素点的示意图;
图4为本发明实施例中一种电子设备的示意图。
为了使本申请所属技术领域中的技术人员更清楚地理解本申请,下面结合附图,通过具体实施例对本申请技术方案作详细描述。
本发明公开了一种判断视频帧的帧类型的方法。
其中,I帧,又称为内部画面,I帧通常是每个GOP(一种视频压缩技术,表示上一个I帧距离当前视频帧的帧数)的第一个帧,经过适度地压缩,做为随机访问的参考点,可以当成图像。
在针对连续动态图像编码时,将连续若干幅图像分成P、B、I三种类型,P帧由在它前面的P帧或者I帧预测而来,它比较与它前面的P帧或者I帧之间的相同信息或数据,也即考虑运动的特性进行帧间压缩。P帧法是根据本帧与相邻的前一帧(I帧或P帧)的不同点来压缩本帧数据。采取P帧和I帧联合压缩的方法可达到更高的压缩且无明显的压缩痕迹。
本发明的目的是,先计算出当前视频帧的帧间编码代价值Pcost和帧内编码代价值Icost。然后通过预先设置的最大I帧间距keyint_max和预先设置的最小I帧间距keyint_min,和场景切换阈值来控制当前i帧出现的频率,即:阈值越小或帧间间距越小i帧出现的频率越小。而且可以将i帧均匀的分布到视频队列中,既在出现误码的情况下由于有i帧不会将误码影响到其他帧,从而实现帧间视频编码的鲁棒性。
请参看图1,公开了一种判断视频帧的帧类型的方法的流程图,包括:
步骤11,将前一视频帧作为参考帧,利用运动估计方法计算得到当前视频 帧的帧间编码代价值Pcost。
在此之前,本发明还会对所述当前视频帧进行数据处理。在数据处理过程中,首先将所述当前视频帧的长宽扩大到16的整数倍;然后降低所述当前视频帧的分辨率;再基于所述当前视频帧的边缘像素点的空白一侧增加对应的像素点,然后将所述当前视频帧的边缘像素点值填入到所述对应的像素点中。
具体来说,将所述当前视频帧的长宽扩大到16的整数倍,是将当前视频帧以行为单位进行16的整数倍,且将当前视频帧以列为单位进行16的整数倍。以便于后续在将当前视频帧拆分为宏块。每个宏块就是一个16*16的像素子块,故而此处会将其扩展为16的整数倍。
而降低当前视频帧的分辨率时,可以分别以当前视频帧的长宽(即逐行处理,逐列处理)为基础进行降低。
对所述当前视频帧的每一行像素点执行下述步骤:依次利用所述当前视频帧的每一行像素点中的每两个像素点的原像素值求均值,然后利用所述均值替换掉对应两个像素点的原像素值,使所述对应两个像素点变成一个像素点。参看图2A,是16*16的当前视频帧为例,在第1行中,利用第1、第2两个像素点的像素值(52、60)求均值(均值为56),然后将该均值填入第1、第2两个像素点中,使第1行的第1、第2两个像素点的像素值相同,都为56,进而可看做一个像素点。参看图2B,是逐行执行了上述步骤之后得到的结果,可以发现图2B中的分辨率在以行为单位执行了上述步骤之后,得到的分辨率会比图2A中的分辨率降低1/2。
对所述当前视频帧的每一列像素点执行下述步骤:依次利用所述当前视频帧的每一列像素点中的每两个像素点的原像素值求均值,然后利用所述均值替换掉对应两个像素点的原像素值,使所述对应两个像素点变成一个像素点。承接上述举例,会在图2B的基础上,以列为单位进行缩小,参看图2C,在第1列中,第1、第2两个像素点的像素值(56、80)求均值(均值为68),然后将该均值填入第1列的第1、第2两个像素点中,使第1、第2两个像素点的像素值相同,都为68,进而变为一个像素点。参看图2D,是逐行执行了上述步骤之后得到的结果。
当然,本发明的上述两个步骤顺序可以调换,即在逐列执行完毕之后,再逐行执行。
这样的目的,是为了降低当前视频帧的分辨率,进而降低当前视频帧的数据量,提高处理效率。
在最后的步骤中,会在当前视频帧的最边缘的上下左右的像素点的空白一侧都增加对应的像素点,参看图3A,粗黑框表示的是上下左右的像素点,由此 可见,粗黑框中的每个像素点,三侧都是挨着其他的像素点的,只有一侧是空白的,本发明则在空白侧增加对应的像素点,参看图3B,虚线表示增加的像素点。将所述当前视频帧的边缘像素点值填入到所述对应的像素点中。即,可以看出,粗黑框中的像素值和虚线框中的对应的像素值是相同的。
这样做的目的是为了在进行帧间预测和帧内预测时增加的像素点可以作为参考对像素块进行预测,如帧内预测需要当前块上面一行像素和左边一列像素按照帧内预测模式进行预测,而帧间预测,增加的像素块可以作为像素块运动估计偏移的区域,故需要扩展当前视频帧。
本发明将当前视频帧上下左右边扩展的像素行或列数为4行或4列,可以增加帧间预测的区域并提高精度,例如在进行帧间预测时,比较通用的的是菱形运动估计算法,算法首先是计算当前宏块向左偏移一个像素计算代价值,然后分别计算向上,向下,向右........直到获得最优的运动矢量,很明显采样4行4列要其他的运动估计区域要大些,故精度要好些,进而在进行帧间运动估计时,能够增加运动矢量值计算准确率。
而利用运动估计方法计算得到当前视频帧的帧间编码代价值Pcost时,首先将前一视频帧划分为N个第一宏块;其中,所述宏块为16*16的像素子块,N为正整数;然后将所述当前视频帧也换分为对应的N个第二宏块,其中,所述第二宏块为16*16的像素子块。然后基于所述N个第一宏块,分别利用运动估计方法对应计算所述N个第二宏块的帧间编码代价值;对所述N个第二宏块的帧间编码代价值进行累加,获得所述当前视频帧的帧间编码代价值Pcost。
可以根据实际情况,选择合适的运动估计方法进行计算,再此本发明不再赘述。
步骤12,利用帧内预测方法得到所述当前视频帧的帧内编码代价值Icost。
帧内预测算法可根据实际情况选定。
步骤13,基于最大I帧间距keyint_max,最小I帧间距keyint_min,最大场景切换阈值thresh_max,最小场景切换阈值thresh_min,所述上一个I帧距离当前视频帧的帧数GOP,获得动态偏移值bias。
在具体的实施过程中,最大I帧间距keyint_max,最小I帧间距keyint_min的具体数值本发明不做限定,可根据实际情况选定。而最大场景切换阈值thresh_max,最小场景切换阈值thresh_min是根据对应的公式进行确定。
若所述最大I帧间距keyint_max和最小I帧间距keyint_min相等,则所述最大场景切换阈值thresh_max和所述最小场景切换阈值thresh_min相等。
下面进行具体的介绍。
首先确定最大场景切换阈值thresh_max,最小场景切换阈值thresh_min。 然后进一步确定出动态偏移值bias,具体的做法是:
根据公式thresh_max=float(scenecut_threshold*100%),获得所述最大场景切换阈值thresh_max;其中,scenecut_threshold为预设的场景切换阈值,float表示归一化处理。
根据公式thresh_min=float(thresh_max*0.25),获得所述最小场景切换阈值thresh_min。
最大I帧间距keyint_max,最小I帧间距keyint_min的具体值可根据实际情况预先设置。
在确定了上述四个参数的具体值之后,可确定动态偏移值bias,具体做法是:
判断所述上一个I帧距离当前视频帧的帧数GOP是否小于最小I帧间距keyint_min;
若是,则根据下述公式获得所述动态偏移值bias:bias=thresh_min*gop_size/keyint_min;其中,gop_size为所述当前视频帧的GOP。
若否,则根据下述公式获得所述动态偏移值bias:bias=thresh_min+(thresh_max-thresh_min)*(gop_size-keyint_min)/(keyint_max-keyint_min)。
进一步的,判断所述上一个I帧距离当前视频帧的帧数GOP是否小于最小I帧间距keyint_min之后,还可以进一步判断所述上一个I帧距离当前视频帧的帧数GOP是否小于最小I帧间距keyint_min的四分之一;若是,则根据下述公式获得所述动态偏移值bias:bias=thresh_min/4。
步骤14,基于所述当前视频帧的帧间编码代价值Pcost、所述当前视频帧的帧内编码代价值Icost、所述动态偏移值bias确定所述当前视频帧的类别。
在本发明的实施例中,当前视频帧的类别包括I帧和P帧,其中,若Pcost≥(1.0-bias)*Icost,则所述当前视频帧为I帧,若Pcost<(1.0-bias)*Icost,则所述当前视频帧为P帧。
基于同一发明构思,本发明的实施例还公开了一种电子设备300,包括存储器310、处理器320及存储在存储器上并可在处理器上运行的计算机程序311,其特征在于,所述处理器执行所述程序时实现上述实施例中的方法。由于本发明已经在上述详细介绍了方法的实施过程,故而在此不再赘述。
通过本发明的一个或者多个实施例,本发明具有以下有益效果或者优点:
本发明公开了的一种判断视频帧的帧类型的方法及电子设备,通过计算得到当前视频帧的帧间编码代价值Pcost和帧内编码代价值Icost;并且动态偏移值bias是基于最大I帧间距keyint_max,最小I帧间距keyint_min,最大场景切 换阈值thresh_max,最小场景切换阈值thresh_min,上一个I帧距离当前视频帧的帧数GOP来获得。然后会根据上述三个参数确定当前视频帧的类别。本发明可以通过预先设置的最大I帧间距、最小I帧间距、场景切换阈值等来控制当前i帧出现的频率,而且可以将i帧均匀的分布到视频队列中,既在出现误码的情况下由于有i帧不会将误码影响到其他帧,从而实现帧间视频编码的鲁棒性。
尽管已描述了本申请的优选实施例,但本领域内的普通技术人员一旦得知了基本创造性概念,则可对这些实施例作出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本申请范围的所有变更和修改。
显然,本领域的技术人员可以对本申请进行各种改动和变型而不脱离本申请的精神和范围。这样,倘若本申请的这些修改和变型属于本申请权利要求及其等同技术的范围之内,则本申请也意图包含这些改动和变型在内。
Claims (20)
- [根据细则20.5改正24.05.2018]
一种判断视频帧的帧类型的方法,其特征在于,所述方法包括:将前一视频帧作为参考帧,利用运动估计方法计算得到当前视频帧的帧间编码代价值Pcost;利用帧内预测方法得到所述当前视频帧的帧内编码代价值Icost;基于最大I帧间距keyint_max,最小I帧间距keyint_min,最大场景切换阈值thresh_max,最小场景切换阈值thresh_min,所述上一个I帧距离当前视频帧的帧数GOP,获得动态偏移值bias;基于所述当前视频帧的帧间编码代价值Pcost、所述当前视频帧的帧内编码代价值Icost、所述动态偏移值bias确定所述当前视频帧的类别;所述当前视频帧的类别包括I帧和P帧,其中,若Pcost≥(1.0-bias)*Icost,则所述当前视频帧为I帧,若Pcost≤ (1.0-bias)*Icost,则所述当前视频为P帧。 - [根据细则20.5改正24.05.2018]
如权利要求1所述的方法,其特征在于,所述将前一视频帧作为参考帧,利用运动:估计方法计算得到当前视频帧的帧间编码代价值Pcost之前,包括:将所述当前视频帧的长宽扩大到16的整数倍;降低所述当前视频帧的分辨率;基于所述当前视频帧的边缘像素点的空白一侧增加对应的像素点,然后将所述当前视频帧的边缘像素点值填入到所述对应的像素点中。 - [根据细则20.5改正24.05.2018]
如权利要求2所述的方法,其特征在于,所述降低所述当前视频帧的分辨率,包括:^所述当前视频帧的每一行像素点执行下述步骤:依次利用所述当前视频帧的每一行像素点中的每两个像素点的原像素值求均值,然后利用所述均值替换掉对应两个像素点的原像素值,使所述对应两个像素点变成一个像素点;对所述当前枧频帧的每一列像素点执行下述步骤:依次利用所述当前视频帧的每一列像素点中的每两个像素点的原像素值求均值,然后利用所述均值替换掉对应两个像素点的原像素值,使所述对应两个像素点变成一个像素点。 - [根据细则20.5改正24.05.2018]
如权利要求1所述的方法,其特征在于,所述将前一视频帧作为参考帧,利用运动估计方法计算得到当前视频帧的帧间编码代价值Pcost,包括:将前一视频帧划分为N个第一宏块;其中,所述第一宏块为16*16的像素子块,N为正整数:将所述当前视频帧划分为N个第二宏块;其中,所述第二宏块为16*16的像素子块;基于所述N个第一宏块,分别利用运动估计方法对应计算所述N个第二宏块的帧间编码代价值;对所述N个第二宏块的帧间编码代价值进行累加,获得所述当前视频帧的帧间编码代价值Pcost。 - [根据细则20.5改正24.05.2018]
如权利要求1所述的方法,其特征在于,所述基于最大I帧间距keyint_max,最小I帧、间距keyint_min,最大场景切换阈值thresh_max,最小场景切换阈值thresh_min,所述上一个I帧距离当前视频帧的帧数GOP,获得动态偏移值bias之包括:根据公式thresh_max=float (scenecut_threshold*100%),获得所述最大场景切换阈值thresh_max;其中,scenecut_threshold为预设的场景切换阈值,float表示归一化处理; 根据公式thresh_min=float (thresh_max*0.25),获得所述最小场景切换阈值thresh_min。 - [根据细则20.5改正24.05.2018]
权利要求5所述的方法,其特征在于,所述基于最大I帧间距keyint_max,最小I帧间距keyint_min,最大场景切换岡值thresh_max,最小场景"换阈值thresh_min,所述上一个I帧距离当前视频帧的帧数GOP,获得动态偏移值bias之前,包括: 判断所述上一个I帧距离当前视频帧的帧数GOP是否小于最小I帧间距keyint_min; 若是,则根据下述公式获得所述动态偏移值bias: bias=thresh_min*gop_size/keyint_min;其中,gop_size为所述当前视频帧的GOP。 - [根据细则20.5改正24.05.2018]
如权利要求6所述的方法,其特征在于,所述判断所述上一个I帧距离当前视频帧的帧数GOP是否小于最小I帧间距keyint_min之后,所述方法还包括: 判断所述上一个I帧距离当前视频帧的帧数GOP是否小于最小I帧间距keyint_min的四分之一;若是,则根据下述公式获得所述动态偏移值bias: bias=thresh_min/4。 - [根据细则20.5改正24.05.2018]
如权利要求6所述的方法,其特征在于,所述判断所述上一个I帧距离当前视频帧的帧数GOP是否小于最小I帧间距keyint_min之后,所述方法还包括:若否,则根据下述公式获得所述动态偏移值bias: bias=thresh_min+(thresh_max – thresh_min)*(gop_size – keyint_min)/(keyint_max – keyint_min)。 - [根据细则20.5改正24.05.2018]
如权利要求1所述的方法,其特征在于,若所述最大I帧间距keyint_max和最小I帧间距keyint_min相等,则所述最大场景切换阔值thresh_max和所述最小场景切换阈值thresh_min相等。 - [根据细则20.5改正24.05.2018]
一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,其特征在于,所述处理器执行所述程序时实现权利要求1-9中任一权项的方法。 - 一种视频播放方法,应用于服务器,其特征在于,所述方法包括:接收用户终端发送的针对目标视频的视频流地址请求;如果所述用户终端对应的观看用户对所述目标视频有观看权限,向所述用户终端返回用于表征所述目标视频的视频类型的第一Key值和用于表征所述观看用户的权限范围的第二Key值;接收到所述用户终端发送的基于所述第一Key值与所述第二Key值生成的针对所述目标视频的终端侧鉴权Key值时,向所述用户终端返回所述目标视频。
- 如权利要求1所述的视频播放方法,其特征在于,如果所述视频流地址请求中携带有所述观看用户的用户识别信息,所述方法还包括:根据所述用户识别信息与所述权限范围的对应关系,判别所述观看用户是否对所述目标视频有观看权限。
- 如权利要求1所述的视频播放方法,其特征在于,如果所述视频流地址请求中携带有请求识别标识组,在所述接收用户终端发送的针对目标视频的视频流地址请求之后,所述方法还包括:基于所述第一Key值、所述第二Key值和所述请求识别标识组生成针对所述目标视频的服务器侧鉴权Key值;将接收到的鉴权Key值分别与针对所述目标视频的服务器侧鉴权Key值进行对比是否一致;如果一致,表征接收到的鉴权Key值为针对所述目标视频的终端侧鉴权Key值。
- 如权利要求3所述的视频播放方法,其特征在于,所述基于所述第一Key值、所述第二Key值和所述请求识别标识组生成针对所述目标视频的服务器侧鉴权Key值,包括:将所述第一Key值与所述第二Key值输入MD5算法,以运算得到中间Key值;将所述中间Key值和所述请求识别标识组输入RSA加密算法,以运算得到针对所述目标视频的服务器侧鉴权Key值。
- 如权利要求3或4所述的视频播放方法,其特征在于,所述请求识别标识组,包括:表征所述用户终端发送所述视频流地址请求所在时刻的时间标识,以及如下至少一种所述用户终端的终端特征:所述观看用户登录所述服务器时的令牌、所述用户终端的唯一ID、所述用户终端当前的IP地址。
- 如权利要求5所述的视频播放方法,其特征在于,如果所述目标视频为目标网络直播视频,则所述请求识别标识组还包括所述目标网络直播视频所在 的房间标识码,所述第二Key值包括所述观看用户的权限范围内的各个网络直播视频所属的直播类型识别码和所在的房间标识码。
- 一种视频请求方法,应用于用户终端,其特征在于,包括:向服务器发送针对目标视频的视频流地址请求;接收所述服务器返回的用于表征所述目标视频的视频类型的第一Key值和用于表征所述观看用户的权限范围的第二Key值;根据所述第一Key值与所述第二Key值生成针对所述目标视频的终端侧鉴权Key值并发送给所述服务器,以使所述服务器接收到针对所述目标视频的终端侧鉴权Key值时向所述用户终端返回所述目标视频;从所述服务器接收所述目标视频。
- 如权利要求7所述的视频请求方法,其特征在于,如果所述视频流地址请求中携带有请求识别标识组,所述根据所述第一Key值与所述第二Key值生成针对所述目标视频的终端侧鉴权Key值并发送给所述服务器,包括:基于所述第一Key值、所述第二Key值和所述请求识别标识组生成针对所述目标视频的终端侧鉴权Key值;将针对所述目标视频的终端侧鉴权Key值发送给所述服务器。
- 一种视频播放装置,应用于服务器,其特征在于,所述视频播放装置包括:地址请求接收模块,用于接收用户终端发送的针对目标视频的视频流地址请求;Key值发送模块,用于如果所述用户终端对应的观看用户对所述目标视频有观看权限,向所述用户终端返回用于表征所述目标视频的视频类型的第一Key值和用于表征所述观看用户的权限范围的第二Key值;视频返回模块,用于接收到所述用户终端发送的基于所述第一Key值与所述第二Key值生成的针对所述目标视频的终端侧鉴权Key值时,向所述用户终端返回所述目标视频。
- 一种视频请求装置,应用于用户终端,其特征在于,所述视频请求装置包括:地址请求发送模块,用于向服务器发送针对目标视频的视频流地址请求;Key值接收模块,用于接收所述服务器返回的用于表征所述目标视频的视频类型的第一Key值和用于表征所述观看用户的权限范围的第二Key值;鉴权Key值获取模块,用于根据所述第一Key值与所述第二Key值生成针对所述目标视频的终端侧鉴权Key值并发送给所述服务器,以使所述服务器接收到针对所述目标视频的终端侧鉴权Key值时向所述用户终端返回所述目标视 频;视频接收模块,用于从所述服务器接收所述目标视频。
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201710523398.0A CN107277519B (zh) | 2017-06-30 | 2017-06-30 | 一种判断视频帧的帧类型的方法及电子设备 |
| CN201710523398.0 | 2017-06-30 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2019001072A1 true WO2019001072A1 (zh) | 2019-01-03 |
Family
ID=60070062
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2018/082159 Ceased WO2019001072A1 (zh) | 2017-06-30 | 2018-05-24 | 一种判断视频帧的帧类型的方法及电子设备 |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN107277519B (zh) |
| WO (1) | WO2019001072A1 (zh) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116546203A (zh) * | 2023-04-24 | 2023-08-04 | 北京达佳互联信息技术有限公司 | 视频帧处理方法、装置、电子设备及可读存储介质 |
| CN119788847A (zh) * | 2024-12-20 | 2025-04-08 | 摩尔线程智能科技(北京)股份有限公司 | 帧类型判断方法、装置、电子设备及存储介质 |
Families Citing this family (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107277519B (zh) * | 2017-06-30 | 2019-11-15 | 武汉斗鱼网络科技有限公司 | 一种判断视频帧的帧类型的方法及电子设备 |
| CN112449182B (zh) * | 2019-08-28 | 2025-01-07 | 广州虎牙科技有限公司 | 视频编码方法、装置、设备及存储介质 |
| CN116567228A (zh) | 2022-01-27 | 2023-08-08 | 腾讯科技(深圳)有限公司 | 编码方法、实时通信方法、装置、设备及存储介质 |
| CN115361582B (zh) * | 2022-07-19 | 2023-04-25 | 鹏城实验室 | 一种视频实时超分辨率处理方法、装置、终端及存储介质 |
| CN115442602B (zh) * | 2022-08-12 | 2025-09-05 | 阿里巴巴(中国)有限公司 | 编码单元的划分决策方法、解码、转码方法、设备及介质 |
| CN119363979B (zh) * | 2024-12-20 | 2025-04-11 | 摩尔线程智能科技(北京)股份有限公司 | 帧类型确定方法、装置、电子设备及存储介质 |
Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2002037859A2 (en) * | 2000-11-03 | 2002-05-10 | Compression Science | Video data compression system |
| CN102196291A (zh) * | 2011-05-20 | 2011-09-21 | 四川长虹电器股份有限公司 | 一种双目立体视频编码方法 |
| CN102300096A (zh) * | 2011-09-19 | 2011-12-28 | 盛乐信息技术(上海)有限公司 | 帧类型检测方法及帧类型检测系统 |
| US8891892B2 (en) * | 2012-03-26 | 2014-11-18 | Electronics and Teleommunications Research Institute | Image encoding method using adaptive preprocessing scheme |
| CN104394485A (zh) * | 2012-08-21 | 2015-03-04 | 华为技术有限公司 | 视频流的帧类型检测、帧大小检测方法及装置 |
| CN105898296A (zh) * | 2015-12-24 | 2016-08-24 | 乐视云计算有限公司 | 视频编码帧选择方法及装置 |
| CN107277519A (zh) * | 2017-06-30 | 2017-10-20 | 武汉斗鱼网络科技有限公司 | 一种判断视频帧的帧类型的方法及电子设备 |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5164828A (en) * | 1990-02-26 | 1992-11-17 | Sony Corporation | Video signal transmission and method and apparatus for coding video signal used in this |
| FR2696026B1 (fr) * | 1992-09-18 | 1994-12-30 | Sgs Thomson Microelectronics | Procédé de masquage d'erreurs de transmission d'image compressée en MPEG. |
| EP0935396A3 (en) * | 1998-02-09 | 2004-08-11 | Matsushita Electric Industrial Co., Ltd. | Video coding method and apparatus |
| CN108495130B (zh) * | 2017-03-21 | 2021-04-20 | 腾讯科技(深圳)有限公司 | 视频编码、解码方法和装置、终端、服务器和存储介质 |
-
2017
- 2017-06-30 CN CN201710523398.0A patent/CN107277519B/zh active Active
-
2018
- 2018-05-24 WO PCT/CN2018/082159 patent/WO2019001072A1/zh not_active Ceased
Patent Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2002037859A2 (en) * | 2000-11-03 | 2002-05-10 | Compression Science | Video data compression system |
| CN102196291A (zh) * | 2011-05-20 | 2011-09-21 | 四川长虹电器股份有限公司 | 一种双目立体视频编码方法 |
| CN102300096A (zh) * | 2011-09-19 | 2011-12-28 | 盛乐信息技术(上海)有限公司 | 帧类型检测方法及帧类型检测系统 |
| US8891892B2 (en) * | 2012-03-26 | 2014-11-18 | Electronics and Teleommunications Research Institute | Image encoding method using adaptive preprocessing scheme |
| CN104394485A (zh) * | 2012-08-21 | 2015-03-04 | 华为技术有限公司 | 视频流的帧类型检测、帧大小检测方法及装置 |
| CN105898296A (zh) * | 2015-12-24 | 2016-08-24 | 乐视云计算有限公司 | 视频编码帧选择方法及装置 |
| CN107277519A (zh) * | 2017-06-30 | 2017-10-20 | 武汉斗鱼网络科技有限公司 | 一种判断视频帧的帧类型的方法及电子设备 |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116546203A (zh) * | 2023-04-24 | 2023-08-04 | 北京达佳互联信息技术有限公司 | 视频帧处理方法、装置、电子设备及可读存储介质 |
| CN119788847A (zh) * | 2024-12-20 | 2025-04-08 | 摩尔线程智能科技(北京)股份有限公司 | 帧类型判断方法、装置、电子设备及存储介质 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN107277519A (zh) | 2017-10-20 |
| CN107277519B (zh) | 2019-11-15 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2019001072A1 (zh) | 一种判断视频帧的帧类型的方法及电子设备 | |
| CN113766226B (zh) | 图像编码方法、装置、设备及存储介质 | |
| US11451787B2 (en) | Method and apparatus for video encoding and decoding | |
| US12125171B2 (en) | Video denoising method and apparatus, and storage medium | |
| CN107087200B (zh) | 针对高效率视频编码标准的跳过编码模式提前判决方法 | |
| US10356441B2 (en) | Method and apparatus for detecting quality defects in a video bitstream | |
| CN112637591B (zh) | 一种视频预测编码的方法及装置 | |
| CN111970511B (zh) | 一种基于vmaf的感知视频率失真编码优化方法及装置 | |
| US8000393B2 (en) | Video encoding apparatus and video encoding method | |
| US9756356B2 (en) | Application-assisted spatio-temporal error concealment for RTP video | |
| WO2023131059A1 (zh) | 图像编码方法、图像编码装置、电子设备和可读存储介质 | |
| KR20120139552A (ko) | 디지털 비디오 데이터를 엔코딩하기 위한 방법 및 디지털 비디오 엔코더 시스템 | |
| CN108989818B (zh) | 一种图像编码参数调整方法及装置 | |
| CN105898306A (zh) | 用于运动视频的码率控制方法及装置 | |
| CN100525454C (zh) | 帧间预测模式的选择方法 | |
| CN115866250A (zh) | 视频处理方法、装置、设备及计算机可读存储介质 | |
| CN104853191A (zh) | 一种hevc的快速编码方法 | |
| Chen et al. | Adaptive intra-refresh for low-delay error-resilient video coding | |
| Wang et al. | Region-classification-based rate control for flicker suppression of I-frames in HEVC | |
| CN110769255A (zh) | 一种图像编码方法及装置 | |
| CN109618152B (zh) | 深度划分编码方法、装置和电子设备 | |
| CN108600757A (zh) | 宏块类型判定方法、视频转码方法、电子设备和存储介质 | |
| CN116527944B (zh) | 样点自适应补偿方法、装置、设备、存储介质以及产品 | |
| CN101179734B (zh) | 一种视频压缩的帧内预测方法和系统 | |
| KR101678138B1 (ko) | 동화상 부호화 방법, 장치 및 프로그램 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 18825521 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 18825521 Country of ref document: EP Kind code of ref document: A1 |