CN108366295B - Video classification feature extraction method, transcoding recompression detection method and storage medium - Google Patents
Video classification feature extraction method, transcoding recompression detection method and storage medium Download PDFInfo
- Publication number
- CN108366295B CN108366295B CN201810144370.0A CN201810144370A CN108366295B CN 108366295 B CN108366295 B CN 108366295B CN 201810144370 A CN201810144370 A CN 201810144370A CN 108366295 B CN108366295 B CN 108366295B
- Authority
- CN
- China
- Prior art keywords
- video
- frame
- compressed
- classification
- division type
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 59
- 238000000605 extraction Methods 0.000 title claims abstract description 21
- 230000000007 visual effect Effects 0.000 claims abstract description 11
- 238000000034 method Methods 0.000 claims description 35
- 238000007906 compression Methods 0.000 claims description 34
- 230000006835 compression Effects 0.000 claims description 29
- 238000005192 partition Methods 0.000 claims description 21
- 238000012706 support-vector machine Methods 0.000 claims description 19
- 238000012360 testing method Methods 0.000 claims description 14
- 238000012549 training Methods 0.000 claims description 9
- 238000011156 evaluation Methods 0.000 claims description 7
- 238000004590 computer program Methods 0.000 claims description 3
- 238000012935 Averaging Methods 0.000 claims 1
- 230000006870 function Effects 0.000 description 11
- 238000004422 calculation algorithm Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 6
- 239000011159 matrix material Substances 0.000 description 6
- 230000008859 change Effects 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 238000013139 quantization Methods 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000003491 array Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 230000033001 locomotion Effects 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 1
- 238000013144 data compression Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/4402—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
- H04N21/440218—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by transcoding between formats or standards, e.g. from MPEG-2 to MPEG-4
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/172—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/177—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/44008—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
本发明提供了一种视频分类特征提取方法、转码重压缩检测方法及存储介质,所述视频分类特征提取方法包括:利用可视化分析器提取视频帧的PU划分类型,并将提取到的所述视频帧的PU划分类型以像素块为基本单元进行标记;统计视频中每一组连续画面里第一个P帧的各个PU划分类型对应的像素块数目;将所有组连续画面里第一个P帧的各个PU划分类型对应的像素块数目求取平均值,得到所有组连续画面里第一个P帧的各个PU划分类型的分类特征。本发明使用数目较少维数特征,达到了更高的重压缩视频的检测率。
The present invention provides a video classification feature extraction method, a transcoding recompression detection method and a storage medium. The video classification feature extraction method includes: using a visual analyzer to extract the PU division type of a video frame, and extracting the extracted The PU division type of the video frame is marked with the pixel block as the basic unit; the number of pixel blocks corresponding to each PU division type of the first P frame in each group of consecutive pictures in the video is counted; The number of pixel blocks corresponding to each PU division type of the frame is averaged, and the classification features of each PU division type of the first P frame in all groups of consecutive pictures are obtained. The present invention uses a smaller number of dimensional features to achieve a higher detection rate of recompressed video.
Description
技术领域technical field
本发明涉及视频处理技术领域,特别涉及一种视频分类特征提取方法、视频转码重压缩检测方法及计算机可读存储介质。The present invention relates to the technical field of video processing, and in particular, to a video classification feature extraction method, a video transcoding recompression detection method and a computer-readable storage medium.
背景技术Background technique
在当下互联网的快速发展下,数字视频的获取和传输变得日益成熟和普及。同时,功能日益强大的视频编辑软件也越来越受到民众追捧,这使得数字视频被篡改的情况时有发生。当被篡改的视频用于司法、媒体等行业时,会歪曲事实真相,继而会引起司法误判、媒体失实的严重后果。于是,数字视频的真实性和完整性成为当下社会需要迫切解决的问题。With the rapid development of the current Internet, the acquisition and transmission of digital video has become increasingly mature and popular. At the same time, increasingly powerful video editing software has become more and more popular, which makes digital video tampering happen from time to time. When the tampered video is used in judicial, media and other industries, it will distort the truth, which will lead to serious consequences of judicial misjudgment and media misrepresentation. Therefore, the authenticity and integrity of digital video has become an urgent problem to be solved in the current society.
目前,国内外的研究主要集中在数字图像方面,也获得了大量的研究成果。例如,检测版权图像的非法拷贝,图像中的复制移动检测,以及将计算机生成的图像与摄影图像区分开。由于视频的信息量大、篡改方式多样等特点,导致视频取证研究困难重重。依托于图像取证研究的进展,近年来视频取证技术也获得了长足发展。At present, the research at home and abroad mainly focuses on digital images, and a lot of research results have also been obtained. For example, detecting illegal copies of copyrighted images, detection of copy movement in images, and distinguishing computer-generated images from photographic images. Due to the large amount of video information and various tampering methods, video forensics research is difficult. Relying on the progress of image forensics research, video forensics technology has also made great progress in recent years.
常见的视频篡改手段一般需经历解码、删除帧或插入帧等篡改操作,被篡改的视频序列都需要经过再次压缩才能重新生成视频码流。因此,检测视频是否被重压缩可作为检测视频是否被篡改的一种技术手段。现有技术中,对重压缩检测的方法有多种,例如:以同比特率对MPEG视频进行重压缩时,单次压缩与双次压缩的DCT系数变化数目要比双次压缩和三次压缩的变化数目多,并利用此现象进行视频的重压缩检测。通过检查重压缩视频的块效应强度规律及其均值的变化来检测MPEG视频的重压缩。利用了相邻P帧的运动补偿边际效应的不同,并通过判断傅里叶变换域中是否存在尖峰来检测MPEG视频的重压缩。利用重压缩视频中量化DCT系数统计直方图的凸特性区分MPEG-2单次压缩视频和重压缩视频,算法适用于检测采用不同MPEG-2编码器的重压缩,对帧删除篡改具有鲁棒性。一种以非零量化AC系数的概率为特征的检测算法用于区分H.264单次压缩视频和双重压缩视频,当第二次压缩量化参数小于首次压缩量化参数时取得了很高的分类准确率。一种相同量化参数下的H.264视频多重压缩取证算法,以相邻三次压缩间不同量化DCT系数的比率差构建了含有四分位数的特征集,作为支持向量机的输入,实现了对单次压缩视频和多重压缩视频的分类。所提算法有高分类精度,对复制/粘贴攻击和帧删除攻击具有较强的鲁棒性。Common video tampering methods generally need to undergo tampering operations such as decoding, deleting frames or inserting frames. The tampered video sequence needs to be compressed again to regenerate the video stream. Therefore, detecting whether the video is recompressed can be used as a technical means to detect whether the video has been tampered with. In the prior art, there are many methods for recompression detection. For example, when recompressing an MPEG video at the same bit rate, the number of DCT coefficient changes in single-compression and double-compression is larger than that in double-compression and triple-compression. The number of changes is large, and this phenomenon is used for video recompression detection. The recompression of MPEG video is detected by examining the regularity of the blockiness intensity of the recompressed video and the change of its mean value. The difference of the motion compensation side effects of adjacent P frames is exploited, and the recompression of MPEG video is detected by judging whether there is a spike in the Fourier transform domain. Distinguish MPEG-2 single-shot compressed video and recompressed video by using the convexity of the statistical histogram of quantized DCT coefficients in recompressed video. The algorithm is suitable for detecting recompression using different MPEG-2 encoders and is robust to frame deletion and tampering. . A detection algorithm characterized by the probability of non-zero quantized AC coefficients is used to distinguish H.264 single-compressed video and double-compressed video, and a high classification accuracy is achieved when the quantization parameter of the second compression is smaller than the quantization parameter of the first compression. Rate. A multi-compression forensics algorithm for H.264 video under the same quantization parameters, constructs a feature set containing quartiles based on the ratio difference of different quantized DCT coefficients between adjacent three compressions, as the input of the support vector machine, realizes the Classification of single-compressed video and multi-compressed video. The proposed algorithm has high classification accuracy and strong robustness to copy/paste attack and frame deletion attack.
作为最新的视频编码标准,HEVC已经吸引了相当多的研究者的关注。和H.264相比,相同视频质量情况下,HEVC提供双倍的数据压缩比,也即相同的比特率情况下,HEVC能大幅提高视频质量。它支持的分辨率高达8192×4320,其中也包括8k UHD。因此,现有技术中提出了针对HEVC视频重压缩加测的方法,例如:一种基于相邻DCT系数对奇偶组合统计特性的HEVC重压缩视频检测算法。同时在现有技术中提出了基于量化DCT系数的共生矩阵和基于Markov特征优化的HEVC视频重压缩检测算法。利用HEVC重压缩视频中I帧的各PU划分类型对应的块数目的突变现象,提出了不同比特率下HEVC视频重压缩取证算法。但是,现有技术中针对HEVC视频重压缩检测的方法的分类特征维数较多,计算量大,也没有达到较高的检测率。As the latest video coding standard, HEVC has attracted considerable attention of researchers. Compared with H.264, under the same video quality, HEVC provides double the data compression ratio, that is, under the same bit rate, HEVC can greatly improve the video quality. It supports resolutions up to 8192×4320, which also includes 8k UHD. Therefore, in the prior art, a method for recompression and measurement of HEVC video is proposed, for example, an HEVC recompression video detection algorithm based on the statistical characteristics of adjacent DCT coefficients on parity combination. At the same time, a co-occurrence matrix based on quantized DCT coefficients and an HEVC video recompression detection algorithm based on Markov feature optimization are proposed in the prior art. Taking advantage of the sudden change in the number of blocks corresponding to each PU division type of I frame in HEVC recompressed video, a forensic algorithm for HEVC video recompression at different bit rates is proposed. However, in the prior art methods for HEVC video recompression detection, the classification feature dimensions are large, the amount of calculation is large, and a high detection rate is not achieved.
因此,为了解决上述技术问题,需要一种分类特征维数少,达到高检测率的一种视频分类特征提取方法及视频转码重压缩检测方法。Therefore, in order to solve the above-mentioned technical problems, there is a need for a video classification feature extraction method and a video transcoding recompression detection method with few classification feature dimensions and high detection rate.
发明内容SUMMARY OF THE INVENTION
本发明的目的在于提供一种视频分类特征提取方法及视频转码重压缩检测方法,以解决现有技术中的至少一种缺陷。The purpose of the present invention is to provide a video classification feature extraction method and a video transcoding recompression detection method to solve at least one defect in the prior art.
本发明的一方面提供一种及视频转码重压缩检测方法,所述方法包括:One aspect of the present invention provides a video transcoding recompression detection method, the method includes:
利用可视化分析器提取视频帧的预测单元(PU)划分类型,并将提取到的所述视频帧的PU划分类型以像素块为基本单元进行标记;Extract the prediction unit (PU) division type of the video frame by using a visual analyzer, and mark the extracted PU division type of the video frame with pixel blocks as the basic unit;
统计视频中每一组连续画面里第一个P帧的各个PU划分类型对应的像素块数目;Count the number of pixel blocks corresponding to each PU division type of the first P frame in each group of consecutive pictures in the video;
将所有组连续画面里第一个P帧的各个PU划分类型对应的像素块数目求取平均值,得到所有组连续画面里第一个P帧的各个PU划分类型的分类特征。The number of pixel blocks corresponding to each PU division type of the first P frame in all groups of consecutive pictures is averaged, and the classification features of each PU division type of the first P frame in all groups of consecutive pictures are obtained.
优选地,所述视频帧的PU划分类型提取时,设定所述可视化分析器边界颜色的RGB分量,优选可以设置为(255,0,255)或者其他能够区分PU划分边框与视频内容的任何适宜的分量值。Preferably, when extracting the PU partition type of the video frame, the RGB component of the border color of the visual analyzer is set, preferably (255, 0, 255) or any other suitable value that can distinguish the PU partition border from the video content. component value.
优选地,所述视频帧的PU划分类型以N×N像素块为基本单元进行标记,其中N为4或者4的整数倍值。例如可以以8×8像素块为基本单元进行标记。Preferably, the PU division type of the video frame is marked with an N×N pixel block as a basic unit, where N is 4 or an integer multiple of 4. For example, an 8×8 pixel block can be used as a basic unit for marking.
优选地,所述所有组连续画面里第一个P帧的各个PU划分类型对应的像素块数目求取平均值通过如下公式实现:Preferably, the average value of the number of pixel blocks corresponding to each PU division type of the first P frame in all groups of consecutive pictures is achieved by the following formula:
其中Pi={pi,0,pi,1,...,pi,24}(i=1,2,...,M),M为视频中包含的连续画面的组数。 Wherein P i ={pi ,0 ,pi ,1 ,...,pi ,24 }(i=1,2,...,M), where M is the number of groups of consecutive pictures included in the video.
本发明的另一个方面在于提供一种视频转码重压缩检测方法,包括:Another aspect of the present invention is to provide a video transcoding recompression detection method, comprising:
随机选取数目相同的单次压缩视频和重压缩视频作为训练样本送入支持向量机;Randomly select the same number of single-shot compressed videos and re-compressed videos as training samples and send them to the support vector machine;
按照如下方法对所述单次压缩视频和重压缩视频进行视频分类特征提取:对PU划分类型分析,利用可视化分析器提取视频帧的PU划分类型,并将提取到的所述视频帧的PU划分类型以像素块为基本单元进行标记;统计视频中每一组连续画面里第一个P帧的各个PU划分类型对应的像素块数目;将所有组连续画面里第一个P帧的各个PU划分类型对应的像素块数目求取平均值,得到所有组连续画面里第一个P帧的各个PU划分类型的分类特征;Perform video classification feature extraction on the single compressed video and the recompressed video according to the following method: analyze the PU division type, use a visual analyzer to extract the PU division type of the video frame, and divide the extracted PU of the video frame into The type is marked with pixel blocks as the basic unit; count the number of pixel blocks corresponding to each PU division type of the first P frame in each group of consecutive pictures in the video; divide each PU of the first P frame in all groups of consecutive pictures The number of pixel blocks corresponding to the type is averaged, and the classification features of each PU division type of the first P frame in all groups of consecutive pictures are obtained;
所述支持向量机根据提取的所述分类特征构建判决函数;The support vector machine constructs a decision function according to the extracted classification feature;
随机选取用于测试的单次压缩视频和重压缩视频作为测试样本送入所述支持向量机,所述支持向量机根据所述判决函数输出判定测试的视频为单次压缩视频还是重压缩视频的分类检测结果。Randomly select the single-shot compression video and the re-compressed video for testing and send them to the support vector machine as test samples, and the support vector machine determines whether the test video is a single-shot compressed video or a re-compressed video according to the output of the decision function. Classification detection results.
优选地,所述方法还包括,通过如下方式计算表示分类性能的评估指标:其中,AR为评估指标,TNR为判定为单次压缩视频的比率;TPR为判定为重压缩视频的比率。Preferably, the method further includes calculating an evaluation index representing the classification performance in the following manner: Among them, AR is the evaluation index, TNR is the ratio of the video that is judged as single compression; TPR is the ratio of the video that is judged to be re-compressed.
优选地,所述方法还包括:通过如下方式计算表示视频压缩检测的检测率:其中,n为测试和训练不同的视频样本的次数。Preferably, the method further comprises: calculating a detection rate representing video compression detection by: where n is the number of times to test and train different video samples.
所述重压缩视频为原始视频以第一比特率进行H.261、H.263、H.263+、H.264、MPEG-1、MPEG-2和MPEG-4中的任一种标准格式进行压缩。例如所述重压缩视频为原始视频以第一比特率进行H.264压缩,经解码后再对解码后视频以第二比特率进行HEVC压缩得到的视频。The recompressed video is the original video and is processed in any one of the standard formats of H.261, H.263, H.263+, H.264, MPEG-1, MPEG-2 and MPEG-4 at the first bit rate. compression. For example, the recompressed video is a video obtained by performing H.264 compression on the original video at a first bit rate, and then performing HEVC compression on the decoded video at a second bit rate after decoding.
优选地,所述单次压缩视频为原始视频以第二比特率进行HEVC压缩得到的视频。Preferably, the single-shot compressed video is a video obtained by performing HEVC compression on the original video at the second bit rate.
所述视频帧的PU划分类型以N×N像素块为基本单元进行标记,其中N为4或者4的整数倍值。例如,所述视频帧的PU划分类型提取时,选取所述可视化分析器边界颜色的RGB分量为(255,0,255);所述视频帧的PU划分类型以8×8像素块为基本单元进行标记。The PU division type of the video frame is marked with an N×N pixel block as a basic unit, where N is 4 or an integer multiple of 4. For example, when extracting the PU division type of the video frame, the RGB component of the border color of the visual analyzer is selected as (255, 0, 255); the PU division type of the video frame is marked with an 8×8 pixel block as the basic unit .
优选地,所述所有组连续画面里第一个P帧的各个PU划分类型对应的像素块数目求取平均值通过如下方法实现:Preferably, the average value of the number of pixel blocks corresponding to each PU division type of the first P frame in all groups of consecutive pictures is achieved by the following method:
其中Pi={pi,0,pi,1,...,pi,24}(i=1,2,...,M),M为视频中包含的连续画面的组数。 Wherein P i ={pi ,0 ,pi ,1 ,...,pi ,24 }(i=1,2,...,M), where M is the number of groups of consecutive pictures included in the video.
本发明的又一方面还提供一种计算机可读存储介质,该计算机可读存储介质中存储有计算机程序,在该计算机程序被执行时实现如上所述的方法步骤。Yet another aspect of the present invention also provides a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and when the computer program is executed, the above-mentioned method steps are implemented.
本发明提供的一种视频分类特征提取方法及视频转码重压缩检测方法,针对H.264至HEVC标准的视频转码重压缩视频进行重压缩检测,提取的分类特征维数少,能够达到更高的检测率。此外,本发明提供的方法同样也可以适用于检测第一次视频编码标准为其他HEVC标准之前的视频编码标准,例如也可以适用于H.261、H.263、H.263+、MPEG-1、MPEG-2和MPEG-4中的任一种标准。The invention provides a video classification feature extraction method and a video transcoding recompression detection method, which perform recompression detection for video transcoding recompressed videos of H.264 to HEVC standards, and the extracted classification features have fewer dimensions, which can achieve higher high detection rate. In addition, the method provided by the present invention can also be applied to detect the video encoding standard before the first video encoding standard is other HEVC standard, for example, it can also be applied to H.261, H.263, H.263+, MPEG-1 , MPEG-2 and MPEG-4.
应当理解,前述大体的描述和后续详尽的描述均为示例性说明和解释,并不应当用作对本发明所要求保护内容的限制。It is to be understood that both the foregoing general description and the following detailed description are exemplary illustrations and explanations, and should not be used as limitations on what is claimed in the present invention.
附图说明Description of drawings
参考随附的附图,本发明更多的目的、功能和优点将通过本发明实施方式的如下描述得以阐明,其中:Further objects, functions and advantages of the present invention will be elucidated by the following description of embodiments of the present invention with reference to the accompanying drawings, wherein:
图1是不同预测模式下PU划分方式的示意图;FIG. 1 is a schematic diagram of a PU division mode under different prediction modes;
图2是本发明视频分类特征提取方法的流程框图;Fig. 2 is the flow chart of the video classification feature extraction method of the present invention;
图3a至图3d是单次压缩视频和重压缩视频的P帧PU划分类型示意图;3a to 3d are schematic diagrams of P-frame PU division types of single-shot compressed video and recompressed video;
图4是本发明PU划分类型的标记示意图;FIG. 4 is a schematic diagram of marking of PU division types of the present invention;
图5是本发明视频转码重压缩检测方法的流程框图。Fig. 5 is a flow chart of the video transcoding and recompression detection method of the present invention.
具体实施方式Detailed ways
通过参考示范性实施例,本发明的目的和功能以及用于实现这些目的和功能的方法将得以阐明。然而,本发明并不受限于以下所公开的示范性实施例;可以通过不同形式来对其加以实现。说明书的实质仅仅是帮助相关领域技术人员综合理解本发明的具体细节。Objects and functions of the present invention and methods for achieving these objects and functions will be elucidated by referring to the exemplary embodiments. However, the present invention is not limited to the exemplary embodiments disclosed below; it may be implemented in various forms. The essence of the description is merely to assist those skilled in the relevant art to comprehensively understand the specific details of the present invention.
在下文中,将参考附图描述本发明的实施例。在附图中,相同的附图标记代表相同或类似的部件,或者相同或类似的步骤。下面通过具体的实施方式对本发明的内容进行说明,视频篡改者在对视频进行帧删除、插入等篡改后,需要另外的视频压缩格式对视频序列进行重压缩。与其他编码标准相比,H.264编码与HEVC编码最为相关,编码架构大致相似。篡改者对H.264格式的原始视频篡改后,再次压缩时采用HEVC编码重新压缩。本实例中采用H.264编码标准作为第一次视频压缩的标准。Hereinafter, embodiments of the present invention will be described with reference to the accompanying drawings. In the drawings, the same reference numbers represent the same or similar parts, or the same or similar steps. The content of the present invention will be described below through specific embodiments. After the video tamperer performs frame deletion, insertion, etc. tampering on the video, he needs another video compression format to recompress the video sequence. Compared with other encoding standards, H.264 encoding is most related to HEVC encoding, and the encoding architecture is roughly similar. After tampering with the original video in H.264 format, HEVC encoding is used for recompression when recompressing. In this example, the H.264 coding standard is used as the standard for the first video compression.
视频经转码重压缩后,由于关键帧像素值的改变,一组连续画面(GOP)里第一个P帧在以关键帧作为参考帧进行帧间预测编码时,其帧内PU划分类型也会相应地发生变化。基于此现象,本发明采用直方图统计此类帧内的各PU划分类型对应的像素块数目,以此作为分类特征来检测视频的转码重压缩。After the video is transcoded and recompressed, due to the change of the pixel value of the key frame, when the first P frame in a group of continuous pictures (GOP) uses the key frame as the reference frame for inter-frame prediction encoding, the intra-frame PU division type also changes. will change accordingly. Based on this phenomenon, the present invention uses a histogram to count the number of pixel blocks corresponding to each PU division type in such a frame, and uses this as a classification feature to detect video transcoding and recompression.
实施例中为了使本发明针对其它编码标准到HEVC的转码重压缩检测得以更加清晰的阐释,有必要对P帧PU划分类型进行说明。HEVC编码采用了预测加变换的混合编码框架。在HEVC编码中引入了三个基本单元,即编码单元(CU),预测单元(PU)和变换单元(TU),从而使得HEVC比H.264/AVC的编码方式更加灵活。其中,CU是编码基本单元,PU用于帧内和帧间预测,TU用于变换和量化。PU是包含了预测信息的基本单元,一个CU可以分割为一个或多个PU,PU预测模式可以分为跳过、帧内和帧间。如图1所示不同模式下PU划分类型的示意图,当预测模式为跳过模式a时,其PU大小只能为2N×2N;当预测模式为帧内模式b时,PU有2N×2N和N×N两种划分模式;当预测模式为帧间模式c时,PU有8种划分模式,包括2N×2N和N×N两种方形分割模式c1、2N×N和N×2N两个对称分割模式c2,以及2N×nU、2N×nD、nL×2N和nR×2N四个非对称分割模式c3。其中非对称分割模式c3为可选模式,能通过编码配置文件中的相关语法控制其开启或关闭。In the embodiment, in order to clarify the transcoding and recompression detection of the present invention for other coding standards to HEVC, it is necessary to describe the PU division type of the P frame. HEVC coding adopts a hybrid coding framework of prediction and transform. Three basic units are introduced in HEVC coding, namely coding unit (CU), prediction unit (PU) and transform unit (TU), thus making HEVC more flexible than H.264/AVC coding. Among them, CU is the coding basic unit, PU is used for intra and inter prediction, and TU is used for transform and quantization. A PU is a basic unit that contains prediction information. A CU can be divided into one or more PUs. The PU prediction modes can be divided into skip, intra and inter. Figure 1 shows a schematic diagram of PU division types in different modes. When the prediction mode is skip mode a, the PU size can only be 2N×2N; when the prediction mode is intra mode b, the PU has 2N×2N and N×N two division modes; when the prediction mode is inter frame mode c, the PU has 8 division modes, including 2N×2N and N×N two square division modes c1, 2N×N and N×2N two symmetrical The division mode c2, and four asymmetric division modes c3 of 2N×nU, 2N×nD, nL×2N and nR×2N. The asymmetric split mode c3 is an optional mode, which can be turned on or off through the relevant syntax in the encoding configuration file.
在转码视频重压缩过程中,PU块包含的预测信息阐释了CU块的预测过程。如图2所示为本发明视频分类特征提取方法的流程框图,如图2所示,本发明实施例提供的一种视频分类特征提取方法包括如下步骤:In the transcoded video recompression process, the prediction information contained in the PU block explains the prediction process of the CU block. Figure 2 is a flowchart of a method for extracting video classification features according to the present invention. As shown in Figure 2, a method for extracting video classification features provided by an embodiment of the present invention includes the following steps:
步骤S101,对PU划分类型分析,提取视频帧的PU划分类型并进行标记。Step S101, analyze the PU division type, extract the PU division type of the video frame and mark it.
对PU划分类型分析,如图3a至图3d所示为单次压缩视频和H.264至HEVC重压缩视频的P帧PU划分类型示意图,实施例中通过对PU划分类型分析,进而提取视频的分类特征。图3a为使用HEVC编码,以3.5M比特率压缩的bridge_far的第一组连续画面(GOP)里第一个P帧的PU划分类型。图3b为使用H.264编码以3M比特率压缩后,再用HEVC编码以3.5M比特率压缩后的第一组连续画面(GOP)里第一个P帧的PU划分类型。图3c为使用H.264编码以3.5M比特率压缩后,再用HEVC编码以3.5M比特率压缩bridge_far的第一组连续画面(GOP)里第一个P帧的PU划分类型。图3d为使用H.264编码以4M比特率压缩后,再用HEVC编码以3.5M比特率压缩bridge_far的第一组连续画面(GOP)里第一个P帧的PU划分类型。For the analysis of the PU division type, as shown in Figures 3a to 3d are schematic diagrams of the P-frame PU division type of the single compressed video and the H.264 to HEVC recompressed video. In the embodiment, the PU division type is analyzed to extract the video's Categorical features. Figure 3a shows the PU partition type of the first P frame in the first group of continuous pictures (GOP) of bridge_far compressed at a bit rate of 3.5M using HEVC encoding. Figure 3b shows the PU partition type of the first P frame in the first group of consecutive pictures (GOPs) after compression at a bit rate of 3M using H.264 encoding and compression at a bit rate of 3.5M using HEVC encoding. Figure 3c shows the PU partition type of the first P frame in the first group of continuous pictures (GOP) of bridge_far after being compressed at 3.5M bit rate using H.264 encoding and then using HEVC encoding to compress at 3.5M bit rate. Figure 3d shows the PU partition type of the first P frame in the first group of continuous pictures (GOP) in the first group of continuous pictures (GOP) of bridge_far after using H.264 encoding at 4M bit rate and then using HEVC encoding to compress at 3.5M bit rate.
表1.各PU划分类型对应的像素块数目Table 1. Number of pixel blocks corresponding to each PU partition type
从表1中可看出,3M-3.5M、3.5M-3.5M和4M-3.5的各PU划分类型对应的像素块数目分布趋势大致相同,与3.5M的PU划分类型对应的像素块数目分布相差较大。特别是PU划分类型为4×4、8×8、4×8、8×4时,3M-3.5M、3.5M-3.5M和4M-3.5的各PU划分类型对应的像素块数目与3.5M的PU划分类型对应的像素块数目分布具有较大的差别。It can be seen from Table 1 that the distribution trend of the number of pixel blocks corresponding to each PU division type of 3M-3.5M, 3.5M-3.5M and 4M-3.5 is roughly the same, and the distribution of the number of pixel blocks corresponding to the 3.5M PU division type Big difference. Especially when the PU division types are 4×4, 8×8, 4×8, and 8×4, the number of pixel blocks corresponding to each PU division type of 3M-3.5M, 3.5M-3.5M and 4M-3.5 is the same as that of 3.5M. The distribution of the number of pixel blocks corresponding to the PU partition type has a large difference.
由上述分析PU划分类型分析及表1显示,使用H.264编码后,再用HEVC编码压缩后的第一组连续画面(GOP)里第一个P帧的PU划分类型以小像素块为主;单次使用HEVC编码压缩后的第一组连续画面(GOP)里第一个P帧的PU划分类型以大像素块为主。由此,使用H.264编码后,再用HEVC编码压缩后的第一组连续画面(GOP)里第一个P帧的PU为精细的PU划分类型。According to the above analysis of PU partition type analysis and Table 1, after using H.264 encoding, the PU partition type of the first P frame in the first group of continuous pictures (GOP) compressed by HEVC encoding is mainly small pixel blocks. ; The PU division type of the first P frame in the first group of continuous pictures (GOP) compressed by HEVC encoding at a time is dominated by large pixel blocks. Therefore, after using H.264 encoding, the PU of the first P frame in the first group of consecutive pictures (GOPs) compressed by HEVC encoding is a fine PU division type.
应当理解,在H.264编码压缩时,分别对每个像素块进行DCT变换,块与块之间的相关性被忽略。块与块的边界处的像素值出现不连续跳变。当视频再以HEVC编码压缩时,由于块效应的影响,块与块的边界处便需要更小的PU划分类型来表达此处的图像跳变。除H.264之外的其它HEVC之前的视频编码标准亦采用分块DCT变换,同样存在块效应,因此可以预见采用HEVC之前的视频编码标准进行视频压缩之后,再转码为HEVC格式视频时,都会需要更小的PU划分类型来表达块边界处的图像跳变。It should be understood that during H.264 coding and compression, DCT transform is performed on each pixel block separately, and the correlation between blocks is ignored. The pixel values at the block-to-block boundary appear discontinuous jumps. When the video is compressed by HEVC encoding, due to the influence of block effect, a smaller PU partition type is required at the boundary between blocks to express the image transition here. In addition to H.264, other pre-HEVC video coding standards also use block DCT transform, and there is also block effect. Therefore, it is foreseeable that after video compression is performed using pre-HEVC video coding standards, and then transcoded to HEVC format video, the Both require smaller PU partition types to express image transitions at block boundaries.
根据本发明,实施例通过对PU划分类型分析后,利用可视化分析器提取视频帧的PU划分类型,并将提取到的视频帧的PU划分类型以像素块为基本单元进行标记。According to the present invention, the embodiment uses a visual analyzer to extract the PU division type of the video frame after analyzing the PU division type, and marks the PU division type of the extracted video frame with pixel blocks as the basic unit.
具体地,实施例中可视化分析器可以采用例如Gitl_HEVC_Analyze的视频分析软件或者其他任何适宜的软件对视频帧进行PU划分类型的提取。为了将视频背景和PU划分类型边界区分开,可视化分析器边界颜色的RGB分量设置为(255,0,255)。Specifically, in the embodiment, the visual analyzer may use video analysis software such as Gitl_HEVC_Analyze or any other suitable software to perform PU division type extraction on the video frame. To distinguish the video background from the PU partition type boundary, the RGB components of the visual analyzer boundary color are set to (255, 0, 255).
根据本发明,实施例中视频帧的PU划分类型以8×8像素块为基本单元进行标记,实施例中对PU划分类型以标号的方式进行标记,表2为标号对应的PU划分类型,根据本发明,实施例中共25种PU划分类型。如图4所示本发明PU划分类型的标记示意图。According to the present invention, in the embodiment, the PU division type of a video frame is marked with an 8×8 pixel block as the basic unit, and in the embodiment, the PU division type is marked in the form of a label. In the present invention, there are altogether 25 PU division types in the embodiment. As shown in FIG. 4 , a schematic diagram of marking of PU division types of the present invention is shown.
表2.标号对应的PU划分类型Table 2. PU partition types corresponding to labels
步骤S102,统计视频中每一组连续画面里第一个P帧的各个PU划分类型对应的像素块数目。Step S102, count the number of pixel blocks corresponding to each PU division type of the first P frame in each group of consecutive pictures in the video.
统计视频中每一组连续画面里第一个P帧的各个PU划分类型对应的像素块数目。将每一组连续画面里第一个P帧的各个PU划分类型对应的像素块数目记为:Pi={pi,0,pi,1,...,pi,24}(i=1,2,...,M),M为视频中包含的连续画面的组数,即每个Pi中记录了25种PU划分类型对应的8×8像素块数目。The number of pixel blocks corresponding to each PU division type of the first P frame in each group of consecutive pictures in the video is counted. The number of pixel blocks corresponding to each PU division type of the first P frame in each group of consecutive pictures is recorded as: P i ={pi ,0 ,pi ,1 ,...,pi ,24 }(i =1, 2, ..., M), M is the number of groups of consecutive pictures included in the video, that is, the number of 8×8 pixel blocks corresponding to 25 PU division types is recorded in each Pi .
步骤S103,提取视频的分类特征。Step S103, extracting classification features of the video.
将所有组连续画面里第一个P帧的各个PU划分类型对应的像素块数目求取平均值,得到所有组连续画面里第一个P帧的各个PU划分类型的分类特征。所有组连续画面里第一个P帧的各个PU划分类型对应的像素块数目求取平均值通过如下公式实现:The number of pixel blocks corresponding to each PU division type of the first P frame in all groups of consecutive pictures is averaged, and the classification features of each PU division type of the first P frame in all groups of consecutive pictures are obtained. The average value of the number of pixel blocks corresponding to each PU division type of the first P frame in all groups of consecutive pictures is achieved by the following formula:
其中Pi={pi,0,pi,1,...,pi,24}(i=1,2,...,M),M为视频中包含的连续画面的组数。 Wherein P i ={pi ,0 ,pi ,1 ,...,pi ,24 }(i=1,2,...,M), where M is the number of groups of consecutive pictures included in the video.
将所有组连续画面里第一个P帧的各个PU划分类型对应的像素块数目的平均值作为每一组连续画面里第一个P帧的各个PU划分类型对应的像素块数目的直方图,得到视频的分类特征。Taking the average value of the number of pixel blocks corresponding to each PU division type of the first P frame in all groups of consecutive pictures as the histogram of the number of pixel blocks corresponding to each PU division type of the first P frame in each group of consecutive pictures, Get the classification features of the video.
通过本发明一种视频分类特征的提取方法对视频转码重压缩检测,如图5所示本发明视频转码重压缩检测方法的流程框图,具体地一种视频转码重压缩检测方法包括:The video transcoding recompression detection is performed by a method for extracting video classification features of the present invention, as shown in FIG.
步骤S201,随机选取数目相同的单次压缩视频和重压缩视频作为训练样本送入支持向量机。Step S201, randomly select the same number of single-shot compressed videos and re-compressed videos as training samples and send them to the support vector machine.
重压缩视频为原始视频以第一比特率进行H.264压缩,经解码后再对解码后视频以第二比特率进行HEVC压缩得到的视频。单次压缩视频为原始视频以第二比特率进行HEVC压缩得到的视频。The recompressed video is a video obtained by performing H.264 compression on the original video at the first bit rate, and then performing HEVC compression on the decoded video at the second bit rate after decoding. The single-shot compressed video is a video obtained by performing HEVC compression on the original video at the second bit rate.
具体地,实施例中首先制作单次压缩视频和重压缩视频作为检测目标。Specifically, in the embodiment, the single-shot compressed video and the re-compressed video are first made as detection targets.
采用34个未压缩的YUV序列作为初始视频,其中包括17个QCIF格式视频(分辨率为176×144)和17个CIF格式视频(分辨率为352×288)。为了增加样本容量,每个视频被分割成长度为100帧的非重叠视频片段。最后,总共生成36个QCIF视频片段和43个CIF视频片段。34 uncompressed YUV sequences are used as initial videos, including 17 QCIF format videos (with a resolution of 176×144) and 17 CIF format videos (with a resolution of 352×288). To increase the sample capacity, each video is segmented into non-overlapping video segments of length 100 frames. Finally, a total of 36 QCIF video clips and 43 CIF video clips are generated.
HM10.0采用encoder_lowdelay_P_main配置文件进行HEVC编码和解码过程。JM采用encoder_main配置文件进行H.264编码和解码过程。帧率、I帧周期和GOP大小分别设置为30、4和4。HM10.0 uses the encoder_lowdelay_P_main configuration file for the HEVC encoding and decoding process. JM uses the encoder_main configuration file for the H.264 encoding and decoding process. The frame rate, I-frame period, and GOP size are set to 30, 4, and 4, respectively.
制作单次压缩视频:对原始视频以第二比特率(B2)进行HEVC压缩得到。Making a single-shot compressed video: The original video is obtained by HEVC compression at the second bit rate (B 2 ).
制作重压缩视频:对原始视频以第一比特率(B1)进行H.264压缩,经解码后再对解码后视频以第二比特率(B2)进行HEVC压缩。Making a recompressed video: perform H.264 compression on the original video at the first bit rate (B 1 ), and then perform HEVC compression on the decoded video at the second bit rate (B 2 ) after decoding.
实施例中由于QCIF和CIF视频具有不同的空间分辨率,因此应选择不同的比特率来保证编码视频的视觉质量。对于QCIF视频,第一比特率(B1)和第二比特率(B2)的值分别从{100,200,300}(kbps)和{200,300,400}(kbps)中选择。对于CIF视频,第一比特率(B1)和第二比特率(B2)分别从{3,3.5,4}(Mbps)和{3.5,4,4.5}(Mbps)中选择。In the embodiment, since the QCIF and CIF videos have different spatial resolutions, different bit rates should be selected to ensure the visual quality of the encoded videos. For QCIF video, the values of the first bit rate (B 1 ) and the second bit rate (B 2 ) are selected from {100, 200, 300} (kbps) and {200, 300, 400} (kbps), respectively. For CIF video, the first bit rate (B 1 ) and the second bit rate (B 2 ) are selected from {3, 3.5, 4} (Mbps) and {3.5, 4, 4.5} (Mbps), respectively.
在上述制作的单次压缩视频和重压缩视频中,随即选取数目相同的单次压缩视频和重压缩视频作为训练样本送入支持向量机(SVM)进行训练。本实施例,对于QCIF格式的视频随机选择30个单次压缩视频和30个重压缩视频进行训练。对于CIF格式的视频随机选择35个单次压缩视频和35个重压缩视频进行训练。Among the single-shot compressed videos and re-compressed videos produced above, the same number of single-shot compressed videos and re-compressed videos are randomly selected as training samples and sent to a support vector machine (SVM) for training. In this embodiment, 30 single-compressed videos and 30 re-compressed videos are randomly selected for training for videos in QCIF format. For the videos in CIF format, 35 single-shot compressed videos and 35 re-compressed videos are randomly selected for training.
训练阶段,支持向量机(SVM)执行步骤2和步骤3构建判决函数。In the training phase, the support vector machine (SVM) performs
步骤202、对单次压缩视频和重压缩视频提取分类特征。Step 202: Extract classification features from the single-compressed video and the re-compressed video.
根据本发明,实施例按照如下方法对单次压缩视频和重压缩视频进行视频分类特征提取:According to the present invention, the embodiment performs video classification feature extraction on single compressed video and recompressed video according to the following method:
利用可视化分析器提取视频帧的PU划分类型,并将提取到的所述视频帧的PU划分类型以像素块为基本单元进行标记;Utilize the visual analyzer to extract the PU division type of the video frame, and mark the extracted PU division type of the video frame with pixel blocks as the basic unit;
统计视频中每一组连续画面里第一个P帧的各个PU划分类型对应的像素块数目;Count the number of pixel blocks corresponding to each PU division type of the first P frame in each group of consecutive pictures in the video;
将所有组连续画面里第一个P帧的各个PU划分类型对应的像素块数目求取平均值,得到所有组连续画面里第一个P帧的各个PU划分类型的分类特征。The number of pixel blocks corresponding to each PU division type of the first P frame in all groups of consecutive pictures is averaged, and the classification features of each PU division type of the first P frame in all groups of consecutive pictures are obtained.
步骤S203、支持向量机根据提取的分类特征构建判决函数。Step S203, the support vector machine constructs a decision function according to the extracted classification features.
对于步骤2中视频分类特征的提取在上文中已经给出了详细阐释,这里不再赘述。The extraction of the video classification features in
优选地,实施例中支持向量机可以选取具有SVMcg内核的LIBSVM开源软件或者其他具有类似功能的软件作为分类器。Preferably, in the embodiment, the support vector machine may select the LIBSVM open source software with the SVMcg kernel or other software with similar functions as the classifier.
步骤S204、随机选取用于测试的单次压缩视频和重压缩视频作为测试样本送入所述支持向量机,输出分类结果。Step S204 , randomly select the single-shot compressed video and the re-compressed video for testing as test samples and send them to the support vector machine, and output the classification result.
随机选取用于测试的单次压缩视频和重压缩视频作为测试样本送入所述支持向量机,所述支持向量机根据所述判决函数输出判定测试的视频为单次压缩视频还是重压缩视频的分类检测结果。Randomly select the single-shot compression video and the re-compressed video for testing and send them to the support vector machine as test samples, and the support vector machine determines whether the test video is a single-shot compressed video or a re-compressed video according to the output of the decision function. Classification detection results.
根据本发明,本实施例可通过如下方式计算表示分类性能的评估指标:According to the present invention, the present embodiment can calculate the evaluation index representing the classification performance in the following manner:
其中,AR为评估指标,TNR为判定为单次压缩视频的比率;TPR为判定为重压缩视频的比率。 Among them, AR is the evaluation index, TNR is the ratio of the video that is judged as single compression; TPR is the ratio of the video that is judged to be re-compressed.
视频压缩检测的检测率通过如下方式计算表示:The detection rate of video compression detection is calculated and expressed as follows:
其中,n为测试和训练不同的视频样本的次数。 where n is the number of times to test and train different video samples.
本实施例,选择20次训练和测试得到检测率的平均值,具体视频压缩检测的检测率的平均值通过如下方式计算:In this embodiment, 20 times of training and testing are selected to obtain the average value of the detection rate, and the average value of the detection rate of the specific video compression detection is calculated as follows:
其中,AR为评估指标,n=20。 Among them, AR is the evaluation index, n=20.
为了更加清楚的体现本发明所提供的一种视频分类特征提取方法及视频转码重压缩检测方法的优势,本实施例中分别采用本发明的视频分类特征提取方法及视频转码重压缩检测方法和采用I帧的PU划分类型对应的像素块数目的共生矩阵作为视频分类特征的视频转码重压缩检测方法进行对比。表3为本发明QCIF格式的重压缩视频的检测率,表3为本发明CIF格式的重压缩视频的检测率,表5为采用I帧的PU划分类型对应的像素块数目的共生矩阵作为视频分类特征的视频转码重压缩检测方法的QCIF格式的重压缩视频的检测率。In order to more clearly reflect the advantages of the video classification feature extraction method and the video transcoding recompression detection method provided by the present invention, the video classification feature extraction method and the video transcoding recompression detection method of the present invention are respectively adopted in this embodiment. It is compared with the video transcoding recompression detection method that uses the co-occurrence matrix of the number of pixel blocks corresponding to the PU division type of I frame as the video classification feature. Table 3 is the detection rate of the recompressed video of the QCIF format of the present invention, Table 3 is the detection rate of the recompressed video of the CIF format of the present invention, and Table 5 is the co-occurrence matrix of the number of pixel blocks corresponding to the PU division type of the I frame as the video Detection rate of recompressed video in QCIF format for video transcoding recompression detection method of classification features.
表3.本发明QCIF格式的重压缩视频的检测率Table 3. Detection rate of the recompressed video of the QCIF format of the present invention
表4.本发明CIF格式的重压缩视频的检测率Table 4. Detection rate of recompressed video in CIF format of the present invention
表5.采用I帧的PU划分类型对应的像素块数目的共生矩阵作为视频分类特征的视频转码重压缩检测方法的QCIF格式的重压缩视频的检测率Table 5. The detection rate of the recompressed video in the QCIF format using the co-occurrence matrix of the number of pixel blocks corresponding to the PU division type of the I frame as the video classification feature of the video transcoding recompression detection method
从表3和表4中可以看出,采用本发明的视频分类特征提取方法及视频转码重压缩检测方法,QCIF和CIF格式的重压缩检测率均达到了90%以上,最高达到98.75%,最低为92.08%。As can be seen from Table 3 and Table 4, using the video classification feature extraction method and the video transcoding recompression detection method of the present invention, the recompression detection rates of QCIF and CIF formats both reach more than 90%, and the highest reaches 98.75%, The lowest is 92.08%.
从表5可以看出,现有技术中采用采用I帧的PU划分类型对应的像素块数目的共生矩阵作为视频分类特征的视频转码重压缩检测方法,QCIF格式的重压缩视频检测率在77%-90%之间。本发明重压缩视频检测正确率在91%-97.5%之间,明显高于现有技术。同时,实施例中采用发明的视频分类特征提取方法及视频转码重压缩检测方法,PU划分类型为25种,现有技术中采用I帧的PU划分类型对应的像素块数目的共生矩阵作为视频分类特征的视频转码重压缩检测方法的PU划分类型为100种。本发明的PU划分类型为现有技术的1/4。本发明在PU划分类型维数低于现有技术,在降低了计算量同时,提高了重压缩视频的检测率。本发明一种视频分类特征提取方法及视频转码重压缩检测方法更为有效。As can be seen from Table 5, in the prior art, the co-occurrence matrix of the number of pixel blocks corresponding to the PU division type of I frame is adopted as the video transcoding recompression detection method of the video classification feature, and the recompression video detection rate of the QCIF format is 77%. %-90%. The detection accuracy rate of the recompressed video of the present invention is between 91% and 97.5%, which is obviously higher than that of the prior art. Meanwhile, the video classification feature extraction method and the video transcoding recompression detection method of the invention are adopted in the embodiment, the PU division types are 25, and the co-occurrence matrix of the number of pixel blocks corresponding to the PU division type of one frame is used as the video in the prior art. There are 100 types of PU division in the video transcoding and recompression detection method of categorical features. The PU division type of the present invention is 1/4 of the prior art. In the present invention, the dimension of the PU division type is lower than that of the prior art, and at the same time, the calculation amount is reduced, and the detection rate of the recompressed video is improved. The video classification feature extraction method and the video transcoding and recompression detection method of the present invention are more effective.
本发明提供的一种视频分类特征提取方法及视频转码重压缩检测方法,针对H.264至HEVC标准的视频转码重压缩视频进行重压缩检测,提取的分类特征维数少,能够达到更高的检测率。The invention provides a video classification feature extraction method and a video transcoding recompression detection method, which perform recompression detection for video transcoding recompressed videos of H.264 to HEVC standards, and the extracted classification features have fewer dimensions, which can achieve higher high detection rate.
本发明的各部分可以用硬件、软件、固件或者它们的组合来实现。在上述实施方式中,多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。例如,如果用硬件来实现,和在另一实施方式中一样,可以用本领域共知的下列技术中的任一项或者他们的组合来实现:具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路,具有合适的组合逻辑门电路的专用集成电路,可编程门阵列(PGA),现场可编程门阵列(FPGA)等。Portions of the present invention may be implemented in hardware, software, firmware, or a combination thereof. In the above-described embodiments, various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, it can be implemented using any one of the following techniques known in the art, or a combination thereof: having logic gates for implementing logic functions on data signals Circuits are discrete logic circuits, application specific integrated circuits with suitable combinational logic gates, Programmable Gate Arrays (PGA), Field Programmable Gate Arrays (FPGA), etc.
在流程图中表示或者在此以其它方式描述的逻辑和/或步骤,例如,可以被认为是用于实现逻辑功能的可执行指令的定序列表,可以具体实现在任何计算机可读存储介质中,以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用,或结合这些指令执行系统、装置或设备而使用。“计算机可读存储介质”可以包括能够存储或传输信息的任何介质。机器可读介质的例子包括电子电路、半导体存储器设备、ROM、闪存、可擦除ROM(EROM)、软盘、CD-ROM、光盘、硬盘、光纤介质、射频(RF)链路,等等。Logic and/or steps represented in flowcharts or otherwise described herein, for example, may be considered an ordered listing of executable instructions for implementing the logical functions, and may be embodied in any computer-readable storage medium , for use by, or in conjunction with, an instruction execution system, apparatus, or device (such as a computer-based system, a system including a processor, or other system that can fetch and execute instructions from an instruction execution system, apparatus, or device), device or equipment. A "computer-readable storage medium" may include any medium capable of storing or transmitting information. Examples of machine-readable media include electronic circuits, semiconductor memory devices, ROM, flash memory, erasable ROM (EROM), floppy disks, CD-ROMs, optical disks, hard disks, fiber optic media, radio frequency (RF) links, and the like.
如上针对一个实施例描述和/或示出的特征可以以相同或类似的方式在一个或更多个其它实施例中使用,和/或与其它实施例中的特征相结合或替代其它实施例中的特征使用。Features described and/or illustrated above for one embodiment may be used in the same or similar manner in one or more other embodiments, and/or in combination with or instead of features in other embodiments features are used.
还需要说明的是,本发明中提及的示例性实施例,基于一系列的步骤或者装置描述一些方法或系统。但是,本发明不局限于上述步骤的顺序,也就是说,可以按照实施例中提及的顺序执行步骤,也可以不同于实施例中的顺序,或者若干步骤同时执行。It should also be noted that the exemplary embodiments mentioned in the present invention describe some methods or systems based on a series of steps or devices. However, the present invention is not limited to the order of the above steps, that is, the steps may be performed in the order mentioned in the embodiments, or may be different from the order in the embodiments, or several steps may be performed simultaneously.
此外,本发明提供的方法同样也可以适用于其他HEVC标准之前压缩格式标准转码为HEVC格式的场景,例如也可以适用于H.261、H.263、H.263+、MPEG-1、MPEG-2和MPEG-4中的任一种标准。In addition, the method provided by the present invention can also be applied to other scenarios where the compression format standard before the HEVC standard is transcoded into the HEVC format, for example, it can also be applied to H.261, H.263, H.263+, MPEG-1, MPEG -2 and either standard MPEG-4.
结合这里披露的本发明的说明和实践,本发明的其他实施例对于本领域技术人员都是易于想到和理解的。说明和实施例仅被认为是示例性的,本发明的真正范围和主旨均由权利要求所限定。Other embodiments of the present invention will be readily apparent to and understood by those skilled in the art in conjunction with the specification and practice of the present invention disclosed herein. The description and examples are to be regarded as exemplary only, with the true scope and spirit of the invention being defined by the claims.
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810144370.0A CN108366295B (en) | 2018-02-12 | 2018-02-12 | Video classification feature extraction method, transcoding recompression detection method and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810144370.0A CN108366295B (en) | 2018-02-12 | 2018-02-12 | Video classification feature extraction method, transcoding recompression detection method and storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108366295A CN108366295A (en) | 2018-08-03 |
CN108366295B true CN108366295B (en) | 2020-07-14 |
Family
ID=63006028
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810144370.0A Expired - Fee Related CN108366295B (en) | 2018-02-12 | 2018-02-12 | Video classification feature extraction method, transcoding recompression detection method and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108366295B (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111837140B (en) * | 2018-09-18 | 2024-08-06 | 谷歌有限责任公司 | Video coding receptive field consistent convolution model |
CN109587498B (en) * | 2018-12-25 | 2022-07-08 | 华南农业大学 | A method for identifying HEVC video transcoded by AVC |
CN109982071B (en) * | 2019-03-16 | 2020-08-11 | 四川大学 | HEVC (high efficiency video coding) dual-compression video detection method based on space-time complexity measurement and local prediction residual distribution |
CN114630103B (en) * | 2020-12-11 | 2025-04-08 | 上海哔哩哔哩科技有限公司 | Video processing method and device |
CN113065533B (en) * | 2021-06-01 | 2021-11-02 | 北京达佳互联信息技术有限公司 | Feature extraction model generation method and device, electronic equipment and storage medium |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2006279573A (en) * | 2005-03-29 | 2006-10-12 | Sanyo Electric Co Ltd | Encoder and encoding method, and decoder and decoding method |
CN101790097B (en) * | 2010-03-05 | 2012-05-09 | 天津大学 | Method for detecting multiple times of compression and coding of digital video |
US8711940B2 (en) * | 2010-11-29 | 2014-04-29 | Mediatek Inc. | Method and apparatus of motion vector prediction with extended motion vector predictor |
CN102413327A (en) * | 2011-11-10 | 2012-04-11 | 中山大学深圳研究院 | Video tampering detection method based on compressed sensing semi-fragile watermark |
CN105072444B (en) * | 2015-07-31 | 2018-01-05 | 宁波大学 | A kind of HEVC video second-compressed detection methods under different quantization parameters |
-
2018
- 2018-02-12 CN CN201810144370.0A patent/CN108366295B/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN108366295A (en) | 2018-08-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108366295B (en) | Video classification feature extraction method, transcoding recompression detection method and storage medium | |
KR101599875B1 (en) | Method and apparatus for multimedia encoding based on attribute of multimedia content, method and apparatus for multimedia decoding based on attributes of multimedia content | |
JP5373910B2 (en) | CABAC stream compliance | |
CN101978698B (en) | Method and apparatus for encoding and decoding image | |
CN107197297B (en) | A Video Steganalysis Method for Detecting Steganography Based on DCT Coefficients | |
US9609338B2 (en) | Layered video encoding and decoding | |
CN103067713B (en) | Method and system of bitmap joint photographic experts group (JPEG) compression detection | |
Luo et al. | MPEG recompression detection based on block artifacts | |
CN110896483B (en) | Method for compressing and decompressing image data | |
CN108965887B (en) | Video information hiding method and device based on inter-block decoupling | |
Xu et al. | Detection of double MPEG-2 compression based on distributions of DCT coefficients | |
CN104853186B (en) | An Improved Video Steganalysis Method Based on Motion Vector Recovery | |
CN101313582A (en) | Encoder-assisted frame rate up-conversion using various motion models | |
KR20120049881A (en) | Vector embedded graphics coding | |
KR20100068529A (en) | Apparatus and method for detecting scene change | |
Tan et al. | GOP based automatic detection of object-based forgery in advanced video | |
EP2014099A2 (en) | Method and apparatus for encoding/transcoding and decoding | |
Xu et al. | Detection of video transcoding for digital forensics | |
Chen et al. | Detecting double MPEG compression with the same quantiser scale based on MBM feature | |
Neufeld et al. | A study of embedding operations and locations for steganography in H. 264 video | |
CN102547371B (en) | A secondary compression detection method based on H.264/AVC video | |
Idbeaa et al. | An adaptive compressed video steganography based on pixel-value differencing schemes | |
CN110728613B (en) | Non-additive distortion JPEG image steganography method based on blocking effect | |
US10536697B2 (en) | Method for re-encoding image and apparatus therefor | |
CN102595136A (en) | H.264/AVC (automatic volume control) video secondary compression detection method based on quantized coefficient statistical properties |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20200714 Termination date: 20210212 |