CN1788499A

CN1788499A - Video quality evaluation device, video quality evaluation method, video quality evaluation program, video matching device, video matching method, and video matching program

Info

Publication number: CN1788499A
Application number: CN 200480013139
Authority: CN
Inventors: 冈本淳; 栗田孝昭
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: NTT Inc
Priority date: 2003-08-22
Filing date: 2004-08-20
Publication date: 2006-06-14
Anticipated expiration: 2024-08-20
Also published as: CN100591145C

Abstract

A subjective quality estimation unit (11) receives a reference video signal (RI) which is a video signal not to be degraded and a degraded video signal (PI) obtained by degrading the reference video signal, calculates a video signal feature amount of both signals, and estimates the subjective quality of the degraded video signal from the difference between the calculated video signal feature amounts of both signals. A feature value calculation unit (12) calculates a video signal feature value of a reference video signal. A correction information storage unit (13) stores correction information for correcting subjective quality in association with the video signal feature amount. And a correction calculation unit (14) that acquires correction information corresponding to the video signal feature of the reference video signal input from the feature calculation unit (12) from the correction information storage unit (13) and transfers the correction information to the correction unit (15). And a correction unit (15) for correcting the subjective quality of the degraded video signal estimated by the subjective quality estimation unit (11) on the basis of the forwarded correction information.

Description

Image quality evaluation device, image quality evaluation method, and image quality evaluation program, and image matching device, image matching method, and image matching program

技术领域technical field

本发明涉及不是在人们看到实际的影像后进行质量评价的主观质量评价试验，而是由影像信号的物理的特征量的测定推定主观质量的影像质量评价装置、影像质量评价方法和影像质量评价程序。The present invention relates to a video quality evaluation device, a video quality evaluation method, and a video quality evaluation that estimate subjective quality from measurement of physical feature quantities of video signals instead of a subjective quality evaluation test in which people view actual video. program.

另外，本发明还涉及在影像质量评价装置、影像质量评价方法和影像质量评价程序中，进行基准影像信号和通过经过编码或网络传送等、而其质量恶化的恶化影像信号间的空间以及时间位置的匹配的影像匹配装置、影像匹配方法和影像匹配程序。In addition, the present invention also relates to the determination of spatial and temporal positions between a reference video signal and a degraded video signal whose quality has deteriorated through encoding or network transmission in the video quality evaluation device, video quality evaluation method, and video quality evaluation program. The matching image matching device, image matching method and image matching program.

背景技术Background technique

影像信息，通过某种处理，例如编码或者网络传送等，一般其质量会恶化。关于这样的恶化的影像，把人们实际看见后而感觉到的恶化的程度称为主观质量。Generally, the quality of image information will deteriorate through certain processing, such as encoding or network transmission. Regarding such a deteriorated video, the degree of deterioration perceived by people after actually seeing it is called subjective quality.

历来，作为评价该主观质量的方法，进行根据人们实际看见影像的主管质量评价试验。但是，对于人们实际看见影像后评价该影像的质量，存在费事费工且在影像的质量评价上花费时间的问题。另外，精通影像质量评价的人和初学者，大多评价不同。Conventionally, as a method of evaluating this subjective quality, a subjective quality evaluation test based on what people actually see of a video has been performed. However, for people to evaluate the quality of the video after actually seeing the video, there is a problem that it takes a lot of time and effort to evaluate the quality of the video. In addition, people who are proficient in image quality evaluation and beginners often have different evaluations.

因此，作为由物理测定推定影像的主观质量的方法，有根据基准影像信号和恶化的恶化影像信号的物理特征量的差，或者根据仅由恶化影像信号的物理特征量求得的恶化量，客观地评价质量的恶化程度的方法。Therefore, as a method of estimating the subjective quality of video by physical measurement, there is the difference between the physical feature quantity of the reference video signal and the degraded degraded video signal, or based on the deterioration amount obtained only from the physical characteristic quantity of the degraded video signal. A method for accurately assessing the degree of deterioration in quality.

但是，影像的主观质量，如果限定影像的话，虽然有时也可以高精度地推定主观评价值(例如参照ANSI T1.801.03-1996，“Digital Transport ofOne-Way Video Signal Parameters for Objective Performance Assessment”；冈本、高桥“关于影像质量客观评价技术的适用法的探讨”，信学ソ大，Sep.2002；冈本、栗田、高桥“关于提高影像质量客观评价的性能的探讨”，信学ソ大，Mar.2003)，但是对于任意的影像，因为极大依赖影像的性质，所以即使在恶化量的程度相同的场合，作为结果判断的主管质量也多有不同。However, if the subjective quality of the video is limited, the subjective evaluation value can sometimes be estimated with high accuracy (for example, refer to ANSI T1.801.03-1996, "Digital Transport of One-Way Video Signal Parameters for Objective Performance Assessment"; Okamoto , Takahashi, "Discussion on Applicable Laws of Objective Image Quality Evaluation Technology", Shingaku University, Sep.2002; Okamoto, Kurita, Takahashi "Discussion on Improving the Performance of Image Quality Objective Evaluation", Shingaku University , Mar.2003), but for any image, because it greatly depends on the nature of the image, even if the degree of deterioration is the same, the quality of the result judged will be different.

从这样的理由出发，在求影像主观质量的场合，仍然大多是人们实际看见该影像后来进行评价其质量这样的主观评价试验，这样的技术状况一直在持续。For this reason, in the case of seeking the subjective quality of an image, there are still many subjective evaluation tests in which people actually see the image and evaluate its quality. Such a technical situation continues.

在这样的状况中，提出了使其能够像接近人看见的场合那样评价主观质量为目的，着眼于影像的边缘区域，对影像信号使用Sobel滤波器，计算其恶化量的客观评价法(参照PCT Pub.No.：WO99/45715)。In such a situation, an objective evaluation method has been proposed that focuses on the edge area of the image, applies a Sobel filter to the image signal, and calculates the amount of deterioration (see PCT Pub. No.: WO99/45715).

但是，仅用这样的特征量，还不能得到置换实际的主观评价的充分的评价精度。However, sufficient evaluation accuracy to replace actual subjective evaluation cannot be obtained only with such feature quantities.

另一方面，在推定影像的主观质量的方法中，作为比较基准影像信号和恶化的恶化影像信号的物理特征量的前提，需要匹配基准影像信号和恶化影像信号的空间及时间的位置。亦即，在基准影像信号和恶化影像信号之间，需要匹配时间方向的偏离或者空间位置的偏离。On the other hand, in the method of estimating the subjective quality of video, it is necessary to match the spatial and temporal positions of the reference video signal and the degraded video signal as a prerequisite for comparing the physical feature quantities of the reference video signal and the degraded degraded video signal. That is, it is necessary to match the deviation in the time direction or the deviation in the spatial position between the reference video signal and the degraded video signal.

原来，这样的匹配处理是用手动进行的。但是，不久，提出了自动进行匹配处理的方案。例如，在美国专利5,446,492中，作为客观评价的前处理，在时间匹配处理时发生了延迟的场合也能对其进行应对。Originally, such a matching process was performed manually. Soon, however, a scheme for automatically performing the matching process was proposed. For example, in US Pat. No. 5,446,492, delays in time matching processing can be dealt with as pre-processing for objective evaluation.

根据该文献的技术，像电视广播那样，如果对象是同大小同帧速率的信号的话，则一旦取得同步，以后就能取得良好的同步。According to the technology of this document, as in television broadcasting, if the target is a signal of the same size and frame rate, once synchronization is obtained, good synchronization can be obtained thereafter.

但是，在近年来开始普及的经由因特网等的IP网络的影像分发或通信服务中，因为在个人计算机(PC)终端中接收影像，所以接收的影像的大小或长宽比各种各样。另外，也有时损失像数据包那样某种程度集中的信息。因此，难于取得基准影像信号和恶化影像信号的空间的位置对应。另一方面，由于IP包到达间隔的波动或者包损失的发生，发生了诸如在时间轴上的影像显示的定时偏离或者影像显示定时的波动或者冻结这些过去不存在的现象。因此，没有取得基准影像信号和恶化影像信号之间这样的时间上偏离的匹配的方法。However, in video distribution or communication services via IP networks such as the Internet that have become popular in recent years, since a personal computer (PC) terminal receives video, the received video has various sizes and aspect ratios. In addition, there is a case where information concentrated to some extent such as data packets is lost. Therefore, it is difficult to obtain spatial position correspondence between the reference video signal and the degraded video signal. On the other hand, due to fluctuations in IP packet arrival intervals or occurrence of packet loss, phenomena such as timing deviation of video display on the time axis or fluctuations in video display timing or freezes occur that did not exist in the past. Therefore, there is no method of achieving such temporally offset matching between the reference video signal and the degraded video signal.

在推定影像的主观质量的方法中，如果不能充分地取得基准影像信号和恶化影像信号的空间以及时间的位置匹配的话，最终将不可能进行影像的主观质量的推定，即使能其精度也低。In the method of estimating the subjective quality of video, if the spatial and temporal position matching of the reference video signal and the degraded video signal cannot be sufficiently obtained, it will be impossible to estimate the subjective quality of video, and the accuracy will be low even if it is possible.

此外，在前面的PCT Pub.No.：WO99/45715中，作为记述了时间上匹配处理的一例的文献，可以举出ITU-T Contribution COM-12-29“Draft newrecommendation on multimedia communication delay，synchronization，and framerate measurement”，December 1997。In addition, in the aforementioned PCT Pub. No.: WO99/45715, as a document describing an example of temporal matching processing, ITU-T Contribution COM-12-29 "Draft new recommendation on multimedia communication delay, synchronization, and framerate measurement”, December 1997.

发明内容Contents of the invention

本发明是鉴于上述的现有技术提出的，其目的是，提供能够对任意影像高精度地推定统一的主观质量的影像质量评价装置、影像质量评价方法和影像质量评价程序。The present invention is made in view of the above-mentioned prior art, and an object of the present invention is to provide a video quality evaluation device, a video quality evaluation method, and a video quality evaluation program capable of estimating a unified subjective quality of an arbitrary video with high accuracy.

另外，本发明的目的是，提供在推定主观质量中比较基准影像信号和恶化影像信号的物理特征量时，能够确实取得它们的空间以及时间的位置匹配的影像匹配装置、影像匹配方法和影像匹配程序。Another object of the present invention is to provide an image matching device, an image matching method, and an image matching device that can reliably obtain their spatial and temporal position matching when comparing physical feature quantities of a reference image signal and a degraded image signal in estimating subjective quality. program.

为实现上述目的，第一形态的发明的要旨是一种影像质量评价装置，具有：输入作为未恶化的影像信号的基准影像信号和该基准影像信号恶化后的恶化影像信号，计算两信号的影像信号特征量，根据计算出的两信号的影像信号特征量的差，推定所述恶化影像信号的主观质量的主观质量推定部；与影像信号特征量对应起来存储用于修正所述主观质量的修正信息的修正信息存储部；输入所述基准影像信号的所述影像信号特征量，从所述修正信息存储部取得与已输入的影像信号特征量对应的修正信息，根据取得的修正信息修正所述推定的主观质量的主观质量修正部。In order to achieve the above objects, the gist of the first aspect of the invention is an image quality evaluation device that includes: inputting a reference image signal that is an undegraded image signal and a deteriorated image signal after the reference image signal has been deteriorated, and calculating an image of the two signals. a signal feature value, a subjective quality estimating unit for estimating the subjective quality of the degraded video signal based on the calculated difference between the video signal feature values of the two signals; an information correction information storage unit; input the video signal feature value of the reference video signal, obtain correction information corresponding to the input video signal feature value from the correction information storage unit, and correct the The subjective quality correction part of the estimated subjective quality.

第二形态的发明的要旨是，在第一形态的发明中，所述主观质量推定部，根据表示在影像信号中包含的影像的各帧内的影像状态的空间信息以及表示在帧间的影像变化的时间信息的至少任何一个的恶化影像信号和基准影像信号间的差推定所述主观质量。The gist of the invention of the second aspect is that in the invention of the first aspect, the subjective quality estimating unit uses the spatial information indicating the video state in each frame of the video contained in the video signal and the video information between frames. The subjective quality is estimated from a difference between the degraded video signal and the reference video signal of at least any one of the changed temporal information.

第三形态的发明的要旨是，在第二形态的发明中，所述主观质量推定部，作为所述恶化影像信号和所述基准影像信号间的所述影像信号特征量的差，计算表示在影像信号中包含的影像的各帧内的恶化量的边缘电力量(E)、以及表示在帧间的恶化量的移动电力量(M)的至少一方。The gist of the third aspect of the invention is that, in the second aspect of the invention, the subjective quality estimating unit calculates, as a difference in the video signal feature value between the degraded video signal and the reference video signal, the At least one of an edge power amount (E) of a deterioration amount in each frame of a video included in the video signal and a mobile power amount (M) indicating a deterioration amount between frames.

第四形态的发明的要旨是，在第二形态的发明中，所述主观质量推定部，根据在ITU-R Recommendation P.910中规定的spatial information以及temporalinformation中至少任何一个的、恶化影像信号和基准影像信号间的差推定所述主观质量。The gist of the fourth aspect of the invention is that, in the second aspect of the invention, the subjective quality estimating unit, based on at least any one of spatial information and temporal information specified in ITU-R Recommendation P.910, degraded video signal and The difference between the reference video signals estimates the subjective quality.

第五形态的发明的要旨是，在第一形态的发明中，修正信息存储部，作为用于修正所述主观质量的修正信息，与表示在影像信号中包含的在影像的各帧内的影像状态的空间信息以及表示在帧间的影像变化的时间信息对应起来存储修正系数。The gist of the fifth aspect of the invention is that, in the first aspect of the invention, the correction information storage unit, as the correction information for correcting the subjective quality, is related to the image in each frame of the image included in the image signal. The spatial information of the state and the temporal information indicating the change of the image between frames are stored in association with correction coefficients.

第六形态的发明的要旨是，在第一形态的发明中，所述主观质量推定部，具有：输入所述基准影像信号和所述恶化影像信号，生成关于来自所述基准影像信号的基准影像帧和来自所述恶化影像信号的恶化影像帧的时间的偏离、和所述基准影像帧和所述恶化影像帧的空间的偏离的校准信息的校准信息生成部；根据所述校准信息，在消除了所述空间偏离和所述时间偏离之后，根据表示所述基准影像帧和所述恶化影像帧的影像状态的空间信息计算各自的空间特征量的空间特征量计算部；根据所述校准信息，在消除了所述空间偏离和所述时间偏离之后，对于所述基准影像帧和所述恶化影像帧，根据表示各自的帧间的影像变化的时间信息计算各自的时间特征量的时间特征量计算部；根据所述空间特征量和所述时间特征量推定所述恶化影像信号的主观质量的推定部。The gist of the sixth aspect of the invention is that, in the first aspect of the invention, the subjective quality estimating unit includes: inputting the reference video signal and the degraded video signal, and generating a reference video from the reference video signal; a calibration information generation unit for calibration information of a temporal deviation of a degraded video frame from the degraded video signal, and a calibration information of a spatial deviation between the reference video frame and the degraded video frame; After the spatial offset and the temporal offset are determined, a spatial feature calculation unit that calculates respective spatial feature quantities based on the spatial information representing the image states of the reference image frame and the degraded image frame; based on the calibration information, After the spatial deviation and the temporal deviation are eliminated, for the reference video frame and the degraded video frame, the temporal feature value calculation of calculating the respective temporal feature values based on the time information representing the image change between the respective frames and an estimating unit for estimating the subjective quality of the degraded video signal based on the spatial feature value and the temporal feature value.

另外，为实现上述目的，第七形态的发明是一种影像质量评价方法，该方法，是在具有与影像信号特征量对应起来存储用于修正作为未恶化的影像信号的基准影像信号恶化后的恶化影像信号被推定出的主观质量的修正信息的修正信息存储部、推定所述恶化影像信号的主观质量的影像质量评价装置中的影像质量评价方法；其特征在于，输入所述基准影像信号和所述恶化影像信号，计算两信号的影像信号特征量，根据算出的两信号的影像信号特征量的差、推定所述恶化影像信号的所述主观质量，从所述修正信息存储部取得与算出的所述基准影像信号的影像信号特征量对应的修正信息，根据取得的修正信息修正所述推定的主观质量。In addition, in order to achieve the above object, the seventh aspect of the invention is a method for evaluating video quality, which includes storing a degraded reference video signal for correcting a non-degraded video signal in association with the feature value of the video signal. A video quality evaluation method in a correction information storage unit for correction information of subjective quality of which a degraded video signal is estimated, and a video quality evaluation device for estimating the subjective quality of the degraded video signal; characterized in that the reference video signal and For the degraded video signal, the video signal feature values of the two signals are calculated, and the subjective quality of the degraded video signal is estimated based on the difference between the calculated video signal feature values of the two signals, and obtained and calculated from the correction information storage unit. Correction information corresponding to the video signal feature quantity of the reference video signal, and correcting the estimated subjective quality according to the obtained correction information.

另外，为实现上述目的，第八形态的发明是一种影像质量评价程序，其特征在于，使计算机作为下述单元工作：输入作为未恶化的影像信号的基准影像信号和该基准影像信号恶化后的恶化影像信号，计算两信号的影像信号特征量，根据计算出的两信号的影像信号特征量的差，推定所述恶化影像信号的主观质量的主观质量推定单元；与影像信号特征量对应起来存储用于修正所述主观质量的修正信息的修正信息存储单元；从所述修正信息存储单元取得对应算出的所述基准影像信号的影像信号特征量的修正信息，根据取得的修正信息修正所述推定出的主观质量的主观质量修正单元。In addition, in order to achieve the above-mentioned object, the invention of the eighth aspect is a video quality evaluation program characterized in that a computer is operated as a unit for inputting a reference video signal which is an undegraded video signal and a degraded reference video signal. The degraded image signal, calculate the image signal characteristic quantity of two signals, according to the difference of calculated image signal characteristic quantity of two signals, estimate the subjective quality estimation unit of the subjective quality of described deteriorated image signal; Correspond to the image signal characteristic quantity a correction information storage unit that stores correction information for correcting the subjective quality; obtaining correction information corresponding to the calculated image signal feature quantity of the reference image signal from the correction information storage unit, and correcting the Subjective quality correction unit for the estimated subjective quality.

另外，为实现上述目的，第九形态的发明是一种影像质量评价装置，其特征在于，具有：输入作为未恶化的影像信号的基准影像信号和该基准影像信号恶化后的恶化影像信号，生成关于来自所述基准影像信号的基准影像帧和来自所述恶化影像信号的恶化影像帧的时间的偏离、和所述基准影像帧和所述恶化影像帧的空间的偏离的校准信息的校准信息生成部；根据所述校准信息，在消除了所述空间偏离和所述时间偏离之后，根据表示所述基准影像帧和所述恶化影像帧的影像状态的空间信息计算各自的空间特征量的空间特征量计算部；根据所述校准信息，在消除了所述空间偏离和所述时间偏离之后，对于所述基准影像帧和所述恶化影像帧，根据表示各自的帧间的影像变化的时间信息计算各自的时间特征量的时间特征量计算部；根据所述空间特征量和所述时间特征量推定所述恶化影像信号的主观质量的主观质量推定部。In addition, in order to achieve the above object, the ninth aspect of the invention is a video quality evaluation device characterized by comprising: inputting a reference video signal which is an undegraded video signal and a degraded video signal obtained by degrading the reference video signal; Calibration information generation of calibration information regarding a temporal deviation between a reference video frame from the reference video signal and a degraded video frame from the degraded video signal, and a spatial deviation between the reference video frame and the degraded video frame according to the calibration information, after eliminating the spatial deviation and the temporal deviation, calculating the spatial characteristics of the respective spatial characteristic quantities according to the spatial information representing the image state of the reference image frame and the deteriorated image frame An amount calculation unit; after eliminating the spatial deviation and the temporal deviation based on the calibration information, for the reference video frame and the degraded video frame, calculate according to the time information representing the video change between the respective frames a temporal feature calculation unit for each temporal feature; and a subjective quality estimation unit for estimating the subjective quality of the degraded video signal based on the spatial feature and the temporal feature.

第十形态的发明的要旨是，在第九形态的发明中，进一步具有：在把在所述恶化影像信号中包含的恶化影像的文件形式变换为在对应的所述基准影像信号中包含的基准影像的文件形式的同时，输出与其统一后的文件形式关联的信息的形式变换部，和对应与所述文件形式关联的信息、存储用于推定所述恶化影像信号的主观质量的系数的修正系数存储部；所述主观质量推定部，从所述修正系数存储部取得对应与从所述形式变换部输入的所述统一的文件形式关联的信息的所述系数，根据所述空间特征量、所述时间特征量、以及所述取得的系数，推定所述恶化影像信号的主观质量。The gist of the tenth aspect of the invention is that, in the ninth aspect of the invention, further comprising: converting the file format of the degraded video included in the degraded video signal into a reference file format included in the corresponding reference video signal. A format conversion unit that outputs information related to the unified file format along with the file format of the video, and a correction coefficient that stores a coefficient for estimating the subjective quality of the degraded video signal corresponding to the information related to the file format a storage unit; the subjective quality estimating unit obtains, from the correction coefficient storage unit, the coefficient corresponding to the information associated with the unified document format input from the format conversion unit, and based on the spatial feature value, the The subjective quality of the degraded video signal is estimated based on the temporal feature value and the acquired coefficient.

第十一形态的发明的要旨是，在第十形态的发明中，所述形式变换部，作为与所述统一的文件形式关联的信息，输出所述恶化影像信号的信号形式、由所述恶化影像信号发送来的恶化影像的信息量、以及所述恶化影像信号的编码方式中至少任何一个，所述修正系数存储部，对应所述恶化影像信号的信号形式、由所述恶化影像信号发送来的恶化影像的信息量、以及所述恶化影像信号的编码方式中至少任何一个，存储最佳的系数。The gist of the eleventh aspect of the invention is that, in the tenth aspect of the invention, the format conversion unit outputs, as information related to the unified file format, the signal format of the degraded video signal, obtained by the degraded At least one of the amount of information of the degraded video transmitted from the video signal and the encoding method of the degraded video signal, the correction coefficient storage unit corresponding to the signal format of the degraded video signal and transmitted from the degraded video signal The optimal coefficient is stored in at least any one of the amount of information of the degraded video and the encoding method of the degraded video signal.

第十二形态的发明的要旨是，在第九或者第十形态的发明中，所述空间特征量计算部，根据所述基准影像信号和所述恶化影像信号，将定量化在帧内亮度值急剧变化的边界发生的恶化的指标作为所述空间特征量进行计算。The gist of the invention of the twelfth aspect is that, in the ninth or tenth aspect of the invention, the spatial feature value calculation unit quantifies the intra-frame luminance value based on the reference video signal and the degraded video signal An index of deterioration of sharply changing boundaries is calculated as the spatial feature quantity.

第十三形态的发明的要旨是，在第十二形态的发明中，所述空间特征量计算部，根据所述基准影像信号和所述恶化影像信号，将在ANSIT1.801.03-1995中规定的边缘电力量作为所述空间特征量进行计算。The gist of the invention of the thirteenth aspect is that, in the invention of the twelfth aspect, the spatial feature value calculating unit calculates the value specified in ANSIT1.801.03-1995 based on the reference video signal and the degraded video signal. The amount of marginal power is calculated as the spatial feature amount.

第十四形态的发明的要旨是，在第九、第十或者第十二形态的发明中，所述空间特征量计算部，比较所述基准影像帧，将定量化在对应该基准影像帧的所述恶化影像帧亮度值急剧变化的边界在水平垂直方向上发生的情况的指标作为所述空间特征量进行计算。The gist of the fourteenth aspect of the invention is that, in the ninth, tenth, or twelfth aspect of the invention, the spatial feature value calculation unit compares the reference video frame and quantifies the value corresponding to the reference video frame. An index of occurrence of a boundary in which the luminance value of the degraded video frame changes sharply in the horizontal and vertical directions is calculated as the spatial feature value.

第十五形态的发明的要旨是，在第九、第十、第十二或者第十四形态的发明中，所述空间特征量计算部，根据在每一个作为帧内的大于等于1像素以上的集合的块中基准影像在ITU-R Recommendation P.910中规定的Temporal Information值和恶化影像在ITU-R Recommendation P.910中规定的Temporal Information值的差，将影像的帧间的变化量作为所述时间特征量进行计算。The gist of the fifteenth aspect of the invention is that, in the ninth, tenth, twelfth, or fourteenth aspect of the invention, the spatial feature value calculation unit calculates a value of at least 1 pixel in each frame. The difference between the Temporal Information value specified in ITU-R Recommendation P.910 of the reference image in the block of the set and the Temporal Information value specified in ITU-R Recommendation P.910 of the deteriorated image, and the amount of change between frames of the image is taken as The temporal feature quantity is calculated.

第十六形态的发明的要旨是，在第九或者第十形态的发明中，进而具有与空间特征量和时间特征量对应起来存储用于修正所述主观质量的修正信息的修正信息存储部；输入所述基准影像信号的所述空间特征量以及所述时间特征量、从所述修正信息存储部取得对应输入的空间特征量和时间特征量的修正信息、根据取得的修正信息修正所述推定出的主观质量的主观质量修正部。The gist of the sixteenth aspect of the invention is that, in the ninth or tenth aspect of the invention, there is further provided a correction information storage unit that stores correction information for correcting the subjective quality in association with the spatial feature value and the temporal feature value; Inputting the spatial feature value and the temporal feature value of the reference video signal, obtaining correction information corresponding to the input spatial feature value and temporal feature value from the correction information storage unit, and correcting the estimation based on the obtained correction information. The subjective quality correction part of the subjective quality.

另外，为实现上述目的，第十七形态的发明的要旨是一种影像质量评价方法，其特征在于，输入作为未恶化的影像信号的基准影像信号和该基准影像信号恶化后的恶化影像信号；生成关于来自所述基准影像信号的基准影像帧和来自所述恶化影像信号的恶化影像帧的时间的偏离、和所述基准影像帧和所述恶化影像帧的空间的偏离的校准信息；根据所述校准信息，在消除了所述空间偏离和所述时间偏离之后，根据表示所述基准影像帧和所述恶化影像帧的影像状态的空间信息计算各自的空间特征量；根据所述校准信息，在消除了所述空间偏离和所述时间偏离之后，对于所述基准影像帧和所述恶化影像帧，根据表示各自的帧间的影像变化的时间信息计算各自的时间特征量；根据所述空间特征量和所述时间特征量推定所述恶化影像信号的主观质量。In addition, in order to achieve the above object, the gist of the seventeenth aspect of the invention is a video quality evaluation method characterized by inputting a reference video signal which is an undegraded video signal and a degraded video signal obtained by degrading the reference video signal; generating calibration information regarding a temporal offset of a reference image frame from the reference image signal and a degraded image frame from the degraded image signal, and a spatial offset between the reference image frame and the degraded image frame; according to the The calibration information, after eliminating the spatial deviation and the temporal deviation, calculate respective spatial feature quantities according to the spatial information representing the image state of the reference image frame and the deteriorated image frame; according to the calibration information, After the spatial deviation and the temporal deviation are eliminated, for the reference image frame and the deteriorated image frame, calculate respective temporal feature quantities according to time information representing image changes between respective frames; according to the spatial The feature quantity and the temporal feature quantity estimate the subjective quality of the degraded video signal.

另外，为实现上述目的，第十八形态的发明的要旨是影像质量评价方法，其特征在于，输入作为未恶化的影像信号的基准影像信号和该基准影像信号恶化的恶化影像信号；把在所述恶化影像信号中包含的恶化影像的文件形式变换为在对应的所述基准影像信号中包含的基准影像的文件形式；生成与该统一过的文件形式关联的信息；生成关于来自所述基准影像信号的基准影像帧和来自所述恶化影像信号的文件形式被变换过的恶化影像的恶化影像帧间的时间的偏离、和所述基准影像帧和所述恶化影像帧间的空间的偏离的校准信息；根据所述校准信息，在消除了所述空间偏离和所述时间偏离之后，根据表示所述基准影像帧和所述恶化影像帧的影像状态的空间信息计算各自的空间特征量；根据所述校准信息，在消除了所述空间偏离和所述时间偏离之后，对于所述基准影像帧和所述恶化影像帧，根据表示各自的帧间的影像变化的时间信息计算各自的时间特征量；根据所述空间特征量、所述时间特征量和对应与所述统一过的文件形式关联的信息的、用于主观质量推定的系数，推定所述恶化影像信号的主观质量。In addition, in order to achieve the above object, the gist of the eighteenth aspect of the invention is a video quality evaluation method characterized in that a reference video signal which is an undegraded video signal and a degraded video signal degraded by the reference video signal are input; Converting the file format of the degraded image included in the degraded image signal to the file format of the reference image included in the corresponding reference image signal; generating information associated with the unified file format; generating information about the reference image from the reference image Calibration of a temporal offset between a reference image frame of a signal and a degraded image from a degraded image whose file format has been converted from said degraded image signal, and a spatial offset between said reference image frame and said degraded image frame information; according to the calibration information, after the spatial deviation and the temporal deviation are eliminated, the respective spatial feature quantities are calculated according to the spatial information representing the image state of the reference image frame and the deteriorated image frame; according to the The calibration information, after eliminating the spatial deviation and the temporal deviation, for the reference image frame and the deteriorated image frame, calculate respective time feature values according to time information representing image changes between respective frames; The subjective quality of the degraded video signal is estimated based on the spatial feature value, the temporal feature value, and a coefficient for subjective quality estimation corresponding to information associated with the unified file format.

另外，为实现上述目的，第十九形态的发明是一种影像质量评价程序，该程序使计算机作为下述单元工作：输入作为未恶化的影像信号的基准影像信号和其基准影像信号恶化的恶化影像信号、生成关于来自所述基准影像信号的基准影像帧和来自所述恶化影像信号的恶化影像帧间的时间的偏离、和所述基准影像帧和所述恶化影像帧间的空间的偏离的校准信息的校准信息生成单元；根据所述校准信息，在消除了所述空间偏离和所述时间偏离之后，根据表示所述基准影像帧和所述恶化影像帧的影像状态的空间信息计算各自的空间特征量的空间特征量计算单元；根据所述校准信息，在消除了所述空间偏离和所述时间偏离之后，对于所述基准影像帧和所述恶化影像帧，根据表示各自的帧间的影像变化的时间信息计算各自的时间特征量的时间特征量计算单元；根据所述空间特征量和所述时间特征量，推定所述恶化影像信号的主观质量的主观质量推定单元。In addition, in order to achieve the above-mentioned object, the invention of the nineteenth aspect is a video quality evaluation program which causes a computer to operate as a means for inputting a reference video signal which is an undegraded video signal and a degradation result of the degradation of the reference video signal. A video signal, generating information about a temporal deviation between a reference video frame from the reference video signal and a degraded video frame from the degraded video signal, and a spatial deviation between the reference video frame and the degraded video frame A calibration information generation unit for calibration information; according to the calibration information, after eliminating the spatial deviation and the temporal deviation, calculate respective A spatial feature calculation unit for a spatial feature; according to the calibration information, after eliminating the spatial deviation and the temporal deviation, for the reference image frame and the deteriorated image frame, according to the respective inter-frame a temporal feature value calculation unit for calculating respective temporal feature values based on temporal information of video changes; and a subjective quality estimation unit for estimating subjective quality of the degraded video signal based on the spatial feature value and the temporal feature value.

第二十形态的发明的要旨是，在第十九形态的发明中，进而使计算机作为下述单元工作：在把在所述恶化影像信号中包含的恶化影像的文件形式，变换为在对应的所述基准影像信号中包含的基准影像的文件形式的同时，输出与该统一过的文件形式关联的信息的形式变换单元；和使其和与所述文件形式关联的信息对应起来、存储用于推定所述恶化影像信号的主观质量的系数的修正系数存储单元；所述主观质量推定单元，从所述修正系数存储单元中取得对应与从所述形式变换单元输入的所述统一过的文件形式关联的信息的所述系数，根据所述空间特征量、所述时间特征量以及所述取得的系数，推定所述恶化影像信号的主观质量。The gist of the invention of the twentieth aspect is that, in the invention of the nineteenth aspect, the computer is further operated as a unit for converting the file format of the degraded video included in the degraded video signal into the corresponding A format conversion unit for outputting information associated with the unified file format while simultaneously outputting the file format of the reference image included in the reference image signal; and making it correspond to the information associated with the file format and storing a correction coefficient storage unit for estimating a coefficient of the subjective quality of the degraded video signal; the subjective quality estimation unit acquires from the correction coefficient storage unit a file format corresponding to the unified file format input from the format conversion unit The coefficients of the associated information are used to estimate the subjective quality of the degraded video signal based on the spatial feature value, the temporal feature value, and the acquired coefficients.

另外，为实现上述目的，第二十一形态的发明是一种影像匹配装置，其特征在于，具有：输入作为未恶化的影像信号的基准影像信号和该基准影像信号恶化后的恶化影像信号，把在所述恶化影像信号中包含的恶化影像的文件形式，变换为在对应的所述基准影像信号中包含的基准影像的文件形式的形式变换部；匹配在所述基准影像信号中包含的基准影像帧和所述恶化影像信号中包含的恶化影像帧的数目以及显示定时的显示定时匹配部；和对于所述基准影像帧以及所述恶化影像帧的对象帧以及其前后几个帧，监视在所述基准影像帧以及所述恶化影像帧之间的帧的偏离或者所述恶化影像的冻结状态，且取得帧间的对应关系以及像素间的对应关系的匹配的同步/位置匹配部。In addition, in order to achieve the above object, the invention of the twenty-first aspect is a video matching device characterized by comprising: inputting a reference video signal which is an undegraded video signal and a degraded video signal obtained by degrading the reference video signal, a format conversion unit for converting the file format of the degraded image included in the degraded image signal into the file format of the reference image included in the corresponding reference image signal; matching the reference image included in the reference image signal a display timing matching unit for the number of video frames and the degraded video frames included in the degraded video signal and display timing; A synchronization/position matching unit that obtains a frame deviation between the reference image frame and the degraded image frame or a frozen state of the degraded image, and obtains a correspondence between frames and a correspondence between pixels.

第二十二形态的发明的要旨是，在第二十一形态的发明中，所述形式变换部，变换所述恶化影像的数据形式、大小、长宽比中至少一个来使与所述基准影像吻合。The gist of the twenty-second aspect of the invention is that, in the twenty-first aspect of the invention, the format conversion unit converts at least one of the data format, size, and aspect ratio of the degraded video to match the reference The images match.

第二十三形态的发明的要旨是，在第二十一形态的发明中，所述显示定时匹配部，在所述基准影像帧和所述恶化影像帧的帧速率不同的场合，通过插补或者删除所述恶化影像帧，使所述基准影像帧和所述恶化影像帧的帧速率吻合。The gist of the twenty-third aspect of the invention is that, in the twenty-first aspect of the invention, the display timing matching unit, when the frame rates of the reference video frame and the degraded video frame are different, Or delete the degraded image frame, so that the frame rates of the reference image frame and the degraded image frame match.

第二十四形态的发明的要旨是，在第二十一形态的发明中，所述显示定时匹配部，对于所述基准影像帧的显示定时，调准所述恶化影像帧的显示定时。The gist of the twenty-fourth aspect of the invention is that, in the twenty-first aspect of the invention, the display timing matching unit aligns the display timing of the degraded video frame with the display timing of the reference video frame.

第二十五形态的发明的要旨是，在第二十一形态的发明中，所述显示定时匹配部，在所述基准影像帧的显示定时不一定的场合，把所述基准影像帧和所述恶化影像帧双方的显示定时取为预定的时间间隔。The gist of the twenty-fifth aspect of the invention is that, in the twenty-first aspect of the invention, the display timing matching unit matches the reference image frame with the display timing of the reference image frame if the display timing is not constant. The display timing of both of the degraded image frames is set at a predetermined time interval.

第二十六形态的发明的要旨是，在第二十一形态的发明中，所述同步/位置匹配部，对于所述基准影像帧和所述恶化影像帧的对象帧以及其前后几个帧，比较全部帧或其特定区域的各自的特征量的推移，执行决定所述特征量的偏离成为最小的所述基准影像帧和所述恶化影像帧的时间的对应关系的宏同步处理。The gist of the twenty-sixth aspect of the invention is that, in the twenty-first aspect of the invention, the synchronization/position matching unit, for the target frame of the reference video frame and the degraded video frame, and several frames before and after them and performing a macro-synchronization process for determining a temporal correspondence between the reference video frame and the degraded video frame in which the deviation of the feature values is the smallest by comparing the transition of each feature value of all frames or specific regions thereof.

第二十七形态的发明的要旨是，在第二十六形态的发明中，所述同步/位置匹配部，对于所述基准影像帧和所述恶化影像帧的对象帧以及其前后几个帧，一边挪移所述基准影像帧和所述恶化影像帧的时间的对应关系以及像素对应关系，一边比较全部帧或其特定区域的各自的特征量，执行决定所述特征量的差异成为最小的所述基准影像帧和所述恶化影像帧的时间的对应关系以及像素对应关系的微同步/位置匹配处理。The gist of the twenty-seventh aspect of the invention is that, in the twenty-sixth aspect of the invention, the synchronization/position matching unit, for the target frame of the reference video frame and the degraded video frame, and several frames before and after them , while shifting the time correspondence and pixel correspondence between the reference image frame and the degraded image frame, comparing the respective feature quantities of all frames or specific regions thereof, and performing the process of determining that the difference of the feature quantities becomes the smallest Micro-synchronization/position matching processing of the time correspondence between the reference image frame and the degraded image frame and the pixel correspondence.

第二十八形态的发明的要旨是，在第二十七形态的发明中，所述同步/位置匹配部，初期执行所述宏同步处理以及所述微同步/位置匹配处理。The gist of the twenty-eighth aspect of the invention is that, in the twenty-seventh aspect of the invention, the synchronization and position matching unit initially executes the macro synchronization and the micro synchronization and position matching.

第二十九形态的发明的要旨是，在第二十八形态的发明中，所述同步/位置匹配部，在所述恶化影像帧陷入了冻结状态的场合，通过对帧数进行计数来计量所述冻结状态的继续时间。The gist of the twenty-ninth aspect of the invention is that, in the twenty-eighth aspect of the invention, the synchronization/position matching unit counts the number of frames when the degraded video frame is frozen. The duration of the frozen state.

第三十形态的发明的要旨是，在第二十九形态的发明中，所述同步/位置匹配部，对于所述基准影像帧和所述恶化影像帧的对象帧以及其前后几个帧，导出全部帧的各自的特征量，在对于所述基准影像帧的特征量随时间变化而所述恶化影像帧的特征量不随时间变化时，判定所述恶化影像帧陷入冻结状态。The gist of the invention of the thirtieth aspect is that, in the invention of the twenty-ninth aspect, the synchronization/position matching unit, for the target frame of the reference video frame and the degraded video frame, and several frames before and after them, The respective feature values of all frames are derived, and when the feature values of the reference video frame change over time but the feature values of the degraded video frames do not change over time, it is determined that the degraded video frame is in a frozen state.

第三十一形态的发明的要旨是，在第二十八形态的发明中，所述同步/位置匹配部，在所述恶化影像帧陷入冻结状态的场合，或者不能与所述基准影像帧同步的场合，再次执行所述宏同步处理。The gist of the thirty-first aspect of the invention is that, in the twenty-eighth aspect of the invention, the synchronization/position matching unit cannot synchronize with the reference image frame when the degraded image frame is frozen. In the case of , execute the macro synchronization process again.

第三十二形态的发明的要旨是，在第二十八形态的发明中，所述同步/位置匹配部，在所述恶化影像帧陷入帧偏离的状态的场合，输出帧的偏离数。The gist of the thirty-second aspect of the invention is that, in the twenty-eighth aspect of the invention, the synchronization/position matching unit outputs the number of frame deviations when the degraded video frame falls into a frame deviation state.

第三十三形态的发明的要旨是，在第二十一形态的发明中，进而具有从所述同步/位置匹配部输入所述基准影像信号和所述恶化影像信号，使所述恶化影像的亮度以及颜色信息与所述基准影像吻合，并把所述亮度以及颜色信息已吻合的恶化影像返回到所述同步/位置匹配部的亮度/颜色修正部。The gist of the thirty-third aspect of the invention is that, in the twenty-first aspect of the invention, the reference image signal and the degraded image signal are input from the synchronization/position matching unit, and the degraded image matching the brightness and color information with the reference image, and returning the deteriorated image whose brightness and color information have matched to the brightness/color correction unit of the synchronization/position matching unit.

另外，为实现上述目的，第三十四形态的发明是一种影像匹配方法，其特征在于，输入作为未恶化的影像信号的基准影像信号和该基准影像信号恶化后的恶化影像信号；把在所述恶化影像信号中包含的恶化影像的文件形式，变换为在对应的所述基准影像信号中包含的基准影像的文件形式；匹配在所述基准影像信号中包含的基准影像帧和所述恶化影像信号中包含的恶化影像帧的数目以及显示定时；和对于所述基准影像帧以及所述恶化影像帧的对象帧以及其前后几个帧，监视在所述基准影像帧以及所述恶化影像帧之间的帧的偏离或者所述恶化影像的冻结状态，且取得帧间的对应关系以及像素间的对应关系的匹配。In addition, in order to achieve the above object, the thirty-fourth aspect of the invention is a video matching method characterized by inputting a reference video signal which is an undegraded video signal and a degraded video signal after the reference video signal has been degraded; converting the file format of the degraded image included in the degraded image signal into the file format of the reference image included in the corresponding reference image signal; matching the reference image frame included in the reference image signal with the degraded The number and display timing of the degraded video frames contained in the video signal; and for the target frame of the reference video frame and the degraded video frame and several frames before and after it, monitor the reference video frame and the degraded video frame The frame deviation or the frozen state of the degraded image is obtained, and the matching between the corresponding relationship between the frames and the corresponding relationship between the pixels is obtained.

另外，为实现上述目的，第三十五形态的发明是一种影像匹配程序，其特征在于，使计算机作为下述单元工作：输入作为未恶化的影像信号的基准影像信号和该基准影像信号恶化后的恶化影像信号、把在所述恶化影像信号中包含的恶化影像的文件形式变换为在对应的所述基准影像信号中包含的基准影像的文件形式的形式变化单元；匹配在所述基准影像信号中包含的基准影像帧和在所述恶化影像信号中包含的恶化影像帧的数目以及显示定时的显示定时匹配单元；和对于所述基准影像帧以及所述恶化影像帧的对象帧以及其前后几个帧、监视在所述基准影像帧和所述恶化影像帧之间的帧的偏离或者所述恶化影像的冻结状态、同时取得帧间的对应关系以及像素间的对应关系的匹配的同步/位置匹配单元。In addition, in order to achieve the above object, the thirty-fifth aspect of the invention is a video matching program characterized by causing a computer to operate as a means for inputting a reference video signal which is a non-degraded video signal and a degraded reference video signal. After the degraded image signal, a format change unit for converting the file format of the degraded image included in the degraded image signal into the file format of the reference image included in the corresponding reference image signal; matching the reference image a reference video frame included in the signal and a display timing matching unit for the number of degraded video frames included in the degraded video signal and display timing; Synchronization/ Location matching unit.

另外，为实现上述目的，第三十六形态的发明是一种影像质量评价装置，其特征在于，具有：输入作为未恶化的影像信号的基准影像信号(RI)和该基准影像信号恶化后的恶化影像信号(PI)、把在所述恶化影像信号中包含的恶化影像的文件形式变换为在对应的所述基准影像信号中包含的基准影像的文件形式的形式变换部(41)；匹配在所述基准影像信号中包含的基准影像帧和在所述恶化影像信号中包含的恶化影像帧的数目以及显示定时的显示定时匹配部(42)；对于所述基准影像帧以及所述恶化影像帧的对象帧以及其前后几个帧、监视在所述基准影像帧和所述恶化影像帧之间的帧的偏离或者所述恶化影像的冻结状态、同时取得帧间的对应关系以及像素间的对应关系的匹配的同步/位置匹配部(43)；和输入通过所述同步/位置匹配部(43)已取得匹配的所述基准影像信号(RI)以及所述恶化影像信号(PI)、计算两信号的影像信号特征量、根据计算的两信号的影像信号特征量的差、推定所述恶化影像信号的主观质量的主观质量推定部(11)。Furthermore, in order to achieve the above object, the thirty-sixth aspect of the invention is a video quality evaluation device characterized by comprising: a reference video signal (RI) which is an undegraded video signal and a degraded reference video signal. A degraded image signal (PI), a format conversion unit (41) for converting the file format of the degraded image included in the degraded image signal into the file format of the reference image included in the corresponding reference image signal; A display timing matching unit (42) for displaying the number and display timing of a reference video frame included in the reference video signal and a degraded video frame included in the degraded video signal; for the reference video frame and the degraded video frame The target frame and several frames before and after it, monitor the deviation of the frame between the reference image frame and the degraded image frame or the freezing state of the degraded image, and simultaneously obtain the correspondence between frames and the correspondence between pixels a synchronization/position matching section (43) for matching the relationship; A subjective quality estimating unit (11) for estimating the subjective quality of the degraded video signal based on the video signal feature quantity of the signal and the calculated difference between the video signal feature quantity of the two signals.

附图说明Description of drawings

图1是表示本发明的影像质量评价装置的第一实施形态的结构的框图；FIG. 1 is a block diagram showing the configuration of the first embodiment of the video quality evaluation device of the present invention;

图2是表示在图1的修正信息数据库中存储的修正信息的图；FIG. 2 is a diagram showing correction information stored in the correction information database of FIG. 1;

图3是表示基于针对一次推定主观质量SQ的修正公式的修正后的最终推定主观质量Q和实测的主观质量的关系的图；3 is a graph showing the relationship between the final estimated subjective quality Q after correction based on the correction formula for the primary estimated subjective quality SQ and the actual measured subjective quality;

图4是表示本发明的影像质量评价装置的第二实施形态的结构的框图；4 is a block diagram showing the configuration of a second embodiment of the video quality evaluation device of the present invention;

图5是用于计算在图4的空间特征量计算部中使用的水平垂直边缘量的图；FIG. 5 is a diagram for calculating horizontal and vertical edge quantities used in the spatial feature quantity calculation section of FIG. 4;

图6是表示本发明的影像质量评价装置的第三实施形态的结构的框图；6 is a block diagram showing the configuration of a third embodiment of the video quality evaluation device of the present invention;

图7是表示在图6的加权系数数据库中存储的多个条件以及与条件对应的加权系数的图；7 is a diagram representing a plurality of conditions stored in the weighting coefficient database of FIG. 6 and weighting coefficients corresponding to the conditions;

图8是把在本发明的实施形态的验证中使用的标准影像数据分类为学习数据和验证数据列举的图；Fig. 8 is a diagram listing standard image data classified into learning data and verification data used in verification according to the embodiment of the present invention;

图9是根据图8的学习数据以及验证数据计算出的SI(空间信息)值和TI(时间信息)值的分布图；Fig. 9 is a distribution diagram of SI (spatial information) value and TI (time information) value calculated according to the learning data of Fig. 8 and verification data;

图10是表示使用现有的峰值SN比(PSNR)得到的学习数据的推定结果的图；FIG. 10 is a diagram showing estimation results of learning data obtained using a conventional peak-to-SN ratio (PSNR);

图11是表示使用现有的边缘电力量(Ave_EE)得到的学习数据的推定结果的图；FIG. 11 is a diagram showing estimation results of learning data obtained using conventional marginal electric power (Ave_EE);

图12是表示使用本发明的第二以及第三实施形态的影像质量评价装置得到的学习数据的推定结果的图；12 is a diagram showing estimation results of learning data obtained using the video quality assessment devices according to the second and third embodiments of the present invention;

图13是表示使用本发明的第二以及第三实施形态的影像质量评价装置得到的验证数据的推定结果的图；13 is a diagram showing estimation results of verification data obtained using the video quality evaluation devices according to the second and third embodiments of the present invention;

图14是表示仅使用现有的边缘电力量(Ave_EE)得到的学习数据的推定结果的图；FIG. 14 is a diagram showing estimation results of learning data obtained using only the existing marginal electric power (Ave_EE);

图15是表示水平垂直边缘量的最小值(Min_HV)和主观评价值的关系的图；Fig. 15 is a diagram showing the relationship between the minimum value (Min_HV) of the horizontal and vertical margin amount and the subjective evaluation value;

图16是表示块平均运动电力量(Ave_MEB)和主观评价值的关系的图；Fig. 16 is a diagram showing the relationship between block average motion electric power (Ave_MEB) and subjective evaluation value;

图17是表示本发明的影像匹配装置的实施形态的结构的框图；Fig. 17 is a block diagram showing the configuration of an embodiment of the video matching device of the present invention;

图18是表示本发明的影像匹配装置的实施形态的动作步骤的流程图；Fig. 18 is a flow chart showing the operation steps of the embodiment of the image matching device of the present invention;

图19是用于说明显示定时匹配部中的处理的图；FIG. 19 is a diagram for explaining processing in a display timing matching unit;

图20是用于说明同步/位置匹配部中宏时刻同步处理的图；FIG. 20 is a diagram for explaining macro time synchronization processing in the synchronization/position matching unit;

图21是用于说明同步/位置匹配部中微同步/位置导出处理的图。FIG. 21 is a diagram for explaining micro-synchronization/position derivation processing in a synchronization/position matching unit.

具体实施方式Detailed ways

下面参照附图详细说明本发明的影像质量评价装置、影像质量评价方法和影像质量评价程序以及影像匹配装置、影像匹配方法和影像匹配程序的实施形态。第一到第三实施形态是影像质量评价装置、影像质量评价方法和影像质量评价程序的发明的实施形态，第四实施形态是影像匹配装置、影像匹配方法和影像匹配程序的发明的实施形态。Embodiments of the image quality evaluation device, image quality evaluation method, and image quality evaluation program, as well as the image matching device, image matching method, and image matching program of the present invention will be described in detail below with reference to the accompanying drawings. The first to third embodiments are embodiments of the invention of the video quality evaluation device, video quality evaluation method, and video quality evaluation program, and the fourth embodiment is an embodiment of the invention of the video matching device, video matching method, and video matching program.

<第一实施形态><First Embodiment>

参照图1说明本发明的影像质量评价装置的第一实施形态的各装置部分。图1是表示本发明的影像质量评价装置的第一实施形态的结构的框图。Each device part of the first embodiment of the video quality evaluation device of the present invention will be described with reference to FIG. 1 . FIG. 1 is a block diagram showing the configuration of a first embodiment of the video quality evaluation device of the present invention.

第一实施形态的影像质量评价装置，至少具有主观质量推定部11、特征量计算部12、修正信息数据库13、修正计算部14以及修正部15。The video quality assessment device of the first embodiment includes at least a subjective quality estimation unit 11 , a feature value calculation unit 12 , a correction information database 13 , a correction calculation unit 14 , and a correction unit 15 .

主观质量推定部11，输入基准影像信号RI和恶化影像信号PI。基准影像信号RI是恶化以前的影像信号，另一方面，恶化影像信号PI，是基准影像信号RI例如被编码或经由网络而恶化了的影像信号。The subjective quality estimation unit 11 receives a reference video signal RI and a deteriorated video signal PI as inputs. The reference video signal RI is a video signal before degradation, while the degraded video signal PI is a video signal in which the reference video signal RI has been degraded, for example, by encoding or passing through a network.

接着，主观质量推定部11，关于基准影像信号RI和恶化影像信号PI计算作为物理特征量的各自的影像信号特征量的差。在影像信号特征量中，例如有表示在影像信号中包含的在有影像的帧的影像状态的空间信息(SI)、以及表示在影像信号中包含的在有影像的帧间的影像变化的时间信息(TI)。作为这些空间信息SI以及时间信息TI，例如有在ITU-R Recommendation P.910的”Subjective Video Quality Assessment Methods for Multimedia Applications”的Appendix A中规定的Spatial Information以及Temporal Information。Next, the subjective quality estimating unit 11 calculates a difference in video signal feature value as a physical feature value with respect to the reference video signal RI and the degraded video signal PI. The video signal feature value includes, for example, spatial information (SI) indicating the video state of video frames included in the video signal, and time indicating video changes between video frames included in the video signal. Information (TI). Such spatial information SI and temporal information TI include, for example, Spatial Information and Temporal Information specified in Appendix A of "Subjective Video Quality Assessment Methods for Multimedia Applications" of ITU-R Recommendation P.910.

然后，主观质量推定部11，从计算的影像信号特征量的差定量化恶化影像信号PI的恶化，根据该定量化了的恶化推定主观质量。亦即，主观质量推定部11，由基准影像信号RI和恶化影像信号PI定量化恶化影像信号PI的恶化，根据该被定量化了的恶化推定主观质量。推定出的主观质量，从主观质量推定部11作为一次推定主观质量SQ输出。一次推定主观质量SQ，例如若通过边缘电力量(E)和运动电力量(M)决定的话，则一般可用下式(1)所示的函数表示：Then, the subjective quality estimation unit 11 quantifies the degradation of the degraded video signal PI from the calculated difference of the video signal feature value, and estimates the subjective quality from the quantified degradation. That is, the subjective quality estimation unit 11 quantifies the degradation of the degraded video signal PI from the reference video signal RI and the degraded video signal PI, and estimates the subjective quality from the quantified degradation. The estimated subjective quality is output from the subjective quality estimation unit 11 as primary estimated subjective quality SQ. The primary estimated subjective quality SQ, for example, if determined by the marginal electric power (E) and the kinetic electric power (M), can generally be expressed by the function shown in the following formula (1):

SQ＝F(M，E) ...(1)。SQ=F(M, E) ... (1).

该函数F是预先通过主观评价实验求得的。另外，边缘电力量(E)和运动电力量(M)，在美国ANSI规定的影像质量客观评价尺度(ANSIT1.801.03-1996，“Digital Transport of One-Way Video Signal Parameters forObiective Performance Assessment”)中被规定了。This function F is obtained in advance through subjective evaluation experiments. In addition, the amount of edge power (E) and motion power (M) are measured in the objective evaluation scale of video quality specified by ANSI in the United States (ANSIT1.801.03-1996, "Digital Transport of One-Way Video Signal Parameters for Objective Performance Assessment") stipulated.

特征量计算部12，输入基准影像信号RI，根据该基准影像信号RI计算影像信号特征量FI。作为该影像信号特征量FI如上所述例如也有空间信息SI或时间信息TI。特征量计算部12，例如确定空间信息SI以及时间信息TI中至少任何一个定量计算其特征量。The feature quantity calculation unit 12 receives a reference video signal RI as input, and calculates a video signal feature quantity FI based on the reference video signal RI. The video signal feature FI also includes, for example, spatial information SI or temporal information TI as described above. The feature amount calculation unit 12, for example, specifies at least one of the spatial information SI and the temporal information TI to quantitatively calculate the feature amount.

修正信息数据库13，与影像信号特征量对应起来存储修正信息。影像信号特征量，如上所述例如是空间信息SI或时间信息TI。修正信息是用于修正作为主观质量推定部11输出的一次推定主观质量SQ的修正公式或修正系数。后面参照图2将详细说明这些修正信息。这些修正公式或修正系数，预先通过实验决定，存储在修正信息数据库13中。在具有影像信号特征量FI的影像中加入了恶化的场合，预先通过主观评价实验统一计算人们看见该影像后主观上感觉到何种程度的恶化这样的特性。根据该特性计算对于影像信号特征量FI的修正公式以及修正系数。The correction information database 13 stores correction information in association with video signal feature values. The video signal feature quantity is, for example, spatial information SI or temporal information TI as described above. The correction information is a correction formula or a correction coefficient for correcting the primary estimated subjective quality SQ output from the subjective quality estimation unit 11 . These correction information will be described in detail later with reference to FIG. 2 . These correction formulas or correction coefficients are determined experimentally in advance and stored in the correction information database 13 . When degradation is added to a video having the video signal feature value FI, characteristics such as how much degradation people perceive subjectively after seeing the video are collectively calculated in advance through a subjective evaluation experiment. A correction formula and a correction coefficient for the video signal feature value FI are calculated based on this characteristic.

亦即，预先通过主观评价试验统一计算基准影像信号RI的物理特征量和在具有其特征的影像上加上恶化时的主观评价特性，根据该计算出的物理特征量和主观评价特性间的对应关系，修正从基准影像信号RI的影像信号特征量由主观质量推定部11导出的一次推定主观质量SQ，使之可以进行高精度地向统一的客观评价值的变换。That is, the physical feature quantity of the reference video signal RI is collectively calculated in advance through a subjective evaluation test, and the subjective evaluation characteristic when adding deterioration to an image having its characteristics is calculated, and the correspondence between the calculated physical feature quantity and the subjective evaluation characteristic is calculated. relationship, the primary estimated subjective quality SQ derived by the subjective quality estimating unit 11 from the video signal feature quantity of the reference video signal RI is corrected so that it can be converted to a unified objective evaluation value with high precision.

修正计算部14，从特征量计算部12输入影像信号特征量FI，从修正信息数据库13抽出与该影像信号特征量FI对应的修正公式以及修正系数。亦即，修正计算部14，在修正信息数据库13中检索与影像信号特征量FI对应的修正公式以及修正系数，从修正信息数据库13抽出相应的修正公式以及修正系数。然后，修正计算部14，作为修正信息CI输出这些修正公式以及修正系数。The correction calculation unit 14 receives the video signal feature FI from the feature calculation unit 12 , and extracts a correction formula and correction coefficient corresponding to the video signal feature FI from the correction information database 13 . That is, the correction calculation unit 14 searches the correction information database 13 for a correction formula and correction coefficient corresponding to the video signal feature value FI, and extracts the corresponding correction formula and correction coefficient from the correction information database 13 . Then, the correction calculation unit 14 outputs these correction formulas and correction coefficients as correction information CI.

修正部15，从主观质量推定部11输入一次推定主观质量SQ，从修正计算部14输入修正信息CI。然后，修正部15，把一次推定主观质量SQ代入伴有在修正信息CI中包含的修正系数的修正公式中，作为最终推定主观质量Q输出修正的一次推定主观质量SQ。最终推定主观质量Q，修正了定量表示恶化影像信号PI的主观质量SQ的一次推定主观质量SQ。The correction unit 15 receives an estimated subjective quality SQ from the subjective quality estimation unit 11 and correction information CI from the correction calculation unit 14 . Then, the correction unit 15 substitutes the primary estimated subjective quality SQ into a correction formula with a correction coefficient included in the correction information CI, and outputs the corrected primary estimated subjective quality SQ as the final estimated subjective quality Q. The subjective quality Q is finally estimated, and the primary estimated subjective quality SQ quantitatively expressing the subjective quality SQ of the deteriorated video signal PI is corrected.

下面参照图2说明修正信息数据库13存储的、对应影像信号特征量的修正信息。图2是表示在图1的修正信息数据库13中存储的修正信息的图。Next, correction information corresponding to feature quantities of video signals stored in the correction information database 13 will be described with reference to FIG. 2 . FIG. 2 is a diagram showing correction information stored in the correction information database 13 of FIG. 1 .

如图2所示，在修正信息数据库13中，与影像信号特征量的各个值对应起来，把多个修正系数数据库化。根据图2，在通过特征量计算部12将基准影像信号RI的影像信号特征量(空间信息SI以及时间信息TI)计算为(SI，TI)＝(75，30)的场合，对应该影像信号特征量的修正系数是(α，β)＝(α2，β2)。实际上，修正计算部14从特征量计算部12输入影像信号特征量FI，从修正信息数据库13的数据库与修正公式一起抽出与该影像信号特征量FI对应的修正系数。进而，在对应影像信号特征量修正公式也变化的场合，在修正信息数据库13中，不仅修正系数，对应影像信号特征量的修正公式也被数据库化。As shown in FIG. 2 , in the correction information database 13 , a plurality of correction coefficients are databased in association with each value of the feature value of the video signal. According to FIG. 2, when the video signal feature data (spatial information SI and temporal information TI) of the reference video signal RI is calculated by the feature data calculation unit 12 as (SI, TI)=(75, 30), the corresponding video signal The correction coefficient of the feature quantity is (α, β)=(α2, β2). Actually, the correction calculation unit 14 receives the video signal feature FI from the feature calculation unit 12 , and extracts a correction coefficient corresponding to the video signal feature FI from the database of the correction information database 13 together with a correction formula. Furthermore, when the correction formula corresponding to the feature value of the video signal is also changed, not only the correction coefficient but also the correction formula corresponding to the feature value of the video signal are databased in the correction information database 13 .

最后，参照图3说明修正部15修正一次推定主观质量SQ后输出的最终推定主观质量Q。图3是表示使用对一次推定主观质量SQ的修正公式得到的修正后的最终推定主观质量Q和实测的主观质量间的关系的图。Finally, the final estimated subjective quality Q outputted after the correction unit 15 corrects the estimated subjective quality SQ once will be described with reference to FIG. 3 . FIG. 3 is a graph showing the relationship between the corrected final estimated subjective quality Q and the actually measured subjective quality obtained by using a correction formula for the primary estimated subjective quality SQ.

在第一实施形态中，修正公式为下面的式(2)。In the first embodiment, the correction formula is the following formula (2).

最终推定主观质量Q＝α×一次推定主观质量SQ+β……(2)Final estimated subjective quality Q=α×primary estimated subjective quality SQ+β...(2)

这里，α和β是修正系数。图3表示通过对于输入到影像信号评价装置中的3种输入影像信号计算的3个修正公式得到的图线。横轴表示作为主观质量推定部11的输出的一次推定主观质量SQ，纵轴表示对于一次推定主观质量SQ、通过人们实际看到通过输入影像信号得到的影像后评价其质量的主观质量评价试验计算出来的主观质量。图中用圆形记号、方形记号和三角形记号分别表示的，是表示对于每一输入影像信号针对一次推定主观质量SQ的主观质量评价点。另一方面，3条线是分别对应3种输入影像信号的基于本实施形态的修正公式，根据本实施形态，一次推定主观质量SQ以这样对应每一影像的修正公式进行修正。Here, α and β are correction coefficients. FIG. 3 shows graphs obtained by three correction formulas calculated for three types of input video signals input to the video signal evaluation device. The horizontal axis represents the primary estimated subjective quality SQ which is the output of the subjective quality estimation unit 11, and the vertical axis represents the subjective quality evaluation test calculation of the primary estimated subjective quality SQ by people actually seeing the video obtained from the input video signal and evaluating its quality. The subjective quality that comes out. In the figure, circles, squares, and triangles represent subjective quality evaluation points for each input video signal for one estimated subjective quality SQ. On the other hand, the three lines are correction formulas based on the present embodiment respectively corresponding to the three types of input video signals. According to the present embodiment, the primary estimated subjective quality SQ is corrected using such correction formulas corresponding to each video.

这样，根据第一实施形态，从基准影像的物理特征量求对于该影像的人的视觉特性，作为对于基准影像的特征量的修正信息数据库化，对于从基准影像和恶化影像的物理的特征量的差导出的推定主观质量，以该修正信息付以权重，这样，对于任意影像就可以以和作为现有方法的主观评价法相同的精度统一推定主观质量。In this way, according to the first embodiment, the human visual characteristics for the image are obtained from the physical feature quantities of the reference image, and the correction information for the feature quantities of the reference image is databased, and the physical feature quantities of the reference image and the deteriorated image By weighting the estimated subjective quality derived from the difference between the two values, the subjective quality can be uniformly estimated with the same accuracy as the conventional subjective evaluation method for any video.

此外，在上述实施形态中，把计算基准影像信号RI的影像信号特征量的结构作为特征量计算部12在逻辑上作为单体设置，但是也可以不特别设置这样的特征量计算部12，原样不变利用由主观质量推定部11导出的基准影像信号RI的影像信号特征量。In addition, in the above-mentioned embodiment, the structure for calculating the feature quantity of the video signal of the reference video signal RI is logically provided as a single feature quantity calculation unit 12, but such a feature quantity calculation unit 12 may not be specially provided, and it may be used as it is. The video signal feature value of the reference video signal RI derived by the subjective quality estimation unit 11 is used without change.

另外，修正计算部14和修正部15不仅物理上、也可以在逻辑上为一体。亦即，也可以在修正部15直接输入基准影像信号RI的影像信号特征量的同时，修正部15从修正信息数据库输入对应该影像信号特征量的修正信息。In addition, the correction calculation unit 14 and the correction unit 15 may be integrated not only physically but also logically. That is, while the correcting unit 15 directly inputs the video signal feature value of the reference video signal RI, the correcting unit 15 may input correction information corresponding to the video signal feature value from the correction information database.

<第二实施形态><Second Embodiment>

参照图4说明本发明的影像质量评价装置的第二实施形态的各装置部分。图4是表示本发明的影像质量评价装置的第二实施形态的结构的框图。Each device part of the second embodiment of the video quality evaluation device of the present invention will be described with reference to FIG. 4 . FIG. 4 is a block diagram showing the configuration of a second embodiment of the video quality evaluation device of the present invention.

第二实施形态的影像质量评价装置，如图4所示，具有校准信息生成部21、空间特征量计算部22、时间特征量计算部23以及主观质量推定部24。The video quality assessment device according to the second embodiment includes a calibration information generation unit 21 , a spatial feature calculation unit 22 , a temporal feature calculation unit 23 , and a subjective quality estimation unit 24 as shown in FIG. 4 .

校准信息生成部21，输入基准影像信号RI和恶化影像信号PI，分别从基准影像信号RI接收基准影像帧，从恶化影像信号PI接收恶化影像帧，检测基准影像帧和恶化影像帧的时间的以及空间的帧的偏离，生成关于这些时间的以及空间的帧的偏离的校准信息。The calibration information generator 21 receives the reference video signal RI and the degraded video signal PI as input, receives the reference video frame from the reference video signal RI, receives the degraded video frame from the degraded video signal PI, and detects the timing of the reference video frame and the degraded video frame. Spatial frame deviations, generating calibration information about these temporal and spatial frame deviations.

所谓基准影像帧和恶化影像帧的时间上的偏离，是指校准信息生成部21在某时刻接收到的基准影像帧和恶化影像帧的影像在时间上的偏离。例如，在校准信息生成部21在某时刻在基准影像帧中接收某影像A的帧、在该时刻在恶化影像帧中接收到比该影像A的帧3帧前的帧的场合，校准信息生成部21检测对于基准影像帧恶化影像帧晚了3帧的事实，将该信息作为校准信息生成。The temporal deviation between the reference video frame and the degraded video frame refers to the temporal deviation between the reference video frame and the degraded video frame received by the calibration information generation unit 21 at a certain point in time. For example, when the calibration information generation unit 21 receives a frame of a certain video A in the reference video frame at a certain time, and receives a frame 3 frames earlier than the frame of the video A in the degraded video frame at this time, the calibration information generation The unit 21 detects that the degraded video frame is three frames behind the reference video frame, and generates this information as calibration information.

另外，所谓基准影像帧和恶化影像帧的空间的偏离，是指校准信息生成部21在某时刻接收到的影像的空间位置偏离了。例如，在基准影像帧和恶化影像帧的帧在时间上不偏离的场合，校准信息生成部21在某时刻在基准影像帧中接收在影像中心球(ボ-ル)的中心映射的帧、在该时刻在恶化影像帧中在从影像的中心右1像素、上2像素偏离的位置球的中心映射的帧的场合，校准信息生成部21检测对于基准影像帧恶化影像帧向右1像素、上2像素偏离了的事实，将该信息作为校准信息生成。In addition, the spatial deviation between the reference video frame and the degraded video frame means that the spatial position of the video received by the calibration information generation unit 21 at a certain time is deviated. For example, when the frames of the reference video frame and the degraded video frame do not deviate in time, the calibration information generation unit 21 receives the frame mapped at the center of the center sphere of the video in the reference video frame at a certain time, At this point in time, in the case of a degraded video frame that is mapped to the center of the position sphere that is 1 pixel to the right and 2 pixels above the center of the video, the calibration information generation unit 21 detects that the degraded video frame is 1 pixel to the right and 2 pixels above the reference video frame. The fact that 2 pixels are shifted creates this information as calibration information.

空间特征量计算部22，输入基准影像信号RI和恶化影像信号PI和校准信息，使基准影像帧和恶化影像帧的空间的偏离以及时间的偏离消失，其后，根据基准影像信号RI和恶化影像信号PI计算空间特征量。在该第二实施形态中，作为该空间特征量，使用以下要详述的边缘电力量(Ave_EE)和水平垂直边缘量的最小值(Min_HV)的指标。The spatial feature calculation unit 22 receives the reference image signal RI, the degraded image signal PI, and the calibration information, so that the spatial and temporal deviations between the reference image frame and the degraded image frame disappear, and then, based on the reference image signal RI and the deteriorated image The signal PI calculates the spatial feature quantity. In the second embodiment, as the spatial feature quantity, indices of the edge electric power (Ave_EE) and the minimum value (Min_HV) of the horizontal and vertical edge quantities to be described in detail below are used.

边缘电力量(Ave_EE)：Edge power (Ave_EE):

该指标，是基于基准影像信号RI和恶化影像信号PI定量化在帧内亮度值急剧变化的边界(称为边缘)上发生的恶化(例如模糊情况)的指标。在第二实施形态中，作为一例，通过使用Sobel滤波器从像素的亮度值强调边缘，由此，来定量化边缘上的恶化。另外，这里定量化的边缘电力量(Ave_EE)，在ANSI TI.801.03-1996，“Digital Transport of One-Way Video Signal Parametersfor Objective Performance Assessment”中被规定了。This index is an index for quantifying deterioration (for example, blurring) occurring at a boundary (called an edge) where the luminance value changes rapidly within a frame based on the reference video signal RI and the degraded video signal PI. In the second embodiment, as an example, the deterioration on the edge is quantified by emphasizing the edge from the luminance value of the pixel using a Sobel filter. In addition, the quantified edge power (Ave_EE) here is specified in ANSI TI.801.03-1996, "Digital Transport of One-Way Video Signal Parameters for Objective Performance Assessment".

边缘电力量(Ave_EE)，用(3)式定量化。The amount of marginal electric power (Ave_EE) is quantified by the formula (3).

[数学式1][mathematical formula 1]

$Ave Ave.__EE EE = = \sqrt{\frac{11}{M m} {Σ Σ}_{m m = = 00}^{M m - - 11} {((\frac{S S {I I}_{in in} ((m m)) - - S S {I I}_{out out} ((m m))}{S S {I I}_{in in} ((m m))}))}^{22}} - - - - - - ((33))$

式中，SI_μ(m)(μ＝in或out)由(4)式给出。In the formula, SI _μ (m) (μ=in or out) is given by (4).

[数学式2][mathematical formula 2]

$S S {I I}_{μ μ} ((m m)) = = \sqrt{\frac{11}{N N} \underset{i i,, j j}{Σ Σ} {{S S {I I}^{22} h h__μ μ ((i i,, j j,, m m)) + + S S {I I}^{22} v v__μ μ ((i i,, j j,, m m))}} - - {((\frac{11}{N N} \sqrt{S S {I I}^{22} h h__μ μ ((i i,, j j,, m m)) + + S S {I I}^{22} v v__μ μ ((i i,, j j,, m m))}))}^{22}} - - - - - - ((44))$

μ＝in或outμ=in or out

另外，这里，SI_h-μ(i，j，m)、SI_v-μ(i，j，m)，分别表示位于第m帧的位置(i，j)处的Sobel滤波器，分别由(5)式以及(6)式给出。In addition, here, SI _h-μ (i, j, m), SI _v-μ (i, j, m), respectively represent the Sobel filter located at the position (i, j) of the m-th frame, respectively by ( 5) and (6) are given.

SI_h-μ(i，j，m)＝{-Y_μ(i-1，j-1，m)-2Y_μ(i，j-1，m)-Y_μ(i+1，j-1，m)SI _h-μ (i, j, m)={-Y _μ (i-1, j-1, m)-2Y _μ (i, j-1, m)-Y _μ (i+1, j-1 , m)

+Y_μ(i-1，j+1，m)+2Y_μ(i，j+1，m)+Y_μ(i+1，j+1，m)}+Y _μ (i-1, j+1, m)+2Y _μ (i, j+1, m)+Y _μ (i+1, j+1, m)}

……(5)... (5)

SI_v-μ(i，j，m)＝{-Y_μ(i-1，j-1，m)+Y_μ(i+1，j-1，m)-2Y_μ(i-1，j，m)SI _v-μ (i, j, m)={-Y _μ (i-1, j-1, m)+Y _μ (i+1, j-1, m)-2Y _μ (i-1, j , m)

+2Y_μ(i+1，j，m)-Y_μ(i-1，j+1，m)+Y_μ(i+1，j+1，m)}+2Y _μ (i+1, j, m)-Y _μ (i-1, j+1, m)+Y _μ (i+1, j+1, m)}

……(6)... (6)

式中，Y_in(i，j，m)表示位于基准影像帧的第m帧的位置(i，j)处的像素的亮度值(从0到255)，Y_out(i，j，m)表示位于恶化影像帧的第m帧的位置(i，j)处的像素的亮度值(从0到255)。In the formula, Y _in (i, j, m) represents the brightness value (from 0 to 255) of the pixel at the position (i, j) of the mth frame of the reference image frame, Y _out (i, j, m) Indicates the brightness value (from 0 to 255) of the pixel at position (i, j) of the mth frame of the degraded image frame.

此外，后面和图一起说明仅用边缘电力量(Ave_EE)推定的时的结果，但是存在尽管客观评价值几乎不变但主观评价值变化却很大的影像，仅用该边缘电力量(Ave_EE)不能完全捕捉在这些影像中发生的恶化。In addition, the results of estimation using only the marginal electric power (Ave_EE) will be described later together with the figure, but there are images in which the subjective evaluation value changes greatly although the objective evaluation value hardly changes, and only the marginal electric power (Ave_EE) is used. The deterioration that occurs in these images cannot be fully captured.

水平垂直边缘量的最小值(Min_HV)：Minimum value of horizontal and vertical margin (Min_HV):

该指标，是和基准影像帧比较，来定量化在对应该基准影像帧的所述恶化影像帧中亮度值急剧变化的边界(边缘)在水平垂直方向上发生的程度的指标。在该第二实施形态中，作为一例，通过由在帧的水平/垂直方向的场所发生的水平垂直边缘量和其以外的方向的场所发生的边缘量的比、捕捉失真量的特征量(Min_HV)，来定量化在水平垂直上边缘发生的程度。该程度，如图5所示，通过输入影像和恶化影像来求取进入由常数r0＝20和Δθ＝0.05236决定的水平/垂直方向区域的点部分的边缘量(HV)和进入这以外的区域的边缘量(HV短横线)的比。采用这样的指标的理由在于，大多用块单位编码影像信息，所以确实反映在这样的编码中的所谓的块失真。This index is an index for quantifying the degree of horizontal and vertical boundaries (edges) in which luminance values change sharply in the degraded video frame corresponding to the reference video frame compared with the reference video frame. In this second embodiment, as an example, the feature value (Min_HV ), to quantify the extent to which edges occur horizontally and vertically. To this extent, as shown in Fig. 5, the edge volume (HV) of the dots entering the horizontal/vertical area determined by the constant r0 = 20 and Δθ = 0.05236 and the area outside this are obtained from the input image and the degraded image. The ratio of edge volume (HV dashes). The reason for adopting such an index is that video information is often coded in units of blocks, so so-called block distortion is reliably reflected in such coding.

水平垂直边缘量的最小值(Min_HV)，用式(7)定量化。The minimum value (Min_HV) of the horizontal and vertical margins is quantified by formula (7).

[数学式3][mathematical formula 3]

$Min Min__HV HV = = \underset{m m}{min min} {{\frac{HV HV {R R}_{in in} ((M m)) - - HV HV {R R}_{out out} ((m m))}{HV HV {R R}_{in in} ((m m))}}} - - - - - - ((77))$

式中，HVR_μ(m)(μ＝in或out)，由(8)式给出。In the formula, HVR _μ (m) (μ = in or out), given by (8).

[数学式4][mathematical formula 4]

$HV HV {R R}_{μ μ} ((m m)) = = \frac{H h {V V}_{μ μ} (({r r}_{min min},, Δθ Δθ,, m m)) + + 0.5 0.5}{\overset{&OverBar; &OverBar;}{H h {V V}_{μ μ}} (({r r}_{min min},, Δθ Δθ,, m m)) + + 0.5 0.5} - - - - - - ((88))$

μ＝in或outμ=in or out

另外，这里有用(9)式表示的定义。In addition, here is the definition represented by the formula (9).

[数学式5][mathematical formula 5]

$H h {V V}_{μ μ} (({r r}_{min min},, Δθ Δθ,, m m)) = = \frac{11}{P P} \underset{i i,, j j}{Σ Σ} S S {I I}_{r r__μ μ ((i i,, j j,, m m))} - - - - - - ((99))$

μ＝in或outμ=in or out

其中，作为条件，满足(10)式以及(11)式。However, as a condition, Expression (10) and Expression (11) are satisfied.

SI_{r_μ}(i，j，m)≥r_min＞0 ……(10)SI _{r_μ} (i, j, m) ≥ r _min > 0 ... (10)

k_n/2-Δθ＜SI_{θ_μ}(i，j，m)＜k_n/2+Δθ(k＝0，1，2，3)k _n /2-Δθ<SI _{θ_μ} (i,j,m)<k _n /2+Δθ(k=0,1,2,3)

……(11)... (11)

这里，SI_{θ_μ}(i，j，m)＝tan^-1[SI_{v_μ}(i，j，m)/SI_{h_μ}(i，j，m)]。Here, SI _{θ_μ} (i, j, m) = tan ^-1 [SI _{v_μ} (i, j, m)/SI _{h_μ} (i, j, m)].

另外，(9)式的P，是图5的阴影占据的范围的像素数。In addition, P in the formula (9) is the number of pixels in the range occupied by the shade in FIG. 5 .

另一方面，有(12)式表示的定义。On the other hand, there is a definition represented by the formula (12).

[数学式6][mathematical formula 6]

$\overset{&OverBar; &OverBar;}{H h {V V}_{μ μ}} (({r r}_{min min},, Δθ Δθ,, m m)) = = \frac{11}{P P} \underset{i i,, j j}{Σ Σ} S S {I I}_{r r__μ μ} ((i i,, j j,, m m)) - - - - - - ((1212))$

μ＝in或outμ=in or out

其中，作为条件，满足(13)式以及(14)式。However, as a condition, Expression (13) and Expression (14) are satisfied.

SI_{r_μ}(i，j，m)≥r_min＞0 ……(13)SI _{r_μ} (i, j, m) ≥ r _min > 0 ... (13)

k_п/2+Δθ＜SI_{θ_μ}(i，j，m)＜(k+1)_п/2-Δθ(k＝0，1，2，3)k _п /2+Δθ<SI _{θ_μ} (i, j, m)<(k+1) _п /2-Δθ(k=0, 1, 2, 3)

……(14)... (14)

另外，(12)式的P是满足(13)式以及(14)式的范围的像素数。In addition, P in Expression (12) is the number of pixels satisfying the ranges of Expressions (13) and (14).

水平垂直边缘量的最小值(Min_HV)，是在边缘电力量(Ave_EE)中捕捉不到的恶化，具体说，是捕捉由于块失真新生成的边缘的特征量。参照附图后面会说明，通过该指标可以敏感地捕捉其恶化。该指标，以ANSI中规定的指标为基础，不过，如(7)式所示，通过使用最小值独自改进成能够高灵敏度地仅捕捉新边缘的发生数量。The minimum value (Min_HV) of the horizontal and vertical edge quantities is a deterioration that cannot be captured in the edge electric quantity (Ave_EE), specifically, it is a feature quantity that captures edges newly generated due to block distortion. As will be described later with reference to the drawings, this index can sensitively capture its deterioration. This index is based on the index stipulated in ANSI, however, as shown in the formula (7), it is independently improved by using the minimum value so that only the number of occurrences of new edges can be captured with high sensitivity.

然而，时间特征量计算部23，输入基准影像信号RI和恶化影像信号PI和校准信息，消除基准影像帧和恶化影像帧的空间偏离以及时间偏离，其后，根据基准影像信号RI和恶化影像信号PI计算时间特征量。在该第二实施形态中作为该时间特征量，使用以下要详细说明的作为基于TI(时间信息)值的差的指标的块平均运动电力量(Ave_MEB)。TI值是影像帧之间的像素的亮度值的差，是在ITU-R Recommendation P.910中被规定了的。However, the temporal feature quantity calculation unit 23 inputs the reference video signal RI, the degraded video signal PI and the calibration information, eliminates the spatial deviation and the temporal deviation between the reference video frame and the degraded video frame, and thereafter, based on the reference video signal RI and the degraded video signal, PI calculates the time feature quantity. In the second embodiment, the block average exercise electric power amount (Ave_MEB) which is an index based on a difference of TI (time information) values to be described in detail below is used as the temporal feature amount. The TI value is the difference in brightness values of pixels between image frames, and is specified in ITU-R Recommendation P.910.

该块平均运动电力量(Ave_MEB)：Average motion electric power of the block (Ave_MEB):

该指标，是对于作为帧内几个像素集合的每一块导出基准影像帧和恶化影像帧的TI值的差、根据基准影像帧每块的TI值对该差执行标准化的指标。This index is an index that derives the difference between the TI values of the reference video frame and the degraded video frame for each block, which is a set of several pixels in the frame, and normalizes the difference based on the TI value of each block of the reference video frame.

亦即，块平均运动电力量(Ave_MEB)用(15)式表示。That is, the block average motion electric power amount (Ave_MEB) is represented by equation (15).

[数学式7][mathematical formula 7]

$Ave Ave.__MEB MEB = = \frac{11}{M m} {Σ Σ}_{M m = = 00}^{M m - - 11} \frac{11}{{N N}_{b b}} \sqrt{\underset{((k k,, l l))}{Σ Σ} {((\frac{T T {I I}_{b b__in in} ((k k,, l l,, m m)) - - T T {I I}_{b b__out out} ((k k,, l l,, m m))}{T T {I I}_{b b,, in in} ((k k,, l l,, m m))}))}^{22}} - - - - - - ((1515))$

式中，TI_b-μ(k，l，m)用(16)式表示。In the formula, TI _b-μ (k, l, m) is represented by the formula (16).

[数学式8][mathematical formula 8]

$T T {I I}_{b b__μ μ} ((k k,, l l,, m m)) = = \frac{11}{6464} \underset{i i,, j j}{Σ Σ} {(({Y Y}_{μ μ} ((88 k k + + i i,, 88 l l + + j j,, m m)) - - {Y Y}_{μ μ} ((88 k k + + i i,, 88 l l + + j j,, m m - - 11))))}^{22} - - - - - - ((1616))$

μ＝in或outμ=in or out

块平均运动电力量(Ave_MEB)，是在边缘电力量(Ave_EE)中捕捉不尽的恶化，具体说，是捕捉基于每个区域的运动的恶化的发生的特征量。参照附图后面会说明，通过该指标可以敏感地捕捉其恶化。该指标，是为捕捉每一区域的运动导出每块的TI值，进而通过用基准影像的值进行标准化，而使灵敏度增高的独自的尺度。The block average motion electric power amount (Ave_MEB) is a characteristic amount that captures the endless deterioration in the edge electric power amount (Ave_EE), specifically, the occurrence of deterioration based on motion for each area. As will be described later with reference to the drawings, this index can sensitively capture its deterioration. This index is a unique measure to increase the sensitivity by deriving the TI value of each block in order to capture the motion of each area, and normalizing it with the value of the reference image.

主观质量推定部24，把由空间特征量计算部22计算的空间特征量和由时间特征量计算部23计算的时间特征量作为输入，推定作为对于基准影像信号RI的恶化影像信号PI的主观质量的主观评价值。该主观评价值(Y)用(17)式计算。The subjective quality estimating unit 24 receives the spatial feature calculated by the spatial feature calculating unit 22 and the temporal feature calculated by the temporal feature calculating unit 23 as inputs, and estimates the subjective quality of the degraded video signal PI as the reference video signal RI. subjective evaluation value. This subjective evaluation value (Y) is calculated by the formula (17).

Y＝αX₁+βX₂+γX₃+δ ……(17)Y=αX ₁ +βX ₂ +γX ₃ +δ ... (17)

式中，X₁＝Ave_EE，X₂＝Min_HV，X₃＝Ave_MEB。In the formula, X ₁ =Ave_EE, X ₂ =Min_HV, X ₃ =Ave_MEB.

该主观评价值是最终得到的值。This subjective evaluation value is a finally obtained value.

这里，α、β、γ、δ是为从与对于通过主观评价试验求得的恶化影像的主观评价值的关系，对于时间特征量以及空间特征量来决定与主观评价值的对应关系而预先求得的加权系数。另外，α、β、γ、δ，例如，是根据恶化影像信号PI的信号形式、由恶化影像信号PI发送来的恶化影像的信息量(大小)以及恶化影像信号的编码方式来决定。因此，从恶化影像信号PI的信号形式、通过恶化影像信号PI发送来的恶化影像的信息量(大小)以及恶化影像信号的编码方式，可以预先计算α、β、γ、δ。Here, α, β, γ, and δ are calculated in advance in order to determine the correspondence relationship between the temporal feature quantity and the spatial feature quantity and the subjective evaluation value from the relationship with the subjective evaluation value of the degraded image obtained through the subjective evaluation test. The obtained weighting coefficient. In addition, α, β, γ, and δ are determined according to, for example, the signal format of the degraded video signal PI, the information amount (size) of the degraded video transmitted from the degraded video signal PI, and the encoding method of the degraded video signal. Therefore, α, β, γ, and δ can be calculated in advance from the signal format of the degraded video signal PI, the information amount (size) of the degraded video transmitted by the degraded video signal PI, and the encoding method of the degraded video signal.

根据以上说明的第二实施形态的影像质量评价装置，可以对于任意的影像统一推定主观质量。后面参照附图要详细说明该第二实施形态的影像质量评价装置相对于现有的影像质量评价装置、精度可提高多少。According to the video quality assessment device of the second embodiment described above, it is possible to collectively estimate the subjective quality of any video. How much the accuracy of the video quality evaluation device of the second embodiment can be improved compared with the conventional video quality evaluation device will be described later in detail with reference to the drawings.

另外，也可以把在第二实施形态中说明过的图4所示的影像质量评价装置组装入到图1所示的影像质量评价装置的主观质量推定部11中，使用图1所示的影像质量评价装置。即，图4所示的影像质量评价装置输入基准影像信号RI和恶化影像信号PI，把主观质量推定部24输出的值作为一次推定主观质量SQ向修正部15输出。修正部15，从修正计算部14接收修正信息CI，输入一次推定主观质量SQ，计算最终推定主观质量Q。In addition, the video quality evaluation device shown in FIG. 4 described in the second embodiment may be incorporated into the subjective quality estimation unit 11 of the video quality evaluation device shown in FIG. 1, and the video quality evaluation device shown in FIG. quality evaluation device. That is, the video quality evaluation device shown in FIG. 4 inputs the reference video signal RI and the degraded video signal PI, and outputs the value output from the subjective quality estimation unit 24 to the correction unit 15 as the primary estimated subjective quality SQ. The correction unit 15 receives the correction information CI from the correction calculation unit 14, inputs the estimated subjective quality SQ once, and calculates the final estimated subjective quality Q.

<第三实施形态><Third Embodiment>

参照图6说明本发明的影像质量评价装置的第三实施形态的各装置部分。图6是表示本发明的影像质量评价装置的第三实施形态的结构的框图。Each device part of the third embodiment of the video quality evaluation device of the present invention will be described with reference to FIG. 6 . Fig. 6 is a block diagram showing the configuration of a third embodiment of the video quality evaluation device of the present invention.

第三实施形态的影像质量评价装置，与第二实施形态的影像质量评价装置比较，不同之处在于：向装置输入的基准影像信号RI和恶化影像信号PI是分别以相互不同的文件形式输入的；再者，恶化影像信号PI的信号形式、通过恶化影像信号PI发送来的恶化影像的信息量(大小)以及恶化影像信号的编码方式是未知的。因此，对和第二实施形态的影像质量评价装置同样的部分给以同样的符号，省略说明。The image quality evaluation device of the third embodiment differs from the image quality evaluation device of the second embodiment in that the reference image signal RI and the deteriorated image signal PI input to the device are input in different file formats. Furthermore, the signal form of the degraded image signal PI, the information amount (size) of the degraded image sent by the degraded image signal PI, and the encoding method of the degraded image signal are unknown. Therefore, the same reference numerals are assigned to the same parts as those in the image quality evaluation device of the second embodiment, and description thereof will be omitted.

第三实施形态的影像质量评价装置，如图6所示，具有格式变换部35、校准信息生成部21、空间特征量计算部22、时间特征量计算部23、主观质量推定部34、以及加权系数数据库36。The video quality evaluation device of the third embodiment, as shown in FIG. Coefficient database 36 .

格式变换部35，在输入基准影像文件和恶化影像文件后，而恶化影像文件的文件形式与基准影像文件的文件形式不同的场合，把恶化影像文件的文件形式变换为与基准影像文件的文件形式相同的形式。更详细说，例如，在恶化影像文件的信号形式、颜色分布、大小、长宽比、以及编码方式与基准影像文件的不同的场合，把恶化影像文件的文件形式变换为与基准影像文件的文件形式相同的形式。然后，格式变换部35从变换为了与基准影像文件相同文件形式的恶化影像文件输出恶化影像信号PI。来自格式变换部35的恶化影像信号PI被输出到校准信息生成部21、空间特征量计算部22、时间特征量计算部23。The format conversion unit 35 converts the file format of the degraded video file into the file format of the reference video file when the file format of the degraded video file is different from that of the reference video file after inputting the reference video file and the degraded video file. same form. More specifically, for example, when the signal form, color distribution, size, aspect ratio, and encoding method of the degraded video file are different from those of the reference video file, the file format of the degraded video file is converted into a file format similar to that of the reference video file. The form is the same. Then, the format conversion unit 35 outputs the degraded video signal PI from the degraded video file converted into the same file format as the reference video file. The degraded video signal PI from the format conversion unit 35 is output to the calibration information generation unit 21 , the spatial feature calculation unit 22 , and the temporal feature calculation unit 23 .

更具体说，格式变换部35，例如，在设想基准影像是VGA尺寸的YUV形式基准影像、基准影像和恶化影像的颜色格式不同的场合，如果恶化影像是RGB形式的话，则使用在Rec.IUT-R BT.601“STUDIO ENCODINGPARAMETERS OF DIGITAL TELEVISION FOR STANDARD 4:3 ANDWIDE-SCREEN 16:9 ASPECT RATIOS”等中记载的变换式变化该影像。另外，在格式变换部35把基准影像和恶化影像的尺寸或者长宽比变换为相同的场合，有时可以简单地作为整数倍计算，但是在这点不成立的场合需要变换为任意尺寸。在这种场合，使用周知的方法进行向任意尺寸的变换(例如参照，Muramatsu S.and Kiya H.：”Scale Factor of Resolution Conversion Based onOrthogonal Transforms，”IEICE Trans.Fundamentals.，E76-A，7，pp.1150-1153(July 1993)，或者松村正吾、贵家仁志，“对于变化编码的图像的任意的有理数倍率的分辨率变换法”电子信息通信学会论文集A，Vol.77-A，No.3，pp.369-378，March 1994)。另外，影像信号被压缩了的场合，预先变换为非压缩形式。More specifically, the format conversion unit 35, for example, assumes that the reference image is a VGA-sized YUV format reference image, and the reference image and the degraded image have different color formats. If the degraded image is in the RGB format, then -R BT.601 "STUDIO ENCODING PARAMETERS OF DIGITAL TELEVISION FOR STANDARD 4:3 ANDWIDE-SCREEN 16:9 ASPECT RATIOS" etc. to change the video. Also, when the format conversion unit 35 converts the size or aspect ratio of the reference video and the degraded video to be the same, it may be simply calculated as an integer multiple, but if this is not true, it needs to be converted to an arbitrary size. In this case, transformation to an arbitrary size is performed using a well-known method (see, for example, Muramatsu S. and Kiya H.: "Scale Factor of Resolution Conversion Based on Orthogonal Transforms," IEICE Trans. Fundamentals., E76-A, 7, pp.1150-1153 (July 1993), or Matsumura Shogo, Takaya Hitoshi, "Resolution conversion method for arbitrary rational number magnifications for images with change codes", Proceedings of the Society for Electronics, Information and Communications A, Vol.77-A, No. .3, pp.369-378, March 1994). Also, when the video signal is compressed, it is converted into an uncompressed format in advance.

另外，由于影像编码或传送系统的特性或不同国家规格的不同，使得恶化影像的颜色或亮度的分布不同，因此，为避免给客观评价值造成偏差，要对基准影像和恶化影像的亮度值等的分布进行标准化。亦即，对于基准影像和恶化影像两者，对于亮度、色差信号、或者RGB值的每一个，从特定帧的像素值的分布，导出其最大值、最小值、平均值、方差那样的统计量，为使其分布相同，通过例如以平均值偏离的数量移位恶化影像的全部像素值，或者通过使用在用于使基准影像和恶化影像的最小值和最大值的范围一致的(18)式所示的变换式进行变换，来对基准影像和恶化影像的分布进行标准化。In addition, due to the characteristics of the image coding or transmission system or the different national standards, the distribution of the color or brightness of the deteriorated image is different. The distribution is standardized. That is, for both the reference image and the degraded image, for each of the luminance, color difference signal, or RGB value, from the distribution of pixel values in a specific frame, statistics such as the maximum value, minimum value, average value, and variance are derived. , to make their distributions the same, either by shifting all the pixel values of the degraded image by, for example, the amount of deviation from the mean, or by using (18) in the ranges used to make the minimum and maximum values of the reference image and the degraded image consistent The transformation shown is transformed to normalize the distribution of the reference image and the degraded image.

[数学式9][mathematical formula 9]

式中，Y：变换后的恶化影像的像素值In the formula, Y: the pixel value of the transformed degraded image

X：变换前的恶化影像的像素值X: The pixel value of the degraded image before transformation

格式变换部35，向主观质量推定部34输出与变换过的恶化影像信号PI的文件形式、即和基准影像信号RI同样的文件形式关联的信息。与该文件形式关联的信息，例如是恶化影像文件的信号形式、颜色分布、尺寸、长宽比、以及编码方式，但是在该第三实施形态中，向主观质量推定部34输出恶化影像文件的信号形式、尺寸、以及编码方式。The format conversion unit 35 outputs information related to the file format of the converted degraded video signal PI, that is, the same file format as the reference video signal RI, to the subjective quality estimation unit 34 . The information related to the file format is, for example, the signal format, color distribution, size, aspect ratio, and encoding method of the degraded video file, but in this third embodiment, the subjective quality estimation unit 34 outputs the Signal form, size, and encoding method.

加权系数数据库36，把恶化影像文件的信号形式、尺寸、以及编码方式作为一组条件，存储有多个该条件，对应各条件预先设定加权系数α、β、γ、δ。这些加权系数，是从和通过主观评价实验求得的针对恶化影像的主观评价值间的关系、对于时间特征量以及空间特征量、预先求得和主观评价值间的对应关系的系数。图7表示加权系数数据库36存储的多个条件以及对应这些条件的加权系数。The weighting coefficient database 36 stores the signal format, size, and coding method of the degraded video file as a set of conditions, stores a plurality of these conditions, and sets weighting coefficients α, β, γ, and δ corresponding to each condition in advance. These weighting coefficients are coefficients obtained from the relationship with the subjective evaluation value for the degraded image obtained through the subjective evaluation experiment, and the correspondence relationship with the subjective evaluation value obtained in advance with respect to the temporal feature quantity and the spatial feature quantity. FIG. 7 shows a plurality of conditions stored in the weighting coefficient database 36 and weighting coefficients corresponding to these conditions.

主观质量推定部34，对应从格式变换部35输入的恶化影像文件的信号形式、尺寸、以及编码方式，从加权系数数据库36取得加权系数α、β、γ、δ，这点与第二实施形态的主观质量推定部24不同。其他和主观质量推定部24相同。Subjective quality estimation unit 34 obtains weighting coefficients α, β, γ, and δ from weighting coefficient database 36 corresponding to the signal format, size, and encoding method of the degraded video file input from format conversion unit 35, which is similar to that of the second embodiment. The subjective quality estimation unit 24 is different. Others are the same as the subjective mass estimation unit 24 .

此外，在图6中，格式变换部35和校准信息生成部21被分别配置，但是格式变换部35和校准信息生成部21也可以作为一个构成部组装到图6的影像质量评价装置中。In addition, in FIG. 6 , the format conversion unit 35 and the calibration information generation unit 21 are arranged separately, but the format conversion unit 35 and the calibration information generation unit 21 may be incorporated into the image quality evaluation device in FIG. 6 as a single component.

根据以上说明的本第三实施形态，即使在恶化影像文件的文件形式与基准影像文件不同的场合，也可以推定恶化影像的主观质量。再有，关于恶化影像文件的信号形式、尺寸、以及编码方式，因为可以对应多个模式，所以可以推定各种各样的恶化影像文件的主观质量。According to the third embodiment described above, even when the file format of the degraded video file is different from that of the reference video file, the subjective quality of the degraded video can be estimated. Furthermore, since a plurality of modes can be supported for the signal format, size, and coding method of the degraded video file, it is possible to estimate the subjective quality of various degraded video files.

下面比较现有的影像质量评价装置，参照图8到图16，来说明关于第二以及第三实施形态的影像质量评价装置推定的影像质量值的推定精度的实际结果。Next, the actual results of the estimation accuracy of the image quality values estimated by the image quality evaluation apparatuses of the second and third embodiments will be described with reference to FIGS. 8 to 16 in comparison with conventional image quality evaluation apparatuses.

最初，说明用于在推定精度的验证以及导出在加权系数数据库36中存储的加权系数使用的主观评价数据。主观评价数据，把在图8中所示的用ITU-R选定的标准影像(参照ITU-R BT.802-1，“Test Pictures and Sequences forsubjective Assessments of Digital Codecs’Coveying Signals produced Accordingto Recommendation ITU-RBT.601，”1994，以及ITU-R BT.1201-2，“Testmaterials to be used in subjective assessment，”2001)的36种作为影像，如图8所示，把标准影像分为用于验证影像质量评价值的推定精度的验证数据、和作为在导出系数中预先使用的数据的学习数据。First, subjective evaluation data used for verification of estimation accuracy and derivation of weighting coefficients stored in the weighting coefficient database 36 will be described. Subjective evaluation data, using the standard images selected by ITU-R shown in Figure 8 (refer to ITU-R BT.802-1, "Test Pictures and Sequences for subjective Assessments of Digital Codecs'Coveying Signals produced According to Recommendation ITU- RBT.601," 1994, and ITU-R BT.1201-2, "Testmaterials to be used in subjective assessment," 2001) 36 kinds of images, as shown in Figure 8, the standard images are divided into images for verification The verification data of the estimation accuracy of the quality evaluation value, and the learning data which are the data previously used for deriving the coefficient.

用于验证推定精度的主观评价数据，被选择为减小由于基准影像的特性的不平衡引起的影响。亦即，考虑在ITU-TP.910(参考ITU-TP.910，“SubjectiveVideo Quality assessment Methods for multimedia applicatons”Aug.1996.)中表示的空间信息(SI)、时间信息(TI)的分布，从图9的区域A到区域D选定相同数目的影像，这样，可以把具有各种各样的SI值以及TI值的影像作为基准影像。另外，对于这些基准影像，使用以MPEG4为基础的编码恶化在256kbps～8Mbps的范围内作做成为分配给4阶段的恶化影像。The subjective evaluation data used to verify the estimation accuracy is selected to reduce the influence due to the imbalance of the characteristics of the reference image. That is, considering the distribution of spatial information (SI) and temporal information (TI) expressed in ITU-TP.910 (refer to ITU-TP.910, "SubjectiveVideo Quality assessment Methods for multimedia applications" Aug.1996.), from The same number of images are selected from the regions A to D in FIG. 9 , so that images with various SI values and TI values can be used as reference images. In addition, for these reference images, MPEG4-based coding degradation is used to make degradation images assigned to 4 levels in the range of 256 kbps to 8 Mbps.

主观质量评价法，使用在MPEG的验证等多媒体数字信号编解码器的性能评价试验中经常使用的DSCQS法(参考ITU-R BT.500-10，“Methodology firthe subjective assessment of the quality of television pictures”March 2000.)，接受试验的人是从18岁到40岁的普通男女24名。The subjective quality evaluation method uses the DSCQS method often used in the performance evaluation tests of multimedia digital signal codecs such as MPEG verification (refer to ITU-R BT.500-10, "Methodology first subjective assessment of the quality of television pictures" March 2000.), the people who received the test were 24 ordinary men and women from the age of 18 to 40.

影像质量评价装置的目标推定精度，做成等于在评价中评测点的分散的程度是合适的。在把主观评价值的分散的程度作为单侧95％的可信区间导出时，在全部数据中是7.24。因此，作为影像质量评价装置的目标推定精度，以平均方差(RMSE)不超过该值作为目标。It is appropriate for the target estimation accuracy of the image quality evaluation device to be equal to the degree of dispersion of evaluation points in the evaluation. When the degree of dispersion of the subjective evaluation values was derived as a one-sided 95% confidence interval, it was 7.24 in all data. Therefore, as the target estimation accuracy of the video quality evaluation device, the mean square error (RMSE) should not exceed this value.

分别在图10以及图11中表示在现有的影像质量评价装置中、一般使用的、使用采用了PSNR(Peak Signal Noise Ratio：峰值SN比)和作为ANSI的参数已知的边缘电力量(Ave_EE)的数据、对于学习数据推定出的结果。图10是在使用了PSNR的场合的推定结果，图11是在使用了边缘电力量(Ave_EE)的场合的推定结果。根据图10以及图11，PSNR的RMSE是9.57，边缘电力量(Ave_EE)的RMSE是7.47，比之PSNR，边缘电力量(Ave_EE)一方表示出良好的特性，但是RMSE超过目标值，推定精度是不充分的。In FIG. 10 and FIG. 11, respectively, in the conventional image quality evaluation device, generally used, PSNR (Peak Signal Noise Ratio: peak SN ratio) and edge power known as ANSI parameters (Ave_EE ) data, the results estimated for the learning data. FIG. 10 shows the estimation results when PSNR is used, and FIG. 11 shows the estimation results when marginal electric power (Ave_EE) is used. According to Figure 10 and Figure 11, the RMSE of PSNR is 9.57, and the RMSE of marginal power (Ave_EE) is 7.47. Compared with PSNR, marginal power (Ave_EE) shows good characteristics, but RMSE exceeds the target value, and the estimation accuracy is inadequate.

图12以及图13表示使用使用了边缘电力量(Ave_EE)、水平垂直边缘量的最小值(Min_HV)、Ave_MEB全体的第二以及第三实施形态的影像质量评价装置，对于学习数据、验证数据的推定结果。根据图12以及图13，因为学习数据的RMSE是6.43，验证数据的RMSE是6.49，所以，不仅对于学习数据，就是对于作为非学习数据的验证数据也能实现足够的目标推定精度。因此，显见，第二以及第三实施形态的影像质量评价装置，具有能够代替人们看见实际影像后评价其质量的主观评价质量试验使用的推定精度，达到了实用水平。FIG. 12 and FIG. 13 show the video quality evaluation apparatuses of the second and third embodiments using the edge power amount (Ave_EE), the minimum value of the horizontal and vertical edge amount (Min_HV), and the entire Ave_MEB, for the learning data and the verification data. presumptive result. 12 and 13, since the RMSE of the learning data is 6.43 and the RMSE of the verification data is 6.49, sufficient target estimation accuracy can be achieved not only for the learning data but also for the verification data which is non-learning data. Therefore, it is obvious that the image quality evaluation devices of the second and third embodiments have the estimated accuracy that can be used instead of the subjective evaluation quality test used by people to evaluate the quality of an actual image after seeing it, and have reached a practical level.

图14是表示仅使用边缘电力量(Ave_EE)得到的学习数据的一部分的推定结果的图。如图14所示，显见，在输入影像[3]、[6]、[7]、[9]处评测点的倾斜度大，尽管客观评价值几乎不变化，主观评价值却变化很大，仅用边缘电力量(Ave_EE)捕捉不尽在这些影像中发生的恶化。FIG. 14 is a diagram showing estimation results of a part of learning data obtained using only the marginal electric power amount (Ave_EE). As shown in Figure 14, it is obvious that the inclinations of the evaluation points at the input images [3], [6], [7], [9] are large, and although the objective evaluation value hardly changes, the subjective evaluation value changes greatly. The degradation that occurs in these images cannot be fully captured with the marginal power level (Ave_EE).

图15是表示水平垂直边缘量的最小值(Min_HV)和主观评价值的关系的图。如图15所示，可知，根据水平垂直边缘量的最小值(Min_HV)，在输入影像[3]、[7]中，敏感地捕捉该恶化。FIG. 15 is a diagram showing the relationship between the minimum value (Min_HV) of the horizontal and vertical edge amounts and subjective evaluation values. As shown in FIG. 15 , it can be seen that the deterioration is sensitively captured in the input images [3] and [7] according to the minimum value (Min_HV) of the horizontal and vertical edge amounts.

图16是表示块平均运动电力量(Ave_MEB)和主观评价值的关系的图。如图16所示，可知，根据块平均运动电力量(Ave_MEB)，在输入影像[6]、[9]中，敏感地捕捉该恶化。Fig. 16 is a diagram showing the relationship between block average motion electric power (Ave_MEB) and subjective evaluation values. As shown in FIG. 16 , it can be seen that the deterioration is sensitively captured in the input images [6] and [9] according to the block average motion electric quantity (Ave_MEB).

如上所述，根据本发明的第二以及第三实施形态，作为影像的物理特征量，除了捕捉在边缘区域发生的恶化的参数之外，通过追加两个补充精度不足的独自参数，可以把在特定的影像中保持推定精度的技术扩展到任意影像。As described above, according to the second and third embodiments of the present invention, in addition to the parameter capturing the deterioration occurring in the edge region as the physical feature value of the image, by adding two unique parameters that supplement the lack of accuracy, it is possible to use the The technique of maintaining estimation accuracy in a specific image is extended to arbitrary images.

<第四实施形态><Fourth Embodiment>

下面说明涉及影像匹配装置、影像匹配方法以及影像匹配程序的发明的实施形态。Embodiments of the invention relating to the video matching device, video matching method, and video matching program will be described below.

图17是表示本发明的影像匹配装置的实施形态的结构的框图。Fig. 17 is a block diagram showing the configuration of an embodiment of the video matching device of the present invention.

第四实施形态的影像匹配装置，至少具有格式变换部41、显示定时匹配部42、同步/位置匹配部43、亮度/颜色修正部44、以及恶化量导出部45。The video matching device of the fourth embodiment includes at least a format conversion unit 41 , a display timing matching unit 42 , a synchronization/position matching unit 43 , a brightness/color correction unit 44 , and a deterioration amount deriving unit 45 .

格式变换部41，使在由于编码或者在网络中的损失恶化后的恶化影像信号中包含的文件形式，与在基准影像信号中包含的基准影像的形式吻合。显示定时匹配部42，使基准影像信号和恶化影像信号的影像显示定时吻合。同步/位置匹配部43，一边从恶化量导出部45取得恶化量或者同步偏离信息，一边取得基准影像信号和恶化影像信号的时间空间方向的匹配。另外，根据需要，亮度/颜色修正部44，对于基准影像信号和恶化影像信号的亮度或者颜色分布的不同进行修正。The format conversion unit 41 matches the file format included in the degraded video signal degraded by encoding or loss in the network with the format of the reference video included in the reference video signal. The display timing matching unit 42 matches the video display timings of the reference video signal and the degraded video signal. The synchronization/position matching unit 43 obtains the time-space matching between the reference video signal and the degraded video signal while acquiring the degradation amount or out-of-synchronization information from the degradation amount derivation unit 45 . In addition, the luminance/color correction unit 44 corrects the difference in luminance or color distribution between the reference video signal and the degraded video signal as necessary.

此外，在该实施形态中，在基准影像信号和恶化影像信号中，包含帧速率信息或者帧显示时刻/取入时刻信息。另外根据需要，还包含信号格式信息。由此，该影像匹配装置，对于基准影像以及恶化影像，一边在存储器中存储对象帧及其前后数帧，一边进行非实时处理。In addition, in this embodiment, the reference video signal and the degraded video signal include frame rate information or frame display time/capture time information. In addition, if necessary, signal format information is also included. As a result, the video matching device performs non-real-time processing on the reference video and the degraded video while storing the target frame and several frames before and after it in the memory.

图18是表示本发明的影像匹配装置的实施形态的动作步骤的流程图。Fig. 18 is a flowchart showing the operation procedure of the embodiment of the video matching device of the present invention.

首先，格式变换部41，在基准影像和恶化影像的信号形式、尺寸、长宽比不同的场合，进行恶化影像的信号形式的变换(步骤S1)。例如，若基准影像的数据形式是非压缩的YUV形式，恶化影像的数据形式是是非压缩的RGB形式的话，则使用Rec.ITU-R BT.601”STUDIO ENCODINGPARAMETERS OG DIGITAL TELEVISION FOR SDANDART 4:3 ANDWIDE-SCREEN 16:9 ASPECT RATIOS”的变换式变换恶化影像。另外，格式变换部41，起初在恶化影像是压缩形式的场合事前要变换为非压缩形式。进而，格式变换部41，在尺寸和长宽比不同的场合，进行变换以使它们变成相同。例如，有时可以简单地作为整数倍计算，但是在仅这样做还不够的场合，需要变换为任意尺寸，在这一场合，使用周知的方法进行向任意尺寸的变换(例如参照，Muramatsu S.and Kiya H.：”Scale Factor of Resolution ConversionBased on Orthogonal Transforms，”IEICE Trans.Fundamentals.，E76-A，7，PP.1150-1153(July 1993)，或者松村正吾、贵家仁志，“对于变化编码的图像的任意的有理数倍率的分辨率变换法”电子信息通信学会论文集A，Vol.77-A，No.3，pp.369-378，March 1994.)。其后，格式变换部41，把基准影像信号和变换后的恶化影像信号转交给显示定时匹配部42。First, the format conversion unit 41 converts the signal format of the degraded video when the signal format, size, and aspect ratio of the reference video and the degraded video are different (step S1). For example, if the data format of the reference image is uncompressed YUV format and the data format of the degraded image is uncompressed RGB format, use Rec.ITU-R BT.601”STUDIO ENCODINGPARAMETERS OG DIGITAL TELEVISION FOR SDANDART 4:3 ANDWIDE- SCREEN 16:9 ASPECT RATIOS" transformation transform degrades the image. In addition, the format conversion unit 41 first converts the degraded video to an uncompressed format when it is in a compressed format. Furthermore, the format conversion unit 41 performs conversion so that the sizes and aspect ratios are different when they are different. For example, sometimes it can simply be calculated as an integer multiple, but in the case where this is not enough, it needs to be converted to an arbitrary size. In this case, a known method is used to perform conversion to an arbitrary size (for example, refer to Muramatsu S. and Kiya H.: "Scale Factor of Resolution Conversion Based on Orthogonal Transforms," IEICE Trans. Fundamentals., E76-A, 7, PP.1150-1153 (July 1993), or Matsumura Shogo, Takaya Hitoshi, "For Change Coding The resolution transformation method of arbitrary rational number magnification of the image "Proceedings of the Electronic Information and Communication Society A, Vol.77-A, No.3, pp.369-378, March 1994.). Thereafter, the format conversion unit 41 transfers the reference video signal and the converted degraded video signal to the display timing matching unit 42 .

接着，显示定时匹配部42，为使由格式变换部41格式变换过的恶化影像信号与基准影像信号的显示定时吻合，进行帧的补齐等的处理(步骤S2)。Next, the display timing matching unit 42 performs processing such as frame complementation to match the display timing of the degraded video signal format-converted by the format converting unit 41 with the reference video signal (step S2).

具体说，如图19(a)所示，在恶化影像的显示间隔比基准影像的显示间隔恒定长的场合，显示定时匹配部42，如图19(b)的下段所示，通过补齐全帧对于恶化影像补充影像。Specifically, as shown in FIG. 19(a), when the display interval of the degraded image is constant longer than the display interval of the reference image, the display timing matching unit 42, as shown in the lower part of FIG. 19(b), complements the full frame Supplementary images for degraded images.

另外，如图19(b)所示，在恶化影像的显示间隔波动的场合，显示定时匹配部42，如图19(b)的下段所示，由与基准影像同步的显示定时，用紧接恶化影像之前的影像补充恶化影像。此外，此时，也可以做成不是用恶化影像之前的影像补充，而是用恶化影像中时间上接近的影像进行补充。即，例如在图19(b)的场合，作为变换恶化影像的第二帧补充了帧I，但是假定如果用恶化影像中时间上接近的影像补充的话，则要补充帧II。In addition, as shown in FIG. 19(b), when the display interval of the degraded video fluctuates, the display timing matching unit 42, as shown in the lower part of FIG. The image preceding the degraded image complements the degraded image. In addition, at this time, instead of supplementing the degraded video with a video preceding the degraded video, it may be supplemented with a temporally close video among the degraded video. That is, for example, in the case of FIG. 19( b ), frame I is supplemented as the second frame of the converted degraded video, but frame II is supposed to be supplemented with temporally close video among the degraded video.

进而，如图19(c)所示，在基准影像信号自身不是固定的帧速率的场合，则要补齐成以和它们不同的正确的时间间隔来显示基准影像信号和恶化影像信号。Furthermore, as shown in FIG. 19(c), when the reference video signal itself does not have a constant frame rate, it must be complemented so that the reference video signal and the degraded video signal are displayed at different correct time intervals.

其后，显示定时匹配部42，把基准影像信号和恶化影像信号转交给同步/位置匹配部43。Thereafter, the display timing matching unit 42 transfers the reference video signal and the degraded video signal to the synchronization/position matching unit 43 .

同步/位置匹配部43，设想(1)评价开始状态，(2)同步状态，(3)冻结状态的3种状态后，规定其动作。The synchronization/position matching unit 43 assumes three states (1) the evaluation start state, (2) the synchronization state, and (3) the freeze state, and prescribes its operation.

(1)首先，在评价开始状态中，同步/位置匹配部43，为取得时间方向的宏的匹配，比较对于一定时间的基准影像信号和恶化影像信号的1帧全体或其特定区域的信号的亮度/色差/RGB值的平均值那样的特征量的推移，取得匹配性成为最高的帧信息，由此，导出宏的时间方向的偏离(步骤S3)。具体说，如图20所示，使特征量在时间方向偏离，从各时间系列值的差为最小，或者互相关系数为最大的条件导出时间方向的偏离(帧差)。(1) First, in the evaluation start state, the synchronization/position matching unit 43 compares the reference video signal and the degraded video signal for a certain period of time in order to obtain a macro match in the time direction, for the entire frame or a specific region of the signal of the degraded video signal. Changes in feature quantities such as luminance/color difference/average values of RGB values are obtained to obtain frame information with the highest matching performance, thereby deriving the deviation in the time direction of the macro (step S3). Specifically, as shown in FIG. 20 , the feature values are shifted in the time direction, and the shift in the time direction (frame difference) is derived from the condition that the difference between the time-series values is the smallest or the correlation coefficient is the largest.

另外，同步/位置匹配部43，为取得微的时间空间方向的匹配，通过恶化量导出部45在基准影像中对于取得了匹配的影像的前后数帧，如图21所示，使恶化影像在上下左右移动像素，把差值为最小的位置作为取得了最匹配的位置，作为像素位置信息接收(步骤S4)。In addition, the synchronization/position matching unit 43 uses the degradation amount deriving unit 45 to make the degraded image in the reference image for several frames before and after the matched image as shown in FIG. The pixels are moved up, down, left, and right, and the position with the smallest difference is regarded as the position where the best match is obtained, and received as pixel position information (step S4).

进而，同步/位置匹配部43，为进行亮度/颜色修正向亮度/颜色修正部44转交取得了匹配的基准影像和恶化影像。亮度/颜色修正部44，使恶化影像的亮度以及颜色信息的平均值、最小值和最大值以及它们的分布，与基准影像的这些取得匹配。例如，在基准影像和恶化影像的亮度分布不同的场合，亮度/颜色修正部44，根据基准影像和恶化影像的亮度分布的各自的平均值和方差值，对于基准影像和恶化影像的亮度分布施行线性标准化，把关于为进行该线性标准化的变换式的信息作为修正信息转交给同步/位置匹配部43。同步/位置匹配部43，接收该修正信息，根据接收的修正信息进行亮度/颜色修正处理(步骤S5)。Furthermore, the synchronization/position matching unit 43 transfers the matched reference video and degraded video to the brightness/color correction unit 44 for brightness/color correction. The luminance/color correction unit 44 matches the average value, minimum value, and maximum value of the luminance and color information of the degraded image, and their distribution with those of the reference image. For example, when the luminance distributions of the reference image and the degraded image are different, the luminance/color correction unit 44 adjusts the luminance distributions of the reference image and the degraded image based on the respective average values and variance values of the luminance distributions of the reference image and the degraded image. Linear normalization is performed, and information on the conversion formula for this linear normalization is passed to the synchronization/position matching unit 43 as correction information. The synchronization/position matching unit 43 receives the correction information, and performs brightness/color correction processing based on the received correction information (step S5).

以下，作为(2)同步状态(3)冻结状态中的处理，同步/位置匹配部43，进行图18所示的步骤S6到步骤S22的处理。Hereinafter, the synchronization/position matching unit 43 performs the processing from step S6 to step S22 shown in FIG. 18 as processing in (2) the synchronization state (3) the freeze state.

在基准影像对象帧号码i比基准影像最终帧数N小，或者恶化影像对象帧号码j比恶化影像最终帧数M小时，执行以下的步骤S7到步骤S22的处理(步骤S6)。When the target frame number i of the reference video is smaller than the final frame number N of the reference video, or the target frame number j of the degraded video is smaller than the final frame number M of the degraded video, the following steps S7 to S22 are executed (step S6 ).

于是，首先，判断F1是否是1，即基准影像和恶化影像是否处于非同步状态(步骤S7)。在基准影像和恶化影像处于同步状态的场合(F1＝0)，转移到步骤S8，在基准影像和恶化影像处于非同步状态的场合(F1＝1)，转移到步骤S16。Therefore, first, it is judged whether F1 is 1, that is, whether the reference image and the deteriorated image are in an asynchronous state (step S7). If the reference image and the degraded image are synchronized (F1=0), go to step S8, and if the reference image and degraded video are not synchronized (F1=1), go to step S16.

首先，以下说明基准影像和恶化影像处于同步状态的场合(F1＝0)。First, the case where the reference image and the degraded image are synchronized (F1=0) will be described below.

同步/位置匹配部43，向恶化量导出部45转交在上述评价开始状态得到的、时间空间方向的偏离、或者修正了亮度/颜色信息的恶化影像和基准影像。恶化量导出部45，对于恶化影像的对象帧、把对应的基准影像的帧及其前后数帧作为对象进行微同步处理(参照图21)，在导出由此得到的恶化量的同时，把基准影像和恶化影像各自的前一个帧之间的帧间差值作为恶化量导出(步骤S8)。同步/位置匹配部43，接收由恶化量导出部45导出的那些恶化量，在那些恶化量中，根据帧间差值、判断恶化影像是否是冻结状态(步骤S9)。亦即，相对于关于基准影像的差值表示出某值，而恶化影像的差值如果表示出几近于0，则同步/位置匹配部43判断恶化影像是冻结状态。The synchronization/position matching unit 43 transfers to the degradation amount deriving unit 45 the degraded image and the reference image obtained in the above-mentioned evaluation start state, the deviation in the time-space direction, or the luminance/color information corrected. The deterioration amount deriving unit 45 performs micro-synchronization processing (refer to FIG. The frame-to-frame difference between the previous frames of the video and the degraded video is derived as a degradation amount (step S8). The synchronization/position matching unit 43 receives the degradation amounts derived by the degradation amount deriving unit 45, and judges whether the degraded video is frozen based on the inter-frame difference among the degradation amounts (step S9). That is, if the difference with respect to the reference image shows a certain value, and the difference with the degraded image is close to 0, the synchronization/position matching unit 43 judges that the degraded image is frozen.

在恶化影像是冻结状态时，同步/位置匹配部43，在设定为非同步状态(F1＝1)的同时，把冻结数作为“1”(count＝1)(步骤S10)，转移到步骤S22。When the degraded image is in the frozen state, the synchronization/position matching unit 43 sets the frozen state to "1" (count=1) while setting it to the asynchronous state (F1=1) (step S10), and transfers to step S10. S22.

另一方面，在恶化影像不是冻结状态时，接着，同步/位置匹配部43，判断在步骤S8通过恶化量导出部45的微同步处理得到的、恶化影像的对象帧的恶化量是否最小(步骤S11)。在恶化影像的对象帧的恶化量是最小时，设定为同步状态(F2＝0)(步骤S12)，转移到步骤S14。另一方面，在恶化影像的对象帧的恶化量不是最小时，判定为帧偏离状态(跳帧状态，返帧状态)，转移到步骤S13，把恶化量最小的帧号码设定为j值，若是跳帧的话设定为F2＝1，否则(返帧状态等)设定为F2＝2，其时的帧的偏离数设定为Count值。On the other hand, when the degraded video is not in the frozen state, then the synchronization/position matching unit 43 judges whether the degraded amount of the target frame of the degraded video obtained by the micro-synchronization process of the degraded amount deriving unit 45 in step S8 is the smallest (step S11). When the degradation amount of the target frame of the degraded video is the minimum, the synchronization state is set (F2=0) (step S12), and the process proceeds to step S14. On the other hand, when the amount of deterioration of the target frame of the degraded image is not the smallest, it is determined that it is a frame deviation state (frame skipping state, frame returning state), and transfers to step S13, and the frame number with the smallest amount of deterioration is set as the j value, If it is frame skipping, set F2=1, otherwise (frame return state, etc.) set F2=2, and set the number of frame deviations at that time as the Count value.

在步骤S14，在输出基准影像和恶化影像的同时输出F2值和Count值。其后，把Count值复位为0，转移到步骤S22。In step S14, the F2 value and the Count value are output together with the output of the reference image and the deteriorated image. Thereafter, the Count value is reset to 0, and the process moves to step S22.

接着，以下说明在步骤S7被判断为了基准影像和恶化影像处于非同步状态(F1＝1)的场合。Next, the case where it is judged in step S7 that the reference video and the degraded video are out of synchronization (F1=1) will be described below.

同步/位置匹配部43，在步骤S16，和步骤S8同样，对于恶化影像的对象帧，从恶化量导出部45接收通过把对应的基准影像的帧及其前后数帧作为对象进行的微同步处理得到的恶化量，同时，将与基准影像和恶化影像各自前一帧之间的帧间差值作为恶化量接收。同步/位置匹配部43，和步骤S9相同，在这些恶化量中，根据帧间差值，判断恶化影像是否是冻结状态(步骤S17)。The synchronization/position matching unit 43, in step S16, similarly to step S8, receives from the degradation amount deriving unit 45 the micro-synchronization process performed on the frame of the corresponding reference video and its several frames before and after it as the object. To obtain the deterioration amount, at the same time, the frame-to-frame difference between the reference image and the previous frame of the deteriorated image is received as the deterioration amount. The synchronization/position matching unit 43, similar to step S9, judges whether or not the degraded video is in a frozen state based on the difference between frames among these degraded amounts (step S17).

在恶化影像是冻结状态时，同步/位置匹配部43将冻结数(count值)加1(步骤S18)，转移到步骤S22。When the degraded video is frozen, the synchronization/position matching unit 43 increments the frozen count (count value) by 1 (step S18 ), and proceeds to step S22 .

另一方面，在恶化影像不是冻结状态时，同步/位置匹配部43判断冻结状态结束，进行和步骤S3同样的宏时刻同步处理(步骤S19)、输出冻结数(count值)(步骤S20)。其后，把F1值以及count值复位为0，转移到步骤S22。On the other hand, when the degraded image is not in the frozen state, the synchronization/position matching unit 43 judges that the frozen state is over, performs the same macro time synchronization process as in step S3 (step S19), and outputs the frozen number (count value) (step S20). Thereafter, the F1 value and the count value are reset to 0, and the process proceeds to step S22.

在步骤S22，将基准影像对象帧号码i以及恶化影像对象帧号码j加1，在该i值达到基准影像最终帧数N、而且该j值达到恶化影像最终帧数M的场合，处理结束。In step S22, add 1 to the target frame number i of the reference video and the target frame number j of the degraded video. When the value of i reaches the final frame number N of the reference video and the value j reaches the final frame number M of the degraded video, the process ends.

此外，假定总是在一定时间，对于基准影像信号和恶化影像信号，积蓄有1帧全部或者特定区域的亮度/色差/RGB值的平均值那样的特征量，而由于某种原因未能取同步时，例如在成为对象的帧中的特征量在离开一定时期的平均值某阈值(如3σ那样的标准偏差的常数倍)以上发生了断开时，认为进入(1)的评价初期状态，从步骤S3重新开始处理。In addition, it is assumed that the reference video signal and the degraded video signal have a feature quantity such as the average value of luminance/color difference/RGB values in the entire frame or in a specific area for a certain period of time, and synchronization is not possible for some reason. For example, when the feature quantity in the target frame is separated from the average value for a certain period by a certain threshold value (a constant multiple of the standard deviation such as 3σ), it is considered to enter the evaluation initial state of (1), and from step S3 restarts processing.

以上是本发明的实施形态的影像匹配处理的详情。The above are the details of the video matching processing according to the embodiment of the present invention.

根据本实施形态，在执行影像的格式变换的同时，通过把宏匹配处理和微匹配处理作为基础时常进行同步处理，即使在个人计算机终端中接收影像时接收的影像的尺寸或者长宽比不同的场合，或者在由于如数据包损失那样某程度集中的信息损失而不能取得基准影像和恶化影像的空间位置的对应的场合，或者进而由于IP数据包到达间隔的波动或数据包损失的发生、在时间轴的影像显示的定时偏离或者影像显示定时的波动或者冻结这样的历来不存在的现象发生的场合，都能够适当地取得基准影像和恶化影像的时间方向和空间方向的匹配。According to the present embodiment, by performing the format conversion of the video and performing synchronous processing at all times based on the macro-matching process and the micro-matching process, even if the received video is different in size or aspect ratio when the personal computer terminal receives the video, Occasionally, or when the correspondence of the spatial positions of the reference image and the deteriorated image cannot be obtained due to a certain degree of concentrated information loss such as packet loss, or further due to fluctuations in the arrival interval of IP packets or the occurrence of packet loss, in Even when the timing of video display on the time axis deviates, or phenomena such as fluctuations in video display timing or freezes that have never existed before occur, it is possible to appropriately match the time direction and space direction of the reference video and the degraded video.

但是，在本发明的影像匹配方法中，设想为把从同步/位置匹配部43输出的基准影像和恶化影像及其附属信息(时间方向的状态)输入影像质量评价装置中(例如被输入第一实施形态的影像质量评价装置的主观质量推定部11)，但是，也可以在恶化量导出部45中，不是导出恶化量而是导出客观评价值，若通过同步/位置匹配部43输出其结果，也可以作为影像质量评价装置使用。However, in the video matching method of the present invention, it is assumed that the reference video output from the synchronization/position matching unit 43, the degraded video and its auxiliary information (state in the time direction) are input into the video quality evaluation device (for example, the first In the subjective quality estimation unit 11) of the video quality assessment device of the embodiment, however, in the degradation amount derivation unit 45, the objective evaluation value may be derived instead of the degradation amount, and if the result is output by the synchronization/position matching unit 43, It can also be used as an image quality evaluation device.

此外，在以上说明的各实施形态中，在其处理步骤中表示的指示，可以基于作为软件的程序来执行。通用计算机系统，预先存储好该程序，通过读入该程序，可以让该通用计算机系统作为影像质量评价装置以及影像匹配装置来工作。上述各实施形态中记述的指示，作为可以让计算机执行的程序，记录在磁盘(软盘，硬盘等)、光盘(CD-ROM，CD-R，CD-RW，DVD-ROM，DVD±R，DVD±RW等)、半导体内存储器、或者与其类似的记录媒体上。计算机从该记录媒体读入程序，若根据该程序由CPU执行在程序中记述的指示，则计算机就作为上述实施形态的影像质量评价装置以及影像匹配装置来动作。In addition, in each embodiment described above, the instructions shown in the processing steps may be executed based on a program as software. The general-purpose computer system stores the program in advance, and by reading the program, the general-purpose computer system can be made to work as an image quality evaluation device and an image matching device. The instructions described in the above-mentioned embodiments are recorded on magnetic disks (floppy disks, hard disks, etc.), optical disks (CD-ROM, CD-R, CD-RW, DVD-ROM, DVD±R, DVD-ROM, etc.) ±RW, etc.), semiconductor memory, or similar recording media. The computer reads the program from the recording medium, and when the CPU executes the instructions described in the program based on the program, the computer operates as the image quality evaluation device and the image matching device of the above-mentioned embodiment.

此外，本发明并不限定于上述实施形态的照搬，而可以在实施阶段在不脱离其要旨的范围内变化结构要素来具体化。另外，根据在上述实施形态中公开的多个结构要素的适当的组合，可以形成各种发明。例如，也可以从实施形态中所示的全体结构要素中删除几个结构要素。进而，也可以适当组合涉及不同实施形态的结构要素。In addition, the present invention is not limited to the copying of the above-mentioned embodiments, but can be realized by changing the constituent elements within a range that does not deviate from the gist at the stage of implementation. In addition, various inventions can be formed by appropriately combining a plurality of constituent elements disclosed in the above embodiments. For example, some structural elements may be deleted from all the structural elements shown in the embodiments. Furthermore, constituent elements related to different embodiments may be appropriately combined.

根据本发明的影像质量评价装置、影像质量评价方法以及影像质量评价系统，即使是未知的基准影像，也可以对于任意影像高精度地推定统一的主观质量。According to the video quality evaluation device, video quality evaluation method, and video quality evaluation system of the present invention, even if it is an unknown reference video, a unified subjective quality can be estimated with high accuracy for any video.

具体说，根据本发明的影像质量评价装置、影像质量评价方法以及影像质量评价系统，从基准影像的物理特征量求取针对该影像的人们的视觉特性，作为对于基准影像特征量的修正信息进行数据库化，通过对于从基准影像和恶化影像的物理特征量的差导出的推定主观质量以该修正信息附加权重，就能对于任意影像以和现有方法的主观评价法同等的精度统一地导出主观质量。Specifically, according to the video quality evaluation device, video quality evaluation method, and video quality evaluation system of the present invention, the visual characteristics of people for the video are obtained from the physical feature data of the reference video, and are used as correction information for the reference video feature data. By creating a database and adding weights to the correction information on the estimated subjective quality derived from the difference between the physical feature values of the reference image and the deteriorated image, the subjective quality can be uniformly derived for any image with the same accuracy as the existing subjective evaluation method. quality.

另外，作为影像的物理特征量，除了捕捉在边缘区域发生的恶化的参数之外，通过追加弥补精度不足的两个独自参数，就可以向任意的影像扩展在特定的影像中保持推定精度的技术。In addition, by adding two original parameters to compensate for the lack of accuracy as physical feature values of images, in addition to the parameters that capture the deterioration that occurs in the edge area, the technology that maintains the estimation accuracy in a specific image can be extended to any image. .

根据本发明的影像匹配装置、影像匹配方法以及影像匹配程序，在主观质量的推定中，在比较基准影像信号和恶化影像信号间的物理特征量时，可以确实地取得它们的空间的以及时间的位置的匹配。According to the image matching device, image matching method, and image matching program of the present invention, when comparing the physical feature values between the reference image signal and the degraded image signal in the estimation of subjective quality, their spatial and temporal characteristics can be reliably obtained. location match.

具体说，在进行影像的格式变换的同时，通过经常以宏匹配处理和微匹配处理为基础进行同步匹配，即使在个人计算机终端中接收影像时接收的影像的尺寸或者长宽比不同的场合，或者在由于如数据包损失那样某程度集中的信息损失而不能取得基准影像和恶化影像的空间位置的对应的场合，或者进而由于IP数据包到达间隔的波动或者数据包损失的发生、在时间轴的影像显示的定时偏离或者影像显示定时的波动或者冻结这样的历来不存在的现象发生的场合，都能够适当地取得基准影像和恶化影像的时间方向和空间方向的匹配。Specifically, by performing synchronous matching on the basis of macro-matching processing and micro-matching processing while performing video format conversion, even if the size or aspect ratio of the video received by the personal computer terminal is different, Or when the correspondence of the spatial positions of the reference image and the degraded image cannot be obtained due to a certain degree of concentrated information loss such as packet loss, or further due to fluctuations in the arrival interval of IP packets or the occurrence of packet loss, in the time axis Even when a phenomenon such as a deviation in timing of image display, fluctuation in image display timing, or freezing occurs, it is possible to appropriately match the time direction and space direction of the reference image and the degraded image.

因此，根据以这样的影像匹配装置以及方法为前提的影像质量评价装置以及方法，即使是在上述各场合也能够进行影像质量的客观评价，可以提高其精度。Therefore, according to the video quality evaluation device and method based on such a video matching device and method, it is possible to perform objective evaluation of video quality even in the above-mentioned cases, and the accuracy thereof can be improved.

Claims

1. a video quality evaluation device is characterized in that,

Have:

Input is as the reference images signal (RI) of the signal of video signal that does not worsen and the deterioration signal of video signal (PI) after this reference images signal degradation, calculate the signal of video signal characteristic quantity of two signals, poor according to the signal of video signal characteristic quantity of two signals that calculate inferred the subjective quality of the subjective quality of described deterioration signal of video signal and infers portion (11);

Be used to revise the update information storage part (13) of the update information of described subjective quality with the storage that is mapped of signal of video signal characteristic quantity; With

Import the described signal of video signal characteristic quantity of described reference images signal, obtain and the corresponding update information of having imported of signal of video signal characteristic quantity from described update information storage part (13), subjective quality correction portion (12 according to the described subjective quality of obtaining of inferring out of update information correction, 14,15).

2. video quality evaluation device according to claim 1 is characterized in that,

Described subjective quality is inferred portion (11), according to the spatial information (SI) of the image state in each frame that is illustrated in the image that comprises in the signal of video signal and be illustrated in poor between any one at least deterioration signal of video signal in the temporal information (TI) that the image of interframe changes and reference images signal, infer described subjective quality.

3. video quality evaluation device according to claim 2 is characterized in that,

Described subjective quality is inferred portion (11), poor as the described signal of video signal characteristic quantity between described deterioration signal of video signal and described reference images signal, computational chart be shown in the deterioration amount in each frame of the image that comprises in the signal of video signal edge electric power amount (E), be illustrated at least one side of mobile electric power amount (M) of the deterioration amount of interframe.

4. video quality evaluation device according to claim 2 is characterized in that,

Described subjective quality is inferred portion (11) and is inferred described subjective quality according to the difference between, deterioration signal of video signal and reference images signal any one at least among the spatial information of regulation and the temporal information in P.910 at ITU-R Recommendation.

5. the described video quality evaluation device of claim 1 is characterized in that,

Update information storage part (13), as the update information that is used to revise described subjective quality, with the spatial information (SI) that is illustrated in the image state in each frame of image that comprises in the signal of video signal and be illustrated in temporal information (TI) that the image of interframe changes be mapped the storage correction factor (α, β).

6. video quality evaluation device according to claim 1 is characterized in that,

Described subjective quality is inferred portion (11) and is had:

Import described reference images signal (RI) and described deterioration signal of video signal (PI), generate about from the reference images frame of described reference images signal with from the calibration information generating unit (21) of the calibration information that departs from the space of the departing from of time of the deterioration image frame of described deterioration signal of video signal, described reference images frame and described deterioration image frame;

According to described calibration information, eliminating after described spatial offset and described time departs from, calculate the space characteristics amount calculating part (22) of space characteristics amount separately according to the spatial information of the image state of described reference images frame of expression and described deterioration image frame;

According to described calibration information, eliminating after described spatial offset and described time departs from, for described reference images frame and described deterioration image frame, calculate the temporal characteristics amount calculating part (23) of temporal characteristics amount separately according to the temporal information of the image variation of representing interframe separately; With

Infer the portion of inferring (24) of the subjective quality of described deterioration signal of video signal according to described space characteristics amount and described temporal characteristics amount.

7. video quality evaluation method, this method is to be pushed the update information (α of the subjective quality of making having with the signal of video signal characteristic quantity deterioration signal of video signal (PI) of storage after being used to revise reference images signal (RI) as the signal of video signal that worsens and worsening that be mapped, update information storage part (13) β), infer the video quality evaluation method in the video quality evaluation device of subjective quality of described deterioration signal of video signal, it is characterized in that

Import described reference images signal and described deterioration signal of video signal;

Calculate the signal of video signal characteristic quantity of two signals;

Poor according to the signal of video signal characteristic quantity of two signals of calculating inferred the described subjective quality of described deterioration signal of video signal;

From described update information storage part (13) obtain the update information corresponding with the signal of video signal characteristic quantity of the described reference images signal of calculating (α, β);

(α β) revises described subjective quality of inferring out according to the update information that obtains.

8. a video quality evaluation program is characterized in that,

Make computer as following cell operation:

Input is as the reference images signal (RI) of the signal of video signal that does not worsen and the deterioration signal of video signal (PI) of this reference images signal degradation, calculate the signal of video signal characteristic quantity of two signals, poor according to the signal of video signal characteristic quantity of two signals that calculate inferred the subjective quality of the subjective quality of described deterioration signal of video signal and infers unit (11);

With the signal of video signal characteristic quantity be mapped the storage be used to revise described subjective quality update information (α, update information memory cell (13) β),

Obtain the update information (α corresponding from described update information memory cell (13) with the signal of video signal characteristic quantity of the described reference images signal of calculating, β), subjective quality amending unit (12,14,15) according to the described subjective quality of obtaining of inferring out of update information correction.

9. a video quality evaluation device is characterized in that,

Have:

Input is as the reference images signal (RI) of the signal of video signal that does not worsen and the deterioration signal of video signal (PI) of this reference images signal degradation, and generation is about from the reference images frame of described reference images signal with from the calibration information generating unit (21) of the calibration information that departs from the space of the departing from of time of the deterioration image frame of described deterioration signal of video signal, described reference images frame and described deterioration image frame;

Infer portion (24) according to the subjective quality that described space characteristics amount and described temporal characteristics amount are inferred the subjective quality of described deterioration signal of video signal.

10. video quality evaluation device according to claim 9 is characterized in that,

Also have:

In the document form of the reference images that comprises in the described reference images signal (RI) that the document form of the deterioration image that comprises in described deterioration signal of video signal (PI) is transformed in correspondence, the formal argument portion (35) that exports the information related with its document form of unifying; With

Make its corresponding information related, store coefficient (α, beta, gamma, the correction factor storage part (36) δ) of the subjective quality that is used to infer described deterioration signal of video signal with described document form;

Described subjective quality is inferred portion (34), obtain described coefficient corresponding and from described correction factor storage part (36) from the related information of the described unified document form of described formal argument portion (35) input, according to described space characteristics amount, described temporal characteristics amount and the described coefficient of obtaining, infer the subjective quality of described deterioration signal of video signal.

11. video quality evaluation device according to claim 10 is characterized in that,

Described formal argument portion (35), as with the described unified related information of document form, in the amount of information of the deterioration image of exporting the signal form of described deterioration signal of video signal, sending by described deterioration signal of video signal, the coded system of described deterioration signal of video signal at least any one, described correction factor storage part, in the amount of information of the signal form of corresponding described deterioration signal of video signal, the deterioration image that sends by described deterioration signal of video signal and the coded system of described deterioration signal of video signal at least any one, coefficient (the α that storage is best, β, γ, δ).

12. according to claim 9 or 10 described video quality evaluation devices, it is characterized in that,

Described space characteristics amount calculating part (22), according to described reference images signal and described deterioration signal of video signal, the index of the deterioration that quantification brightness value in frame border jumpy is taken place is calculated as described space characteristics amount.

13. the described video quality evaluation device of claim 12 is characterized in that,

Described space characteristics amount calculating part (22), according to described reference images signal and described deterioration signal of video signal, the edge electric power amount (Ave_EE) that will stipulate in ANSI T1.801.03-1995 is calculated as described space characteristics amount.

14. claim 9,10 or 12 described video quality evaluation devices is characterized in that,

Described space characteristics amount calculating part (22), more described reference images frame calculates in the index (Min_HV) of the situation that brightness value border jumpy in should the described deterioration image frame of reference images frame is taken place on the horizontal vertical direction quantification as described space characteristics amount.

15. claim 9,10,12 or 14 described video quality evaluation devices is characterized in that,

Described space characteristics amount calculating part (23), according to each as frame in more than or equal to the piece of the set more than 1 pixel in reference images in Temporal Information value that ITU-R Recommendation is prescribed in P.910 with worsen the poor of Temporal Information value that image is prescribed in P.910 at ITU-R Recommendation, the variable quantity (Ave_MEB) of the interframe of image is calculated as described temporal characteristics amount.

16. claim 9 or 10 described video quality evaluation devices is characterized in that,

And then have:

Be used to revise the update information storage part (13) of the update information of described subjective quality with the storage that is mapped of space characteristics amount and temporal characteristics amount; With

Import the described space characteristics amount of described reference images signal (RI) and described temporal characteristics amount, obtain the space characteristics amount of corresponding input and the update information (α of temporal characteristics amount from described update information storage part (13), β), according to the subjective quality correction portion (12 of the described subjective quality of inferring of update information correction obtained, 14,15).

17. a video quality evaluation method is characterized in that,

Input is as the reference images signal (RI) of the signal of video signal that does not worsen and the deterioration signal of video signal (PI) after this reference images signal degradation;

Generation is about from the reference images frame of described reference images signal with from the calibration information that departs from the space of the departing from of time of the deterioration image frame of described deterioration signal of video signal, described reference images frame and described deterioration image frame;

According to described calibration information, eliminating after described spatial offset and described time departs from, calculate separately space characteristics amount according to the spatial information of the image state of described reference images frame of expression and described deterioration image frame;

According to described calibration information, having eliminated after described spatial offset and described time departs from, for described reference images frame and described deterioration image frame, the temporal information that changes according to the image of expression interframe is separately calculated temporal characteristics amount separately;

According to described space characteristics amount and described temporal characteristics amount, infer the subjective quality of described deterioration signal of video signal.

18. a video quality evaluation method is characterized in that,

Input is as the reference images signal (RI) of the signal of video signal that does not worsen and the deterioration signal of video signal (RI) of this reference images signal degradation;

The document form of the deterioration image that in described deterioration signal of video signal, comprises, be transformed to the document form of the reference images that in the described reference images signal of correspondence, comprises;

Generate the information related with this document form of unifying;

Generation is about the calibration information that departs from the space between the departing from of the time between the deterioration image frame of the deterioration image that was transformed from the reference images frame of described reference images signal with from the document form of described deterioration signal of video signal, described reference images frame and described deterioration image frame;

According to described space characteristics amount, described temporal characteristics amount with (α, beta, gamma δ), are inferred the subjective quality of described deterioration signal of video signal corresponding to the described document form of unifying the coefficient that subjective quality infers of being used for related, information.

19. a video quality evaluation program is characterized in that,

Make computer as following cell operation:

Input is as the reference images signal (RI) of the signal of video signal that does not worsen and the deterioration signal of video signal (PI) of this reference images signal degradation, and generation is about from the reference images frame of described reference images signal with from the calibration information generation unit (21) of the calibration information that departs from the space between the departing from of the time between the deterioration image frame of described deterioration signal of video signal, described reference images frame and described deterioration image frame;

According to described calibration information, eliminating after described spatial offset and described time departs from, calculating the space characteristics amount computing unit (22) of space characteristics amount separately according to the spatial information of the image state of described reference images frame of expression and described deterioration image frame;

According to described calibration information, eliminating after described spatial offset and described time departs from, for described reference images frame and described deterioration image frame, calculate the temporal characteristics amount computing unit (23) of temporal characteristics amount separately according to the temporal information of the image variation of representing interframe separately; With

According to described space characteristics amount and described temporal characteristics amount, infer the subjective quality of the subjective quality of described deterioration signal of video signal and infer unit (24).

20. video quality evaluation program according to claim 19 is characterized in that,

Also make computer as following cell operation:

In the document form of the reference images that comprises in the described reference images signal (RI) that the document form of the deterioration image that comprises is transformed in correspondence, export the formal argument unit (35) of the information related with this document form of unifying in described deterioration signal of video signal (PI); With

The coefficient (α, beta, gamma, the correction factor memory cell (36) δ) that make it be mapped with the information related, store the subjective quality that is used to infer described deterioration signal of video signal with described document form;

Described subjective quality is inferred unit (34), from described correction factor memory cell (36), obtain with the related corresponding described coefficient of information of the described document form of unifying from described formal argument unit (35) input, according to described space characteristics amount, described temporal characteristics amount and the described coefficient of obtaining, infer the subjective quality of described deterioration signal of video signal.

21. an image coalignment is characterized in that,

Have:

Input is as the reference images signal (RI) of the signal of video signal that does not worsen and the deterioration signal of video signal (PI) of this reference images signal degradation, the document form of the deterioration image that in described deterioration signal of video signal, comprises, be transformed to the formal argument portion (41) of the document form of the reference images that in the described reference images signal of correspondence, comprises;

The number of the deterioration image frame that comprises in reference images frame that coupling comprises in described reference images signal and the described deterioration signal of video signal and the Displaying timer matching part (42) of Displaying timer; With

For described reference images frame and described deterioration image frame to several frames before and after picture frame and its, departing from or the frozen state of described deterioration image of the frame of supervision between described reference images frame and described deterioration image frame, and obtain the corresponding relation of interframe and the corresponding relation between pixel coupling synchronously/location matches portion (43).

22. image coalignment according to claim 21 is characterized in that,

Described formal argument portion (41), at least one makes with described reference images and coincide in the data mode of the described deterioration image of conversion, size, the length-width ratio.

23. image coalignment according to claim 21 is characterized in that,

Described Displaying timer matching part (42) in the described reference images frame occasion different with the frame rate of described deterioration image frame, by interpolation or delete described deterioration image frame, coincide the frame rate of described reference images frame and described deterioration image frame.

24. image coalignment according to claim 21 is characterized in that,

Described Displaying timer matching part (42), for the Displaying timer of described reference images frame, the Displaying timer of the described deterioration image frame of harmonizing.

25. image coalignment according to claim 21 is characterized in that,

Described Displaying timer matching part (42) in the dissimilar occasion of the Displaying timer of described reference images frame, is taken as preset time to described reference images frame and described deterioration image frame both sides' Displaying timer at interval.

26. image coalignment according to claim 21 is characterized in that,

Described synchronously/location matches portion (43), for described reference images frame and described deterioration image frame to several frames before and after picture frame and its, the passing of the characteristic quantity separately of whole frames or its specific region, the grand Synchronous Processing of the corresponding relation that departs from the time that becomes minimum described reference images frame and described deterioration image frame of the described characteristic quantity of execution decision.

27. image coalignment according to claim 26 is characterized in that,

Described synchronously/location matches portion (43), for described reference images frame and described deterioration image frame to several frames before and after picture frame and its, move the corresponding time relationship and the pixel corresponding relation of described reference images frame and described deterioration image frame on one side, relatively whole characteristic quantity separately of frames or its specific region on one side, the difference of carrying out the described characteristic quantity of decision become the corresponding relation of time of minimum described reference images frame and described deterioration image frame and pixel corresponding relation little synchronously/location matches handles.

28. image coalignment according to claim 27 is characterized in that,

Described synchronously/location matches portion (43), the initial stage carry out described grand Synchronous Processing and described little synchronously/location matches handles.

29. image coalignment according to claim 28 is characterized in that,

Described synchronously/location matches portion (43), be absorbed in the occasion of frozen state at described deterioration image frame, by frame number being counted the continuation time of measuring described frozen state.

30. image coalignment according to claim 29 is characterized in that,

Described synchronously/location matches portion (43), for described reference images frame and described deterioration image frame to several frames before and after picture frame and its, derive whole frames characteristic quantity separately, change in time and the characteristic quantity of described deterioration image frame when not changing in time at characteristic quantity, be judged to be described deterioration image frame and be absorbed in frozen state for described reference images frame.

31. image coalignment according to claim 28 is characterized in that,

Described synchronously/location matches portion (43), be absorbed in the occasion of frozen state at described deterioration image frame, perhaps can not with the occasion of described reference images frame synchronization, carry out described grand Synchronous Processing once more.

32. image coalignment according to claim 28 is characterized in that,

Described synchronously/location matches portion (43), be absorbed in the occasion of the state that frame departs from described deterioration image frame, output frame depart from number.

33. image coalignment according to claim 21 is characterized in that,

And then have from described synchronously/location matches portion (43) input described reference images signal (RI) and described deterioration signal of video signal (PI), the brightness of described deterioration image and colouring information and described reference images are coincide, the deterioration image that described brightness and colouring information have been coincide turn back to described synchronous/the brightness/color correction portion (44) of location matches portion (43).

34. an image matching method is characterized in that,

The document form of the deterioration image that in described deterioration signal of video signal, comprises, be transformed to the document form (step S1) of the reference images that in the described reference images signal of correspondence, comprises;

The number of the deterioration image frame that comprises in reference images frame that coupling comprises in described reference images signal and the described deterioration signal of video signal and Displaying timer (step S2); With

For described reference images frame and described deterioration image frame to several frames before and after picture frame and its, departing from or the frozen state of described deterioration image of the frame of supervision between described reference images frame and described deterioration image frame, and obtain the coupling (step S3-S22) of the corresponding relation and the corresponding relation between pixel of interframe.

35. a video matching program is characterized in that,

Make computer as following cell operation:

Input is transformed to the variation unit (41) of the document form of the reference images that comprises in the described reference images signal of correspondence as the reference images signal (RI) of the signal of video signal that does not worsen and the deterioration signal of video signal (PI) after this reference images signal degradation, the document form of the deterioration image that comprises in described deterioration signal of video signal;

The number of reference images frame that coupling comprises in described reference images signal and the deterioration image frame that in described deterioration signal of video signal, comprises and the Displaying timer matching unit (42) of Displaying timer; With

For described reference images frame and described deterioration image frame to several frames before and after picture frame and its, departing from or the frozen state of described deterioration image of the frame of supervision between described reference images frame and described deterioration image frame, obtain simultaneously the corresponding relation of interframe and the corresponding relation between pixel coupling synchronously/location matches unit (43).

36. a video quality evaluation device is characterized in that,

Have:

Input is transformed to the formal argument portion (41) of the document form of the reference images that comprises in the described reference images signal of correspondence as the reference images signal (RI) of the signal of video signal that does not worsen and the deterioration signal of video signal (PI) after this reference images signal degradation, the document form of the deterioration image that comprises in described deterioration signal of video signal;

The number of reference images frame that coupling comprises in described reference images signal and the deterioration image frame that in described deterioration signal of video signal, comprises and the Displaying timer matching part (42) of Displaying timer;

For described reference images frame and described deterioration image frame to several frames before and after picture frame and its, departing from or the frozen state of described deterioration image of the frame of supervision between described reference images frame and described deterioration image frame, obtain simultaneously the corresponding relation of interframe and the corresponding relation between pixel coupling synchronously/location matches portion (43); With

Input by described synchronously/location matches portion (43) obtained coupling described reference images signal (RI) and described deterioration signal of video signal (PI), calculate the signal of video signal characteristic quantity of two signals, the subjective quality of inferring the subjective quality of described deterioration signal of video signal according to the difference of the signal of video signal characteristic quantity of two signals of calculating infers portion (11).