HK1229101B

HK1229101B - Method for preparing 3d image

Info

Publication number: HK1229101B
Application number: HK17102379.8A
Authority: HK
Inventors: 科林．R．杜特; 帕诺斯．纳西奥普洛斯
Original assignee: 杜比实验室特许公司
Priority date: 2011-05-14
Filing date: 2017-03-07
Publication date: 2019-09-20

Description

Method for preparing three-dimensional images

本申请是申请日为2012年5月11日、申请号为“201210147475.4”、发明名称为“用于准备三维视频图像的方法和设备”的发明专利申请的分案申请。This application is a divisional application of the invention patent application with the application date of May 11, 2012, application number "201210147475.4", and invention name "Method and device for preparing three-dimensional video images".

技术领域Technical Field

本发明涉及3D显示。本发明尤其涉及用于消除3D显示中的串扰的机制。The present invention relates to 3D displays and, in particular, to a mechanism for eliminating crosstalk in 3D displays.

背景技术Background Art

存在多种可用来向观看者显示3D图像的技术。这些技术向观看者的左眼和右眼呈现单独的图像。这些图像可以呈现相同场景或对象的不同视角。观看者的脑合成并解释左眼图像和右眼图像，以感知具有深度感的单个3D图像。There are various technologies available for displaying 3D images to a viewer. These technologies present separate images to the viewer's left and right eyes. These images can show different perspectives of the same scene or object. The viewer's brain combines and interprets the left and right eye images to perceive a single 3D image with a sense of depth.

在3D显示中使用的一些非限制性的示例技术包括：Some non-limiting example technologies used in 3D displays include:

-对左眼图像和右眼图像进行不同的偏振。观看者可以佩戴偏振眼镜，以阻挡右眼观看左眼图像，并阻挡左眼观看右眼图像；Polarize the left-eye image and the right-eye image differently. The viewer can wear polarized glasses to block the right eye from viewing the left-eye image, and block the left eye from viewing the right-eye image;

-以交替的序列显示左眼图像和右眼图像。观看者可以通过包含受控光闸的眼镜观看图像，该受控光闸随着图像的显示及时地打开和关闭；- Displaying left-eye and right-eye images in an alternating sequence. The viewer can view the images through glasses containing controlled light shutters that open and close in time with the display of the images;

-提供具有不同光谱特性的左眼图像和右眼图像的技术。观看者可以通过包含光谱过滤器的眼镜观看图像，该光谱过滤器使一个图像通过，但是阻挡另一个图像；A technique for providing left-eye and right-eye images with different spectral characteristics. The viewer can view the images through glasses containing a spectral filter that passes one image but blocks the other.

-将不同的图像导向空间中的不同位置(使得观看者的眼镜看到不同的图像)的多视点(multiview)技术。- Multiview technology that directs different images to different locations in space (so that the viewer's eyes see different images).

3D成像显示的一个问题是在观看者的左眼和右眼观看的图像之间可能存在串扰(crosstalk)。可以将这些图像称为“视图”。“串扰”是希望仅由观看者的一只眼睛观看的光在某种程度上对观看者的另一只眼睛也可见的情况。串扰可能由于很多种原因中的任意一种而发生。例如，佩戴在观看者的眼睛中的每个眼睛上的光闸、偏振器或滤光器不能完全阻止来自希望仅由一只眼睛观看的图像的光到达另一只眼睛，反之亦然。One problem with 3D imaging displays is the potential for crosstalk between the images viewed by a viewer's left and right eyes. These images can be referred to as "views." Crosstalk occurs when light intended for viewing by only one eye is also visible to some extent to the viewer's other eye. Crosstalk can occur for any of a variety of reasons. For example, a shutter, polarizer, or filter worn on each of the viewer's eyes may not completely prevent light from an image intended for viewing by only one eye from reaching the other eye, or vice versa.

串扰可能使观看者在3D图像中感知到“重影(ghosting)”、看到双重图像的效果，尤其是在图像中的高对比度边缘处。串扰可能大大降低感知到的3D图像质量，并且可能妨碍观看者融合图像和感知深度。Crosstalk can cause viewers to perceive "ghosting" in 3D images, the effect of seeing double images, especially at high-contrast edges in the image. Crosstalk can significantly degrade the perceived 3D image quality and can prevent viewers from blending the images and perceiving depth.

串扰可能在左眼图像中的像素非常亮而相应的右眼图像中的相应像素非常暗的情况、或在右眼图像中的像素非常亮而相应的左眼图像中的相应像素非常暗的情况下尤其明显。Crosstalk may be particularly noticeable where a pixel in the left-eye image is very bright and a corresponding pixel in the corresponding right-eye image is very dark, or where a pixel in the right-eye image is very bright and a corresponding pixel in the corresponding left-eye image is very dark.

存在对于用来消除3D显示中的串扰的有效并且实用的方式的需要。There is a need for an efficient and practical way to eliminate crosstalk in 3D displays.

发明内容Summary of the Invention

提供本概要以介绍下面在说明书中进一步描述的简要形式的本发明的代表性概念和各方面的选择。本概要不旨在标识要求保护的主题的关键特征或者必要特征，也不旨在以任何可能限制要求保护的主题的范围的方式使用。This summary is provided to introduce a selection of representative concepts and aspects of the invention in a simplified form that are further described below in the specification. This summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used in any way that might limit the scope of the claimed subject matter.

本发明提供与对3D图像进行处理和显示相关的方法和设备。一个方面涉及3D图像中的串扰的消除。可以以方法和设备实施本发明。该设备可以包括独立设备，或者可以与显示器或用于向显示器传送3D图像数据的通道中的其它设备集成。The present invention provides methods and apparatus for processing and displaying 3D images. One aspect relates to eliminating crosstalk in 3D images. The present invention can be implemented as a method and apparatus. The apparatus can comprise a standalone device or be integrated with a display or other device in a channel used to transmit 3D image data to the display.

本发明的一个示例方面提供一种用于准备3D图像的方法，该3D图像包括用于显示的左眼视图和右眼视图。该示例方法包括：基于图像数据确定为允许完全减法串扰消除而增大像素值的量；确定该量的最大值；以及通过以基于最大值而在量上进行的加法或按比例确定(scaling)中的一个或两个，全局地增大像素值。3D图像可以包括视频序列中的视频帧。在一些实施例中，该方法包括：对该最大值或者根据该最大值得出或与该最大值相关的量应用时间低通滤波器。One exemplary aspect of the present invention provides a method for preparing a 3D image comprising a left-eye view and a right-eye view for display. The exemplary method comprises: determining, based on image data, an amount by which pixel values are to be increased to allow for full subtractive crosstalk cancellation; determining a maximum value for the amount; and globally increasing the pixel values by one or both of addition and scaling based on the maximum value. The 3D image may comprise a video frame from a video sequence. In some embodiments, the method comprises: applying a temporal low-pass filter to the maximum value or to an amount derived from or related to the maximum value.

另一示例方面提供一种用于准备3D视频图像的方法，该3D视频图像包括用于显示的一系列帧中的每个帧的左眼视图和右眼视图。该方法包括：识别视图中的像素值过小而不允许完全减法串扰消除的局部区域；确定加到局部区域的亮度补丁的强度；对强度进行时间低通滤波；以及根据经时间滤波的强度生成亮度补丁。Another example aspect provides a method for preparing a 3D video image comprising a left-eye view and a right-eye view for each frame in a series of frames for display. The method includes: identifying a local region in the view where pixel values are too small to allow full subtractive crosstalk cancellation; determining an intensity of a luma patch to be applied to the local region; temporally low-pass filtering the intensity; and generating a luma patch based on the temporally filtered intensity.

在一些实施例中，该方法包括：对于亮度补丁中的每个，生成用于另一视图的相应亮度补丁。生成相应亮度补丁可以包括：定位亮度补丁的中心；进行块匹配以估计视图之间与局部区域的中心相对应的位置处的不一致；以及将在x方向上偏移了所估计的不一致的亮度补丁的副本加到另一视图上。In some embodiments, the method includes: for each of the luma patches, generating a corresponding luma patch for the other view. Generating the corresponding luma patch may include: locating a center of the luma patch; performing block matching to estimate an inconsistency between the views at a location corresponding to the center of the local region; and adding a copy of the luma patch to the other view that is offset in the x-direction by the estimated inconsistency.

另一方面提供一种用于准备3D视频图像的示例方法，该3D视频图像包括用于显示的一系列帧中的每个帧的左眼视图和右眼视图。该方法包括：识别视图中的像素值过小而不允许完全减法串扰消除的局部区域；确定加到局部区域的亮度补丁的强度；链接跨视频图像的帧的相应局部区域以提供一个或更多个系列的链接局部区域；识别这些系列的相应链接局部区域中的一个系列的第一链接局部区域；以及将与第一链接局部区域相对应的淡入亮度补丁加到与第一链接局部区域相对应的帧之前的一系列帧。Another aspect provides an example method for preparing a 3D video image that includes a left-eye view and a right-eye view for each frame in a series of frames for display. The method includes: identifying a local region in the view where pixel values are too small to allow full subtractive crosstalk cancellation; determining the intensity of a luminance patch applied to the local region; linking corresponding local regions across frames of the video image to provide one or more series of linked local regions; identifying a first linked local region in one of the series of corresponding linked local regions; and applying a faded-in luminance patch corresponding to the first linked local region to a series of frames preceding the frame corresponding to the first linked local region.

本发明的其它方面提供如这里所描述的设备。该设备可以被配置为和 /或能够进行操作以执行根据本发明的方法。Another aspect of the present invention provides a device as described herein. The device can be configured and/or operable to perform a method according to the present invention.

本发明的其它方面提供计算机程序产品，该计算机程序产品包括承载计算机可读指令的非易失性介质，当由一个或更多个数据处理器执行该计算机可读指令时，使数据处理器执行根据本发明的方法。Other aspects of the invention provide a computer program product comprising a non-volatile medium carrying computer-readable instructions which, when executed by one or more data processors, cause the data processors to perform a method according to the invention.

根据一个实施例，提供一种用于准备三维图像的方法，三维图像包括用于显示的左眼视图和右眼视图，该方法包括：基于左眼视图和右眼视图的图像数据识别左眼视图和右眼视图中的每个视图中的像素值不足以允许完全减法串扰消除的像素的集合；选择性地针对集合中的像素确定为允许完全减法串扰消除而增大像素的像素值的量；基于针对集合中的像素所确定的量来建立增大像素值的一个或更多个量；通过在所建立的一个或更多个量上进行的加法和按比例确定中的一个或两个，增大集合中的像素中的至少一些像素的像素值。According to one embodiment, a method for preparing a three-dimensional image is provided, the three-dimensional image including a left-eye view and a right-eye view for display, the method comprising: identifying, based on image data of the left-eye view and the right-eye view, a set of pixels in each of the left-eye view and the right-eye view whose pixel values are insufficient to allow complete subtractive crosstalk cancellation; selectively determining, for the pixels in the set, an amount by which the pixel values of the pixels are to be increased to allow complete subtractive crosstalk cancellation; establishing one or more amounts by which the pixel values are to be increased based on the amounts determined for the pixels in the set; and increasing the pixel values of at least some of the pixels in the set by one or both of addition and scaling performed on the established one or more amounts.

根据另一个实施例，提供一种用于准备三维图像的方法，三维图像包括用于显示的左眼视图和右眼视图，该方法包括：预处理左眼视图和右眼视图的图像数据以确定左眼视图和右眼视图中的区域，对于该区域，左眼视图和右眼视图的像素值之间的对比度超过阈值；生成标识所确定的区域的元数据；与图像数据相关联地存储元数据。According to another embodiment, a method for preparing a three-dimensional image is provided, the three-dimensional image including a left-eye view and a right-eye view for display, the method comprising: preprocessing image data of the left-eye view and the right-eye view to determine regions in the left-eye view and the right-eye view for which a contrast between pixel values of the left-eye view and the right-eye view exceeds a threshold; generating metadata identifying the determined regions; and storing the metadata in association with the image data.

除了上面描述的示例性方面和实施例之外，通过参考附图并且通过研究下面的详细描述，其它方面和实施例将变得明显。In addition to the exemplary aspects and embodiments described above, further aspects and embodiments will become apparent by reference to the drawings and by study of the following detailed description.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

附图示出了非限制性示例实施例。The accompanying drawings illustrate non-limiting example embodiments.

图1是根据本发明的一个示例实施例的方法的流程图。图1A-1B分别为图1所示的块100的两个可选具体实施方式的示例。Figure 1 is a flow chart of a method according to an exemplary embodiment of the present invention. Figures 1A-1B are examples of two alternative specific implementations of block 100 shown in Figure 1 .

图2是示出根据本发明的示例实施例的设备的框图。FIG2 is a block diagram illustrating an apparatus according to an example embodiment of the present invention.

图3是根据本发明的另一示例实施例的方法的流程图。FIG3 is a flowchart of a method according to another example embodiment of the present invention.

图4是示出亮度补丁(luminance patch)的平滑空间变化的曲线图。FIG4 is a graph showing smooth spatial variations of luminance patches.

图5A、5B和5C示出与被添加以增加一个视图中的底空间(foot room)的亮度补丁相对应的另一视图中的亮度补丁的添加。5A , 5B, and 5C illustrate the addition of luma patches in one view corresponding to luma patches added to increase foot room in another view.

图6是示出用于对图像数据进行处理以添加淡入(fade in)和淡出 (fade out)的亮度补丁以用于减小闪烁的示例方法的流程图。6 is a flow chart illustrating an example method for processing image data to add luminance patches that fade in and out for reducing flicker.

图7A和7B对比单独针对视频的每一帧进行串扰消除以及使用于辅助串扰消除而被添加的亮度随着时间淡入和淡出的处理。7A and 7B compare the process of performing crosstalk cancellation on each frame of video individually and fading the added brightness to assist in crosstalk cancellation in and out over time.

在全部附图和说明书中，使用相似的附图标记来指示相似的元素。Like reference numerals are used throughout the drawings and the description to refer to like elements.

具体实施方式DETAILED DESCRIPTION

在下面的描述中，叙述具体细节以向本领域技术人员提供更透彻的理解。然而，不示出或详细描述公知的元素，以避免不必要地使本公开不明确。相应地，说明书和附图应当被视为说明性的，而不是限制性的。In the following description, specific details are described to provide a more thorough understanding to those skilled in the art. However, well-known elements are not shown or described in detail to avoid unnecessarily obscuring the present disclosure. Accordingly, the specification and drawings should be regarded as illustrative rather than restrictive.

例如，假设左眼图像对于某颜色通道具有由V_L(x,y)给出的像素值，而相应的右眼图像对于该颜色通道的具有由V_R(x,y)给出像素值。在不对串扰进行补偿的情况下，观看者将在观看者的左眼中看到在该颜色通道中具有如下像素值的图像：V_L(x,y)+A_RL×V_R(x,y)，而观看者将在观看者的右眼中看到在该颜色通道中具有如下像素值的图像：V_R(x,y)+A_LR×V_L(x,y)。这里，A_RL是来自右眼图像的光的到达观看者左眼的部分，A_LR是来自左眼图像的光的到达观看者右眼的部分。在许多情况下，A_LR＝A_RL。For example, suppose the left-eye image has pixel values for a certain color channel given by V _L (x, y), and the corresponding right-eye image has pixel values for that color channel given by _VR (x, y). Without compensating for crosstalk, the viewer will see an image with the following pixel values for that color channel in their left eye: V _L (x, y) + A _RL × _VR (x, y), while the viewer will see an image with the following pixel values for that color channel in their right eye: _VR (x, y) + A _LR × V _L (x, y). Here, A _RL is the portion of light from the right-eye image that reaches the viewer's left eye, and A _LR is the portion of light from the left-eye image that reaches the viewer's right eye. In many cases, A _LR = A _RL .

可以通过对图像数据进行处理以减去估计的串扰，来消除串扰。例如，可以基于来自另一视图中的相应像素的预期串扰的量，来减小每个像素的强度。因此，在显示图像时，串扰应当被消除掉，观看者应当像没有串扰一样地感知左图像和右图像。然而，由于负光强度是不可能的，因此完全减法串扰消除需要每个图像中的像素的值大于预期串扰。尤其在一个视图中的像素暗而另一个视图中的相应像素非常亮的情况下，不会如此。Crosstalk can be eliminated by processing the image data to subtract the estimated crosstalk. For example, the intensity of each pixel can be reduced based on the amount of expected crosstalk from the corresponding pixel in the other view. Therefore, when the images are displayed, the crosstalk should be eliminated, and the viewer should perceive the left and right images as if there was no crosstalk. However, because negative light intensities are impossible, complete subtractive crosstalk cancellation requires that the value of the pixel in each image is greater than the expected crosstalk. This is especially not the case if the pixel in one view is dark and the corresponding pixel in the other view is very bright.

考虑下面的示例，其中，图像数据针对每个像素包括8位伽马编码的红、绿和蓝值。可以如下地将这些值转换为线性值：Consider the following example, where the image data includes 8-bit gamma-encoded red, green, and blue values for each pixel. These values can be converted to linear values as follows:

其中，I₈是针对一个颜色通道(红、绿或蓝)的8位伽马编码输入，γ是显示器伽马值(通常为2.2)，而I_lin是按比例确定到0到1的范围的该颜色通道的线性值。可以将各自包括来自另一个视图的串扰的到达观看者的眼睛的视图建模为：Where I ₈ is the 8-bit gamma-encoded input for one color channel (red, green, or blue), γ is the display gamma value (typically 2.2), and I _lin is the linear value for that color channel scaled to the range of 0 to 1. The views reaching the viewer's eye, each including crosstalk from the other view, can be modeled as:

I_L,eye＝I_L,lin+cI_R,lin (2)I _L,eye =I _L,lin +cI _R,lin (2)

以及as well as

I_R,eye＝I_R,lin+cI_L,lin (3)I _R,eye =I _R,lin +cI _L,lin (3)

其中，I_L,lin和I_R,lin分别是输入的左视图和右视图的线性值。I_L,eye和I_R,eye分别是曝露给观看者的左眼和右眼的线性值，而c是串扰的量。where IL _,lin and IR _,lin are the linear values of the input left and right views, _IL,eye and _IR,eye are the linear values exposed to the viewer's left and right eyes, and c is the amount of crosstalk.

减法串扰消除通过以下方式起作用：修正所显示的左视图和右视图的图像数据，使得在加入了串扰之后，到达观看者的信号将是希望的图像(没有串扰)。这例如可以通过这样的方式进行——对图像数据进行处理，使得对显示器的输入的值由下式给出：Subtractive crosstalk cancellation works by modifying the displayed left and right view image data so that after adding the crosstalk, the signal reaching the viewer is the desired image (without crosstalk). This can be done, for example, by processing the image data so that the value of the input to the display is given by the following equation:

以及as well as

其中，下标proc指示经处理的输入值。通过将式(4)和(5)的经处理的值代替式(2)和(3)中的I_L,lin和I_R,lin，可以验证输出I_L,eye和I_R,eye将是没有串扰的原始图像。注意，应用式(4)和(5)主要涉及在对所有项应用小增益因子 1/(1-c²)的情况下，从左图像中减去右图像，以及从右图像中减去左图像。除非满足下面两个条件，否则式(4)或(5)将产生负结果：where the subscript proc indicates the processed input value. By substituting the processed values of equations (4) and (5) for IL _,lin and IR _,lin in equations (2) and (3), it can be verified that the outputs IL _,eye and IR _,eye will be the original images without crosstalk. Note that applying equations (4) and (5) mainly involves subtracting the right image from the left image, and subtracting the left image from the right image, while applying a small gain factor of 1/(1-c ² ) to all terms. Equations (4) or (5) will produce negative results unless the following two conditions are met:

I_L,lin≥cI_R,lin (6)I _L,lin ≥cI _R,lin (6)

以及as well as

I_R,lin≥cI_L,lin (7)I _R,lin ≥cI _L,lin (7)

在一些实施例中，以足以允许串扰的减法消除的量对每个视图添加光强度。消除可能是完全的或部分的。添加的光强度的量可以基于图像数据，使得不添加比允许希望程度的串扰消除所需的光强度更大的光强度。In some embodiments, light intensity is added to each view in an amount sufficient to allow subtractive cancellation of crosstalk. The cancellation may be complete or partial. The amount of light intensity added may be based on the image data so that no more light intensity is added than is required to allow the desired degree of crosstalk cancellation.

在一些实施例中，对图像数据进行处理包括对图像数据进行压缩和/ 或向更高的强度偏移图像数据，使得黑水平(black level)升高。可以基于图像数据来计算黑水平升高的量。In some embodiments, processing the image data includes compressing the image data and/or shifting the image data toward higher intensities such that the black level is raised. The amount by which the black level is raised can be calculated based on the image data.

在示例实施例中，处理包括应用对颜色通道(例如红、绿和蓝通道) 进行线性按比例确定的函数将其范围压缩到[b,255]。该函数例如可以由下式给出：In an exemplary embodiment, processing includes applying a function that linearly scales the color channels (e.g., red, green, and blue channels) to compress their range to [b, 255]. This function may be given, for example, by the following equation:

I'＝b+(1-b/255)I (8)I'＝b+(1-b/255)I (8)

其中，I表示来自颜色通道I∈(R,G,B)中的任意一个的值。可以对每个颜色通道应用相同的按比例确定，以避免色调偏移(hue shift)。较大的b 值将更大地压缩图像的动态范围，这降低图像的对比度。因此，可能希望保持b的值较小而同时仍允许希望的程度的串扰消除。Where I represents a value from any one of the color channels I∈(R,G,B). The same scaling can be applied to each color channel to avoid hue shift. Larger values of b will further compress the dynamic range of the image, which reduces image contrast. Therefore, it may be desirable to keep the value of b small while still allowing a desired degree of crosstalk cancellation.

可以示出在b满足下式的情况下可以进行完全串扰消除：It can be shown that complete crosstalk cancellation is possible if b satisfies the following equation:

对于8位值的情况，其中下标S指示信号图像(即，当前所考虑的视图)，而下标C指示串扰图像(即，作为当前所考虑的视图中的串扰的源的视图)。可以应用式(9)以对每个视图中的每个像素的每个颜色通道(例如R、 G和B)的b的值提供约束。For the case of 8-bit values, where the subscript S indicates the signal image (i.e., the currently considered view) and the subscript C indicates the crosstalk image (i.e., the view that is the source of crosstalk in the currently considered view), Equation (9) can be applied to provide constraints on the value of b for each color channel (e.g., R, G, and B) of each pixel in each view.

在示例实施例中，对每个视图中的每个像素的每个颜色通道应用式 (9)。确定b的最大值，并且在式(8)中使用该值以对两个视图中的所有像素对所有颜色通道应用相同的图像数据压缩。这确保对于经处理的图像数据，式(6)和(7)的条件对所有像素适用。In an exemplary embodiment, equation (9) is applied to each color channel of each pixel in each view. A maximum value of b is determined and used in equation (8) to apply the same image data compression to all color channels for all pixels in both views. This ensures that the conditions of equations (6) and (7) hold for all pixels in the processed image data.

在一些实施例中，b的值被选择为稍小于由式(9)确定的值。虽然这意味着不能进行完全减法串扰消除，但是小量的未消除的串扰不会明显地影像图像质量，而b的减小有利地减小了压缩量。In some embodiments, the value of b is chosen to be slightly smaller than the value determined by equation (9). Although this means that full subtractive crosstalk cancellation cannot be performed, a small amount of uncancelled crosstalk does not significantly affect image quality, and reducing b advantageously reduces the amount of compression.

可以对处理进行调整，以避免针对每个像素的每个颜色通道进行b 的详细计算。例如，不需要针对满足下式的像素值计算b：The processing can be adjusted to avoid the detailed calculation of b for each color channel of each pixel. For example, b does not need to be calculated for pixel values that satisfy the following equation:

I_S≥c^1/γI_C (10)I _S ≥c ^1/γ I _C (10)

这是因为对于这些像素值，总是可以进行对任何串扰的完全补偿。在许多典型图像中，90％或更多的像素值满足式(10)。This is because for these pixel values, full compensation for any crosstalk can always be performed.In many typical images, 90% or more of the pixel values satisfy equation (10).

在图像包括视频帧的情况下，可能希望提供时间滤波以避免由于压缩的突变而分散观看者的注意力。这在如上所述的由诸如b的单个参数定义压缩的情况下可以较直接。在这种情况下，可以对b进行低通滤波，使得 b不在帧与帧之间发生突变。In the case where the images comprise video frames, it may be desirable to provide temporal filtering to avoid distracting the viewer due to sudden changes in compression. This can be relatively straightforward if the compression is defined by a single parameter such as b, as described above. In this case, b can be low-pass filtered so that b does not change suddenly between frames.

在一些实施例中，该滤波在时间上是双向的，使得在b在否则将发生突然跳变的帧的之前几个帧开始增大。In some embodiments, the filtering is bidirectional in time, so that b begins to increase several frames before the frame where the abrupt jump would otherwise occur.

在一些可选实施例中，可以针对不同的颜色通道单独进行使用相应b 值的压缩。然而，这可能使颜色失真，因此通常在关注颜色逼真度的情况下是不希望的。In some alternative embodiments, compression using the corresponding b-values can be performed separately for different color channels. However, this may distort the colors and is generally undesirable when color fidelity is a concern.

如上所述的处理不限于将像素值表示为8位数值的情况。可以以任意合适的方式表示像素值，例如，表示为10位、11位或12位数值，任意合适的范围内，例如0到1等的数值。此外，可以以除RGB之外的颜色模型或其它相加颜色模型(additive color model)进行处理。The processing described above is not limited to representing pixel values as 8-bit values. Pixel values may be represented in any suitable manner, for example, as 10-bit, 11-bit, or 12-bit values, within any suitable range, such as 0 to 1. Furthermore, processing may be performed using color models other than RGB or other additive color models.

在一些实施例中，使用提供与表示色调(或色度)的像素值分开的表示光强度的像素值的颜色模型进行处理。这种颜色模型的一个示例是 YCbCr颜色模型。存在许多其它颜色模型，诸如YUV、xvYCC、YPbPr、 YIQ等。In some embodiments, processing is performed using a color model that provides pixel values representing light intensity separately from pixel values representing hue (or chroma). An example of such a color model is the YCbCr color model. Many other color models exist, such as YUV, xvYCC, YPbPr, YIQ, and the like.

可以在不改变定义色调(色度)的像素值的情况下，对YCbCr颜色模型的亮度(luma)分量(或者其它类似的颜色模型上的相应分量)应用按比例确定。因此，对由YCbCr颜色模型(或类似的颜色模型)表示像素值的图像数据进行处理，可以提供更好地保持颜色的优点。Scaling can be applied to the luminance (luma) component of the YCbCr color model (or the corresponding component on other similar color models) without changing the pixel values that define hue (chrominance). Therefore, processing image data whose pixel values are represented by the YCbCr color model (or similar color model) can provide the advantage of better color preservation.

在HDTV普遍使用的ITU-R BT.709标准中，亮度(Y)具有16到 235的范围。可以通过提高最小水平而将最大值保持在235来压缩亮度范围。这例如可以通过如下按比例确定亮度来实现：In the ITU-R BT.709 standard commonly used for HDTV, luminance (Y) has a range of 16 to 235. The luminance range can be compressed by increasing the minimum level while keeping the maximum value at 235. This can be achieved, for example, by scaling the luminance as follows:

Y'＝b+(1-b/235)Y (11)Y'＝b+(1-b/235)Y (11)

其中，Y’是压缩后的亮度，b是压缩参数。如上，在仍然允许消除串扰的同时，选择尽可能小的b值是有利的。保证可以完全补偿串扰的一组条件是：Where Y' is the compressed brightness and b is the compression parameter. As above, it is advantageous to choose the smallest possible b value while still allowing crosstalk to be eliminated. A set of conditions that guarantee complete crosstalk compensation is:

R′_S≥c^1/γR′_C R′ _S ≥c ^1/γ R′ _C

G′_S≥c^1/γG′_C G′ _S ≥c ^1/γ G′ _C

B′_S≥c^1/γB′_C (12)B′ _S ≥c ^1/γ B′ _C (12)

在由下式给出RGB颜色空间和YCbCr颜色空间之间的转换的情况下：In the case of conversion between RGB color space and YCbCr color space given by:

R＝1.164(Y-16)+1.793(Cr-128)R＝1.164(Y-16)+1.793(Cr-128)

G＝1.164(Y-16)-0.213(Cb-128)-0.534(Cr-128)G＝1.164(Y-16)-0.213(Cb-128)-0.534(Cr-128)

B＝1.164(Y-16)+2.112(Cb-128) (13)B＝1.164(Y-16)+2.112(Cb-128) (13)

通过根据式(11)如下按比例确定亮度，并且应用条件(12)，可以针对将导致在所有颜色通道中满足条件(12)的b的最小值进行求解。在用(13) 表示变换的情况下，可以示出其针对红色通道产生下式：By scaling the brightness as follows according to equation (11), and applying condition (12), we can solve for the minimum value of b that will result in condition (12) being satisfied in all color channels. When the transformation is expressed as (13), it can be shown that it produces the following equation for the red channel:

类似地，由下式给出针对绿色通道的条件：Similarly, the condition for the green channel is given by:

由下式给出针对蓝色通道的条件：The condition for the blue channel is given by:

可以针对两个视图的所有像素进行如式(14)至(16)所示的计算，以确定对于进行希望程度的串扰减小足够大的b值。一旦确定了b值，则例如可以根据式(11)对两个视图的亮度通道进行按比例确定，根据式(13)转换到RGB空间，然后根据式(4)和(5)对串扰进行补偿。The calculations shown in equations (14) to (16) can be performed for all pixels of both views to determine a b value that is large enough to achieve the desired degree of crosstalk reduction. Once the b value is determined, the luminance channels of the two views can be scaled, for example, according to equation (11), converted to RGB space according to equation (13), and then crosstalk compensation can be performed according to equations (4) and (5).

在一些实施例中，可以在源处或者在显示器上游的分配路径中的另一点处对视频进行分析。分析可以包括将均与同一帧相对应的两个视图中的像素值进行比较，以识别对于该帧最具限制性的像素对(例如在视图之间展现最大对比度的像素对)。该分析不需要知道通常取决于显示图像的显示器的串扰c。与可以图像一起分配分析的结果(例如作为与图像相关联的元数据)。可以在使用减法串扰消除确定用于显示的适当按比例确定值(scaling value)时提取和使用该元数据。这种方法降低了显示器处的计算复杂度，并且可以允许在显示器处使用比其它方式所需要的硬件更简单的硬件进行图像处理和串扰消除。In some embodiments, the video can be analyzed at the source or at another point in the distribution path upstream of the display. The analysis can include comparing pixel values in two views that both correspond to the same frame to identify the most restrictive pixel pairs for the frame (e.g., the pixel pairs that exhibit the greatest contrast between the views). The analysis does not require knowledge of the crosstalk c that typically depends on the display displaying the image. The results of the analysis can be distributed with the image (e.g., as metadata associated with the image). The metadata can be extracted and used when determining the appropriate scaling value for display using subtractive crosstalk cancellation. This approach reduces computational complexity at the display and can allow image processing and crosstalk cancellation to be performed at the display using simpler hardware than would otherwise be required.

在显示器在如这里所述地显示图像之前进行图像处理(例如应用锐化滤波器)的情况下，希望在进行准备用于如这里所描述的减法串扰消除的图像的图像处理之前进行该图像处理。Where a display performs image processing (eg, applying a sharpening filter) prior to displaying an image as described herein, it may be desirable to perform such image processing prior to performing image processing to prepare the image for subtractive crosstalk cancellation as described herein.

图1示出根据本发明的示例实施例的方法100。在块102，方法100 确定左眼视图和右眼视图中的每个中的每个像素的减法串扰消除所需的附加底空间的量。如图1A所示，可以针对多个颜色通道中的每个颜色通道单独确定所需的附加底空间(如块101R、101G和101B(其对左眼视图进行处理)和块102R、102G和102B(其对右眼视图进行处理)所示)。FIG1 illustrates a method 100 according to an example embodiment of the present invention. At block 102, the method 100 determines the amount of additional floor space required for subtractive crosstalk cancellation for each pixel in each of the left-eye view and the right-eye view. As shown in FIG1A , the required additional floor space may be determined separately for each of a plurality of color channels (as shown in blocks 101R, 101G, and 101B (which process the left-eye view) and blocks 102R, 102G, and 102B (which process the right-eye view)).

在图1B所示的可选实施例中，以具有单独亮度值的颜色模型表示像素值，或者将像素值变换为具有单独亮度值的颜色模型，并且在分别对左眼视图和右眼视图进行处理的块103L和103R中，根据亮度值确定所需的附加底空间。In an optional embodiment shown in Figure 1B, the pixel values are represented in a color model with a separate luminance value, or the pixel values are transformed into a color model with a separate luminance value, and in blocks 103L and 103R that process the left eye view and the right eye view, respectively, the required additional bottom space is determined based on the luminance value.

在块104中，确定添加到每个颜色通道或亮度通道的压缩量(例如b) 或亮度量的值。块104例如可以基于两个视图的颜色通道上的所需的附加底空间的最大或几乎最大的值、两个视图的亮度通道上的所需的附加底空间的最大或几乎最大的值。几乎最大的值例如是第90个百分位值或者第 95个百分位值或者第99个百分位值等。In block 104, a value for the amount of compression (e.g., b) or luma to be added to each color channel or luma channel is determined. Block 104 may be based, for example, on a maximum or nearly maximum value of the additional headroom required for the color channels of both views, or a maximum or nearly maximum value of the additional headroom required for the luma channels of both views. The nearly maximum value may be, for example, the 90th, 95th, or 99th percentile value.

在块106中，在图像是视频帧的情况下，对来自块104的压缩或亮度的量应用低通时间滤波。In block 106, where the image is a video frame, a low-pass temporal filter is applied to the amount of compression or brightness from block 104.

在块108中，基于来自块106的经时间滤波的量，对图像进行压缩(或者添加亮度)。In block 108 , the image is compressed (or brightness is added) based on the temporally filtered amount from block 106 .

在块110中，进行减法串扰消除。In block 110, subtractive crosstalk cancellation is performed.

在块112中，显示图像。In block 112, the image is displayed.

图2示出根据示例实施例的设备200。设备200包括计算块202，计算块202确定减法串扰消除所需的附加底空间的量。计算块202可以具有对图像的左视图和右视图的不同颜色通道或亮度通道并行操作的多个部分。计算块202可以包括编程数据处理器，例如一个或更多个微处理器或图形处理器或数字信号处理器，诸如FPGA的可编程逻辑电路，硬连线逻辑电路，其组合等。FIG2 illustrates a device 200 according to an example embodiment. Device 200 includes a computation block 202 that determines the amount of additional headroom required for subtractive crosstalk cancellation. Computation block 202 may include multiple components that operate in parallel on different color channels or luminance channels for the left and right views of an image. Computation block 202 may include a programmed data processor, such as one or more microprocessors, graphics processors, or digital signal processors, programmable logic circuits such as FPGAs, hardwired logic circuits, combinations thereof, and the like.

压缩确定块204根据块202的输出确定希望的压缩。时间滤波器206 对块204的输出进行滤波。压缩器208根据时间滤波器206的输出对两个视图的图像数据进行压缩。块210进行减法串扰消除。块212是显示图像的显示器。Compression determination block 204 determines the desired compression based on the output of block 202. Temporal filter 206 filters the output of block 204. Compressor 208 compresses the image data of the two views based on the output of temporal filter 206. Block 210 performs subtractive crosstalk cancellation. Block 212 is a display that displays the image.

示例Example

对三个3D立体图像应用如上所述的方法。假设显示器伽马为2.2并且串扰为7％(c＝0.07)。为了进行比较，通过最坏情况RGB域线性按比例确定(将黑水平提高到255c^1/r＝76)对相同的图像进行处理。The method described above was applied to three 3D stereo images. Assuming a display gamma of 2.2 and a crosstalk of 7% (c=0.07). For comparison, the same images were processed by worst-case RGB domain linear scaling (raising the black level to 255c ^1/r =76).

在表Ｉ中给出了结果的客观比较，表Ｉ示出了每个图像的压缩范围以及使用CIEDE2000色差公式测量的原始图像和压缩图像之间的色度差。 CIEDE2000色差是基于CIELAB颜色空间比较两个颜色的感知度量 (perceptual metric)。为了防止该比较受到由于应用这些方法而导致的亮度添加的影响，在保留与色度和色调相关的项的同时，将CIEDE2000公式中的亮度项(L)设置为零。这产生仅测量颜色改变的度量。An objective comparison of the results is presented in Table 1, which shows the compression range for each image and the chromaticity difference between the original and compressed images, measured using the CIEDE2000 color difference formula. CIEDE2000 color difference is a perceptual metric for comparing two colors based on the CIELAB color space. To prevent the comparison from being affected by the added brightness caused by applying these methods, the lightness term (L) in the CIEDE2000 formula was set to zero, while retaining the terms related to chromaticity and hue. This produces a metric that only measures color change.

对图像进行比较示出最坏情况RGB按比例确定使图像看起来褪色。如上所述的最佳RGB按比例确定产生具有较好对比度的图像，但是与原始图像相比仍然稍微褪色。在YCbCr空间中进行按比例确定产生具有更饱和的颜色的图像，这更准确地反映输入图像的颜色(如由明显较低的 CIEDE2000色度差值所指示的)。Comparing the images shows that the worst-case RGB scaling makes the image appear washed out. The optimal RGB scaling as described above produces an image with better contrast, but is still slightly washed out compared to the original image. Scaling in YCbCr space produces an image with more saturated colors that more accurately reflect the colors of the input image (as indicated by the significantly lower CIEDE2000 chromaticity difference values).

由于串扰不在图像的所有部分中产生问题，因此不强制全局地进行允许进行减法串扰消除的处理。一些实施例识别需要或者希望进行串扰消除的区域，并且对这些局部区域进行处理以确保在这些局部区域内可以进行希望的程度的减法串扰消除。该处理可以通过亮度的相加和/或按比例确定来提高局部区域内的亮度水平。如上所述，可以对亮度通道或者对各个颜色通道进行按比例确定。Because crosstalk does not pose a problem in all parts of the image, it is not mandatory to perform the processing that allows subtractive crosstalk cancellation to be performed globally. Some embodiments identify areas where crosstalk cancellation is needed or desired and perform processing on these local areas to ensure that the desired degree of subtractive crosstalk cancellation can be performed within these local areas. This processing can increase the brightness level in the local area by adding and/or scaling the brightness. As described above, scaling can be performed on the brightness channel or on each color channel.

图3中示出根据示例局部实施例的方法300。在方法300中，在块302 中，对3D图像的一对视图进行处理，以检测减法串扰消除将失败的区域 (局部区域)。可选地，在块304中，根据一些标准对这些区域进行分析，以确定是否需要或希望在该区域中进行串扰消除。例如，在未消除的串扰局限于非常小的区域或者仅在非常短的时间内(例如在视频的情况下，在视频中仅针对一帧或者仅针对几个连续帧)发生的情况下，未消除的串扰不产生明显的图像假象。对于由块304选择的区域，处理在块305、块306 和块307处继续，块305确定要添加到每个区域的底空间的量，块306通过亮度的相加和/或通过按比例确定增加用于减法串扰消除的底空间，块 307向另一视图中的相应区域添加底空间。在一些实施例中，针对每个区域单独确定加上的亮度或按比例确定的量。可以忽略在块304处未选择的区域。在块308处进行减法串扰消除，并且在块310处显示图像。FIG3 illustrates a method 300 according to an example partial embodiment. In method 300, in block 302, a pair of views of a 3D image are processed to detect regions (local regions) where subtractive crosstalk cancellation would fail. Optionally, in block 304, these regions are analyzed based on criteria to determine whether crosstalk cancellation is necessary or desirable in the region. For example, if uncancelled crosstalk is confined to a very small region or occurs only for a very short period of time (e.g., in the case of video, for only one frame or a few consecutive frames in the video), uncancelled crosstalk does not produce noticeable image artifacts. For the regions selected by block 304, processing continues at blocks 305, 306, and 307. Block 305 determines the amount of headroom to be added to each region, block 306 increases the headroom for subtractive crosstalk cancellation by summing luminance and/or scaling, and block 307 adds the headroom to the corresponding region in the other view. In some embodiments, the luminance or scaling amount to be added is determined separately for each region. Areas not selected at block 304 may be ignored. Subtractive crosstalk cancellation is performed at block 308, and at block 310 the image is displayed.

有利的是使增加的亮度平滑地变化。例如，在局部区域周围的边界区域中，添加的亮度可以从零缓慢递增到添加到局部区域的量。在图像具有多个要进行处理以改善减法串扰消除的局部区域的情况下，可以对局部区域中的不同的区域应用不同的加上的亮度和/或按比例确定的量。It is advantageous to vary the added brightness smoothly. For example, in a border region around a local region, the added brightness can be slowly increased from zero to the amount added to the local region. In the case where an image has multiple local regions to be processed to improve subtractive crosstalk cancellation, different added brightness and/or proportional amounts can be applied to different regions within the local region.

这里，i(x,y)表示图像的红、绿和蓝色通道中的任意一个。可以向所有通道添加相同的信号α(x,y)以避免像素颜色改变太多。可以分别针对左眼视图和右眼视图生成单独版本的α(x,y)。为了针对局部区域构造这些信号，可以确定该局部区域中的完全减法串扰消除需要多少底空间，然后生成将充分提高亮度以用于有效的串扰消除的平滑信号。Here, i(x,y) represents any of the red, green, and blue channels of the image. The same signal α(x,y) can be added to all channels to avoid excessive pixel color changes. Separate versions of α(x,y) can be generated for the left-eye and right-eye views. To construct these signals for a local region, it is possible to determine how much floor space is required for full subtractive crosstalk cancellation in that local region, and then generate a smooth signal that sufficiently boosts brightness for effective crosstalk cancellation.

可以基于计算出的视图中每个像素需要提高的量R_K来识别可能希望进行改善串扰消除的处理的局部区域，以提供用于串扰消除的足够的底空间。R_K可以由下式给出：Local areas where improved crosstalk cancellation may be desirable can be identified based on the calculated amount R _K that each pixel in the view needs to be improved to provide sufficient floor space for crosstalk cancellation. R _K can be given by:

R_K(x,y)＝max(0,c×I_C,K(x,y)-I_S,K(x,y)) (18)R _K (x,y)＝max(0,c×I _C,K (x,y)-I _S,K (x,y)) (18)

其中，下标S指示被考虑的“信号”视图，下标C指示作为在显示图像时对该信号视图添加的串扰的源的“串扰”视图，下标K指示颜色通道 (其例如可以是红、绿和蓝之一)。因为任何负值指示信号视图已经具有用于减法串扰消除的足够的亮度，因此式(18)中结果被“max”函数箝位 (clip)为零。Wherein, the subscript S indicates the "signal" view being considered, the subscript C indicates the "crosstalk" view that is the source of crosstalk added to the signal view when displaying the image, and the subscript K indicates the color channel (which can be, for example, one of red, green, and blue). Because any negative value indicates that the signal view has sufficient brightness for subtractive crosstalk cancellation, the result in equation (18) is clamped to zero by the "max" function.

识别局部区域可以基于经修正的版本的R_K，其中，将小于阈值的值设置为零。阈值例如可以基于最大显示亮度。在范例实施例中，将小于最大显示亮度的1％的值设置为零。Identifying local regions may be based on a modified version of R _K where values less than a threshold are set to zero. The threshold may be based on, for example, maximum display brightness. In an exemplary embodiment, values less than 1% of the maximum display brightness are set to zero.

例如可以通过用小掩模腐蚀(eroding)和膨胀(dilating)R_K来去除在视觉上较不明显的小区域。例如，可以使用8像素的圆形掩模来去除小区域。For example, a small area that is less visually noticeable can be removed by using a small mask to erode and dilate R _K. For example, a circular mask of 8 pixels can be used to remove the small area.

例如可以通过将R_K(x,y)(如通过任何预处理修正后的)划分为多个连接的区域来识别要进行处理以改善串扰消除的局部区域。这例如可以用二值标记算法(binarylabeling algorithm)来进行。在R.M.Haralick和 L.G.Shapiro.Computer and RobotVision,Volume I.Addison-Wesley,1992, pp.40-48中描述了合适的示例二值标记算法，通过引用将其合并于此。For example, local regions to be processed to improve crosstalk cancellation can be identified by dividing R _K (x, y) (as modified by any preprocessing) into a plurality of connected regions. This can be performed, for example, using a binary labeling algorithm. A suitable example binary labeling algorithm is described in R M Haralick and L G Shapiro. Computer and Robot Vision, Volume 1. Addison-Wesley, 1992, pp. 40-48, which is incorporated herein by reference.

对于这些局部区域中的每个，可以添加使亮度平滑改变的补丁，该补丁具有局部区域中的像素的最大值，并且在周围像素的区域内基于其与该局部区域的距离而逐渐减小。作为一个示例，可以如下计算要对一个局部区域添加的亮度补丁：For each of these local areas, a patch that smoothly changes the brightness can be added, which has the maximum value of the pixels in the local area and gradually decreases within the area of surrounding pixels based on their distance from the local area. As an example, the brightness patch to be added to a local area can be calculated as follows:

其中，下标j是所考虑的局部区域的标签，M_j是局部区域j中的R_K的最大值，w是局部区域j外部的过渡区域的宽度，而d是x,y处的像素与局部区域j的距离(对于局部区域j内的像素d＝0)。图4是通过一个局部区域402的中心的通过示例平滑改变的α(x,y)的截面的曲线400。在区域402 内，α(x,y)具有恒定值403。在边界或者“过渡”区域404(在图4中示出了过渡区域404的部分404A和404B)，α(x,y)从值403开始平滑地(并且在该示例中为线性地)递减。Where subscript j is the label of the local region under consideration, _Mj is the maximum value of _Rk in local region j, w is the width of the transition region outside local region j, and d is the distance of the pixel at x,y from local region j (d=0 for pixels within local region j). Figure 4 is a curve 400 of a section of α(x,y) that smoothly varies by example through the center of a local region 402. Within region 402, α(x,y) has a constant value 403. At the boundary or "transition" region 404 (portions 404A and 404B of transition region 404 are shown in Figure 4), α(x,y) decreases smoothly (and in this example, linearly) from value 403.

对于HD分辨率视频，合适的过渡宽度(w)是200个像素。更大的宽度将导致过渡较不明显，并且还使图像的较大部分具有较低的对比度。For HD resolution video, a suitable transition width (w) is 200 pixels. A larger width will result in a less noticeable transition and also cause a larger portion of the image to have lower contrast.

还可以使用其它形式的α(x,y)。例如，不强制α(x,y)随着与局部区域 j的距离线性地减小。代替地，α(x,y)可以根据诸如高斯或者S形(sigmoid) 函数、多项式函数、样条曲线等的可选函数平滑地减小。也不强制α(x,y) 在整个局部区域j具有相同的值403。代替地，α(x,y)可以在局部区域j 内部和缓地达到峰值或者以其它某种平滑的方式改变。Other forms of α(x,y) may also be used. For example, α(x,y) is not required to decrease linearly with distance from local region j. Instead, α(x,y) may decrease smoothly according to an optional function such as a Gaussian or sigmoid function, a polynomial function, a spline curve, or the like. α(x,y) is also not required to have the same value 403 throughout local region j. Instead, α(x,y) may gently peak within local region j or vary in some other smooth manner.

可以基于局部区域内的每个像素需要提高以提供用于串扰消除的足够的底空间的量R_K来确定α(x,y)。可以针对每个颜色通道执行式(18)。α(x, y)可以基于R_K在所有颜色通道上在局部区域上的最大值。可以对局部区域中的所有颜色通道添加相同的信号α(x,y)。可以对不同的局部区域添加不同量的亮度。α(x, y) can be determined based on the amount R _K that each pixel in the local region needs to be increased to provide sufficient floor space for crosstalk cancellation. Equation (18) can be performed for each color channel. α(x, y) can be based on the maximum value of R _K over all color channels in the local region. The same signal α(x, y) can be added to all color channels in the local region. Different amounts of brightness can be added to different local regions.

在对一个视图的局部区域添加亮度的情况下，应当对另一视图的相应局部区域添加相同的亮度，以防止网膜竞争(retinal rivalry)。识别另一视图中的相应局部区域可以考虑估计视图之间的不一致。例如，在一个实施例中，识别另一视图中的相应局部区域包括：计算针对一个视图确定的局部区域的质心，然后进行块匹配以估计视图之间在局部区域的质心处的不一致。可以使用相对大的块大小(例如16个像素)来进行块匹配。When adding brightness to a local area of one view, the same brightness should be added to the corresponding local area of the other view to prevent retinal rivalry. Identifying the corresponding local area in the other view may take into account estimating inconsistencies between views. For example, in one embodiment, identifying the corresponding local area in the other view includes calculating the centroid of the local area determined for one view and then performing block matching to estimate inconsistencies between views at the centroid of the local area. A relatively large block size (e.g., 16 pixels) may be used for block matching.

可以通过对另一视图添加在x方向上偏移了所估计的不一致的α(x,y) 的副本，来生成对另一视图中的相应局部区域添加的亮度。图5A和5B 示出了这一点。在图5A中，示出了针对左图像和右图像中的各个局部区域的补丁α_j(x,y)。在图5B中，对每个视图添加与来自另一视图的补丁相对应并且偏移了所估计的不一致的附加补丁。The added brightness for the corresponding local area in the other view can be generated by adding a copy of α(x,y) to the other view that is offset in the x direction by the estimated inconsistency. Figures 5A and 5B illustrate this. In Figure 5A, patches _αj (x,y) for various local areas in the left and right images are shown. In Figure 5B, additional patches corresponding to the patches from the other view and offset by the estimated inconsistency are added to each view.

进行两次这里描述的整个过程，一次将左视图视为信号(S)而将右视图视为串扰(C)的源，一次将右视图视为信号而将左视图视为串扰(C) 的源。为了生成将对每个图像添加的最终亮度，可以取覆盖任意像素(x,y) 的所有各个补丁α(x,y)的最大值。图5C中示出了这样的示例。The entire process described here is performed twice, once with the left view considered as the signal (S) and the right view as the source of crosstalk (C), and once with the right view considered as the signal and the left view as the source of crosstalk (C). To generate the final brightness to be added to each image, the maximum value of all the individual patches α(x,y) covering any pixel (x,y) can be taken. An example of this is shown in Figure 5C.

对输入图像添加平滑的最终亮度信号，然后可以应用减法串扰消除 (例如关于式(4)和(5)所描述的)。如果需要或者希望，则可以将最终图像从线性空间转换到伽马编码空间，或者变换为希望的图像格式和/或颜色模型表示。A smoothed final luminance signal is added to the input image, and then subtractive crosstalk cancellation can be applied (e.g., as described with respect to equations (4) and (5)). If necessary or desired, the final image can be converted from linear space to gamma-coded space, or transformed into a desired image format and/or color model representation.

在图像是视频序列中的帧的情况下，则可以进行时间滤波以减少由于亮度补丁的出现和消失而导致的闪烁。此外，可以可选地去除短持续时间的补丁。在一些实施例中，使亮度补丁逐渐淡入和淡出。例如，在亮度补丁α(x,y)在特定视频帧中第一次出现的情况下，则可以对紧接在特定视频帧之前的一系列视频帧添加α(x,y)的一系列衰减副本。在亮度补丁α(x,y) 在特定视频帧中最后一次出现的情况下，则可以对紧接在特定视频帧之后的一系列视频帧添加α(x,y)的一系列衰减副本。衰减副本可以使补丁α(x, y)线性地向上或者根据某种其它函数递增。In the case where the images are frames in a video sequence, temporal filtering may be performed to reduce flicker due to the appearance and disappearance of luma patches. In addition, patches of short duration may optionally be removed. In some embodiments, the luma patches are gradually faded in and out. For example, where a luma patch α(x,y) appears for the first time in a particular video frame, a series of attenuated copies of α(x,y) may be added to the series of video frames immediately preceding the particular video frame. Where a luma patch α(x,y) appears for the last time in a particular video frame, a series of attenuated copies of α(x,y) may be added to the series of video frames immediately following the particular video frame. The attenuated copies may cause the patch α(x, y) to increase linearly upwards or according to some other function.

图6中示出用于对视频数据进行处理以使亮度补丁淡入或淡出的方法600。在块602中，对在时间上相邻的帧中的区域进行匹配。这例如可以通过以下方式进行：在第一帧处开始前进通过视频中的每个帧，每次将提供亮度补丁的局部区域与下一帧中的相应局部区域进行匹配。匹配可以包括：对于帧N中的每个局部区域，将质心与帧N+1中的局部区域的质心进行比较。如果帧N中的局部区域的中心与帧N+1中的局部区域的中心之间的距离小于阈值量(例如小于某数量的像素)，则将局部区域视为匹配并且“链接”在一起。如果多于一个局部区域满足匹配标准，则可以链接最佳匹配局部区域。最佳匹配可以基于中心距离和/或局部区域的大小和形状。例如，可以链接中心到中心距离最近的匹配局部区域。FIG6 shows a method 600 for processing video data to fade luma patches in or out. In block 602, regions in temporally adjacent frames are matched. This can be done, for example, by starting at the first frame and advancing through each frame in the video, each time matching the local region providing the luma patch with the corresponding local region in the next frame. Matching can include, for each local region in frame N, comparing the centroid with the centroid of the local region in frame N+1. If the distance between the center of the local region in frame N and the center of the local region in frame N+1 is less than a threshold amount (e.g., less than a certain number of pixels), the local regions are considered matched and "linked" together. If more than one local region meets the matching criteria, the best matching local region can be linked. The best match can be based on center-to-center distance and/or the size and shape of the local region. For example, the matching local region with the closest center-to-center distance can be linked.

链接可以通过提供数据结构来实现，该数据结构标识每一帧中的局部区域并且针对每个帧包括指向下一帧和前一帧中的任意匹配局部区域的指针。Linking may be achieved by providing a data structure that identifies the local regions in each frame and includes, for each frame, pointers to any matching local regions in the next and previous frames.

在可选块603中，丢弃具有非常短的持续时间的局部区域。观看者不太可能注意到在短时间内出现的串扰。为了实现这一点，块603可以是由一组链接的局部区域覆盖的帧的数量(不计任何复制的区域)。如果一组在时间上链接的区域的计数小于阈值(例如对应于1/2秒或一秒的帧数量，诸如30fps视频的30个帧)，则可以删除该组中的所有局部区域。In optional block 603, local regions with very short durations are discarded. A viewer is less likely to notice crosstalk that occurs over a short period of time. To achieve this, block 603 may be the number of frames covered by a group of linked local regions (excluding any duplicated regions). If the count of a group of temporally linked regions is less than a threshold (e.g., the number of frames corresponding to 1/2 second or one second, such as 30 frames for a 30 fps video), all local regions in the group may be deleted.

如果在帧N+1中没有找到针对帧N中的局部区域的匹配，则一些实施例对帧N+1中的局部区域进行复制。在方法600中，块604执行该功能。该复制可以用于如下目的中的一个或两个：允许局部区域的亮度补丁淡出，由此防止亮度的在视觉上明显的突然跌落；以及填充在一个或更多个帧中未检测到、但是在稍后的帧中检测到的局部区域中的时间间隙。块605在最后链接的区域之后应用淡出。当将亮度补丁从帧N复制到帧N+1 中时，Mj的值可以小量地减小，使得亮度补丁随着时间淡出。例如，范例实施例将亮度补丁的副本中的Mj减小每一帧中的显示亮度的0.1％。这使得典型的亮度补丁在2-3秒中淡出。If no match is found in frame N+1 for the local region in frame N, some embodiments copy the local region in frame N+1. In method 600, block 604 performs this function. This copying can be used for one or both of the following purposes: allowing the brightness patch of the local region to fade out, thereby preventing visually obvious sudden drops in brightness; and filling in time gaps in local regions that were not detected in one or more frames but were detected in later frames. Block 605 applies a fade after the last linked region. When copying the brightness patch from frame N to frame N+1, the value of Mj can be reduced by a small amount so that the brightness patch fades out over time. For example, an example embodiment reduces Mj in the copy of the brightness patch by 0.1% of the displayed brightness in each frame. This causes a typical brightness patch to fade out in 2-3 seconds.

块608识别在不位于视频的开头的帧中第一次出现的局部区域。为了防止亮度在局部区域首次出现时突然跳变，块609对应用于这些局部区域的亮度补丁应用淡入。实现这一点的一种方式是进行通过帧的经过，检查帧N中没有到帧N-1中的相应局部区域的后向链接的局部区域。如果找到了任何这种局部区域，则可以将亮度补丁的副本添加到之前的帧中，首先添加到帧N-1中，然后到N-2中等等。每次复制亮度补丁时，可以小量减小Mj的值，使得在播放视频时亮度补丁将淡入。Block 608 identifies local areas that appear for the first time in a frame that is not at the beginning of the video. To prevent brightness from jumping suddenly when a local area appears for the first time, block 609 fades in the brightness patches applied to these local areas. A way to achieve this is to go through the frames and check if there are no local areas in frame N that have backward links to the corresponding local areas in frame N-1. If any such local areas are found, a copy of the brightness patch can be added to the previous frame, first added to frame N-1, then to N-2, etc. Each time a brightness patch is copied, the value of Mj can be reduced by a small amount so that the brightness patch will fade in when the video is played.

与在上面讨论的淡出的情况下相同。可以将亮度补丁的每个副本相对于下一帧中的副本降低或衰减诸如显示亮度的0.1％的小的量，以实现例如几秒(例如在一些实施例中为2-3秒)的渐变过渡时间。As in the case of the fade-out discussed above, each copy of the brightness patch can be lowered or attenuated relative to the copy in the next frame by a small amount, such as 0.1% of the displayed brightness, to achieve a gradual transition time of, for example, a few seconds (e.g., 2-3 seconds in some embodiments).

为了防止闪烁，应用时间滤波以确保量Mj在帧之间是一致的，并且随着时间缓慢地改变。例如，可以对添加的亮度补丁应用具有0.5Hz的截止频率的低通滤波器以消除视频中的闪烁(基于人的视觉系统的瞬时频率响应)。在示例实施例中，随着时间对每个区域的Mj的值应用具有0.5 Hz的截止频率的80抽头低通滤波器。在滤波之后，针对每个局部区域使用式(19)中的经时间滤波的M_j的值创建亮度补丁。在图6中，在块610 中进行时间滤波。To prevent flicker, temporal filtering is applied to ensure that the quantity Mj is consistent between frames and changes slowly over time. For example, a low-pass filter with a cutoff frequency of 0.5 Hz can be applied to the added luminance patch to eliminate flicker in the video (based on the transient frequency response of the human visual system). In an example embodiment, an 80-tap low-pass filter with a cutoff frequency of 0.5 Hz is applied to the value of Mj for each region over time. After filtering, a luminance patch is created for each local region using the temporally filtered value of _Mj in equation (19). In FIG6 , temporal filtering is performed in block 610.

如上所述，相应地一旦针对一个视图建立了一个或更多个亮度补丁，则可以创建相应的亮度补丁以用于添加到另一视图。这可以通过针对每个局部区域使用块匹配进行不一致搜索，并且对另一视图应用相应亮度补丁的偏移版本来进行。为了节省计算量，可以使用先前帧中的区域的不一致作为初始估计值，对于大多数帧使用较小不一致搜索(并且例如搜索+/-2 个像素)。可以通过取覆盖像素的所有补丁的最大值来获得对每个视图添加的最终亮度。然后对每个视图添加适当的最终图像。然后可以进行减法串扰消除。在图6中，在块611中进行串扰消除。在块612中显示获得的 3D图像。As described above, once one or more luma patches are created for one view, corresponding luma patches can be created for addition to the other view. This can be done by performing an inconsistency search using block matching for each local region and applying an offset version of the corresponding luma patch to the other view. To save computational effort, the inconsistency of the region in the previous frame can be used as an initial estimate, with a smaller inconsistency search used for most frames (and searching for example +/- 2 pixels). The final luma added to each view can be obtained by taking the maximum value of all patches covering the pixel. The appropriate final image is then added to each view. Subtractive crosstalk cancellation can then be performed. In Figure 6, crosstalk cancellation is performed in block 611. The resulting 3D image is displayed in block 612.

图7A示出了在使用如这里所描述的方法时在10秒的视频过程中添加的补丁的强度。为了进行比较，图7B示出了单独针对每个帧计算的补丁的强度。如图7B所示，单独针对每个帧确定的补丁可能帧与帧之间完全不一致。有时未在每一帧中检测到应当应用处理以便于进行串扰消除的局部区域。在一个或两个帧中仅检测到几个局部区域。这在输出视频中产生烦人的非常明显的闪烁。FIG7A shows the intensity of patches added over the course of a 10-second video using the method described herein. For comparison, FIG7B shows the intensity of patches calculated for each frame individually. As shown in FIG7B , the patches determined individually for each frame may be completely inconsistent from frame to frame. Sometimes local regions to which processing should be applied for crosstalk cancellation are not detected in every frame. Only a few local regions are detected in one or two frames. This produces annoying, very noticeable flickering in the output video.

通过对比，如图7A所示，如这里所描述的方法给出了平滑得多的时间过渡，并且降低或避免了闪烁。By contrast, as shown in FIG. 7A , the method as described herein gives a much smoother temporal transition and reduces or avoids flicker.

可以实时地或者在非实时的预处理步骤中进行如这里所描述的方法。例如可以在3D计算机显示器、3D电视和3D影院显示中应用这些方法。The methods described herein can be performed in real time or in a non-real time pre-processing step. These methods can be applied, for example, in 3D computer displays, 3D televisions, and 3D cinema displays.

在接收到信号时实时对信号应用处理的一些实施例中，当检测到不在先前帧中的新的局部区域时，在该帧的起始处开始针对新区域的淡入。由于在淡入时间期间串扰可能产生可见的效果，因此可以使用相对快速的淡入。In some embodiments where processing is applied to the signal in real time as it is received, when a new local region is detected that was not in the previous frame, a fade-in for the new region is started at the beginning of the frame. Since crosstalk may have visible effects during the fade-in time, a relatively fast fade-in may be used.

如这里所描述的方法可以被适配为适合于式(2)和(3)所例示的线性模型之外的串扰模型。为了使用不同的串扰模型，可以根据串扰模型使用给出补偿来自另一视图的串扰所需的光的最小量的函数来计算R_K。串扰模型例如可以提供非线性串扰、在不同的颜色通道中不同量的串扰和/或在空间上改变的串扰。The method described herein can be adapted to be suitable for crosstalk models other than the linear model exemplified by equations (2) and (3). To use a different crosstalk model, R _K can be calculated using a function that gives the minimum amount of light required to compensate for the crosstalk from another view according to the crosstalk model. The crosstalk model can, for example, provide nonlinear crosstalk, different amounts of crosstalk in different color channels, and/or spatially varying crosstalk.

例如可以在显示器或者显示器的驱动器中、在附着的用于驱动3D显示器的计算机中、在视频播放器或者媒体播放器中、在3D数字影院投影仪等中，设置被配置为执行如这里所描述的方法的设备。该设备可以是独立的或者与显示器集成。在一些实施例中，该设备被配置为获得显示器中的表征串扰的信息，并且基于表征显示串扰的信息对图像数据进行处理以用于在显示器上显示。该设备可以被配置为例如从显示器本身或者例如从在线数据库获得表征串扰的信息。For example, a device configured to perform the method described herein can be provided in a display or a driver for the display, in an attached computer for driving a 3D display, in a video player or media player, in a 3D digital cinema projector, or the like. The device can be independent or integrated with the display. In some embodiments, the device is configured to obtain information characterizing crosstalk in the display and process image data for display on the display based on the information characterizing the display crosstalk. The device can be configured to obtain the information characterizing the crosstalk, for example, from the display itself or from an online database.

这里描述的系统和模块可以包括适合于这里描述的目的的软件、固件、硬件或者其任意组合。系统或者模块可以执行一个或更多个功能，一个软件、固件或者硬件可以执行一个或更多个系统或模块的功能。系统和模块可以驻留在诸如服务器、工作站、个人计算机、计算机化的平板电脑、个人数字助理(PDA)的通用计算机以及适合这里描述的目的的其它设备。相关领域技术人员将理解，可以通过其它通信、数据处理或者计算机系统配置来实现系统的各方面，这些通信、数据处理或者计算机系统配置包括：因特网设备、手持设备(包括PDA)、可佩戴计算机、所有方式的蜂窝电话或移动电话、多处理器系统、基于微处理器的或者可编程消费电子设备、机顶盒、网络PC、迷你计算机、大型计算机等。实际上，通常这里可互换地使用术语“计算机”、“服务器”、“主机”、“主机系统”等，这些术语是指上述设备和系统中的任意一个以及任意数据处理器。此外，可以在被特定地编程、配置或构造为执行这里详细说明的计算机可执行指令中的一个或更多个的特殊用途计算机或数据处理器中实施系统的各方面。The systems and modules described herein can include software, firmware, hardware, or any combination thereof suitable for the purposes described herein. A system or module can perform one or more functions, and a software, firmware, or hardware can perform the functions of one or more systems or modules. The systems and modules can reside on general-purpose computers such as servers, workstations, personal computers, computerized tablet computers, personal digital assistants (PDAs), and other devices suitable for the purposes described herein. Those skilled in the relevant art will understand that various aspects of the system can be implemented by other communications, data processing, or computer system configurations, including: Internet appliances, handheld devices (including PDAs), wearable computers, all types of cellular phones or mobile phones, multiprocessor systems, microprocessor-based or programmable consumer electronics, set-top boxes, network PCs, minicomputers, mainframe computers, etc. In fact, the terms "computer," "server," "host," "host system," etc. are usually used interchangeably here, and these terms refer to any one of the above-mentioned devices and systems and any data processor. In addition, various aspects of the system can be implemented in a special-purpose computer or data processor that is specifically programmed, configured, or constructed to execute one or more of the computer-executable instructions described in detail herein.

本发明的特定实施方式包括执行软件指令的计算机处理器，软件指令使处理器执行本发明的方法。例如，驱动3D显示器的计算机、3D投影仪、3D显示器、图形卡、连接以用于驱动3D投影仪或显示器的源单元、有线电视系统的机顶盒或者其它数据分配系统等中的一个或更多个处理器，可以通过执行处理器可访问的程序存储器中的软件指令来实施这里所描述的方法。还可以以程序产品的形式提供本发明。程序产品可以包括任意非易失性介质，其承载一组计算机可读信号，该计算机可读信号包括指令，当由数据处理器执行该指令时，使数据处理器执行本发明的方法。根据本发明的程序产品可以是很多种不同的形式中的任意种。程序产品例如可以包括诸如包括软盘、硬盘驱动器的磁数据存储介质、包括CD ROM、DVD的光学数据存储介质、包括ROM、闪速RAM等的电子数据存储介质等的物理介质。可以可选地对程序产品上的计算机可读信号进行压缩或者加密。Certain embodiments of the present invention include a computer processor that executes software instructions that cause the processor to perform the methods of the present invention. For example, one or more processors in a computer driving a 3D display, a 3D projector, a 3D display, a graphics card, a source unit connected to drive a 3D projector or display, a set-top box for a cable television system, or other data distribution system, can implement the methods described herein by executing software instructions in a program memory accessible to the processor. The present invention can also be provided in the form of a program product. The program product can include any non-volatile medium that carries a set of computer-readable signals that include instructions that, when executed by a data processor, cause the data processor to perform the methods of the present invention. The program product according to the present invention can be in any of a variety of different forms. For example, the program product can include physical media such as magnetic data storage media including floppy disks, hard drives, optical data storage media including CD ROMs, DVDs, electronic data storage media including ROMs, flash RAM, etc. The computer-readable signals on the program product can optionally be compressed or encrypted.

可以将本技术的示例存储或者分布在计算机可读介质上，计算机可读介质包括磁或光学可读计算机盘、硬配线或预编程芯片(例如EEPROM 半导体芯片)、纳米技术存储器、生物存储器或者其它数据存储介质。实际上，可以在一段时间内在因特网上或者在其它网络(包括无线网络)上、在传播介质(例如电磁波、声波等)上的传播信号上分发用计算机实施的指令、数据结构、屏幕显示和系统的方面的其它数据，或者可以在任意模拟或数字网络(包切换、电路切换或者其它方案)上提供它们。Examples of the present technology may be stored or distributed on computer-readable media, including magnetically or optically readable computer disks, hard-wired or pre-programmed chips (e.g., EEPROM semiconductor chips), nanotechnology memories, biological memories, or other data storage media. In practice, computer-implemented instructions, data structures, screen displays, and other data for aspects of the system may be distributed over a period of time over the Internet or other networks (including wireless networks), on propagated signals over propagation media (e.g., electromagnetic waves, acoustic waves, etc.), or may be provided over any analog or digital network (packet-switched, circuit-switched, or other schemes).

在将部件(例如组件、设备等)称为上述内容的情况下，除非另外指出，否则对该部件的引用(包括对设备的引用)应当被解释为作为该部件的等同物，包括执行与所描述的部件相同的功能的任意部件、包括结构上不等同于所公开的执行所示出的本发明的示例性实施例的功能的结构的部件。Where a component (e.g., assembly, device, etc.) is referred to as such, unless otherwise indicated, reference to the component (including reference to the device) should be interpreted as an equivalent of the component, including any component that performs the same function as the described component, including components that are not structurally equivalent to the disclosed structure for performing the function of the exemplary embodiments of the invention shown.

按照给定顺序呈现了处理、方法、列表等。可以按照不同的顺序执行可选示例，可以删除、移动、添加、细分、组合和/或修改一些元素，以提供替选或者子组合。可以以各种不同的方式实现这些元素中的每个。此外，虽然有时示出串行执行元素，但是可以代而并行执行它们，或者可以在不同的时间执行它们。一些元素可能具有条件性，为了简要而没有示出这一点。Processes, methods, lists, etc. are presented in a given order. Optional examples may be performed in a different order, and some elements may be deleted, moved, added, subdivided, combined, and/or modified to provide alternatives or sub-combinations. Each of these elements may be implemented in a variety of ways. Furthermore, although elements are sometimes shown as being performed serially, they may instead be performed in parallel, or may be performed at different times. Some elements may be conditional, which is not shown for brevity.

除非上下文清楚地需要其它情况，否则在全部说明书和权利要求中，应当以涵盖性的意义，也就是说，以“包括、但不限于”的意义，解释词语“包括”、“包含”等。如这里所使用的，术语“连接”、“耦合”或者任意其变化意为两个或更多个元素之间的直接或非直接的任意连接或耦合；元素之间的耦合或连接可以是物理的、逻辑的或者其组合。另外，词语“这里”、“上面”、“下面”和类似含义的词语应当是指本文的整体，而不是指任何特定部分。在上下文允许的情况下，使用单数或复数的词语也可能分别包括复数或者单数。关于两个或更多个项目的列举的词语“或者”覆盖以下所有对该词语的解释：列表中的项目中的任意一个；列表中的所有项目；以及列表中的项目的任意组合。Unless the context clearly requires otherwise, throughout the specification and claims, the words "including", "comprising", etc. should be interpreted in an inclusive sense, that is, in the sense of "including, but not limited to". As used herein, the terms "connected", "coupled" or any variation thereof means any connection or coupling, direct or indirect, between two or more elements; the coupling or connection between elements can be physical, logical, or a combination thereof. In addition, the words "herein", "above", "below" and words of similar meaning should refer to the entire document and not to any particular part. Where the context permits, words using the singular or plural number may also include the plural or singular number, respectively. The word "or" with respect to a list of two or more items covers all of the following interpretations of the word: any one of the items in the list; all the items in the list; and any combination of the items in the list.

虽然上面讨论了多个示例性方面和实施例，但是本领域技术人员将意识到特定变形、排列、附加及其子组合。因此，旨在将下面所附的权利要求以及之后引入的权利要求解释为包括所有这些变形、排列、附加和子组合，如在其真实精神和范围内。Although various exemplary aspects and embodiments have been discussed above, those skilled in the art will recognize certain variations, permutations, additions, and sub-combinations thereof. It is therefore intended that the following appended claims and claims hereafter introduced be interpreted as including all such variations, permutations, additions, and sub-combinations as are within their true spirit and scope.

本发明还包括以下实施方式：The present invention also includes the following embodiments:

(1)一种用于准备3D图像的方法，所述3D图像包括用于显示的左眼视图和右眼视图，所述方法包括：(1) A method for preparing a 3D image, the 3D image including a left-eye view and a right-eye view for display, the method comprising:

基于图像数据确定为允许完全减法串扰消除而增大像素值的量；determining, based on the image data, an amount to increase pixel values to allow for complete subtractive crosstalk cancellation;

确定所述量的最大值；determining a maximum value for said amount;

通过以基于所述最大值而在量上进行的加法或按比例确定中的一个或两个，全局地增大所述像素值。The pixel values are globally increased by one or both of addition and scaling in amounts based on the maximum value.

(2)根据(1)所述的方法，其中，所述3D图像包括视频序列中的视频帧，并且所述方法包括：对所述最大值或根据所述最大值得出的量应用时间低通滤波器。(2) The method of (1), wherein the 3D image comprises a video frame in a video sequence, and the method comprises applying a temporal low-pass filter to the maximum value or a quantity derived from the maximum value.

(3)根据(2)所述的方法，其中，所述时间滤波器包括双向滤波器。(3) The method according to (2), wherein the temporal filter comprises a bidirectional filter.

(4)根据(1)至(3)中任一项所述的方法，其中，确定增大像素值的量包括计算：(4) The method according to any one of (1) to (3), wherein determining the amount to increase the pixel value comprises calculating:

R_K(x,y)＝max(0,F(I_C,K(x,y))-I_S,K(x,y))R _K (x,y)=max(0,F(I _C,K (x,y))-I _S,K (x,y))

其中，R_k是增大位于位置(x,y)的像素的量，F()是串扰函数，I_S,K是所述图像的一个视图中的像素的颜色通道K的值，I_C,K是所述图像的另一视图中的像素的颜色通道K的值。where _Rk is the amount to increase the pixel at position (x,y), F() is the crosstalk function, I _S,K is the value of color channel K of the pixel in one view of the image, and I _C,K is the value of color channel K of the pixel in the other view of the image.

(5)根据(4)所述的方法，其中，F()包括与串扰因子c相乘。(5) The method of (4), wherein F() includes multiplication by a crosstalk factor c.

(6)根据(1)至(5)中任一项所述的方法，其中，全局地增大所述像素值包括：对具有单独的亮度通道和色度通道的颜色模型中的亮度通道中的值全局地进行按比例确定和/或相加。(6) The method of any one of (1) to (5), wherein globally increasing the pixel value comprises globally scaling and/or adding values in a luminance channel in a color model having separate luminance and chrominance channels.

(7)根据(6)所述的方法，其中，所述颜色模型包括YCbCr颜色模型。(7) The method according to (6), wherein the color model includes a YCbCr color model.

(8)一种用于准备3D视频图像的方法，所述3D视频图像包括用于显示的一系列帧中的每个帧的左眼视图和右眼视图，所述方法包括：(8) A method for preparing a 3D video image, the 3D video image comprising a left-eye view and a right-eye view for each frame in a series of frames for display, the method comprising:

识别视图中的像素值过小而不允许完全减法串扰消除的局部区域；identifying local regions in the view where pixel values are too small to allow full subtractive crosstalk cancellation;

确定加到所述局部区域的亮度补丁的强度；determining an intensity of a brightness patch applied to the local area;

对所述强度进行时间低通滤波；以及temporally low-pass filtering the intensities; and

根据经时间滤波的强度生成所述亮度补丁。The luminance patch is generated from the temporally filtered intensity.

(9)根据(8)所述的方法，包括：对于所述亮度补丁中的每个，生成用于另一视图的相应亮度补丁。(9) The method of (8), comprising: for each of the luma patches, generating a corresponding luma patch for another view.

(10)根据(9)所述的方法，其中，生成相应亮度补丁包括：定位所述亮度补丁的中心；进行块匹配以估计所述视图之间在与所述局部区域的中心相对应的位置处的不一致；以及把在x方向上偏移了所估计的不一致的所述亮度补丁的副本加到所述另一视图上。(10) A method according to (9), wherein generating the corresponding luminance patch includes: locating the center of the luminance patch; performing block matching to estimate the inconsistency between the views at a position corresponding to the center of the local area; and adding a copy of the luminance patch offset in the x direction by the estimated inconsistency to the other view.

(11)根据(8)所述的方法，包括：对与所述亮度补丁相关联的最后一帧之后的帧中的每个亮度补丁加上淡出。(11) The method of (8), comprising: applying a fade-out to each luma patch in a frame following the last frame associated with the luma patch.

(12)根据(8)所述的方法，包括：对与所述亮度补丁相关联的第一帧之前的帧中的每个亮度补丁加上淡入。(12) The method of (8), comprising fading in each luma patch in a frame preceding a first frame associated with the luma patch.

(13)根据(10)所述的方法，其中，使用16个像素或更大的块大小进行所述块匹配。(13) The method according to (10), wherein the block matching is performed using a block size of 16 pixels or larger.

(14)一种用于准备3D视频图像的方法，所述3D视频图像包括用于显示的一系列帧中的每个帧的左眼视图和右眼视图，所述方法包括：(14) A method for preparing a 3D video image, the 3D video image comprising a left-eye view and a right-eye view for each frame in a series of frames for display, the method comprising:

链接跨所述视频图像的帧的相应局部区域，以提供一个或更多个系列的链接局部区域；linking corresponding local regions across frames of the video image to provide one or more series of linked local regions;

识别所述系列的相应链接局部区域中的一个系列的第一链接局部区域；以及identifying a first linked local area of a series of the series of corresponding linked local areas; and

把与所述第一链接局部区域相对应的淡入亮度补丁加到与所述第一链接局部区域相对应的帧之前的一系列帧上。A faded-in luminance patch corresponding to the first linked local area is added to a series of frames preceding the frame corresponding to the first linked local area.

(15)根据(14)所述的方法，包括：识别所述系列的相应链接局部区域中的所述一个系列的最后的链接局部区域；以及把与所述最后的链接局部区域相对应的淡出亮度补丁加到接在与所述最后的链接局部区域相对应的帧之后的一系列帧上。(15) The method according to (14) includes: identifying the last linked local area of the series of corresponding linked local areas; and adding a faded brightness patch corresponding to the last linked local area to a series of frames following the frame corresponding to the last linked local area.

(16)一种非易失性介质，其承载一组计算机可读信号，所述计算机可读信号包括指令，当被数据处理器执行时，所述指令使所述数据处理器执行根据(1)至(15)中任一项所述的用于准备3D图像的方法。(16) A non-volatile medium carrying a set of computer-readable signals, the computer-readable signals comprising instructions that, when executed by a data processor, cause the data processor to perform the method for preparing a 3D image according to any one of (1) to (15).

(17)一种用于准备3D图像的设备，所述3D图像包括用于显示的左眼视图和右眼视图，所述设备包括：(17) An apparatus for preparing a 3D image, the 3D image comprising a left-eye view and a right-eye view for display, the apparatus comprising:

用于对图像数据进行处理以确定为允许完全减法串扰消除而增大像素值的量的装置；means for processing the image data to determine an amount by which pixel values are to be increased to allow for complete subtractive crosstalk cancellation;

用于确定所述量的最大值的装置；means for determining a maximum value of said quantity;

用于通过以基于所述最大值而在量上进行的加法或按比例确定中的一个或两个来全局地增大所述像素值的装置。Means for globally increasing the pixel values by one or both of adding or scaling in amounts based on the maximum value.

(18)根据(17)所述的设备，其中，所述3D图像包括视频序列中的视频帧，并且所述设备包括：用于对所述最大值或根据所述最大值得出的量应用时间低通滤波器的装置。(18) The apparatus of (17), wherein the 3D image comprises a video frame in a video sequence, and the apparatus comprises means for applying a temporal low-pass filter to the maximum value or a quantity derived from the maximum value.

(19)根据(18)所述的设备，其中，所述时间滤波器包括双向滤波器。(19) The apparatus according to (18), wherein the temporal filter comprises a bidirectional filter.

(20)根据(17)至(19)中任一项所述的设备，其中，用于对图像数据进行处理以确定增大像素值的量的装置被配置为计算：(20) The apparatus according to any one of (17) to (19), wherein the means for processing the image data to determine the amount to increase the pixel value is configured to calculate:

(21)根据(20)所述的设备，其中，F()包括与串扰因子c相乘。(21) The apparatus of (20), wherein F() includes multiplication by a crosstalk factor c.

(22)根据(17)至(21)中任一项所述的设备，其中，用于全局地增大所述像素值的装置包括：用于对具有单独的亮度通道和色度通道的颜色模型中的亮度通道中的值全局地进行按比例确定和/或相加的装置。(22) An apparatus according to any one of (17) to (21), wherein the device for globally increasing the pixel value includes: a device for globally scaling and/or adding values in a luminance channel in a color model having separate luminance and chrominance channels.

(23)根据(22)所述的设备，其中，所述颜色模型包括YCbCr颜色模型。(23) The apparatus according to (22), wherein the color model includes a YCbCr color model.

(24)一种用于准备3D视频图像的设备，所述3D视频图像包括用于显示的一系列帧中的每个帧的左眼视图和右眼视图，所述设备包括：(24) An apparatus for preparing a 3D video image comprising a left-eye view and a right-eye view for each frame in a series of frames for display, the apparatus comprising:

用于识别视图中的像素值过小而不允许完全减法串扰消除的局部区域的装置；means for identifying local regions in the view having pixel values too small to permit complete subtractive crosstalk cancellation;

用于确定加到所述局部区域的亮度补丁的强度的装置；means for determining an intensity of a brightness patch applied to said local area;

用于对所述强度进行时间低通滤波的装置；以及means for temporally low-pass filtering said intensities; and

用于根据经时间滤波的强度生成所述亮度补丁的装置。means for generating the luminance patch from the temporally filtered intensity.

(25)根据(23)所述的设备，其中，用于生成所述亮度补丁的装置被配置为对于所述亮度补丁中的每个生成用于另一视图的相应亮度补丁。(25) The apparatus according to (23), wherein the means for generating the luma patches is configured to generate a corresponding luma patch for another view for each of the luma patches.

(26)根据(24)所述的设备，包括：用于定位所述亮度补丁的中心的装置；用于进行块匹配以估计所述视图之间与所述亮度补丁的中心相对应的位置处的不一致的装置；以及用于把在x方向上偏移了所估计的不一致的所述亮度补丁的副本加到对所述另一视图上的装置。(26) The apparatus according to (24), comprising: means for locating the center of the luminance patch; means for performing block matching to estimate an inconsistency between the views at a position corresponding to the center of the luminance patch; and means for adding a copy of the luminance patch offset in the x direction by the estimated inconsistency to the other view.

(27)根据(24)所述的设备，包括：用于对与所述亮度补丁相关联的最后一帧之后的帧中的每个亮度补丁加上淡出的装置。(27) The apparatus of (24), comprising means for fading each luma patch in a frame subsequent to the last frame associated with the luma patch.

(28)根据(24)所述的设备，包括：用于对与所述亮度补丁相关联的第一帧之前的帧中的每个亮度补丁加上淡入的装置。(28) The apparatus of (24), comprising means for fading in each luma patch in a frame preceding a first frame associated with the luma patch.

(29)一种用于准备3D视频图像的设备，所述3D视频图像包括用于显示的一系列帧中的每个帧的左眼视图和右眼视图，所述设备包括：(29) An apparatus for preparing a 3D video image comprising a left-eye view and a right-eye view for each frame in a series of frames for display, the apparatus comprising:

用于识别视图中的像素值过小而不能进行完全减法串扰消除的局部区域的装置；means for identifying local regions in the view where pixel values are too small to permit full subtractive crosstalk cancellation;

用于确定加到所述局部区域的亮度补丁的强度的装置；means for determining the intensity of a brightness patch applied to said local area;

用于链接跨所述视频图像的帧的相应局部区域以提供一个或更多个系列的链接局部区域的装置；means for linking corresponding local regions across frames of said video image to provide one or more series of linked local regions;

用于识别所述系列的相应链接局部区域中的一个系列的第一链接局部区域的装置；以及means for identifying a first linked local area of one of said series of corresponding linked local areas; and

用于把与所述第一链接局部区域相对应的淡入亮度补丁加到与所述第一链接局部区域相对应的帧之前的一系列帧上的装置。Means for adding a faded-in luminance patch corresponding to said first linked local area to a series of frames preceding the frame corresponding to said first linked local area.

(30)根据(29)所述的设备，包括：用于识别所述系列的相应链接局部区域中的所述一个系列中的最后的链接局部区域并且把与所述最后的链接局部区域相对应的淡出亮度补丁加到接在与所述最后的链接局部区域相对应的帧之后的一系列帧上的装置。(30) The apparatus according to (29) comprises: a device for identifying the last linked local area in the series of corresponding linked local areas and adding a faded brightness patch corresponding to the last linked local area to a series of frames following the frame corresponding to the last linked local area.

(31)设置有根据(17)至(30)中任一项所述的设备的显示器、图形卡、3D投影仪、视频播放器、媒体播放器或3D数字影院投影仪。(31) A display, a graphics card, a 3D projector, a video player, a media player or a 3D digital cinema projector provided with the device according to any one of (17) to (30).

Claims

1. A method for preparing a three-dimensional image, the three-dimensional image including a left-eye view and a right-eye view for display, the method comprising:

Based on the image data of the left eye view and the right eye view, identify the set of pixels in each of the left eye view and the right eye view whose pixel values are insufficient to allow for complete subtraction crosstalk cancellation;

Selectively determine the amount by which the pixel value of the pixel in the set is increased to allow for complete subtractive crosstalk cancellation;

One or more quantities are established to increase the pixel value based on the quantities determined for the pixels in the set;

The pixel values of at least some pixels in the set are increased by adding to one or more of the established quantities and by proportionally determining one or two of them.

2. The method according to claim 1, wherein identifying the set of pixels comprises: identifying pixels with _IS ≥ ^{c 1/γ} _IC , wherein _IS is the pixel value of the currently considered view in the left-eye view and the right-eye view, _IC is the corresponding pixel value of another view in the left-eye view and the right-eye view, c is the amount of crosstalk between the left-eye view and the right-eye view when displayed, and γ is the gamma value.

3. The method of claim 1, wherein establishing the one or more quantities that increase the pixel value comprises: taking the maximum value of the determined quantities.

4. The method of claim 3, wherein establishing the one or more quantities that increase the pixel value comprises: taking the percentile value of the determined quantity.

5. The method of claim 4, wherein the percentile value is at least the 90th percentile of the determined quantity.

6. The method of claim 1, further comprising: globally increasing the pixel values in the left-eye view and the right-eye view by applying the same summation and/or proportional determination to all pixels in the left-eye view and the right-eye view.

7. The method of claim 1, wherein the pixel value is an R, G, B pixel value, and the method comprises: applying the same summation and/or proportional determination to all the R, G, B pixel values of the pixel for each of the pixels.

8. The method of claim 1, wherein the three-dimensional image comprises video frames in a video sequence, and the method comprises: applying a time low-pass filter to one or more quantities that increase the pixel values.

9. The method of claim 8, wherein the time low-pass filter comprises a bidirectional filter.

10. The method of claim 1, wherein the pixel value is represented by more than 8 bits.

11. The method of claim 1, further comprising: applying a sharpening filter before identifying a set of pixels in each of the left-eye and right-eye views whose pixel values are insufficient to allow for complete subtraction crosstalk cancellation.

12. The method of claim 1, further comprising: individually determining, for different regions within the left-eye view and the right-eye view, the amount by which the pixel value of the pixel is increased.

13. The method of claim 1, further comprising: identifying a plurality of regions within one of the left-eye view and the right-eye view, wherein the plurality of regions includes pixels whose pixel values are insufficient to allow for complete subtraction crosstalk cancellation; and for each of the plurality of regions, selectively increasing or not increasing the pixel values of pixels within that region based on whether the region is larger than a threshold size.

14. The method of claim 1, wherein the three-dimensional image comprises video frames in a video sequence, and the method comprises: identifying a plurality of regions within one of the left-eye view and the right-eye view, wherein the plurality of regions comprises pixels whose pixel values are insufficient to allow for complete subtraction crosstalk cancellation; and for each of the plurality of regions, selectively increasing or not increasing the pixel values of pixels in the region based on whether the region persists for more than a threshold number of frames in the video sequence.

15. The method of claim 14, wherein selectively increasing the pixel value of the pixels in the region or not increasing the pixel value of the pixels in the region further depends on whether the region is larger than a threshold size.

16. The method of claim 1, further comprising: identifying a plurality of regions within one of the left-eye view and the right-eye view by applying a binary labeling algorithm, wherein the plurality of regions include pixels whose pixel values are insufficient to allow for complete subtraction crosstalk cancellation.

17. The method of claim 1, further comprising: identifying a plurality of regions within one of the left-eye view and the right-eye view, wherein the plurality of regions includes pixels with pixel values insufficient to allow for complete subtraction crosstalk cancellation, wherein the established quantity includes a quantity for each of the plurality of regions, wherein the method further comprises: increasing pixel values in transition regions along the boundaries of the plurality of regions by the amount that decreases with distance from the boundaries of the plurality of regions.

18. The method of claim 17, wherein, within the transition region, the amount of reduction decreases non-linearly with distance from the boundary of the plurality of regions.

19. The method of claim 17, further comprising: increasing the pixel value in each of the plurality of regions by a different amount, such that the amount of brightness added to the region changes with position within the region to provide a gentle peak.