HK1231195B - Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities - Google Patents
Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities Download PDFInfo
- Publication number
- HK1231195B HK1231195B HK17104282.0A HK17104282A HK1231195B HK 1231195 B HK1231195 B HK 1231195B HK 17104282 A HK17104282 A HK 17104282A HK 1231195 B HK1231195 B HK 1231195B
- Authority
- HK
- Hong Kong
- Prior art keywords
- digital code
- gsdf
- display
- value
- values
- Prior art date
Links
Description
本申请是申请号为201280060408.7、申请日为2012年12月6日、 发明名称为“改进不同显示能力之间基于感知照度非线性的图像数据 交换的装置和方法”的发明专利申请的分案申请。This application is a divisional application of the invention patent application with application number 201280060408.7, application date December 6, 2012, and invention name "Apparatus and method for improving image data exchange based on perceived illumination nonlinearity between different display capabilities".
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请要求于2011年12月6日提交的美国临时专利申请 No.61/567,579;于2012年7月23日提交的美国临时专利申请 No.61/674,503;以及于2012年9月20日提交的美国临时专利申请 No.61/703,449的优先权,这些申请的全部内容都出于所有目的通过引 用并入此。This application claims priority to U.S. Provisional Patent Application No. 61/567,579, filed December 6, 2011; U.S. Provisional Patent Application No. 61/674,503, filed July 23, 2012; and U.S. Provisional Patent Application No. 61/703,449, filed September 20, 2012, the entire contents of which are incorporated herein by reference for all purposes.
技术领域Technical Field
本发明总地涉及图像数据。更特别地,本发明的实施例涉及不同 显示能力之间基于感知非线性的图像数据交换。The present invention relates generally to image data. More particularly, embodiments of the present invention relate to the exchange of image data between different display capabilities based on perceptual nonlinearity.
背景技术Background Art
技术的发展使得现代显示器设计可以呈现图像和视频内容,与相 同内容呈现在落后的显示器上相比,各种质量特性得到显著改进。例 如,一些更现代的显示器能够以比常规或标准显示器的标准动态范围 (SDR)高的动态范围(DR)呈现内容。Technological advances have enabled modern display designs to present image and video content with significantly improved quality characteristics compared to the same content presented on less advanced displays. For example, some more modern displays are capable of presenting content with a higher dynamic range (DR) than the standard dynamic range (SDR) of conventional or standard displays.
例如,一些现代的液晶显示器(LCD)具有光单元(背光单元、 侧光单元等),该光单元提供光场,在该光场中可以与有源LCD元 件的液晶取向状态的调制分开地对各个部分进行调制。该双调制方法 是可扩展的(例如,可扩展到N个调制层,其中,N包括大于2的整数),诸如可以通过显示器的电光学构造中的可控中间层(例如,可 分别控制的LCD层的多个层)进行扩展。For example, some modern liquid crystal displays (LCDs) have light units (backlight units, sidelight units, etc.) that provide a light field in which individual components can be modulated separately from the modulation of the liquid crystal orientation state of the active LCD elements. This dual modulation approach is scalable (e.g., to N modulation layers, where N includes integers greater than 2), such as by controllable intermediate layers in the display's electro-optical construction (e.g., multiple layers of separately controllable LCD layers).
相比之下,一些现有的显示器具有比高动态范围(HDR)明显窄 的动态范围(DR)。使用典型的阴极射线管(CRT)、具有恒定荧光 白色背光照明或等离子体屏幕技术的液晶显示器(LCD)的移动装置、 计算机平板、游戏装置、电视(TV)和计算机监视器设备的DR呈现能力可能限于大约三个数量级。这样的现有显示器因此代表标准动态 范围(SDR),相对于HDR,有时也被称为“‘低’动态范围”或“LDR”。In contrast, some existing displays have a dynamic range (DR) significantly narrower than high dynamic range (HDR). Mobile devices, computer tablets, gaming devices, televisions (TVs), and computer monitors using typical cathode ray tubes (CRTs), liquid crystal displays (LCDs) with constant fluorescent white backlighting, or plasma screen technology can have DR rendering capabilities limited to approximately three orders of magnitude. Such existing displays therefore represent standard dynamic range (SDR), sometimes also referred to as "low" dynamic range or "LDR" relative to HDR.
HDR照相机所捕捉的图像可具有远大于大多数(如果不是所有) 显示装置的动态范围的场景相关(scene-referred)HDR。场景相关 HDR图像可以包括大量数据,并且可以被转换为便利于传输和存储的 后期制作格式(例如,具有8位RGB、YCbCr或深色选项的HDMI视频信号;具有10位4:2:2采样率的1.5Gbps SDI视频信号;具有 12位4:4:4或10位4:2:2采样率的3Gbps SDI;以及其他视频或图像 格式)。后期制作图像可以包括比场景相关HDR图像的动态范围小 得多的动态范围。此外,当图像被递送到终端用户的显示装置进行呈 现时,沿途发生装置特定和/或制造商特定的图像变换,使得与原始的 场景相关HDR图像相比,在呈现图像中引起大量视觉上明显的错误。Images captured by HDR cameras can have scene-referred HDR, a dynamic range far greater than that of most, if not all, display devices. Scene-referred HDR images can include large amounts of data and must be converted to post-production formats that facilitate transmission and storage (e.g., HDMI video signals with 8-bit RGB, YCbCr, or Deep Color options; 1.5 Gbps SDI video signals with 10-bit 4:2:2 sampling rates; 3 Gbps SDI with 12-bit 4:4:4 or 10-bit 4:2:2 sampling rates; and other video or image formats). Post-production images can include a dynamic range significantly smaller than that of scene-referred HDR images. Furthermore, when the images are delivered to an end-user's display device for presentation, device-specific and/or manufacturer-specific image transformations occur along the way, causing numerous visually noticeable errors in the rendered image compared to the original scene-referred HDR image.
在本部分中描述的方法是可从事的方法,但未必是以前已经构想 或从事的方法。因此,除非另外指出,否则,不应仅凭借包含于本部 分中而认为在本部分中描述的方法中的任一种为现有技术。类似地, 除非另外指出,否则,关于一种或更多种方法识别的问题不应基于本 部分而认为在任何现有技术中已被识别。The approaches described in this section are approaches that could be pursued, but not necessarily approaches that have been previously conceived or pursued. Therefore, unless otherwise indicated, no approach described in this section should be considered prior art solely by virtue of its inclusion in this section. Similarly, unless otherwise indicated, any issues identified with respect to one or more approaches should not be considered as having been identified in any prior art based on this section.
GB2311432A公开了用于将图像数据的输入帧转换成具有不同空 间频率谱的多个输出帧的方法。在根据人类视觉系统的模型向每个输 出帧引用单独的增益之后,输出帧被组合,由此锐化或软化输入图像。GB2311432A discloses a method for converting an input frame of image data into multiple output frames having different spatial frequency spectra. After applying a separate gain to each output frame according to a model of the human visual system, the output frames are combined, thereby sharpening or softening the input image.
US2003/0174284A1公开了表现为使用随机交错空间频率的光栅 刺激的两个交替强制选择检查的对比敏感度测试。US 2003/0174284 A1 discloses a contrast sensitivity test performed as two alternating forced-choice tests using grating stimuli of randomly interleaved spatial frequencies.
WO2007/014681A公开了调节显示系统以使得显示系统符合针对 所选择的参数范围的强制标准的方法。根据标准化的灰阶标准显示函 数编码的图像被转换成显示器特定数字驱动水平。数字驱动水平被基 于人类对比敏感度根据显示系统的本征传递函数曲线转换成照度输出。WO2007/014681A discloses a method for adjusting a display system so that it complies with mandatory standards for a selected parameter range. An image encoded according to a standardized grayscale standard display function is converted into display-specific digital drive levels. The digital drive levels are converted into luminance output according to the display system's intrinsic transfer function curve based on human contrast sensitivity.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
在附图中以举例的方式、而非限制的方式例示本发明,在附图中, 相似的标号表示类似的元件,其中:The present invention is illustrated by way of example, and not by way of limitation, in the accompanying figures and in which like references indicate similar elements and in which:
图1例示根据本发明的示例实施例的跨越多个光适应水平的对比 敏感度函数曲线的示例族群;FIG1 illustrates an example family of contrast sensitivity function curves across multiple light adaptation levels according to an example embodiment of the present invention;
图2例示根据本发明的示例实施例的示例积分路径;FIG2 illustrates an example integration path according to an example embodiment of the present invention;
图3例示根据本发明的示例实施例的示例灰阶(grayscale)显示 函数;FIG3 illustrates an example grayscale display function according to an example embodiment of the present invention;
图4例示根据本发明的示例实施例的描绘Weber分数的曲线;FIG4 illustrates a graph depicting Weber scores according to an example embodiment of the present invention;
图5例示根据本发明的示例实施例的与不同GSDF的装置交换图 像数据的示例框架;FIG5 illustrates an example framework for exchanging image data with devices of different GSDFs according to an example embodiment of the present invention;
图6例示根据本发明的示例实施例的示例转换单元;FIG6 illustrates an example conversion unit according to an example embodiment of the present invention;
图7例示根据本发明的示例实施例的示例SDR显示器;FIG7 illustrates an example SDR display according to an example embodiment of the present invention;
图8A和图8B例示根据本发明的示例实施例的示例处理流程;8A and 8B illustrate example process flows according to an example embodiment of the present invention;
图9例示根据本发明的示例实施例的在其上可以实现如本文中所 描述的计算机或计算装置的示例硬件平台;FIG9 illustrates an example hardware platform upon which a computer or computing device as described herein may be implemented according to an example embodiment of the present invention;
图10A例示根据一些示例实施例的多个代码空间中的以JND为 单位的代码错误的最大量,所述多个代码空间中的每个代码空间具有 一个或多个不同位深中的不同的一个位深;FIG. 10A illustrates a maximum amount of code errors in a unit of JND in a plurality of code spaces, each of the plurality of code spaces having a different one of one or more different bit depths according to some example embodiments;
图10B至图10E例示根据一些示例实施例的代码错误的分布;以 及Figures 10B to 10E illustrate distribution of code errors according to some example embodiments; and
图11例示根据示例实施例的函数模型中的参数的值。FIG11 illustrates values of parameters in a function model according to an example embodiment.
具体实施方式DETAILED DESCRIPTION
本文中描述与不同能力的显示器之间基于感知照度非线性的图像 数据交换相关的示例实施例。在以下的描述中,出于解释的目的,为 了使得能够彻底理解本发明,阐述了大量的特定细节。但应理解,可 以在没有这些特定细节的情况下实施本发明。在其它情况下,为了避 免不必要地遮蔽、掩盖或混淆本发明,不以详尽的细节描述公知的结 构和装置。This document describes example embodiments related to the exchange of image data between displays of varying capabilities based on perceived luminance nonlinearity. In the following description, for purposes of explanation and to provide a thorough understanding of the present invention, numerous specific details are set forth. However, it should be understood that the present invention can be practiced without these specific details. In other instances, well-known structures and devices are not described in exhaustive detail to avoid unnecessarily obscuring, obscuring, or obfuscating the present invention.
本文中根据下面的大纲来描述示例实施例:Example embodiments are described herein according to the following outline:
1.总体概述1. General Overview
2.对比敏感度函数(CSF)模型2. Contrast Sensitivity Function (CSF) Model
3.感知非线性3. Perceptual nonlinearity
4.数字代码值和灰度级(gray level)4. Digital code value and gray level
5.模型参数5. Model Parameters
6.可变空间频率6. Variable spatial frequency
7.函数模型7. Function Model
8.基于参考GSDF交换图像数据8. Exchange image data based on reference GSDF
9.转换参考编码图像数据9. Convert reference coded image data
10.示例处理流程10. Example Processing Flow
11.实现机构——硬件概述11. Implementation mechanism - Hardware overview
12.枚举示例实施例、等同、扩展、替代和其它12. Enumerate example embodiments, equivalents, extensions, alternatives and others
1.总体概述1. General Overview
本概述给出本发明的示例实施例的一些方面的基本描述。应当注 意,本概述不是示例性实施例的各方面的详尽或全面的概括。并且, 应当注意,本概述不应被理解为标识示例性实施例的任何特别重要的 方面或要素,也不应被理解为特别地将示例性实施例的任何范围划界, 也不总体上将本发明划界。本概述仅以简要和简化的格式给出与示例 性实施例有关的一些概念,并且应被理解为仅是以下给出的示例性实 施例的更详细的描述的概念性序言。This summary provides a basic description of some aspects of the exemplary embodiments of the present invention. It should be noted that this summary is not an exhaustive or comprehensive overview of all aspects of the exemplary embodiments. Furthermore, it should be noted that this summary should not be construed as identifying any particularly important aspects or elements of the exemplary embodiments, nor as delimiting any scope of the exemplary embodiments in particular, nor as delimiting the present invention generally. This summary merely presents some concepts related to the exemplary embodiments in a concise and simplified format and should be understood as merely a conceptual prelude to the more detailed description of the exemplary embodiments presented below.
如果两个照度值彼此相差不大,则人类视觉可能感知不到这两个 照度值之间的差异。相反,如果照度值相差不小于最小可觉差(JND), 则人类视觉仅感知到差异。由于人类视觉的感知非线性,各个JND的 数量在光水平(light level)范围上为不均匀的大小或比例,而是随着 不同的各个光水平而变化。另外,由于感知非线性,各个JND的数量 在特定光水平处的空间频率范围上为不均匀的大小或比例,而是随着 低于截止空间频率的不同空间频率而变化。If two illuminance values are not significantly different from each other, human vision may not perceive the difference between them. Conversely, if the illuminance values differ by at least a just noticeable difference (JND), human vision only perceives the difference. Due to the nonlinearity of human vision, the magnitude of each JND is not uniform across a range of light levels, but rather varies with each light level. Furthermore, due to perceptual nonlinearity, the magnitude of each JND is not uniform across a range of spatial frequencies at a specific light level, but rather varies with different spatial frequencies below the cutoff spatial frequency.
通过用相等大小或线性缩放大小的照度量化步长被编码的图像数 据与人类视觉的感知非线性不匹配。用通过在一个固定空间频率上的 照度量化步长被进行编码的图像数据与人类视觉的感知非线性也不匹 配。根据这些技术,当分配码字来表示量化照度值时,太多的码字可 能分布在光水平范围的特定区域(例如,明亮区域)中,而太少的码 字则可能分布在光水平范围的不同区域(例如,黑暗区域)中。Image data encoded using luminance quantization steps of equal or linearly scaled size does not match the perceptual nonlinearity of human vision. Image data encoded using luminance quantization steps at a fixed spatial frequency also does not match the perceptual nonlinearity of human vision. According to these techniques, when assigning codewords to represent quantized luminance values, too many codewords may be distributed in a particular region of the light level range (e.g., bright areas), while too few codewords may be distributed in different regions of the light level range (e.g., dark areas).
在过于密集的区域中,大量码字可能不能生成感知差异,并且因 此实际上被浪费。在稀疏的区域中,两个相邻码字可以生成比JND大 得多的感知差异,并且可能生成轮廓畸变(也称为条带状)视觉伪像。In overly dense areas, a large number of codewords may not generate a perceptual difference and are therefore effectively wasted. In sparse areas, two adjacent codewords may generate a perceptual difference much larger than the JND and may generate contour distortion (also known as banding) visual artifacts.
根据如本文中所描述的技术,可以使用对比敏感度函数(contrast sensitivityfunction,CSF)来确定宽范围的光水平(例如,0至12,000 cd/m2)上的JND。在示例实施例中,选择在特定光水平的作为空间 频率的函数的峰值JND来表示在该特定光水平的人类感知量。峰值 JND的选择根据如下的人类视觉的行为:即当具有接近的、但是不同 的照度值的背景被观看时人类视觉适应于提高的水平的视觉感知性水 平,这在视频和图像显示视场中有时被称为勾边效应(crispening effect) 和/或Whittle的勾边效应,并且在本文中可以像这样进行描述。如本 文中所使用的,术语“光适应水平(light adaptation level)”可以用 于表示这样的光水平,在该光水平,在假定人类视觉适应于该光水平 的情况下,选择/确定(例如,峰值)JND。如本文中所描述的峰值JND 在不同光适应水平随空间频率而变化。According to the techniques described herein, a contrast sensitivity function (CSF) can be used to determine the JND over a wide range of light levels (e.g., 0 to 12,000 cd/m 2 ). In an example embodiment, a peak JND as a function of spatial frequency at a particular light level is selected to represent the amount of human perception at that particular light level. The selection of the peak JND is based on the behavior of human vision, which adapts to an increased level of visual perceptibility when backgrounds with similar, but different illuminance values are viewed, which is sometimes referred to as the crispening effect and/or Whittle's crispening effect in video and image display fields of view, and may be described as such herein. As used herein, the term "light adaptation level" may be used to refer to a light level at which a (e.g., peak) JND is selected/determined assuming that human vision is adapted to that light level. The peak JND as described herein varies with spatial frequency at different light adaptation levels.
如本文中所使用的,术语“空间频率”可以是指图像中的空间调 制/变化的速率(其中,与相对于时间计算速率相对比,相对于空间距 离或在空间距离上计算速率)。与可将空间频率固定为特定值的常规 方法相对比,如本文中所描述的空间频率可以例如在一个范围中或在 一个范围上变化。在一些实施例中,峰值JND可被限于特定空间频率 范围(例如,0.1至5.0、0.01至8.0个周期/度、或者更小的或更大的 范围)内。As used herein, the term "spatial frequency" may refer to the rate of spatial modulation/variation in an image (wherein the rate is calculated relative to or over spatial distance, as opposed to relative to time). In contrast to conventional approaches that may fix the spatial frequency to a specific value, the spatial frequency as described herein may vary, for example, within or over a range. In some embodiments, the peak JND may be limited to a specific spatial frequency range (e.g., 0.1 to 5.0, 0.01 to 8.0 cycles/degree, or a smaller or larger range).
参考灰阶显示函数(GSDF)可以基于CSF模型而产生。在一些 实施例中,对于CSF模型假定非常广的视场,以用于产生更好地支持 娱乐显示场的参考GSDF。GSDF是指参考数字代码值(或参考码字) 集合、参考灰度级(或参考照度值)集合、以及这两个集合之间的映射。在示例实施例中,每个参考数字代码值对应于如JND所表示的(例 如,在某一光适应水平的峰值JND)的人类感知量。在示例实施例中, 相同数量的参考数字代码值可以对应于人类感知量。A reference grayscale display function (GSDF) can be generated based on a CSF model. In some embodiments, a very wide field of view is assumed for the CSF model to generate a reference GSDF that better supports entertainment display fields. A GSDF refers to a set of reference digital code values (or reference codewords), a set of reference grayscale levels (or reference illuminance values), and a mapping between these two sets. In an example embodiment, each reference digital code value corresponds to a human perceptual quantity represented by a JND (e.g., a peak JND at a certain light adaptation level). In an example embodiment, the same number of reference digital code values can correspond to a human perceptual quantity.
可以通过从初始值开始累积JND来获得GSDF。在示例实施例中, 将中间码字值(例如,对于12位代码空间,2048)作为初始值给予参 考数字代码。参考数字代码的初始值可以对应于初始参考灰度级(例 如,100cd/m2)。在参考数字代码一个接一个地递增时可以通过正累 积JND(相加)来获得对于参考数字代码的其他值的其他参考灰度级, 当参考数字代码一个接一个地递减时,可以通过负累积JND(相减) 来获得对于参考数字代码的其他值的其他参考灰度级。在示例实施例 中,在计算GSDF中的参考值时,可以使用诸如对比度阈值的量值来 代替JND。在计算GSDF中实际上所使用的这些量值可以被定义为无 单位比率,并且可以与相应的JND仅相差已知的或可确定的乘数、除 法因子和/或偏差。The GSDF can be obtained by accumulating JNDs starting from an initial value. In an exemplary embodiment, an intermediate codeword value (e.g., 2048 for a 12-bit code space) is assigned as an initial value to the reference digital code. The initial value of the reference digital code can correspond to an initial reference grayscale level (e.g., 100 cd/m 2 ). As the reference digital codes increase one by one, other reference grayscale levels for other values of the reference digital code can be obtained by accumulating positive JNDs (addition), and as the reference digital codes decrease one by one, other reference grayscale levels for other values of the reference digital code can be obtained by accumulating negative JNDs (subtraction). In an exemplary embodiment, when calculating the reference values in the GSDF, a quantity such as a contrast threshold can be used instead of the JND. These quantities actually used in calculating the GSDF can be defined as unitless ratios and can differ from the corresponding JND only by a known or determinable multiplier, division factor, and/or bias.
代码空间可被选择为包括GSDF中的所有参考数字代码值。在一 些实施例中,所有的参考数字代码值都驻存在其中的代码空间可以是 下列代码空间之一:10位代码空间、11位代码空间、12位代码空间、 13位代码空间、14位代码空间、15位代码空间、或者更大的或更小 的代码空间。The code space may be selected to include all reference digital code values in the GSDF. In some embodiments, the code space in which all reference digital code values reside may be one of the following code spaces: a 10-bit code space, an 11-bit code space, a 12-bit code space, a 13-bit code space, a 14-bit code space, a 15-bit code space, or a larger or smaller code space.
尽管可以使用大的代码空间(>15位)来托管所有的参考数字代 码值,但是在特定实施例中,使用最高效的代码空间(例如,10位、12位等)来托管参考GSDF中所产生的所有参考数字代码值。Although a large code space (>15 bits) can be used to host all reference digital code values, in certain embodiments, the most efficient code space (e.g., 10 bits, 12 bits, etc.) is used to host all reference digital code values generated in the reference GSDF.
可以使用参考GSDF来对图像数据进行编码,所述图像数据例如 由具有远大于大多数(如果不是所有)显示装置的动态范围的场景相 关HDR的HDR照相机、演播室系统、或其他系统捕捉或产生。可以 用多种分发或发送方法(例如,具有8位RGB、YCbCr或深色选项 的HDMI视频信号;具有10位4:2:2采样率的1.5Gps SDI视频信号; 具有12位4:4:4或10位4:2:2采样率的3Gps SDI;以及其他视频或 图像格式)来将编码图像数据提供给下游装置。The reference GSDF can be used to encode image data captured or generated, for example, by an HDR camera, studio system, or other system with scene-dependent HDR that has a dynamic range far greater than that of most, if not all, display devices. The encoded image data can be provided to downstream devices using a variety of distribution or transmission methods (e.g., HDMI video signals with 8-bit RGB, YCbCr, or Deep Color options; 1.5 Gps SDI video signals with a 10-bit 4:2:2 sampling rate; 3 Gps SDI with a 12-bit 4:4:4 or 10-bit 4:2:2 sampling rate; and other video or image formats).
在一些实施例中,因为参考GSDF中的相邻参考数字代码值对应 于在JND内的灰度级,所以人类视觉能够区分的细节可以完全或基本 上保留在基于参考GSDF被编码的图像数据中。完全支持参考GSDF 的显示器可以呈现不具有条带状或轮廓畸变伪像的图像。In some embodiments, because adjacent reference digital code values in the reference GSDF correspond to grayscale levels within the JND, details distinguishable by human vision can be fully or substantially preserved in image data encoded based on the reference GSDF. A display that fully supports the reference GSDF can present images without banding or contouring artifacts.
可以使用基于参考GSDF被编码的图像数据(或参考编码图像数 据)来支持可能不能完全支持参考GSDF中的所有参考照度值的多种 能力较低的显示器。因为参考编码图像数据包括所支持的照度范围(其 可以被设计为显示器所支持的范围的超集)中的所有感知细节,所以 参考数字代码值可被以如下方式最佳地且高效率地转码为显示器特定 数字代码值,即尽可能多地保留特定显示器能够支持的细节并且引起 尽可能少的视觉上明显的错误。附加地和/或可选地,可以与从参考数 字代码值到显示器特定数字代码值的转码相结合地或者作为该转码的 一部分,执行去轮廓(decontour)和抖动(dither),以进一步改进 图像或视频质量。Image data encoded based on a reference GSDF (or reference coded image data) can be used to support a variety of lower-capability displays that may not fully support all reference luminance values in the reference GSDF. Because the reference coded image data includes all perceptual detail within the supported luminance range (which can be designed to be a superset of the range supported by the display), the reference digital code values can be optimally and efficiently transcoded into display-specific digital code values in a manner that preserves as much detail as possible that can be supported by the specific display and introduces as few visually noticeable errors as possible. Additionally and/or alternatively, decontouring and dithering can be performed in conjunction with or as part of the transcoding from reference digital code values to display-specific digital code values to further improve image or video quality.
如本文中所描述的技术是无关于颜色空间的。它们可以用于RGB 颜色空间、YCbCr颜色空间或不同的颜色空间中。此外,可以将使用 随空间频率变化的JND导出参考值(例如,参考数字代码值和参考灰 度级)的技术应用于可以包括照度通道或者可以不包括照度通道的不 同颜色空间(例如,RGB)中的除了照度通道之外的不同通道(例如, 红色、绿色和蓝色通道之一)。例如,作为参考灰度级的代替,可以 使用可应用于蓝色颜色通道的JND来导出参考蓝色值。因此,在一些 实施例中,可以用灰阶来取代颜色。附加地和/或可选地,还可以使用 不同的CSF模型来代替Barten的模型。所以,对于同一CSF模型, 可以使用不同的模型参数。The techniques described herein are color space agnostic. They can be used in the RGB color space, the YCbCr color space, or different color spaces. Furthermore, the techniques for deriving reference values (e.g., reference digital code values and reference grayscale levels) using spatial frequency-dependent JNDs can be applied to different channels (e.g., one of the red, green, and blue channels) in addition to the luminance channel in different color spaces (e.g., RGB), which may or may not include a luminance channel. For example, instead of a reference grayscale, a JND applicable to the blue color channel can be used to derive a reference blue value. Thus, in some embodiments, grayscale can be used instead of color. Additionally and/or alternatively, a different CSF model can be used instead of Barten's model. Thus, different model parameters can be used for the same CSF model.
在一些实施例中,如本文中所描述的机构形成媒体处理系统的一 部分,所述媒体处理系统包括但不限于:手持装置、游戏机、电视、 膝上型计算机、上网本计算机、蜂窝无线电话、电子书阅读器、销售 点终端、台式计算机、计算机工作站、计算机亭、或各种其他类型的 终端和媒体处理单元。In some embodiments, mechanisms as described herein form part of a media processing system including, but not limited to, a handheld device, a game console, a television, a laptop computer, a netbook computer, a cellular wireless telephone, an electronic book reader, a point-of-sale terminal, a desktop computer, a computer workstation, a computer kiosk, or various other types of terminals and media processing units.
对于本文中所描述的优选实施例以及总体原理和特征的各种修改 对于本领域的技术人员将是清楚的。因此,本公开并非意图局限于所 示的实施例,而是要被给予与本文中所描述的原理和特征一致的最广 泛的范围。Various modifications to the preferred embodiments and overall principles and features described herein will be apparent to those skilled in the art. Therefore, the present disclosure is not intended to be limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and features described herein.
2.对比敏感度函数(CSF)模型2. Contrast Sensitivity Function (CSF) Model
可以用对比敏感度函数(CSF)来最好地描述人类视觉对于所呈 现的图像中的空间结构的敏感度,CSF将对比敏感度描述为空间频率 (或人类观察者所感知的图像中的空间调制/变化的速率)的函数。如 本文中所使用的,对比敏感度S可以被认为是人类视觉神经信号处理 中的增益,而对比度阈值CT可以从对比敏感度的倒数确定,例如:The sensitivity of human vision to spatial structure in a presented image can be best described by the contrast sensitivity function (CSF), which describes contrast sensitivity as a function of spatial frequency (or the rate of spatial modulation/change in an image perceived by a human observer). As used herein, contrast sensitivity S can be thought of as the gain in human visual neural signal processing, while the contrast threshold CT can be determined from the inverse of contrast sensitivity, for example:
对比敏感度=S=1/CT 表达式(1)Contrast sensitivity = S = 1/ CT expression (1)
如本文中所使用的,术语“对比度阈值”可以是指人眼感知到对 比度差异所必要的(相对)对比度的最低值(例如,最小可觉差), 或者与该最低值相关;在一些实施例中,还可以将对比度阈值描绘为 照度值范围上除以光适应水平的最小可觉差的函数。As used herein, the term "contrast threshold" may refer to, or be related to, the minimum value of (relative) contrast necessary for the human eye to perceive a difference in contrast (e.g., a just noticeable difference); in some embodiments, the contrast threshold may also be described as a function of the just noticeable difference over a range of illuminance values divided by the light adaptation level.
在一些实施例中,可以在实验中直接测量对比度阈值,而不使用 任何CSF模型。然而,在一些其他实施例中,可以基于CSF模型来 确定对比度阈值。可以用若干个模型参数来构建CSF模型,并且可以 使用CSF模型来导出GSDF,该GSDF的灰度级中的量化步长取决于由照度值和空间频率表征的光水平并且随该光水平而变化。示例实施 例可以基于各种CSF模型中的一个或多个来实现,诸如下列文献中所 描述的CSF模型:Peter G.J.Barten的Contrast Sensitivity of the Human Eye and its Effects on Image Quality(1999)(在下文中,Barten 的模型或Barten的CSF模型)、或A.B.Watson,MIT Press(1933) 的Digital Images and Human Vision中的Scott Daly,第17章(在下 文中,Daly的模型)。关于本发明的示例实施例,可以用实验、从理 论上、用CSF模型或它们的组合来导出用于产生参考灰阶显示函数 (GSDF)的对比度阈值。In some embodiments, the contrast threshold can be measured directly in an experiment without using any CSF model. However, in some other embodiments, the contrast threshold can be determined based on a CSF model. A CSF model can be constructed using several model parameters, and the CSF model can be used to derive a GSDF whose quantization step size in grayscale depends on and varies with the light level characterized by illuminance value and spatial frequency. Example embodiments can be implemented based on one or more of various CSF models, such as those described in Peter GJ Barten's Contrast Sensitivity of the Human Eye and its Effects on Image Quality (1999) (hereinafter, Barten's model or Barten's CSF model), or Scott Daly's Digital Images and Human Vision , Chapter 17, by A.B. Watson, MIT Press (1933) (hereinafter, Daly's model). With respect to example embodiments of the present invention, the contrast threshold used to generate a reference grayscale display function (GSDF) can be derived experimentally, theoretically, using a CSF model, or a combination thereof.
如文本中所使用的,GSDF可以是指多个数字代码值(例如, 1,2,3,…,N)到多个灰度级(L1,L2,L3,…,LN)的映射,其中,如表1 中所示,数字代码值表示对比度阈值的索引值,灰度级对应于对比度 阈值。As used herein, GSDF may refer to a mapping of multiple digital code values (e.g., 1, 2, 3, ..., N) to multiple gray levels ( L1 , L2 , L3 , ..., LN ), where the digital code values represent index values of contrast thresholds and the gray levels correspond to the contrast thresholds as shown in Table 1.
表1Table 1
在实施例中,可以如下地与对比度(例如,C(i))相关地计算与 数字代码值(例如,i)相应的灰度级(例如,Li)和相邻灰度级(例 如,Li+1):In an embodiment, a gray level (eg, Li) and an adjacent gray level (eg, Li + 1) corresponding to a digital code value (eg, i) may be calculated in relation to a contrast ratio (eg, C(i) ) as follows:
其中,C(i)表示限于Li与Li+1之间的照度范围的对比度。Lmean(i,i+1) 包括两个相邻灰度级Li和Li+1的算术平均值或均值。对比度C(i)在算 术上以因子2与Weber分数ΔL/L相关。这里,ΔL表示(Li+1-Li),L 表示Li、Li+1、或Li与Li+1之间的中间值之一。Where C(i) represents the contrast ratio limited to the illumination range between Li and Li +1 . Lmean (i,i+1) comprises the arithmetic mean or average of two adjacent gray levels Li and Li +1 . The contrast ratio C(i) is mathematically related to the Weber fraction ΔL/L by a factor of 2. Here, ΔL represents ( Li+1 - Li ), and L represents Li , Li +1 , or one of the intermediate values between Li and Li +1 .
在一些实施例中,GSDF产生器可以如下地将对比度C(i)设置为 与在Li与Li+1之间(包括Li与Li+1在内)的照度水平L处的对比度 阈值(例如,CT(i))相等的或者成比例的值:In some embodiments, the GSDF generator may set the contrast C(i) to a value equal to or proportional to a contrast threshold (e.g., CT (i)) at illumination levels L between Li and Li + 1 , inclusive, as follows:
C(i)=kCT(i) 表达式(3)C(i)=kC T (i) Expression (3)
其中,k表示乘法常数。关于本发明的实施例,可以使用其他描述性 统计/定义(例如,几何均值、中间值、众数、方差或标准差)和/或 缩放(x2、x3、除以或乘以缩放因子等)和/或偏移(+1、+2、-1、-2、 减去或加上偏移等)和/或加权(例如,为两个相邻灰度级分配相同的 或不同的权重因子)来将对比度阈值与用于计算GSDF中的灰度级的 目的的对比度相关。Where k represents a multiplication constant. With respect to embodiments of the present invention, other descriptive statistics/definitions (e.g., geometric mean, median, mode, variance, or standard deviation) and/or scaling (x2, x3, division or multiplication by a scaling factor, etc.) and/or offset (+1, +2, -1, -2, subtraction or addition of an offset, etc.) and/or weighting (e.g., assigning the same or different weight factors to two adjacent gray levels) may be used to relate the contrast threshold to the contrast for the purpose of calculating the gray levels in the GSDF.
如表达式(1)、(2)和(3)中所计算的,对比度或对比度阈值可以包括 相对值,因此可以包括无单位量值(例如,所以S也可以是无单位的)。As calculated in expressions (1), (2), and (3), the contrast or contrast threshold may include relative values and therefore may include unitless quantities (e.g., so S may also be unitless).
可以从基于描绘CSF模型的CSF的基本对比度阈值计算或基本 对比度阈值测量来构建CSF模型。人类视觉不幸的是复杂的、自适应 的和非线性的,使得不存在描述人类视觉的单个CSF曲线。相反,可 以基于CSF模型来产生CSF曲线族群。即使就同一CSF模型而言,模型参数的不同值对于CSF曲线族群也生成不同的绘图。A CSF model can be constructed from a basic contrast threshold calculation or basic contrast threshold measurement based on the CSF that depicts the CSF model. Unfortunately, human vision is complex, adaptive, and nonlinear, so there is no single CSF curve that describes human vision. Instead, a family of CSF curves can be generated based on the CSF model. Even for the same CSF model, different values of the model parameters produce different plots for the CSF curve family.
3.感知非线性3. Perceptual nonlinearity
图1例示跨越多个光适应水平的示例性CSF曲线族群。仅仅为了 例示的目的,在图1中描绘的最高CSF曲线是针对照度值为每平方米 1000坎德拉(cd/m2或“尼特”)的光适应水平,其他高度降低的曲 线是针对照度值以10降低因子连续减小的光适应水平。从CSF曲线 可以读取的显著特征是,随着照度增大(光适应水平增大),包括最 大(或峰值)对比敏感度的总体对比敏感度增大。在图1中的CSF曲 线上的对比敏感度达到峰值的峰值空间频率移位到更高的空间频率。 类似地,CSF曲线上的最大可感知空间频率(截止频率)也增大,所 述最大可感知空间频率是水平(空间频率)轴对CSF曲线的截取。FIG1 illustrates a family of exemplary CSF curves spanning multiple light adaptation levels. For illustrative purposes only, the highest CSF curve depicted in FIG1 is for a light adaptation level with an illuminance value of 1000 candelas per square meter (cd/m 2 or "nits"), with the other highly decreasing curves being for light adaptation levels with illuminance values successively decreasing by a factor of 10. A notable feature that can be read from the CSF curves is that as illuminance increases (light adaptation level increases), the overall contrast sensitivity, including the maximum (or peak) contrast sensitivity, increases. The peak spatial frequency at which the contrast sensitivity on the CSF curves in FIG1 peaks is shifted to a higher spatial frequency. Similarly, the maximum perceptible spatial frequency (cutoff frequency) on the CSF curves, which is the intercept of the CSF curve by the horizontal (spatial frequency) axis, also increases.
在示例实施例中,可以用考虑了与人类感知相关的若干个关键效 应的Barten的CSF模型来导出产生如图1中所示的CSF曲线族群的 CSF函数。可以如下面的表达式(4)中所示那样计算根据Barten的CSF 模型的示例CSF S(u)(或相应的对比度阈值mi的倒数)。In an example embodiment, Barten's CSF model, which takes into account several key effects related to human perception, can be used to derive a CSF function that produces a family of CSF curves as shown in FIG1. An example CSF S(u) (or the inverse of the corresponding contrast threshold mi ) according to Barten's CSF model can be calculated as shown in Expression (4) below.
上面的表达式(4)中所使用的示例模型参数包括下面列出的表示:Example model parameters used in expression (4) above include the following listed representations:
○2(数值因子)对应于双眼视觉(如果是单眼视觉,则为4);○2 (numerical factor) corresponds to binocular vision (4 if monocular vision);
○k表示信噪比,例如,3.0;○k represents the signal-to-noise ratio, for example, 3.0;
○T表示眼睛的积分时间,例如,0.1秒;○T represents the integration time of the eye, for example, 0.1 seconds;
○X0表示对象(例如,方形形状)的角度大小;○X 0 represents the angular size of the object (e.g., a square shape);
○Xmax表示眼睛的积分区域的最大角度大小(例如,12度)○ Xmax represents the maximum angular size of the integration area of the eye (e.g., 12 degrees)
○Nmax表示通过概率求和而累积的周期的最大数量,例如,15 个周期N max represents the maximum number of cycles accumulated by probability summation, for example, 15 cycles
○η表示眼睛的量子效率,例如,.03;○η represents the quantum efficiency of the eye, for example, .03;
○p表示光子转换因子;○p represents the photon conversion factor;
○E表示例如以特罗兰得为单位的视网膜照度;○E represents the retinal illuminance in, for example, troland;
○Φ0表示神经噪声的谱密度,例如,3×10-8秒*度2;以及○Φ 0 represents the spectral density of neural noise, for example, 3× 10-8 seconds* degrees2 ; and
○μ0表示用于侧抑制的最大空间频率,例如,7个周期/度。○μ 0 represents the maximum spatial frequency for lateral suppression, for example, 7 cycles/degree.
可以如下给出光学调制传递函数Mopt:The optical modulation transfer function M opt can be given as:
其中,σ表示与瞳孔和/或光水平相关的模型参数。where σ represents a model parameter related to the pupil and/or light level.
可以使用如上面所讨论的Barten的CSF模型来描述与照度相关 的感知非线性。还可以使用其他CSF模型来描述感知非线性。例如, Barten的CSF模型没有考虑眼调节(accommodation)效应,该眼调 节效应使得CSF的高空间频率区域中的截止空间频率降低。可以将由 于眼调节而导致的该降低效应表达为减小观看距离的函数。Barten's CSF model, as discussed above, can be used to describe illuminance-related perceptual nonlinearity. Other CSF models can also be used to describe perceptual nonlinearity. For example, Barten's CSF model does not account for the effect of accommodation, which reduces the cutoff spatial frequency in the high-spatial-frequency region of the CSF. This reduction due to accommodation can be expressed as a function of decreasing viewing distance.
例如,对于1.5米以上的观看距离,可以实现如Barten的CSF 模型所描绘的最大截止空间频率,而不影响Barten的模型作为适合于 描述感知非线性的模型的有效性。然而,对于小于1.5米的距离,眼 调节效应开始变得显著,降低了Barten的模型的精度。For example, for viewing distances above 1.5 meters, the maximum cutoff spatial frequency as described by Barten's CSF model can be achieved without affecting the effectiveness of Barten's model as a model suitable for describing perceptual nonlinearities. However, for distances less than 1.5 meters, eye accommodation effects begin to become significant, reducing the accuracy of Barten's model.
因此,对于具有更近的观看距离(诸如0.5米)的平板显示器、 以及可以具有近如0.125米的观看距离的智能电话,可能不能最佳地 调整Barten的CSF模型。Therefore, Barten's CSF model may not be optimally adjusted for flat-panel displays that have closer viewing distances, such as 0.5 meters, and smartphones that may have viewing distances as close as 0.125 meters.
在一些实施例中,可以使用考虑了眼调节效应的Daly的CSF模 型。在特定实施例中,可以部分地基于上面的表达式(4)中的Barten 的CSF S(u)例如通过修正表达式(5)中的光学调制传递函数Mopt来构 造Daly的CSF模型。In some embodiments, Daly's CSF model that takes into account the effects of eye accommodation can be used. In certain embodiments, Daly's CSF model can be constructed based in part on Barten's CSF S(u) in expression (4) above, for example, by modifying the optical modulation transfer function M opt in expression (5).
4.数字代码值和灰度级4. Digital code value and grayscale
如表1中所示的GSDF通过使用数字代码值表示与人类视觉中的 对比度阈值相关联的灰度级来映射感知非线性。包括所有的被映射照 度值的灰度级可被分布为使得它们被最佳地间隔以匹配人类视觉的感 知非线性。The GSDF, as shown in Table 1, maps perceptual nonlinearity by using digital code values to represent gray levels associated with contrast thresholds in human vision. The gray levels comprising all mapped luminance values can be distributed so that they are optimally spaced to match the perceptual nonlinearity of human vision.
在一些实施例中,当GSDF中的最大数量的灰度级相对于最大照 度值范围足够大时,GSDF中的数字代码值可被如下方式使用,即在 不造成灰度级步长跃变(steptransition)可见(例如,在图像中可见 为错误轮廓或条带;或图像的黑暗区域中的色移)的情况下实现最低 数量(例如,低于总共4096个数字代码值)的灰度级。In some embodiments, when the maximum number of gray levels in the GSDF is sufficiently large relative to the maximum range of luminance values, the digital code values in the GSDF can be used in a manner that achieves a minimum number of gray levels (e.g., less than a total of 4096 digital code values) without causing gray level step transitions to be visible (e.g., visible as false contours or banding in the image; or color shifts in dark areas of the image).
在一些其他实施例中,仍然可以使用有限数量的数字代码值来表 示宽动态范围的灰度级。例如,当GSDF中的灰阶级(grayscale level) 的最大数量相对于灰阶级的最大范围不够大(例如,在灰阶级范围为 从0至12,000尼特的情况下8位表示的数字代码值)时,GSDF仍然 可被以如下方式使用,即实现最低数量(例如,低于总共256个数字 代码值)的灰度级,以降低或最小化灰度级步长跃变的可见性。就这 样的GSDF而言,可以在GSDF中的相对少的灰度级的整个层级结构 中均匀地分布步长跃变的可感知错误/伪像的数量/程度。如本文中所 使用的,术语“灰阶级”或“灰度级”可以互换地使用,并且可以是 指所表示的照度值(GSDF中所表示的量化照度值)。In some other embodiments, a limited number of digital code values can still be used to represent a wide dynamic range of grayscale levels. For example, when the maximum number of grayscale levels in a GSDF is not large enough relative to the maximum range of grayscale levels (e.g., an 8-bit representation of digital code values for a grayscale range from 0 to 12,000 nits), the GSDF can still be used in a manner that achieves a minimum number of grayscale levels (e.g., less than a total of 256 digital code values) to reduce or minimize the visibility of grayscale step transitions. With such a GSDF, the number/degree of perceptible errors/artifacts from step transitions can be evenly distributed throughout the hierarchy of the relatively few grayscale levels in the GSDF. As used herein, the terms "grayscale" or "gray level" are used interchangeably and can refer to the represented luminance value (the quantized luminance value represented in the GSDF).
可以通过对光适应水平上(不同照度值处)的对比度阈值进行堆 叠或求积分来导出GSDF中的灰度级。在一些实施例中,灰度级之间 的量化步长可被选择为使得任何两个相邻灰度级之间的量化步长落在 JND内。特定光适应水平(或照度值)处的对比度阈值可以不大于在 该特定自适应水平的最小可觉差(JND)。可以通过对对比度阈值(或 JND)的分数进行求积分或堆叠来导出灰度级。在一些实施例中,数 字代码值的数量多于足够表示所表示的照度动态范围中的所有JND 的数量。Grayscale in the GSDF can be derived by stacking or integrating contrast thresholds across light adaptation levels (at different illuminance values). In some embodiments, the quantization step size between grayscale levels can be selected so that the quantization step size between any two adjacent grayscale levels falls within a JND. The contrast threshold at a particular light adaptation level (or illuminance value) can be no greater than the just noticeable difference (JND) at that particular adaptation level. Grayscale can be derived by integrating or stacking fractions of the contrast threshold (or JND). In some embodiments, the number of digital code values is greater than the number sufficient to represent all JNDs within the represented illuminance dynamic range.
可以对于特定光适应水平(或照度值)在除固定空间频率之外的 不同空间频率处从CSF曲线选择用于计算灰阶级的对比度阈值或作 为倒数的对比敏感度。在一些实施例中,对于光适应水平在与(例如, 由于Whittle的勾边效应导致的)峰值对比敏感度相应的空间频率处 从CSF曲线选择每个对比度阈值。另外,可以对于不同光适应水平在 不同空间频率处从CSF曲线选择对比度阈值。A contrast threshold for calculating grayscale levels or contrast sensitivity as a reciprocal can be selected from the CSF curve at different spatial frequencies other than a fixed spatial frequency for a particular light adaptation level (or illuminance value). In some embodiments, each contrast threshold is selected from the CSF curve at a spatial frequency corresponding to a peak contrast sensitivity (e.g., due to Whittle's edge effect) for a light adaptation level. Furthermore, contrast thresholds can be selected from the CSF curve at different spatial frequencies for different light adaptation levels.
计算/堆叠GSDF中的灰度级的示例表达式如下:An example expression for computing/stacking gray levels in a GSDF is as follows:
JND=1/S(f,LA) 表达式(6)JND=1/S(f, LA ) Expression (6)
其中,f表示空间频率,其根据如本文中所描述的技术可以不同于固 定数;LA表示光适应水平。Lmin可以是所有被映射灰度级中的最低照 度值。如本文中所使用的,术语“尼特”或其缩写“nt”可以同义地 或可互换地涉及或者指的是等同于或等于一(1)坎德拉每平方米(1尼 特=1nt=1cd/m2)的图像强度、明亮度、亮度和/或照度的单位。在一 些实施例中,Lmin可以包括零值。在一些其他实施例中,Lmin可以包 括非零值(例如,可以低于显示装置通常能够实现的水平的某一暗黑 水平,10-5尼特、10-7尼特等)。在一些实施例中,Lmin可被用除最小 初始值之外的允许用减法或负数加法来进行堆叠计算的值(诸如中间 值或最大值)取代。Wherein, f represents the spatial frequency, which may be different from a fixed number according to the techniques described herein; LA represents the light adaptation level. Lmin may be the lowest illuminance value among all mapped grayscale levels. As used herein, the term "nit" or its abbreviation "nt" may synonymously or interchangeably relate to or refer to a unit of image intensity, brightness, luminance and/or illuminance that is equivalent to or equal to one (1) candela per square meter (1 nit = 1 nt = 1 cd/m 2 ). In some embodiments, Lmin may include a zero value. In some other embodiments, Lmin may include a non-zero value (e.g., a certain dark level that may be lower than the level typically achievable by a display device, 10 -5 nits, 10 -7 nits, etc.). In some embodiments, Lmin may be replaced by a value other than the minimum initial value that allows stacking calculations to be performed using subtraction or negative addition (such as an intermediate value or a maximum value).
在一些实施例中,例如,如表达式(6)中所示,通过求和来执行JND 的堆叠以导出GSDF中的灰度级。在一些其他实施例中,可以使用积 分来代替离散求和。积分可以沿着从CSF确定的积分路径来进行积分 (例如,表达式(4))。例如,积分路径可以包括对于CSF的(参考) 动态范围中的所有光适应水平的峰值对比敏感度(例如,与不同空间 频率相应的不同峰值敏感度)。In some embodiments, for example, as shown in Expression (6), stacking of JNDs is performed by summing to derive the grayscale levels in the GSDF. In some other embodiments, integration can be used instead of discrete summation. The integration can be performed along an integration path determined from the CSF (e.g., Expression (4)). For example, the integration path can include peak contrast sensitivities for all light adaptation levels within the (reference) dynamic range of the CSF (e.g., different peak sensitivities corresponding to different spatial frequencies).
如本文中所使用的,积分路径可以是指用于表示人类感知非线性 并且建立数字代码值集合与参考灰度级(量化照度值)集合之间的映 射的可见动态范围(VDR)曲线。可能要求该映射满足下述标准,即, 在相应的光适应水平(照度值)以上或以下,每个量化步长(例如, 表1中的两个相邻灰度级的照度差)小于JND。积分路径的在特定光 适应水平(照度值)的瞬时导数(以尼特/空间-周期为单位)与该特 定自适应水平的JND成比例。如本文中所使用的,术语“VDR”或 “视觉动态范围”可以是指比标准动态范围广的动态范围,并且可以 包括但不限于直到人类视觉眨眼间可以感知到的瞬时可感知的动态范 围和色域的宽动态范围。As used herein, an integration path may refer to a visible dynamic range (VDR) curve that represents the nonlinearity of human perception and establishes a mapping between a set of digital code values and a set of reference grayscale levels (quantized luminance values). This mapping may be required to meet the following criteria: each quantization step (e.g., the luminance difference between two adjacent grayscale levels in Table 1) is less than the JND above or below the corresponding light adaptation level (luminance value). The instantaneous derivative (in nits/spatial-period) of the integration path at a particular light adaptation level (luminance value) is proportional to the JND for that particular adaptation level. As used herein, the term "VDR" or "visual dynamic range" may refer to a dynamic range wider than the standard dynamic range and may include, but is not limited to, a wide dynamic range up to the instantaneous perceptible dynamic range and color gamut that can be perceived by human vision within a blink of an eye.
基于如本文中所描述的技术,可以开发与任何特定显示器或图像 处理装置无关的参考GSDF。在一些实施例中,可以将除光适应水平 (照度)、空间频率和角度大小之外的一个或多个模型参数设置为常 数(或固定)值。Based on the techniques described herein, a reference GSDF can be developed that is independent of any particular display or image processing device. In some embodiments, one or more model parameters other than light adaptation level (illuminance), spatial frequency, and angular size can be set to constant (or fixed) values.
5.模型参数5. Model Parameters
在一些实施例中,用覆盖宽范围的显示装置的保守模型参数值来 构造CSF模型。保守模型参数值的使用提供了比现有的标准GSDF 小的JND。因此,在一些实施例中,根据本文中所描述的技术的参考 GSDF能够支持超过这些显示装置的要求的高精度的照度值。In some embodiments, the CSF model is constructed using conservative model parameter values that cover a wide range of display devices. The use of conservative model parameter values provides a smaller JND than existing standard GSDFs. Therefore, in some embodiments, the reference GSDF according to the techniques described herein can support high-precision illuminance values that exceed the requirements of these display devices.
在一些实施例中,如本文中所描述的模型参数包括视场(FOV) 参数。可以将FOV参数设置为45度、40度、35度、30度、25度的 值、或者支持宽范围的显示装置和观看场景(包括演播室、影院或高 端娱乐系统中所使用的那些)的另一个更大的或更小的值。In some embodiments, the model parameters described herein include a field of view (FOV) parameter. The FOV parameter can be set to a value of 45 degrees, 40 degrees, 35 degrees, 30 degrees, 25 degrees, or another larger or smaller value to support a wide range of display devices and viewing scenarios, including those used in studios, theaters, or high-end entertainment systems.
如本文中所描述的模型参数可以包括可以与例如视场有关的角度 大小参数。可以将角度大小参数设置为45度×45度、40度×40度、 35度×35度、30度×30度、25度×25度的值、或者支持宽范围的显 示装置和观看场景的另一个更大的或更小的值。在一些实施例中,可 以将部分用于导出参考GSDF的角度大小参数设置为n度×m度,其 中,n和m均可以是30与40之间的数值,并且n和m可以相等或者 可以不相等。Model parameters as described herein can include an angular size parameter that can be related to, for example, the field of view. The angular size parameter can be set to a value of 45 degrees × 45 degrees, 40 degrees × 40 degrees, 35 degrees × 35 degrees, 30 degrees × 30 degrees, 25 degrees × 25 degrees, or another larger or smaller value to support a wide range of display devices and viewing scenarios. In some embodiments, a portion of the angular size parameter used to derive the reference GSDF can be set to n degrees × m degrees, where n and m can both be values between 30 and 40, and n and m can be equal or unequal.
在一些实施例中,使用更大的角度大小(例如,40度×40度)来 生成具有更多灰阶级、因此具有更大对比敏感度的参考GSDF。可以 使用GSDF来支持可能要求~30至40度的广视角的宽范围的观看和/ 或显示场景(例如,大屏幕视频显示器)。还可以使用敏感度由于选 择大的角度大小而提高的GSDF来支持高度可变的观看和/或显示场 景(例如,影院)。可以选择甚至更大的角度大小;然而,将角度大 小提高到远高于某一角度大小(例如,40度)可以生成相对有限的余 裕益处。In some embodiments, a larger angular size (e.g., 40 degrees x 40 degrees) is used to generate a reference GSDF with more grayscale levels and, therefore, greater contrast sensitivity. The GSDF can be used to support a wide range of viewing and/or display scenarios (e.g., large-screen video displays) that may require a wide viewing angle of ~30 to 40 degrees. A GSDF, whose sensitivity is enhanced by selecting a larger angular size, can also be used to support highly variable viewing and/or display scenarios (e.g., movie theaters). Even larger angular sizes can be selected; however, increasing the angular size well above a certain angular size (e.g., 40 degrees) can yield relatively limited margin benefits.
在一些实施例中,参考GSDF模型覆盖大照度范围。例如,参考 GSDF模型所表示的灰度级或量化照度值的范围为从0或大约0(例 如,10-7cd/m2)至12,000cd/m2。参考GSDF模型中所表示的照度值 的下界限可以为10-7cd/m2、或者更低的或更高的值(例如,0、10-5、10-8、10-9cd/m2等)。可以使用GSDF来支持具有不同周围光水平的 宽范围的观看和/或显示场景。可以使用GSDF来支持具有不同暗黑水 平的宽范围的显示装置(影院里、室内或户外)。In some embodiments, the reference GSDF model covers a wide range of illuminances. For example, the grayscale or quantized illuminance values represented by the reference GSDF model range from 0 or approximately 0 (e.g., 10-7 cd/ m2 ) to 12,000 cd/ m2 . The lower bound of the illuminance values represented in the reference GSDF model can be 10-7 cd/ m2 , or lower or higher values (e.g., 0, 10-5 , 10-8 , 10-9 cd/ m2 , etc.). The GSDF can be used to support a wide range of viewing and/or display scenarios with different ambient light levels. The GSDF can be used to support a wide range of display devices (in theaters, indoors, or outdoors) with different black levels.
参考GSDF模型中的所表示的照度值的上界限可以为12,000 cd/m2、或者更低的或更高的值(6000-8000、8000-10000、10000-12000、 12000-15000cd/m2等)。可以使用GSDF来支持具有高动态范围的宽 范围的观看和/或显示场景。可以使用GSDF来支持具有不同最大照度 水平的宽范围的显示装置(HDR TV、SDR显示器、膝上型计算机、 平板计算机、手持装置等)。The upper bound of the illuminance values represented in the reference GSDF model can be 12,000 cd/m 2 , or lower or higher values (6000-8000, 8000-10000, 10000-12000, 12000-15000 cd/m 2 , etc.). The GSDF can be used to support a wide range of viewing and/or display scenarios with high dynamic range. The GSDF can be used to support a wide range of display devices with different maximum illuminance levels (HDR TVs, SDR displays, laptops, tablets, handheld devices, etc.).
6.可变空间频率6. Variable spatial frequency
图2例示根据本发明的示例实施例的示例积分路径(被表示为 VDR),其可以用作如本文中所描述的用于获得参考GSDF中的灰度 级的积分路径。在实施例中,使用VDR曲线来精确地捕捉人类视觉 在高动态范围的照度值上的峰值对比敏感度。FIG2 illustrates an example integration path (denoted as VDR) according to an example embodiment of the present invention, which can be used as the integration path for obtaining gray levels in a reference GSDF as described herein. In an embodiment, a VDR curve is used to accurately capture the peak contrast sensitivity of human vision over a high dynamic range of luminance values.
如图2中所示,随着光适应水平(照度值)降低,峰值对比敏感 度不在固定空间频率值处发生,而是在更小的空间频率处发生。这意 味着,具有固定空间频率的技术(例如,DICOM)可能显著地低估人 类视觉对于暗光适应水平(低照度值)的对比敏感度。较低的对比敏 感度导致对比度阈值更高,导致量化照度值中的量化步长大小更大。As shown in Figure 2, as the light adaptation level (illuminance value) decreases, peak contrast sensitivity occurs not at a fixed spatial frequency value, but at smaller spatial frequencies. This means that technologies with a fixed spatial frequency (e.g., DICOM) can significantly underestimate the contrast sensitivity of human vision for dark light adaptation levels (low illuminance values). Lower contrast sensitivity results in a higher contrast threshold, which leads to a larger quantization step size in quantizing illuminance values.
与医学数字成像和通信(DICOM)标准不同,根据如本文中所描 述的技术的VDR曲线不将空间频率模型参数固定为一个固定值,诸 如每度4个周期。相反,VDR曲线随空间频率而变化,并且精确地捕 捉在多个光适应水平的人类视觉的峰值对比敏感度。VDR曲线适当地 考虑了由于人类视觉对于宽范围的光适应水平的自适应性而导致的勾 边效应,并且帮助产生高精度的参考GSDF。这里,术语“高精度” 意味着,基于在固定大小代码空间(例如,10位、12位等之一)的约 束内最好地并且最高效率地捕捉人类视觉非线性的参考GSDF,去除 了或大幅减小了由于照度值的量化而导致的感知错误。Unlike the Digital Imaging and Communications in Medicine (DICOM) standard, the VDR curve according to the techniques described herein does not fix the spatial frequency model parameters to a fixed value, such as 4 cycles per degree. Instead, the VDR curve varies with spatial frequency and accurately captures the peak contrast sensitivity of human vision at multiple light adaptation levels. The VDR curve appropriately accounts for edge effects caused by the adaptability of human vision to a wide range of light adaptation levels and helps generate a high-precision reference GSDF. Here, the term "high-precision" means that perceptual errors caused by quantization of illuminance values are eliminated or significantly reduced, based on a reference GSDF that best and most efficiently captures the nonlinearities of human vision within the constraints of a fixed-size code space (e.g., 10 bits, 12 bits, etc.).
可以使用计算参考GSDF中的灰度级的计算处理(例如,表1)。 在示例实施例中,该计算处理是迭代的或递归的,重复地从VDR曲 线确定对比度阈值(或调制阈值,例如,表达式4中的ml),并且应 用对比度阈值来获得参考GSDF中的连续灰度级。该计算处理可以用下面的表达式(7)来实现:A computational process for computing gray levels in a reference GSDF (e.g., Table 1) may be used. In an exemplary embodiment, the computational process is iterative or recursive, repeatedly determining a contrast threshold (or modulation threshold, e.g., ml in Expression 4) from a VDR curve and applying the contrast threshold to obtain continuous gray levels in the reference GSDF. The computational process may be implemented using the following Expression (7):
所以:并且So: and
表达式(7)Expression (7)
其中,j-1、j和j+1表示三个相邻的数字代码值的索引;Lj-1、Lj和Lj+1表示数字代码值j-1、j和j+1分别映射到的灰度级。Lmax和Lmin分别 表示JND或JND的一小部分上的最大照度值和最小照度值。使用JND 或其一小部分保持参考GSDF的高精度。Where j-1, j, and j+1 represent the indices of three adjacent digital code values; Lj -1 , Lj , and Lj +1 represent the grayscale levels to which digital code values j-1, j, and j+1 are mapped, respectively. Lmax and Lmin represent the maximum and minimum illuminance values, respectively, over a JND or a fraction of the JND. Using the JND or a fraction of the JND maintains high accuracy of the reference GSDF.
可以将与JND相关联的对比度阈值ml定义为相对量,例如,除 以Lmax或Lmin的特定照度值的Lmax与Lmin之间的差值、或者Lmax与 Lmin之间的值(例如,Lmax和Lmin的平均值)。在一些实施例中,可 替代地,可以将ml定义为除以Lmax或Lmin的特定照度值的倍数(例 如,2倍)的Lmax与Lmin之间的差值、或者Lmax与Lmin之间的值。在 将GSDF中的照度值量化为多个灰度级时,Lmax和Lmin可以是指这些 多个灰度级中的相邻灰度级。结果,如表达式(7)中所示,Lj可以通过 ml分别与Lj-1和Lj+1相关。The contrast threshold ml associated with the JND can be defined as a relative quantity, for example, the difference between L max and L min divided by a specific illuminance value of L max or L min , or a value between L max and L min (e.g., the average of L max and L min ). In some embodiments, ml can alternatively be defined as the difference between L max and L min divided by a multiple (e.g., 2) of the specific illuminance value of L max or L min , or a value between L max and L min . When the illuminance values in the GSDF are quantized into multiple gray levels, L max and L min can refer to adjacent gray levels among these multiple gray levels. As a result, as shown in Expression (7), L j can be related to L j-1 and L j+1 , respectively, by ml.
在替代实施例中,不是使用如表达式(7)中所示的线性表达式,而 是可以使用非线性表达式来将JND或对比度阈值与灰度级相关。例如, 可以使用基于被均值除的标准差的替代表达式来代替如所示的对于对 比度阈值的简单比率。In an alternative embodiment, rather than using a linear expression as shown in Expression (7), a nonlinear expression may be used to relate the JND or contrast threshold to the gray level. For example, instead of using a simple ratio for the contrast threshold as shown, an alternative expression based on the standard deviation divided by the mean may be used.
在一些实施例中,在数字代码值被表示为12位整数值的情况下, 参考GSDF覆盖0至12,000cd/m2的范围。为了进一步改进参考GSDF 的精度,可以将ml与分数值f相乘。此外,可以将中心数字值L2048 (指出,如与SDI兼容的12位代码空间中那样,数字代码值至少限于0和4096)映射到100cd/m2。表达式(7)可以得到下面的表达式(8):In some embodiments, the reference GSDF covers a range of 0 to 12,000 cd/m 2 , where the digital code values are represented as 12-bit integer values. To further improve the accuracy of the reference GSDF, ml can be multiplied by a fractional value f. In addition, the center digital value L2048 (note that, as in the 12-bit code space compatible with SDI, the digital code values are limited to at least 0 and 4096) can be mapped to 100 cd/m 2. Expression (7) can be derived from the following expression (8):
并且and
表达式(8)Expression (8)
其中,分数值f被设置为0.918177。在示例实施例中,数字代码的最 小容许值被设置为码字(或整数值)16,其被设置为0(cd/m2)。第 二最低数字代码值17以5.27×10-7cd/m2结束,而数字代码值4076 以12,000cd/m2结束。Here, the fractional value f is set to 0.918177. In the exemplary embodiment, the minimum allowable value of the digital code is set to codeword (or integer value) 16, which is set to 0 (cd/m 2 ). The second lowest digital code value 17 ends at 5.27×10-7 cd/m 2 , while the digital code value 4076 ends at 12,000 cd/m 2 .
图3例示根据本发明的示例实施例的在12位代码空间中在多个灰 度级(以对数照度值表示)与多个数字代码值之间映射的示例GSDF。3 illustrates an example GSDF mapping between multiple gray levels (expressed as logarithmic luminance values) and multiple digital code values in a 12-bit code space according to an example embodiment of the present invention.
图4例示描绘基于图3的示例GSDF的灰度级的Weber分数 (Delta L/L或ΔL)的曲线。如图4所示的人类视觉的感知非线性被 表示为对数照度轴上的照度值的函数。人类视觉的可比较的视觉差(例 如,JND)对应于较低照度值处的更大的Delta L/L。Weber分数的曲 线对于高照度值渐进为常数值(例如,在更高照度值处满足Weber法 则的情况下,Weber分数为0.002)。FIG4 illustrates a graph depicting the Weber score (Delta L/L or ΔL) for grayscale levels based on the example GSDF of FIG3 . The perceptual nonlinearity of human vision, as shown in FIG4 , is represented as a function of illuminance value on a logarithmic illuminance axis. Comparable visual differences (e.g., JNDs) in human vision correspond to larger Delta L/L at lower illuminance values. The Weber score curve asymptotically becomes constant at high illuminance values (e.g., when the Weber law is satisfied at higher illuminance values, the Weber score is 0.002).
7.函数模型7. Function Model
可以使用一个或多个分析函数来获得如本文中所描述的GSDF (参考GSDF或装置特定GSDF)中的数字代码值与灰度级之间的映 射。所述一个或多个分析函数可以是专有的、基于标准的、或者从基 于标准的函数的扩展。在一些实施例中,GSDF产生器(例如,图5的504)可以基于一个或多个分析函数(或公式)来产生一个或多个 正向查找表(LUT)和/或一个或多个反向LUT的形式的GSDF。可 以将这些LUT中的至少一些提供给各种图像数据编解码器(例如,图 5的506)或多种显示装置,以用于为了对参考图像数据进行编码的目 的而在参考灰度级与参考数字代码级之间进行转换。附加地,可选地, 或可替代地,可以直接将分析函数(它们的系数用整数或浮点数表示) 中的至少一些提供给图像数据编解码器或多种显示装置,以用于获得 如本文中所描述的GSDF中的数字代码值与灰度级之间的映射和/或 用于为了对图像数据进行编码的目的而在灰度级与数字代码级之间进 行转换。One or more analysis functions can be used to obtain the mapping between digital code values and grayscale levels in a GSDF (a reference GSDF or a device-specific GSDF) as described herein. The one or more analysis functions can be proprietary, standard-based, or extensions of standard-based functions. In some embodiments, a GSDF generator (e.g., 504 of FIG. 5 ) can generate a GSDF in the form of one or more forward lookup tables (LUTs) and/or one or more inverse LUTs based on the one or more analysis functions (or formulas). At least some of these LUTs can be provided to various image data codecs (e.g., 506 of FIG. 5 ) or various display devices for converting between reference grayscale levels and reference digital code levels for the purpose of encoding reference image data. Additionally, optionally, or alternatively, at least some of the analysis functions (whose coefficients are represented as integers or floating point numbers) can be provided directly to an image data codec or various display devices for use in obtaining a mapping between digital code values and grayscale levels in a GSDF as described herein and/or for converting between grayscale levels and digital code levels for the purpose of encoding image data.
在一些实施例中,如本文中所描述的分析函数包括如下的可以用 于基于相应的灰度级预测数字代码值的正向函数:In some embodiments, the analysis functions as described herein include the following forward function that can be used to predict digital code values based on corresponding gray levels:
其中,D表示数字代码的(例如,12位)值,L表示以尼特为单位的 照度值或灰度级,n可以表示如表达式(9)所给出的logD/logL曲线的 中间部分中的斜率,m可以表示logD/logL曲线的拐点的锐度,c1、 c2和c3可以定义logD/logL曲线的端点和中间点。Wherein, D represents the (e.g., 12-bit) value of the digital code, L represents the luminance value or grayscale in nits, n can represent the slope in the middle part of the logD/logL curve as given by Expression (9), m can represent the sharpness of the inflection point of the logD/logL curve, and c1, c2, and c3 can define the endpoints and the middle point of the logD/logL curve.
在一些实施例中,分析函数包括如下的逆函数,其与表达式(9)中 的正向函数相对应,并且可用于基于相应的数字代码值来预测照度值:In some embodiments, the analysis function comprises the following inverse function, which corresponds to the forward function in expression (9) and can be used to predict the illuminance value based on the corresponding digital code value:
可以将使用表达式(9)基于多个照度值预测的数字代码值与所观 察的数字代码值进行比较。所观察的数字代码值可以是、但不仅仅限 于下列中的任何一个:如前面所讨论的基于CSF模型的数值计算。在 实施例中,可以计算并且最小化所预测的数字代码值与所观察的数字 代码值之间的偏差,以导出表达式(9)中的参数n、m、c1、c2和c3的 最佳值。The digital code values predicted based on the plurality of illumination values using expression (9) can be compared with the observed digital code values. The observed digital code values can be, but are not limited to, any of the following: numerical calculations based on the CSF model as discussed above. In an embodiment, the deviation between the predicted digital code values and the observed digital code values can be calculated and minimized to derive optimal values for the parameters n, m, c 1 , c 2 , and c 3 in expression (9).
同样地,可以将使用表达式(10)基于多个数字代码值预测的照度 值与所观察的照度值进行比较。所观察的照度值可以(但不限于)如 前面所讨论的通过使用基于CSF模型的数值计算而产生,或者通过使 用人类视觉实验数据而产生。在实施例中,可以将所预测的照度值与 所观察的照度值之间的偏差推导为参数n、m、c1、c2和c3的函数,并 且最小化该偏差以导出表达式(10)中的参数n、m、c1、c2和c3的最佳 值。Similarly, the illuminance values predicted based on the plurality of digital code values using expression (10) can be compared with the observed illuminance values. The observed illuminance values can be generated, but are not limited to, by numerical calculations based on the CSF model as discussed above, or by using human vision experimental data. In an embodiment, the deviation between the predicted illuminance values and the observed illuminance values can be derived as a function of the parameters n, m, c 1 , c 2 , and c 3 , and the deviation can be minimized to derive the optimal values of the parameters n, m, c 1 , c 2 , and c 3 in expression (10).
用表达式(9)确定的参数n、m、c1、c2和c3的最佳值集合可以与 用表达式(10)确定的参数n、m、c1、c2和c3的最佳值集合相同,或者 可以不相同。在这两个集合之间不同的情况下,可以使用这两个集合 中的一个或两者来产生数字代码值与照度值之间的映射。在一些实施 例中,参数n、m、c1、c2和c3的这两个最佳值集合在不同的情况下可 以例如基于往返错误的最小化来被协调,所述往返错误是由于用表达 式(9)和(10)两者执行正向译码(coding)操作和逆向译码操作两者而 引入的。在一些实施例中,可以进行多个往返以研究导致的数字代码 值和/或照度值或灰度级中的错误。在一些实施例中,表达式(9)和(10)中的参数的选择可以至少部分基于在一次、两次或更多次往返中不发 生显著错误的准则。无显著往返错误的例子可以包括,但不仅仅限于 下列错误中的任何一个:小于0.0001%、0.001%、0.01%、1%、2% 或其他可配置值的错误。The set of optimal values for the parameters n, m, c 1 , c 2 and c 3 determined by expression (9) may be the same as or different from the set of optimal values for the parameters n, m, c 1 , c 2 and c 3 determined by expression (10). In the event that the two sets differ, one or both of the two sets may be used to generate a mapping between digital code values and luminance values. In some embodiments, the two optimal value sets for the parameters n, m, c 1 , c 2 and c 3 may be coordinated under different circumstances, for example, based on minimization of round-trip errors introduced by performing both forward decoding and reverse decoding operations using expressions (9) and (10). In some embodiments, multiple round trips may be performed to investigate the resulting errors in digital code values and/or luminance values or grayscale levels. In some embodiments, the selection of the parameters in expressions (9) and (10) may be based at least in part on the criterion that no significant errors occur in one, two or more round trips. Examples of no significant round trip error may include, but are not limited to, any of the following errors: errors of less than 0.0001%, 0.001%, 0.01%, 1%, 2%, or other configurable values.
实施例包括使用一个或多个不同位长度的代码空间来表示数字控 制值。可以对于多个代码空间中的每个获得表达式(9)和(10)中的参数 的优化值,所述多个代码空间中的每个代码空间具有一个或多个不同 位长度中的不同的一个位长度。基于表达式(9)和(10)的优化值,可以 确定代码错误的分布(例如,基于表达式(9)和(10)的数字代码值中的正向变换错误、逆向变换错误、或往返错误)。在一些实施例中,两 个数字代码值中的数值差一(1)对应于这两个数字代码值所表示的两 个照度值之间的在某一光水平的对比度阈值(或对应于JND)。图10A 例示根据一些示例实施例的多个代码空间中的以JND为单位的代码 错误的最大量,所述多个代码空间中的每个代码空间具有一个或多个 不同精度(具有不同位长度)中的不同的一个精度。例如,基于如本 文中所描述的函数模型,具有无限或无限制的位长度的代码空间的最 大代码错误为11.252。与之相比,基于如本文中所描述的函数模型, 12位长度(或4096)的代码空间的最大代码错误为11.298。这表明, 就表达式(9)和(10)所表示的函数模型而言,用于数字代码值的12位长 度的代码空间是优良的选择。Embodiments include using one or more code spaces of different bit lengths to represent digital control values. Optimized values for the parameters in expressions (9) and (10) can be obtained for each of a plurality of code spaces, each of the plurality of code spaces having a different one of the one or more different bit lengths. Based on the optimized values of expressions (9) and (10), a distribution of code errors (e.g., forward transform errors, inverse transform errors, or round-trip errors in digital code values based on expressions (9) and (10)) can be determined. In some embodiments, a numerical difference of one (1) between two digital code values corresponds to a contrast threshold (or to a JND) at a certain light level between two illuminance values represented by the two digital code values. FIG10A illustrates a maximum amount of code errors in JNDs for a plurality of code spaces, each of the plurality of code spaces having a different one of one or more different precisions (having different bit lengths), according to some example embodiments. For example, based on the functional model as described herein, the maximum code error for a code space having an infinite or unrestricted bit length is 11.252. In contrast, based on the functional model described herein, the maximum code error for a code space of 12 bits (or 4096) is 11.298. This indicates that a code space of 12 bits is an excellent choice for digital code values for the functional model represented by expressions (9) and (10).
图10B例示根据示例实施例的在表达式(9)所指定的正向变换(从 照度值到数字代码值)的情况下12位长(或4096)的代码空间的代 码错误的分布。图10C例示根据示例实施例的在表达式(10)所指定的 反向变换(从数字代码值到照度值)的情况下12位长度(或4096) 的代码空间的代码错误的分布。图10B和图10C都指示最大代码错误 小于12.5。FIG10B illustrates the distribution of code errors for a 12-bit (or 4096) long code space under the forward transformation (from luminance values to digital code values) specified by Expression (9) according to an exemplary embodiment. FIG10C illustrates the distribution of code errors for a 12-bit (or 4096) long code space under the reverse transformation (from digital code values to luminance values) specified by Expression (10) according to an exemplary embodiment. Both FIG10B and FIG10C indicate that the maximum code error is less than 12.5.
图11例示根据示例实施例的表达式(9)和(10)中可以使用的参数 的值。在一些实施例中,如所示,在如本文中所描述的函数模型的特 定实现中,可以使用基于整数的公式来表示/逼近这些非整数值。在一 些其他的实施例中,在如本文中所描述的函数模型的特定实现中,可 以使用具有一个或多个精度(例如,14位、16位或32位)之一的定 点值、浮点值来表示的这些非整数值。FIG11 illustrates values of parameters that can be used in expressions (9) and (10) according to example embodiments. In some embodiments, as shown, in a particular implementation of the function model as described herein, these non-integer values can be represented/approximated using integer-based formulas. In some other embodiments, in a particular implementation of the function model as described herein, these non-integer values can be represented using fixed-point values, floating-point values, or one of one or more precisions (e.g., 14 bits, 16 bits, or 32 bits).
实施例包括使用具有除了表达式(9)和(10)中所给出的那些公式 (其可以是色调映射曲线)之外的公式的函数模型。例如,如本文中 所描述的函数模型可以使用具有如下的Naka-Rushton公式的圆锥体 模型:Embodiments include using a function model having a formula other than those given in expressions (9) and (10) (which may be a tone mapping curve). For example, the function model described herein may use a cone model having the following Naka-Rushton formula:
其中,L表示照度值,n、m和σ表示与圆锥体模型相关联的模型参数, Ld表示可以用数字代码值编码的预测值。可以使用通过最小化偏差来 获得模型参数的类似方法来导出用于表达式(11)的模型参数的最佳值。 图10D例示根据示例实施例的在表达式(11)所指定的正向变换(从照 度值到数字代码值)的情况下对于12位长度(或4096)的代码空间 的代码错误的分布。在实施例中,如图10D中所示的最大代码错误为 25个JND。Wherein, L represents the illuminance value, n, m and σ represent the model parameters associated with the cone model, and Ld represents the predicted value that can be encoded with a digital code value. A similar method for obtaining the model parameters by minimizing the deviation can be used to derive the optimal values of the model parameters for expression (11). Figure 10D illustrates the distribution of code errors for a code space of 12 bits in length (or 4096) in the case of the forward transformation (from illuminance values to digital code values) specified by expression (11) according to an example embodiment. In an embodiment, the maximum code error as shown in Figure 10D is 25 JNDs.
在另一个例子中,可以产生具有如下的Raised mu公式的函数模 型:In another example, a functional model with the following Raised mu formula can be generated:
其中,x表示照度值,y表示预测数字代码值。可以通过最小化偏差来 获得模型参数μ的最佳值。图10E例示根据示例实施例的在表达式(12) 所指定的正向变换(从照度值到数字代码值)的情况下对于12位长度 (或4096)的代码空间的代码错误的分布。在实施例中,如图10D中 所示的最大代码错误为17个JND。Where x represents the illuminance value and y represents the predicted digital code value. The optimal value of the model parameter μ can be obtained by minimizing the deviation. FIG10E illustrates the distribution of code errors for a code space of 12 bits (or 4096) in the case of the forward transformation (from illuminance values to digital code values) specified by expression (12) according to an example embodiment. In an embodiment, the maximum code error as shown in FIG10D is 17 JNDs.
如本文中所示,在一些实施例中,可以使用函数模型来从照度值 预测代码值或者从代码值预测照度值。函数模型所使用的公式可以是 可逆的。可以实现执行这些值之间的正向和逆向变换的相同的或类似 的处理逻辑。在一些实施例中,包括但不仅限于任何指数的模型参数 可以用基于定点值或整数的公式来表示。因此,可以仅用硬件、仅用 软件或硬件和软件的组合来高效率地实现所述处理逻辑的至少一部分。 类似地,可以仅用硬件、仅用软件或硬件和软件的组合(包括ASIC 或FPGA)来高效率地实现用函数模型或模型公式(诸如表达式(9)至 表达式(12))产生的LUT的至少一部分。在一些实施例中,一个、两 个或更多个函数模型可以在单个计算装置、多个计算装置的构造、服 务器等中实现。在一些实施例中,在照度值的可见动态范围的整个范 围上,预测代码值中的错误可以在目标或观察值的14个代码值内。在 一些实施例中,这适用于正向和逆向变换两者。在正向变换和逆向变 换中可以使用相同的或不同的模型参数集合。可以用模型参数的最佳 值来最大化往返精度。可以使用不同的代码空间。在特定实施例中, 可以使用12位长度(4096)的代码空间来在可见动态范围的整个范围 上以最小代码错误托管数字代码值。As shown herein, in some embodiments, a function model can be used to predict code values from illuminance values or to predict illuminance values from code values. The formula used by the function model can be reversible. The same or similar processing logic can be implemented to perform forward and inverse transformations between these values. In some embodiments, model parameters, including but not limited to any exponentials, can be represented by formulas based on fixed-point values or integers. Therefore, at least a portion of the processing logic can be efficiently implemented using only hardware, only software, or a combination of hardware and software. Similarly, at least a portion of the LUT generated using the function model or model formula (such as Expressions (9) to (12)) can be efficiently implemented using only hardware, only software, or a combination of hardware and software (including ASICs or FPGAs). In some embodiments, one, two, or more function models can be implemented in a single computing device, a configuration of multiple computing devices, a server, or the like. In some embodiments, the error in the predicted code value can be within 14 code values of the target or observed value over the entire visible dynamic range of the illuminance value. In some embodiments, this applies to both forward and inverse transformations. The same or different sets of model parameters can be used in the forward and inverse transforms. Optimal values of the model parameters can be used to maximize round-trip accuracy. Different code spaces can be used. In a particular embodiment, a 12-bit (4096) code space can be used to manage digital code values with minimal code error across the entire visible dynamic range.
如本文中所使用的,参考GSDF可以是指包括参考数字代码值和 参考灰度级的GSDF,这些参考数字代码值和参考灰度级根据函数模 型(其模型参数可以用CSF模型下的目标值或观察值来确定)而相关, 通过基于CSF模型的数值计算(例如,不确定数字代码值与照度值之 间的映射的任何函数表示)而确定,或者用来自人类视觉研究的数据 而确定。在一些实施例中,装置GSDF还可以包括可以在分析上用如 本文中所描述的函数模型表示的数字代码值与灰度级之间的映射。As used herein, a reference GSDF may refer to a GSDF that includes reference digital code values and reference grayscale levels, which are related according to a functional model (whose model parameters can be determined using target values or observed values under the CSF model), determined through numerical calculations based on the CSF model (e.g., any functional representation of the mapping between uncertain digital code values and illuminance values), or determined using data from human vision research. In some embodiments, the device GSDF may also include a mapping between digital code values and grayscale levels that can be analytically represented using a functional model as described herein.
8.基于参考GSDF交换图像数据8. Exchange image data based on reference GSDF
为了例示的目的,描述了数字代码值驻存在12位代码空间中。然 而,本发明不限于此。在参考GSDF中可以使用具有不同代码空间(例 如,除了12位之外的不同位深)的数字代码值。例如,可以使用10 位整数值来表示数字代码。不是在数字代码的12位表示中将数字代码 值4076映射到照度值12000cd/m2,而是可以在数字代码的10位表示 中将数字代码值1019映射到照度值12000cd/m2。因此,对于参考 GSDF中的数字代码值,可以使用代码空间(位深)中的这些变化和 其他变化。For purposes of illustration, the digital code values are described as residing in a 12-bit code space. However, the present invention is not limited thereto. Digital code values having different code spaces (e.g., different bit depths other than 12 bits) can be used in the reference GSDF. For example, a 10-bit integer value can be used to represent the digital code. Instead of mapping the digital code value 4076 to an illuminance value of 12000 cd/m 2 in the 12-bit representation of the digital code, the digital code value 1019 can be mapped to an illuminance value of 12000 cd/m 2 in the 10-bit representation of the digital code. Thus, these and other variations in code space (bit depth) can be used for the digital code values in the reference GSDF.
可以使用参考GSDF来在不同GSDF之间交换图像数据,该不同 GSDF可分别针对每种类型的图像获取装置或图像呈现装置进行设计。 例如,用特定类型的图像获取装置或图像呈现装置实现的GSDF可以 隐含地或明确地取决于如下这样的模型参数,该模型参数与另一种类 型的图像获取装置或图像呈现装置的装置特定GSDF或标准GSDF的 模型参数不匹配。A reference GSDF can be used to exchange image data between different GSDFs, each designed for a specific type of image acquisition device or image rendering device. For example, a GSDF implemented for a particular type of image acquisition device or image rendering device may implicitly or explicitly depend on model parameters that do not match the model parameters of a device-specific GSDF or a standard GSDF for another type of image acquisition device or image rendering device.
参考GSDF可以对应于如图3和图4中所描绘的曲线形状。一般 来讲,GSDF的形状取决于用于导出或设计GSDF的参数。因此,参 考GSDF取决于参考CSF模型、以及用于从参考CSF模型产生参考 GSDF的参考模型参数。装置特定GSDF的曲线形状取决于特定装置, (如果所述特定装置是显示器,则包括显示参数和观看条件)。The reference GSDF can correspond to the curve shape depicted in Figures 3 and 4. Generally speaking, the shape of a GSDF depends on the parameters used to derive or design the GSDF. Therefore, the reference GSDF depends on the reference CSF model and the reference model parameters used to generate the reference GSDF from the reference CSF model. The curve shape of a device-specific GSDF depends on the specific device (including display parameters and viewing conditions if the specific device is a display).
在例子中,如图3中所示,其所支持的照度值范围限于小于500 cd/m2的显示器在高照度值区域处可能体验不到斜率增大(这在人类 视觉转变到对数行为时对于所有频率发生)。驱动具有图3的曲线形 状的显示器可以导致灰度级的非最佳(例如,次优)分配,其中太多 灰度级分配在明亮区域中,而在黑暗区域中则分配不足。In an example, as shown in FIG3, a display whose supported luminance value range is limited to less than 500 cd/ m2 may not experience an increase in slope at high luminance value regions (which occurs for all frequencies when human vision transitions to logarithmic behavior). Driving a display having the curve shape of FIG3 can result in a non-optimal (e.g., suboptimal) distribution of gray levels, where too much gray level is distributed in bright areas and not enough in dark areas.
在另一个例子中,低对比度显示器被设计为在户外在各种日光条 件下使用。该显示器的照度范围可以大部分地或几乎全部出现在图3 的对数行为区域中。驱动具有图3的曲线形状的该低对比度显示器也 可能导致灰度级的非最佳(例如,次优)分配,其中太多灰度级分配 在黑暗区域中,而在明亮区域中则分配不足。In another example, a low-contrast display is designed for outdoor use under various daylight conditions. The illuminance range of the display may fall largely or almost entirely within the logarithmic behavior region of FIG. 3 . Driving the low-contrast display with the curve shape of FIG. 3 may also result in a non-optimal (e.g., suboptimal) distribution of gray levels, where too much gray level is distributed in dark areas and too little in bright areas.
根据如本文中所描述的技术,每个显示器可以使用其特定GSDF (不仅取决于显示器参数,而且还取决于例如影响实际黑色级的观看 条件)来最佳地支持用参考GSDF编码的图像数据中的感知信息。用 于图像数据的总体编码的一个或多个上游(例如,编码)装置使用参 考GSDF来尽可能多地保留感知细节。然后将在参考GSDF中编码的 图像数据递送给一个或多个下游(例如,解码)装置。在示例实施例 中,基于参考GSDF的图像数据的编码与随后将对该图像数据进行解 码和/或呈现的特定装置无关。According to the techniques described herein, each display can use its specific GSDF (which depends not only on display parameters but also on viewing conditions that affect, for example, actual black levels) to optimally support the perceptual information in image data encoded with a reference GSDF. One or more upstream (e.g., encoding) devices used for the overall encoding of the image data use the reference GSDF to preserve as much perceptual detail as possible. The image data encoded in the reference GSDF is then delivered to one or more downstream (e.g., decoding) devices. In example embodiments, the encoding of the image data based on the reference GSDF is independent of the specific device that will subsequently decode and/or present the image data.
每个装置(例如,显示器)具有其特定GSDF,其中,装置特定 灰度级被支持/优化。特定灰度级对于显示器的制造商可以是已知的, 或者可以被制造商专门设计为支持装置特定GSDF(其可以基于标准 或者可以不基于标准)。装置的线路驱动器可以用特定于该装置的量 化照度值来实现。对于该装置,可以基于特定于该装置的量化照度值 来最好地进行优化。另外,可以至少部分基于目前的周围光水平和/ 或装置的光学反射率(其对于制造商可以是已知的)来设置可以用作 装置特定灰度级范围的下界限的暗黑级(例如,最低装置特定灰度级)。 一旦暗黑级被如此设置,就可以通过隐含地或明确地累积(例如,堆 叠/求取积分)装置的线路驱动器中的量化步长来获得或设置装置特定 灰度级。当装置同时呈现图像时,灰度级的推导和/或调整可在运行时 进行或者可以不在运行时进行。Each device (e.g., a display) has a specific GSDF, in which device-specific grayscale levels are supported/optimized. The specific grayscale levels may be known to the display manufacturer or may be specifically designed by the manufacturer to support the device-specific GSDF (which may or may not be based on a standard). The device's line driver may be implemented using quantized luminance values specific to the device. For each device, optimization may be best performed based on the device-specific quantized luminance values. Furthermore, a dark black level (e.g., a minimum device-specific grayscale level) that serves as the lower limit of the device-specific grayscale range may be set based at least in part on the current ambient light level and/or the device's optical reflectivity (which may be known to the manufacturer). Once the dark black level is set, the device-specific grayscale level may be obtained or set by implicitly or explicitly accumulating (e.g., stacking/integrating) the quantization step size in the device's line driver. Grayscale derivation and/or adjustment may or may not be performed at runtime when the device is simultaneously presenting images.
因此,根据如本文中所描述的技术,本发明的实施例可以包括, 但不仅仅限于,用参考GSDF对图像数据进行编码,并且用装置特定 GSDF来解码和呈现该图像数据。Thus, according to the techniques as described herein, embodiments of the present invention may include, but are not limited to, encoding image data using a reference GSDF and decoding and rendering the image data using a device-specific GSDF.
可以使用如本文中所描述的技术来在具有不同GSDF的各种装置 之间交换图像数据。图5例示根据本发明的示例实施例的与不同GSDF 的装置交换图像数据的示例框架(500)。如图5中所示,可以使用自 适应CSF模型(502)来产生参考GSDF(504)。术语“自适应”可 以是指CSF模型对于人类视觉非线性和行为的自适应性。可以至少部 分基于多个CSF参数(或模型参数)来构建自适应CSF模型。所述 多个模型参数包括例如光适应水平、以度宽度计的显示区域、噪声水 平、眼调节(物理观看距离)、照度或颜色调制矢量(其可以例如与自适应CSF模型(502)中所使用的测试图像或图像模式相关)。[0014] Techniques as described herein can be used to exchange image data between various devices having different GSDFs. FIG5 illustrates an example framework (500) for exchanging image data with devices having different GSDFs according to an example embodiment of the present invention. As shown in FIG5, an adaptive CSF model (502) can be used to generate a reference GSDF (504). The term "adaptive" can refer to the adaptability of the CSF model to the nonlinearities and behavior of human vision. The adaptive CSF model can be constructed based at least in part on a plurality of CSF parameters (or model parameters). The plurality of model parameters include, for example, light adaptation level, display area in degrees of width, noise level, eye accommodation (physical viewing distance), illuminance, or color modulation vector (which can, for example, be related to a test image or image pattern used in the adaptive CSF model (502)).
上游(例如,编码)装置可以在将用参考GSDF(504)编码的图 像数据或其派生物被发送或分发到下游(例如,解码)装置之前接收 该将用参考GSDF(504)编码的图像数据。将被编码的图像数据可以 一开始为多种格式(基于标准的、专有的、其扩展等)中的任何一种格式,和/或可以从多个图像源(照相机、图像服务器、有形介质等) 中的任何一个导出。将被编码的图像数据的例子包括但不仅仅限于原 始的或其他的高位深图像(530)。原始的或其他的高位深图像可以来 自照相机、演播室系统、艺术总监系统、另一上游图像处理系统、图 像服务器、内容数据库等。图像数据可以包括但不仅仅限于,数字照 片、视频图像帧、3D图像、非3D图像、计算机产生的图形等的图像 数据。图像数据可以包括场景相关图像、装置相关图像、或具有各种 动态范围的图像。将被编码的图像数据的例子可以包括原始图像的高 质量版本,该高质量版本将与元数据一起被编辑、下采样和/或压缩为 用于分发给图像接收系统(下游图像处理系统,诸如各种制造商的显 示器)的译码的位流。原始的或其他的高位深图像可以是专家、艺术 工作室、广播公司、高端媒体制作实体等所使用的高采样率。将被编 码的图像数据还可以整个地或部分地由计算机产生,或者甚至可以整 个地或部分地从现有的图像源(诸如老电影和纪录片)获得。An upstream (e.g., encoding) device may receive image data to be encoded using a reference GSDF (504) before the image data or its derivatives are sent or distributed to a downstream (e.g., decoding) device. The image data to be encoded may initially be in any of a variety of formats (standard-based, proprietary, extensions thereof, etc.) and/or may be derived from any of a variety of image sources (cameras, image servers, tangible media, etc.). Examples of image data to be encoded include, but are not limited to, original or other high-bit-depth images (530). Original or other high-bit-depth images may come from a camera, a studio system, an art director's system, another upstream image processing system, an image server, a content database, etc. The image data may include, but are not limited to, image data of a digital photograph, a video image frame, a 3D image, a non-3D image, a computer-generated graphic, etc. The image data may include scene-dependent images, device-dependent images, or images having various dynamic ranges. Examples of image data to be encoded may include a high-quality version of the original image that is edited, downsampled, and/or compressed along with metadata into a decoded bitstream for distribution to image receiving systems (downstream image processing systems, such as displays from various manufacturers). The original or other high-bit-depth images may be of the high sampling rates used by professionals, art studios, broadcasters, high-end media production entities, and the like. The image data to be encoded may also be generated in whole or in part by a computer, or even obtained in whole or in part from existing image sources, such as old films and documentaries.
如本文中所使用的,词组“将被编码的图像数据”可以是指一个 或多个图像的图像数据;将被编码的图像数据可以包括浮点图像数据 或定点图像数据,并且可以在任何颜色空间中。在示例实施例中,所 述一个或多个图像可以在RGB颜色空间中。在另一示例实施例中, 所述一个或多个图像可以在YUV颜色空间中。在例子中,如本文中 所描述的图像中的每个像素包括颜色空间中所定义的所有通道(例如, RGB颜色空间中的红色、绿色和蓝色颜色通道)的浮点像素值。在另 一个例子中,如本文中所描述的图像中的每个像素包括颜色空间中所 定义的所有通道的定点像素值(例如,用于RGB颜色空间中的红色、 绿色和蓝色颜色通道的16位或更高/更低位数的固定点像素值)。每 个像素可以可选地和/或可替代地包括对于颜色空间中的通道中的一 个或多个的下采样的像素值。As used herein, the phrase "image data to be encoded" may refer to image data of one or more images; the image data to be encoded may include floating-point image data or fixed-point image data and may be in any color space. In an example embodiment, the one or more images may be in an RGB color space. In another example embodiment, the one or more images may be in a YUV color space. In an example, each pixel in an image as described herein includes floating-point pixel values for all channels defined in the color space (e.g., red, green, and blue color channels in an RGB color space). In another example, each pixel in an image as described herein includes fixed-point pixel values for all channels defined in the color space (e.g., 16-bit or higher/lower bit fixed-point pixel values for the red, green, and blue color channels in an RGB color space). Each pixel may optionally and/or alternatively include downsampled pixel values for one or more of the channels in the color space.
在一些实施例中,响应于接收到将被编码的图像数据,框架(500) 中的上游装置将由该图像数据指定的或从该图像数据确定的照度值映 射到参考GSDF中的参考数字代码值,并且基于将被编码的图像数据, 产生用参考数字代码值编码的参考编码图像数据。从基于将被编码的 图像数据的照度值到参考数字代码值的映射操作可以包括:选择数字 代码值,这些数字代码值的相应的参考灰度级(例如,如表1中所示) 匹配或者如参考GSDF中的任何其他参考照度值那样接近地逼近由将 被编码的图像数据指定的或从该图像数据确定的照度值;并且用参考 编码图像数据中的参考数字代码值来取代照度值。In some embodiments, in response to receiving image data to be encoded, an upstream device in the framework (500) maps luminance values specified by or determined from the image data to reference digital code values in a reference GSDF, and generates reference encoded image data encoded with the reference digital code values based on the image data to be encoded. The mapping operation from luminance values based on the image data to be encoded to reference digital code values may include: selecting digital code values whose corresponding reference grayscale levels (e.g., as shown in Table 1) match or approximate as closely as any other reference luminance value in the reference GSDF to the luminance value specified by or determined from the image data to be encoded; and replacing the luminance value with the reference digital code value in the reference encoded image data.
附加地,可选地或可替代地,可以将预处理和后处理步骤(其可 以包括,但不仅仅限于,颜色空间转换、下采样、上采样、色调映射、 颜色分级、解压缩、压缩等)作为产生参考编码图像数据的一部分执 行。Additionally, optionally or alternatively, pre-processing and post-processing steps (which may include, but are not limited to, color space conversion, downsampling, upsampling, tone mapping, color grading, decompression, compression, etc.) may be performed as part of generating the reference encoded image data.
在示例实施例中,框架(500)可以包括被配置为将参考编码图像 数据编码和/或格式化为一个或多个译码位流或图像文件的软件和/或 硬件组件(例如,编码或格式化单元(506))。译码位流或图像文件 可以是基于标准的格式、专有格式、或至少部分基于基于标准的格式 的扩展格式。附加地和/或可选地,译码位流或图像文件可以包括元数 据,该元数据包含与用于产生参考编码图像数据的预处理或后处理相 关、参考GSDF的相关参数中的一个或多个(例如,模型参数;如表 1、图3和图4中所示的最小照度值、最大照度值、最小数字代码值、 最大数字代码值等;识别多个CSF之中的一个CSF的识别字段;参 考观看距离)。In an example embodiment, the framework (500) may include software and/or hardware components (e.g., encoding or formatting unit (506)) configured to encode and/or format the reference encoded image data into one or more decoded bitstreams or image files. The decoded bitstreams or image files may be in a standard-based format, a proprietary format, or an extension of a standard-based format. Additionally and/or alternatively, the decoded bitstreams or image files may include metadata containing one or more parameters of the reference GSDF related to pre-processing or post-processing used to generate the reference encoded image data (e.g., model parameters; minimum illuminance value, maximum illuminance value, minimum digital code value, maximum digital code value, etc. as shown in Table 1, Figures 3 and 4; an identification field identifying one of a plurality of CSFs; a reference viewing distance).
在一些实施例中,框架(500)可以包括一个或多个分立的上游装 置。例如,框架(500)中的所述一个或多个上游装置中的至少一个可 以被配置为基于参考GSDF来对图像数据进行编码。上游装置可以包 括被配置为执行与图5的502、504和506相关的功能的软件和/或硬 件组件。译码位流或图像文件可以由上游装置(图5的502、504和 506)通过网络连接、数字接口、有形存储介质等输出,并且在图像数 据流程(508)中被递送给其他图像处理装置进行处理或呈现。In some embodiments, the framework (500) may include one or more separate upstream devices. For example, at least one of the one or more upstream devices in the framework (500) may be configured to encode image data based on a reference GSDF. The upstream device may include software and/or hardware components configured to perform the functions associated with 502, 504, and 506 of FIG. 5 . The decoded bitstream or image file may be output by the upstream device (502, 504, and 506 of FIG. 5 ) via a network connection, a digital interface, a tangible storage medium, etc., and delivered to other image processing devices for processing or presentation in the image data flow (508).
在一些示例实施例中,框架(500)还包括作为一个或多个分立装 置的一个或多个下游装置。下游装置可以被配置为从图像数据流(508) 接收/访问所述一个或多个上游装置输出的译码位流或图像文件。例如, 下游装置可以包括被配置为对译码位流和图像文件进行解码和/或重 新格式化并且恢复/检索其中的参考编码图像数据的软件和/或硬件组件(例如,解码或重新格式化单元(510))。如图5中所示,下游装 置可以包括不同的一组显示装置。In some example embodiments, the framework (500) further includes one or more downstream devices that are one or more separate devices. The downstream devices can be configured to receive/access the decoded bitstream or image file output by the one or more upstream devices from the image data stream (508). For example, the downstream devices can include software and/or hardware components (e.g., a decoding or reformatting unit (510)) configured to decode and/or reformat the decoded bitstream and image file and recover/retrieve the reference encoded image data therein. As shown in FIG5 , the downstream devices can include a different set of display devices.
在一些实施例中,显示装置(未示出)可以被设计和/或实现为支 持参考GSDF。如果显示装置支持参考GSDF中的每一个灰度级,则 可以提供高精度HDR图像呈现。显示装置可以以比人类视觉可能检 测到的水平更细微的水平或相同的水平来呈现图像细节。In some embodiments, a display device (not shown) may be designed and/or implemented to support the reference GSDF. If the display device supports every grayscale level in the reference GSDF, high-precision HDR image rendering can be provided. The display device can render image details at a level finer than or equal to that detectable by human vision.
在一些实施例中,显示装置的装置特定GSDF中的原生数字代码 值(其可以被实现为显示系统中的数字化电压值,例如,数字驱动水 平或DDL)可以对应于与参考GSDF中的数字代码值不同的装置特定 灰度级(或照度值)。装置特定灰度级可以被设计为支持sRGB、Rec.709 或其他规范(包括使用与互补密度相关的表示的那些规范)。附加地, 可选地或可替代地,装置特定灰度级可以基于显示器驱动的基本DAC 特性。In some embodiments, the raw digital code values in a display device's device-specific GSDF (which can be implemented as digitized voltage values in the display system, such as digital drive levels or DDLs) can correspond to device-specific grayscale levels (or luminance values) that differ from the digital code values in the reference GSDF. The device-specific grayscale levels can be designed to support sRGB, Rec.709, or other specifications (including those using complementary density-dependent representations). Additionally, optionally, or alternatively, the device-specific grayscale levels can be based on the underlying DAC characteristics of the display driver.
在一些实施例中,显示装置A(512-A)可以被设计和/或实现为 支持可见动态范围(VDR)显示器的装置特定GSDF A(514-A)。 GSDF A(514-A)可以基于用于装置特定数字代码值的12位的位深 (12位代码空间)、10,000:1对比率(CR)、以及>P3域。GSDF A (514-A)可以支持参考GSDF(504)的整个范围中的第一子范围(例 如,0至5,000cd/m2)内的灰度级。可替代地和/或可选地,GSDF A (514-A)可以支持参考GSDF(504)中的整个范围(例如,0至12,000 cd/m2),但是可以包括比参考GSDF(504)中的所有参考灰度级少 的参考灰度级。In some embodiments, display device A (512-A) can be designed and/or implemented as a device-specific GSDF A (514-A) that supports a visible dynamic range (VDR) display. GSDF A (514-A) can be based on a bit depth of 12 bits (12-bit code space) for device-specific digital code values, a 10,000:1 contrast ratio (CR), and a >P3 domain. GSDF A (514-A) can support grayscale levels within a first sub-range (e.g., 0 to 5,000 cd/m 2 ) of the entire range of a reference GSDF (504). Alternatively and/or optionally, GSDF A (514-A) can support the entire range (e.g., 0 to 12,000 cd/m 2 ) in the reference GSDF (504), but can include fewer reference grayscale levels than all reference grayscale levels in the reference GSDF (504).
在一些实施例中,显示装置B(512-B)可以被设计和/或实现为 支持用于比VDR窄的动态范围的装置特定GSDF B(514-B)。例如, 显示装置B(512-B)可以是标准动态范围(SDR)显示器。如本文中 所使用的,术语“标准动态范围”和“低动态范围”和/或它们相应的缩写“SDR”和“LDR”可以同义地和/或互换地使用。在一些实施例 中,GSDF B(514-B)可以支持用于装置特定数字代码值的8位的位 深、500-5,000:1对比率(CR)、以及如Rec.709中所定义的色域。 在一些实施例中,GSDF B(514-B)可以提供参考GSDF(504)的第 二子范围(例如,0至2000cd/m2)内的灰度级。In some embodiments, display device B (512-B) may be designed and/or implemented to support a device-specific GSDF B (514-B) for a dynamic range narrower than the VDR. For example, display device B (512-B) may be a standard dynamic range (SDR) display. As used herein, the terms "standard dynamic range" and "low dynamic range" and/or their corresponding abbreviations "SDR" and "LDR" may be used synonymously and/or interchangeably. In some embodiments, GSDF B (514-B) may support a bit depth of 8 bits for device-specific digital code values, a contrast ratio (CR) of 500-5,000:1, and a color gamut as defined in Rec. 709. In some embodiments, GSDF B (514-B) may provide grayscale levels within a second sub-range (e.g., 0 to 2000 cd/m 2 ) of the reference GSDF (504).
在一些实施例中,显示装置C(512-C)可以被设计和/或实现为 支持用于甚至比SDR窄的动态范围的装置特定GSDF C(514-C)。 例如,显示装置C(512-C)可以是平板显示器。在一些实施例中, GSDF C(514-C)可以支持用于装置特定数字代码值的8位的位深、100-800:1对比率(CR)、以及比Rec.709中所定义的色域小的色域。 在一些实施例中,GSDFC(514-C)可以提供参考GSDF(504)的第 三子范围(例如,0至1,200cd/m2)内的灰度级。In some embodiments, the display device C (512-C) can be designed and/or implemented to support a device-specific GSDF C (514-C) for a dynamic range even narrower than SDR. For example, the display device C (512-C) can be a flat panel display. In some embodiments, the GSDF C (514-C) can support a bit depth of 8 bits for device-specific digital code values, a contrast ratio (CR) of 100-800:1, and a color gamut smaller than that defined in Rec. 709. In some embodiments, the GSDFC (514-C) can provide grayscale levels within a third sub-range (e.g., 0 to 1,200 cd/m 2 ) of the reference GSDF (504).
在一些实施例中,显示装置(例如,显示装置D(512-D))可以 被设计和/或实现为支持用于比SDR窄得多的非常有限的动态范围的 装置特定GSDF(例如,GSDF D(514-D))。例如,显示装置D(512-D) 可以是电子纸显示器。在一些实施例中,GSDF D(514-D)可以支持用于装置特定数字代码值的6位的位深、10:1或更小的对比率(CR)、 以及比Rec.709中所定义的色域小得多的色域。在一些实施例中, GSDF D(514-D)可以提供参考GSDF(504)的第四子范围(例如, 0至100cd/m2)内的灰度级。In some embodiments, a display device (e.g., display device D (512-D)) may be designed and/or implemented to support a device-specific GSDF (e.g., GSDF D (514-D)) for a very limited dynamic range that is much narrower than SDR. For example, display device D (512-D) may be an electronic paper display. In some embodiments, GSDF D (514-D) may support a bit depth of 6 bits for device-specific digital code values, a contrast ratio (CR) of 10:1 or less, and a color gamut that is much smaller than the color gamut defined in Rec. 709. In some embodiments, GSDF D (514-D) may provide grayscale levels within a fourth sub-range (e.g., 0 to 100 cd/m 2 ) of the reference GSDF (504).
图像呈现的精度可以随显示装置A至D(512-A至D)中的每个 而适当地按比例降低。在一些实施例中,装置特定GSDF A至D (514-A至D)中的每个中的灰度级的子集可以与参考GSDF(504) 中所支持的参考灰度级相关或者映射到该所支持的参考灰度级,以便 在该显示装置所支持的灰度级范围中均匀地分布感知上显著的错误。The accuracy of the image rendering can be appropriately scaled down for each of the display devices A to D (512-A to D). In some embodiments, a subset of the gray levels in each of the device-specific GSDFs A to D (514-A to D) can be correlated with or mapped to the reference gray levels supported in the reference GSDF (504) so as to evenly distribute perceptually significant errors across the range of gray levels supported by the display device.
在一些实施例中,具有装置特定GSDF(例如,514-A至514-D 之一)的显示装置(例如,512-A至512-D之一)接收/提取基于参考 GSDF编码的参考编码图像数据。作为响应,显示装置或其中的转换 单元(516-A至516-D之一)将参考编码图像数据中所指定的参考数字代码值映射到显示装置本地的装置特定数字代码值。这可以以几种 方式之一执行。在例子中,从参考数字代码值到装置特定数字代码值 的映射包括选择如下这样的装置特定灰度级(对应于装置特定数字代 码值),该装置特定灰度级匹配或者如任何其他装置特定灰度级那样 接近地逼近参考灰度级(对应于参考数字代码值)。在另一个例子中, 从参考数字代码值到装置特定数字代码值的映射包括:(1)基于与参考GSDF相关联的参考灰度级(对应于参考数字代码值)来确定色调映 射的照度值,并且(2)选择如下这样的装置特定灰度级(对应于装置特 定数字代码值),该装置特定灰度级匹配或者如任何其他装置特定灰 度级那样接近地逼近色调映射的照度值。In some embodiments, a display device (e.g., one of 512-A through 512-D) having a device-specific GSDF (e.g., one of 514-A through 514-D) receives/extracts reference encoded image data encoded based on a reference GSDF. In response, the display device or a conversion unit (e.g., one of 516-A through 516-D) therein maps reference digital code values specified in the reference encoded image data to device-specific digital code values native to the display device. This can be performed in one of several ways. In one example, mapping the reference digital code values to the device-specific digital code values includes selecting a device-specific grayscale level (corresponding to the device-specific digital code value) that matches or approximates the reference grayscale level (corresponding to the reference digital code value) as closely as any other device-specific grayscale level. In another example, mapping from a reference digital code value to a device-specific digital code value includes: (1) determining a tone-mapped luminance value based on a reference gray level (corresponding to the reference digital code value) associated with a reference GSDF, and (2) selecting a device-specific gray level (corresponding to the device-specific digital code value) that matches or approximates the tone-mapped luminance value as closely as any other device-specific gray level.
随后,显示装置或其中的驱动器芯片(518-A至518-D之一)可 以使用显示器特定数字代码值来用与显示器特定代码值相对应的装置 特定灰度级来呈现图像。The display device, or a driver chip therein (one of 518-A through 518-D), can then use the display specific digital code value to render an image using a device specific grayscale level corresponding to the display specific code value.
一般来讲,参考GSDF可以基于与显示器特定GSDF所基于的 CSF模型不同的CSF模型。参考GSDF与装置特定GSDF之间的转 换/映射是必要的。即使使用相同的CSF模型来产生参考GSDF和装 置特定GSDF两者,在导出GSDF中也可以使用模型参数的不同值。 对于参考GSDF,可以保守地设置模型参数值以为多种下游装置保留 细节,而对于装置特定GSDF,模型参数可以反映显示装置将呈现图 像的特定设计/实现和观看条件。参考GSDF与装置特定GSDF之间的 转换/映射仍然是必要的,因为特定显示装置的观看条件参数(例如, 周围光水平、显示装置的光学反射率等)与用于导出参考GSDF的模 型参数值不同。这里,观看条件参数可以包括损害显示质量(例如, 对比率等)并且抬高黑色级(例如,最低灰度级等)的那些参数。根 据如本文中所描述的技术的参考GSDF与装置特定GSDF之间的转换 /映射改进了图像呈现质量(例如,通过增大高值区域处的照度值来改 进对比率等等)。In general, the reference GSDF can be based on a different CSF model than the CSF model on which the display-specific GSDF is based. Conversion/mapping between the reference GSDF and the device-specific GSDF is necessary. Even if the same CSF model is used to generate both the reference GSDF and the device-specific GSDF, different values of the model parameters can be used in the derived GSDF. For the reference GSDF, the model parameter values can be conservatively set to preserve details for a variety of downstream devices, while for the device-specific GSDF, the model parameters can reflect the specific design/implementation and viewing conditions under which the display device will present the image. Conversion/mapping between the reference GSDF and the device-specific GSDF is still necessary because the viewing condition parameters of a specific display device (e.g., ambient light level, optical reflectivity of the display device, etc.) are different from the model parameter values used to derive the reference GSDF. Here, the viewing condition parameters can include those that degrade display quality (e.g., contrast ratio, etc.) and elevate black levels (e.g., minimum grayscale level, etc.). The conversion/mapping between a reference GSDF and a device-specific GSDF according to the techniques as described herein improves image rendering quality (e.g., improving contrast ratio by increasing luminance values at high-value areas, etc.).
9.转换参考编码数据9. Convert reference code data
图6例示根据本发明的一些实施例的示例转换单元(例如,516)。 转换单元(516)可以是,但不仅仅限于,如图5中所示的多个转换单 元(例如,516-A至516-D)之一(例如,516-A)。在一些实施例中, 转换单元(516)可以接收用于参考GSDF(REF GSDF)的第一定义数据、以及用于装置特定GSDF(例如,GSDF-A(图5的514-A)) 的第二定义数据。如本文中所使用的,如果装置是显示器,则术语“装 置特定”和“显示器特定”可以同义地使用。FIG6 illustrates an example conversion unit (e.g., 516) according to some embodiments of the present invention. The conversion unit (516) may be, but is not limited to, one of the multiple conversion units (e.g., 516-A to 516-D) shown in FIG5 (e.g., 516-A). In some embodiments, the conversion unit (516) may receive first definition data for a reference GSDF (REF GSDF) and second definition data for a device-specific GSDF (e.g., GSDF-A (514-A in FIG5)). As used herein, if the device is a display, the terms "device-specific" and "display-specific" may be used synonymously.
基于所接收的定义数据,转换单元(516)将参考GSDF与显示 器特定GSDF级联以形成转换查找表(转换LUT)。两个GSDF之间 的级联可以包括:将两个GSDF中的灰度级进行比较,并且基于比较 灰度级的结果,建立参考GSDF中的参考数字代码值与显示器特定 GSDF中的显示器特定代码值之间的映射。Based on the received definition data, the conversion unit (516) concatenates the reference GSDF with the display-specific GSDF to form a conversion lookup table (conversion LUT). The concatenation between the two GSDFs may include comparing the grayscale levels in the two GSDFs and, based on the result of the grayscale comparison, establishing a mapping between the reference digital code value in the reference GSDF and the display-specific code value in the display-specific GSDF.
更具体地,在给定参考GSDF中的参考数字代码值的情况下,可 以基于参考GSDF来确定其相应的参考灰度级。可以使用如此确定的 参考灰度级来找出显示器特定GSDF中的装置特定灰度级。在示例实 施例中,所找出的装置特定灰度级可以匹配或者如显示器特定GSDF 中的任何其他显示器特定灰度级那样接近地逼近参考灰度级。在另一 个示例实施例中,可以通过对参考灰度级起作用的整体或局部色调映 射算子来获得色调映射的照度值;所找出的装置特定灰度级可以匹配 或者如显示器特定GSDF中的任何其他显示器特定灰度级那样接近地 逼近色调映射的照度值。More specifically, given a reference digital code value in a reference GSDF, its corresponding reference grayscale can be determined based on the reference GSDF. The thus-determined reference grayscale can be used to find a device-specific grayscale in a display-specific GSDF. In an exemplary embodiment, the found device-specific grayscale can match or approximate the reference grayscale as closely as any other display-specific grayscale in the display-specific GSDF. In another exemplary embodiment, a tone-mapped luminance value can be obtained by applying a global or local tone mapping operator to the reference grayscale; the found device-specific grayscale can match or approximate the tone-mapped luminance value as closely as any other display-specific grayscale in the display-specific GSDF.
就装置特定灰度级而言,可以从显示器特定GSDF识别相应的显 示器特定数字代码值。可以在转换LUT中添加或定义由参考数字代码 值和显示器特定代码值组成的条目。For device-specific gray levels, the corresponding display-specific digital code values can be identified from the display-specific GSDF. An entry consisting of a reference digital code value and a display-specific code value can be added or defined in the conversion LUT.
可以对参考GSDF中的其他参考数字代码值重复如上所述的步骤。The steps described above may be repeated for other reference digital code values in the reference GSDF.
在一些实施例中,转换LUT可被预先构建并且在接收并处理如下 这样的图像数据之前存储,该图像数据的处理将至少部分基于转换 LUT执行。在替代实施例中,对将用转换LUT进行处理的图像数据 进行分析。可以使用分析结果来设置或至少调整参考数字代码值与装 置特定数字代码值之间的对应关系。例如,如果图像数据指示照度值 的特定集中度或分布,则可以以保留照度值的集中区域中的大量细节 的方式设置转换LUT。In some embodiments, a conversion LUT may be pre-built and stored prior to receiving and processing image data whose processing will be performed at least in part based on the conversion LUT. In alternative embodiments, the image data to be processed using the conversion LUT is analyzed. The results of the analysis can be used to set or at least adjust the correspondence between reference digital code values and device-specific digital code values. For example, if the image data indicates a particular concentration or distribution of luminance values, the conversion LUT can be configured to preserve a high degree of detail in the concentrated region of luminance values.
在一些实施例中,转换单元(516)包括被配置为比较参考GSDF 和显示器特定GSDF(514-A)两者中的量化步长(例如,相邻数字代 码值之间的照度值差或ΔL)的一个或多个软件和/或硬件组件(比较子 单元(602))。例如,参考GSDF中的参考数字代码值处的量化步长可以是参考照度值差(参考GSDFΔL),而显示器特定GSDF中 的显示器特定数字代码值处的量化步长可以是显示器特定照度值差 (显示器特定GSDFΔL)。这里,显示器特定数字代码值对应于参 考数字代码值(或者与参考数字代码值在转换LUT中形成对)。在一 些实施例中,比较子单元(602)将这两个照度值差进行比较。该操作 实质上是可以基于ΔL值执行的或者可选地和/或可替代地基于两个 GSDF曲线的相对斜率执行的测试。In some embodiments, the conversion unit (516) includes one or more software and/or hardware components (comparison subunit (602)) configured to compare the quantization step sizes (e.g., luminance value differences or ΔL between adjacent digital code values) in both the reference GSDF and the display-specific GSDF (514-A). For example, the quantization step size at the reference digital code value in the reference GSDF can be a reference luminance value difference (reference GSDFΔL), while the quantization step size at the display-specific digital code value in the display-specific GSDF can be a display-specific luminance value difference (display-specific GSDFΔL). Here, the display-specific digital code value corresponds to the reference digital code value (or forms a pair with the reference digital code value in the conversion LUT). In some embodiments, the comparison subunit (602) compares the two luminance value differences. This operation is essentially a test that can be performed based on the ΔL value or, alternatively and/or alternatively, based on the relative slopes of the two GSDF curves.
显示器特定GSDF中的照度值的量化步长典型地可以超过参考 GSDF的量化步长,这是因为来自参考GSDF的(例如,对应于高位 深域等的)一个或多个参考灰度级被合并到来自显示器特定GSDF的 (例如,对应于低位深域等的)显示器特定灰度级。在这些情况下,使用抖动来去除条带状伪像。作为总体抖动的一部分,还对局部周围 输出像素执行抖动(在空间中和/或在时间中)。在某种意义上,人眼 可以被表示为低通滤波器。至少在这个意义上,如本文中所描述的对 局部周围像素进行平均因此创建了减少和/或去除条带状视觉伪像的 期望输出灰度级,否则可能由于显示器特定GSDF中的大量化步长而 存在这些条带状视觉伪像。The quantization step size of luminance values in a display-specific GSDF can typically exceed the quantization step size of a reference GSDF because one or more reference grayscale levels from the reference GSDF (e.g., corresponding to a high bit-depth domain, etc.) are merged into the display-specific grayscale levels from the display-specific GSDF (e.g., corresponding to a low bit-depth domain, etc.). In these cases, dithering is used to remove banding artifacts. As part of the overall dithering, dithering (in space and/or in time) is also performed on the local surrounding output pixels. In a sense, the human eye can be represented as a low-pass filter. At least in this sense, averaging the local surrounding pixels as described herein creates a desired output grayscale level that reduces and/or removes visual banding artifacts that might otherwise be present due to the large quantization step size in the display-specific GSDF.
在不是那么通常的情况下,参考GSDF的照度值的量化步长有时 可能超过显示器特定GSDF的量化步长。使用基于去轮廓算法的处理, 例如通过对相邻输入像素进行平均来基于输入灰度级合成输出灰度级。In less common cases, the quantization step size of the luminance values of the reference GSDF may sometimes exceed the quantization step size of the display-specific GSDF. A process based on a decontouring algorithm is used, for example, to synthesize output gray levels based on input gray levels by averaging adjacent input pixels.
相应地,如果参考GSDFΔL大于显示器特定GSDFΔL(在图6 中,为“Y”路径),则对于转换LUT中包括参考数字代码值和显示 器特定数字代码值的条目设置去轮廓算法标志。Accordingly, if the reference GSDFΔL is greater than the display specific GSDFΔL ("Y" path in Figure 6), the decontouring algorithm flag is set for the entry in the conversion LUT that includes the reference digital code value and the display specific digital code value.
如果参考GSDFΔL小于显示器特定GSDFΔL(在图6中,为 “N”路径),则对于转换LUT中包括参考数字代码值和显示器特定 数字代码值的条目设置抖动算法标志。If the reference GSDFΔL is less than the display specific GSDFΔL (in Figure 6, the "N" path), the dither algorithm flag is set for the entry in the conversion LUT that includes the reference digital code value and the display specific digital code value.
如果参考GSDFΔL等于显示器特定GSDFΔL,则对于转换LUT 中包括参考数字代码值和显示器特定数字代码值的条目既不设置去轮 廓算法标志,也不设置抖动算法标志。If the reference GSDFΔL is equal to the display specific GSDFΔL, neither the decontouring algorithm flag nor the dithering algorithm flag is set for the entry in the conversion LUT that includes the reference digital code value and the display specific digital code value.
去轮廓和抖动算法标志可以与转换LUT中的条目一起存储,或者 可以被存储在转换LUT外部的、但是与转换LUT操作地链接的相关 数据结构中。The decontouring and dithering algorithm flags may be stored with the entries in the conversion LUT, or may be stored in associated data structures external to the conversion LUT, but operatively linked to the conversion LUT.
在一些实施例中,转换单元(516)被配置为:接收可以为高位深 或浮点输入图像的形式的参考编码图像数据,并且将参考GSDF中所 指定的参考数字代码值映射到显示器特定GSDF中所指定的显示器特 定数字代码值。除了映射GSDF之间的数字代码值之外,转换单元(516) 还可以被配置为基于前面所讨论的算法标志(去轮廓算法标志或抖动 算法标志)的设置来执行去轮廓或抖动。In some embodiments, the conversion unit (516) is configured to receive reference encoded image data, which may be in the form of a high bit depth or floating point input image, and map reference digital code values specified in a reference GSDF to display-specific digital code values specified in a display-specific GSDF. In addition to mapping digital code values between GSDFs, the conversion unit (516) may also be configured to perform decontouring or dithering based on the settings of the algorithm flags (decontouring algorithm flag or dithering algorithm flag) discussed above.
如所指出的,参考GSDF有可能包含比显示器特定GSDF多的细 节;因此,图6的“Y”路径可能不发生,或者可能不那么频繁地发 生。在一些实施例中,可以省略“Y”路径和相关的处理以简化转换 单元的实现。As noted, the reference GSDF is likely to contain more details than the display-specific GSDF; therefore, the "Y" path of Figure 6 may not occur, or may occur less frequently. In some embodiments, the "Y" path and related processing can be omitted to simplify the implementation of the conversion unit.
在一些实施例中,在对于参考编码图像数据中的像素确定的参考 数字代码值被给定的情况下,转换单元(516)在转换LUT中查找相 应的显示器特定数字代码值,并且用相应的显示器特定数字代码值取 代参考数字代码值。附加地和/或可选地,转换单元(516)基于转换 LUT中的包括参考数字代码值和显示器特定数字代码值的条目的算 法标志的存在/设置来确定对于像素是应执行去轮廓算法、还是抖动算 法。In some embodiments, given a reference digital code value determined for a pixel in the reference encoded image data, the conversion unit (516) looks up the corresponding display-specific digital code value in the conversion LUT and replaces the reference digital code value with the corresponding display-specific digital code value. Additionally and/or alternatively, the conversion unit (516) determines whether a decontouring algorithm or a dithering algorithm should be performed on the pixel based on the presence/setting of an algorithm flag for an entry in the conversion LUT that includes the reference digital code value and the display-specific digital code value.
如果确定既不应执行去轮廓算法、又不应执行抖动算法(例如, 没有关于执行任何一种算法的指示或标志),则暂时对该像素不执行 去轮廓也不执行抖动。If it is determined that neither the decontouring algorithm nor the dithering algorithm should be executed (e.g., there is no indication or flag to execute either algorithm), neither decontouring nor dithering is temporarily executed on the pixel.
如果确定应执行去轮廓算法,则转换单元(516)可以执行一种或 多种去轮廓算法(Decontour Algo)。执行所述一种或多种去轮廓算 法可以包括:接收输入的局部相邻像素的图像数据,并将这些局部相 邻像素的图像数据输入到去轮廓算法。If it is determined that a decontour algorithm should be executed, the conversion unit (516) may execute one or more decontour algorithms. Executing the one or more decontour algorithms may include receiving input image data of local adjacent pixels and inputting the image data of the local adjacent pixels into the decontour algorithm.
如果确定应执行抖动算法,则转换单元(516)可以执行一种或多 种抖动算法(Dithering Algo)。If it is determined that a dithering algorithm should be executed, the conversion unit (516) may execute one or more dithering algorithms.
如果转换单元(516)确定对相邻像素需要执行去轮廓或抖动,则 在去轮廓或抖动中仍可能涉及该像素。在例子中,可以使用像素的装 置特定(输出)灰度级来对局部相邻像素进行抖动。在另一个例子中, 可以使用像素的参考(输入)灰度级来对局部相邻像素进行去轮廓。If the conversion unit (516) determines that decontouring or dithering needs to be performed on a neighboring pixel, the pixel may still be included in the decontouring or dithering. In one example, the device-specific (output) gray level of the pixel may be used to dither the local neighboring pixels. In another example, the reference (input) gray level of the pixel may be used to decontouring the local neighboring pixels.
在一些实施例中,转换单元(516)将前面的步骤的处理结果输出 到下游处理单元或子单元。处理结果包括显示器特定编码图像数据, 其格式为用显示器特定GSDF(例如,GSDF-A)中的数字代码值编 码的显示器特定位深输出图像。In some embodiments, the conversion unit (516) outputs the processing results of the previous steps to a downstream processing unit or sub-unit. The processing results include display-specific encoded image data in the form of a display-specific bit-depth output image encoded using digital code values in a display-specific GSDF (e.g., GSDF-A).
图7例示实现8位图像处理的示例SDR显示器(700)。SDR显 示器(700)或其中的VDR解码单元(702)接收编码输入。编码输 入包括可以是多种图像数据容器格式之一的图像数据容器中的参考译 码图像数据。VDR解码单元(702)对编码输入进行解码,并且从其 中确定/检索参考编码图像数据。参考编码图像数据可以包括颜色空间 (例如,RGB颜色空间、YCbCr颜色空间等)中的各个像素的图像 数据。各个像素的图像数据可以用参考GSDF中的参考数字代码值进 行编码。FIG7 illustrates an example SDR display (700) implementing 8-bit image processing. The SDR display (700) or a VDR decoding unit (702) therein receives an encoded input. The encoded input includes reference decoded image data in an image data container that can be one of a variety of image data container formats. The VDR decoding unit (702) decodes the encoded input and determines/retrieves reference encoded image data therefrom. The reference encoded image data can include image data for each pixel in a color space (e.g., RGB color space, YCbCr color space, etc.). The image data for each pixel can be encoded using a reference digital code value in a reference GSDF.
附加地和/或可选地,SDR显示器(700)包括保持用于SDR显示 器(700)的显示器参数的显示器管理单元(704)。显示器参数可以 至少部分定义与SDR显示器(700)相关联的显示器特定GSDF(例 如,图5的GSDF-B)。定义显示器特定GSDF的显示器参数可以包 括SDR显示器(700)所支持的最大(max)和最小(min)灰度级。 显示器参数还可以包括SDR显示器所支持的颜色基元(原色)、显示 器大小(大小)、SDR显示器的图像呈现表面的光学反射率、周围光 水平。显示器参数中的一些可以预先配置有固定值。显示器参数中的 一些可以由SDR显示器(700)实时地或近乎实时地测量。显示器参 数中的一些可以由SDR显示器(700)的用户配置。显示器参数中的 一些可以预先配置有默认值,并且可以通过测量或者由用户被覆写。 显示器管理单元(704)基于参考GSDF来建立显示器特定灰度级的 感知非线性/使显示器特定灰度级的感知非线性成形,并且可以附加地 和/或可选地将色调映射作为建立显示器特定灰度级/使显示器特定灰 度级成形的一部分执行。例如,为了根据参考GSDF建立显示器特定 灰度级的感知非线性/使显示器特定灰度级的感知非线性成形的目的, 显示器管理单元(704)可以建立如图5中所示的转换LUT和/或其他 相关元数据(例如,抖动和去轮廓处理标志等)。显示器管理单元(704) 可以实现如前面所讨论的级联操作来创建与参考GSDF和显示器特定 GSDF中的一个或两者相关的转换LUT和/或其他相关元数据(712)。 转换LUT和/或其他相关元数据(712)可被SDR显示器(700)中的 其他单元或子单元访问并且使用。此外,可以使用转换LUT和/或其 他相关元数据作为或者导出用于对感知非线性进行逆操作的元数据 (714)。如本文中所使用的,对感知非线性进行逆操作可以包括将显 示器特定数字代码值转换为显示器特定数字驱动水平(例如,显示装 置中的数字化电压电平)。Additionally and/or alternatively, the SDR display (700) includes a display management unit (704) that maintains display parameters for the SDR display (700). The display parameters may at least partially define a display-specific GSDF (e.g., GSDF-B of FIG. 5 ) associated with the SDR display (700). The display parameters that define the display-specific GSDF may include the maximum (max) and minimum (min) grayscale levels supported by the SDR display (700). The display parameters may also include the color primitives (primaries) supported by the SDR display, the display size (size), the optical reflectivity of the image rendering surface of the SDR display, and the ambient light level. Some of the display parameters may be pre-configured with fixed values. Some of the display parameters may be measured by the SDR display (700) in real time or near real time. Some of the display parameters may be configured by a user of the SDR display (700). Some of the display parameters may be pre-configured with default values and may be overwritten by measurement or by the user. The display management unit (704) establishes/shapes the perceptual nonlinearity of display-specific grayscales based on a reference GSDF and may additionally and/or alternatively perform tone mapping as part of establishing/shaping the display-specific grayscales. For example, to establish/shape the perceptual nonlinearity of display-specific grayscales based on a reference GSDF, the display management unit (704) may establish a conversion LUT and/or other related metadata (e.g., dithering and decontouring flags, etc.) as shown in FIG5 . The display management unit (704) may implement a cascade operation as discussed above to create a conversion LUT and/or other related metadata (712) associated with one or both of the reference GSDF and the display-specific GSDF. The conversion LUT and/or other related metadata (712) may be accessed and used by other units or subunits in the SDR display (700). Furthermore, the conversion LUT and/or other related metadata may be used as or derived from metadata (714) for performing an inverse operation on the perceptual nonlinearity. As used herein, inverting perceptual nonlinearity may include converting display-specific digital code values to display-specific digital drive levels (e.g., digitized voltage levels in a display device).
附加地和/或可选地,SDR显示器(700)包括如图5和图6中所 示的转换单元(516)、以及8位感知量化器(706)。在一些实施例 中,SDR显示器(700)或其中的转换单元(516)和8位感知量化器 (706)将参考编码图像数据转换为用与显示器特定GSDF(例如,图 5的GSDF-A或GSDF-B)相关联的显示器特定数字代码值编码的显 示器特定位深输出图像,并且将显示器特定位深输出图像量化为8位 代码空间中的感知编码图像数据。如本文中所使用的,术语“感知编 码”可以是指基于人类视觉感知模型(诸如产生参考GSDF的CSF)的 一种编码。Additionally and/or alternatively, the SDR display (700) includes a conversion unit (516) and an 8-bit perceptual quantizer (706) as shown in Figures 5 and 6. In some embodiments, the SDR display (700), or the conversion unit (516) and the 8-bit perceptual quantizer (706) therein, converts the reference-encoded image data into a display-specific bit-depth output image encoded with a display-specific digital code value associated with a display-specific GSDF (e.g., GSDF-A or GSDF-B of Figure 5), and quantizes the display-specific bit-depth output image into perceptually encoded image data in an 8-bit code space. As used herein, the term "perceptual encoding" may refer to an encoding based on a human visual perception model, such as a CSF that generates the reference GSDF.
附加地和/或可选地,SDR显示器(700)包括视频后处理单元(708), 其可以(但不仅仅限于)对8位照度表示的感知编码图像数据不执行 图像处理操作、执行图像处理操作中的一个或多个。这些图像处理操 作可以包括(但不仅仅限于)压缩、解压缩、颜色空间转换、下采样、 上采样或颜色分级。这些操作的结果可以输出到SDR显示器(700) 的其他部分。Additionally and/or alternatively, the SDR display (700) includes a video post-processing unit (708) that may, but is not limited to, perform no image processing operation, perform one or more image processing operations on the perceptually coded image data represented by 8-bit luminance. These image processing operations may include, but are not limited to, compression, decompression, color space conversion, downsampling, upsampling, or color grading. The results of these operations may be output to other parts of the SDR display (700).
在示例实施例中,SDR显示器(700)包括8位逆感知量化器(710), 其被配置为将图像处理操作的结果中的显示器特定数字代码值转换为 显示器特定数字驱动水平(例如,数字化电压电平)。逆感知量化器 (710)所产生的(或者从数字代码值转换回的)显示器特定数字驱动 水平可以特别地支持SDR显示器(700)中可支持的几种照度非线性 之一。在例子中,逆感知量化器(710)将显示器特定数字代码值转换 为显示器特定数字驱动水平以支持与Rec.709相关联的照度非线性。 在另一个例子中,逆感知量化器(710)将显示器特定数字代码值转换 为显示器特定数字驱动水平以支持与线性照度域或对数照度域(其可 以相对容易地与局部调光操作整合)相关联的照度非线性。在另一个 例子中,逆感知量化器(710)将显示器特定数字代码值转换为显示器 特定数字驱动水平以支持显示器特定CSF(或其相关联的GSDF), 其中,显示器特定灰度级对于特定显示器(700)被最佳地放置,并且 可对于特定于显示器(700)的观看条件被调整。In an example embodiment, an SDR display (700) includes an 8-bit inverse perceptual quantizer (710) configured to convert display-specific digital code values from a result of an image processing operation into display-specific digital drive levels (e.g., digitized voltage levels). The display-specific digital drive levels generated by (or converted back from) the digital code values by the inverse perceptual quantizer (710) can specifically support one of several luminance nonlinearities supportable in the SDR display (700). In one example, the inverse perceptual quantizer (710) converts the display-specific digital code values into display-specific digital drive levels to support luminance nonlinearities associated with Rec. 709. In another example, the inverse perceptual quantizer (710) converts the display-specific digital code values into display-specific digital drive levels to support luminance nonlinearities associated with a linear luminance domain or a logarithmic luminance domain (which can be relatively easily integrated with local dimming operations). In another example, the inverse perceptual quantizer (710) converts display-specific digital code values into display-specific digital drive levels to support a display-specific CSF (or its associated GSDF), wherein the display-specific gray levels are optimally positioned for the specific display (700) and can be adjusted for viewing conditions specific to the display (700).
10.示例处理流程10. Example Processing Flow
图8A例示根据本发明的实施例的示例处理流程。在一些实施例 中,一个或多个计算装置或组件(诸如框架(500)中的一个或多个计 算装置)可以执行该处理流程。在方框802中,计算装置接收将被编 码的图像数据。FIG8A illustrates an example process flow according to an embodiment of the present invention. In some embodiments, one or more computing devices or components (such as one or more computing devices in the framework (500)) can perform the process flow. In block 802, a computing device receives image data to be encoded.
在方框804中,计算装置基于参考数字代码值集合与参考灰度级 集合之间的参考映射来将将被编码的图像数据编码为参考编码图像数 据。这里,将被编码的图像数据中的照度值用参考数字代码值集合表 示。参考数字代码值集合中的两个相邻参考数字代码值所表示的两个 参考灰度级之间的照度差可以与适于特定光水平的人类视觉的峰值对 比敏感度成反比。At block 804, the computing device encodes the image data to be encoded into reference encoded image data based on a reference mapping between a set of reference digital code values and a set of reference grayscale levels. Here, luminance values in the image data to be encoded are represented by the set of reference digital code values. The luminance difference between two reference grayscale levels represented by two adjacent reference digital code values in the set of reference digital code values may be inversely proportional to the peak contrast sensitivity of human vision for a particular light level.
在方框806中,计算装置输出参考编码图像数据。In block 806 , the computing device outputs the reference encoded image data.
在实施例中,计算装置基于对比敏感度函数(CSF)模型来确定 参考灰阶显示函数(GSDF);参考GSDF指定参考数字代码值集合 与参考灰度级集合之间的参考映射。CSF模型包括一个或多个模型参 数,这些模型参数可以具有落在包括下列中的一个或多个的范围中的 角度大小:25度×25度与30度×30度之间、30度×30度与35度× 35度之间、35度×35度与40度×40度之间、40度×40度与45度× 45度之间、或大于45度×45度。In an embodiment, a computing device determines a reference grayscale display function (GSDF) based on a contrast sensitivity function (CSF) model; the reference GSDF specifies a reference mapping between a set of reference digital code values and a set of reference grayscale levels. The CSF model includes one or more model parameters, and these model parameters may have an angular size within a range including one or more of: between 25 degrees x 25 degrees and 30 degrees x 30 degrees, between 30 degrees x 30 degrees and 35 degrees x 35 degrees, between 35 degrees x 35 degrees and 40 degrees x 40 degrees, between 40 degrees x 40 degrees and 45 degrees x 45 degrees, or greater than 45 degrees x 45 degrees.
在实施例中,计算装置将参考灰度级集合所支持的照度值范围内 的中间照度值分配给托管参考数字代码值集合的代码空间中的中间数 字代码值,并且通过执行堆叠或积分计算中的一个或多个来导出多个 子映射,每个子映射将参考数字代码值集合中的参考数字代码值映射 到参考灰度级集合中的参考灰度级。中间照度值可以在包括下列中的一个或多个的范围内选择:小于50尼特、50尼特与100尼特之间(包 括50和100尼特在内)、100与500尼特之间(包括100和500尼特 在内)、或不小于500尼特。In one embodiment, a computing device assigns an intermediate luminance value within the range of luminance values supported by the reference grayscale set to an intermediate digital code value in a code space hosting the reference digital code value set, and derives a plurality of submaps by performing one or more of stacking or integration calculations, each submap mapping a reference digital code value in the reference digital code value set to a reference grayscale level in the reference grayscale set. The intermediate luminance value may be selected from a range comprising one or more of: less than 50 nits, between 50 nits and 100 nits (inclusive), between 100 and 500 nits (inclusive), or not less than 500 nits.
在示例实施例中,参考灰度级集合覆盖上限的值为下列值的动态 范围:小于500尼特、500尼特与1000尼特之间(包括500和1000 尼特在内)、1000尼特与5000尼特之间(包括1000和5000尼特在 内)、5000尼特与10000尼特之间(包括5000和10000尼特在内)、10000尼特与15000尼特之间(包括10000和15000尼特在内)、或 大于15000尼特。In an exemplary embodiment, the value of the upper limit of the reference grayscale set coverage is a dynamic range of the following values: less than 500 nits, between 500 nits and 1000 nits (including 500 and 1000 nits), between 1000 nits and 5000 nits (including 1000 and 5000 nits), between 5000 nits and 10000 nits (including 5000 and 10000 nits), between 10000 nits and 15000 nits (including 10000 and 15000 nits), or greater than 15000 nits.
在实施例中,从基于对比敏感度函数(CSF)模型确定的多个对 比敏感度曲线之中的一个对比敏感度曲线确定峰值对比敏感度,所述 CSF模型具有包括照度值变量、空间频率变量或者一个或多个其他变 量中的一个或多个的模型参数。In an embodiment, the peak contrast sensitivity is determined from a contrast sensitivity curve among a plurality of contrast sensitivity curves determined based on a contrast sensitivity function (CSF) model, wherein the CSF model has model parameters including one or more of a luminance value variable, a spatial frequency variable, or one or more other variables.
在实施例中,基于所述多个对比敏感度曲线中的至少两个对比敏 感度曲线确定的至少两个峰值对比敏感度发生于两个不同的空间频率 值处。In an embodiment, at least two peak contrast sensitivities determined based on at least two contrast sensitivity curves of the plurality of contrast sensitivity curves occur at two different spatial frequency values.
在实施例中,计算装置将通过来自输入视频信号的将被编码的图 像数据被表示、接收、发送或存储的一个或多个输入图像转换为通过 输出视频信号中所包含的参考编码图像数据被表示、接收、发送或存 储的一个或多个输出图像。In an embodiment, a computing device converts one or more input images represented, received, sent or stored by image data to be encoded from an input video signal into one or more output images represented, received, sent or stored by reference encoded image data contained in an output video signal.
在实施例中,将被编码的图像数据包括以下列之一编码的图像数 据:高分辨率高动态范围(HDR)图像格式、与电影艺术与科学学院 (AMPAS)的学院颜色编码规范(ACES)标准相关联的RGB颜色 空间、数字电影倡导联盟的P3颜色空间标准、参考输入媒介度量/参考输出媒介度量(RIMM/ROMM)标准、sRGB颜色空间、与国际电 信联盟(ITU)的BT.709推荐标准相关联的RGB颜色空间等。In an embodiment, the image data to be encoded includes image data encoded in one of the following: a high-resolution high dynamic range (HDR) image format, an RGB color space associated with the Academy Color Encoding Specification (ACES) standard of the Academy of Motion Picture Arts and Sciences (AMPAS), the P3 color space standard of the Digital Cinema Initiatives Alliance, the Reference Input Medium Metrics/Reference Output Medium Metrics (RIMM/ROMM) standard, the sRGB color space, the RGB color space associated with the BT.709 recommendation standard of the International Telecommunication Union (ITU), etc.
在实施例中,两个相邻参考数字代码值所表示的两个参考灰度级 之间的照度差小于在特定光水平的最小可觉差阈值。In an embodiment, the luminance difference between two reference gray levels represented by two adjacent reference digital code values is less than a just noticeable difference threshold at a specific light level.
在实施例中,所述特定光水平是两个照度值之间(包括该两个照 度只在内)的照度值。In an embodiment, the specific light level is an illuminance value between two illuminance values (inclusive of the two illuminance values).
在实施例中,参考数字代码值集合包括位深为下列位深的代码空 间中的整数值:少于12位;12位与14位之间(包括在内(inclusive)); 至少14位;或者14位或更多位。In an embodiment, the set of reference digital code values comprises integer values in a code space having a bit depth of: less than 12 bits; between 12 bits and 14 bits (inclusive); at least 14 bits; or 14 bits or more.
在实施例中,参考灰度级集合可以包括量化照度值集合。In an embodiment, the set of reference gray levels may include a set of quantized luminance values.
图8B例示根据本发明的实施例的另一示例处理流程。在一些实 施例中,一个或多个计算装置或组件(诸如框架(500)中的一个或多 个计算装置)可以执行该处理流程。在方框852中,计算装置确定参 考数字代码值集合与装置特定数字代码值集合之间的数字代码映射。 这里,在参考映射中将参考数字代码值集合映射到参考灰度级集合, 而在装置特定映射中将装置特定数字代码值集合映射到装置特定灰度 级集合。FIG8B illustrates another example process flow according to an embodiment of the present invention. In some embodiments, one or more computing devices or components (such as one or more computing devices in the framework (500)) can perform the process flow. In block 852, the computing device determines a digital code mapping between a reference set of digital code values and a device-specific set of digital code values. Here, the reference set of digital code values is mapped to a reference set of grayscale levels in the reference mapping, while the device-specific set of digital code values is mapped to a device-specific set of grayscale levels in the device-specific mapping.
在方框854中,计算装置接收用参考数字代码值编码的参考编码 图像数据。参考编码图像数据中的照度值基于参考数字代码值集合。 参考数字代码值集合中的两个相邻参考数字代码值所表示的两个参考 灰度级之间的照度差可以与适于特定光水平的人类视觉的峰值对比敏 感度成反比。At block 854, the computing device receives reference encoded image data encoded with reference digital code values. Luminance values in the reference encoded image data are based on a set of reference digital code values. The luminance difference between two reference grayscale levels represented by two adjacent reference digital code values in the set of reference digital code values can be inversely proportional to the peak contrast sensitivity of human vision for a particular light level.
在方框856中,计算装置基于数字代码映射来将用参考数字代码 值集合编码的参考编码图像数据转码为用装置特定数字控制代码集合 编码的装置特定图像数据。装置特定图像数据中的照度值基于装置特 定数字代码值集合。In block 856, the computing device transcodes the reference encoded image data encoded with the reference digital code value set into device-specific image data encoded with the device-specific digital control code set based on the digital code mapping. Luminance values in the device-specific image data are based on the device-specific digital code value set.
在实施例中,计算装置确定参考数字代码值集合与装置特定数字 代码值集合之间的对应关系集合。这里,对应关系集合中的一个对应 关系将参考数字代码值集合中的参考数字代码值与装置特定数字代码 值相关。计算装置还将参考数字代码值处的第一照度差与装置特定数 字代码值处的第二照度差进行比较,并且基于第一照度差与第二照度 差的比较来存储关于对于参考数字代码值是应执行抖动、去轮廓操作、 还是不执行操作的算法标志。In one embodiment, a computing device determines a set of correspondences between a set of reference digital code values and a set of device-specific digital code values. Here, one correspondence in the set of correspondences relates a reference digital code value in the reference digital code value set to a device-specific digital code value. The computing device also compares a first luminance difference at the reference digital code value with a second luminance difference at the device-specific digital code value, and based on the comparison of the first luminance difference with the second luminance difference, stores an algorithm flag indicating whether a dithering operation, a decontouring operation, or no operation should be performed on the reference digital code value.
在实施例中,计算装置从像素的参考编码图像数据确定参考数字 代码值,并且进一步确定是否针对参考数字代码值设置算法标志。响 应于确定算法标志被设置为去轮廓,计算装置对所述像素执行去轮廓 算法。可替代地,响应于确定算法标志被设置为抖动,计算装置对所 述像素执行抖动操作。In one embodiment, a computing device determines a reference digital code value from reference encoded image data for a pixel and further determines whether an algorithm flag is set for the reference digital code value. In response to determining that the algorithm flag is set to decontouring, the computing device performs a decontouring algorithm on the pixel. Alternatively, in response to determining that the algorithm flag is set to dithering, the computing device performs a dithering operation on the pixel.
在实施例中,计算装置基于用装置特定数字控制代码集合编码的 装置特定图像数据来在显示器上呈现一个或多个图像。这里,显示器 可以是(但不仅仅限于)下列显示器之一:可见动态范围(VDR)显 示器、标准动态范围(SDR)显示器、平板计算机显示器或手持装置 显示器。In one embodiment, a computing device presents one or more images on a display based on device-specific image data encoded with a set of device-specific digital control codes. The display may be, but is not limited to, one of the following: a visible dynamic range (VDR) display, a standard dynamic range (SDR) display, a tablet computer display, or a handheld device display.
在实施例中,装置特定灰阶显示函数(GSDF)指定了装置特定 数字代码值集合与装置特定灰度级集合之间的装置特定映射。In an embodiment, a device-specific grayscale display function (GSDF) specifies a device-specific mapping between a set of device-specific digital code values and a set of device-specific gray levels.
在实施例中,基于一个或多个显示器参数以及零个或更多个观看 条件参数来导出装置特定映射。In an embodiment, a device-specific mapping is derived based on one or more display parameters and zero or more viewing condition parameters.
在实施例中,装置特定灰度级集合覆盖上限的值为下列值的动态 范围:小于100尼特;不小于100尼特但小于500尼特;500尼特与 1000尼特之间(包括);1000尼特与5000尼特之间(包括);5000 尼特与10000尼特之间(包括);或大于10000尼特。In an embodiment, the device-specific grayscale set covers an upper limit value of a dynamic range of the following values: less than 100 nits; not less than 100 nits but less than 500 nits; between 500 nits and 1000 nits (inclusive); between 1000 nits and 5000 nits (inclusive); between 5000 nits and 10000 nits (inclusive); or greater than 10000 nits.
在实施例中,计算装置将通过来自输入视频信号的参考编码图像 数据表示、接收、发送或存储的一个或多个输入图像转换为通过输出 视频信号中所包含的装置特定图像数据表示、接收、发送或存储的一 个或多个输出图像。In an embodiment, a computing device converts one or more input images represented, received, sent, or stored by reference encoded image data from an input video signal into one or more output images represented, received, sent, or stored by device-specific image data contained in an output video signal.
在实施例中,装置特定图像数据支持下列之一的图像呈现:高分 辨率高动态范围(HDR)图像格式、与电影艺术与科学学院(AMPAS) 的学院颜色编码规范(ACES)标准相关联的RGB颜色空间、数字电 影倡导联盟的P3颜色空间标准、参考输入媒介度量/参考输出媒介度 量(RIMM/ROMM)标准、sRGB颜色空间、与国际电信联盟(ITU) 的BT.709推荐标准相关联的RGB颜色空间。In an embodiment, the device-specific image data supports image presentation in one of the following: a high-resolution high dynamic range (HDR) image format, an RGB color space associated with the Academy Color Encoding Specification (ACES) standard of the Academy of Motion Picture Arts and Sciences (AMPAS), the P3 color space standard of the Digital Cinema Initiatives Alliance, the Reference Input Medium Measurement/Reference Output Medium Measurement (RIMM/ROMM) standard, the sRGB color space, and the RGB color space associated with the BT.709 recommendation of the International Telecommunication Union (ITU).
在实施例中,装置特定数字代码值集合包括位深为下列位深的代 码空间中的整数值:8位;多于8位、但少于12位;12位或更多位。In an embodiment, the set of device-specific digital code values comprises integer values in a code space having a bit depth of: 8 bits; more than 8 bits but less than 12 bits; 12 bits or more.
在实施例中,所述装置特定灰度级集合可以包括量化照度值集合。In an embodiment, the set of device specific gray levels may comprise a set of quantized luminance values.
在各种实施例中,编码器、解码器、系统等执行如所描述的前面 的方法的任何一种方法或一部分。In various embodiments, an encoder, decoder, system, etc. performs any one or part of the foregoing methods as described.
11.实现机构——硬件概述11. Implementation mechanism - Hardware overview
根据一个实施例,本文中所描述的技术用一个或多个专用计算装 置来实现。专用计算装置可以被硬连线来执行这些技术,或者可以包 括被持久地编程为执行这些技术的数字电子装置(诸如一个或多个专 用集成电路(ASIC)或现场可编程门阵列(FPGA)),或者可以包 括被编程为按照固件、存储器、其它储存器或组合中的程序指令执行 这些技术的一个或多个通用硬件处理器。这样的专用计算装置还可以 通过定制编程来组合定制的硬连线逻辑、ASIC或FPGA以实现这些 技术。专用计算装置可以是台式电脑系统、便携式电脑系统、手持装 置、联网装置、或结合有实现这些技术的硬连线和/或程序逻辑的任何 其它装置。According to one embodiment, the techniques described herein are implemented using one or more special-purpose computing devices. The special-purpose computing devices may be hardwired to perform these techniques, or may comprise digital electronics permanently programmed to perform these techniques (such as one or more application-specific integrated circuits (ASICs) or field-programmable gate arrays (FPGAs)), or may comprise one or more general-purpose hardware processors programmed to execute these techniques according to program instructions in firmware, memory, other storage, or a combination thereof. Such special-purpose computing devices may also incorporate customized hardwired logic, ASICs, or FPGAs through custom programming to implement these techniques. The special-purpose computing devices may be desktop computer systems, portable computer systems, handheld devices, networked devices, or any other device incorporating hardwired and/or program logic to implement these techniques.
例如,图9是例示在其上可以实现本发明的示例实施例的计算机 系统900的框图。计算机系统900包括用于传送信息的总线902或其 他通信机构、以及与总线902耦合的用于处理信息的硬件处理器504。 硬件处理器904可以是例如通用微处理器。For example, Figure 9 is a block diagram illustrating a computer system 900 on which an example embodiment of the present invention may be implemented. The computer system 900 includes a bus 902 or other communication mechanism for transmitting information, and a hardware processor 904 coupled to the bus 902 for processing information. The hardware processor 904 may be, for example, a general-purpose microprocessor.
计算机系统900还包括耦合到总线902的用于存储将被处理器 904执行的信息和指令的主存储器906,诸如随机存取存储器(RAM) 或其他动态存储装置。主存储器906还可以用于存储处理器904执行 指令期间的临时变量或其他中间信息。这样的指令在被存储在处理器 904可访问的非暂态存储介质上时使计算机系统900变为被定制为执 行这些指令中所指定的操作的专用机器。Computer system 900 also includes a main memory 906, such as a random access memory (RAM) or other dynamic storage device, coupled to bus 902 for storing information and instructions to be executed by processor 904. Main memory 906 may also be used to store temporary variables or other intermediate information during execution of instructions by processor 904. Such instructions, when stored on a non-transitory storage medium accessible to processor 904, transform computer system 900 into a special-purpose machine customized to perform the operations specified in the instructions.
计算机系统900还包括耦合到总线902的用于存储用于处理器 904的静态信息和指令的只读存储器(ROM)908或其他静态存储装 置。提供存储装置910,诸如磁盘或光学盘,并且将存储装置910耦 合到总线902以用于存储信息和指令。Computer system 900 also includes a read-only memory (ROM) 908 or other static storage device coupled to bus 902 for storing static information and instructions for processor 904. A storage device 910, such as a magnetic or optical disk, is provided and coupled to bus 902 for storing information and instructions.
计算系统900可以通过总线902耦合到用于向计算机用户显示信 息的显示器912,诸如液晶显示器。包括字母数字键和其他键的输入 装置914耦合到总线902,以用于将信息和命令选择传送到处理器904。 另一种类型的用户输入装置是用于将方向信息和命令选择传送给处理 器904并且用于控制显示器912上的光标移动的光标控件916,诸如 鼠标、轨迹球、或光标方向键。该输入装置通常具有两个轴(第一轴 (例如,x)和第二轴(例如,y))上的两个自由度,这使得装置可 以指定平面中的位置。The computing system 900 may be coupled to a display 912, such as a liquid crystal display, via bus 902 for displaying information to a computer user. An input device 914, including alphanumeric and other keys, is coupled to bus 902 for communicating information and command selections to processor 904. Another type of user input device is a cursor control 916, such as a mouse, trackball, or cursor direction keys, for communicating directional information and command selections to processor 904 and for controlling the movement of a cursor on display 912. This input device typically has two degrees of freedom along two axes (a first axis (e.g., x) and a second axis (e.g., y)), which allows the device to specify a position in a plane.
计算机系统900可以使用定制的硬连线逻辑、一个或多个ASIC 或FPGA、固件和/或程序逻辑(其与计算机系统组合来使计算机系统 900变为专用机器)来实现本文中所描述的技术。根据一个实施例, 本文中的技术由计算机系统900响应于处理器904执行包含在主存储 器906中的一个或多个指令的一个或多个序列来执行。可以将这样的 指令从另一存储介质(诸如存储装置910)读取到主存储器906中。 包含在主存储器906中的指令序列的执行使处理器904执行本文中所 描述的处理步骤。在替代实施例中,可以代替软件指令来使用硬连线 电路系统,或者可以与软件指令组合使用硬连线电路系统。Computer system 900 may implement the techniques described herein using custom hardwired logic, one or more ASICs or FPGAs, firmware, and/or program logic (which, in combination with the computer system, transforms computer system 900 into a special-purpose machine). According to one embodiment, the techniques described herein are performed by computer system 900 in response to processor 904 executing one or more sequences of one or more instructions contained in main memory 906. Such instructions may be read into main memory 906 from another storage medium, such as storage device 910. Execution of the sequences of instructions contained in main memory 906 causes processor 904 to perform the process steps described herein. In alternative embodiments, hardwired circuitry may be used in place of, or in combination with, software instructions.
这里使用的术语“存储介质”指的是存储导致机器以特定的方式 操作的数据和/或指令的任何非暂态介质。这种存储介质可包含非易失 性介质和/或易失性介质。非易失性介质包含例如光盘或磁盘,诸如存 储装置910。易失性介质包含诸如主存储器906的动态存储器。存储 介质的常见形式包含例如软盘、柔性盘、硬盘、固态驱动器、磁带或 任何其它的磁性数据存储介质、CD-ROM、任何其它光学数据存储介 质、具有孔图案的任何物理介质、RAM、PROM和EPROM、 FLASH-EPROM、NVRAM、任何其它存储芯片或盒。As used herein, the term "storage media" refers to any non-transitory medium that stores data and/or instructions that cause a machine to operate in a specific manner. Such storage media may include non-volatile media and/or volatile media. Non-volatile media include, for example, optical or magnetic disks, such as storage device 910. Volatile media include dynamic memory, such as main memory 906. Common forms of storage media include, for example, floppy disks, flexible disks, hard disks, solid-state drives, magnetic tape or any other magnetic data storage medium, CD-ROMs, any other optical data storage medium, any physical medium with a pattern of holes, RAM, PROM and EPROM, FLASH-EPROM, NVRAM, or any other memory chip or cartridge.
存储介质与传输介质明显不同,但可与其组合使用。传输介质参 与存储介质之间的信息传送。例如,传输介质包含共轴电缆、铜线和 光纤,包括包含总线902的导线。传输介质也可采取声波或光波的形 式,诸如在无线电波和红外数据通信期间产生的那些。Storage media are distinct from, but may be used in combination with, transmission media. Transmission media are involved in the transfer of information between storage media. For example, transmission media include coaxial cables, copper wire, and optical fiber, including the wires that comprise bus 902. Transmission media can also take the form of acoustic or light waves, such as those generated during radio wave and infrared data communications.
各种形式的介质可涉及将一个或更多个指令的一个或更多个序列 承载到处理器902以供执行。例如,可初始地在磁盘或远程计算机的 固态驱动器上承载指令。远程计算机可将指令加载到其动态存储器中 并且通过使用调制解调器在电话线上发送指令。计算机系统900本地 的调制解调器可接收电话线上的数据,并且使用红外发射器以将数据 转换成红外信号。红外检测器可接收在红外信号中承载的数据并且适 当的电路可将该数据放置在总线902上。总线902将数据承载到主存 储器906,从该主存储器906,处理器904检索和执行指令。由主存储 器906接收的指令可以可选地在被处理器904执行之前或之后被存储于存储装置910上。Various forms of media may be involved in carrying one or more sequences of one or more instructions to processor 902 for execution. For example, the instructions may initially be carried on a magnetic disk or solid-state drive of a remote computer. The remote computer may load the instructions into its dynamic memory and send the instructions over a telephone line using a modem. A modem local to computer system 900 may receive the data on the telephone line and use an infrared transmitter to convert the data into an infrared signal. An infrared detector may receive the data carried in the infrared signal and appropriate circuitry may place the data on bus 902. Bus 902 carries the data to main memory 906, from which processor 904 retrieves and executes the instructions. The instructions received by main memory 906 may optionally be stored on storage device 910 before or after execution by processor 904.
计算机系统900还包括耦合到总线902的通信接口918。通信接 口918提供耦合到网络链路920的双向数据通信,网络链路920连接 到局域网922。例如,通信接口918可以是综合服务数字网络(ISDN) 卡、电缆调制解调器、卫星调制解调器、或提供与相应类型的电话线 的数据通信连接的调制解调器。另举一例,通信接口918可以是提供 与兼容的LAN的数据通信连接的局域网(LAN)卡。还可以实现无 线链路。在任何这样的实现中,通信接口918发送并接收传载表示各 种类型的信息的数字数据流的电信号、电磁信号或光学信号。Computer system 900 also includes a communication interface 918 coupled to bus 902. Communication interface 918 provides bidirectional data communication coupled to network link 920, which is connected to local area network 922. For example, communication interface 918 can be an integrated services digital network (ISDN) card, a cable modem, a satellite modem, or a modem that provides a data communication connection with a corresponding type of telephone line. As another example, communication interface 918 can be a local area network (LAN) card that provides a data communication connection with a compatible LAN. Wireless links can also be implemented. In any such implementation, communication interface 918 sends and receives electrical signals, electromagnetic signals, or optical signals that carry digital data streams representing various types of information.
网络链路920通常通过一个或多个网络提供与其他数据装置的数 据通信。例如,网络链路920可以通过局域网922提供与主机924或 由互联网服务提供商(ISP)926操作的数据设备的连接。ISP 926继 而通过广域数据包数据通信网络(现在被称为“互联网”928)提供数 据通信服务。局域网922和互联网928都使用传载数字数据流的电信 号、电磁信号或光学信号。通过各种网络的信号以及网络链路920上 通过通信接口918的信号是示例形式的传输介质,这些信号与计算机 系统900来回传载数字数据。Network link 920 typically provides data communication with other data devices through one or more networks. For example, network link 920 may provide a connection to a host computer 924 or data equipment operated by an Internet service provider (ISP) 926 via a local area network 922. ISP 926, in turn, provides data communication services through a wide-area packet data communication network, commonly referred to as the "Internet" 928. Both local area network 922 and Internet 928 use electrical, electromagnetic, or optical signals to carry digital data streams. The signals passing through the various networks and the signals on network link 920 through communication interface 918 are exemplary forms of transmission media that carry digital data to and from computer system 900.
计算机系统900可以通过一个(多个)网络、网络链路920和通 信接口918来发送消息和接收数据(包括程序代码)。在互联网例子 中,服务器930可以通过互联网928、ISP926、局域网922和通信接 口918来发送所请求的应用程序代码。Computer system 900 can send messages and receive data (including program code) through one or more networks, network link 920, and communication interface 918. In the Internet example, server 930 can send requested application code through Internet 928, ISP 926, local area network 922, and communication interface 918.
所接收的代码可以在被处理器904接收到时被执行、和/或被存储 在存储装置910或其他非易失性储存器中以供以后执行。The received code may be executed as it is received by processor 904 and/or stored in storage device 910 or other non-volatile storage for later execution.
12.枚举示例实施例、等同、扩展、改变和其他12. Enumerate example embodiments, equivalents, extensions, variations and others
以上已经关于不同能力的显示器之间基于感知照度非线性的图像 数据交换描述了本发明的枚举示例实施例(“EEE”)。因此,本发 明的实施例可以涉及在下面的表2中枚举的例子中的一个或多个。The enumerated example embodiments ("EEE") of the present invention have been described above with respect to image data exchange based on perceived luminance nonlinearity between displays of different capabilities. Therefore, embodiments of the present invention may involve one or more of the examples enumerated in Table 2 below.
表2.枚举示例实施例Table 2. Enumerated example embodiments
下面的表3描述了用于将数字视频代码值转换为显示点处的绝对 线性照度水平的感知曲线EOTF的计算。还包括用于将绝对线性照度 转换为数字代码值的逆OETF计算。Table 3 below describes the calculation of the perceptual curve EOTF used to convert digital video code values to absolute linear luminance levels at the display point. Also included is the inverse OETF calculation used to convert absolute linear luminance to digital code values.
表3.感知曲线EOTF的示例性规范Table 3. Example specifications for perceptual curve EOTFs
下面的表4示出10位的示例性值。Table 4 below shows exemplary values of 10 bits.
表4. 10位的值的示例性表Table 4. Example table of 10-bit values
在以上的说明书中,参照可随实现而改变的大量的具体细节描述 了本发明的实施例。因此,什么是本发明以及本发明的申请人意图什 么成为本发明的专有和专用的指示是包含任何随后的校正的一组权利 要求,这些权利要求以这些权利要求发布的特定的形式从本申请发发 布。这里对于包含于这些权利要求中的术语明确阐述的任何定义应掌 控在权利要求中使用的这些术语的意思。由此,没有在权利要求中明 确详述的限制、要素、性能、特征、优点或属性不应以任何的方式限 制这些权利要求的范围。因此,说明书和附图应视为解释性而不是限 制性的。In the foregoing specification, embodiments of the present invention have been described with reference to numerous specific details that may vary from implementation to implementation. Therefore, the only indication of what is the invention, and what the applicants intend to be the sole and exclusive indication of the invention, is the set of claims, including any subsequent amendments, that issue from this application in the specific form in which such claims issue. Any definitions expressly set forth herein for terms contained in such claims shall govern the meaning of such terms as used in the claims. Accordingly, no limitation, element, property, feature, advantage, or attribute that is not expressly recited in a claim should in any way limit the scope of such claims. The specification and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense.
Claims (14)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US61/567,579 | 2011-12-06 | ||
| US61/674,503 | 2012-07-23 | ||
| US61/703,449 | 2012-09-20 |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| HK15100277.7A Addition HK1199955B (en) | 2011-12-06 | 2012-12-06 | Device and method of improving the perceptual luminance nonlinearity-based image data |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| HK15100277.7A Division HK1199955B (en) | 2011-12-06 | 2012-12-06 | Device and method of improving the perceptual luminance nonlinearity-based image data |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| HK1231195A1 HK1231195A1 (en) | 2017-12-15 |
| HK1231195B true HK1231195B (en) | 2020-07-31 |
Family
ID=
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7699279B2 (en) | Apparatus and method for improving perceptual luminance nonlinearity-based image data exchange between different display capabilities - Patents.com | |
| HK1231195B (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1231192B (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1231193B (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1231194B (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1231196B (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1231191B (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1229922A1 (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1231193A1 (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1197471B (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1231196A1 (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1231192A1 (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1231194A1 (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1231195A1 (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1199955B (en) | Device and method of improving the perceptual luminance nonlinearity-based image data |