TWI843747B - Multi-range hdr video coding - Google Patents
Multi-range hdr video coding Download PDFInfo
- Publication number
- TWI843747B TWI843747B TW108131502A TW108131502A TWI843747B TW I843747 B TWI843747 B TW I843747B TW 108131502 A TW108131502 A TW 108131502A TW 108131502 A TW108131502 A TW 108131502A TW I843747 B TWI843747 B TW I843747B
- Authority
- TW
- Taiwan
- Prior art keywords
- brightness
- image
- dynamic range
- hdr
- idr
- Prior art date
Links
- 238000013507 mapping Methods 0.000 claims abstract description 242
- 238000000034 method Methods 0.000 claims abstract description 107
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 55
- 238000004364 calculation method Methods 0.000 claims abstract description 29
- 230000009466 transformation Effects 0.000 claims description 22
- 230000006835 compression Effects 0.000 claims description 14
- 238000007906 compression Methods 0.000 claims description 14
- 230000001419 dependent effect Effects 0.000 claims description 11
- 235000019557 luminance Nutrition 0.000 abstract description 252
- 238000005516 engineering process Methods 0.000 abstract description 23
- 241000023320 Luma <angiosperm> Species 0.000 abstract description 19
- OSWPMRLSEDHDFF-UHFFFAOYSA-N methyl salicylate Chemical compound COC(=O)C1=CC=CC=C1O OSWPMRLSEDHDFF-UHFFFAOYSA-N 0.000 abstract description 11
- 230000006870 function Effects 0.000 description 379
- 230000000875 corresponding effect Effects 0.000 description 94
- 238000012545 processing Methods 0.000 description 49
- 230000006978 adaptation Effects 0.000 description 37
- 238000004891 communication Methods 0.000 description 32
- 241000282414 Homo sapiens Species 0.000 description 21
- 238000013459 approach Methods 0.000 description 21
- 239000003086 colorant Substances 0.000 description 18
- 238000005457 optimization Methods 0.000 description 15
- 230000008859 change Effects 0.000 description 13
- 238000009826 distribution Methods 0.000 description 13
- 230000000007 visual effect Effects 0.000 description 12
- 230000009471 action Effects 0.000 description 11
- 238000006243 chemical reaction Methods 0.000 description 11
- 238000012937 correction Methods 0.000 description 11
- 238000001228 spectrum Methods 0.000 description 11
- 230000006399 behavior Effects 0.000 description 10
- 230000007246 mechanism Effects 0.000 description 10
- 230000036961 partial effect Effects 0.000 description 10
- 230000008569 process Effects 0.000 description 10
- 238000013461 design Methods 0.000 description 9
- 238000012905 input function Methods 0.000 description 8
- 239000011159 matrix material Substances 0.000 description 6
- 238000000844 transformation Methods 0.000 description 6
- 230000000694 effects Effects 0.000 description 5
- RCTYDUUDOSRQTI-WSZWBAFRSA-N (2s)-1-formyl-n-[(2s)-1-oxopropan-2-yl]pyrrolidine-2-carboxamide;propane Chemical compound CCC.O=C[C@H](C)NC(=O)[C@@H]1CCCN1C=O RCTYDUUDOSRQTI-WSZWBAFRSA-N 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 230000015556 catabolic process Effects 0.000 description 4
- 238000004590 computer program Methods 0.000 description 4
- 238000006731 degradation reaction Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 230000000670 limiting effect Effects 0.000 description 4
- 238000009877 rendering Methods 0.000 description 4
- 238000009795 derivation Methods 0.000 description 3
- 238000013213 extrapolation Methods 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 238000010606 normalization Methods 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 238000004088 simulation Methods 0.000 description 3
- 101100356682 Caenorhabditis elegans rho-1 gene Proteins 0.000 description 2
- 241000519995 Stachys sylvatica Species 0.000 description 2
- 239000000654 additive Substances 0.000 description 2
- 230000000996 additive effect Effects 0.000 description 2
- 210000004556 brain Anatomy 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000003595 mist Substances 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 241000282412 Homo Species 0.000 description 1
- 240000004050 Pentaglottis sempervirens Species 0.000 description 1
- 235000004522 Pentaglottis sempervirens Nutrition 0.000 description 1
- 101100012902 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) FIG2 gene Proteins 0.000 description 1
- 101100233916 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) KAR5 gene Proteins 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000005282 brightening Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000004737 colorimetric analysis Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000012885 constant function Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000013523 data management Methods 0.000 description 1
- 230000006837 decompression Effects 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 238000004880 explosion Methods 0.000 description 1
- 238000005562 fading Methods 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 238000005286 illumination Methods 0.000 description 1
- 238000010191 image analysis Methods 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 238000012886 linear function Methods 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000011017 operating method Methods 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 238000002310 reflectometry Methods 0.000 description 1
- 230000008672 reprogramming Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000010963 scalable process Methods 0.000 description 1
- 238000007493 shaping process Methods 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 239000005315 stained glass Substances 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000000153 supplemental effect Effects 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 238000012384 transportation and delivery Methods 0.000 description 1
- 238000009966 trimming Methods 0.000 description 1
Landscapes
- Image Processing (AREA)
- Picture Signal Circuits (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
Description
本發明係關於用於編碼高動態範圍影像,且特別用於可根據像MPEG-HEVC(例如,電視廣播)之壓縮技術壓縮之係影像之時間序列的視訊,特別係藉由使用第二動態範圍的(多個)影像(用於傳達至解碼器)以表示第一動態範圍之(多個)(主)影像的方法及設備,該動態範圍變化涉及藉由施加一般作為後設資料與(多個)第二動態範圍影像共同傳達的函數改變影像像素輝度(例如,從正規化至1.0值的第一者至正規化至1.0值的第二者)。The present invention relates to methods and apparatus for encoding high dynamic range images, and in particular for video being a temporal sequence of images which may be compressed according to a compression technique such as MPEG-HEVC (e.g. television broadcasting), in particular by using (multiple) image(s) of a second dynamic range (for communication to a decoder) to represent (multiple) (primary) image(s) of a first dynamic range, the dynamic range change involving changing the image pixel brightness (e.g. from a first one normalized to a value of 1.0 to a second one normalized to a value of 1.0) by applying a function which is generally communicated as metadata together with the (multiple) second dynamic range image(s).
約5年前,高動態範圍視訊編碼的新穎技術被引入世界(例如,導致待於1000尼特(nit)的高階超高畫質(UltraHD Premium) tv上觀看的特殊HDR藍光光碟)。About 5 years ago, the new technology of high dynamic range video encoding was introduced to the world (leading to special HDR Blu-ray discs to be viewed on high-end UltraHD Premium TVs at 1000 nits, for example).
此在技術上處理影像的新穎方式在技術上與直到當時的過去50年所有視訊均根據其編碼的傳統(legacy)視訊編碼的許多方式形成對比,該傳統視訊編碼如今稱為標準動態範圍(Standard Dynamic Range, SDR)視訊編碼(亦稱為低動態範圍(low dynamic range)視訊編碼;LDR)。為表示影像,像素顏色的經數位編碼表示係必要的,且Rec. 709之SDR明亮度(luma)碼定義(亦稱為光電轉移函數(Opto-electrical transfer function) OETF)因為其近似平方根函數形狀(square root function shape)(明亮度:Y=sqrt(輝度L))而能夠編碼(使用8或10位元的明亮度字組)僅約1000:1的輝度動態範圍。然而,此完美地適於編碼待於該等時代具有大約在0.1及100尼特之間的(當時的所有顯示器之)一般輝度演現能力之顯示器上顯示的影像,後值係所謂的峰亮度(peak brightness, PB),亦稱為最大輝度。This technically novel way of processing images contrasts technically with many of the ways in which all video for the past 50 years had been coded until then, which is now known as Standard Dynamic Range (SDR) video coding (also known as low dynamic range (LDR) video coding). To represent images, a digitally coded representation of the pixel colors is necessary, and the SDR luma code definition of Rec. 709 (also known as the Opto-electrical transfer function (OETF)) is capable of encoding (using 8 or 10-bit luma words) a luminance dynamic range of only about 1000:1 because of its approximately square root function shape (luminance: Y=sqrt(luminance L)). However, this was perfectly suited for encoding images to be displayed on monitors of that era, which had a typical brightness rendering capability (of all monitors at the time) of between about 0.1 and 100 nits, the latter value being the so-called peak brightness (PB), also known as maximum brightness.
見Rec. 709明亮度碼定義函數無法數學地表示HDR場景影像輝度的巨大範圍(例如,在0.001尼特與10,000尼特的期望影像編碼峰亮度PB_C之間),HDR研究者最初藉由設計新的HDR碼分配解決此問題,該新的HDR碼分配在形狀上係更加對數的,使得可編碼更加多輝度(因為視覺系統需要較低的準確度,亦即,較亮區域的碼值比暗區域小,因此可理解到各十進位輝度分配2^8=256(其中^代表冪函數)中的50個碼,可已編碼100,000:1的動態範圍)。此係藉由使用SMPTE 2084-標準化(所謂的感知量化器(perceptual quantizer, PQ))函數來編碼HDR影像顏色的簡單「自然」方式。See Rec. 709 brightness code definition function cannot mathematically represent the huge range of HDR scene image brightness (for example, between 0.001 nit and the desired image coding peak brightness PB_C of 10,000 nits). HDR researchers initially solved this problem by designing a new HDR code allocation that was more logarithmic in shape, allowing for encoding of a wider range of brightness (because the visual system requires lower accuracy, i.e., the code values for brighter areas are smaller than for darker areas, so it can be understood that 50 codes in each decimal brightness allocation 2^8=256 (where ^ represents the brightness function) can encode a dynamic range of 100,000:1). This is a simple, "natural" way to encode the colors of HDR images by using the SMPTE 2084-standardized (so-called perceptual quantizer (PQ)) function.
可天真地認為編碼及解碼HDR影像就是全部,但事情並非如此簡單,因此額外編碼方法出現,特別係對本申請人之HDR視訊編碼及處理的先前發展方法。One might naively think that encoding and decoding HDR images is all there is to it, but things are not that simple, and therefore additional coding methods have emerged, particularly in relation to the applicant's previously developed methods for HDR video coding and processing.
為得到在HDR視訊編碼中所涉及或需要之事物的合宜理解,圖1總結一些重要態樣。To get a proper understanding of what is involved or required in HDR video encoding, Figure 1 summarizes some important aspects.
假設左側上具有所有可能(經PQ解碼)的HDR輝度的輝度範圍,至多PB_C=5000尼特。暫且假設為使此影像看起來如期望般完美,將稱為主HDR影像之此者的所有物體像素係在電腦上建立(於下文以圖2說明如何從,例如,廣播攝影機開始)。使用固有HDR編解碼器(其僅提供編碼至多10,000尼特之輝度的技術,亦即,在此實例中亦如期望地至多5000尼特)的問題係若消費者亦具有昂貴的5000尼特顯示峰亮度(PB_D)的顯示器(且若他在標準化觀看環境條件下觀看影像),他可完美地觀看創作者(例如,電影導演)所意欲的影像,但若他具有不同的顯示器(例如,PB_D=750尼特,或PB_D=100尼特),有尚未解決且亦不簡單的問題:如何在750尼特PB_D的顯示器上顯示5000尼特PB_C的影像?此似乎沒有優雅的簡單解決方案。施加準確輝度顯示將以至多750尼特的輝度完美地顯示所有物體,但將所有更明亮的物體像素修剪至相同的PB_D=750尼特,使許多影像物體消失在白色斑點區域中,其當然係不好看的。可認為內容的線性按比例調整係解決方案(將所有HDR輝度除以5000/750,其係所謂的內容白色映射至顯示器白色方法(map content-white-on-display-white approach)),但較暗物體(像實例場景影像ImSCN3中在洞穴的暗區域中具有HDR輝度(0.05尼特)的人,其對較小動態範圍顯示器可能已經太低)於是在750尼特顯示器上得到不可感知的暗輝度(0.05*750/5000= 0.0075尼特)。Assume on the left the luminance range of all possible (PQ decoded) HDR luminances up to PB_C=5000 nits. And assume for the moment that in order for this image to look as perfect as desired, all object pixels of what will be called the master HDR image are created on a computer (how to start from, for example, a broadcast camera is explained below with Figure 2). The problem with using native HDR codecs (which only provide the technology to encode brightness up to 10,000 nits, i.e. in this example also up to 5000 nits as desired) is that if the consumer also has an expensive monitor with a 5000 nit display peak brightness (PB_D) (and if he views the image under standardized viewing environment conditions), he can perfectly view the image as intended by the creator (e.g., movie director), but if he has a different monitor (e.g., PB_D=750 nits, or PB_D=100 nits), there is an unresolved and not simple problem: how to display an image with 5000 nits PB_C on a monitor with 750 nits PB_D? There seems to be no elegant simple solution to this. Applying an accurate brightness display will show all objects perfectly at a brightness of up to 750 nits, but will clip all brighter object pixels to the same PB_D=750 nits, causing many image objects to disappear into the white spot area, which is of course not a good look. One could argue that linear scaling of the content is a solution (dividing all HDR luminance by 5000/750, which is the so-called map content-white-on-display-white approach), but darker objects (like the person in the dark area of the cave in the example scene image ImSCN3 with HDR luminance (0.05 nits) which might already be too low for a smaller dynamic range display) would thus end up being imperceptibly dark on a 750 nit display (0.05*750/5000= 0.0075 nits).
圖1亦教示(不同典型HDR場景的)不同HDR影像可具有相當不同的需求,該等需求有關於如何將各種(可能在沿著HDR輝度範圍DR_2的「任意」輝度位置)HDR輝度擠壓成更小者,例如,LDR輝度動態範圍DR_1。FIG. 1 also teaches that different HDR images (of different typical HDR scenes) may have quite different requirements regarding how to squeeze various HDR luminances (possibly at “arbitrary” luminance positions along the HDR luminance range DR_2) into a smaller, e.g., LDR luminance dynamic range DR_1.
真實世界輝度的範圍可在,例如,當室內及室外的物體二者同時透過illumination_contrast*object_reflection_contrast =(1:100)*(1:100)觀看時,且雖然在表示場景之影像中的輝度一般不必或不會等同於原始場景輝度,對於好的HDR表示影像,將預期像素輝度的範圍可能至多至少1000尼特,並以至少0.1尼特或更小的最小值開始(因此DR_im >= 10,000:1)。此外,好的HDR影像可比有關於實體動態範圍本身更加有關於各種物體輝度沿著輝度範圍的智慧分配(更別說誤解其正在導引的位元量,其對非線性明亮度碼定義不為真,且10位元明亮度影像可僅係一些HDR影像以及SDR影像)。The range of real-world luminance can be, for example, when both indoor and outdoor objects are viewed simultaneously through illumination_contrast*object_reflection_contrast =(1:100)*(1:100), and while the luminance in the represented image need not or will not generally be identical to the original scene luminance, for a good HDR representation image one would expect pixel luminance to range up to at least 1000 nits, and to start with a minimum of at least 0.1 nit or less (so DR_im >= 10,000:1). Furthermore, good HDR images can be more about the intelligent distribution of the luminance of various objects along the luminance range than about physical dynamic range itself (not to mention misunderstanding the amount of bits it's directing, which by definition isn't true for non-linear luminance codes, and 10-bit luminance images can be only some HDR images as well as SDR images).
圖1顯示許多可能HDR場景的數個典型說明性實例,未來的HDR系統(例如,連接至1000尼特PB_D顯示器)可能需要能夠正確地處理,亦即,藉由顯示影像中的所有物體/像素的適當輝度。Figure 1 shows several typical illustrative examples of the many possible HDR scenes that a future HDR system (e.g. connected to a 1000 nit PB_D display) may need to be able to handle correctly, i.e. by displaying the appropriate brightness for all objects/pixels in the image.
例如,ImSCN1係來自西方電影的晴朗室外影像(其具有大部分明亮的區域,比陰天影像的平均更明亮,該等區域理想上應演現得比100尼特顯示器上更明亮,以提供比雨天模樣更晴朗的模樣,例如,具有比如說400尼特的平均輝度)。另一方面,ImSCN2係非常不同類型的影像,亦即,由暗區域(及,例如其等的良好可見度)主導的夜間影像,然而使此係HDR影像而非簡單的暗SDR影像的原因係亦有亮像素在街燈下方的地點中,且可能在房屋的發光窗戶中,且甚至有非常亮的像素(例如,3000尼特)在街燈的燈表面上。For example, ImSCN1 is a sunny outdoor image from a western movie (it has mostly bright areas, brighter than the average for cloudy images, which should ideally appear brighter than on a 100 nit display to provide a sunny look than a rainy look, e.g., with an average brightness of, say, 400 nits). On the other hand, ImSCN2 is a very different type of image, namely a night image dominated by dark areas (and, e.g., good visibility thereof), however what makes this an HDR image and not a simple dark SDR image is that there are also bright pixels in spots below street lights, and possibly in lit windows of houses, and there are even very bright pixels (e.g., 3000 nits) on the lamp surface of the street lights.
相對於ImSCN2的暗,什麼使此ImSCN1晴朗?不必然係相對輝度,至少在SDR範型中不是(將有遍布在0.1與100尼特之間的範圍上的像素輝度對二者影像係可能的,雖然此類輝度的空間分布,且特別係直方圖可係不同的)。使HDR影像演現與在僅於數年前終止的SDR時代中如何總是如此的演現不同的事物在於SDR具有此一有限動態範圍(約PB=100尼特,且最小黑色位準MB約0.1至1尼特),僅有物體的大部分固有反射率可在SDR中顯示(其將落於用於良好白色的90%及用於良好黑色的1%之間)。在經技術控制的均勻照明下,對於辨識物體(具有來自其等反射之某些量的亮度及當然來自其等色度)將係良好的,但對自然場景中可具有之照明本身中的美麗變化上,及可在觀看者上具有的何種衝擊性上(出自窗戶的陽光,或巫師輻射出的電漿)沒有那麼好。若顯示器允許其(且因此影像編碼及處理技術應如此),在森林中漫步真的看到陽光通過樹,亦即,而不係像SDR顯示器上僅有些更淡黃的印象,當人從陰影走入艷陽下時,想要看到明亮且鮮豔的陽光照射的衣服。且至少在PB_D允許時,火及爆炸也應具有最佳的視覺衝擊。What makes this ImSCN1 clear relative to the darkness of ImSCN2? Not necessarily the relative brightness, at least not in the SDR paradigm (it will be possible for both images to have pixel brightness spread over the range between 0.1 and 100 nits, although the spatial distribution of such brightness, and in particular the histogram, may be different). The thing that makes HDR image rendering different from how it was always done in the SDR era that ended only a few years ago, is that SDR has such a limited dynamic range (about PB=100 nits, and a minimum black level MB of about 0.1 to 1 nit) that only most of the intrinsic reflectivity of objects can be displayed in SDR (it will fall between 90% for a good white and 1% for a good black). Under technically controlled uniform illumination, it will be good for identifying objects (with some amount of brightness from their reflections and of course from their chromaticity), but not so good for the beautiful variations in lighting itself that can be present in natural scenes, and what kind of impact that can have on the viewer (sunlight coming out of a window, or plasma from wizard radiation). If the display allows it (and therefore the image encoding and processing technology should), walking in a forest really sees the sunlight through the trees, i.e., one wants to see bright and vivid sunlight-lit clothes as one walks from shadow into bright sunlight, rather than just a somewhat more yellowish impression as on an SDR display. And fire and explosions should also have optimal visual impact, at least when PB_D allows it.
在SDR中,可使夜間影像比正常照明影像更暗些,如可在明亮度直方圖中所感知者,但不會太暗,否則其將僅演現成太暗及醜(甚至可能大部分不可見)的影像(此係引入使夜間影像仍係相對明亮(但係藍色)之習知作法的原因)。而且,在100尼特TV或在100尼特的編碼中,正好沒有可用於過於明亮之任何者的空間。所以必須獨立於其等照明顯示物體,且無法同時如實地顯示場景之有時可發生的所有高對比照明。實務上,意謂著高度明亮的晴朗場景必須使用與陰暗雨天場景且甚至夜間場景幾乎相同的顯示器輝度(0至100尼特)演現。In SDR, night images can be made somewhat darker than normally lit images, as can be perceived in the luminance histogram, but not too dark, otherwise it would just render an image that is too dark and ugly (maybe even mostly invisible) (this is where the common practice of making night images still relatively bright (but blue) comes in). Moreover, in 100 nit TV, or in encoding at 100 nits, there is just no room for anything that is too bright. So objects must be displayed independently of their lighting, and all the high-contrast lighting that can sometimes occur in a scene cannot be faithfully displayed at the same time. In practice, this means that a highly bright sunny scene must be rendered using almost the same display luminance (0 to 100 nits) as a dark rainy scene, and even a night scene.
在現實生活中,人類視力也將適應可用光量,但沒有那麼多(大多數人在現實生活中確實認知到逐漸變暗,或他們在較暗或相當明亮的環境中)。且不應忘記顯示影像的電視不係對適應性眼睛的模擬,而係對現實生活環境的模擬,如其鑑於觀看環境及其他技術限制所得到的良好者。所以,想要以可藝術地設計至影像中的所有壯觀的局部亦且時間的照明效果顯示影像,以得到更逼真的演現影像(至少若終端觀看者具有可用的HDR顯示器)。精確地什麼是合適的輝度,比方說暗室中的光劍,將留給建立(多個)主分級(grading)的顏色分級者決定(為了此專利中之教示的簡單性起見,假設各種動態範圍影像,至少二個最極端不同的動態範圍係由人類分級者建立,但影像可類似地由自動軟體建立),且此申請案將聚焦在針對具有潛在不同需求的各種市場播放器建立及處理此類影像的所需技術成分上。In real life, human vision will also adapt to the amount of available light, but not that much (most people do perceive gradual darkening in real life, or they are in darker or quite bright environments). And one should not forget that the television displaying the image is not a simulation of the adapting eye, but a simulation of the real life environment, as good as it can be given the viewing environment and other technical limitations. So, one would like to display the image with all the spectacular local and also temporal lighting effects that can be artistically designed into the image, to get a more realistic rendering (at least if the end viewer has an HDR display available). Exactly what is the appropriate brightness for, say, a lightsaber in a dark room, will be left to the color grader who creates the master grading(s) (for simplicity of the teachings in this patent, it is assumed that various motion range images, at least two of the most extreme different motion ranges, are created by a human grader, but the images could similarly be created by automated software), and this application will focus on the technical components required to create and process such images for a variety of market players with potentially different needs.
在圖1的左軸上係當想要以(例如)5000尼特PB_C主HDR分級看到其等時可在5000尼特PB_D(參考)顯示器上直接顯示的物體輝度(亦即,影像分級者產生假設家中的一般高品質HDR TV將具有5000尼特PB_D的影像,且他可實際坐在此一家庭觀看室及在此一分級顯示器上之分級的表示中)。若想要不僅輸送已擷取之原始HDR場景的大約幻象,以及係在明亮的陽光照射環境中之牛仔的實際場景,必須將此等像素輝度指定且演現成足夠明亮,約例如平均500尼特。On the left axis of Figure 1 are the luminances of objects that can be directly displayed on a 5000 nit PB_D (reference) display when one wants to see them with a (for example) 5000 nit PB_C master HDR grade (i.e. the image grader generated the image assuming that a typical high quality HDR TV at home would have a 5000 nit PB_D and that he could actually sit in such a home viewing room and in the representation of the grade on such a graded display). These pixel luminances must be specified and rendered bright enough, around for example 500 nits on average, if one wants to deliver not only an approximate illusion of the original HDR scene that was captured, but also an actual scene of a cowboy in a bright sunlit environment.
對於夜晚場景,想要主要係暗輝度,但機車上的主要角色應係可良好辨識的,亦即不會太暗(例如,約5尼特),且同時可有,例如街燈之相當高輝度的像素,例如在5000尼特顯示器上的3000尼特左右,或在任何其他HDR顯示器(例如,1000尼特PB_D)上的峰亮度左右。第三實例ImSCN3顯示何者現在也可能在HDR顯示器上:可同時演現許多(在語義上更相關於不僅燈,亦即,相關於許多區域內細節,像是陽光照射的樹)非常明亮及許多重要而非常暗的像素二者。ImSCN3顯示何者可被視為係典型且相對難以處理的HDR場景影像,具有通過其可看見晴朗外側之開口的黑暗洞穴。針對此場景,可能想要產生比想要演現僅有明亮晴朗風景之印象的場景更不亮一些的陽光照射物體(像是樹),例如,400尼特左右,其應與洞穴內側之基本上黑暗的角色更協調。顏色分級者可能想要最佳地協調所有物體的輝度(已在PB_HDR=5000尼特主HDR影像中),使得沒有東西看起來不適當地暗或亮且對比良好,例如站在此洞穴中之暗處中的人可以0.05尼特左右編碼在主HDR經分級影像中。For night scenes, one wants mainly dimness, but the main character on the motorcycle should be well recognizable, i.e. not too dim (e.g. around 5 nits), and at the same time there can be quite high brightness pixels, e.g. street lights, e.g. around 3000 nits on a 5000 nit display, or around the peak brightness on any other HDR display (e.g. 1000 nit PB_D). The third example ImSCN3 shows what is now also possible on an HDR display: both many (semantically more related to not only lights, i.e. related to many in-area details, like a sunlit tree) very bright and many important but very dark pixels can be rendered at the same time. ImSCN3 shows what can be regarded as a typical and relatively difficult HDR scene image, with a dark cave with an opening through which the clear outside can be seen. For this scene, one might want to produce sunlit objects (like trees) less bright than a scene where one wants to give the impression of just a bright sunny scenery, e.g. around 400 nits, which should be more in tune with the essentially dark characters inside the cave. A color grader might want to best harmonize the brightness of all objects (already in the PB_HDR=5000 nit master HDR image) so that nothing looks unduly dark or bright and the contrast is good, e.g. a person standing in the dark in this cave could be encoded at around 0.05 nits in the master HDR graded image.
此主HDR影像已建立的情況下,藝術問題(甚至在將其以實現技術公式化之前)係之後應如何將此影像重新分級至不同的動態範圍,例如至少100尼特PB_C傳統SDR影像。Once this master HDR image has been created, the artistic question (even before it is formulated in terms of implementation technology) is how this image should then be regraded to a different dynamic range, such as a conventional SDR image of at least 100 nits PB_C.
其有助於當給定輝度之間的關係時的清晰度,因此在方便時將於此專利中如此做。實際上技術方面,輝度將經由明亮度碼分配函數(亦稱為光電轉移函數(OETF))編碼為明亮度,且因此亦可將輝度間的所有關係公式化,例如,從輸入L_in計算輸出輝度L_out的函數,等效明亮度之間的關係亦如此。It helps to clarify when the relationship between luminances is given, so this will be done in this patent when convenient. In practical technical terms, luminance will be encoded as brightness via a brightness code allocation function (also known as a photoelectric transfer function (OETF)), and therefore all relationships between luminances can also be formulated, such as a function that calculates the output luminance L_out from the input L_in, and the same is true for the relationship between equivalent brightness.
或許有些令人混淆,亦可以正規化(例如,使用等於1.0的最大正規化輝度)方式將輝度公式化,並定義此類經正規化輝度上的所有動作。此具有經正規化HDR色域精確重疊LDR色域的優點(條件係二者的影像像素顏色均定義在相同的RGB原色組中),並因此可以此單一經正規化色域顯示輝度變化。明顯地,應以與定義在具有PB_C=5000之HDR輝度輝度範圍中的HDR輝度完全相同的絕對輝度顯示的經正規化LDR輝度的相對位置將具有不同的相對高度(亦即,然後可以此一色域表示顯示用於當隨此經正規化色域中之高度的相對/經正規化變化而建立對應的LDR影像像素輝度時所需的特定HDR像素輝度的輝度映射)。絕對與相對輝度之間的關係係簡單的:L_norm=L_abs/PB_C,其中PB_C係編碼的任何最大輝度,例如HDR編碼的5000尼特,及按照標準協議之用於SDR的100尼特。Perhaps somewhat confusingly, it is also possible to formulate luminance in a normalized (e.g., using a maximum normalized luminance equal to 1.0) way, and define all actions on this normalized luminance. This has the advantage that the normalized HDR color gamut exactly overlaps the LDR color gamut (provided that the image pixel colors of both are defined in the same set of RGB primaries), and thus luminance variations can be displayed in this single normalized color gamut. Obviously, the relative position of a normalized LDR luminance that should be displayed with exactly the same absolute luminance as an HDR luminance defined in the HDR luminance luminance range with PB_C=5000 will have a different relative height (i.e., the luminance mapping required for a particular HDR pixel luminance when creating the corresponding LDR image pixel luminance as a function of relative/normalized variation of height in this normalized color gamut can then be displayed in this color gamut representation). The relationship between absolute and relative luminance is simple: L_norm=L_abs/PB_C, where PB_C is any maximum luminance encoded, e.g. 5000 nits for HDR encoding, and 100 nits for SDR as per standard protocol.
習自圖1的最後重要事項(因為所有技術均必須據此行為)係取決於在何種類型的HDR場景中處理何種種類的物體(亦即,其像素輝度),在如何重新分級該(多個)像素輝度上(亦即,輝度變換)可有不同的高階方法。The last important thing to learn from Figure 1 (because all techniques must behave accordingly) is that depending on what kind of objects (i.e., their pixel brightness) are being processed in what type of HDR scene, there can be different high-level approaches on how to rescale the pixel brightness (i.e., the brightness transformation).
例如,黑暗中的物體(像是機車騎士)可藉由使等同所有重新分級影像的絕對輝度(其涉及經正規化輝度的對應按比例調整變化)而演現,特別係在左方的起始主HDR影像、在右方的對應SDR影像、及在其間的任何中等動態範圍(medium dynamic range, MDR)影像,例如,以對在800尼特PB_D顯示器(例如,對於已購買此一顯示器並,例如從他的有線電視供應商、或經由衛星機上盒、或從網際網路等得到5000尼特PB_C HDR影像的消費者)上的直接顯示最佳化(具有正確的物體輝度)的PB_C=PB_MDR=800尼特顯示。此係合理的,因為內容的創作者想要輸送機車僅係可見的黑暗氣氛,且僅為了此一顯示器可如此作的原因(因為其具有在較高的PB_D結束以顯示場景中的所有物體輝度的較大輝度範圍),在較亮的顯示器上使其演現得更明亮將係不佳的。For example, an object in the dark (like a motorcyclist) can be rendered by equating the absolute luminance of all regraded images (which involves corresponding proportionally adjusted changes in normalized luminance), in particular the starting master HDR image on the left, the corresponding SDR image on the right, and any medium dynamic range (MDR) images in between, e.g., with a PB_C = PB_MDR = 800 nit display optimized (with correct object luminance) for direct display on an 800 nit PB_D display (e.g., for a consumer who has purchased such a display and gets a 5000 nit PB_C HDR image, e.g., from his cable TV provider, or via a satellite set-top box, or from the Internet, etc.). This makes sense since the creators of the content wanted to convey a dark atmosphere where only the motorcycle is visible, and for the sole reason that this display can do so (since it has a larger luminance range ending at a higher PB_D to show the luminance of all objects in the scene), making it appear brighter on a brighter display would be poor.
物體(像是太陽)或許將遵循完全不同的原則,亦即,映射白色在白色上的方法(map white-on-white method),其中總是給定任何影像表示中的最高可能值,亦即PB_C。明顯地,其他類型的物體可遵循其他類型的規則,並可繼續下去(例如,牛仔將遵循經按比例調整中間灰色原則),但讀者理解必須具有允許所有像素輝度的幾乎「任意」分配之技術的一者係足夠的,而非例如固定者(像是簡單技術將指定的)。Objects (like the sun) would probably follow a completely different principle, namely the map white-on-white method, where the highest possible value in any image representation is always given, namely PB_C. Obviously, other types of objects may follow other types of rules, and could go on (for example, cowboys would follow the scaled middle gray principle), but the reader understands that it must be sufficient to have one of the techniques that allows an almost "arbitrary" distribution of brightness for all pixels, rather than, for example, a fixed one (as the simple technique would specify).
而圖1簡單地總結多樣化HDR影像建立的需求(跨越此類受不同技術限制的應用,像是電影、即時運動廣播等),然後HDR技術開發者的問題仍係如何編碼HDR影像,及亦如何變換HDR影像以能夠將其等最佳地顯示在具有比經編碼PB_C(亦即,可能在視訊中發生至少一次的最亮像素)更小之PB_D的任何顯示器上。擷取HDR場景影像,且同樣重要的美術指導與照明HDR場景亦係技術技能,但本申請案不必聚焦在該態樣上。While Figure 1 simply summarizes the requirements for creating diverse HDR images (across such applications as movies, live sports broadcasting, etc., which are subject to different technical constraints), the problem for HDR technology developers remains how to encode HDR images, and also how to transform HDR images so that they can be optimally displayed on any display with a PB_D smaller than the encoded PB_C (i.e., the brightest pixel that may occur at least once in the video). Capturing HDR scene images, and equally important, artistically directing and lighting HDR scenes are also technical skills, but this application does not necessarily focus on that aspect.
所設想的最簡單的事情係僅編碼HDR像素輝度(忽略顯示調適(display adaptation, DA)的複雜度,亦即,如何將PB_C1影像映射成用於能力較小之顯示器的影像)。問題係Rec. 709 OETF僅可編碼1000:1的輝度動態範圍,亦即,必須發明新的HDR OETF(或實際上其反式,EOTF)。稱為HDR10的第一HDR編解碼器已引入市場,其用以例如建立新的黑帶HDR藍光光碟,且其將在SMPTE 2084中標準化之稱為感知量化器(Perceptual Quantizer, PQ)函數的更對數地成形的函數使用為OETF,且其允許定義用於在1/10,000尼特與10,000尼特之間的輝度的明亮度(對實務的HDR視訊生產足夠的)。此外,其具有其所產生的明亮度碼適應人類視力如何運作的不錯性質(大腦用以特性化場景中之不同輝度的感知灰度值種類,其係用於有效率地重新分級某些灰度值物體,及用於有效率地表示輝度二者(如大腦所作的)的不錯性質)。在明亮度的計算之後,僅具有10位元的像素平面(或更確切地說,亦具有二個色度平面,Cb及Cr 3位元平面),其可進一步在該線下方受經典地處理,「彷彿」其等在數學上係SDR影像,例如,經MPEG壓縮(此係重要限制,因為其避免總視訊管線中的數個預先存在的技術的重新設計及重新部署)。The simplest thing to envision is to just encode HDR pixel brightness (ignoring the complexity of display adaptation (DA), i.e. how to map a PB_C1 image to an image for a less capable display). The problem is that the Rec. 709 OETF can only encode a brightness dynamic range of 1000:1, i.e. a new HDR OETF (or actually its inverse, the EOTF) had to be invented. The first HDR codec called HDR10 has been introduced on the market, which is used for example to create new black band HDR Blu-ray discs, and it uses a more logarithmically shaped function called Perceptual Quantizer (PQ) function standardized in SMPTE 2084 as OETF, and it allows to define brightness for luminances between 1/10,000 nits and 10,000 nits (enough for practical HDR video production). Furthermore, it has the nice property that the brightness codes it produces are adapted to how human vision works (the kinds of perceived gray values the brain uses to characterize different luminances in a scene, which is a nice property for efficiently re-grading certain gray value objects, and for efficiently representing luminance both (as the brain does)). After the calculation of the luminance, one has only a 10-bit pixel plane (or more precisely, also two chrominance planes, Cb and Cr 3-bit planes), which can be further processed classically below the line, "as if" they were mathematically SDR images, e.g. MPEG compressed (this is an important limitation, since it avoids the redesign and redeployment of several pre-existing technologies in the overall video pipeline).
使用HDR10影像的顯著技術困難仍係如何適當地將其等顯示在能力較小的顯示器上(例如,能力小於HDR內容針對其製造的2000 PB_C)。例如,若僅線性地映射白色在白色上(經編碼影像最大白色,亦稱為將峰亮度PB_C編碼至例如SDR顯示器峰亮度PB_D),具有PB_C=1000尼特之影像的最感興趣(較暗)部分一般將看起來10x太暗,其將意謂著夜間場景ImSCN2變得不可見。由於PQ OETF的對數本質,HDR10影像係可見的(當僅演現為明亮度時,亦即以錯誤的EOTF解碼),但具有醜陋的惡化對比,使其等除了其他事物之外看起來褪色且係亮度不正確的。A significant technical difficulty with HDR10 images remains how to properly display them on less capable displays (e.g. less than the 2000 PB_C for which the HDR content was made). For example, the most interesting (darker) parts of an image with PB_C = 1000 nits will typically look 10x too dark if one just linearly maps white on white (encoded image maximum white, aka encoding peak luminance PB_C to e.g. SDR display peak luminance PB_D), which would mean that night scenes ImSCN2 become unviewable. Due to the logarithmic nature of the PQ OETF, HDR10 images are visible (when only luminance is rendered, i.e. decoded with a wrong EOTF), but with an ugly deteriorated contrast that makes them look among other things washed out and brightness incorrect.
以圖 2 解釋用於建立HDR視訊內容的簡單系統(例如,在廣播場景中)。再者,為保持說明簡單,仍不考慮非線性明亮度或R’G’B’像素顏色碼分配的細節(所謂的光-光學方法(Opto-optical approach):OOTF,在整個鏈中使用正常(絕對)輝度)。使用攝影機(201)曝光(EXP),可選擇如實地記錄哪個物體的輝度,及在其的相對值(因為攝影機的功能如同所有空間位置的相對輝度計,或產生RGB三元組(triplet)的相對色度計)。因為攝影機感測器及顏色分量的N-位元數學表示二者實際上均具有最終範圍(在最小值開始並在最大值結束),不暴露十億尼特之太陽的細節係合理的,但至少將該等輝度或RGB值修剪至其等最大值。在實質無限的範圍中,曝光選擇可藉由稍後的輝度重新映射「校正」,但在任何情形中,此事實對讀者說明沒有場景輝度至待顯示輝度的「自然」明顯映射(後者輝度的參考係已知為顯示器相關比色法,且其事實上係是最重要的該一者)。線性輝度影像LIN_HDR一般首先係經受OOTF映射(202)。此在SDR時代已某種程度地存在,並對觀看電視之晚上客廳的一般較暗的觀看環境中的人類視力對類似的視覺體驗需要較高之對比的事實進行校正,因此OOTF一般係軟伽瑪函數(soft gamma function)。然而,特別當將可觀的動態範圍的場景映射在較小動態範圍的一般顯示器(205)上時(甚至在其係高品質4000尼特的參考監視器時),各種物體像素輝度的某種藝術最佳化可藉由分級單元203施加可能任意的曲線(其在本文中將稱為分級)而就緖。特別係對於離線高品質生產,分級效果可能係可觀的,以將所謂的創意視覺或模樣置於主HDR影像MAST_HDR中(其根據本發明仍必須受進一步技術性處理,例如,經有利地編碼)。然後所得影像看起來最佳並可經由某個影像通訊連接204發送至顯示器205,在該顯示器上,人類分級者可檢查影像是否已如所期望的,或經由使用者介面控制單元206(例如,分級控制台)繼續微調至少一個輝度映射函數。此任意分級形成主模樣(不與任意重新分級(次模樣)混淆)以獲得,例如,盡可能最佳的對應SDR影像,其可稱為主SDR影像(例如,當如下文所述地形成視訊編碼原則的一部分時)。雖然僅為讀者說明一個更簡單的拓樸,讀者可理解實務上取決於,例如,是否有僅使用單一HDR攝影機或許多混合的SDR與HDR攝影機進行的現實生活廣播,或先前判定的HDR影像與對應的重新分級SDR主影像,其等現在需要根據編碼原理(例如,見於下文的ETSI1或ETSI2原理)共編碼等而可能有不同的實際實施例。A simple system for creating HDR video content (e.g. in a broadcast scenario) is explained with reference to FIG2 . Again, to keep the explanation simple, the details of non-linear brightness or R'G'B' pixel color code assignment are not considered (the so-called Opto-optical approach: OOTF, using normal (absolute) luminance throughout the chain). Using the camera (201) exposure (EXP), it is possible to choose which object's luminance is faithfully recorded and at what relative value (because the camera functions as a relative luminometer for all spatial positions, or a relative colorimeter producing RGB triplets). Since both the camera sensor and the N-bit mathematical representation of the color components actually have an ultimate range (starting at a minimum and ending at a maximum), it is reasonable not to expose the details of the billion-nit sun, but at least to clip the luminance or RGB values to their maximum values. In a virtually infinite range, the exposure choice can be "corrected" by a later luminance remapping, but in any case this fact indicates to the reader that there is no "natural" obvious mapping of scene luminance to the luminance to be displayed (the reference to the latter luminance is known as display-dependent colorimetry, and is in fact the most important one). The linear luminance image LIN_HDR is generally first subjected to OOTF mapping (202). This already exists to some extent in the SDR era, and corrects for the fact that human vision in the generally darker viewing environment of a nighttime living room when watching TV requires higher contrast for a similar visual experience, so the OOTF is generally a soft gamma function. However, particularly when mapping scenes with a considerable dynamic range on a general display (205) with a smaller dynamic range (even when it is a high quality 4000 nit reference monitor), some kind of artistic optimization of the brightness of the pixels of various objects can be achieved by applying a possibly arbitrary curve (which will be referred to herein as a grading) by the grading unit 203. Particularly for offline high quality production, the grading effect may be appreciable in order to place a so-called creative look or look in the master HDR image MAST_HDR (which according to the invention still has to be further technically processed, e.g. advantageously encoded). The resulting image then looks optimal and can be sent via a certain image communication connection 204 to a display 205, where a human grader can check whether the image is as desired, or continue to fine-tune at least one brightness mapping function via a user interface control unit 206 (e.g. a grading console). This arbitrary grading forms the master look (not to be confused with an arbitrary re-grading (secondary look)) in order to obtain, e.g., the best possible corresponding SDR image, which can be called the master SDR image (e.g. when forming part of a video coding strategy as described below). Although only a simpler topology is explained to the reader, the reader will understand that in practice there may be different actual implementations depending on, for example, whether there is a real-life broadcast using only a single HDR camera or many mixed SDR and HDR cameras, or previously determined HDR images and corresponding re-graded SDR master images, which now need to be co-encoded according to coding principles (e.g., ETSI1 or ETSI2 principles seen below), etc.
申請人理解到因為在從主HDR開始的各種可能的重新分級MDR影像之間有數學重新分級關係,倘若可技術務實地擷取該等函數,可事實上編碼不同動態範圍函數的整體頻譜(藉由發送其等之僅一者)及至少一個輝度映射函數以從已實際發送的該者建立另一影像(已以圖1說明)。此可能性及隨後的技術編碼概念的首次引入在WO2011107905中完成。The applicants understood that since there is a mathematical re-grading relationship between the various possible re-graded MDR images starting from the master HDR, if these functions can be technically and practically extracted, it is possible to actually encode the entire spectrum of different dynamic range functions (by sending only one of them) and at least one brightness mapping function to create another image from the one that has actually been sent (already illustrated with FIG. 1). The first introduction of this possibility and the subsequent technical coding concept was done in WO2011107905.
已發現定義用於將(例如,5000尼特PB_C)主HDR影像輝度變換成SDR影像輝度的輝度映射函數F_L係合理的,亦即,使分級者在最極端的影像表示之間定義所需的重新分級行為,且接著重新計算經顯示調適的輝度映射函數F_L_DA以用於計算對應於任何可能的5000尼特PB_C M_HDR影像輝度的其間MDR影像像素輝度。It has been found to be reasonable to define a brightness mapping function F_L for transforming a (e.g. 5000 nits PB_C) master HDR image luminance to an SDR image luminance, i.e., to allow the grader to define the desired regrading behavior between the most extreme image representations, and then recalculate the display-adapted brightness mapping function F_L_DA for calculating the intermediate MDR image pixel luminances corresponding to any possible 5000 nits PB_C M_HDR image luminance.
在申請人於隨後標準化時,之後有影像的二個邏輯選擇以實際傳輸(作為用於不同動態範圍之可重新分級影像的整體頻譜的單一影像,特別係PB_C端點,此係因為往往可假設下端點MB係大致固定的,例如0.01尼特)至任何接收器:主HDR影像或對應的SDR影像(應停止一秒以理解在該情況下,因為L_HDR_reconstructed=F_L_inverse[L_SDR],且事實上因為HDR影像仍亦傳達至F_L函數,所以傳輸實際普通的SDR影像而非HDR影像)。When the applicant subsequently standardizes, there are then two logical choices of the image to actually transmit (as a single image of the entire spectrum of rescalable images for different dynamic ranges, in particular the PB_C endpoint, since it can often be assumed that the lower endpoint MB is roughly fixed, e.g. 0.01 nit) to any receiver: the main HDR image or the corresponding SDR image (one should stop for a second to understand that in this case, since L_HDR_reconstructed = F_L_inverse[L_SDR], and in fact since the HDR image is still also passed to the F_L function, the actual normal SDR image is transmitted instead of the HDR image).
第二編碼選項首先在ETSI TS 103 433-1下標準化(注意該-1 ;將其縮寫為 ETSI1 ),該第二編碼選項在技術限制係許多傳統顯示器需要以不受干擾的方式服務時係相當有用的(事實上舊的SDR顯示器僅得到SDR影像,且不需要知道此亦編碼HDR影像,其可直接顯示SDR影像並立即得到HDR場景的非常不錯的SDR演現,事實上顯示器可儘可能顯示此一HDR場景)。注意到具有對能夠以足夠精密度在任何接收側重建原始主HDR影像的技術限制(像是對SDR影像顏色之可逆性的需求),其係導致如其所定義的標準編(解)碼方法之技術設想的一部分。The second coding option, first standardized under ETSI TS 103 433-1 (note the -1 ; abbreviated to ETSI1 ), is quite useful when the technical limitation is that many legacy displays need to be served in an undisturbed way (in fact an old SDR display only gets an SDR image and does not need to know that this also encodes an HDR image, it can directly display the SDR image and immediately get a very good SDR rendition of the HDR scene, in fact the display can display this HDR scene as much as possible). Note that there are technical limitations to be able to reconstruct the original master HDR image with sufficient precision on any receiving side (like the need for reversibility of SDR image colors), which is part of the technical assumption that led to the standard encoding (de)coding method as defined.
ETSI TS 103 433-2 (ETSI2 )係實際上將主HDR影像傳達至接收器的編碼替代方案,且其中(多個)函數F_L(實際上如將於下文所顯示的,雖然為了說明,可將系統設想為彷彿有用於該經傳達影像中的所有像素輝度的單一全域F_L函數,為了技術原因,使用一組隨後施加的映射函數)作用以計算用於在具有PB_D<PB_C_master之顯示器上最佳地顯示的影像(亦即,用於所謂的顯示調適)。各種消費者可選擇其等欲採用的系統,例如,傳達ETSI2 HDR的有線電視營運商將對其使用者部署STB,該STB將針對使用者恰巧在家中所具有的任何顯示器解碼及最佳化。ETSI TS 103 433-2 ( ETSI2 ) is a coding alternative that actually delivers a master HDR image to a receiver, and in which function(s) F_L (actually as will be shown below, although for illustration the system can be imagined as if there were a single global F_L function for all pixel luminances in the delivered image, for technical reasons a set of subsequently applied mapping functions is used) acts to compute the image for optimal display on a display with PB_D < PB_C_master (i.e. for so-called display adaptation). Various consumers can choose the system they wish to adopt, for example a cable operator delivering ETSI2 HDR will deploy STBs to its users that will decode and optimize for whatever display the user happens to have at home.
圖 3 首先以鳥瞰圖層級顯示一般單一影像增強功能(single-image-plus-function) HDR視訊通訊系統(編碼器+解碼器)的組件,未限制用於解釋基本觀念之目的的SDR通訊類型的一般系統。 FIG3 first shows the components of a general single-image-plus-function HDR video communication system (encoder + decoder) at a bird's-eye view level, without limiting the general system to SDR communication type for the purpose of explaining the basic concepts.
顏色變換器302從影像源301得到MAST_HDR影像作為輸入(例如,當其等由攝影機擷取並藉由以圖2所說明之系統分級,且接著透過一些專業視訊通訊系統傳達至廣播台側編碼器321時,其例如將透過空氣或經由電視有線網路傳輸電視節目)。然後施加一組顏色變換函數F_ct(在此實例中,例如,藉由分級自動軟體判定,諸如,申請人的自動HDR至SDR轉換技術,其基於影像特性(諸如,直方圖等)定義F_ct函數;特定細節可對此申請案的說明擱置,因為其僅需要用於任何影像或時間連續影像組的此類最佳化函數存在),包含至少亮度映射函數F_L,以獲得主HDR影像(MAST_HDR)像素之輝度的對應SDR輝度。為易於瞭解,讀者可為了簡單性而假設F_L係4分之一次方的輝度映射函數(L_out_SDR=power(L_in_HDR; ¼)),用於推導經正規化至100尼特PB_C SDR輸出影像Im_LDR(亦即,圖1的右側輝度範圍)中之像素的1.0 SDR輸出輝度。The color converter 302 gets as input the MAST_HDR images from the image source 301 (e.g. when they are captured by a camera and graded by the system illustrated in FIG. 2 , and then passed through some professional video communication system to the broadcast station side encoder 321 , which will transmit a TV program over the air or via a TV cable network, for example). A set of color transformation functions F_ct is then applied (in this example, for example, as determined by graded automatic software, e.g., the applicant's automatic HDR to SDR conversion technique, which defines the F_ct function based on image characteristics (e.g., histogram, etc.); the specific details may be left out of the description of this application, as it is only required that such an optimization function exists for any image or set of time-sequential images), including at least a brightness mapping function F_L, to obtain the corresponding SDR brightness of the brightness of the pixels of the master HDR image (MAST_HDR). For ease of understanding, the reader may assume for simplicity that F_L is a 4th power luminance mapping function (L_out_SDR=power(L_in_HDR; ¼)) used to derive the 1.0 SDR output luminance of a pixel in the PB_C SDR output image Im_LDR (i.e., the luminance range on the right side of Figure 1) normalized to 100 nits.
因為現在有「正常」SDR影像,其可用標準視訊壓縮技術(例如,MPEG標準(像是HEVC或MPEG2),或類似標準(像是AV1))壓縮,該壓縮係由視訊壓縮器303執行。Because there are now “normal” SDR images, they can be compressed using standard video compression techniques (e.g., MPEG standards (such as HEVC or MPEG2), or similar standards (such as AV1)), which are performed by the video compressor 303.
因為接收器必須能夠從所接收的對應經壓縮SDR影像Im_COD重建主HDR影像,除了待傳輸的實際像素化影像外,顏色映射函數F_ct亦必須進入視訊壓縮器。在沒有限制的情況下,可假設函數儲存在後設資料中,例如,藉由SEI(補充增強資訊(supplemental enhancement information))機制或類似技術的方式。最後,格式器304進行所有需要者以針對任何技術的通訊媒體305格式化(放入資料塊等)視訊串流,例如,針對藍光光碟上的儲存,或針對透過衛星的DVB通訊等格式化(此細節可由各別技術領域中具有通常知識者發現,且與理解本發明概念無關)。Since the receiver must be able to reconstruct the master HDR image from the corresponding compressed SDR image Im_COD received, in addition to the actual pixelated image to be transmitted, the color mapping function F_ct must also enter the video compressor. Without limitation, it can be assumed that the function is stored in metadata, for example, by means of a SEI (supplemental enhancement information) mechanism or similar techniques. Finally, the formatter 304 performs all the necessary steps to format (put data blocks, etc.) the video stream for a communication medium 305 of any technology, for example, for storage on a Blu-ray disc, or for DVB communication via satellite, etc. (this detail can be found by a person having ordinary knowledge in the respective technical field and is not relevant for understanding the concept of the invention).
在視訊接收器320中的MPEG解壓縮由視訊解壓縮器307執行後(在已通過未格式器(unformatter) 306後),SDR影像可藉由施加標準Rec. 709 EOTF由接收器解譯(以得到用於SDR顯示器的影像),但接收器亦可不同地解碼所接收的Im_COD影像,以獲得經重建HDR影像Im_RHDR。After MPEG decompression in the video receiver 320 is performed by the video decompressor 307 (after having passed through the unformatter 306), the SDR image can be interpreted by the receiver by applying the standard Rec. 709 EOTF (to obtain an image for an SDR display), but the receiver can also decode the received Im_COD image differently to obtain a reconstructed HDR image Im_RHDR.
此係由顏色變換器308執行,該顏色轉換器經配置以將如經壓縮Im_RLDR的SDR影像變換成任何非SDR動態範圍(亦即,高於100尼特之PB_C,且一般至少高於6x)的影像。例如,5000尼特的原始主影像Im_RHDR可藉由施加在編碼側使用的顏色變換F_ct的逆顏色變換IF_ct而重建,以從MAST_HDR產生Im_LDR(且其等在後設資料中接收並通過顏色變換器308)。或者,可包含顯示調適單元309,其將SDR影像Im_RLDR變換成不同的動態範圍,例如,假如顯示器310係3000尼特PB顯示器,受最佳分級的Im3000尼特,或用於對應較低PB_D顯示器的1500尼特或1000尼特PB影像等。已非限制性地假設視訊解碼器及顏色變換器係在單一視訊接收器320中。具有通常知識的讀者可理解可類似地設計具有,例如,分開在待連接至顯示器之機上盒中的解碼功能的許多不同拓樸,該顯示器僅作用為用於如所接收之預最佳化影像的基本型顯示器(dumb display),或其作進一步的影像顏色變換等。This is performed by a color converter 308, which is configured to convert an SDR image such as compressed Im_RLDR to an image of any non-SDR dynamic range (i.e., PB_C above 100 nits, and typically at least above 6x). For example, the original master image Im_RHDR at 5000 nits can be reconstructed by applying the inverse color transform IF_ct of the color transform F_ct used on the encoding side to generate Im_LDR from MAST_HDR (and which are received in the metadata and passed through the color converter 308). Alternatively, a display adaptation unit 309 may be included which converts the SDR image Im_RLDR to a different dynamic range, e.g., if the display 310 is a 3000 nit PB display, the optimally graded Im3000 nit, or a 1500 nit or 1000 nit PB image for a lower PB_D display, etc. It has been assumed without limitation that the video decoder and the color converter are in a single video receiver 320. A reader with ordinary knowledge will appreciate that many different topologies may be similarly designed with, for example, the decoding function separated in a set-top box to be connected to a display, the display acting only as a basic display (dumb display) for pre-optimized images as received, or it performing further image color conversion, etc.
圖 4 簡短地總結如在ETSI2中標準化之申請人的輝度及顏色映射技術的原理(事實上其詳述顏色變換器302,其根據ETSI2解碼原則(或類似地ETSI1編碼原則)大致在圖3中介紹),因為必須理解其以理解本申請案的一些更特定實施例技術。 FIG . 4 briefly summarizes the principles of the applicant's brightness and color mapping technique as standardized in ETSI2 (in fact it details a color converter 302 which is roughly introduced in FIG. 3 according to the ETSI2 decoding principles (or similarly the ETSI1 encoding principles)), because it must be understood in order to understand some of the more specific embodiment techniques of the present application.
輸入應係經PQ定義的YCbCr像素顏色(亦即,每像素的明亮度Y及色度Cb與Cr顏色分量)。首先,藉由必須使用SMPTE 2084 PQ EOTF的EOTF施加單元401將明亮度線性化成正常線性輝度L_in。然後可再次使用正常(實體SI及CIE普遍定義的)輝度定義從輸入HDR像素顏色獲得SDR輸出像素顏色的完整重新分級程序。在此之後,輝度處理可由輝度處理器401執行,該輝度處理器如期望但藉由明智選擇的子單元(此等單元402、403等在技術上設計成有利於各種HDR應用的需要,諸如,自動分級、易於人類分級、IC設計的複雜度等)實現總F_L映射。The input should be PQ defined YCbCr pixel colors (i.e., luminance Y and chrominance Cb and Cr color components per pixel). First, the luminance is linearized to normal linear brightness L_in by the EOTF application unit 401 which must use the SMPTE 2084 PQ EOTF. The complete re-grading process from the input HDR pixel colors to the SDR output pixel colors can then be defined again using the normal (physical SI and CIE universally defined) brightness. After this, luminance processing may be performed by a luminance processor 401 which implements the overall F_L mapping as desired but through judiciously chosen sub-units (these units 402, 403, etc. are technically designed to facilitate the needs of various HDR applications, e.g., automatic grading, ease of human grading, complexity of IC design, etc.).
首先,輝度均勻化器藉由施加定義如下之曲線族的取決於PB之一者而施加其形狀僅取決於輸入HDR影像之峰亮度PB_C_H(PB_C_H=例如,5000尼特)的固定曲線變換: Y’HP=log(1+(RHO-1)*power(L_in/PB_C_H; 1/(2.4)))/log(RHO) [方程式1] 其中 RHO= 1+32*power(PB_C_H/10000;1/2.4) [方程式2]First, the brightness equalizer applies a fixed curve transformation whose shape depends only on the peak brightness PB_C_H of the input HDR image (PB_C_H = e.g., 5000 nits) by applying one of the PB-dependent curves defined as follows: Y’HP=log(1+(RHO-1)*power(L_in/PB_C_H; 1/(2.4)))/log(RHO) [Equation 1] Where RHO= 1+32*power(PB_C_H/10000;1/2.4) [Equation 2]
此將所有輝度映射至在感知上均勻的灰色明亮度Y’HP。若PB_C_HDR =10000尼特,則此曲線緊密對應於SMPTE 2084 PQ曲線,其已知係感知均勻的。對於較低PB_C_HDR的輸入影像,曲線良好地按比例調整(事實上,其在絕對意義上表示在10000尼特曲線上在,例如,3000尼特結束的次曲線),導致經正規化[0-1.0]/[0-1.0]輸入/輸出輝度軸表示中的最暗顏色對數伽瑪曲線較不陡峭。亦即,該處理的其餘部分已開始良好地預正規化。This maps all luminances to a perceptually uniform grey brightness Y'HP. If PB_C_HDR = 10000 nits, this curve corresponds closely to the SMPTE 2084 PQ curve, which is known to be perceptually uniform. For input images with lower PB_C_HDR, the curve scales well (in fact, it represents a subcurve on the 10000 nit curve that ends at, for example, 3000 nits), resulting in a less steep log-gamma curve for the darkest colors in the normalized [0-1.0]/[0-1.0] input/output luminance axis representation. That is, the rest of the process is already well pre-normalized to begin with.
隨後,黑色-白色位準偏移器403可在所期望處施加某個添加白色位準偏移WLO及某個黑色位準偏移BLO。The black-white level shifter 403 may then apply an additive white level offset WLO and a black level offset BLO where desired.
白色位準偏移的效用可理解如下。假設內容創作者在設定在PB_C=4000尼特的系統(亦即,例如他的參考分級監視器具有4000尼特的PB_D)上分級他的影像,然而,在整個視訊中,他從未使用高於,例如,1500尼特(視訊最大值,係與可編碼最大值PB_D不同的事物)的像素最大亮度實際產生影像。然後,因為SDR輝度動態範圍按原樣足夠小,重新按比例調整將該等未使用值下降1500至4000尼特的輸入HDR係合理的(因為正在使用可動態調整的輝度映射,無論如何可每影像/視訊瞬間地將其最佳化)。1500/4000對應於0.375之經正規化(輸入)HDR輝度,所以可藉由除以2.6而將此值映射至經按比例調整HDR明亮度Y’HPS的最大值。The utility of white level shifting can be understood as follows. Assume that the content creator grades his images on a system set at PB_C=4000 nits (i.e., his reference grading monitor has a PB_D of 4000 nits, for example), however, throughout the entire video, he never actually produces the image using a pixel maximum brightness higher than, say, 1500 nits (the video maximum, which is a different thing from the encodable maximum PB_D). Then, because the SDR luminance dynamic range is small enough as it is, it makes sense to rescale the input HDR to drop those unused values by 1500 to 4000 nits (because a dynamically adjustable luminance mapping is being used, which can be optimized instantaneously per image/video anyway). 1500/4000 corresponds to a normalized (input) HDR luminance of 0.375, so this value can be mapped to the maximum value of the scaled HDR brightness Y’HPS by dividing by 2.6.
為了精確,根據ETSI2標準,執行以下計算: Y’HPS=(Y’HP-BLO)/(1-WLO-BLO) [方程式3]To be precise, according to the ETSI2 standard, the following calculation is performed: Y’HPS=(Y’HP-BLO)/(1-WLO-BLO) [Equation 3]
其中WLO及BLO係在後設資料中傳達,該後設資料與所接收視訊影像共同傳達或可與其關聯。The WLO and BLO are communicated in metadata that is communicated together with or may be associated with the received video image.
黑色位準偏移對得到SDR對應重新分級影像的更高對比模樣係有用的,但應留意ESTI1經接收影像應保持對HDR影像係可逆映射的,亦即,不應遺失太多的黑色像素細節(其係亦有平行增益限制器的原因,未顯示於圖4中)。Black level shifting is useful to get a higher contrast look for the SDR corresponding regraded image, but care should be taken that the ESTI1 received image should remain reversibly mapped to the HDR image, i.e. not too much black pixel detail should be lost (this is also the reason for the parallel gain limiter, not shown in Figure 4).
基本上,可將黑色位準偏移簡單地理解為將某種HDR「黑」色置於SDR中的0.0,或更精確地說,經由準備用於HDR至SDR輝度映射的單元403(亦即,使用仍在HDR中的經正規化輝度,意謂著具有可用於得到HDR顯示器上的良好模樣,及SDR顯示器上的不良仍未最佳化模樣的相對分布)。Basically, black level shift can be simply understood as putting a certain HDR "black" color at 0.0 in SDR, or more precisely, via a unit 403 that is prepared for HDR to SDR brightness mapping (i.e., using normalized brightness that is still in HDR, meaning having a relative distribution that can be used to get a good look on an HDR display, and a bad, still unoptimized look on an SDR display).
隨後,粗略動態範圍變換器404施加主要輝度變換以得到SDR輝度(亦即,具有物體輝度的良好第一重新分布,以在SDR顯示器上得到合理模樣)。對此,ETSI2使用由用於最暗的HDR經正規化輝度的斜率可控線性區段(此區段之斜率稱為陰影增益)、用於最亮經正規化HDR輸入輝度Y’HPS的另一線性壓縮部分(具有斜率控制參數高亮度增益)、及藉由針對中間色調提供良好SDR外觀而一起平滑化其的可控拋物線部分(具有控制參數中間色調寬度,且數學在該標準中係可讀的,且在此申請案中僅重新解釋(視需要以簡單易消化的方式)至理解根據本見解之新發明實施例所需的程度)組成的曲線。所以此粗略動態範圍變換器404的輸出明亮度Y’CL首次定義在SDR範圍中,或SDR相對明亮度分布統計中。Then, a coarse dynamic range converter 404 applies the main brightness conversion to obtain SDR brightness (i.e., with a good first redistribution of the object brightness to get a reasonable look on an SDR display). For this, ETSI2 uses a curve consisting of a slope controllable linear section for the darkest HDR normalized luminance (the slope of this section is called shadow gain), another linear compression section for the brightest normalized HDR input luminance Y'HPS (with slope control parameter high brightness gain), and a controllable parabolic section that smooths it together by providing a good SDR look for the mid-tones (with control parameter mid-tone width, and the math is readable in the standard, and is only re-explained in this application (in a simple and digestible way as necessary) to the extent necessary to understand the new inventive embodiments according to the present insight). So the output brightness Y'CL of this coarse dynamic range converter 404 is defined for the first time in the SDR range, or in the SDR relative brightness distribution statistics.
對此單元404之內容創作者的技術(及藝術)提議係分級者可以含有更亮像素之其他物體的物體內對比為代價(因為有限的SDR輝度範圍)而良好地最佳化他所需要多明亮以產生最暗像素,但他可共調諧,例如,高亮度增益。可將陰影增益理解成用於,例如,站在洞穴的暗陰影區域中之0.05尼特輝度的人。若以白色在白色上準則(亦即,正規化映射函數,其係具有係經正規化輝度函數圖表之對角線的45度斜率的恆等函數)將他顯示在SDR顯示器上,將發現他在HDR中的經正規化輝度係0.05/5000,其由於用於粗略映射SDR輝度的恆等映射而停留在相等的經正規化輝度,亦即,在使其等絕對化後,該等像素應以(1/100000)*100顯示在SDR顯示器上,亦即,該顯示器上的最小黑色(「0」驅動信號)且係不可見的。因此,必須可觀地提高此類輝度,甚至在更對數均勻的HDR及SDR相對灰色值或明亮度表示中,以獲得充分可見且導致個人物體(例如,顯示在SDR顯示器上之跨越0.3至1.2尼特的個人像素輝度)內的物體紋理可辨別性的SDR輝度。因此,取決於該人恰巧落在HDR輝度範圍上多深(其如上文所教示的將取決於如HDR場景架構、場景照明、攝影機曝光、及由內容創作者所選擇之藝術主HDR分級的此類因子的組合係如何),編碼器(例如,作出合適F_L部分係此第一粗略輝度映射之選擇以將主HDR輸入重新分級成最佳或合適的對應SDL像素輝度的人類分級者)將選擇用於處理此特定影像的該等最暗像素的適當陰影增益(亦即,影像內容最佳化)。注意到實際上在ETSI中,將陰影增益SG定義為基於輸入與輸出影像之峰亮度(至少其等之明亮度表示)之比率的自動按比例調整的校正。在等明亮度(equiluma)原則下,藉由從經正規化HDR輝度作為:L_200=Y’200*L_HDR開始,應提高表示在經正規化明亮度範圍上的輝度係合理的,該經正規化明亮度範圍對應於,例如,僅200尼特PB_C(或更確切地說,根據以上方程式1及方程式2的值:Y’HP=Y’200=v(PB_C_H/200;RHO(200)),v係上述方程式1的虛擬對數(pseudo-logarithmic)方程式)。然而,此一般提供太亮且低對比的影像,所以分級者可使用曝光增益校正:SG=expgain*Y’200,其將係使SG朝向對角值1.0移回並將一些暗度帶回到SDR影像中(他一般不會選擇expgain=1/Y’200,因為SDR經正規化明亮度接著將等於HDR經正規化明亮度且再度太暗;SG將例如落在1.0與1.8之間)的調光(dimming)因子。The technical (and artistic) suggestion to the content creator of this unit 404 is that the grader can optimize well how bright he needs to be to produce the darkest pixel at the expense of intra-object contrast of other objects containing brighter pixels (because of the limited SDR luminance range), but he can co-tune, for example, high brightness gain. Shadow gain can be understood as for, for example, a person standing in a dark shadow area of a cave at 0.05 nit luminance. If it is displayed on an SDR display with a white-on-white criterion (i.e., a normalized mapping function, which is a constant function with a 45 degree slope of the diagonal line of the normalized luminance function graph), it will be found that its normalized luminance in HDR is 0.05/5000, which stays at the same normalized luminance due to the constant mapping used to roughly map the SDR luminance, i.e., after making it equiabsolute, these pixels should be displayed on the SDR display at (1/100000)*100, i.e., the minimum black ("0" drive signal) on the display and are invisible. Therefore, such brightness must be significantly improved, even in more log-uniform HDR and SDR relative gray value or brightness representations, to obtain SDR brightness that is sufficiently visible and results in object texture discernibility within individual figures (e.g., individual pixel brightness spanning 0.3 to 1.2 nits displayed on an SDR display). Thus, depending on how deep into the HDR luminance range that person happens to fall (which, as taught above, will depend on a combination of factors such as the HDR scene architecture, scene lighting, camera exposure, and the artistic master HDR grading chosen by the content creator), the encoder (e.g., one that makes a choice of the appropriate F_L portion of this first coarse luminance mapping to regrade the master HDR input to an optimal or appropriate human grading of corresponding SDL pixel luminance) will choose the appropriate shadow gain for processing those darkest pixels of this particular image (i.e., image content optimization). Note that in practice in ETSI, shadow gain SG is defined as an automatically scaled correction based on the ratio of the peak luminance (at least their brightness representation) of the input and output images. Under the isolumina principle, by starting from the normalized HDR luminance as: L_200=Y’200*L_HDR, it is reasonable to increase the luminance over the normalized luminance range corresponding to, for example, only 200 nits PB_C (or more precisely, according to the values of Equation 1 and Equation 2 above: Y’HP=Y’200=v(PB_C_H/200;RHO(200)), v is the pseudo-logarithmic equation of Equation 1 above). However, this generally gives an image that is too bright and low-contrast, so the grader may use an exposure gain correction: SG=expgain*Y’200, which will be a dimming factor that moves SG back towards the diagonal value of 1.0 and brings some darkness back to the SDR image (he would generally not choose expgain=1/Y’200, because the SDR normalized brightness would then be equal to the HDR normalized brightness and be too dark again; SG would, for example, fall between 1.0 and 1.8).
此曲線類似實作用於將可能大得多的輝度動態範圍中的許多HDR輝度擠壓至小得多的SDR DR中的非線性輝度壓縮「彈簧」。因為使用的不係其「永遠不應在平均上太不合理」的固定曲線,但編碼器可施加已經最佳化的曲線,所得的SDR影像對許多HDR場景將不會不好(並非所有HDR場景均同樣複雜,例如,有時在均勻的陽光照射區域旁邊恰有某個弱陰影區域,然後雖然最簡單的系統將產生像是修剪成白色的問題,但不太複雜的智慧HDR至SDR映射(像是單元404的三部分曲線)在建立HDR主影像(例如,出自現實生活事件擷取內容創作者之HDR攝影機的該一者)的合適SDR重新分級影像上往往將已做得好。This curve acts like a non-linear luminance compression "spring" that squeezes much of the HDR luminance from a potentially much larger luminance dynamic range into the much smaller SDR DR. Because what is used is not a fixed curve which "should never be too unreasonable on average", but the encoder can apply a curve that has been optimized, the resulting SDR image will not be bad for many HDR scenes (not all HDR scenes are equally complex, for example, sometimes there is a slightly shadowed area right next to an evenly sunny area, and then although the simplest system will produce problems like clipping to white, a less complex intelligent HDR to SDR mapping (like the three-part curve of unit 404) will often do a good job of creating a suitable SDR regraded image of an HDR master image (for example, the one from the HDR camera of the content creator of the real life event capture).
然而,數種其他場景可更複雜,且一些內容創作者在精細調諧他們的藝術內容時(例如,好萊塢電影導演或DOP)亦可具有更高程度的專業需求。However, several other scenarios may be more complex, and some content creators may also have a higher degree of professional needs in fine-tuning their artistic content (e.g., a Hollywood film director or DOP).
因此,次一單元(可客製化曲線施加器405)允許內容創作者(再次不論係人類或具有編碼在其演算法中之各種規則的智慧自動化)將可客製化且可能任意成形的精細分級輝度映射函數F_L_CU施加至Y’CL預分級明亮度,產生經分級LDR明亮度Y’GL(對該函數的唯一要求係其係不減少的,且一般甚至係單調增加的,且一般至少如在ETSI2中所選擇地將1.0輸入映射至1.0輸出)。實務上,此函數F_L_CU的形狀可作為一組形狀定義參數(例如,多項式的係數)或作為LUT等的其中一者傳達至解碼器。Thus, the next unit (customizable curve applicator 405) allows the content creator (again, whether human or intelligent automation with various rules encoded in its algorithm) to apply a customizable and possibly arbitrarily shaped fine-scale graded brightness mapping function F_L_CU to the Y'CL pre-graded brightness, producing the graded LDR brightness Y'GL (the only requirement on the function is that it is non-decreasing, and typically even monotonically increasing, and typically maps 1.0 input to 1.0 output, at least as chosen in ETSI2). In practice, the shape of this function F_L_CU can be communicated to the decoder as a set of shape-defining parameters (e.g., coefficients of a polynomial) or as one of a LUT or the like.
因為視覺系統具有判定經感知影像物體灰色值印象的複雜方式、及/或因為將大跨度(span)的HDR輝度擠壓至有限SDR DR中有時可需要相當的理解能力、及/或因為內容創作者明顯地期望將某種額外藝術氣息置於此客製化曲線F_L_CU(然後該形狀一般將由另一顏色使用者介面電腦硬體及在編碼側所連接的軟體(未圖示)判定),此一精細分級可能係必要的。事實上,在一方面,可說所有的MDR影像均應係(僅)在主HDR影像中的所有資訊的某種壓縮表示,但另一方面(因為可給出相當弱印象的影像,例如,具有太少對比,彷彿透過霧見到的),內容創作的其他重要要求可係將所有影像彌補成SDR影像模樣-鑑於其等的更有限的DR能力-儘可能像HDR場景一樣真實或至少盡可能美麗。人類視力係高度非線性及精巧的,且若已用於太簡單的功能,可快速地感知。所以除了粗略輝度擠壓函數F_C外,內容創作者可能使用對可客製化函數F_L_CU的理解能力以在近乎不可能產生其看起來仍儘可能地好到用於HDR場景且較佳地像是HDR場景的SDR影像時做得更好(例如,降低像素的某個輝度子範圍的亮度以產生恰好多一些的物體間對比,例如,用於彩色玻璃窗的亮度之於教堂內部,或SDR影像中的室內室外視覺對比,或藉由選擇經由F_L_CU曲線的特殊局部形狀將場景中的一些物體的色彩度之於輝度最佳化等)。Such a fine grading may be necessary because visual systems have a complex way of determining the impression of the grey values of perceived image objects, and/or because squeezing a large span of HDR brightness into a limited SDR DR may sometimes require considerable understanding, and/or because content creators may clearly wish to put some additional artistic touch on this customized curve F_L_CU (which shape will then generally be determined by another color user interface computer hardware and software connected on the encoding side (not shown)). In fact, on the one hand, one could say that all MDR images should be some kind of compressed representation of all the information in the master HDR image (only), but on the other hand (since images with too little contrast could give a rather weak impression, e.g., as if seen through fog), other important requirements for content creation could be to make all images look like SDR images - as realistic as possible or at least as beautiful as possible, given their more limited DR capabilities. Human vision is highly non-linear and delicate, and can quickly perceive if it has been used to too simple functions. So in addition to the crude luminance squeezing function F_C, content creators may use the ability to understand the customizable function F_L_CU to do better when it is nearly impossible to produce an SDR image that still looks as good as possible to be used in an HDR scene, and preferably looks like an HDR scene (for example, lowering the brightness of a certain luminance subrange of pixels to produce just a little more contrast between objects, e.g. for the brightness of stained glass windows in church interiors, or indoor-outdoor visual contrast in SDR images, or optimizing the chromaticity of some objects in the scene with respect to luminance by choosing a particular local shape of the F_L_CU curve, etc.).
可用顯示於圖 6 中的「陰影人」影像的單一簡單實例啟發讀者並將可客製化之輝度映射函數的最小必要理解提供給他。A single simple example of the "Shadow Man" image shown in FIG6 may be used to enlighten the reader and provide him with the minimum necessary understanding of a customizable brightness mapping function.
圖6A幾何地顯示可在影像中見到的事物,而圖6B顯示L_HDR輝度與L_SDR輝度之間的函數關係。該影像顯示機器人602正移動通過其的黑暗太空站(DRKSPST)。在某個影像呈現時間,其遭遇陰影人601(其色度地定義成一組非常亮的HDR像素),其中在組成陰影人身體的各種像素之間有些微輝度差。此係因為他正站在充滿霧氣氛的強烈光照環境中的窗後方而發生。霧將一分量加至源自陰影人身體(例如,他的衣服)的輝度,將例如L_pants=20尼特+ L_mist= 4500尼特= 4520尼特,L_shirt= 50尼特+ L_mist= 4800尼特= 4850尼特等之HDR影像中的最終輝度給予觀看者。當使用具有對最亮像素太小之斜率的進程輝度映射函數的問題係陰影人可變成對比不足且在較小的動態範圍影像(諸如,SDR影像)中難以看見。一種解決方案係定義F_L_CU函數,使得其在輸入HDR輝度區域4500至5000尼特中局部地具有較大斜率,導致陰影人之更大的SDR輝度子範圍RS,使他及他的細節(例如,其穿戴的領帶)在霧中(甚至在SDR影像中)更加可見。可理解到可能有許多比僅有粗略映射函數F_C具有更多一些的額外重新分級控制可係有利的情況。FIG6A shows geometrically what can be seen in the image, while FIG6B shows the functional relationship between L_HDR brightness and L_SDR brightness. The image shows a robot 602 moving through its dark space station (DRKSPST). At a certain image rendering time, it encounters a shadow man 601 (chromatically defined as a set of very bright HDR pixels), where there are some slight brightness differences between the various pixels that make up the shadow man's body. This occurs because he is standing behind a window in a strongly lit environment with a foggy atmosphere. The fog adds a component to the luminance originating from the shadow person's body (e.g., his clothes), giving the viewer a final luminance in the HDR image of, for example, L_pants = 20 nits + L_mist = 4500 nits = 4520 nits, L_shirt = 50 nits + L_mist = 4800 nits = 4850 nits, etc. The problem when using a progressive luminance mapping function with a slope that is too small for the brightest pixel is that the shadow person can become lacking in contrast and difficult to see in smaller dynamic range images (e.g., SDR images). One solution is to define the F_L_CU function so that it has a steeper slope locally in the input HDR luminance region 4500 to 5000 nits, resulting in a larger SDR luminance sub-range RS for the shadowed man, making him and his details (e.g. the tie he is wearing) more visible in the fog (even in the SDR image). It can be appreciated that there may be many situations where having a bit more additional regrading control than just the coarse mapping function F_C may be beneficial.
回到圖4,在已定義適當的(均勻視覺表示)SDR明亮度之後,線性化器406將其等轉換成(經正規化)SDR明亮度Ls。但是由於SDR輝度此時係以對應於PB_C_S=100尼特(其輸入至單元406中)的RHO而非在輝度處理鏈開始處用於感知不均勻化的5000尼特產生,因此施加上述方程式1的反方程式。Returning to Figure 4, having defined the appropriate (uniform visual representation) SDR brightness, the linearizer 406 converts it to a (normalized) SDR brightness Ls. However, since the SDR brightness is now generated with an RHO corresponding to PB_C_S = 100 nits (which is input to unit 406) rather than the 5000 nits used for perceptual non-uniformity at the beginning of the brightness processing chain, the inverse of equation 1 above applies.
顏色當然不係1維的(除非僅使用消色灰色值影像運作),其使動態範圍轉換及編碼變得更相當複雜,但在其需要用於像素之色度Cb及Cr的平行處理軌的任何情形中,以獲得作為輸出顏色分量Rs、Gs、及Bs的更適合的對應SDR色度,或事實上如最終於圖4所示的合適的SDR RGB顏色。Color is of course not 1-dimensional (unless operating only with achromatic grey value images), which makes dynamic range conversion and encoding considerably more complex, but in any case it requires parallel processing tracks for the chrominance Cb and Cr of the pixel to obtain more appropriately corresponding SDR chrominance as the output color components Rs, Gs, and Bs, or indeed proper SDR RGB colors as ultimately shown in Figure 4.
ETSI2的色度處理軌450執行下列者(僅以所需程度再次簡短地解釋)。輸入像素色度Cb與Cr類似地藉由乘法器452乘以值F_C[Y],產出輸出色度Cb*、Cr*。困難在於始終獲得適當的輸出色度,知道有許多困難:可實現顏色的不規則形狀色域(見圖5中的解釋)、數學的非線性、以及觀看者的人類視覺系統等。此外,如將於以下之本申請案的實施例中所示的,市場甚至有更多需求,導致甚至更複雜的HDR處理系統。The chroma processing track 450 of ETSI2 performs the following (explained again briefly only to the extent necessary). The input pixel chroma Cb and Cr are similarly multiplied by the value F_C[Y] by multiplier 452, producing the output chroma Cb*, Cr*. The difficulty is to always get the appropriate output chroma, and there are many difficulties: the irregular shape of the gamut of achievable colors (see explanation in Figure 5), the non-linearity of the mathematics, and the human visual system of the viewer. In addition, as will be shown in the embodiments of the present application below, the market has even more requirements, resulting in even more complex HDR processing systems.
ETSI2使用飽和處理判定器451,其可取決於輸入像素恰巧具有的何者明亮度值Y而載入定義待發送至乘法器之輸出值的LUT。再者,內容創作者可自由定義/最佳化此明亮度相依之飽和乘法器定義函數的形狀。至少係因為如將於下文見的,有時發明顏色數學需要將此F_C[Y] LUT定義至所需程度。ETSI2 uses a saturation processing determinator 451 which can load a LUT defining the output value to be sent to the multiplier depending on what brightness value Y the input pixel happens to have. Again, the content creator is free to define/optimize the shape of this brightness-dependent saturation multiplier definition function. Not least because, as will be seen below, sometimes inventing color mathematics requires defining this F_C[Y] LUT to the desired degree.
矩陣施加單元453單純地將Cb、Cr顏色規格轉換成對應的經正規化RGB表示(此數學對本申請案係無趣的,且有興趣的讀者可在ETSI2以及ETSI1中找到)。The matrix application unit 453 simply converts the Cb, Cr color specifications into corresponding normalized RGB representations (this mathematics is uninteresting for this application and can be found in ETSI2 and ETSI1 for interested readers).
可藉由將「非HDR輝度」的經正規化R/Lh等值乘以在輝度處理軌401中計算的經正規化Ls值而定義RGB三元組。注意到所得的RN、GN、及BN值事實上仍係經正規化輝度,而非絕對SDR輝度(Rs等),但其等係「SDR正確」經正規化輝度,因為其等現在將SDR恰巧得到的輝度(Ls)列入考慮。The RGB triplet can be defined by multiplying the normalized R/Lh etc. values of "non-HDR brilliance" by the normalized Ls value calculated in the brilliance processing track 401. Note that the resulting RN, GN, and BN values are in fact still normalized brilliance, not absolute SDR brilliance (Rs etc.), but they are "SDR correct" normalized brilliance because they now take into account the brilliance that the SDR happens to get (Ls).
有對於不係比色技術中具有通常知識者的人最初有些困難概念的可能性,為了加快讀者的速度,說明在圖 5 中的經正規化(普遍的,亦即,當如上文所解釋地正規化時,SDR及HDR色域良好地重疊,但當然必須偏移HDR顏色以變成合適的SDR顏色,即使該變換對目前HDR場景影像的需求不係高度精巧且最佳化的,但係簡單地將絕對SDR輝度等同於輸入HDR絕對輝度)YCbCr色域中發生什麼。There is the possibility that this may be initially somewhat difficult to conceptualize for someone who is not generally knowledgeable in colorimetric techniques, so to speed up the reader it is explained what happens in the normalized (general, i.e., when normalized as explained above, the SDR and HDR color domains overlap well, but of course the HDR colors must be shifted to become proper SDR colors, even if the transformation is not highly sophisticated and optimized for the needs of the present HDR scene image, but is simply to equate the absolute SDR brightness to the input HDR absolute brightness) YCbCr color domain in FIG. 5 .
純輝度變換將在垂直方向上發生,所以一般將HDR輝度或其對應明亮度Y(亦即,ColHDR的明亮度)向上移動至最佳的新位置(ColSDR),因為對於HDR至SDR輝度映射,F_L曲線在經正規化軸圖表上將始終落在對角線上方(亦即,具有某個x座標的輸入HDR經正規化輝度或明亮度亦具有該對角線在該x座標之位置的高度的y座標,且始終位於對角線上方之函數的經正規化SDR輸出輝度因此將始終產生較高的經正規化輸出值)。哪個實際(絕對)SDR輝度對應於此經正規化明亮度值Y係藉由首先EOTF成經正規化輝度(由單元406執行,因為Y’HP上至Y’GL的處理明亮度係藉由施加方程式1的對應EOTF而定義),並藉由乘法器455將該等經正規化輝度簡單地乘以100(例如,0.7*100= 70尼特)而發現。亦即,讀者現在看到使用此框架,所需的任何者均可從輸入HDR影像顏色,特別係從其之經PQ定義的明亮度Y(例如,如HDR藍光光碟上所儲存者)定義,一直到待顯示在SDR顯示器上之對應像素的絕對SDR輝度,以將最佳對應SDR影像顯示成HDR輸入影像(及來自所接收HDR影像之SDR影像的所得解碼)。Pure brilliance transformations will occur in the vertical direction, so generally HDR brilliance or its corresponding brightness Y (i.e., brightness for ColHDR) will be moved upwards to an optimal new position (ColSDR), because for the HDR to SDR brilliance mapping, the F_L curve will always fall above the diagonal line on the normalized axis graph (i.e., an input HDR normalized brilliance or brightness with a certain x-coordinate also has a y-coordinate that is the height of the diagonal line at the position of the x-coordinate, and the normalized SDR output brilliance of the function that is always above the diagonal line will therefore always produce higher normalized output values). Which actual (absolute) SDR brightness corresponds to this normalized brightness value Y is found by first EOTFing the normalized brightness (performed by unit 406, since the processed brightness from Y'HP to Y'GL is defined by applying the corresponding EOTF of equation 1), and then simply multiplying those normalized brightness by 100 (e.g., 0.7*100= 70 nits) by multiplier 455. That is, the reader now sees that using this framework, anything required can be defined from the input HDR image color, and in particular from its PQ-defined luminance Y (e.g., as stored on an HDR Blu-ray disc), all the way to the absolute SDR brightness of the corresponding pixels to be displayed on an SDR display, in order to display the best corresponding SDR image to the HDR input image (and the resulting decode of the SDR image from the received HDR image).
到這裡為止,讀者現在瞭解至少根據申請人的ETSI標準化編碼原則的HDR編碼的基本起點。對於多數消費者而言,ETSI1或ETSI2之其中一者的選擇(以及隨後技術地發生的一切者)均將對於他們的目的係足夠的,亦即,以美麗的HDR影像供應他們的市場(當然他們仍將需要產生該等美麗的HDR影像,包括針對至少F_C函數且較佳地亦針對F_L_CU函數判定良好形狀,或至少在根據他們自有的特定藝術需求非手動地最佳化該等函數時,購買及使用自動針對各HDR影像種類產生相當不錯模樣的申請人的自動化,及隨後的編解碼器函數形狀)。例如,將去作完全修補以獲得未來性(future-proof)的高品質多樣化HDR的消費者可部署ETSI2系統,且重視其等之SDR影像或SDR消費者之其中一者的市場播放器可更將他們的HDR系統部署為ETSI1系統(此亦可涉及取決於在HDR視訊處理鏈的何處的各種討論,例如,內容創作者之於有線電視通訊系統營運商,並可能涉及轉碼等)。By now the reader knows the basic starting point of HDR coding at least according to the applicant's ETSI standardized coding principles. For most consumers, the choice of either ETSI1 or ETSI2 (and everything that technically follows) will be sufficient for their purposes, namely to supply their market with beautiful HDR images (of course they will still need to produce such beautiful HDR images, including determining good shapes for at least the F_C function and preferably also for the F_L_CU function, or at least buy and use the applicant's automation that automatically produces quite good looking and subsequent codec function shapes for each HDR image category when optimizing these functions non-manually according to their own specific artistic needs). For example, a consumer who is going to do a full patching for future-proof high quality diverse HDR may deploy an ETSI2 system, and a market player that values their SDR images or one of the SDR consumers may instead deploy their HDR system as an ETSI1 system (this may also involve various discussions depending on where in the HDR video processing chain, e.g. content creators versus cable operators, and possibly involving transcoding, etc.).
然而,市場有另一需求或有提議給不喜歡部署精確標準化之ETSI1或ETSI2之消費者的市場。若選擇將HDR影像傳達為表示所有各種PB_D顯示器所需之影像的整體頻譜的單獨影像,傳達(例如,5000尼特PB_C)主HDR影像自身係非常合理的,不僅因為該等影像已經可用,亦因為影像HDR場景的最佳品質表示(事實上其等係內容創作者的「財富」,由他特別創作且認可,且常係創意視覺電影之起點的影像,若已非他主動創作的唯一事物,若重新分級的其餘部分藉由所選技術自動地運作)。然而,尤其是未來幾年,有可從另一額外方法獲益的市場情況。遺憾的係並非市場中之非基本型傳統SDR顯示器(亦即,不能進行涉及在HDR解碼、顯示調適等中的所有數學)的所有電視(或一般視訊解碼或處置裝置)將始終係立即有ETSI2(或ETSI1)能力的電視。市場上有許多將非常不同的方法(像是,例如,根據最近標準化的混合對數伽瑪方法)施加至HDR編碼及顯示的電視。或者可能某些tv可僅解碼經PQ明亮度編碼的HDR影像,僅此而已。可能一些電視可僅使用該方法,所以很可能它們所能作的最好事情係完全不處理傳入的ETSI2 HDR視訊。類似地,市場中可能有不遵循任何標準原則的一些電視,至少不關於顯示調適,亦即,在接收時將例如2000尼特影像重新分級成用於900尼特PB_D顯示器的例如900尼特影像。此一電視將需要解碼能力以理解所接收的影像含有哪些像素顏色且尤其係輝度,但其等可將其等自有(色調映射)的啟發使用在如何產生900尼特的影像上。至少從想要每個消費者均可看見與他原本創作其時一樣好的電影的內容創作者的觀點,缺點係此一變異性將建立高度不確定性,任何特定品牌的電視將在該不確定性上產生任何經接收HDR影像。例如,近來執行的HDR影像的簡單顯示重新解譯係HDR影像輝度的絕對演現。此意謂著將至多900尼特的所有HDR影像輝度以如編碼在影像中的輝度精確地顯示,但將所有更高的輝度修剪至顯示器的最白的可能白色(亦即,PB_D)。使用實例影像,像是圖7的太空站,可意謂著地球的一些部分將修剪成醜陋的白色斑點(在其右方的太陽過度照射地球處)。而此TV在某種程度上仍將係美麗的HDR TV,因為其將顯示通過太空站的頂部觀看口看見的與黑暗內部成良好地對比的大部分係非常明亮的藍色的地球,該影像的至少一部分將看起來係醜陋的(且一些其他場景至少在一些TV上可顯示更為嚴重的錯誤,像是例如,修剪掉圖1之洞穴或市集等外側的每個影像細節)。執行另一簡化色調映射重新解譯(像是例如,輝度的線性壓縮,像是白色在白色上策略)可產生若干其他問題。因此,雖然此一系統可為終端觀看者運作並產生某種種類的HDR影像(例如,在吾等的ETSI2系統中,此一TV僅可使用401的PQ函數,但忽略所有其他輝度映射函數後設資料及隨之發生的循序輝度映射402、403等,其等在ESTI2中執行顯示調適的功能),但結果將不會係最佳視覺品質也不會係可預測的(其可能係更糟的)。However, there is another market need or there is a market proposal for consumers who do not like the deployment of an exact standardized ETSI1 or ETSI2. If the choice is to convey the HDR image as a single image representing the entire spectrum of the image needed for all kinds of PB_D displays, it is very reasonable to convey the (for example, 5000 nits PB_C) master HDR image itself, not only because these images are already available, but also because they are the best quality representation of the image HDR scene (in fact they are the "wealth" of the content creator, the images specifically created and approved by him, and often the starting point of a creative visual film, if it has not been the only thing he has created actively, if the rest of the re-grading works automatically by the chosen technology). However, especially in the coming years, there are market situations that can benefit from another additional approach. Unfortunately not all TVs (or video decoding or processing devices in general) on the market that are not basic conventional SDR displays (i.e. cannot do all the math involved in HDR decoding, display adaptation, etc.) will always be ETSI2 (or ETSI1) capable TVs out of the box. There are many TVs on the market that apply very different approaches to HDR encoding and display (like, for example, according to the recently standardized Hybrid Log-Gamma method). Or maybe some TVs can just decode PQ brightness encoded HDR images and nothing more. Maybe some TVs can just use that method, so it's likely that the best they can do is not process incoming ETSI2 HDR video at all. Similarly, there may be some TVs in the market that do not follow any standard principles, at least not with regard to display adaptation, i.e. re-grading, upon reception, e.g. a 2000 nit image into, e.g., a 900 nit image for a 900 nit PB_D display. Such a TV will need decoding capabilities to understand which pixel colors and especially the brightness the received image contains, but they can use their own (tone mapping) inspiration on how to produce the 900 nit image. The disadvantage, at least from the point of view of the content creator who wants every consumer to see a movie that is as good as it was when he originally created it, is that this variability will create a high degree of uncertainty over which any particular brand of TV will produce any received HDR image. For example, a simple display re-interpretation of an HDR image that has been performed recently is an absolute representation of the brightness of the HDR image. This means that all HDR image luminances up to 900 nits are displayed exactly as encoded in the image, but all higher luminances are clipped to the display's whitest possible white (i.e., PB_D). Using an example image, such as the space station in Figure 7, this may mean that some parts of the Earth will be clipped to ugly white spots (where the Sun over-illuminates the Earth to its right). While this TV will still be a somewhat beautiful HDR TV, since it will show a mostly very bright blue Earth seen through the top viewing port of the space station contrasting well with the dark interior, at least part of the image will look ugly (and some other scenes may show more severe errors on at least some TVs, such as, for example, clipping off every image detail outside of the cave or marketplace in Figure 1). Performing another simplified tone mapping reinterpretation (such as, for example, linear compression of brightness, like a white-on-white strategy) may create several other problems. So while such a system may work and produce some kind of HDR image for the end viewer (e.g., in our ETSI2 system, such a TV may only use the PQ function of 401, but ignore all other brightness mapping function metadata and the resulting sequential brightness mapping 402, 403, etc., which perform display adaptation functions in ETSI2), the result will not be the best visual quality nor predictable (it may be worse).
此導致除了主HDR影像外,基於第二類型之HDR影像的新編碼拓撲,所謂的中間動態範圍 (intermediate dynamic range, IDR) 影像,其首先在WO2016020189中引入。然後優點係可以位於本領域中之許多電視的範圍中的PB_C(例如,1000尼特、或750尼特;雖然亦可選擇使用相同技術的500尼特,或可能甚至400尼特PB_IDR)定義此類次級HDR影像(經IDR編碼影像,其將傳達至接收器而非典型ETSI2編解碼器原則中的主HDR影像)。但就所需的藝術地或實際地技術限制(例如,可用分級監視器)而言,無論如何仍可期望產生PB_MHDR主HDR。該想法係無論任何電視所使用的顯示重新解譯(包括色調映射)技術,其應係平滑的,從這個意義而言若PB_D接近PB_IDR(所接收之IDR影像的峰亮度),處理不應與所接收影像偏離太多。例如,即使係僅修剪所有高於PB_D的像素輝度之如此基本型的電視,當時亦不應修剪過多(例如,並非整個地球或洞穴影像的晴朗外側)。且內容創作者取回一些控制,因為即使他一方面期望產生美麗的超明亮影像區域,例如,5000尼特PB_C_H之主影像中的在4000尼特的平均值左右,他可控制他重新分級IDR影像中之該等區域的方式,使得它們下降至,例如,充分地低於1000尼特,使得即使800尼特的基本型電視應僅修剪最亮且視覺上破壞性最小的像素,例如,僅修剪圖7之太空站實例中的太陽的光線。因此需要某種新技術以迎合該新方法。This leads to a new coding topology based on a second type of HDR image, in addition to the main HDR image, the so called intermediate dynamic range (IDR) images, first introduced in WO2016020189. The advantage is then that such secondary HDR images (IDR coded images, which will be delivered to the receiver instead of the main HDR image in the typical ETSI2 codec principles) can be defined at PB_C in the range of many TVs in the art (e.g. 1000 nits, or 750 nits; although one could also choose 500 nits, or possibly even 400 nits PB_IDR using the same technology). But in terms of desired artistic or practical technical limitations (e.g. available graded monitors), it is nevertheless desirable to produce a PB_MHDR main HDR. The idea is that whatever display re-interpretation (including tone mapping) technique any TV uses, it should be smooth, in the sense that if PB_D is close to PB_IDR (the peak brightness of the received IDR image), the processing should not deviate too much from the received image. For example, even a basic TV that just clips all pixel brightness above PB_D should not clip too much (e.g. not the clear outer side of an image of the entire Earth or a cave). And the content creator gets back some control, because even if he on the one hand wishes to produce beautiful super bright image areas, e.g. around an average of 4000 nits in a main image of a 5000 nit PB_C_H, he can control the way he regrades these areas in the IDR image so that they drop to, e.g., well below 1000 nits, so that even a basic TV of 800 nits should clip only the brightest and least visually disruptive pixels, e.g. clip only the light of the sun in the space station example of Figure 7. So some new technology is needed to cater to this new approach.
圖 7 顯示迎合通道調適方法之WO2016020189的編解碼器原則(該通道傳達影像係IDR影像,藉此可說特定通訊通道經組態以用於發送,例如,1000尼特PB_CH影像)。再次選擇對一些主要概念的說明有趣的實例。應理解的一件事係若沿著該範圍的所有不同的PB_C影像精確地或至少非常接近於若內容創作者分開地且未受任何技術系統限制地分級其等之各者所產生的事物,雖然其可係有用的,但此需求並不一定始終如此,尤其是對於IDR影像。可能涉及某種放寬(另一方面,亦可能有對HDR場景類別的某種特定影像分級X之於Y何時或為何係最佳的某種爭論,且似乎有足夠的偏差用於該偏差;例如,可想像街燈之像素的輝度比面部之像素更不重要,尤其係若其應被視為係半隱藏在黑暗中,亦已因為在現實生活中,無論如何任何街燈均很有可能更亮或更不亮些)。 Figure 7 shows the codec principle of WO2016020189 catering for the channel adaptation approach (the channel conveyed image is an IDR image, whereby one can say that a particular communication channel is configured for sending, for example, a 1000 nit PB_CH image). Again an interesting example is chosen to illustrate some of the main concepts. One thing that should be understood is that while it can be useful if all the different PB_C images along the range are exactly or at least very close to what would be produced if the content creator graded each of them separately and without any technical system constraints, this requirement is not necessarily always the case, especially for IDR images. There may be some relaxation involved (on the other hand, there may also be some debate about when or why a certain image classification X for Y of the HDR scene class is best, and there seems to be enough bias for that bias; for example, one could imagine that the brightness of a pixel of a street light is less important than that of a face, especially if it should be seen as semi-hidden in darkness, but also because in real life any street light is likely to be brighter or less bright anyway).
WO2016020189提供從作為某個中間點的IDR影像(亦即,向上朝向待從當由接收器接收時的IDR影像重建的主HDR,及向下以進行針對PB_D < PB_IDR的MDR顯示器的顯示調適)定義函數(不同函數)的方式。使用此一技術,主HDR範圍可良好地選擇成始終固定為10000尼特PB_C範圍,其係連結至PQ函數的範圍。WO2016020189 provides a way to define a function (different functions) from an IDR image as some midpoint (i.e., upwards towards the master HDR to be reconstructed from the IDR image as received by the receiver, and downwards for display adaptation for MDR displays with PB_D < PB_IDR). Using this technique, the master HDR range can be well chosen to always be fixed to the 10000 nit PB_C range, which is linked to the range of the PQ function.
看到可能再次有涉及如何轉變各種可能輝度的不同考量,且此等可能有利地在所選擇IDR影像的左方上相當不同於右方上。因為事實上,在概念上可能正在作不同的事。在左方,從主HDR影像產生次級(「較小」)HDR影像。所以一個考量可係此IDR影像必須與主HDR影像「一樣好」(儘管較低的PB_IDR)(及然後如何優雅地解決該表面矛盾?)。在右方,正朝向甚至更小的PB_MDR壓縮(其對一些高複雜度係可觀的,意謂著除了其他事物之外之遍布輝度範圍的許多關鍵物體,及高PB_C_H影像),亦即,似乎具有不同的經顯示調適影像產生任務。所以,可想像此可導致(相當)不同的技術處置,特別係在吾等的影像+輝度映射視覺不同地經成形/設計的輝度映射函數中。See that again there may be different considerations involved in how to transform the various possible luminances, and these may advantageously be quite different on the left than on the right for the chosen IDR image. Because in fact, conceptually different things may be being done. On the left, a secondary ("smaller") HDR image is generated from the main HDR image. So one consideration may be that this IDR image must be "as good" as the main HDR image (despite the lower PB_IDR) (and then how to resolve this seeming contradiction elegantly?). On the right, one is heading towards even smaller PB_MDR compression (which is appreciable for some high complexity, meaning many key objects across the luminance range, and high PB_C_H images among other things), i.e., there seems to be a different displayed adapted image generation task. So, one can imagine that this could lead to a (quite) different technical treatment, in particular in that our image+brightness mapping function is shaped/designed differently in vision.
在此實例中,該暗太空站輝度可(至少原則上)在每個合理的電視上顯示,因為其等比60尼特更暗。但該等較亮像素首先必須相當和緩地壓縮至IDR影像,然後在第一部分中完成較少壓縮,更多者必須朝向SDR影像完成。且可能再度有不同準則用於例示性的二個亮物體,亮藍色地球之於遠為明亮但幾乎無色的太陽與其光線。當分別在主HDR影像輝度範圍(BE)及用於明亮地球物體的IDR輝度範圍(Be2)上的輝度子範圍指示理想上此內容創作者可期望用於地球的最大亮度從未高於,例如,750尼特,無論任何影像或顯示器的PB_C能力(因為否則地球可能開始太過發光而看起來不真實)。然而,然後太陽輝度必須做的事變成隨數個因素而變動,不僅係藝術需求,亦係留下用於編碼在所選(800尼特PB_IDR)IDR影像中高於750尼特之太陽物體的輝度量(當然在一些情況中,內容傳達者可選擇另一較高的PB_IDR值,但此處已假設任何連接至通訊通道之接收端的設備始終預期用於任何視訊內容(無論係好萊塢電影或新聞節目)的800尼特PB_IDR)。使用二個箭號顯示針對作為子集之所有此等最亮像素用於從主HDR影像輝度建立IDR影像輝度之最終選擇的F_H2h輝度映射函數:將解決方案選擇成定義一起用於二個物體的總壓縮動作,其亦減少一些最不明亮的地球物體輝度。此係未100%完全滿足(因為可能對應於一些其他技術困難)內容創作者之理想重新分級需求之情況的實例,而IDR影像對多數人而言足夠接近。地球像素在IDR影像中是否僅有些更暗真的沒那麼重要,且甚至可能將有些期望其用於較低品質的HDR影像。但重點係此IDR影像仍可滿足原始ETSI2原則的所有要求(同時此額外編解碼器步驟亦滿足接近800尼特PB_D的基本型顯示器在顯示已接收IDR影像之前不使其劣化太多的需求):藉由採用右側輝度變換函數,所有MDR影像上至SDR主影像仍可依內容創作者的期望由接收器產生,且(甚至具有經暗化的明亮地球物體像素)主HDR 2000尼特PB_C或10,000尼特PB_C影像仍可藉由逆向F_H2h函數而重建(亦可根據其技術及/或藝術需求藉由其自身對各影像、編碼電影之特定鏡頭的一組連續影像最佳化)。In this example, the dim space station brightness can (at least in principle) be displayed on every reasonable TV, since it is dimmer than 60 nits. But the brighter pixels must first be compressed quite gently to the IDR image, then less compression is done in the first part, more must be done towards the SDR image. And there may again be different criteria for the exemplary two bright objects, the bright blue Earth versus the much brighter but almost colorless Sun and its light. As the luminance sub-ranges on the main HDR image luminance range (BE) and the IDR luminance range for bright Earth objects (Be2) respectively indicate, ideally the content creator can expect the maximum brightness for the Earth to never be higher than, for example, 750 nits, regardless of the PB_C capabilities of any image or display (because otherwise the Earth might start to shine too brightly and look unrealistic). However, what must be done with the solar brilliance then becomes a function of several factors, not only artistic requirements but also the amount of brilliance left to encode solar objects above 750 nits in the chosen (800 nits PB_IDR) IDR image (of course in some cases the content provider could choose another higher PB_IDR value, but it is assumed here that any device connected to the receiving end of the communication channel always expects a PB_IDR of 800 nits for any video content, be it a Hollywood movie or a news show). The final chosen F_H2h luminance mapping function for building the IDR image luminance from the main HDR image luminance is shown with two arrows for all these brightest pixels as a subset: the solution was chosen to define an overall compression action for both objects together, which also reduces the luminance of some of the least bright Earth objects. This is an example of a situation where the ideal regrading needs of the content creator are not 100% fully met (because probably corresponds to some other technical difficulties), while the IDR image is close enough for most people. It is really not that important if the Earth pixels are just a little darker in the IDR image, and it might even be somewhat desirable for lower quality HDR images. But the point is that this IDR image still meets all the requirements of the original ETSI2 principles (while this additional codec step also meets the requirement of basic monitors close to 800 nits PB_D not to degrade the received IDR image too much before displaying it): by adopting the right luminance transform function, all MDR images up to the SDR master image can still be produced by the receiver as the content creator intended, and (even with darkened bright earth object pixels) the master HDR 2000 nits PB_C or 10,000 nits PB_C image can still be reconstructed by the inverse F_H2h function (which can also be optimized by itself for each image, a set of consecutive images of a specific shot of the encoded movie, according to its technical and/or artistic needs).
值得討論關於其等的無關度(因為不應混淆不同的技術態樣)而非其等重要性(但由於潛在混淆,其等值得討論)的二個文件:US20160307602及EP2689392(亦稱為WO2012127401),其等二者考慮所謂的「顯示最佳化」而非視訊影像編碼框架設計。此主要差異係使用圖23對具有通常知識者說明,其顯示一般實例總視訊處置鏈。在內容創作側2301,假設有藉由攝影機2302方式的HDR場景的現場(或先前記錄的)擷取。人類分級者(或著色者)判定,例如,除了其他事物之外之擷取的主分級(亦即,各種影像物體像素輝度在主HDR影像之輝度動態範圍上的相對位置-其在例如PB_C_H50=5000尼特的最大表示值處結束;並在某個小的黑色值開始,例如,MB_C_H50=0.001尼特,可針對目前討論將其假設等同於零:例如,針對太空站,他藉由影像處理而改變原始攝影機擷取,使得太陽像素變成4500尼特,地球的亮藍色變成,例如,200尼特等)。其次,在吾等之方法中的分級者一般想要以至少一個亮度映射函數(實務上,此一亮度映射函數可經不同地定形以用於HDR視訊的連續影像,且甚至在吾等的ETSI標準中解釋甚至針對一個單一瞬間視訊影像定義數個函數在技術上可係如何相當便利的,但其等之進一步複雜度對說明本領域的本創新貢獻不係必要的)指示哪個一般將係指定如何必須將5000尼特PB_C_H50之經正規化主HDR輝度重新分級至100尼特LDR經正規化輝度的函數:FL_50t1。Two documents worth discussing regarding their irrelevance (because different technical aspects should not be confused) rather than their importance (but worth discussing due to potential confusion) are US20160307602 and EP2689392 (also known as WO2012127401), both of which consider so-called "display optimization" rather than video image coding framework design. The main difference is illustrated to those of ordinary skill using Figure 23, which shows a general example overall video processing chain. On the content creation side 2301, it is assumed that there is a live (or previously recorded) capture of an HDR scene by means of a camera 2302. The human grader (or colorist) determines, e.g., the main grade of the capture among other things (i.e., the relative position of the brightness of the various image object pixels on the brightness dynamic range of the main HDR image - which ends at a maximum representable value of e.g. PB_C_H50=5000 nits; and starts at some small black value, e.g. MB_C_H50=0.001 nit, which can be assumed equal to zero for the present discussion: e.g., for the space station he alters the original camera capture by image processing so that the sun pixels become 4500 nits, the bright blue of the Earth become, e.g., 200 nits, etc.). Secondly, the grader in our method generally wants to indicate with at least one luminance mapping function (in practice, such a luminance mapping function may be shaped differently for successive images of HDR video, and even in our ETSI standard it is explained how it may be technically quite convenient to define several functions even for a single instantaneous video image, but such further complexity is not necessary to illustrate the present innovative contribution to the art) which generally will be the function that specifies how the normalized main HDR luminance of 5000 nits PB_C_H50 must be regraded to a 100 nits LDR normalized luminance: FL_50t1.
然後第三重要態樣係用於主HDR影像之編碼的編碼技術,以(經由至少一個編碼技術)傳達出至一或多個接收器。在HDR視訊研究開始時,且對應地在由申請人標準化的較簡單版本中,此將係相對簡單的編碼,諸如,例如LDR 100尼特影像,然後其不錯地回溯相容,使得其可以良好的視覺外觀直接顯示在沒有對HDR的理解能力或處理能力的舊型LDR電視上。WO2016020189編碼方法及本教示係更先進的第二代方法的實例,其更複雜但可在一些特定HDR視訊通訊或處理技術生態系統中迎合額外需求。藉由例如人類分級者2304執行的分級(假如此非自動化的,諸如,通常在現實生活的廣播節目中)在分級設備2303(其一般將含有數個工具以改變像素輝度,針對本說明仍可假設成由提供使用者介面以指定FL_50t1形狀,及傳達出此一函數形狀(例如,作為包含定義函數之形狀的一些參數的後設資料)的元件組成)上完成。The third important aspect is then the coding technique used for encoding of the main HDR image to be communicated (via at least one coding technique) to one or more receivers. At the beginning of HDR video research, and correspondingly in the simpler versions standardized by the applicants, this would be a relatively simple encoding of, say, an LDR 100 nit image, which would then be well retroactively compatible so that it could be displayed directly with good visual appearance on older LDR TVs that have no HDR understanding or processing capabilities. The WO2016020189 coding method and the present teachings are examples of more advanced second generation methods, which are more complex but can cater for additional needs in some specific HDR video communication or processing technology ecosystems. Grading, performed by for example a human grader 2304 (if this is not automated, e.g., as is common in real-life broadcast programmes), is done on a grading device 2303 (which will generally contain several tools to change pixel brightness, but for the present description can still be assumed to consist of components providing a user interface to specify the FL_50t1 shape, and communicating this function shape (e.g. as metadata containing some parameters defining the shape of the function)).
雖然視訊編碼器2305(非限制地假設其之輸入主HDR影像係用於所有像素的一組輝度,其將執行產生主HDR影像之實際編碼的所有技術,亦即,例如及一般YCbCr像素顏色三元組的8位元、10位元、或12位元像素化矩陣連同鑑於所選擇之無論哪個編碼技術描述所有進一步資訊(像是輝度映射函數)的後設資料)在原則上可包括在分級設備2303中,一般已將其顯示為可進一步連接的設備。此表示足以說明本發明之針對讀者的簡化,其中其總結在外側廣播軌中發生的,例如,擷取(及可能的分級),及或許在一些中間通訊中繼站中發生之編碼(例如,在將本地廣告插入在信號中等,其中亦可涉及各種影像內容的各別調和,但其係不需要說明的細節)的各種實際變化。重點係瞭解在創作側發生什麼(見例如貢獻與分布之間的不同),可將其定義為當,例如,藉由衛星天線2306及通訊衛星2340方式(或任何等效視訊通訊通道,例如,經由網際網路等)將最終經編碼視訊信號傳達至一些消費者時正式結束。Although the video encoder 2305 (assuming, without limitation, that its input master HDR image is a set of luminances for all pixels, which will perform all techniques for the actual encoding of the master HDR image, i.e., for example, an 8-bit, 10-bit, or 12-bit pixelation matrix of generic YCbCr pixel color triplets together with metadata describing all further information (such as luminance mapping functions) in view of whichever encoding technique is chosen) may in principle be included in the grading device 2303, it is generally shown as a further connected device. This representation is sufficient to illustrate the simplification of the invention for the reader, which summarizes the various actual variations that occur in the outer broadcast track, such as capture (and possible classification), and perhaps encoding that occurs in some intermediate communication relays (for example, in inserting local advertisements in the signal, etc., which may also involve respective adjustments of various image contents, but which are details that do not require explanation). The key point is to understand what happens on the creation side (see, for example, the difference between contribution and distribution), which can be defined as the official end when the final encoded video signal is transmitted to some consumer, for example, by means of satellite antenna 2306 and communication satellite 2340 (or any equivalent video communication channel, for example, via the Internet, etc.).
在接收側,一般面臨在最終消費者家的消費性設備,諸如,在輸入側上連接至本地衛星接收碟(local satellite dish) 2351,及在輸出側上連接至HDR顯示器2353的衛星tv機上盒,或任何等效解碼及最終處理設備2352,該顯示器可具有各種顯示器能力,例如,1000尼特、或700尼特、或2500尼特的PB_D。而可能係針對機上盒藉由解碼器2381僅對需要顯示的輝度值再度執行解碼就係足夠的,該解碼器常時執行編碼器的反操作,此一般僅在有限量的情況將係有用的。通常將有藉由顯示最佳化器2382的顯示最佳化程序,該顯示最佳化器再度改變絕對各別地經正規化輝度分布(所接收之,例如,LDR影像或經解碼主HDR,例如,5000尼特影像的任一者),因為主影像可已針對,例如,5000尼特PB_C_H50編碼,亦即,可能含有2000尼特輝度像素,特定消費者的HDR顯示器仍可僅,例如,顯示至多700尼特(其之可顯示最白的白色)。On the receiving side, it is generally faced with consumer equipment in the end consumer's home, such as a satellite TV set connected to a local satellite dish 2351 on the input side and to an HDR display 2353 on the output side. box, or any equivalent decoding and final processing device 2352, the display may have various display capabilities, for example, 1000 nits, or 700 nits, or 2500 nits PB_D. While it may be sufficient for the set-top box to only decode the brightness values that need to be displayed again by means of the decoder 2381, which often performs the inverse operation of the encoder, this will generally only be useful in a limited number of cases. . Typically there will be a display optimization process by a display optimizer 2382 which again modifies the absolutely individually normalized brightness distribution (the received, e.g., LDR image or decoded master HDR, For example, any of the 5000 nit images), since the main image may have been encoded for, for example, 5000 nits PB_C_H50, i.e., may contain 2000 nit brightness pixels, a particular consumer's HDR display may still only, for example, Displays up to 700 nits (which can display the whitest white).
所以,在一方面,在其中一側上的器具(及其等的技術設計原則等)之間有主要的技術差異,例如,創作/編碼/傳輸側將僅具有視訊編碼器2370以將主HDR視訊(MsterHDR)編碼為某個經通道編碼的中間動態範圍影像IDR,而接收側亦可顯示最佳化經重建5000尼特HDR影像(RecHDR)在,例如,700尼特PB_C影像ImDA最佳化的顯示器中,最佳地適用於經連接的700尼特PB_D顯示器。二者之間的技術差異可見於一者可進行作為(可選的)後處理的顯示最佳化,而編碼/解碼僅係影像重建技術,一般不需要與顯示最佳化有關的任何教示。二側的設備(及操作程序等)一般亦由相當不同的技術人員處置。內容創作設備可由專業視訊裝備生產商設計,且由廣播工程師等操作。機上盒及電視一般係由亞裔的消費性電子器具製造商製造。So, on one hand, there is a major technical difference between the apparatus on one side (and their technical design principles etc.), e.g. the authoring/encoding/transmitting side will only have a video encoder 2370 to encode the master HDR video (MsterHDR) into some channel coded intermediate dynamic range image IDR, while the receiving side can also display an optimized reconstructed 5000 nit HDR image (RecHDR) in, e.g., a 700 nit PB_C image ImDA optimized for a display, optimally suited for a connected 700 nit PB_D display. The technical difference between the two can be seen in that one can perform display optimization as an (optional) post-processing, while the encoding/decoding is only an image reconstruction technique, generally not requiring any teachings related to display optimization. The equipment (and operating procedures, etc.) on both sides are generally handled by quite different technical personnel. Content creation equipment may be designed by professional video equipment manufacturers and operated by broadcast engineers, etc. Set-top boxes and TVs are generally manufactured by Asian consumer electronics manufacturers.
US20160307602係申請人的第一個顯示最佳化專利。總結而言,此中的想法係內容創作者可給出用於可存在於影像中之各種(至少二個)區域(區域係一種概念,該概念係影像中的一組像素,及當具有各種動態範圍的各種可用顯示器時該等像素所需要的重新分級行為二者)的引導重新分級行為規則及演算法。雖然此首先實現內容創作者的需求與最終消費地點的實際顯示之間的連接,但實際上顯示調適的受控行為必須在此終端地點發生。且理想上,機上盒或電視(假如至少顯示調適在該電視中發生)的製造者將大部分遵循內容創作者針對視訊影像中的各種區域物體指定為良好行為者(例如,某人從暗區域淡入而在任何顯示器能力上既不太過可見亦不太過不可見,甚至100尼特PB_D LDR顯示器),因為其係此內容需要的,而非他自身盲目地作任何事。但此清楚地係在消費側發生的最終行為,且與視訊通訊技術供應商想要如何發展、任何實作者分別想要如何部署的任何特定視訊編解碼器原則完全正交。亦不應與任何特設色調映射技術混淆,已針對此一映射一般將係不可逆的事實,應具有經由較低動態範圍IDR影像編碼的性質。US20160307602 is the applicant's first display optimization patent. In summary, the idea here is that content creators can come up with rules and algorithms for guided re-rating behavior for various (at least two) regions that may exist in an image (region is a concept that is both a set of pixels in an image, and the re-rating behavior required for those pixels when there are various available displays with various dynamic ranges). While this first achieves the connection between the needs of the content creator and the actual display at the final consumption location, the actual controlled behavior of the display adaptation must occur at this end location. And ideally, the manufacturer of a set-top box or TV (if at least display adaptation occurs in that TV) would mostly follow what the content creator specifies as a good bet for various area objects in the video image (e.g. someone fading in from a dark area to be neither too visible nor too invisible on any display capability, even a 100 nit PB_D LDR display), because it is required by the content, rather than blindly doing anything himself. But this is clearly the ultimate behavior that occurs on the consumer side, and is completely orthogonal to any specific video codec principles of how the video communications technology vendors want to develop, and how any implementation creator wants to deploy respectively. Nor should it be confused with any ad hoc tone mapping techniques, which are targeted at the fact that such a mapping will generally be irreversible, should it have the properties of being encoded via lower dynamic range IDR images.
WO2012127401亦係用於指定顯示最佳化行為的早期HDR年代技術,其可藉由DATGRAD結構的各種實施例方式完成,該DATGRAD結構指定各種影像內容應如何針對不同的輝度動態範圍能力重新分級。此DATGRAD結構將用以針對MDR顯示器PB_D在主HDR可編碼峰亮度PB_C(亦即,本表示法中的PB_C_H50)與最低必要重新分級規格(p. 16)的100尼特LDR PB_C之間產生任何需要的中等動態範圍影像(MDR)。MDR影像的推導不僅藉由使用如編碼在DATGRAD資料結構中之影像的重新分級需求,亦使用特定顯示器側觀看態樣(諸如,例如觀看環境亮度或最終觀看者偏好設定(見p. 5))最佳地完成。WO2012127401 is also an early HDR era technology for specifying display optimization behavior, which can be accomplished by various implementations of a DATGRAD structure that specifies how various image content should be regraded for different luminance dynamic range capabilities. This DATGRAD structure will be used to generate any required medium dynamic range image (MDR) for an MDR display PB_D between the main HDR encodeable peak luminance PB_C (i.e., PB_C_H50 in this notation) and the 100 nits LDR PB_C of the minimum necessary regrading specification (p. 16). The derivation of the MDR image is best done by using not only the image regrading requirements as encoded in the DATGRAD data structure, but also specific display-side viewing conditions such as, for example, the viewing environment brightness or the final viewer preference settings (see p. 5).
應該清楚的係自始當不具有進一步相當具體的洞察力時,此類教示未帶給通常知識者關於編解碼器重新設計的任何事物。It should be clear from the outset that such teachings do not bring the person of ordinary skill anything regarding codec redesign without further quite specific insights.
與已可在先前技術中發現者相比,除了在產生特定函數上的差異外,更重要的係創新的編解碼器結構/框架本身,亦應提到第二PB_C值(除了實際傳達之IDR影像的較低一者外,主內容的最高一者)的實際通訊亦與可使用在WO2016020189中的(可選的)種類特徵化器不同。除了二者在字面上並不相同的事實外,列舉器(enumerator)可扮演不同角色,且特別係若考慮該框架相較於本教示的細節。此一‘189之特徵化器在有例如二個向上重新分級輝度映射函數的情形中可係有用的。然後其可能對選擇哪一個以獲得任何事物(像是創作側之主HDR影像的緊密重建)有用。但此類資訊既不係嚴格需要的,亦不必要施加在先前技術中。可使用來自主HDR影像的升級函數以取代5000尼特的重建影像而獲得4000或6000尼特的重建影像。有中間影像的二個側,降級函數通常係具有臨界影像內容(特別係必須足夠明亮且如實的顯示在所有PB_D顯示器上的內容)之一者,但升級函數在針對最非常明亮的物體(像是車頭燈、日光反射等)指定重新分級行為的此一方式上將具體地不同。其等係一般的HDR影響目的/效應,然而亦不可以任何方式正確地複製,因為其在各種能力的顯示器改變最多之輝度範圍的上區域中。因此,從,例如,600尼特IDR影像產生4000尼特PB_C重建影像可具有相較於其等的理想輝度值(即使此一值可在4000尼特輝度範圍上表示)略微太暗的一些車頭燈,但若僅施加,例如,多重線性經正規化重新分級函數在[0-1]/[0-1]軸系統上,其仍將係相當好看的HDR影像,其中水平軸表示IDR影像之經PB_C正規化之輝度,且垂直軸對應於被選以計算經重建HDR影像PB_C的任何事物,該HDR影像PB_C並未太不合理地遠離主HDR PB_C(可能並不知道而僅係假設)。在吾等的技術中,主動在後設資料中傳達PB_C_H50輝度值自身,因為其亦在解碼器的演算法中使用。Besides the difference in generating specific functions, more importantly than what can already be found in the prior art, the innovative codec structure/framework itself, it should also be mentioned that the actual communication of the second PB_C value (the highest one of the main content, in addition to the lower one of the actually communicated IDR image) is also different from the (optional) kind of characterizer that can be used in WO2016020189. Apart from the fact that the two are literally not the same, the enumerator can play a different role, and especially if the details of the framework compared to the present teaching are considered. Such a characterizer of '189 can be useful in the case where there are, for example, two upward re-scaled brightness mapping functions. It may then be useful to choose which one to use in order to obtain anything like a close reconstruction of the main HDR image on the authoring side. But such information is neither strictly needed nor necessary in the prior art. Instead of a 5000 nit reconstructed image, an upscaling function from the main HDR image can be used to obtain a 4000 or 6000 nit reconstructed image. There are two sides of the intermediate image, the downscaling function is usually one of critical image content (especially content that must be bright enough to be displayed faithfully on all PB_D displays), but the upscaling function will specifically differ in the way it specifies the regrading behavior for the very brightest objects (like headlights, sunlight reflections, etc.). These are general HDR impact purposes/effects, which however cannot be reproduced correctly in any way, because they are in the upper area of the brightness range that changes the most on displays of various capabilities. Thus, a 4000 nit PB_C reconstructed image produced from, for example, a 600 nit IDR image may have some headlights that are slightly too dim compared to their ideal brightness value (even though such a value is representable on the 4000 nit brightness range), but it will still be a fairly good looking HDR image if one simply applies, for example, a multi-linear normalized rescaling function on a [0-1]/[0-1] axis system, where the horizontal axis represents the PB_C normalized brightness of the IDR image and the vertical axis corresponds to whatever was chosen to compute the reconstructed HDR image PB_C that is not unreasonably far from the master HDR PB_C (which may not be known but only assumed). In our technique, the PB_C_H50 brightness value itself is actively communicated in the metadata, since it is also used in the decoder's algorithm.
本專利申請案的發明人想要以許多方式限制通用IDR方法,特別係環繞已於當今部署之ETSI2編碼原則及系統(IC、電視、機上盒)。The inventors of this patent application want to limit the generic IDR approach in many ways, especially around the ETSI2 coding principles and systems (ICs, TVs, set-top boxes) that are already deployed today.
許多技術考量係由發明人產生。在一方面,其等想要其等的系統與已部署的ETSI2解碼器相容。因此,若傳達例如1500尼特的IDR影像(該ETSI2解碼器不知道任何有關IDR架構原理的事物,因此假設此僅係該HDR場景的一原始HDR主影像),則應共同傳達其係圖7之進行該正確顯示調適之F_I2s函數的一F_L_IDR輝度映射函數(及根據ETSI2的所有其他顏色映射資訊)。因此,無論是否曾使用一IDR額外技術,ETSI2(亦稱為SLHDR2)解碼器應能夠正常地建立所有MDR影像上至SDR影像,且其等應(理想上)看起來如同該內容創作者所期望的。理想上,根據本原理的任何新的解碼器(其將稱為一SLHDR2PLUS 解碼器 )亦應針對IDR與SDR之間的所有影像(亦即,該IDR及SDR影像的至少一者較佳地不應與其將導致為該顏色分級者或通常內容創作者將如何想要或至少接受以看到其的該MDR影像偏離太多)精確地或至少近似地產生相同模樣。另一方面,非常重要的準則係主HDR可幾乎完美地重建(但可能對一些不知不覺中產生的次要修整錯誤,例如,當在用於傳輸之MPEG壓縮相位中DCT化IDR影像時,其將在接收側導致主HDR的非常次要的無異議的重建錯誤)。當然,可能有在主HDR之重建品質上具有一些放寬的一些系統(一些內容提供者可將IDR影像視為更重要,至少在涉及一些暫時態樣的情況中,像是廣播中或甚至單一投注至有限觀眾,而非,例如用於儲存,例如藍光光碟上),但通常涉及視訊處理鏈的至少一個主要當事人將發現主HDR影像的完美可重建性係重要的(其與從IDR開始建立某個更高動態範圍模樣的盲目嘗試區分開)。Many technical considerations were generated by the inventors. On the one hand, they wanted their system to be compatible with deployed ETSI2 decoders. Therefore, if an IDR image of, for example, 1500 nits is communicated (the ETSI2 decoder does not know anything about the principles of the IDR architecture, so it assumes that this is just an original HDR master image of the HDR scene), a F_L_IDR brightness mapping function (and all other color mapping information according to ETSI2) which is the F_I2s function of Figure 7 that does the correct display adaptation should be communicated together. Therefore, an ETSI2 (also called SLHDR2) decoder should be able to properly build all MDR images up to SDR images, regardless of whether an IDR additional technique was used, and they should (ideally) look as the content creator intended. Ideally, any new decoder according to the present principles (which will be called a SLHDR2PLUS decoder ) should also produce exactly or at least approximately the same look for all images between IDR and SDR (i.e. at least one of the IDR and SDR images should preferably not deviate too much from the MDR image which would result in how the color grader or generally content creator would want or at least accept to see it). On the other hand, a very important criterion is that the main HDR can be reconstructed almost perfectly (but there may be some minor trimming errors that are introduced unnoticed, e.g. when DCTing IDR images in the MPEG compression phase for transmission which would result in very minor unobtrusive reconstruction errors of the main HDR on the receiving side). Of course, there may be some systems that have some relaxation on the reconstruction quality of the master HDR (some content providers may consider IDR images to be more important, at least in situations involving some temporary aspect, like broadcast or even single investment to a limited audience, rather than, for example, for storage on e.g. Blu-ray disc), but usually at least one key party involved in the video processing chain will find perfect reconstruction of the master HDR image important (as distinguished from blind attempts to create some higher dynamic range appearance starting from IDR).
最後,雖然看到在不需要重新設計及重新部署ETSI2解碼器的情況下服務,必須共同傳達F_I2s函數(亦即,較佳地儘可能重新使用該SLHDR2系統的該編(解)碼電路系統,但至少包括其等之輝度及顏色映射函數的該等視訊信號仍應符合該標準化定義,使得尤其係除了一些它們不需要且可忽略的後設資料外,傳統的SLHDR2系統知道它們取得什麼),該內容分級者一般可想要指定他已建立之主HDR與其之某種對應的SDR(亦即,100尼特PB_C)版本(他可已例如使用圖4所示之一系統建立其)之間的他的輝度(及顏色)映射函數。使得F_Mt1函數(見圖10)既不係圖7的F_H2h也不係F_I2s函數,而係跨越主HDR與主SDR之間的該重新分級效果的總體性的函數(亦即,此F_Mt1將一HDR場景影像的該重新分級需求定義在該HDR場景的最不同的動態範圍表示之間)。所以需要一種技術以優雅地關連此等二種情況,特別係ETSI2框架原則中或圍繞其(例如,該SLHDR2PLUS的解碼方法產生與顯示適應該已接收IDR影像及D_I2s函數的ETSI2接收器相同的MDR影像模樣;針對時間上的各瞬間,一或多個函數部分地進行該瞬間的該輸入動態範圍影像與該瞬間的該期望輸出動態範圍影像之間的該重新分級)。Finally, although it is seen that to serve without the need to redesign and redeploy the ETSI2 decoder, the F_I2s functions must be communicated together (i.e., preferably re-using the encoding (de)coding circuitry of the SLHDR2 system as much as possible, but at least the video signals including their brightness and color mapping functions should still comply with the standardized definition so that, in particular, traditional SLHDR2 systems know what they are getting, except for some metadata that they do not need and can be ignored), the content grader may generally want to specify his brightness (and color) mapping functions between a master HDR that he has created and some corresponding SDR (i.e., 100 nits PB_C) version of it (which he may have created, for example, using one of the systems shown in Figure 4). The F_Mt1 function (see FIG. 10 ) is neither the F_H2h nor the F_I2s function of FIG. 7 , but rather a function that is global in the regrading effect between the primary HDR and primary SDR (i.e., this F_Mt1 defines the regrading requirements of an HDR scene image between the most different dynamic range representations of the HDR scene). Therefore, a technique is needed to elegantly relate these two cases, particularly in or around the ETSI2 framework principles (e.g., the SLHDR2PLUS decoding method produces the same MDR image look as an ETSI2 receiver that displays adapted to the received IDR image and the D_I2s function; for each instant in time, one or more functions partially perform the regrading between the input dynamic range image at that instant and the desired output dynamic range image at that instant).
如將於下文所見的,其可根據該等各種發明人的不同洞察而以數種方式完成,取決於所確切期望的哪個系統種類,及哪個所欲限制條件放寬得更多及哪個放寬得更少(亦將此類特定實際技術因子列入考量,像是例如,針對各種選擇將需要多少個循環或電晶體,何者可使一些選擇比其他者更係係期望的,但不必為此專利申請案鑽研該等細節)。As will be seen below, this can be done in several ways according to the different insights of the various inventors, depending on exactly which kind of system is desired, and which desired constraints are relaxed more and which are relaxed less (also taking into account such specific practical technical factors as, for example, how many loops or transistors will be needed for various choices, which may make some choices more desirable than others, but it is not necessary to delve into those details for the purpose of this patent application).
更有所有方法均使用的一些基礎的基本原則。可用圖 8 總結至少二種解決方式。接收器接收IDR影像(其應可以某種方式重建成Mster HDR影像,在PB_C = 4000尼特的此實例中),且其等亦具有函數F_I2s。但是它們必須以某種方式針對各IDR輝度發現函數F_??,以計算所需的對應經正規化且因此絕對的主HDR輝度(其將該主HDR精確重建成其原樣,仍絕不傳達其(多個)影像)。可建構可判定所需功能的新穎SLHDR2PLUS解碼器顏色變換系統(其仍至少根據其之如圖4所示的處理IC或軟體核心,具有輝度處理軌及其細節(採用子單元的至少一些者),及色度處理軌),或亦可嘗試將所有的智慧性置入編碼器中,使得可原樣使用標準的ETSI2編碼顏色變換方法(除了係其之新規劃以藉由接收此第二期望峰亮度PB_C_H50的後設資訊重建4000尼特的原始HDR的其新穎性外,其一般涉及適當的全部或部分輝度及色度處理LUT的載入),將其設定成外插而非用於比PB_IDR值更小之PB_D的顯示調適影像。二種方法及其等的實施例將需要一些通用的新技術成分,雖然落在通用的SLHDR2PLUS新編碼原則下。There are some basic fundamental principles that all methods use. At least two solutions can be summarized with Figure 8. The receiver receives IDR images (which should be reconstructed somehow into the Master HDR image, in this example with PB_C = 4000 nits), and they also have the function F_I2s. But they must somehow find the function F_?? for each IDR brightness to calculate the required corresponding normalized and therefore absolute Master HDR brightness (which reconstructs the Master HDR exactly as it was, still by no means conveying its image(s)). One could construct a novel SLHDR2PLUS decoder color conversion system that can determine the required functionality (it still has a brightness processing track and its details (using at least some of the sub-units), and a chrominance processing track, based at least on its processing IC or software core as shown in FIG. 4 ), or one could try to put all the intelligence into the encoder so that the standard ETSI2 encoded color conversion method can be used as is (except for its novelty of being newly planned to reconstruct the original HDR of 4000 nits by receiving meta-information of this second desired peak brightness PB_C_H50, which generally involves the loading of appropriate full or partial brightness and chrominance processing LUTs), setting it to interpolate rather than display adapted images for PB_D smaller than the PB_IDR value. Both methods and their implementations will require some common new technical components, although falling under the common SLHDR2PLUS new coding principles.
從圖 9 所示之該通用IDR編碼器的SLHDR2PLUS編碼器900種類的該基本構造(其將於下文更詳細地解釋),看見與正常HDR編碼的差異,特別係ETSI2 HDR視訊編碼:現在有二個峰亮度 共同編碼在該後設資料中,亦即,首先係「正常」峰亮度(其應稱為通道峰亮度PB_CH,亦即所接收之該IDR影像的該峰亮度,將任何技術用於其,亦即對該內容創作者、擁有者、或轉碼者看起來最佳的任何峰亮度位準,及用以計算IDR像素輝度的任何數學技術方法),該正常峰亮度指示所傳達及稍後接收之視訊(亦即,(多個)IDR影像)的最大可編碼輝度[此係正常ETSI2解碼器所將看到者,忽略所有其他新穎方法]。其次,但現在亦有該(多個)原始 主HDR影像的該峰亮度,亦即該內容峰亮度PB_C_H50(例如,5000尼特)。在一些實施例中,在IDR影像建立前的許多個月,該第二者的PB_C_H50可能已在該主HDR影像建立時(例如,基於攝影機擷取動作、在該電腦中等)指定,並可在通道編碼時藉由許多不同的可能方式將該PB_CH設定為至解碼器900內的一外部輸入(例如,一有線電視營運商可具有設定在記憶體中的一固定值,其可在一年度基礎上升級以反映他的消費者之HDR顯示器的該目前平均狀態,或一經最佳化PB_CH可將該視訊之至少一個影像的某種輝度或其他影像細節、或其之關聯後設資料、甚至可能特別包括用於引導稍後的IDR重新編碼的後設資料等列入考量而計算)。使一(單一)峰亮度共同傳達對HDR編碼,至少對ETSI2的該系統(其在當時僅已被視為係它們所需的唯一事物,亦即,所接收之影像的「該」峰亮度)係有用的,但鑑於傳統ETSI2顯示器的該完全透明可用性,其將如所述必須係PB_CH(否則它們不能作它們的正常顯示調適計算)。另一方面,PB_C_H50必須完全能夠計算圖8的F_??函數,且使用此函數,最終所期望的該主HDR重建影像來自該所接收的IDR影像。From the basic architecture of the SLHDR2PLUS codec 900 kind of the generic IDR codec shown in Figure 9 (which will be explained in more detail below), one sees the difference to normal HDR coding, and in particular ETSI2 HDR video coding: there are now two peak luminances encoded together in the metadata, namely first the "normal" peak luminance (which shall be called channel peak luminance PB_CH, i.e. the peak luminance of the received IDR image, using whatever technique is used for it, i.e. whatever peak luminance level looks best to the content creator, owner, or transcoder, and whatever mathematical technique is used to calculate IDR pixel luminance), which indicates the maximum encodable luminance of the transmitted and later received video (i.e. the IDR image(s)) [this is what a normal ETSI2 decoder will see, ignoring all other novel methods]. Secondly, but now also the peak brightness of the original master HDR image(s), which is the content peak brightness PB_C_H50 (e.g., 5000 nits). In some embodiments, the PB_C_H50 of the second party may have been specified when the master HDR image is created (e.g., based on camera capture action, in the computer, etc.), many months before the IDR image is created, and the PB_CH may be set as an external input to the decoder 900 during channel encoding in a number of different possible ways (e.g., a cable TV operator may have a fixed value set in memory that may be upgraded on an annual basis to reflect the current average state of his consumers' HDR displays, or once optimized the PB_CH may be calculated taking into account certain brightness or other image details of at least one image of the video, or its associated metadata, and may even specifically include metadata used to guide subsequent IDR re-encoding). Having a (single) peak brightness communicated together is useful for HDR coding, at least for the systems of ETSI2 (which at the time had been considered to be the only thing they needed, namely "the" peak brightness of the received image), but given the full transparency availability of conventional ETSI2 displays, they would have to be PB_CH as described (otherwise they could not do their normal display adaptation calculations). On the other hand, PB_C_H50 must be fully able to calculate the F_?? function of Figure 8, and using this function, the final desired main HDR reconstructed image comes from the received IDR image.
因此,立即顯示一傳統ETSI2視訊編碼資料串流之間的差異,且傳統ETSI2解碼器將不知道此額外後設資料並簡單地忽略其,因為ETSI2解碼器不必使用比它們在該後設資料中接收之PB_C_H更高的PB_C而判定任何影像,該後設資料指示它們所接收之該影像中的該最亮可能輝度(因為根據一純粹的ETSI2原則,所接收的該影像始終係最佳品質影像,事實上其係由該內容創作者創作之最高品質的主HDR影像)。但如圖11所示,一通用SLHDR2PLUS解碼器將不僅接收及讀取PB_C_H50值,且亦使用其以重建REC_M_HDR影像,其係由該內容創作者所創作之該主HDR影像的一近乎完美重建(事實上,此一解碼器將使用PB_C_H50值以從所接收的(多個)F_I2sCI函數計算所需的(多個)F_??函數)。此解碼器亦可有利地輸出較低的PB_C影像,像是例如,400尼特的PB_C MDR_300影像,但可針對比PB_CH更低之PB_C的此類影像選擇使用一標準ETSI2計算核心,或可在該新的SLHDR2PLUS計算核心的一實施例中進行該計算(但針對具有比PB_CH更高之PB之影像的準確重建,新的洞察肯定係需要的,因為不能使用ETSI2技術瑣細地完成)。Thus, the difference between a conventional ETSI2 video coded data stream is immediately apparent and conventional ETSI2 decoders will be unaware of this additional metadata and simply ignore it since ETSI2 decoders do not have to judge any image using a higher PB_C than the PB_C_H they receive in the metadata indicating the brightest possible brightness in the image they receive (since according to a pure ETSI2 principle the image received is always the best quality image and in fact it is the highest quality master HDR image created by the content creator). But as shown in Figure 11, a generic SLHDR2PLUS decoder will not only receive and read the PB_C_H50 value, but also use it to reconstruct the REC_M_HDR image, which is a near-perfect reconstruction of the master HDR image created by the content creator (in fact, such a decoder will use the PB_C_H50 value to calculate the required F_?? function(s) from the received F_I2sCI function(s)). This decoder can also advantageously output lower PB_C images, such as, for example, 400 nits PB_C MDR_300 images, but a standard ETSI2 compute core may be chosen for such images with a PB_C lower than PB_CH, or the calculation may be performed in an embodiment of the new SLHDR2PLUS compute core (but new insights are certainly needed for accurate reconstruction of images with higher PB_C than PB_CH, as this cannot be done in detail using ETSI2 techniques).
因此,設定成藉由該新技術解決的該等任務係藉由一種高動態範圍視訊編碼器(900)實現,該高動態範圍視訊編碼器經配置以經由一影像輸入(920)接收一輸入高動態範圍影像(MsterHDR),該輸入高動態範圍影像具有一第一最大像素輝度(PB_C_H50),該編碼器具有用於該第一最大像素輝度的一第一後設資料輸入(922),且該高動態範圍視訊編碼器經配置以經由一第二後設資料輸入(921)接收一主明亮度映射函數(FL_50t1),該明亮度映射函數定義該輸入高動態範圍影像的經正規化明亮度與對應的一低動態範圍影像(Im_LDR)的經正規化明亮度之間的該關係,該低動態範圍影像較佳地具有等於100尼特的一LDR最大像素輝度,其特徵在於該編碼器進一步包含一第三後設資料輸入(923)以接收一第二最大像素輝度(PB_CH),且該編碼器的進一步特徵在於其包含: - 一HDR函數產生單元(901),其經配置以施加一經標準化演算法以將該主明亮度映射函數(FL_50t1)變換成一經調適明亮度映射函數(F_H2hCI),該經調適明亮度映射函數使該輸入高動態範圍影像的經正規化明亮度相關於一中間動態範圍影像(IDR)的經正規化輝度,該中間動態範圍影像的特徵在於具有等於該第二最大像素輝度(PB_CH)的一最大可能輝度; - 一IDR影像計算單元(902),其經配置以施加該經調適明亮度映射函數(F_H2hCI)至該輸入高動態範圍影像(MsterHDR)之像素的明亮度,以獲得係此單元之輸出的該中間動態範圍影像(IDR)之像素的明亮度;及 - 一IDR映射函數產生器(903),其經配置以在該主明亮度映射函數(FL_50t1)及該經調適明亮度映射函數(F_H2hCI)的基礎上推導一通道明亮度映射函數(F_I2sCI),該通道明亮度映射函數在將該中間動態範圍影像(IDR)的該等各別經正規化明亮度給定作為輸入時將該低動態範圍影像(Im_LDR)的該等各別經正規化明亮度定義作為輸出,該等明亮度繼而對應於該輸入高動態範圍影像(MsterHDR)的各別明亮度;該編碼器的進一步特徵係具有: - 一影像輸出(930),以輸出該中間動態範圍影像(IDR); - 一第一後設資料輸出(932),以輸出該第二最大像素輝度(PB_CH); - 一第二後設資料輸出(931),以輸出該通道明亮度映射函數(F_I2sCI);及 - 一第三後設資料輸出(933),以輸出該第一最大像素輝度(PB_C_H50)。Therefore, the tasks set to be solved by the new technology are implemented by a high dynamic range video encoder (900), which is configured to receive an input high dynamic range image (MsterHDR) via an image input (920), the input high dynamic range image having a first maximum pixel brightness (PB_C_H50), the encoder having a first meta-data input (922) for the first maximum pixel brightness, and the high dynamic range video encoder is configured to receive a main brightness via a second meta-data input (921). A brightness mapping function (FL_50t1) defining the relationship between the normalized brightness of the input high dynamic range image and the normalized brightness of a corresponding low dynamic range image (Im_LDR), the low dynamic range image preferably having an LDR maximum pixel brightness equal to 100 nits, characterized in that the encoder further comprises a third meta-data input (923) for receiving a second maximum pixel brightness (PB_CH), and the encoder is further characterized in that it comprises: - an HDR function generation unit (901) configured to apply a normalized algorithm to transform the master brightness mapping function (FL_50t1) into an adapted brightness mapping function (F_H2hCI), the adapted brightness mapping function relating the normalized brightness of the input high dynamic range image to the normalized brightness of an intermediate dynamic range image (IDR), the intermediate dynamic range image being characterized by having a maximum possible brightness equal to the second maximum pixel brightness (PB_CH); - an IDR image calculation unit (902) configured to apply the adapted brightness mapping function (F_H2hCI) to the brightness of the pixels of the input high dynamic range image (MsterHDR) to obtain the brightness of the pixels of the intermediate dynamic range image (IDR) which is the output of this unit; and an IDR mapping function generator (903) configured to derive a channel brightness mapping function (F_I2sCI) based on the master brightness mapping function (FL_50t1) and the adapted brightness mapping function (F_H2hCI), the channel brightness mapping function defining as output the respective normalized brightnesses of the low dynamic range image (Im_LDR) when the respective normalized brightnesses of the intermediate dynamic range image (IDR) are given as input, the respective brightnesses in turn corresponding to the respective brightnesses of the input high dynamic range image (MsterHDR); the encoder is further characterized by having: - an image output (930) to output the intermediate dynamic range image (IDR); - a first meta-data output (932) to output the second maximum pixel brightness (PB_CH); - a second meta-data output (931) to output the channel brightness mapping function (F_I2sCI); and - a third meta-data output (933) to output the first maximum pixel brightness (PB_C_H50).
首先注意到雖然在概念上顯示用於本編碼器之各所需資料項的一分開輸入,實務上,具有通常知識的讀者理解此等輸入(且類似地對於輸出)的一或多者可係相同的,取決於一視訊輸入技術可處置什麼(例如,一些較早的HDMI影像輸入不能處置該動態變化-亦即,對於各時間上連續的視訊影像可能不同-的主明亮度映射函數(FL_50t1),在該情形中,資料可透過例如Wi-Fi連接等以一同步方式而傳達)。各種輸入資料如何輸入亦可取決於它們於何處產生,亦即,編碼器連接於其中或連接至的哪個其他系統(其可取決於在事件之攝影機擷取的同一時間是否期望即時編碼,或稍後針對某個視訊通訊系統編碼,像是例如,有線電視分配系統,其在任何稍後時間接收來自原始內容創作者的所有資料,以鑑於此特定有線電視系統的限制或需求最佳地分配其等)。First note that although conceptually a separate input is shown for each required data item for the present encoder, in practice the reader of ordinary skill understands that one or more of these inputs (and similarly for the outputs) may be the same, depending on what a video input technology can handle (e.g., some older HDMI video inputs cannot handle the dynamically changing - i.e., master brightness mapping function (FL_50t1) which may be different for each temporally consecutive video image, in which case the data may be communicated in a synchronized manner via, for example, a Wi-Fi connection). How the various input data are input may also depend on where they originate, i.e., which other system the encoder is connected to or into (which may depend on whether it is desired to encode in real time at the same time as the camera captures the event, or to encode later for some video communications system, such as, for example, a cable television distribution system which receives all the data from the original content creator at any later time in order to best distribute it given the limitations or requirements of this particular cable television system, etc.).
在沒有故意限制的情況下,可假設MsterHDR影像在此之前已由使用一電腦上的顏色分級軟體的一人類顏色分級者而分級,且他已定義FL_50t1函數,該函數定義一對應的較低動態範圍影像,一般係一100尼特的SDR影像(雖然該重新分級影像頻譜的目前最低端根據標準協議係一100尼特PB_C影像,所以似乎不可能改變,三者的所述最低影像,亦即該LDR影像,在未來實施例中可具有不恰係100尼特的一LDR最大輝度,但可能係100尼特的k倍數,例如較佳係k係至多3x,亦即在本系統之實施例實現中的該LDR最大輝度係300尼特),對應於該MsterHDR影像(將可觀的較低輝度動態範圍列入考量,該SDR影像較佳地看起來儘可能地類似於該MsterHDR影像),其依需要一般至少合理地傳達用於在視覺上最佳地訴說(例如,電影故事)的期望模樣(不同的視訊應用程式亦可具有不同需求,諸如不同的顏色準則,可能涉及FL_50t1函數上的不同技術限制)。Without intending to limit, it may be assumed that the MasterHDR image has previously been graded by a human color grader using color grading software on a computer, and that he has defined the FL_50t1 function which defines a corresponding lower dynamic range image, typically a 100 nit SDR image (although the current lowest end of the regraded image spectrum is a 100 nit PB_C image according to standard protocol, so this does not appear likely to change, and the lowest of the three, i.e. the LDR image, may in future embodiments have an LDR maximum luminance that is not exactly 100 nits, but perhaps 1 00 nits, for example preferably k is at most 3x, i.e. the maximum brightness of the LDR in the embodiment of the present system is 300 nits), corresponding to the MasterHDR image (taking into account the considerable lower brightness dynamic range, the SDR image preferably looks as similar to the MasterHDR image as possible), which generally conveys the desired look for best visually telling (e.g., a movie story) as needed (different video applications may also have different requirements, such as different color criteria, which may involve different technical limitations on the FL_50t1 function).
PB_CH值與其他後設資料有一些不同,在於其事實上係用於中間動態範圍編碼的設定。因此,其可能或可能不來自分級者。其可係例如用於特定視訊編碼系統(比方說,例如,衛星廣播系統)的固定值,其可從例如附接至編碼器或編碼器中的固定記憶體提取。在基於網際網路的遞送中,此PB_CH值可依該IDR影像係針對其產生的一最終消費者的需求傳達。例如,具有一不良品質行動顯示器的一消費者可僅請求一500尼特PB_IDR影像以由在該網際網路之其他側上的一伺服器計算,例如按需求視訊公司的伺服器,而一些其他消費者可要求一1000尼特PB_IDR版本,且在此一情形中,所請求的PB_CH=PB_IDR將輸入該編碼器中。The PB_CH value is somewhat different from other metadata in that it is in fact a setting for the intermediate dynamic range encoding. Therefore, it may or may not come from the grader. It may be, for example, a fixed value for a specific video coding system (such as, for example, a satellite broadcasting system), which may be extracted from, for example, a fixed memory attached to or in the encoder. In Internet-based delivery, this PB_CH value may be communicated on demand to an end consumer for whom the IDR image is produced. For example, a consumer with a poor quality mobile display may only request a 500 nit PB_IDR image to be calculated by a server on the other side of the Internet, such as a video on demand company's server, while some other consumer may request a 1000 nit PB_IDR version, and in this case the requested PB_CH=PB_IDR would be input into the encoder.
所以,而在該編碼側有一最高品質(事實上,最高PB_C)的MsterHDR影像,此並非接收器(互補解碼器)將接收之影像,而係該IDR影像(且它們將需要藉由計算REC_M_HDR影像而緊密地重建MsterHDR影像)。該等技術藉由將任何事物公式化成在0.0至1.0明亮度中正規化而最佳地實現。事實上,當談論一明亮度映射函數時,此實際上亦等效於一輝度映射函數(因為明亮度與它們對應輝度(例如,一般待顯示的輝度)之間的關係),但技術上嚴格地說,吾等計算使用明亮度映射函數運作較佳,且較佳地以經視覺心理均勻化的明亮度定義,如藉由飛利浦v函數所計算的(見方程式1以及方程式2)。So, while on the encoding side there is a highest quality (in fact, highest PB_C) MsterHDR image, this is not the image the receiver (complementary decoder) will receive, but the IDR image (and they will need to closely reconstruct the MsterHDR image by computing the REC_M_HDR image). These techniques are best implemented by formulating everything to be normalized in 0.0 to 1.0 luminance. In fact, when we talk about a brightness mapping function, this is actually also equivalent to a brightness mapping function (because of the relationship between brightness and their corresponding brightness (e.g., the brightness generally to be displayed)), but technically speaking, our calculations work better using a brightness mapping function, and are preferably defined in terms of visually psychologically uniform brightness, as calculated by the Philips v function (see Equation 1 and Equation 2).
如上文所解釋的,吾等之處置HDR視訊的方法(特別係不僅編碼在特定峰亮度之不同動態範圍中的單一、或二個不同重新分級影像,且係編碼對應的不同DR重新分級的整體頻譜)係有關相關各種可能的經正規化明亮度,此類至少二個可相關影像的像素可具有例如對應於image_2中的0.4之在image_1中的0.2等。此係該明亮度映射函數所定義的,在一種情況(亦即,一種重新分級)與任何其他經選擇的不同情況之間。As explained above, our approach to processing HDR video (specifically encoding not only a single, or two different re-graded images in different dynamic ranges of a particular peak brightness, but also encoding the entire spectrum of the corresponding different DR re-grades) is to correlate various possible normalized brightnesses that a pixel of at least two correlated images may have, for example, 0.2 in image_1 corresponding to 0.4 in image_2, etc. This is defined by the brightness mapping function, between one case (i.e., one re-grade) and any other chosen different case.
使用標準化演算法,意謂著必須有某種固定方式以將一第一組可能函數(其可具有許多不同形狀及定義)相關於一第二組對應函數。亦即,這僅意謂著一些通訊技術(或甚至所有技術)中,該編碼器及該解碼器之該設計者已定義獨特地指定如何將任何輸入函數的該形狀(一般在正規化至1.0的軸上)變換至該輸出函數之該形狀的一方法。可有各種此類演算法,因此原則上該編解碼器設計者可決定他可能想要傳達給解碼器等的任何此類演算法的順序數-例如,約定演算法數目3,但正常不需要如此複雜,因為吾等之方法將僅藉由預約定(pre-agree)一個固定標準化函數變換演算法而完美且最簡單地運作,例如在下文之支援數學中的一者。Using a standardized algorithm means that there must be some fixed way to relate a first set of possible functions (which can have many different shapes and definitions) to a second set of corresponding functions. That is, it simply means that in some communications technologies (or even all technologies), the designers of the encoder and the decoder have defined a method that uniquely specifies how to transform the shape of any input function (usually on the axis normalized to 1.0) to the shape of the output function. There may be a variety of such algorithms, so in principle the codec designer can decide the order number of any such algorithm he may want to communicate to the decoder etc - for example, agreeing on algorithm number 3, but normally there is no need to be so complicated as our method will work perfectly and most simply by pre-agreeing on a fixed normalized function transformation algorithm, such as the one in the supporting mathematics below.
對使讀者快速瞭解,下文將係此一演算法的一簡單實例。假設該輸入函數係冪函數:power(x_in; P),則演算法可推導對應函數power(x_in; P-1)。藉由反轉,當接收到該等對應函數(藉由該+1演算法)時,該等原始函數亦可再次重新推導。To give the reader a quick overview, here is a simple example of this algorithm. Assuming the input function is the power function: power(x_in; P), the algorithm can derive the corresponding function power(x_in; P-1). By inversion, when receiving the corresponding functions (via the +1 algorithm), the original functions can also be re-derived again.
不應誤解為該標準化演算法本身一般不傳達至接收器,僅係該所得輸出對應函數。此係為何將其標準化,或預約定,亦即必須係固定的,使得該解碼器可知道在已在該編碼側發生什麼。約定此的方式並不如此相關於瞭解所專利化的技術。例如,可有5種不同的固定演算法,且一有線電視營運商可決定以演算法3編碼,並將對應設定供應給他的消費者機上盒以解碼固定演算法3(即使該STB可在某些情況下針對一些其他視訊通訊重設至,例如演算法4;但演算法變化通常將不會係必要的,雖然在用於不同有線電視通道之PB_CH上的變化,例如,可係感興趣的)。It should not be misunderstood that the standardized algorithm itself is generally not communicated to the receiver, only the resulting output correspondence function. This is why it is standardized, or pre-agreed, i.e. must be fixed, so that the decoder can know what has happened on the encoding side. The way this is agreed upon is not so relevant to understanding the patented technology. For example, there may be 5 different fixed algorithms, and a cable TV operator may decide to encode with algorithm 3, and supply his consumer set-top boxes with corresponding settings to decode fixed algorithm 3 (even though the STB may in certain cases be reset to, for example, algorithm 4 for some other video communication; but algorithm changes will generally not be necessary, although changes on the PB_CH used for different cable TV channels, for example, may be of interest).
亦應小心注意一般不將該對應經調適明亮度映射函數F_H2hCI傳達至該等接收器的事實,但隨後傳達另一可進一步推導的通道明亮度映射函數(F_I2sCI),且該解碼器亦需要以某種方式反轉此雙重推導。事實上,將該總重新分級映射分成二個部分,因此若該第一部分經標準化,該第二部分亦可被定義,所以藉由解碼器之此IDR編碼的該反轉可視為係大概可能的(雖然困難)(使新的SLHDR2PLUS編解碼器的架構及正確作用變得可能)。Careful attention should also be paid to the fact that the corresponding adapted brightness mapping function F_H2hCI is not generally communicated to the receivers, but then another channel brightness mapping function (F_I2sCI) is communicated which can be further derived, and the decoder also needs to invert this double derivation in some way. In fact, the total re-rating map is split into two parts, so if the first part is standardized, the second part can also be defined, so the inversion of this IDR encoding by the decoder can be considered to be probably possible (although difficult) (making the architecture and correct functioning of the new SLHDR2PLUS codec possible).
已進一步以圖 24 說明經標準化可編碼峰亮度相依函數變化演算法的此概念。因此在左方看到由該分級者或內容創作側重新分級函數判定自動化設計的各種FL_50t1函數的發生。例如,FL_50t1_1可已針對有重要動作發生之在其中之有相當深之黑色的一HDR場景判定,其必須係可觀地明亮化以在低動態範圍顯示器上仍足夠可見,但該等最亮部分不係如此關鍵的-像是例如,街燈-並可以任何顯示器上的一單一幾乎最大白色輝度表示(或因此計算的影像,亦即恰含有如其等待在該顯示器上演現的該等絕對輝度,或更精確地說,通常係編碼該等輝度的該等明亮度碼)。對比地,FL_50t1_2係針對影像或含有另一類型的HDR場景的連續影像的鏡頭建立,其中有一重要的較低亮度物體(或更精確地說,區域)及一較高亮度物體二者,其已在「中間」的任一側上導致經特別調諧的重新分級曲線形狀。FL_50t1_3係藉由HDR函數產生單元901施加至該標準化演算法的又另一可能的輸入函數,其可例如針對具有非常明亮內容及一些局部較暗區域的一日間場景(及不過高PB_C的主影像)發生,像是例如,在印度的一室外場景中進入一寺廟。This concept of a normalized encodable peak brightness dependent function change algorithm has been further illustrated with FIG . 24 . Thus on the left one sees the occurrence of various FL_50t1 functions designed automatically by the grader or content author side regrading function decision. For example, FL_50t1_1 may have been decided for an HDR scene with significant action taking place in it with fairly deep blacks, which must be noticeably brightened to still be visible enough on a low dynamic range display, but where the brightest parts are not so critical - like e.g. street lights - and can be represented by a single nearly maximum white luminance on any display (or so the image is calculated, i.e. containing exactly those absolute luminances as it is waiting to be rendered on the display, or more precisely, the luminance codes that typically encode those luminances). In contrast, FL_50t1_2 is established for a shot of an image or succession of images containing another type of HDR scene, where there is both a significant lower brightness object (or more precisely, area) and a higher brightness object, which has resulted in specially tuned regrading curve shapes on either side of the "middle". FL_50t1_3 is yet another possible input function applied to the normalization algorithm by the HDR function generation unit 901, which may occur, for example, for a day scene (and a main image with no excessively high PB_C) with very bright content and some local darker areas, like, for example, entering a temple in an outdoor scene in India.
單元901將針對該等三種情況的任一者,及所有其他百萬種情況,判定一輸出函數。此演算法的性質係此函數將類似地成形,但更接近該對角線(因為若該原始函數表示在,例如,X尼特PB_C影像對應的(在能力允許的程度上合理地類似模樣)Y尼特PB_C2影像(姑且說100尼特的影像)之間的重新分級,然後以,例如,在X與Y之間一半的Z從X重新分級至Z尼特PB_C影像將涉及類似的重新分級,但程度較小;若從X映射至X,將具有對應於對角線的恆等變換)。Unit 901 will determine an output function for any of these three cases, and all the other million cases. The nature of the algorithm is that the function will be similarly shaped, but closer to the diagonal (because if the original function represented a re-grading between, say, an X nits PB_C image corresponding to a (reasonably similar to the extent capabilities allow) Y nits PB_C2 image (let's say an image of 100 nits), then re-grading from an X to a Z nits PB_C image with, say, Z halfway between X and Y will involve a similar re-grading, but to a lesser extent; if mapping from X to X, there will be a constant transformation corresponding to the diagonal).
有數個方式,其中一者可定義此一標準化演算法以獨特地獲得對應於該等各別輸入函數的輸出F_H2hCI_1、F_H2hCI_2、及F_H2hCI_3輝度映射函數,且其之細節不真的形成本發明的基本要素,除了必須具有如此行為之可得的一些經標準化演算法的事實。例如,一般可定義一些度量(在所選的PB_C_CH IDR影像可編碼最大輝度上量化PB_C_CH相依性),其可用於以某種方式(例如,大的等步伐、或非均勻地等)用於使任何經正規化輸入輝度之該輸入函數的點y(x)朝向對角線偏移。雖然亦可垂直地偏移,下文詳述之相當良好地運作的較佳實施例在正交於從[0,0]至[1,1]之該對角線的軌跡上偏移此類函數點。There are several ways in which one can define such a normalization algorithm to uniquely obtain the output F_H2hCI_1, F_H2hCI_2, and F_H2hCI_3 brightness mapping functions corresponding to the respective input functions, and the details of which do not really form an essential element of the invention, except for the fact that there must be some normalized algorithm that behaves in this way available. For example, some metric can generally be defined (quantifying the PB_C_CH dependency at the maximum brightness that can be encoded for the selected PB_C_CH IDR image) that can be used to shift the points y(x) of the input function towards the diagonal for any normalized input brightness in some way (e.g., large equal steps, or non-uniformly, etc.). Although vertical offsets are also possible, the preferred embodiment described below, which works quite well, offsets such function points on a trajectory orthogonal to the diagonal line from [0,0] to [1,1].
該高動態範圍視訊編碼器(900)的一個有利實施例的特徵在於該HDR函數產生單元(901)的該經標準化演算法朝向該主明亮度映射函數(FL_50t1)的該對角線施加一壓縮以獲得該經調適明亮度映射函數(F_H2hCI),該壓縮涉及以一比例因子按比例調整該函數的所有輸出明亮度值,該比例因子取決於該第一最大像素輝度(PB_C_H50)及該第二最大像素輝度(PB_CH)。An advantageous embodiment of the high dynamic range video encoder (900) is characterized in that the standardized algorithm of the HDR function generation unit (901) applies a compression towards the diagonal of the main brightness mapping function (FL_50t1) to obtain the adapted brightness mapping function (F_H2hCI), the compression involving scaling all output brightness values of the function by a scaling factor, the scaling factor being dependent on the first maximum pixel luminance (PB_C_H50) and the second maximum pixel luminance (PB_CH).
可有各種經定義F_L50t1函數(下文的para定義係一個實例),且其等可藉由該經標準化演算法以各種方式按比例調整,但一般將有按比例調整涉及,且此按比例調整取決於起始PB_C_H50及目標值PB_CH=PB_IDR。此可藉由不同度量完成,但申請人這些年已發現易於基於在視覺心理上均勻的值及藉由將其等發送通過該v函數的該等峰亮度的比率而定義該比例因子,亦即基於對應於該二個峰亮度(及可能地該SDR影像的第三PB_C)之v函數明亮度輸出而定義一比例因子。There may be various defined F_L50t1 functions (the para definition below is one example), and they may be scaled in various ways by the standardized algorithm, but generally there will be scaling involved, and this scaling depends on the starting PB_C_H50 and the target value PB_CH=PB_IDR. This may be done by different metrics, but the applicants have found over the years that it is easy to define the scaling factor based on a ratio of the peak luminances that are visually psycho-uniform values and by sending them through the v function, i.e. define a scaling factor based on the v function brightness output corresponding to the two peak luminances (and possibly the third PB_C for the SDR image).
該高動態範圍視訊編碼器(900)的一個有利實施例包含:一限制器(1804),其經配置以針對包含等於1.0之最亮的該經正規化明亮度的該等經正規化明亮度的一子範圍重新判定該通道明亮度映射函數(F_I2sCI)的一斜率。此對許多實施例並非必要的,但特別係處置在ETSI2中標準化之para的該等高亮度增益的編碼HG_COD之特定選擇的有用方式,以完全順應於此特定實施例有用的所有者。An advantageous embodiment of the high dynamic range video encoder (900) comprises: a limiter (1804) configured to redefine a slope of the channel brightness mapping function (F_I2sCI) for a sub-range of the normalized luminances including the brightest normalized luminance equal to 1.0. This is not essential for many embodiments, but is in particular a useful way of handling the particular choice of the coding HG_COD of the high brightness gains of para standardized in ETSI2 to fully comply with the useful proprietor of this particular embodiment.
對該編碼器的一種對應鏡像技術(事實上,藉由能夠重新推導所有需要的資訊(即使此類資料未實際傳達)而取消所有編碼處理)係一種高動態範圍視訊解碼器(1100),其具有一影像輸入(1110)以接收一中間動態範圍影像(IDR),該中間動態範圍影像具有藉由較佳係0.8或更小的一乘法因子而小於一主高動態範圍影像(MsterHDR)的一第一最大像素輝度(PB_C_H50)的一第二最大像素輝度(PB_CH),該第二最大像素輝度(PB_CH)經由一第二後設資料輸入(1112)接收,該解碼器具有一第一後設資料輸入(1111)以接收一明亮度映射函數(F_I2sCI),該明亮度映射函數定義該中間動態範圍影像(IDR)的所有可能的經正規化明亮度至一LDR最大像素輝度低動態範圍影像(Im_LDR)之對應經正規化明亮度的該變換,該解碼器的特徵在於其具有一第三後設資料輸入(1113)以接收該第一最大像素輝度(PB_C_H50),且該解碼器包含: - 一輝度函數判定單元(1104),其經配置以施加一經標準化演算法以將該明亮度映射函數(F_I2sCI)變換成一解碼明亮度映射函數(F_ENCINV_H2I),該解碼明亮度映射函數針對該中間動態範圍影像(IDR)的一像素之任何可能的輸入經正規化明亮度將該主高動態範圍影像(MsterHDR)的一對應經正規化HDR明亮度指定為輸出,該經標準化演算法使用該第一最大像素輝度(PB_C_H50)及該第二最大像素輝度(PB_CH)的該等值;及 - 一顏色變換器(1102),其經配置以將該解碼明亮度映射函數(F_ENCINV_H2I)連續地施加至該中間動態範圍影像(IDR)之經輸入的經正規化明亮度,以獲得一經重建主HDR影像(REC_M_HDR)之像素的經正規化經重建明亮度(L_RHDR);該解碼器進一步具有一影像輸出(1120)以輸出該經重建主HDR影像(REC_M_HDR)。該LDR最大輝度再次較佳地係該經標準化的100尼特SDR輝度,雖然可設想所部署之在其中該重新分級影像頻譜的該低(亦即,最低)動態範圍(亦即,最大輝度)影像及其通訊係例如200尼特影像之類似運作的未來變化。A corresponding mirroring technique for the encoder (in fact, eliminating all encoding processing by being able to re-derive all required information (even if such data is not actually communicated)) is a high dynamic range video decoder (1100) having an image input (1110) for receiving an intermediate dynamic range image (IDR) having a second maximum pixel brightness (PB_CH) that is smaller than a first maximum pixel brightness (PB_C_H50) of a main high dynamic range image (MsterHDR) by a multiplication factor preferably of 0.8 or less, the second maximum pixel brightness (PB_CH) being obtained by a second meta-data input (1112) receiving, the decoder having a first meta-data input (1111) receiving a brightness mapping function (F_I2sCI), the brightness mapping function defining the transformation of all possible normalized brightnesses of the intermediate dynamic range image (IDR) to the corresponding normalized brightness of an LDR maximum pixel brightness low dynamic range image (Im_LDR), the decoder is characterized in that it has a third meta-data input (1113) receiving the first maximum pixel brightness (PB_C_H50), and the decoder comprises: - a brightness function determination unit (1104) configured to apply a normalized algorithm to transform the brightness mapping function (F_I2sCI) into a decoded brightness mapping function (F_ENCINV_H2I), the decoded brightness mapping function specifies as output a corresponding normalized HDR brightness of the master high dynamic range image (MsterHDR) for any possible input normalized brightness of a pixel of the intermediate dynamic range image (IDR), the normalized algorithm using the values of the first maximum pixel brightness (PB_C_H50) and the second maximum pixel brightness (PB_CH); and - a color converter (1102) configured to continuously apply the decoded brightness mapping function (F_ENCINV_H2I) to the input normalized brightness of the intermediate dynamic range image (IDR) to obtain normalized reconstructed brightness (L_RHDR) of pixels of a reconstructed master HDR image (REC_M_HDR); the decoder further having an image output (1120) to output the reconstructed master HDR image (REC_M_HDR). The LDR maximum brightness is again preferably the standardized 100 nit SDR brightness, although future variations are envisioned where similar operation is deployed in which the low (i.e., lowest) dynamic range (i.e., maximum brightness) image of the regraded image spectrum and its communication is, for example, a 200 nit image.
所以,MsterHDR影像實際上並未作為一影像接收,但其仍由所 接收的該資料獨特地定義(所以,雖然形式上,此MsterHDR影像係存在於對應的匹配解碼器地點的對應主影像,且該解碼器僅從其接收的IDR影像重建幾乎相同的REC_M_HDR影像,各種函數甚至在任何解碼地點確實定義MsterHDR影像性質)。不同的消費者可選擇PB_C_H50及PB_IDR二者的各種值。第一者可由該內容創作者針對各種原因選擇,諸如,例如因為他購買4000尼特的分級監視器、或因為他喜歡給予他的主內容某種最佳品質(例如,以不少於10,000尼特的PB_C建立/定義所有事物)、或因為至少根據該創作者,某些類型的影像要求某種品質,亦即,PB_C_H50(例如,壯觀的煙火表演或燈光表演或流行演唱會可比,例如,合理地均勻發光的網球比賽或新聞閱讀值得更高的PB_C_H50)。So, the MsterHDR image is not actually received as an image, but it is still uniquely defined by the data received (so, although formally, this MsterHDR image exists in the corresponding matching decoder location with the corresponding host image, and the decoder just reconstructs a nearly identical REC_M_HDR image from the IDR image it receives, various functions do define the MsterHDR image properties even at any decoding location). Different consumers may choose various values for both PB_C_H50 and PB_IDR. The first may be chosen by the content creator for various reasons, e.g. because he buys a 4000 nits rated monitor, or because he likes to give a certain optimal quality to his main content (e.g. everything is established/defined with a PB_C of no less than 10,000 nits), or because, at least according to the creator, certain types of images require a certain quality, namely PB_C_H50 (e.g. a spectacular firework show or light show or a pop concert comparable to, for example, a reasonably evenly lit tennis match or news reading deserves a higher PB_C_H50).
PB_IDR值可基於不同的技術考量選擇,例如視訊通訊公司之一般消費者的計價,且如所述,該通訊公司通常可與該創作公司不同。The PB_IDR value may be chosen based on various technical considerations, such as the pricing of a typical consumer to a video communications company, and as mentioned, the communications company may typically be different than the creative company.
一般而言,產生在PB_C上的差異小於至少20%的重新分級IDR內含係不太合理的(亦即,因子0.8,雖然原則上PB_C之值可更接近,例如0.9),但通常更一般地在PB_C之間將有乘法因子2或更多(例如,以低於1000尼特(例如,800、700、或600尼特)且一般高於500尼特的某個PB_CH發送的2000尼特主材料)。在解碼地點的PB_C_H50一般類似於其他後設資料,且尤其係PB_CH值,所以一般將其接收為與視訊資料關聯的後設資料,例如,非限制性SEI訊息、或視訊通訊協定上的特殊封包等(無論係在一個邏輯資料結構或數個結構中,根據最佳地適於各經標準化或非標準視訊通訊協定的事物,此係本新技術的次要細節)。因為解碼器使用經標準化演算法以最終來到IDR影像及其之ETSI2順應後設資料,對應的經標準化演算法可針對其最終判定REC_M_HDR影像像素明亮度之重建所需的F_ENCINV_H2I明亮度映射函數的解碼器設計或設計在該解碼器中(然後進一步使用此影像完成任何事物,顯示其係典型應用,但例如儲存在硬碟記憶體上係另一應用)。In general, it is not reasonable to regrade IDR content that results in a difference in PB_C of less than at least 20% (i.e., a factor of 0.8, although in principle the value of PB_C could be closer, such as 0.9), but it is more common to have a multiplication factor of 2 or more between PB_Cs (e.g., 2000 nit main material sent at a PB_CH that is lower than 1000 nits (e.g., 800, 700, or 600 nits) and generally higher than 500 nits). The PB_C_H50 at the decoding location is generally similar to other metadata, and in particular the PB_CH value, so it is generally received as metadata associated with the video data, such as a non-restrictive SEI message, or a special packet on a video communication protocol, etc. (whether in one logical data structure or several structures, according to what is best suited to each standardized or non-standard video communication protocol, which is a minor detail of the new technology). Since the decoder uses a standardized algorithm to ultimately come to an IDR image and its ETSI2 compliant metadata, a corresponding standardized algorithm can be designed for or in the decoder which ultimately determines the F_ENCINV_H2I brightness mapping function needed for the reconstruction of the REC_M_HDR image pixel brightness (and then further doing anything with this image, displaying it is a typical application, but storing on hard disk memory for example is another application).
該高動態範圍視訊解碼器(1100)的一個感興趣的實施例的特徵在於該輝度函數判定單元(1104)的該經標準化演算法計算其取決於該第一最大像素輝度(PB_C_H50)及該第二最大像素輝度(PB_CH)的一比例因子。如所提及的,此可對應地以各種方式完成,但在視覺心理上均勻的基於v函數的比例因子實務上相當易用於受良好控制的HDR影像處置,並符合各種甚至關鍵的藝術要求,而同時保持技術複雜度受控制。An interesting embodiment of the high dynamic range video decoder (1100) is characterized in that the standardized algorithm of the brightness function decision unit (1104) calculates a scaling factor which depends on the first maximum pixel brightness (PB_C_H50) and the second maximum pixel brightness (PB_CH). As mentioned, this can be done in various ways accordingly, but a visually psychologically uniform v-function-based scaling factor is practically quite easy to use for well-controlled HDR image processing and meets various, even critical, artistic requirements, while keeping the technical complexity under control.
該高動態範圍視訊解碼器(1100)的一個有用實施例具有由一明亮度映射定義的該明亮度映射函數(F_I2sCI),該明亮度映射由具有用於一黑暗經正規化明亮度範圍的一第一斜率(SG_gr)的一第一線性區段、具有用於一明亮經正規化明亮度範圍的一第二斜率(HG_gr)的一第二線性區段、及用於在該二個範圍之間之明亮度的一拋物線區段組成。對應的數學除了其他事物之外涉及解二階方程式以獲得用於進行該重建所需要的該經通道調適高亮度增益。此係有用的一階HDR重新分級方法,其適用於具有非最高像素顏色控制需求的市場,諸如例如現實生活的電視廣播(如對比於有時涉及在例如賣座電影中的例如詳細顏色控制)。如下文所提及的,此在一些進一步劃分的實施例中可係完全定義F_L50t1函數及所有可推導函數(例如,與IDR影像一起傳達的函數:F_I2S)的單獨分量,但其亦可係該重新分級函數的一部分定義,例如,如圖4所說明地定義全部的重新分級連同一可客製化函數。A useful embodiment of the high dynamic range video decoder (1100) has the brightness map function (F_I2sCI) defined by a brightness map consisting of a first linear segment with a first slope (SG_gr) for a dark normalized brightness range, a second linear segment with a second slope (HG_gr) for a bright normalized brightness range, and a parabolic segment for brightnesses between the two ranges. The corresponding mathematics involves, among other things, solving second order equations to obtain the channel-adapted high brightness gains required for the reconstruction. This is a useful first order HDR re-grading method that is applicable to markets with non-highest pixel color control requirements, such as, for example, real life television broadcasts (as opposed to, for example, detailed color control sometimes involved in, for example, blockbuster movies). As mentioned below, this in some further partitioned embodiments may be a separate component that fully defines the F_L50t1 function and all derivable functions (e.g., the function communicated with the IDR image: F_I2S), but it may also be a partial definition of the rescaling function, e.g., defining the entire rescaling together with a customizable function as illustrated in FIG. 4 .
該高動態範圍視訊解碼器(1100)的一個有用實施例具有其之顏色變換器(1102),該顏色變換器經配置以計算具有一最大像素輝度(PB_MDR)的一中等動態範圍影像(MDR_300)的像素明亮度,該最大像素輝度不等於該等值100尼特、該第一最大像素輝度(PB_C_H50)、及該第二最大像素輝度(PB_CH),且該解碼器具有用於輸出該中等動態範圍影像(MDR_300)的一影像輸出(1122)。雖然REC_M_HDR影像的重建可係一些子市場中的一些設備所需的一切(有所有類型的其他變換施加在該重建影像上的可能),若吾等的SLHDR2PLUS解碼器除了僅重建主HDR影像外,亦可使用其他PB_C計算對應影像(例如,可直接顯示在具有任何PB_C的某種顯示器上的MDR影像)係有利的。此亦將例如以圖16說明的方式或任何等效方式使用本發明的數學原理。A useful embodiment of the high dynamic range video decoder (1100) has a color converter (1102) thereof, which is configured to calculate the pixel brightness of a medium dynamic range image (MDR_300) having a maximum pixel brightness (PB_MDR), the maximum pixel brightness being not equal to the equivalent value of 100 nits, the first maximum pixel brightness (PB_C_H50), and the second maximum pixel brightness (PB_CH), and the decoder has an image output (1122) for outputting the medium dynamic range image (MDR_300). While the reconstruction of a REC_M_HDR image may be all that is needed for some devices in some submarkets (with all kinds of other transformations possible applied to the reconstructed image), it would be advantageous if our SLHDR2PLUS decoder could, in addition to reconstructing only the main HDR image, also use other PB_Cs to calculate the corresponding image (e.g. an MDR image that can be directly displayed on some display with any PB_C). This would also use the mathematical principles of the invention, for example in the manner illustrated in FIG. 16 or in any equivalent manner.
該高動態範圍視訊解碼器(1100)的另一有用實施例具有一後設資料輸出(1121),該後設資料輸出用於輸出一明亮度映射函數(F_L_subsq),該明亮度映射函數針對該經重建主HDR影像(REC_M_HDR)或替代地該中等動態範圍影像(MDR_300)的所有經正規化明亮度定義具有另一最大像素輝度的一影像的對應明亮度,此另一最大像素輝度較佳地係100尼特,或高於或低於各別該經重建主HDR影像(REC_M_HDR)或替代地該中等動態範圍影像(MDR_300)的該最大輝度值的一值。其可能係將該已接收IDR影像重建成不直接顯示在一基本型監視顯示器上的REC_M_HDR影像,但發送至進行進一步色度計算的某種系統。然後該解碼器實施例亦可輸出一合適的明亮度映射函數係有用的,亦即,一般意謂著與其正被輸出之該影像關聯的一明亮度映射函數,例如REC_M_HDR影像(一般與所定義之該函數的該輸入經正規化明亮度係共同輸出之影像的經正規化明亮度的意義關聯,且該函數的輸出係某個參考影像(常係SDR影像)的經正規化明亮度,當將其標準化成具有PB_C= 100尼特時,其一般係HDR年代中所將希望的最低品質,此不排除某人可能想要以用於定義係,例如80或50尼特的共同傳達函數之輸出縱座標的PB_C施加本教示)。Another useful embodiment of the high dynamic range video decoder (1100) has a meta-data output (1121) for outputting a brightness mapping function (F_L_subsq) which defines, for all normalized brightnesses of the reconstructed main HDR image (REC_M_HDR) or, alternatively, the medium dynamic range image (MDR_300), a corresponding brightness of an image having another maximum pixel brightness, which is preferably 100 nits, or a value higher or lower than the maximum brightness value of the respective reconstructed main HDR image (REC_M_HDR) or, alternatively, the medium dynamic range image (MDR_300). It is possible to reconstruct the received IDR image into a REC_M_HDR image that is not directly displayed on a basic monitoring display, but sent to some system that performs further colorimetric calculations. It is then useful that the decoder embodiment may also output a suitable brightness mapping function, i.e., generally meaning a brightness mapping function associated with the image it is being output, such as a REC_M_HDR image (generally associated in the sense that the input normalized brightness of the function being defined is the normalized brightness of the image being output together, and the output of the function is the normalized brightness of a reference image (usually an SDR image) which, when normalized to have PB_C = 100 nits, is generally the minimum quality that would be desired in the HDR era, without excluding the fact that one might want to apply the present teachings with a PB_C for defining the output vertical coordinate of the commonly transmitted function to be, for example, 80 or 50 nits).
針對設備(或設備的一部分或聚集)制訂的任何事物可等效地制訂成信號、包含影像的記憶體產品(諸如,藍光光碟)、方法等,例如:Anything specified for a device (or a portion or collection of devices) may be equivalently specified as a signal, a memory product containing an image (e.g., a Blu-ray disc), a method, etc., for example:
一種具有一第一最大像素輝度(PB_C_H50)的一經接收輸入高動態範圍影像(MsterHDR)之高動態範圍視訊編碼的方法,其包含接收一主明亮度映射函數(FL_50t1),該明亮度映射函數定義該輸入高動態範圍影像的經正規化明亮度與對應的一低動態範圍影像(Im_LDR)的經正規化明亮度之間的一關係,該低動態範圍影像具有其較佳地具有等於100尼特的一值的一LDR最大像素輝度,該方法的特徵在於該編碼進一步包含接收一第二最大像素輝度(PB_CH),且該編碼包含: - 施加一經標準化演算法以將該主明亮度映射函數(FL_50t1)變換成一經調適明亮度映射函數(F_H2hCI),該經調適明亮度映射函數使該輸入高動態範圍影像的經正規化明亮度相關於一中間動態範圍影像(IDR)的經正規化輝度,該中間動態範圍影像的特徵在於具有等於該第二最大像素輝度(PB_CH)的一最大可能輝度; - 施加該經調適明亮度映射函數(F_H2hCI)至該輸入高動態範圍影像(MsterHDR)之像素的明亮度,以獲得該中間動態範圍影像(IDR)之像素的明亮度; - 在該主明亮度映射函數(FL_50t1)及該經調適明亮度映射函數(F_H2hCI)的基礎上推導一通道明亮度映射函數(F_I2sCI),該通道明亮度映射函數在該中間動態範圍影像(IDR)的該等各別經正規化明亮度係給定為輸入時將該低動態範圍影像(Im_LDR)的該等各別經正規化明亮度定義作為輸出,該等明亮度繼而對應於該輸入高動態範圍影像(MsterHDR)的各別明亮度; - 輸出該中間動態範圍影像(IDR);及 - 輸出該第二最大像素輝度(PB_CH)、該通道明亮度映射函數(F_I2sCI)、及該第一最大像素輝度(PB_C_H50)。A method for high dynamic range video encoding of a received input high dynamic range image (MsterHDR) having a first maximum pixel brightness (PB_C_H50), comprising receiving a master brightness mapping function (FL_50t1), the brightness mapping function defining a relationship between the normalized brightness of the input high dynamic range image and the normalized brightness of a corresponding low dynamic range image (Im_LDR), the low dynamic range image having an LDR maximum pixel brightness preferably having a value equal to 100 nits, the method being characterized in that the encoding further comprises receiving a second maximum pixel brightness (PB_CH), and the encoding comprises: - Applying a normalized algorithm to transform the master brightness mapping function (FL_50t1) into an adapted brightness mapping function (F_H2hCI), the adapted brightness mapping function causing the normalized brightness of the input high dynamic range image to be related to the normalized brightness of an intermediate dynamic range image (IDR), the intermediate dynamic range image being characterized by having a maximum possible brightness equal to the second maximum pixel brightness (PB_CH); - Applying the adapted brightness mapping function (F_H2hCI) to the brightness of the pixels of the input high dynamic range image (MsterHDR) to obtain the brightness of the pixels of the intermediate dynamic range image (IDR); - A channel brightness mapping function (F_I2sCI) is derived based on the master brightness mapping function (FL_50t1) and the adapted brightness mapping function (F_H2hCI), the channel brightness mapping function defining as output the respective normalized brightnesses of the low dynamic range image (Im_LDR) when the respective normalized brightnesses of the intermediate dynamic range image (IDR) are given as inputs, which brightnesses in turn correspond to the respective brightnesses of the input high dynamic range image (MsterHDR); - output the intermediate dynamic range image (IDR); and - output the second maximum pixel brightness (PB_CH), the channel brightness mapping function (F_I2sCI), and the first maximum pixel brightness (PB_C_H50).
或者,一種一經接收中間動態範圍影像(IDR)之高動態範圍視訊解碼的方法,該影像具有藉由較佳係0.8或更小的一乘法因子而低於一主高動態範圍影像(MsterHDR)的一第一最大像素輝度(PB_C_H50)的一第二最大像素輝度(PB_CH),該第二最大像素輝度(PB_CH)係接收為該中間動態範圍影像的後設資料,該解碼方法亦接收後設資料中的一明亮度映射函數(F_I2sCI),該明亮度映射函數定義該中間動態範圍影像(IDR)的所有可能的經正規化明亮度至一LDR最大像素輝度低動態範圍影像(Im_LDR)之對應經正規化明亮度的該變換,且該解碼方法的特徵在於其接收該第一最大像素輝度(PB_C_H50),且該解碼方法的特徵在於其包含: - 施加一經標準化演算法以將該明亮度映射函數(F_I2sCI)變換成一解碼明亮度映射函數(F_ENCINV_H2I),該解碼明亮度映射函數針對該中間動態範圍影像(IDR)的一像素之任何可能的輸入經正規化明亮度將該主高動態範圍影像(MsterHDR)的一對應經正規化HDR明亮度指定為輸出,該經標準化演算法使用該第一最大像素輝度(PB_C_H50)及該第二最大像素輝度(PB_CH)的該等值; - 將該解碼明亮度映射函數(F_ENCINV_H2I)施加至該中間動態範圍影像(IDR)之經正規化明亮度,以獲得一經重建主HDR影像(REC_M_HDR)之像素的經正規化經重建明亮度(L_RHDR);及 - 輸出該經重建主HDR影像(REC_M_HDR)。Alternatively, a method for decoding high dynamic range video upon receiving an intermediate dynamic range image (IDR), the image having a second maximum pixel brightness (PB_CH) lower than a first maximum pixel brightness (PB_C_H50) of a master high dynamic range image (MsterHDR) by a multiplication factor preferably 0.8 or less, the second maximum pixel brightness (PB_CH) being received as metadata of the intermediate dynamic range image, the decoding method also receiving the metadata a brightness mapping function (F_I2sCI) in which the brightness mapping function defines the transformation of all possible normalized brightnesses of the intermediate dynamic range image (IDR) to the corresponding normalized brightness of an LDR maximum pixel brightness low dynamic range image (Im_LDR), and the decoding method is characterized in that it receives the first maximum pixel brightness (PB_C_H50), and the decoding method is characterized in that it comprises: - Applying a normalized algorithm to transform the brightness mapping function (F_I2sCI) into a decoded brightness mapping function (F_ENCINV_H2I), the decoded brightness mapping function specifies as output a corresponding normalized HDR brightness of the master high dynamic range image (MsterHDR) for any possible input normalized brightness of a pixel of the intermediate dynamic range image (IDR), the normalized algorithm using the values of the first maximum pixel luminance (PB_C_H50) and the second maximum pixel luminance (PB_CH); - Applying the decoded brightness mapping function (F_ENCINV_H2I) to the normalized brightness of the intermediate dynamic range image (IDR) to obtain the normalized reconstructed brightness (L_RHDR) of the pixels of a reconstructed master HDR image (REC_M_HDR); and - outputting the reconstructed master HDR image (REC_M_HDR).
圖25不錯地繪示本案之經SLHDR2PLUS編碼HDR影像信號(2501),亦即,例如4kx2k的像素顏色矩陣2502(YCbCr,或可藉由已知色度方程式藉由預處理器[未圖示]計算成所需之YCbCr表示的任何顏色表示),及必要的後設資料:輝度映射函數F_I2sCI,及該二個PB_C值。假如將此HDR影像信號傳達至標準的SLHDR2解碼器2510並由該解碼器接收,因為F_I2sCI係正規函數,此解碼器將預期從其之 已接收影像峰亮度重新分級(PB_CH,在該實例中係800尼特至任何較低的峰亮度,其可藉由顯示最佳化計算Im350而針對例如經連接的350尼特中等動態範圍顯示器最佳化而顯示(其如所述,不係此申請案的關鍵態樣,其僅使作為來到新穎編解碼器框架的起始設計準則的一者變得可能,且可用於顯示器最佳化,例如揭示於US20160307602中的方法,或類似一者)。但現在變得可能的係具有SLHDR2PLUS解碼器2520(例如,由決定引入此服務的有線電視營運者新部署的,及類似者)的任何人可製造高於PB_CH值之PB_C的其他影像,例如,5000尼特的例示性主HDR影像的重建(亦即,輸出影像Im5000),或在PB_C_H50與PB_CH之間、或甚至可能高於PB_C_H50(像是Im1250經顯示調適影像)等的任何經顯示調適影像。FIG. 25 illustrates the SLHDR2PLUS encoded HDR image signal (2501) of the present case, i.e., for example, a 4kx2k pixel color matrix 2502 (YCbCr, or any color representation that can be calculated into the required YCbCr representation by a pre-processor [not shown] using known chromaticity equations), and the necessary metadata: the luminance mapping function F_I2sCI, and the two PB_C values. If this HDR image signal is communicated to and received by a standard SLHDR2 decoder 2510, since F_I2sCI is a normal function, the decoder will be expected to rescale from its received image peak brightness (PB_CH, in this example 800 nits to any lower peak brightness, which can be optimized for display for, e.g., a connected 350 nit medium dynamic range display by display optimization calculation Im350 (which, as mentioned, is not a critical aspect of this application, it only makes possible as one of the starting design criteria to the novel codec framework, and can be used for display optimization, e.g., to reveal 20160307602, or a similar one). But it now becomes possible that anyone with a SLHDR2PLUS decoder 2520 (e.g., newly deployed by a cable TV operator who decides to introduce this service, and the like) can produce other images above the PB_CH value of PB_C, such as a reconstruction of the exemplary master HDR image at 5000 nits (i.e., the output image Im5000), or any display-conditioned image between PB_C_H50 and PB_CH, or possibly even above PB_C_H50 (such as the Im1250 display-conditioned image), etc.
圖 9 大致顯示新的SLHDR2PLUS編碼器900。當輸入時,其得到主HDR影像(例如,5000尼特PB_C影像MsterHDR),其不想要失去讀者可假設已在由使用顏色分級軟體的人類顏色分級者編碼時或左右產生的通用性,例如藉由從未處理的攝影機擷取的HDR影像開始(MsterHDR影像針對,例如,一般夜色昏暗(亦即,其平均環繞輝度等)的電視觀看環境最佳化;本技術亦可與其他或可變環境運作,但其更確切地說係顯示調適而非建立或編碼HDR影像之新方法的問題)。該分級者亦已建立至少一個良好的輝度降級函數FL_50t1,以將5000尼特主HDR影像轉換成他已在他的SDR參考監視器上檢查之對應好看的SDR影像(亦即,通常100尼特PB_C的影像),且他已根據色度處理單元451藉由填充403、404、及405之部分重新分級態樣的一些者,及一些良好顏色調整F_C[Y]而完成此)(例如,在現實生活的事件廣播中,其他方法可在運作中計算可施加函數形狀,然後可能有某個導演粗略地觀看結果,或甚至沒有,但原理係出現良好函數FL_50t1,無論係來自部分單元的僅一者或所有單元一起的總函數等)。 Fig. 9 schematically shows the new SLHDR2PLUS encoder 900. When input, it gets a master HDR image (e.g., a 5000 nit PB_C image MsterHDR), which, without losing the commonality that the reader may assume has been generated at or around the time of encoding by a human color grader using color grading software, e.g. by starting with an HDR image captured from an unprocessed camera (the MsterHDR image is optimized for, e.g., a TV viewing environment that is typically dark at night (i.e., it has average ambient luminance, etc.); the present technique may also work with other or variable environments, but it is more a matter of display adaptation rather than a new method of creating or encoding HDR images). The grader has also established at least one good brightness degradation function FL_50t1 to convert the 5000 nit master HDR image into a corresponding good looking SDR image that he has checked on his SDR reference monitor (i.e., typically a 100 nit PB_C image), and he has done this by filling in some of the partial regraded states of 403, 404, and 405 according to the chroma processing unit 451, and some good color adjustments F_C[Y]) (for example, in real life event broadcasting, other methods may calculate the applicable function shape on the fly, and then there may be some director who roughly views the result, or even none, but the principle is that a good function FL_50t1 emerges, whether it is from just one of the partial units or the total function of all units together, etc.).
此函數FL_50t1亦必須輸入為新穎編碼器900的起始資訊。亦輸入峰亮度靜態(對於整體電影或廣播)後設資料PB_C_H50,因為其將會被使用,但亦由編碼器輸出為總IDR視訊編碼信號(IDR+F_I2sCI+PB_CH+PB_C_H50,其中該影像一般根據一些合適的視訊通訊標準(例如,HEVC)來壓縮或解壓縮,且其他後設資料可根據任何可用或可組態的後設資料通訊機制傳達,範圍從MPEG SEI訊息至專用網際網路封包等)。This function FL_50t1 must also be input as the start information of the novel encoder 900. The peak brightness static (for overall movie or broadcast) metadata PB_C_H50 is also input, as it will be used, but is also output by the encoder as the total IDR video coded signal (IDR+F_I2sCI+PB_CH+PB_C_H50, where the image is generally compressed or decompressed according to some suitable video communication standard (e.g., HEVC), and other metadata can be communicated according to any available or configurable metadata communication mechanism, ranging from MPEG SEI messages to dedicated Internet packets, etc.).
HDR函數產生單元901將計算HDR至IDR輝度映射函數F_H2hCI,其係從Mster HDR影像計算IDR影像之所需,且其將需要IDR的PB_CH之選擇,假設其得自某個其他輸入(例如,此可已由有線電視營運商選擇,並置於記憶體中的某處,待由組態軟體載入);將假設PB_CH等於1000尼特(僅用於說明目的;一般而言,此值將係SDR PB_C的數倍高(例如4x高),基於所選擇的值,技術態樣在實施例細節上有些不同)。The HDR function generation unit 901 will calculate the HDR to IDR brightness mapping function F_H2hCI, which is required to calculate the IDR image from the Mster HDR image, and it will require the selection of the PB_CH for the IDR, assuming it is obtained from some other input (for example, this may have been selected by the cable TV operator and placed somewhere in memory to be loaded by the configuration software); PB_CH will be assumed to be equal to 1000 nits (for illustrative purposes only; generally speaking, this value will be several times higher than the SDR PB_C (for example 4x higher), and the technical aspects are somewhat different in implementation details based on the selected value).
以圖 10 繪示此HDR函數產生單元901可如何作用。 FIG . 10 illustrates how the HDR function generating unit 901 may function.
假設分級者已定義某個函數(此處在說明實例中係線性-拋物線-線性函數(簡寫成para ),申請人根據ETSI標準化編解碼器原則使用其以進行主導影像區域之亮度的第一已大部分良好的重新平衡(亦即,其例如以最亮輝度區域之共同控制壓縮的成本在SDR影像中對深色提供足夠可見度)。Assuming that the grader has defined a certain function (here in the illustrated example a linear-parabolic-linear function (abbreviated to para ), the applicant uses it according to the ETSI standardized codec principles to perform a first and mostly good rebalancing of the brightness of the dominant image areas (i.e. it provides sufficient visibility for dark colors in SDR images, for example at the cost of co-controlled compression of the brightest brightness areas).
此一函數藉由具有由分級者針對此HDR影像最佳地選擇的受控斜率SG_gr的線性關係使明亮度之最暗子區域(L<Ld)的輸入明亮度(在藉由根據上述方程式1及方程式2變換像素輝度的視覺心理均等表示中)相關於所需輸出輝度: Ln_XDR= SG_gr *Ln_Mster_HDR if(Ln_Mster_HDR<Ld) [方程式4] (其中Ln_Mstr_HDR及Ln_XDR分別係當藉由分級者預分級為最佳開始影像時之輸入主HDR影像的明亮度(亦即,對應像素輝度在視覺心理上的均勻表示),且Ln_XDR係具有不同動態範圍且尤其係峰亮度PB_C之影像的數個輸出明亮度的總結,為解釋在本發明及其實施例後面的概念而全部顯示在相同的經正規化垂直軸上)。特別係,當分級者針對他的已最佳地分級的Mster_HDR影像而開始重新分級對應的最佳SDR影像時,XDR將係該種類SDR,並將對應的輝度映射函數形狀係顯示成F_Mt1[使用速記符號xty以指示函數將明亮度從該開始PB_C x映射至該結束PB_C y,且x及y任一者大致可指示影像的PB_C,像是M指示主(Master),或數值地指示實例值,其中將二個零丟掉,例如,50意指5000且1意指100尼特]。This function relates the input luminance of the darkest sub-region of luminance (L<Ld) (in a psycho-visually equivalent representation by transforming pixel luminance according to equations 1 and 2 above) to the desired output luminance by a linear relationship with a controlled slope SG_gr optimally chosen by the grader for this HDR image: Ln_XDR= SG_gr *Ln_Mster_HDR if(Ln_Mster_HDR<Ld) [Equation 4] (where Ln_Mstr_HDR and Ln_XDR are the brightness of the input master HDR image when pre-graded by the grader as the best starting image (i.e., the visually uniform representation of the corresponding pixel brightness), and Ln_XDR is the summary of several output brightnesses of images with different dynamic ranges and especially peak brightness PB_C, all displayed on the same normalized vertical axis for explaining the concepts behind the present invention and its embodiments). In particular, when the grader begins to regrade the corresponding best SDR image for his best graded Master_HDR image, the XDR will be that kind of SDR, and the corresponding brightness mapping function shape will be shown as F_Mt1 [the shorthand symbol xty is used to indicate that the function maps brightness from the starting PB_C x to the ending PB_C y, and either x and y may roughly indicate the PB_C of the image, such as M indicating Master, or numerically indicating an instance value with two zeros dropped, for example, 50 means 5000 and 1 means 100 nits].
類似地,對於高於Lb的輸入明亮度Ln_Mster_HDR,再次有可控制的線性關係: Ln_SDR=HG_gr* Ln_Mster_HDR +(1-HG_gr) if(Ln_Mster_HDR>Lb) [方程式5]Similarly, for input brightness Ln_Mster_HDR higher than Lb, there is again a controllable linear relationship: Ln_SDR=HG_gr* Ln_Mster_HDR +(1-HG_gr) if(Ln_Mster_HDR>Lb) [Equation 5]
para延伸在Ld=mx-WP與Lb=mx+WP之間的拋物線部分具有L_XDR=a*x^2+b*x+c的函數定義,其之係數a、b、及c可藉由計算該線正切於來自其之極點相交的該曲線之點及其之橫座標mx而計算(如ETSI1標準中所定義的;mx=(1-HG)/(SG-HG))。The parabolic portion of para extending between Ld=mx-WP and Lb=mx+WP has the functional definition of L_XDR=a*x^2+b*x+c, whose coefficients a, b, and c can be calculated by calculating the point where the line intersects the curve from its pole and its abscissa mx (as defined in the ETSI1 standard; mx=(1-HG)/(SG-HG)).
作為本發明之基礎的大致想法如下(且其可在乘法圖(multiplicative view)中解釋)。任何主HDR輝度可藉由施加恆等變換(對角線)而變換成本身。若在重新分級影像之光譜的終端(亦即,為建立對應的SDR輝度(XDR=SDR)),必須獲得輸出輝度L_SDR=F_Mt1(Ln_M),其中Ln_M係Ln_Mstr_HDR輝度的某特定值,則亦可將此視為輸入輝度L_SDR=b_SDR(Ln_M)*Ln_M的乘法提升。若現可定義某個中間函數F_Mt1_ca,則最終處理係二個函數的連續施加F_IDRt1(F_Mt1_ca(Ln_Mster_HDR)),其中F_IDRt1進行從已計算之IDR像素輝度(從主HDR輝度推導)開始朝向任何像素(或物體)之SDR輝度的最終輝度映射。採用乘法項形式,可說是L_SDR=b_IDR*b_ca*Ln_M,其中二個提升對應於中間函數(或通道調適函數)及其餘的相關功能(其恰巧係與IDR影像一起傳達以建立ETSI2順應HDR視訊編碼的該函數)。注意到此等提升因子本身係Ln_Mster_HDR(或事實上任何與之關聯的中間明亮度)的函數。The general idea underlying the invention is as follows (and it can be explained in a multiplicative view). Any master HDR luminance can be transformed into itself by applying a constant transformation (diagonal). If at the end of the regraded image's spectrum (i.e. to create the corresponding SDR luminance (XDR=SDR)) an output luminance L_SDR=F_Mt1(Ln_M) has to be obtained, where Ln_M is some specific value of the Ln_Mstr_HDR luminance, this can also be viewed as a multiplicative boost of the input luminance L_SDR=b_SDR(Ln_M)*Ln_M. If now some intermediate function F_Mt1_ca can be defined, the final processing is the successive application of two functions F_IDRt1(F_Mt1_ca(Ln_Mster_HDR)), where F_IDRt1 performs the final brightness mapping starting from the calculated IDR pixel brightness (derived from the main HDR brightness) towards the SDR brightness of any pixel (or object). In the form of multiplication terms, it can be said that L_SDR=b_IDR*b_ca*Ln_M, where the two boosts correspond to the intermediate function (or channel adaptation function) and the rest of the related function (which happens to be the function that is passed along with the IDR image to create ETSI2 compliant HDR video coding). Note that these boost factors are themselves functions of Ln_Mster_HDR (or in fact any intermediate brightness associated with it).
現在若不需要傳達任何額外函數則係便利的(若後設資料管理不完美等,其可能例如丟失)。Now this is convenient if there is no need to pass any additional functions (which might for example be lost if metadata management is not perfect etc).
所以,若SLHDR2PLUS原則使用預約定固定方式以將分級者的F_Mt1函數(亦即,用於他期望使用之任何函數形狀的機制)變換成與PB_IDR(根據ETSI2編碼方法,一般亦將該值作為PB_CH傳達至接收器)對應的通道調適函數可係有用的。可顯示向上分級函數F_H2h不需要在與IDR影像關聯的後設資料中共同傳達,因為其對解碼器係固定且已知的,所以反函數F_??或許可從已接收F_I2s函數計算,如所將確實顯示的(若PB_C_H50亦傳達至接收器)。解碼器的新穎性係推導PB_C > PB_IDR之影像的此新方法。理論上,可進行從主F_Mt1推導F_Mt1_ca函數的任何固定方法,條件係其在數學上可逆或至少根據需求係可解碼的,但期望選擇執行HDR至IDR重分級(亦即,推導F_Mt1_ca形狀)使得其用於推導MDR影像的進一步變形與ETSI2所會產生者相容的一種方法(理論上,僅將ETSI2影像標準化在PB_C與100尼特之間,所以可以對PB_IDR與100尼特之間的動態範圍的所有影像要求影像模樣(亦即,所有像素輝度及顏色)的接近相等性開始,但亦可嘗試將技術限制加諸在將獲得的解決方案上,該技術限制係從已接收IDR朝向主HDR影像升級的影像(亦即,使用待由SLHDR2PLUS解碼器計算的F_??)具有與由ETSI2的顯示調適所將獲得的相同的模樣,該ETSI2用以接收,例如5000尼特PB_C Mster_HDR影像,及總輝度重新映射函數F_Mt1。It may therefore be useful if the SLHDR2PLUS principle uses a predetermined fixed way to transform the grader's F_Mt1 function (i.e. a mechanism for whatever function shape he wishes to use) into a channel adaptation function corresponding to PB_IDR (which value is generally also communicated to the receiver as PB_CH according to the ETSI2 coding method). It can be shown that the up-grading function F_H2h does not need to be communicated together in the metadata associated with the IDR picture, since it is fixed and known to the decoder, so the inverse function F_?? may be calculated from the received F_I2s function, as will indeed be shown (if PB_C_H50 is also communicated to the receiver). The novelty of the decoder is this new way of deriving pictures for which PB_C > PB_IDR. In theory, any fixed method of deriving the F_Mt1_ca function from the main F_Mt1 could be done, provided that it is mathematically reversible or at least decodable on demand, but it is desirable to choose a method for performing HDR to IDR regrading (i.e. deriving the F_Mt1_ca shape) such that its use in deriving further deformations of the MDR image is compatible with what would result from ETSI2 (in theory, just normalize ETSI2 images between PB_C and 100 nits, so one can do the same for PB_IDR and 1 00 nits, but one may also try to impose technical constraints on the solution to be obtained, that is, the image upscaled from the received IDR towards the master HDR image (i.e. using F_?? to be calculated by the SLHDR2PLUS decoder) has the same look as would be obtained by display adaptation of ETSI2, which is used to receive, for example, 5000 nit PB_C Master_HDR images, and the overall brightness remapping function F_Mt1.
首先解釋此一較佳的通道調適 (亦即,F_Mt1_ca的計算、或於圖9中計算的F_H2hCI;及對應的(多個)IDR影像)可如何設計成其對SLHDR2PLUS的數個方法/實施例有用。It is first explained how this better channel adaptation (i.e., the calculation of F_Mt1_ca, or F_H2hCI calculated in Figure 9; and the corresponding (multiple) IDR image) can be designed to be useful for several methods/embodiments of SLHDR2PLUS.
圖12a顯示由分級者(或自動)最佳地選擇的白色位準偏移WLO_gr,且若可用,黑色位準偏移(BLO_gr)亦如此;對應於圖4中的單元403。FIG. 12 a shows the white level offset WLO_gr, and if available, the black level offset (BLO_gr), optimally chosen by the grader (or automatically); corresponding to cell 403 in FIG. 4 .
此時可假設此係唯一的動態範圍調整,亦即,輝度映射操作以從Mster_HDR起始影像獲得SDR影像(此白色在白色上及黑色在黑色上(black-on-black)更確切地說係給予不良品質LDR影像的基本型動態範圍轉換,該等不良品質LDR影像既不具有正確的平均亮度亦不具有平均視覺對比,更別說所期望之所得影像的更高影像品質描述符,但作為根據申請人之方法的重新分級鏈的第一步驟,其係不錯的步驟,且吾等首先必須解釋此步驟及其通道調適)。想法在於若(儘管至多PB_HDR=5000尼特之碼明亮度的可能性)實際上在目前待映射影像(或假如決定對所有的該等時間連續影像使用相同函數,在相同場景之視訊中的影像鏡頭)中沒有高於值MXH的像素輝度,則將最高MXH映射至SDR中的最大明亮度碼係合理的(亦即,例如1024,對應於100尼特的輝度)。任何其他映射方法(例如,HDR白色在SDR白色上映射(HDR-white-on-SDR-white mapping))將使所有的實際目前輝度變得甚至更暗,且其並非最佳的,鑑於SDR輝度範圍已經足夠小,仍需要最佳地含有HDR輝度的大範圍對應模擬。It can be assumed at this point that this is the only dynamic range adjustment, i.e. a luminance mapping operation to obtain an SDR image from the Mster_HDR starting image (this white-on-white and black-on-black is more precisely a basic type of dynamic range conversion giving poor quality LDR images which have neither the correct average brightness nor average visual contrast, let alone the desired higher image quality descriptors of the resulting image, but it is a good step as a first step in the re-grading chain according to the applicant's method and we must first explain this step and its channel adaptation). The idea is that if (despite the possibility of code brightnesses up to PB_HDR = 5000 nits) there are actually no pixel luminances higher than the value MXH in the current image to be mapped (or in the image shots in the video of the same scene if one decides to use the same function for all such time-series images), then it is reasonable to map the highest MXH to the maximum luminance code in SDR (i.e., e.g. 1024, corresponding to a brightness of 100 nits). Any other mapping method (e.g. HDR-white-on-SDR-white mapping) would make all actually current luminances even darker, and is not optimal, given that the SDR luminance range is already small enough, and a large range of correspondence simulations is still needed to optimally contain HDR luminances.
然後,問題係此WLO值是否應針對IDR影像調整(如可於圖12b中所見的,中間影像中的最亮明亮度可能已下降成更接近PB_IDR,仍將有用於SDR重新分級影像的最終偏移以映射在1.0上;該映射亦可等效地顯示在HDR 5000尼特影像的經正規化輝度範圍上,如藉由ON所指示的)。在第一種方法中,不需要其(因為在如何設計用於計算F_Mt1_ca函數的演算法上有一些自由),但假如按比例調整其,則可使用下列方法。The question then is whether this WLO value should be adjusted for the IDR image (as can be seen in Figure 12b, the brightest luminance in the intermediate image may have dropped to be closer to PB_IDR, which will still be useful for the final offset of the SDR regraded image to map on 1.0; the mapping can also be equivalently shown on the normalized luminance range of the HDR 5000 nit image, as indicated by ON). In the first approach, it is not needed (because there is some freedom in how to design the algorithm for calculating the F_Mt1_ca function), but if it is adjusted proportionally, the following method can be used.
用於此類水平按比例調整的比例因子需要判定,以能夠按比例調整輝度映射函數,其在此情形中係其之參數WLO_ca,及類似地經按比例調整BLO_gr(符號BLO_ca)。若期望此參數隨PB_IDR線性按比例調整,則限制係該動作完全進行,亦即,當PB_IDR=PB_SDR時,偏移具有其之最大程度BLO_gr。另一方面,對於HDR影像,BLO或WLO應係零,因為具有用於將5000尼特Mster_HDR映射至Mster_HDR的恆等變換,所以沒有需要校正的事物。The scaling factor used for such horizontal scaling needs to be determined in order to be able to scale the brightness mapping function, which in this case is its parameter WLO_ca, and similarly scale BLO_gr (symbol BLO_ca). If it is desired that this parameter scales linearly with PB_IDR, then the limitation is that this action is done exactly, i.e. when PB_IDR=PB_SDR, the offset has its maximum extent BLO_gr. On the other hand, for HDR images, BLO or WLO should be zero, since there is a constant transformation for mapping 5000 nits Mster_HDR to Mster_HDR, so there is nothing to correct for.
因此,可制訂參數的此一定義 WLO_ca=scaleHor*WLO_gr (0<=ScaleHor<=1) BLO_ca=scaleHor*BLO_gr [方程式6]Therefore, the following definition of parameters can be made: WLO_ca=scaleHor*WLO_gr (0<=ScaleHor<=1) BLO_ca=scaleHor*BLO_gr [Equation 6]
然後問題係如何定義ScaleHor。The question then is how to define ScaleHor.
圖 12b 顯示不同動態範圍的頻譜,更具體的,沿著水平軸而組織的不同PB_C影像。其等沿著各影像之峰亮度PB_C之感知位置而定位。因此,將其等置於係v(PB_C)的橫座標位置上,藉此v係函數方程式1,其中值PB_C用於參數L_in,並具有針對已分級之Mster_HDR影像的峰亮度計算之方程式2的值RHO(亦即,例如對於5000尼特PB_C Mster_HDR影像,RHO係25)。若縱座標軸亦使其明亮度L根據v函數(在垂直軸上)參數化,具有相同的RHO=25,則PB_C很好地遵循直線,且可在此框架中完成定義及計算。例如,可將任何中間影像的峰亮度PB_C的明亮度投射至主(5000尼特)明亮度軸上。所使用的符號係「P_I1oI2」,意謂著當表示在影像I2的明亮度範圍上時,經由影像I1的峰亮度(其係正常輝度)之v函數的施加所對應之明亮度的值。所以,例如,P_IoH係在Mster_HDR明亮度範圍上的經選擇IDR影像之峰亮度的明亮度,且P_SoH係100尼特的明亮度(注意到此範圍上的1.0對應於Mster_HDR影像的PB_C,使得,例如,100尼特的位置(例如,0.5)將取決於所選擇之Mster-HDR影像表示而改變,其係為何方程式1及方程式2係RHO參數化的曲線家族)。 Fig . 12b shows the spectrum of different dynamic ranges, more specifically, different PB_C images organized along the horizontal axis. They are located along the perceived position of the peak brightness PB_C of each image. Therefore, they are placed on the horizontal axis position v(PB_C), whereby v is the function of equation 1, with the value PB_C for the parameter L_in, and with the value RHO of equation 2 calculated for the peak brightness of the graded Master_HDR image (i.e., for example, RHO is 25 for a 5000 nit PB_C Master_HDR image). If the vertical axis also has its brightness L parameterized according to the v function (on the vertical axis), with the same RHO=25, then PB_C follows a straight line nicely and can be defined and calculated in this framework. For example, the brightness of the peak brightness PB_C of any intermediate image can be projected onto the primary (5000 nits) brightness axis. The notation used is "P_I1oI2", meaning the value of brightness corresponding to the application of the v-function of the peak brightness of image I1 (which is normal brightness) when represented over the brightness range of image I2. So, for example, P_IoH is the brightness of the peak brightness of the selected IDR image over the Master_HDR brightness range, and P_SoH is the brightness of 100 nits (note that 1.0 on this range corresponds to PB_C of the Master_HDR image, so that, for example, the position of 100 nits (e.g., 0.5) will change depending on the Master-HDR image representation chosen, which is why Equations 1 and 2 are a family of curves parameterized by RHO).
然後用於ScaleHor的合適函數將係從1-P_IoH開始。PB_IDR減少得越多,亦即,選擇更朝向右方,此函數將確實增加MsterHDR影像的吾等IDR影像表示。且假如P_IoH=1,其將產生0,其在選擇5000尼特的IDR影像時發生(純粹用於scaleHor方程式的理論解釋,因為技術上不合理)。然而,當IDR=SDR時,此方程式不等於1.0,因此需要使用因子k將其按比例調整。Then the appropriate function for ScaleHor would be to start with 1-P_IoH. The more PB_IDR decreases, i.e. the further to the right one is chosen, the more this function will actually increase our IDR image representation of the MasterHDR image. And if P_IoH=1, it will produce 0, which happens when a 5000 nits IDR image is chosen (purely for theoretical explanation of the scaleHor equation, as it is not technically sound). However, when IDR=SDR, this equation is not equal to 1.0, so it needs to be scaled using a factor k.
若k=1-P_SoH(其係與對應於各種IDR位置的變數P_IoH成對比的固定值),可證實正規化係正確的,因此: ScaleHor=(1-P_IoH)/(1-P_SoH) [方程式7]If k = 1-P_SoH (which is a fixed value that is compared to the variable P_IoH corresponding to various IDR positions), it can be proved that the regularization is correct, so: ScaleHor = (1-P_IoH)/(1-P_SoH) [Equation 7]
用於該通道轉換的正確para的判定(圖4,單元404)係更複雜的,並以圖 13 說明。The determination of the correct para for the channel conversion (FIG. 4, unit 404) is more complex and is illustrated in FIG . 13 .
在此情形中,發明人決定在正交於恆等對角線([0,0]-[1,1])的對角線方向上進行函數變換。此必須在所有函數重新分級的正常Mster_HDR/XDR座標系統表示中以等效參數化轉換。In this case, the inventors decided to perform the function transformation in a diagonal direction orthogonal to the constant diagonal ([0,0]-[1,1]). This must be transformed with equivalent parameterization in the normal Master_HDR/XDR coordinate system representation of all functions rescaled.
將基本按比例調整定義在將對角線改變成水平軸的45度旋轉的軸系統中(圖13a)。看到其係例如經旋轉para的函數Fx。以因子La/K對經旋轉對角線(亦即,新的x軸)上的點按比例調整任何值dY係合理的(該dX對應於某個橫座標,亦即原始軸系統中的L_Mster_HDR明亮度),藉此K係函數的完整動作,亦即完整dY值,且經按比例調整dY_ca值在此經旋轉系統中將係(La/K)*dY。The basic scaling is defined in an axis system that is a 45 degree rotation that changes the diagonal to the horizontal axis (Fig. 13a). This is seen as a function Fx of the rotated para, for example. It is reasonable to scale any value dY at a point on the rotated diagonal (i.e., the new x-axis) by a factor La/K (this dX corresponds to some horizontal coordinate, i.e., L_Mster_HDR brightness in the original axis system), whereby K is the full motion of the function, i.e., the full dY value, and the scaled dY_ca value in this rotated system will be (La/K)*dY.
定義sc_r=La/K,其中La= 1/P_IoH且K=1/P_SoH(應注意I2明亮度在I1軸上的值可係重新制訂成I1明亮度在I2軸上的值,特別係例如1/P_IoH=P_HoI;例如若P_IoH=0.7,此意謂著PB_Mstr_HDR將釘在PB_IDR上方的1/0.7)。Define sc_r=La/K, where La= 1/P_IoH and K=1/P_SoH (note that the values of I2 brightness on the I1 axis can be re-scaled to the values of I1 brightness on the I2 axis, in particular, for example 1/P_IoH=P_HoI; for example, if P_IoH=0.7, this means that PB_Mstr_HDR will be pegged 1/0.7 above PB_IDR).
現在需要計算對角線sc_r的等效垂直按比例調整sc*。Now we need to calculate the equivalent vertical scaling sc* of the diagonal sc_r.
此可藉由施加反向旋轉數學(實際上藉由首先將K及La界定為1.0而非1.4),將圖13a表示帶到圖13b的對角線上而完成。此藉由矩陣旋轉(旋轉至主表示之對角線系統中的任何x_r、y_r,例如1、dY)產生: [x1,y1]=[cos(pi/4) -sin(pi/4) ; sin(pi/4) cos(pi/4)]*[1, P_HoI= 1/La] [x2,y2]=[cos(pi/4) -sin(pi/4) ; sin(pi/4) cos(pi/4)]*[1, P_HoS= 1/K] [方程式8]This can be done by applying the reverse rotation math (actually by first defining K and La to be 1.0 instead of 1.4) to bring the representation of Figure 13a to the diagonal of Figure 13b. This is produced by a matrix rotation (rotating to any x_r, y_r in the diagonal system of the main representation, such as 1, dY): [x1,y1]=[cos(pi/4) -sin(pi/4); sin(pi/4) cos(pi/4)]*[1, P_HoI= 1/La] [x2,y2]=[cos(pi/4) -sin(pi/4); sin(pi/4) cos(pi/4)]*[1, P_HoS= 1/K] [Equation 8]
應注意因為對角按比例調整,x座標及y座標二者將改變,但無論如何將SG及HG(以及任何其他經按比例調整點改變)係定義成斜率而非角度。Note that because the diagonals are scaled, both the x- and y-coordinates will change, but in any case SG and HG (and any other scaled point changes) are defined as slopes, not angles.
從圖13b中之從(0,0)至表示明亮度映射函數之對角按比例調整點的正方形之線對從(0,0)至係原始輝度映射函數點的圓之線的旋轉(或反之亦然)可藉由任何固定橫座標值(例如,a)除斜率而找出(具有對應於經標準化比例因子sc*之垂直變化的角度變化): sc* = (y2/x2)/(y1/x1)=[(1+1/K)/(1-1/K)]/[(1+1/La)/(1-1/La)]=[(K+1)/(K-1)]/[(La+1)/(La-1)] =[(La-1)*(K+1)]/[(La+1)*(K-1)] [方程式8]The rotation of the line from (0,0) to the square representing the diagonally scaled point of the brightness mapping function in Figure 13b to the line from (0,0) to the circle representing the original brightness mapping function point (or vice versa) can be found by dividing the slope by any fixed abscissa value (e.g., a) (with an angular change corresponding to a vertical change in the normalized scale factor sc*): sc* = (y2/x2)/(y1/x1)=[(1+1/K)/(1-1/K)]/[(1+1/La)/(1-1/La)]=[(K+1)/(K-1)]/[(La+1)/(La-1)] =[(La-1)*(K+1)]/[(La+1)*(K-1)] [Equation 8]
隨後,必須計算對應於全部垂直按比例調整(sc*=1)的實際縱座標距離n,且此可藉由由於涉及在對角線中的45度角,使按比例調整mip係中點(具有在其下方至對角線及在其上方至para的二個線性區段之交點(mx, my)的距離Fd)而完成。因此,n=Fd等於在mx之差動斜率SG-1的一半,亦即,mx*(SG-1)/2。Then, the actual vertical coordinate distance n corresponding to the full vertical scaling (sc*=1) must be calculated, and this can be done by taking the distance Fd from the midpoint of the scaled mip (the intersection of the two linear segments (mx, my) below it to the diagonal and above it to para) due to the 45 degree angle involved in the diagonal. Therefore, n=Fd is equal to half the differential slope SG-1 at mx, that is, mx*(SG-1)/2.
隨後,必須將經偏移交點(mxca, myca)計算如下: mxca=mx+d= mx+ [mx*(SG-1)/2]*(1-sc*) myca=my-d=SG*mx-(mxca-mx)= -mxca+ mx*(SG+1) [方程式9]Then, the offset intersection point (mxca, myca) must be calculated as follows: mxca=mx+d= mx+ [mx*(SG-1)/2]*(1-sc*) myca=my-d=SG*mx-(mxca-mx)= -mxca+ mx*(SG+1) [Equation 9]
使用新點的位置,最終可計算經通道調適陰影增益(SG_ca,見圖10)及經通道調適高亮度增益HG_ca: SG_ca = myca/mxca HG_ca=(myca-1)/(mxca-1) [方程式10]Using the position of the new point, the channel-adjusted shadow gain (SG_ca, see Figure 10) and channel-adjusted highlight gain HG_ca can finally be calculated: SG_ca = myca/mxca HG_ca=(myca-1)/(mxca-1) [Equation 10]
最後,有數個用於拋物線中間區段的方法/實施例。Finally, there are several methods/implementations for the middle section of the parabola.
在產生實務上相當良好之視覺結果的一種方法中,取WP_ca=WP_gr,其中WP_gr係當由有關主HDR及主SDR影像之內容創作者的分級者或自動化而最佳化時拋物線區段的原始寬度,且WP_ca係經通道調適para函數的寬度。另一方法係定義WP_ca=v(abs(sc*), 100)*WP_gr,其中v函數再度藉由上述方程式1及方程式2定義。In one approach that produces quite good visual results in practice, WP_ca=WP_gr is taken, where WP_gr is the original width of the parabola segment when optimized by the content creator's grader or automatically for the master HDR and master SDR images, and WP_ca is the width of the channel-adapted para function. Another approach is to define WP_ca=v(abs(sc*), 100)*WP_gr, where the v function is again defined by Equations 1 and 2 above.
使其作為可用技術,其可用以針對SLHDR2PLUS定義合適的IDR定義。Making it an available technology, it can be used to define appropriate IDR definitions for SLHDR2PLUS.
回到圖 10,上述方程式定義可如何獨特地定義函數F_Mt1_ca,以用於從例如5000尼特主HDR影像開始之例如所選擇的1000尼特PB_IDR。若此函數係由HDR函數產生單元901判定,可將其輸出為F_H2hCI並發送為IDR影像計算單元902的輸入。此單元將會將此函數施加至接收為影像輸入[L_IDR=F_H2hCI(L_MsterHDR)= F_Mt1_ca(L_MsterHDR)]之MsterHDR影像的所有像素輝度,以獲得對應的IDR影像像素輝度,且將輸出IDR影像。Returning to FIG. 10 , the above equations define how a function F_Mt1_ca may be uniquely defined for a selected 1000 nit PB_IDR starting from a 5000 nit master HDR image, for example. If this function is determined by the HDR function generation unit 901, it may be output as F_H2hCI and sent as input to the IDR image calculation unit 902. This unit will apply this function to all pixel luminances of the MasterHDR image received as image input [L_IDR=F_H2hCI(L_MsterHDR)= F_Mt1_ca(L_MsterHDR)] to obtain the corresponding IDR image pixel luminances, and will output the IDR image.
現在問題仍係將哪個輝度映射函數加至到IDR的後設資料中,以使其彷彿正常的ETSI2影像一樣顯現(亦即,使得任何傳統的ETSI2解碼器可正常地將其解碼,產生如其應有之模樣的SDR影像或任何MDR影像)。Now the question remains which brightness mapping function to add to the IDR metadata so that it appears as a normal ETSI2 image (i.e., so that any conventional ETSI2 decoder can decode it normally, producing an SDR image or any MDR image that looks like it should).
可將此次級IDR輝度映射函數F_I2sCI(其亦將係para)定義如下(且將藉由IDR映射函數產生器903計算)。可將用於IDR影像SG_IDR的陰影增益視為在已從Mster_HDR去到IDR影像之後的剩餘乘法(或斜率)(亦即,從IDR影像開始的剩餘相對明亮化以獲得SDR影像): Y_out (x_in)=SG_gr*x_in; = F_I2sCI(L_IDR=SG_ca*x_in)This secondary IDR brightness mapping function F_I2sCI (which will also be para) can be defined as follows (and will be calculated by the IDR mapping function generator 903). The shadow gain for the IDR image SG_IDR can be considered as the remaining multiplication (or slope) after going from Master_HDR to the IDR image (i.e., the remaining relative brightening starting from the IDR image to obtain the SDR image): Y_out (x_in)=SG_gr*x_in; = F_I2sCI(L_IDR=SG_ca*x_in)
亦已知將用於最暗像素的相同para線性區段映射施加至新的IDR明亮度輸入: Y_out=SG_IDR*L_IDR 因此: SG_gr= SG_IDR*SG_ca [方程式11] (例如,取輸入x_in=L_Mster_HDR=0.2,其從對角線映射至L_IDR=0.3=(0.3/0.2)*x_in,其最後映射至Y_out=0.4=k*0.3,其中k=0.4/0.3;Y_out=SG_gr*0.2=(0.4)*0.2=(0.4/0.3)*(0.3/0.2)*0.2)。It is also known that the same para linear segment mapping used for the darkest pixel is applied to the new IDR brightness input: Y_out=SG_IDR*L_IDR Thus: SG_gr= SG_IDR*SG_ca [Equation 11] (For example, take the input x_in=L_Mster_HDR=0.2, which is mapped diagonally to L_IDR=0.3=(0.3/0.2)*x_in, which is finally mapped to Y_out=0.4=k*0.3, where k=0.4/0.3; Y_out=SG_gr*0.2=(0.4)*0.2=(0.4/0.3)*(0.3/0.2)*0.2).
因此,從方程式11,遵循該方式以計算所需的SG_IDR(鑑於如上文所述地使用固定方法以判定SG_ca): SG_IDR=SG_gr/SG_ca [方程式12] 類似地: HG_IDR=HG_gr/HG_ca [方程式13]Therefore, from Equation 11, the required SG_IDR is calculated in this way (given that the fixed method is used to determine SG_ca as described above): SG_IDR=SG_gr/SG_ca [Equation 12] Similarly: HG_IDR=HG_gr/HG_ca [Equation 13]
其中,HG_gr再次係由將主SDR影像模樣相關於主HDR影像模樣(亦即,其明亮度分布)的內容創作者判定的最佳高亮度增益,且HG_ca係對應於原始高亮度增益HG_gr的經通道調適高亮度增益。Wherein, HG_gr is again the optimal high brightness gain determined by the content creator relating the master SDR image sample to the master HDR image sample (i.e., its brightness distribution), and HG_ca is the channel-adapted high brightness gain corresponding to the original high brightness gain HG_gr.
須注意基本陰影增益調整可相關於來自SDR與IDR影像之間的峰亮度的差異之所預期的簡單陰影增益而判定為:ShadBst=SG_IDR/P_IoS。如所述,當表示在SDR影像的經正規化明亮度軸上時,P_IoS係IDR影像的最大可編碼輝度,亦即,例如7.0。Note that the basic shadow gain adjustment can be determined relative to the expected simple shadow gain from the difference in peak brightness between the SDR and IDR images as: ShadBst = SG_IDR / P_IoS. As described, when expressed on the normalized brightness axis of the SDR image, P_IoS is the maximum encodable brightness of the IDR image, i.e., 7.0, for example.
注意到,有高亮度增益無法大於預定義數目(以ETSI標準編成高亮度增益)的一些實際實施例,在此情形中,需要高亮度增益的更進一步重新計算,見下文,但此對所有實施例而言非必要的。例如,可將此實現為:Note that there are some practical embodiments where the high brightness gain cannot be larger than a predefined number (codified in the ETSI standard as high brightness gain), in which case a further recalculation of the high brightness gain is required, see below, but this is not necessary for all embodiments. For example, this can be implemented as:
If HG_IDR>KLIM then HG_IDR_adj=KLIM [方程式14],其中KLIM較佳地等於0.5。If HG_IDR>KLIM then HG_IDR_adj=KLIM [Equation 14], where KLIM is preferably equal to 0.5.
確實地,假設分級者已使HG_gr接近0.5之最大值,且對應的HG_ca(其作為更柔軟映射應具有更接近對角線的HG_ca,亦即,大於HG_gr)係例如0.75,則發現該除法係0.67,其高於當標準化時可根據純粹的ETSI2 HDR視訊信號傳達的最大值。一種解決方案係例如重新定義較小的HG_gr,使得HG_IDR將不高於0.5(經標準化最大值)。此再度需要將所有重新分級態樣列入考量的可觀計算,如將於下文所示者。另一選項係例如藉由將HG_IDR限制至0.5,以使IDR +後設資料信號順應,同時傳達為額外後設資料(完全不受限制的HG_IDR)。HG_gr一般將取決於Mster_HDR影像的PB_C,但亦取決於何種影像物體在影像中(例如,明亮多彩的物體,其足夠重要到不能將其等之輝度壓縮太多,一極端實例係接近強大太陽之明亮行星的影像,其使用許多非常高的L_Mster_HDR明亮度值及很少的暗者分級)。HG_ca一般除了其他事物之外將取決於所選擇的PB_IDR有多接近PB_Mster_HDR。Indeed, assuming that the grader has brought HG_gr close to a maximum value of 0.5, and the corresponding HG_ca (which as a softer mapping should have a HG_ca closer to the diagonal, i.e. larger than HG_gr) is, for example, 0.75, the division is found to be 0.67, which is above the maximum value that can be conveyed from a pure ETSI2 HDR video signal when normalized. One solution is, for example, to redefine a smaller HG_gr so that HG_IDR will not be higher than 0.5 (the normalized maximum value). This again requires considerable calculations taking all re-grading aspects into account, as will be shown below. Another option is, for example, to make the IDR+metadata signal conform by limiting HG_IDR to 0.5, while being conveyed as additional metadata (completely unrestricted HG_IDR). HG_gr will generally depend on the PB_C of the Master_HDR image, but will also depend on what kind of image objects are in the image (e.g. bright, colorful objects that are important enough not to have their brightness compressed too much, an extreme example being images of bright planets close to a powerful Sun, which use many very high L_Master_HDR brightness values and very few dim ones). HG_ca will generally depend on, among other things, how close the chosen PB_IDR is to the PB_Master_HDR.
此外,假設WP_IDR=WP_gr [方程式15]In addition, assume WP_IDR = WP_gr [Equation 15]
如所述,其他實施例係可能的,但以較容易方式說明該等原理,現在產生該假設。As stated, other embodiments are possible, but to illustrate the principles in an easier way, this assumption is made now.
使用方程式6計算黑色位準偏移與白色位準偏移的適當經通道調適值(若有任何此類偏移由內容創作者定義)。現在剩下的係如何計算(藉由IDR視訊編碼器)BLO_IDR及WLO_IDR的對應值。The appropriate channel-adjusted values for the black level offset and white level offset (if any such offsets are defined by the content creator) are calculated using Equation 6. Now all that remains is how to calculate (by the IDR video encoder) the corresponding values for BLO_IDR and WLO_IDR.
首先,以編碼值glim的較佳方式計算: glim= {log[1+(rhoSDR-1)*power((0.1/100);1/2.4)]/log(rhoSDR)}/{log[1+(rhoHDR-1)*power(1/PB_Mster_HDR;1/2.4)]/log(rhoHDR)} [方程式16] 其中rhoSDR=1+32*power(100/10000;1/2.4),且 rhoHDR=1+32*power(PB_Mster_HDR/10000; 1/ 2.4)First, the coding value glim is calculated in a better way: glim= {log[1+(rhoSDR-1)*power((0.1/100);1/2.4)]/log(rhoSDR)}/{log[1+(rhoHDR-1)*power(1/PB_Mster_HDR;1/2.4)]/log(rhoHDR)} [Equation 16] where rhoSDR=1+32*power(100/10000;1/2.4), and rhoHDR=1+32*power(PB_Mster_HDR/10000; 1/ 2.4)
此將導致調適BLO的簡單方式,因為實際上在HDR編碼的ETSI1及ETSI2標準方法中,亦有平行於輝度處理鏈(圖4中的單元402至406及圖15中的1502至1506)之將具有角度glim的線性曲線施加至所感知的Y’HP且與由所解釋之單元所計算的Y’GL值比較及取得平行計算之二個值的最大者(此除了其他事物之外對ETSI1的可逆性係重要,以允許最暗的HDR明亮度的重建)的線性增益限制器,該等圖式僅用於易於瞭解所說明之發明人之方法的部分循序重新分級步驟。This will lead to a simple way of adapting the BLO, since in practice in the ETSI1 and ETSI2 standard methods of HDR coding there are also linear gain limiters in parallel with the brightness processing chain (units 402 to 406 in Fig. 4 and 1502 to 1506 in Fig. 15) that apply a linear curve with the angle glim to the perceived Y'HP and compare it with the Y'GL value calculated by the explained unit and take the maximum of the two values calculated in parallel (this is important among other things for the reversibility of ETSI1 to allow reconstruction of the dimmest HDR brightnesses), the figures are only used to easily understand the partially sequential re-grading steps of the described inventor's method.
現在可顯示由於此限制器之動作,BLO值可容易地以下列方程式而通道調適: BLO_IDR=BLO_gr*glim [方程式17] glim如上文所示地取決於PB_Mster_HDR的特定選擇,且可係例如0.6。It can now be shown that due to the action of this limiter, the BLO value can be easily adapted to the channel with the following equation: BLO_IDR=BLO_gr*glim [Equation 17] glim depends on the specific choice of PB_Mster_HDR as shown above and can be, for example, 0.6.
此以圖 17 說明。圖17b顯示圖17a所示之全範圍明亮度映射的最暗明亮度的放大。將各種函數再次顯示在經標準化圖表上,該等函數對應於各種輸入PB_C及輸出PB_C。This is illustrated in Figure 17. Figure 17b shows a zoom-in of the darkest brightness of the full range brightness map shown in Figure 17a. The various functions are again shown on a normalized graph, corresponding to various input PB_C and output PB_C.
FL_gr係由內容創作者建立之用於將例如4000尼特Mster_HDR映射至SDR的函數。點虛(dotted)曲線FL_ca係用以從Mster_HDR產生例如500尼特IDR的通道調適。虛(dashed)曲線FL_IDR係將IDR明亮度映射至SDR明亮度的曲線。在圖17b的放大圖表中,看到FL_gr曲線在約0.03的輸入處具有尖銳扭轉,其係平行增益限制器起作用處(亦即,其線性輸出y=glim*Y’HP被選為較低明亮度輸入的函數輸出,而非來自圖4所示之該鏈中的所有單元之動作的Y’GL值(關於全部電路描述,見ETSI1標準,圖4))。FL_gr is a function created by the content creator to map, for example, 4000 nits Master_HDR to SDR. The dotted curve FL_ca is a channel adaptation used to produce, for example, 500 nits IDR from Master_HDR. The dashed curve FL_IDR is the curve that maps IDR brightness to SDR brightness. In the zoomed-in graph of Figure 17b, the FL_gr curve is seen to have a sharp twist at about 0.03 of the input, which is where the parallel gain limiter comes into play (i.e., its linear output y=glim*Y’HP is chosen as the function output for lower brightness inputs, rather than the Y’GL value from the action of all units in the chain shown in Figure 4 (see ETSI1 standard, Figure 4 for the full circuit description)).
任何曲線的BLO值係若沒有增益限制時將發生的與水平軸的相交,亦即,例如如點虛線所作地藉由將局部斜率延伸至高於FL_gr曲線的0.3所示的BLO_gr。The BLO value for any curve is the intersection with the horizontal axis that would occur if there was no gain limiting, i.e., BLO_gr by extending the local slope to above 0.3 of the FL_gr curve as shown by the dotted line, for example.
針對此施加,知道亦可延伸FL_IDR曲線以獲得BLO_IDR值(應注意有glim_IDR值,其係ETSI2標準所將使用的,其不同於glim_gr),及知道可將此較低BLO_IDR值發現為glim*BLO_gr(應注意此glim(唯一需要針對SLHDR2PLUS計算之glim)係在圖17b中顯示為glim_gr的事物)係足夠的。For this application, it is sufficient to know that the FL_IDR curve can also be extended to obtain the BLO_IDR value (note that there is a glim_IDR value, which is what the ETSI2 standard will use, which is different from glim_gr), and that this lower BLO_IDR value can be found as glim*BLO_gr (note that this glim (the only glim that needs to be calculated for SLHDR2PLUS) is the one shown as glim_gr in Figure 17b).
隨後,執行以下計算以獲得WLO_IDR。Subsequently, the following calculation is performed to obtain WLO_IDR.
圖17a亦顯示者係有三個不同的WLO,亦即,原本由分級者產生為他的主HDR至SDR映射策略的WLO_gr(亦為圖12b中的ON)、在FL_ca曲線橫跨上水平線處的經通道調適的WLO_ca、及係WLO_gr明亮度至IDR明亮度軸上之映射的該者(其可使用像圖12的表示來發想,其中MXH投影至MXI,且最後亦有WLO_IDR,其係剩下用於將IDR明亮度向下明亮度映射至SDR的WLO(當從用於WLO_gr及WLO_ca的關聯PB_C=5000(因為用於使用該等函數之重新分級的輸入影像係5000尼特Mster_HDR)開始至用於重新分級所需之IDR相關定義的PB_C=1000尼特時(因為在ETSI2順應觀點中,所接收之自其推導其他影像的起始影像係,例如,1000尼特PB_C IDR影像),其因為經正規化明亮度橫座標定義改變而與經按比例調整WLO_ca不同)。What is also shown in Figure 17a is that there are three different WLOs, namely the WLO_gr that was originally generated by the grader as his main HDR to SDR mapping strategy (also ON in Figure 12b), the channel-adapted WLO_ca where the FL_ca curve crosses the upper horizontal line, and the one that is the mapping of WLO_gr brightness to the IDR brightness axis (which can be imagined using a representation like Figure 12, where MXH is projected onto MXI, and finally there is also WLO_IDR, which is left to convert IDR luminance down-luminance mapping to SDR's WLO (which differs from the scaled WLO_ca due to the changed normalized luminance coordinate definition when starting from the associated PB_C=5000 for WLO_gr and WLO_ca (because the input image for regrading using these functions is 5000 nits Mster_HDR) to the IDR-related defined PB_C=1000 nits required for regrading (because in an ETSI2 compliant viewpoint the received starting image from which the other images are derived is, for example, a 1000 nits PB_C IDR image).
圖17c在函數圖表的上隅角(接近[1,1])上放大。如從(經正規化)縱座標位置至橫座標位置的圓形投影所示,WLO_IDR值遵循將作為輸入發送通過FL_ca曲線的WLO_gr值。在圖12b上看到MXI位置確實係在IDR明亮度軸上之被映射至1.0的SDR明亮度的經正規化位置,因此其係如WLO_IDR之定義之所需。Figure 17c zooms in on the upper corner of the function graph (close to [1,1]). As shown by the circular projection from the (normalized) ordinate position to the abscissa position, the WLO_IDR value follows the WLO_gr value that will be sent through the FL_ca curve as input. It can be seen in Figure 12b that the MXI position is indeed the normalized position on the IDR brightness axis that is mapped to an SDR brightness of 1.0, so it is as required by the definition of WLO_IDR.
可從表面上認為,若WLO值隨後在編碼側通過其的映射曲線係para(見圖4,單元404在單元403之後映射),其一般係將涉及之para的上線性區段。It can be assumed superficially that if the curve through which the WLO value is subsequently mapped on the encoding side is para (see FIG. 4 , unit 404 is mapped after unit 403 ), it will generally involve an upper linear section of para.
然而,由於para如何定義,可涉及其之任何部分(甚至有僅有para之SG的特殊值定義理論上移動至高於1.0之非常高的交點的設定,因此在至多最亮明亮度之情形中的行為僅由陰影增益斜率判定,導致對將大多含有非常亮的明亮度的HDR影像重新分級至SDR有用的線性曲線,像是例如,科幻電影中由5個太陽照明的沙漠行星)。因此,此變成有點涉及計算,其中其需要測試para的三個子部分的何者係可施加的,較佳的數學實現係: WLO_co=255*WLO_ca/510 BLO_co=255*BLO_ca/2040 Xh=(1-HG_ca)/(SG_ca-HG_ca)+WP_ca WW=(1-WLO_gr*255/510-BLO_co)/(1-WLO_co-BLO-co) IF WW>=Xh THEN WLO_IDR=HG_ca*(1-WW)*510/255 [該上線性區段] ELSE { Xs=(1-HG_ca)/(SG_ca-HG_ca)-WP_ca IF WW>Xs { [輸入,亦即WLO_gr必須通過經通道調適para的拋物線子部分映射] A= -0.5*(SG_ca-HG_ca/(2*WP_ca)) B=(1-HG_ca)/(2*WP_ca) + (SG_ca+HG_ca)/2 C= -[(SG_ca-HG_ca)*(2*WP_ca)-2*(1-HG_ca)]^2 / (8*(SG_ca-HG_ca)* 2*WP_ca) WLO_IDR=(1-(A*WW*WW+B*WW+C))*510/255 } ELSE [在施加para之陰影增益子部分的特殊情形中] WLO_IDR =(1-SG_ca*WW)*510/255 }However, due to how para is defined, any part of it can be involved (there are even settings where only special values of SG of para define a very high intersection point that theoretically moves above 1.0, so the behavior in cases up to the brightest luminances is determined only by the shadow gain slope, resulting in a linear curve that is useful for regrading to SDR mostly HDR images containing very bright luminances, like for example a desert planet illuminated by 5 suns in a sci-fi movie). So this becomes a bit of a calculation involving testing which of the three sub-parts of para are applicable, the best mathematical implementation is: WLO_co=255*WLO_ca/510 BLO_co=255*BLO_ca/2040 Xh=(1-HG_ca)/(SG_ca-HG_ca)+WP_ca WW=(1-WLO_gr*255/510-BLO_co)/(1-WLO_co-BLO-co) IF WW>=Xh THEN WLO_IDR=HG_ca*(1-WW)*510/255 [the upper linear segment] ELSE { Xs=(1-HG_ca)/(SG_ca-HG_ca)-WP_ca IF WW>Xs { [Input, i.e. WLO_gr must be mapped through the parabolic sub-portion of the channel-adapted para] A= -0.5*(SG_ca-HG_ca/(2*WP_ca)) B=(1-HG_ca)/(2*WP_ca) + (SG_ca+HG_ca)/2 C= -[(SG_ca-HG_ca)*(2*WP_ca)-2*(1-HG_ca)]^2 / (8*(SG_ca-HG_ca)* 2*WP_ca) WLO_IDR=(1-(A*WW*WW+B*WW+C))*510/255 } ELSE [In the special case of the shadow gain sub-portion of para applied] WLO_IDR =(1-SG_ca*WW)*510/255 }
這些參數SG_IDR、HG_IDR、WP_IDR、BLO_IDR、WLO_IDR(且若有需要,用於可客製化曲線的類似額外參數)係特徵化且因此作為函數F_I2sCI之輸出的參數(實際上是否輸出特徵化此所需曲線的形狀以進行顯示調適的此等參數,或是否輸出特徵化該函數的LUT僅係實施例選擇;主要者係在經正規化至1.0之軸系統中的正確輝度映射函數形狀F_I2sCI作為後設資料與(多個)IDR影像共同傳達)。These parameters SG_IDR, HG_IDR, WP_IDR, BLO_IDR, WLO_IDR (and similar additional parameters for customizable curves if desired) are the parameters that characterize and are therefore the output of the function F_I2sCI (whether these parameters characterizing the shape of this desired curve for display adaptation are actually output, or whether a LUT characterizing the function is output is merely an implementation choice; the main thing is that the correct brightness mapping function shape F_I2sCI in an axis system normalized to 1.0 is conveyed as meta-data together with the IDR image(s)).
編碼器現在根據新穎的SLHDR2PLUS方法特徵化。然後問題係應如何設計解碼器。必須瞭解此解碼器現在將僅得到F_I2sCI函數,所以其必須稍微計算從已接收IDR影像重建原始Mster_HDR影像所需的函數F_??。在此SLHDR2PLUS編碼方法中,此將係使用在編碼器中以產生IDR明亮度的F_H2hCI函數的反函數,但此類函數仍應係可計算的。The encoder is now characterized according to the novel SLHDR2PLUS method. The question then is how the decoder should be designed. It must be understood that the decoder will now only get the F_I2sCI function, so it must somewhat compute the function F_?? needed to reconstruct the original Master_HDR image from the received IDR image. In the SLHDR2PLUS encoding method this will be the inverse of the F_H2hCI function used in the encoder to generate the IDR luminance, but such a function should still be computable.
如圖 11 所大致說明的,SLHDR2PLUS視訊解碼器1100,輝度函數判定單元1104必須僅基於其接收的資訊計算F_??函數,亦即F_I2sCI及二個峰亮度PB_CH及PB_C_H50。一旦判定函數,可施加其以重建原始的Mster_HDR輝度,藉由施加其(在顏色變換器1102中)至所接收的IDR明亮度:L_REC_M_HDR=F_??(L_IDR),對應HDR輝度可藉由施加方程式1及方程式2的反函數至該等L_REC_M_HDR明亮度而從該明亮度計算。最後,經重建主HDR影像(REC_M_HDR)可依需要以任何格式由顏色變換器1102輸出,例如,基於PQ的YCbCr顏色配方等。解碼器1100在較佳實施例中亦可經組態以計算任何經顯示調適影像,例如,MDR_300,假如300尼特PB_D連接顯示器待以所接收之HDR影像的最佳等效物供應,且此可藉由SLHDR2PLUS數學、或僅由正規的ETSI2解碼完成,因為適當影像(IDR)及輝度映射函數(F_I2sCI)已可用為顏色變換器1102中的輸入)。 As generally illustrated in FIG . 11 , the SLHDR2PLUS video decoder 1100, the brightness function determination unit 1104 must calculate the F_?? function based only on the information it receives, namely F_I2sCI and the two peak brightnesses PB_CH and PB_C_H50. Once the function is determined, it can be applied to reconstruct the original Master_HDR brightness by applying it (in the color converter 1102) to the received IDR brightness: L_REC_M_HDR=F_??(L_IDR), and the corresponding HDR brightness can be calculated from the brightness by applying the inverse functions of Equations 1 and 2 to the L_REC_M_HDR brightness. Finally, the reconstructed master HDR image (REC_M_HDR) can be output by the color converter 1102 in any format as desired, such as a PQ-based YCbCr color recipe, etc. The decoder 1100 can also in a preferred embodiment be configured to calculate any displayed adapted image, for example MDR_300 if a 300 nit PB_D connected display is to be supplied with the best equivalent of the received HDR image, and this can be done by SLHDR2PLUS mathematics, or just by regular ETSI2 decoding, since the appropriate image (IDR) and brightness mapping function (F_I2sCI) are already available as inputs in the color converter 1102).
圖 14 顯示para所涉及者,以從所接收的IDR影像重建REC_M_HDR影像(類似計算將對WLO及BLO,及可客製化曲線形狀點完成,在可應用處(應注意如下文所討論的,一些實施例將不在Mster_HDR與IDR之間,而僅如同SDR降級技術般,亦即,在IDR與SDR之間施加可客製化曲線原則)。 Figure 14 shows what is involved in para to reconstruct a REC_M_HDR image from a received IDR image (similar calculations will be done for WLO and BLO, and customizable curve shape points where applicable (it should be noted that as discussed below, some embodiments will not be between Mster_HDR and IDR, but simply apply a customizable curve principle between IDR and SDR as in the SDR downgrade technique).
現在,需要計算新的主HDR重建陰影增益(SG_REC)及重建高亮度增益(HG_REC),且必須計算拋物線區段的反拋物線方程式以完成所需的重建para輝度映射函數形狀F_L_RHDR(注意僅用於說明目的,反SDR至Mster_HDR輝度映射函數亦已在此經正規化圖表上顯示為點虛線;應注意由於SDR至HDR映射的反函數性質,該曲線SG_RM的陰影增益等於1/SG_gr等)。Now, the new master HDR reconstructed shadow gain (SG_REC) and reconstructed high brightness gain (HG_REC) need to be calculated, and the inverse parabolic equation of the parabolic segment must be calculated to complete the required reconstructed para luminance mapping function shape F_L_RHDR (note that for illustrative purposes only, the inverse SDR to Master_HDR luminance mapping function has also been shown as a dotted line on this normalized graph; it should be noted that due to the inverse function nature of the SDR to HDR mapping, the shadow gain of this curve SG_RM is equal to 1/SG_gr, etc.).
圖 15 首先說明一般解碼器1502核心計算拓撲的一些態樣。如可見到的,其粗略地與編碼器的結構相同,儘管其在相反方向(從IDR重建REC_M_HDR)上執行重新分級,當可輕易地如需求所指定地重組態此一計算拓撲時,其係便利的。若輝度映射器1501得到(所有部分連續重新分級動作的)總LUT,其確實將以相似方式(像是解碼器)運作。 Figure 15 first illustrates some aspects of the core computational topology of a general decoder 1502. As can be seen, it is roughly the same structure as the encoder, although it performs rescaling in the opposite direction (reconstructing REC_M_HDR from IDR), which is convenient as this computational topology can be easily reconfigured as required. If the luminance mapper 1501 gets the overall LUT (of all partial successive rescaling actions), it will indeed operate in a similar way (like a decoder).
當然,一些差異需要經組態以使解碼器作正確的HDR重建重新分級。首先,L_in現在將係IDR經正規化輝度,且輸出輝度Lh將係正確地針對例如5000尼特PB_D顯示器演現按比例調整的經正規化輝度。也看見產生REC_M_HDR影像像素顏色(Rs, Gs, Bs)的最後乘數現在乘以在後設資料中接收的PB_C_H50值。事實上,由感知化器1502及線性化器1506執行的感知化外計算迴路分別將PB_CH及PB_C_H50值施加在方程式1及方程式2及此等方程式的反方程式中。亦應注意現在各種部分重新分級在其等存在的情況下的順序係相反的:首先感知IDR明亮度Y’IP藉由產生重新分級IDR明亮度Y’IPG的精細分級單元1503中的反可客製化曲線而精細分級。之後,至HDR明亮度軸的第一映射(亦即,用於對應的正確HDR模樣(事實上,5000尼特PB_C_H50 Mster_HDR模樣)的對應重新分布明亮度)係由粗略輝度映射單元1504執行,其施加圖14的反para,其仍需要正確地計算,且其將產生初始HDR明亮度Y’HC。最後,反黑色及白色偏移器1505將建立正確的經正規化REC_M_HDR明亮度(Y'HR),以與色度用於進一步的計算,以到達各像素的全三維顏色。如所解釋者,單元1504一般將得到所計算的SG_REC等(或待對應地施加至此等三個值之明亮度映射函數的LUT版本)。應注意若將各種PW值保持相同,則WP_REC再次係WP_gr。單元1505將類似地得到用於Mster_HDR之重建的黑色及白色偏移(WLO_REC, BLO_REC)。進行色度處理之核心單元的下部分(色度處理器1550)將類似於圖4的編碼器拓撲,除了在色度處理判定單元1551中載入正確的C_LUT F_C[Y](見下文解釋之其之計算)。Of course, some differences need to be configured to enable the decoder to do the correct HDR reconstruction re-grading. First, L_in will now be the IDR normalized luminance, and the output luminance Lh will be the normalized luminance correctly scaled for, e.g., a 5000 nit PB_D display presentation. Also see that the final multiplier that produces the REC_M_HDR image pixel color (Rs, Gs, Bs) is now multiplied by the PB_C_H50 value received in the metadata. In fact, the sensing outer calculation loops executed by the sensor 1502 and the linearizer 1506 apply the PB_CH and PB_C_H50 values in Equations 1 and 2 and the inverse of these equations, respectively. It should also be noted that now the order of the various partial regradings in their presence is reversed: first the perceived IDR brightness Y'IP is finely graded by the inverse customizable curve in the fine grading unit 1503 which generates the regraded IDR brightness Y'IPG. Afterwards, a first mapping to the HDR brightness axis, i.e. the corresponding redistributed brightness for the corresponding correct HDR sample (in fact, a 5000 nit PB_C_H50 Mster_HDR sample), is performed by the coarse brightness mapping unit 1504 which applies the inverse para of FIG. 14 , which still needs to be calculated correctly, and which will generate the initial HDR brightness Y'HC. Finally, the anti-black and white offset 1505 will establish the correct normalized REC_M_HDR brightness (Y'HR) for further calculations with the chrominance to arrive at the full three-dimensional color of each pixel. As explained, unit 1504 will generally obtain the calculated SG_REC, etc. (or a LUT version of the brightness mapping function to be applied to these three values accordingly). It should be noted that if the various PW values are kept the same, then WP_REC is again WP_gr. Unit 1505 will similarly obtain the black and white offsets (WLO_REC, BLO_REC) used for the reconstruction of Mster_HDR. The lower part of the core unit performing chroma processing (chroma processor 1550) will be similar to the encoder topology of Figure 4, except that the correct C_LUT F_C[Y] is loaded in the chroma processing decision unit 1551 (see its calculation explained below).
現在問題係是否及如何計算施加在經程式化以從IDR重建Mster_HDR的解碼器中之函數的參數(此係不在HDR視訊解碼之前發生的情況)。The question now is whether and how to calculate the parameters of the function applied in the decoder programmed to reconstruct Master_HDR from IDR (which is not the case before HDR video decoding).
例如,可見到用於陰影增益的方法。For example, one can see the method used for shadow gain.
在計算SG_REC之前,可詢問是否可判定從SDR至Mster_HDR的總陰影增益SG_RM,且然後可經由方程式12的除法自其判定SG_REC。 所以SG_IDR=SG_gr/SG_ca 亦可顯示SG_ca= (mx/mxca)*(SG_gr+1) -1Before calculating SG_REC, it can be asked whether the total shadow gain SG_RM from SDR to Master_HDR can be determined, and then SG_REC can be determined from it by division in Equation 12. So SG_IDR = SG_gr / SG_ca It can also be shown that SG_ca = (mx / mxca) * (SG_gr + 1) -1
此可看到因為myca=SG_ca*mxca(按照經通道調適之para的下線性區段的定義),且亦可看到myca=my-d = mx*SG_gr+(mx-mxca)。This can be seen because myca=SG_ca*mxca (by definition of the lower linear section of the channel-adapted para), and it can also be seen that myca=my-d = mx*SG_gr+(mx-mxca).
mxca/mx的第二關係藉由將方程式9的上方程式除以mx而遵循。The second relationship of mxca/mx follows by dividing the upper equation of Equation 9 by mx.
因為藉由將第一關係填充至第二者(移除mx/mxca部分)中,SG_ca可按照SG_gr寫出,現在最終關係可形成在SG_IDR與SG_gr之間: SG_ca= (SG_gr+1)/[(SG_gr-1)*(1-sc*)/2+1]-1 從此: SG_IDR=SG_gr/{(SG_gr+1)/[(SG_gr-1)*(1-sc*)/2+1]-1} [方程式18]Since SG_ca can be written in terms of SG_gr by filling the first relation into the second (removing the mx/mxca part), the final relation can now be formed between SG_IDR and SG_gr: SG_ca= (SG_gr+1)/[(SG_gr-1)*(1-sc*)/2+1]-1 From this: SG_IDR=SG_gr/{(SG_gr+1)/[(SG_gr-1)*(1-sc*)/2+1]-1} [Equation 18]
鑑於已知(已接收)的SG_IDR(且sc*已僅從峰亮度(其亦已知)計算,因為PB_CH(亦即,PB_IDR)及PB_C_H50二者已接收且PB_SDR通常係100尼特,但若非如此亦可被置入信號的後設資料中),此方程式現在可針對未知的SG_gr求解。Given the known (received) SG_IDR (and sc* has been calculated only from the peak brightness (which is also known) since both PB_CH (i.e. PB_IDR) and PB_C_H50 are received and PB_SDR is typically 100 nits, but can be built into the signal's metadata if this is not the case), this equation can now be solved for the unknown SG_gr.
將SG_IDR= y及SG_gr =x用於簡化符號,然後: y=[(x-1)*(1-sc*)*x/2+x]/[x-(x-1)*(1-sc*)/2] 因此:x^2+x*(y-1)*[(sc*+1)/(sc*-1)]-y=0 [方程式19] [其係y及sc*之函數的該等係數(下文中稱為A’、B’、C’)將於下文在用於重建Mster_HDR影像的明亮度之方程式的總系統中用以解出二次方程式]。Using SG_IDR = y and SG_gr =x to simplify notation, then: y=[(x-1)*(1-sc*)*x/2+x]/[x-(x-1)*(1-sc*)/2] Thus: x^2+x*(y-1)*[(sc*+1)/(sc*-1)]-y=0 [Equation 19] [These coefficients (hereinafter referred to as A’, B’, C’) which are functions of y and sc* will be used hereinafter to solve the quadratic equation in the overall system of equations for reconstructing the brightness of the Master_HDR image].
為判定給定重建輝度映射函數之形狀的所有參數,下列方程式一般可在實施例的一者中完成(此重建用以在編碼器側產生IDR影像的函數的反函數)。首先,判定正確的para,黑色與白色偏移係可隨後自其計算的。To determine all parameters given the shape of the reconstructed luminance mapping function, the following equations can generally be done in one of the embodiments (this reconstruction is the inverse of the function used to generate the IDR image at the encoder side). First, determine the correct para, from which the black and white offsets can then be calculated.
再次如上述地計算rhoSDR,並將rhoCH計算為: rhoCH=1+32*power(PB_CH/10000; 1/ 2.4) mu= log[1+(rhoSDR-1)*power(PB_CH/PB_SDR ; 1/2.4)]/log(rhoSDR) K及La及sc*如上述地計算,其中K=P_HoS且La=P_HoI A’=1 B’=(SG_IDR-1)*(sc*+1)/(sc*-1) C’=-SG_IDRCalculate rhoSDR again as above, and calculate rhoCH as: rhoCH=1+32*power(PB_CH/10000; 1/ 2.4) mu= log[1+(rhoSDR-1)*power(PB_CH/PB_SDR ; 1/2.4)]/log(rhoSDR) K, La and sc* are calculated as above, where K=P_HoS and La=P_HoI A’=1 B’=(SG_IDR-1)*(sc*+1)/(sc*-1) C’=-SG_IDR
一旦已能夠在解碼器側判定所有所需函數的必要參數(留意:從其他經接收可用參數SG_IDR等),解碼的其餘部分由於剛施加編碼的(多個)反曲線之可逆性(例如,像圖14中的para(藉由已計算其之適當定義的參數1/SG_REC等而合適地成形))而將取消如圖10所繪示之IDR編碼para的行動,亦即定義IDR至Mster_HDR明亮度的重新解碼等)。Once it has been possible to determine the necessary parameters of all required functions on the decoder side (note: from other received available parameters SG_IDR etc.), the rest of the decoding will cancel the action of the IDR encoding para as shown in FIG. 10 due to the reversibility of the inverse curve(s) of the just applied coding (e.g. like para in FIG. 14 (suitably shaped by calculating its appropriately defined parameters 1/SG_REC etc.) i.e. the re-decoding of the defined IDR to Master_HDR brightness etc.).
從此遵循 SG_gr=[-B’+SQRT(B’ ^2-4*A’*C’)]/2*A’From now on, follow SG_gr=[-B’+SQRT(B’ ^2-4*A’*C’)]/2*A’
其中^2指示平方。 SG_REC = SG_gr/SG_IDR [方程式20]Where ^2 indicates square. SG_REC = SG_gr/SG_IDR [Equation 20]
因此,反通道調適陰影增益(1/SG_REC)已係已知。Therefore, the inverse channel adaptation shadow gain (1/SG_REC) is already known.
類似地,可計算所需的高亮度增益。 A’’= (SG_REC*HG_IDR -SG_gr)*(SG_gr+1)/(SG_REC+1) B’’=SG_gr-HG_IDR-(SG_REC*HG_IDR-1)*(SG_gr+1)/(SG_REC+1) C’’=HG_IDR-1 MxRec=[-B’’+SQRT(B’’ ^2-4*A’’*C’’)]/2*A’’ IF MxRec =1 THEN HG_REC= 0 ELSE = HG_REC= max[0,(MxRec *SG_gr-1)/(MxRec -1)]Similarly, the required high brightness gain can be calculated. A’’= (SG_REC*HG_IDR -SG_gr)*(SG_gr+1)/(SG_REC+1) B’’=SG_gr-HG_IDR-(SG_REC*HG_IDR-1)*(SG_gr+1)/(SG_REC+1) C’’=HG_IDR-1 MxRec=[-B’’+SQRT(B’’ ^2-4*A’’*C’’)]/2*A’’ IF MxRec =1 THEN HG_REC= 0 ELSE = HG_REC= max[0,(MxRec *SG_gr-1)/(MxRec -1)]
當para函數係定義自其參數時,一旦計算出其等,所需para即被定義。When a para function is defined from its parameters, once its parameters are computed, the required para is defined.
為獲得BLO_REC及WLO_REC,執行下列方程式: mx=(1-HG_gr)/(SG_gr-HG_gr) mxca=mx*(SG_gr-1)*(1-sc*)/2+mx myca=mx*(SG_gr+1)-mxca SG_ca=myca/mxca IF mxca=1 THEN HG_ca=0 ELSE HG_ca=max[0, (myca-1)/(mxca-1)] ScaleHor=(1-1/La)/(1-1/K) RHO=1+32*power(PB_C_H50/10000; 1/2,4) glim = {log[1 + (rhoSDR-1) * (0.1/100)^(1/2.4)] / log(rhoSDR)}/{log[1 + (RHO-1) * (1/PB_C_H50)^(1/2.4)] / log(RHO)}; [如之前;因為在ETSI方法中的Im_PB_C_1 <> Im_PB_C_2機制的此一固定平行旁路,該glim與由編碼器使用的相同,該等二個影像被定義為從相同的PB_C_1開始重新分級,且在此特定SLHDR2PLUS方法中分別係Mster_HDR及IDR影像] BLO_gr=BLO_IDR/glim[方程式17的反方程式,因此,此相對容易判定而不需更高階方程式,且隨後僅需要施加固定的經通道調適機制以獲得所需的WLO_REC,其等於由編碼使用的WLO_ca,但現在將係反的,加法變成減法] BLO_REC=BLO_ca=BLO_REC*ScaleHorTo obtain BLO_REC and WLO_REC, execute the following equations: mx=(1-HG_gr)/(SG_gr-HG_gr) mxca=mx*(SG_gr-1)*(1-sc*)/2+mx myca=mx*(SG_gr+1)-mxca SG_ca=myca/mxca IF mxca=1 THEN HG_ca=0 ELSE HG_ca=max[0, (myca-1)/(mxca-1)] ScaleHor=(1-1/La)/(1-1/K) RHO=1+32*power(PB_C_H50/10000; 1/2,4) glim = {log[1 + (rhoSDR-1) * (0.1/100)^(1/2.4)] / log(rhoSDR)}/{log[1 + (RHO-1) * (1/PB_C_H50)^(1/2.4)] / log(RHO)}; [As before; because Im_PB_C_1 in the ETSI method <> This fixed parallel bypass of the Im_PB_C_2 mechanism, the glim is the same as used by the encoder, the two images are defined as re-graded starting from the same PB_C_1, and in this particular SLHDR2PLUS method are the Master_HDR and IDR images respectively] BLO_gr=BLO_IDR/glim[The inverse of Eq. 17, so this is relatively easy to determine without higher order equations, and then only a fixed per-channel adaptation mechanism needs to be applied to obtain the required WLO_REC, which is equal to the WLO_ca used by the encoder, but now it will be inverted, and addition becomes subtraction] BLO_REC=BLO_ca=BLO_REC*ScaleHor
隨後,WLO_REC係藉由將其投影通過待於隨後反轉的para(如編碼原理)而計算。 IF HG_ca=0 WLO_REC=0 ELSE { BLO_co=255*BLO_ca/2040 Xh=(1-HG_REC)/(SG_REC-HG_REC)+WP_REC Xh_REC=HG_REC*Xh+1-HG_REC WW_REC=1-WLO_IDR*255/510 IF WW_REC>=Xh_REC THEN WCA=1-(1-WW_REC)/HG_REC ELSE Xs=(1-HG_REC)/(SG_REC-HG_REC)-WP_REC Xsca=SG_REC*Xs IF WW_REC>Xsca { A’’’=-0.5*(SG_REC-HG_REC)/(2*WP_REC) B’’’=(1-HG_REC)/(2*WP_REC)+(SG_REC+HG_REC)/2 C’’’= - [(SG_REC-HG_REC)*(2*WP_REC)-2*(1-HG_REC)]^2 / [8*(SG_REC-HG_REC)*(2*WP_RE)] WCA=(-B’’’+SQRT(B’’’^2-4*A’’’*{C’’’-WW_REC})/(2*A’’’) WCA=min (WCA,1) } ELSE WCA=WW_REC/SG_REC WLO_REC=(1-WCA)*(1-BLO_co)/[(1-WCA*ScaleHor)*(510/255)]WLO_REC is then calculated by projecting it through para (as in the coding principle) to be inverted afterwards. IF HG_ca=0 WLO_REC=0 ELSE { BLO_co=255*BLO_ca/2040 Xh=(1-HG_REC)/(SG_REC-HG_REC)+WP_REC Xh_REC=HG_REC*Xh+1-HG_REC WW_REC=1-WLO_IDR*255/510 IF WW_REC>=Xh_REC THEN WCA=1-(1-WW_REC)/HG_REC ELSE Xs=(1-HG_REC)/(SG_REC-HG_REC)-WP_REC Xsca=SG_REC*Xs IF WW_REC>Xsca { A’’’=-0.5*(SG_REC-HG_REC)/(2*WP_REC) B’’’=(1-HG_REC)/(2*WP_REC)+(SG_REC+HG_REC)/2 C’’’= - [(SG_REC-HG_REC)*(2*WP_REC)-2*(1-HG_REC)]^2 / [8*(SG_REC-HG_REC)*(2*WP_RE)] WCA=(-B’’’+SQRT(B’’’^2-4*A’’’*{C’’’-WW_REC})/(2*A’’’) WCA=min (WCA,1) } ELSE WCA=WW_REC/SG_REC WLO_REC=(1-WCA)*(1-BLO_co)/[(1-WCA*ScaleHor)*(510/255)]
應注意雖然BLO就映射而言實際上係純加法貢獻,WLO轉換成對最大值的乘法按比例調整(例如,在圖4中): Y’HPS= (Y’HP-BLO)/(1-BLO-WLO) [方程式21]Note that while BLO is effectively a purely additive contribution with respect to mapping, WLO translates into a multiplicative scaling to the maximum value (e.g., in Figure 4): Y’HPS = (Y’HP-BLO)/(1-BLO-WLO) [Equation 21]
所有此資訊一般可填充至單一輝度處理LUT中,其將Y’IP(例如,在感知域中)相關於Y’HR(或更佳地,仍係其針對各L_in值定義Lh的總LUT)。此將重建REC_M_HDR影像。All this information can generally be populated into a single brightness processing LUT that relates Y'IP (e.g. in the perceptual domain) to Y'HR (or better still an overall LUT that defines Lh for each L_in value). This will reconstruct the REC_M_HDR image.
如上文所提及的,若解碼器可直接輸出經顯示調適影像,例如,MDR_300,其亦係有用的。As mentioned above, it is also useful if the decoder can directly output display adapted images, e.g., MDR_300.
後續技術可如圖 16 所說明地使用(其中使用二個部分LUT,實務上最有用的係僅載入稱為P_LUT的一個LUT,因為輝度計算上軌係在較佳的核心計算單元中,例如,專用解碼IC的各像素顏色處理器,一般簡單地具現為LUT。Y_IDR明亮度值係輸入(例如,一般經基於PQ的YCbCr編碼),且其等由線性化器1601轉換成經正規化輝度L_in。感知化器1602如上文解釋地運作(方程式1及方程式2),並使用用於IDR峰亮度PB_IDR的RHO值,例如,1000尼特。此產生經感知化IDR明亮度Y’IP。輝度映射單元1603如上文所解釋地重建主HDR影像,亦即,其得到定義IDR至MsterHDR重建輝度映射函數F_L_REC的所有經計算參數,或一般得到該函數形狀的LUT。此產生經重建Mster_HDR明亮度Y’HPR。此影像形成計算低動態範圍/峰亮度PB_C之影像的良好基礎。基本上,倘若施加正確的函數,此像ETSI2機制般操作。此等函數可從共同傳達為後設資料的F_L_IDR按比例調整,或從經重建F_50t1函數計算,該經重建函數係內容創作者在其側定義為最佳函數以從Mster_HDR影像計算主SDR影像之函數的重建。然後可根據定義在ETSI2標準中的原理(此細節讀者請參考該標準)將此F_50t1函數計算成用於,例如,300尼特PB_D的適當顯示調適函數F_L_DA。將此載入至HDR至MDR輝度映射器1604(假設有一者)。實務上,單一P_LUT將含有F_L_REC及後續F_L_DA的全部動作。The subsequent technique may be used as illustrated in FIG . 16 (where two partial LUTs are used, it is most useful in practice to load only one LUT, called P_LUT, since the luminance calculation is on track in a better core computing unit, e.g., each pixel color processor of a dedicated decoding IC, which is typically simply implemented as a LUT. The Y_IDR brightness values are input (e.g., typically PQ-based YCbCr encoding), and they are converted by the linearizer 1601 into normalized luminance L_ in. The sensor 1602 operates as explained above (Eq. 1 and Eq. 2) and uses the RHO value for the IDR peak brightness PB_IDR, e.g. 1000 nits. This produces a sensed IDR brightness Y'IP. The luminance mapping unit 1603 reconstructs the master HDR image as explained above, i.e. it obtains all calculated parameters defining the IDR to MasterHDR reconstruction luminance mapping function F_L_REC, or generally obtains the function of the form This produces a reconstructed Master_HDR brightness Y'HPR. This image forms a good basis for calculating the low dynamic range/peak brightness PB_C image. Basically, this operates like the ETSI2 mechanism if the right functions are applied. These functions can be scaled from the F_L_IDR commonly communicated as metadata, or calculated from a reconstructed F_50t1 function that the content creator defines on his side as the best function to get from Master_ The HDR image calculates a reconstruction of the function of the main SDR image. This F_50t1 function can then be calculated into an appropriate display adaptation function F_L_DA for, for example, 300 nits PB_D according to the principles defined in the ETSI2 standard (the reader is referred to the standard for this detail). This is loaded into the HDR to MDR brightness mapper 1604 (assuming there is one). In practice, a single P_LUT will contain all the actions of F_L_REC and subsequently F_L_DA.
最後,將所獲得的MDR相對輝度發送至圖4的第一乘法器454,以進行相同處理(亦使用正確伴隨的F_C[Y])。Finally, the obtained MDR relative brightness is sent to the first multiplier 454 of Figure 4 for the same processing (also using the correct accompanying F_C[Y]).
最後,需要計算適當的C_LUT(分別在圖4或圖15中的F_C[Y]),其給予輝度重新分級輸出顏色其等的適當色度(以具有儘可能接近Mster_HDR影像的模樣,亦即,輸出影像像素及Mster_HDR影像的色度應達到可能給予不同的較小動態範圍幾乎相同的程度)。Finally, a suitable C_LUT needs to be calculated (F_C[Y] in Figure 4 or Figure 15 respectively) which gives the appropriate chromaticity to the brightness regraded output colors (to have a look as close as possible to the Master_HDR image, i.e. the chromaticity of the output image pixels and the Master_HDR image should be almost the same, possibly giving a different smaller dynamic range).
用於Mster_HDR重建的C_LUT係如下(其他重新分級C-LUT運算遵循類似原理,例如,將ETSI2教示列入考量)。The C_LUT used for Master_HDR reconstruction is as follows (other re-grading C-LUT operations follow similar principles, e.g. taking ETSI2 teachings into account).
首先計算CP-LUT,其係上文提及之在編碼器施加以將Mster_HDR影像映射至IDR影像之P_LUT的反式(因此在解碼器中,此反色度校正將用以從所接收之IDR影像色度Cb及Cr反轉換至Mster_HDR經重建色度)。First the CP-LUT is calculated, which is the inverse of the P_LUT mentioned above applied at the encoder to map the Master_HDR image to the IDR image (so in the decoder this inverse chrominance correction will be used to inversely convert from the received IDR image chrominance Cb and Cr to the Master_HDR reconstructed chrominance).
然後可將用於Mster_HDR重建的C_LUT運算如下: XH=v(PB_M_HDR; 10000) XS=v(PB_SDR=100; 10000) XD=v(PB_D; 10000) XC=v(PB_CH; 10000)Then the C_LUT for Master_HDR reconstruction can be calculated as follows: XH=v(PB_M_HDR; 10000) XS=v(PB_SDR=100; 10000) XD=v(PB_D; 10000) XC=v(PB_CH; 10000)
其中v再度係如上述方程式1及方程式2所定義的函數v(x,RHO)。 CfactCH=1-(XC-XS)/(XH-XS) CfactCA=1-(XD-XS)/(XH-XS) C_LUT[Y}=[1+CfactCA*power(CP_LUT[Y] ;2.4)]/[Y*{1+CfactCH* power(CP_LUT[Y] ;2.4)}] [方程式22]where v is again the function v(x,RHO) defined in Equations 1 and 2 above. CfactCH=1-(XC-XS)/(XH-XS) CfactCA=1-(XD-XS)/(XH-XS) C_LUT[Y}=[1+CfactCA*power(CP_LUT[Y] ;2.4)]/[Y*{1+CfactCH* power(CP_LUT[Y] ;2.4)}] [Equation 22]
可將顯示器目標PB_D設定成PB_Mster_HDR以用於重建,在該情形中,僅有除法器保持為C_LUT判定器。在實際實施例中,2.4次方亦可在LUT中包括為,例如,CPP_LUT = power(CP_LUT[Y]; 2.4),其在一些實施例中可節省一些運算。The display target PB_D may be set to PB_Mster_HDR for reconstruction, in which case only the divider remains as the C_LUT determiner. In a practical embodiment, the 2.4th power may also be included in the LUT as, for example, CPP_LUT = power(CP_LUT[Y]; 2.4), which may save some operations in some embodiments.
上文說過SLHDR2PLUS編碼器的一些實際實施例(用於目前的ETSI2後設資料定義順應性)重計算HG_gr以用於順應HG_IDR值。此可如下文所述地完成。It was mentioned above that some practical embodiments of the SLHDR2PLUS encoder (for current ETSI2 metadata definition compliance) recalculate HG_gr for use in conforming HG_IDR values. This can be done as described below.
例如,後設資料可保留用於para之HG的8位元碼,亦即在此情形中,因為IDR影像+其後設資料應係ETSI2順應信號,問題係所需要的HG_IDR是否將適配在所分配的碼中。該標準一般使用碼分配函數以將實體需要HG_IDR變換成某個HG_COD : [0,255]中的HG_COD = F_COD[HG_IDR]。例如,FCOD可係128*(4*HG_IDR),其意謂著255的最大值對應於0.5的最大HG_IDRFor example, the metadata may reserve an 8-bit code for the HG of para, i.e. in this case, since the IDR picture + its metadata should be ETSI2 compliant signaling, the question is whether the required HG_IDR will fit in the allocated code. The standard generally uses a code allocation function to transform the actual required HG_IDR into a certain HG_COD: HG_COD = F_COD[HG_IDR] in [0,255]. For example, FCOD may be 128*(4*HG_IDR), which means that a maximum value of 255 corresponds to a maximum HG_IDR of 0.5.
想要確保將IDR影像產生成使得HG_IDR恰適配在碼範圍中,亦即,實用的實施例可藉由將分級者的HG_gr調適一些而實現此(使得具有固定通道調適且基於其之IDR後設資料判定恰好避免溢出)。It is desirable to ensure that the IDR image is generated such that HG_IDR fits exactly within the code range, i.e., a practical embodiment may achieve this by adapting the grader's HG_gr a little (so that the IDR meta-data decision with fixed channel adaptation and based thereon just avoids overflow).
用於此(可選的)實施例的計算可係,例如: 設定HG_IDR=(254*2)/(255*4); Exposure=shadow/4+0.5 [其中shadow係陰影增益SG_gr的ETSI2編碼] SG_gr=K*exposure A= SG_gr*(HG_IDR-1)-0.5*(SG_gr-1)*(1-sc*)*(HG_IDR+SG_gr) B=SG_gr-HG_IDR+1+0.5*(SG_gr-1)*(1-sc*)*(HG_IDR+1) C=HG_IDR-1 MxLM=[-B+sqrt(B*B-4*A*C)]/(2*A) IF MxLM= 1 THEN HG_gr_LM = 0 ELSE HG_gr_LM=max[0, (MxLM*SG_gr-1)/(MxLM-1)] 其中HG_gr_LM係經調整的HG_gr值。然後該演算法的其餘部分將如上文所述地運作,就好像分級者從開始就選擇最佳HG_gr_LM值。The calculations used for this (optional) embodiment may be, for example: Set HG_IDR=(254*2)/(255*4); Exposure=shadow/4+0.5 [where shadow is the ETSI2 encoding of the shadow gain SG_gr] SG_gr=K*exposure A= SG_gr*(HG_IDR-1)-0.5*(SG_gr-1)*(1-sc*)*(HG_IDR+SG_gr) B=SG_gr-HG_IDR+1+0.5*(SG_gr-1)*(1-sc*)*(HG_IDR+1) C=HG_IDR-1 MxLM=[-B+sqrt(B*B-4*A*C)]/(2*A) IF MxLM= 1 THEN HG_gr_LM = 0 ELSE HG_gr_LM=max[0, (MxLM*SG_gr-1)/(MxLM-1)] Where HG_gr_LM is the adjusted HG_gr value. The rest of the algorithm then works as described above, as if the grader had chosen the optimal HG_gr_LM value from the beginning.
此詳述處理SLHDR2PLUS新編解碼器設計問題的一種方法。取決於所作的技術選擇,特別係發現關鍵重要性的態樣之於可放寬的其他態樣,而有替代方式。This article details one approach to addressing the design issues of the new SLHDR2PLUS codec. Depending on the technical choices made, especially the aspects that are found to be of critical importance versus other aspects that can be relaxed, there are alternative approaches.
上述數學定義實作HDR解碼器的全新方式,其至少具有與ETSI1及ETSI2方法一致的核心計算方法:特別係雖然不同形狀的P-LUT及C_LUT函數將如其等於上文所說明地計算(儘管圖4及圖15詳述吾等的HDR編碼方法如何及為何運作背後的技術物理原則,實務上等效於分別在明亮度處理軌401及1501中之輝度處理的整體明亮度處理[在一維顏色態樣中,經由非線性相關的該等二者係影像相依函數變換]係藉由載入正確的總P_LUT明亮度映射函數形狀而執行,並類似地對分別在單元451及1551中之稱為F_C[Y]的C-LUT),該計算拓撲可重新使用,其對客戶係高度有用的性質(其等必須購買在例如STB中的IC一次,且其可藉由重新程式化後設資料處置但維持每像素顏色變換引擎而重組態成各種新的編碼原則)。The above mathematics define a completely new way of implementing an HDR decoder, which has at least the core calculation method consistent with the ETSI1 and ETSI2 methods: in particular, although the P-LUT and C_LUT functions of different shapes will be calculated as they are equivalent to those described above (although Figures 4 and 15 detail the technical physics behind how and why our HDR encoding method works, which is actually equivalent to the overall luminance processing of the luminance processing in the luminance processing tracks 401 and 1501 respectively [in a one-dimensional color state, These two are image dependent functional transforms via non-linear correlation] which are performed by loading the correct overall P_LUT brightness mapping function shape, and similarly for the C-LUT called F_C[Y] in units 451 and 1551 respectively), the computational topology can be reused, which is a highly useful property for customers (who have to buy the IC once, e.g. in a STB, and which can be reconfigured to various new encoding strategies by reprogramming the metadata handling but maintaining the per-pixel color transform engine).
亦可設計深入地重新使用相同的ETSI2解碼數學的IDR編碼技術(亦即,部分重新分級1503至1505的鏈),藉由僅指示ETSI2解碼器合適地外插而替代其之將所接收影像降級的正常作業,顯示器將其調適成PB_D < PB_IDR的顯示器。應強調此並非「盲目」外插法,其給出對應於IDR影像之模樣(亦即,尤其係IDR像素的相對明亮度或絕對輝度的統計分布)的「恰好任何」更高動態範圍影像,但實際上藉由編碼看起來儘可能地接近像是內容創作者側之原始Mster_HDR影像(其在此類實施例中亦仍未實際地接收,亦未接收其後設資料,例如,SG_gr)的HDR輸出影像的此方式「自動地」產生。此自動地當然不係如此簡單,並涉及在內容編碼側的正確方法。對於此原則之實施例中的解碼器,所接收的PB_C_H50次級峰亮度在核心每像素解碼器的程式化中等效地運行,彷彿其係所期望的顯示器亮度PB_D(然後其係例如5x高於PB_IDR)。It is also possible to design an IDR coding technique that deeply reuses the same ETSI2 decoding mathematics (i.e., partially reclassifying the chain 1503 to 1505), by simply instructing the ETSI2 decoder to appropriately interpolate instead of its normal operation of degrading the received image, and the display adapts it to a display with PB_D < PB_IDR. It should be emphasized that this is not a "blind" extrapolation method that gives "just any" higher dynamic range image that corresponds to the look of an IDR image (i.e. in particular the statistical distribution of the relative brightness or absolute luminance of the IDR pixels), but is actually generated "automatically" in this way by encoding an HDR output image that looks as close as possible like the original Master_HDR image on the content creator's side (which in such embodiments is still not actually received, nor its metadata, e.g. SG_gr). This automaticity is of course not so simple and involves the right approach on the content encoding side. For the decoder in an embodiment of this principle, the received PB_C_H50 secondary peak brightness is equivalently run in the core per-pixel decoder programming as if it were the desired display brightness PB_D (which is then, for example, 5x higher than PB_IDR).
圖 18 說明此方法(編碼器數學在概念上如何運作的方塊圖)。此外,為了簡單,將假設(雖然不必將該等選擇連結為需要對此實例如此)選擇固定通道調適演算法的選擇自由以僅進行連結Mster_HDR及IDR的para變換,而將任何的BLO及WLO(若已可施加於目前影像或影像鏡頭)及可客製化曲線留給次級變換,亦即IDR至SDR重新分級,並將屬於ETSI2順應IDR信號的後設資料傳達給接收器(無論係傳統ETSI2接收器或SLHDR2PLUS解碼接收器)。 Figure 18 illustrates this approach (block diagram of how the codec math conceptually works). Again, for simplicity, it will be assumed (although it is not necessary to link these choices as needed for this example) that the choice of fixed channel adaptation algorithm is free to be chosen to perform only the para transform linking Master_HDR and IDR, leaving any BLO and WLO (if already applicable to the current image or image lens) and customizable curves to the secondary transform, i.e. IDR to SDR re-scaling, and conveying the metadata pertaining to the ETSI2 compliant IDR signal to the receiver (whether a conventional ETSI2 receiver or a SLHDR2PLUS decoding receiver).
首先需要一些介紹性定義:First some introductory definitions:
如圖10所示之para曲線的反曲線(亦即,具有如在上述方程式4及方程式5中所制定之ETSI經標準化形狀定義及藉由a*x^2+b*x+c定義的拋物線中間部分)在此本文中為了簡明應稱為abcara 的曲線。根據ETSI1第7節(HDR信號重建),其定義為: L_out = 1/SG * L_in (if 0<= L_in <= xS) L_out = -b/2a+ sqrt(b^2-4*a*(c-L_in))/2a (if xS< L_in < xH) L_out = 1/HG *(L_in-1)+1 (if xH<= L_in) [方程式23]The inverse curve of the para curve shown in Figure 10 (i.e., with the ETSI standardized shape definition as formulated in Equations 4 and 5 above and the parabolic middle part defined by a*x^2+b*x+c) shall be referred to as the abcara curve in this document for simplicity. According to ETSI1 Section 7 (HDR Signal Reconstruction), it is defined as: L_out = 1/SG * L_in (if 0<= L_in <= xS) L_out = -b/2a+ sqrt(b^2-4*a*(c-L_in))/2a (if xS< L_in < xH) L_out = 1/HG *(L_in-1)+1 (if xH<= L_in) [Equation 23]
其中xS及xH係線性區段改變成拋物線中間區段的點,與para如何針對編碼(或任何其他用途)定義一致。where xS and xH are the points where the linear segment changes to the middle segment of the parabola, consistent with how para is defined for encoding (or any other purpose).
圖18之視訊編碼器實施例所嘗試達成之事物的基本原理在圖 20 中顯示(在此實例中,已選擇說明500尼特的PB_C IDR的實例,不想要說此方法有些受限於或更適合較低的PB_IDR)。The basic principle of what the video encoder embodiment of Figure 18 attempts to achieve is shown in Figure 20 (in this example, an example of a PB_C IDR of 500 nits has been chosen to illustrate, without wanting to say that this approach is somehow limited to or more suitable for lower PB_IDRs).
若具有固定機制(在ETSI2相容或ETSI2傳統解碼器中)以從IDR外插至比PB_IDR更高的PB_C(使用此類PB_C設定,彷彿其係顯示器峰亮度),則亦可設計反轉該處理的編碼器,亦即藉由使用經合適調適外插輝度映射函數F_E_I2S的反函數F_ENCINV_H2I(從藉由IDR信號(亦即,IDR影像+包括F_I2S函數的後設資料)的接收器接收之符合ETSI2規格的F_I2S函數調適)建立IDR影像,且隨後加入正確的後設資料,其如所述地將係F_I2S,其待從由內容創作者(例如,人類分級者)或在任何中間即時編碼程序中的自動化等建立的總輝度映射函數F_H2S(例如,F_50t1)推導。If there is a fixed mechanism (in an ETSI2 compliant or ETSI2 legacy decoder) to extrapolate from IDR to a PB_C higher than PB_IDR (using such PB_C setting as if it were the display peak luminance), then the encoder can also be designed to invert this process, i.e. by using the inverse function F_ENCINV_H2I (from the IDR signal (i.e., I The IDR image is created by receiving a receiver (DR image + F_I2S function adaptation compliant with ETSI2 specification) and then adding the correct metadata, which as described will be F_I2S, which is derived from the total brightness mapping function F_H2S (e.g., F_50t1) established by the content creator (e.g., human grader) or by automation in any intermediate real-time encoding process.
該關係亦可以乘法觀點制定: L_SDR=m_F_I2S*m_F_ENCINV_H2I*L_HDR= m_F_I2S* L_IDR L_HDR=m_F_E_I2S*L_IDRThis relationship can also be formulated from a multiplication point of view: L_SDR=m_F_I2S*m_F_ENCINV_H2I*L_HDR= m_F_I2S* L_IDR L_HDR=m_F_E_I2S*L_IDR
其中m_F_I2S或更確切地說m_F_I2S(L_HDR)係實現每個任何選擇的L_HDR值之輝度重新分級所需的對應乘數,對應於F_I2S輝度映射函數形狀,且類似地用於其他乘數。where m_F_I2S or more precisely m_F_I2S(L_HDR) is the corresponding multiplier needed to achieve the brightness re-grading for each any chosen L_HDR value, corresponding to the F_I2S brightness mapping function shape, and similarly for the other multipliers.
所以必須解決從HDR至IDR之para的反式(亦即,從IDR運作至HDR的abcara)具有與(在任何L_IDR上開始)外插至PB_HDR的某個para相同的效果。So it must be solved that the inverse of para from HDR to IDR (i.e. abcara running from IDR to HDR) has the same effect as extrapolating (starting at any L_IDR) to some para of PB_HDR.
為了更好一些地瞭解,使用圖 21 。從較高輸入影像PB_C(亦即,在經由高於經正規化輸出影像輝度之PB_D的PB_Ch而對應於實際輝度的任何經正規化輸入輝度L_in_X上運作)至較低PB_D的正常內插模式中,原始分級者的para F_H2S(如藉由標準的ESTI2編碼視訊通訊鏈在後設資料中接收)將遵循朝向對角線[0,0]-[1,1]的箭頭對角地按比例調整,產生F_ENCIV_H2I(其現在對應於PB_IDR/PB_HDR vs. PB_SDR/PB_HDR(亦即,例如v(100/5000)/v(500;5000)=0.54/0.72 [其中v(x;y)係具有橫座標x之方程式1的函數,且RHO經由方程式2對應於y])的視覺均勻化虛擬對數距離比)。可想像通過將PB_HDR映射至PB_HDR的恆等處理而繼續從任何較高至較低PB_D情況的重新分級行為將產生變成陡峭下降的曲線,事實上對於para種類的輝度映射曲線,其等將在數學上變成abcara。實際上,用於外插任何已接收IDR影像的所需函數(基於在後設資料中接收的起始輝度映射函數F_H2S,藉由使用ETSI2第7.3章顯示器調適機制)F_E_I2S將係藉由在F_ENCINV_H2I之對角線周圍鏡像而獲得的鏡像函數(且反之亦然)。To understand this a bit better, use Figure 21. In normal interpolation mode from a higher input image PB_C (i.e. operating on any normalized input brightness L_in_X corresponding to actual brightness via PB_Ch above the normalized output image brightness PB_D) to a lower PB_D, the original grader's para F_H2S (as received in the metadata via a standard ESTI2 encoded video link) will be scaled diagonally following the arrows towards the diagonal [0,0]-[1,1], yielding F_ENCIV_H2I (which now corresponds to PB_IDR/PB_HDR vs. PB_SDR/PB_HDR (i.e., for example v(100/5000)/v(500;5000)=0.54/0.72 [where v(x;y) is the function of Eq. 1 with abscissa x and RHO is the visually averaged virtual log distance ratio via Eq. 2 for y]). One can imagine that continuing the regrading behavior from any higher to lower PB_D case by the constant process of mapping PB_HDR to PB_HDR will produce a curve that becomes a steeply descending curve, in fact for the para kind of brightness mapping curve, which becomes mathematically abcara. In practice, the required function for extrapolating any received IDR image (based on the starting luminance mapping function F_H2S received in the metadata, by using the ETSI2 Chapter 7.3 display adaptation mechanism) F_E_I2S will be the mirroring function obtained by mirroring around the diagonal of F_ENCINV_H2I (and vice versa).
因此,鑑於想要重新利用標準的ETSI2運算機制以實作SLHDR2PLUS功能性,所剩下的係定義對應編碼器,如圖 18 所說明的。Therefore, given the desire to reuse the standard ETSI2 algorithm to implement the SLHDR2PLUS functionality, all that remains is to define the corresponding codec, as illustrated in Figure 18 .
例如,F_ENCINV_H2I的SG在abcara定義1/SG * L_in_X中。For example, the SG of F_ENCINV_H2I is defined in abcara as 1/SG * L_in_X.
依據SG_COD(亦即,上述物理數學陰影增益SG的ETSI經定義編碼),得到(SG_COD=SGC*255/2以及ETSI1方程式C23 exposure=SGC/4 +0.5以及C24 expgain=v(PB_HDR=5000/PB_target=500; PB_target)以及方程式C27 SG=expgain*exposure): 1/[(SGC/4+0.5)*v(5000/500;500)]=(X/4+0.5)* v(500/5000;500) [方程式24]According to SG_COD (i.e., the ETSI defined encoding of the above physical mathematical shadow gain SG), we get (SG_COD=SGC*255/2 and ETSI1 equation C23 exposure=SGC/4 +0.5 and C24 expgain=v(PB_HDR=5000/PB_target=500; PB_target) and equation C27 SG=expgain*exposure): 1/[(SGC/4+0.5)*v(5000/500;500)]=(X/4+0.5)* v(500/5000;500) [Equation 24]
待針對未知的para陰影增益控制X求解(亦即,X係F_ENCINV_H2I的SG)。The unknown para shadow gain control X is to be solved (ie, X is the SG of F_ENCINV_H2I).
亦即,解碼器針對任何分級者的F_H2S選擇定義F_E_I2S形狀將如何像什麼(使用ETSI2 7.3演算法),但需要將其解譯為ETSI1 abcara,使得可將abcara相關於對應的所需反para F_ENCINV_H2I,以最終在新的SLHDR2PLUS編碼器中使用對應的para,以計算IDR影像輝度(在通用SLHDR2PLUS方法之此特定種類方法的第一較佳實施例中,亦即,使用第二峰亮度之輝度映射函數的導數計算;白色與黑色偏移在此種類中(至少在HDR<>IDR子範圍中)將被忽略,因為其等將可施加至不同的PB_C影像光譜的HDR<>SDR子範圍中,如圖7所示)。That is, the decoder defines for any grader's F_H2S choice how the F_E_I2S shape will look like (using the ETSI2 7.3 algorithm), but it needs to be interpreted as an ETSI1 abcara so that the abcara can be related to the corresponding required inverse para F_ENCINV_H2I to ultimately use the corresponding para in the new SLHDR2PLUS encoder to calculate the IDR image brightness (in the first best embodiment of this particular class of methods of the generic SLHDR2PLUS method, i.e., calculated using the derivative of the brightness mapping function of the second peak brightness; white and black offsets will be ignored in this class (at least in the HDR<>IDR sub-range) as they will be applied to the HDR<>SDR sub-range of different PB_C image spectra, as shown in Figure 7).
編碼器現在實務上係以其他順序運作(但遵守相同關係,以保持系統ETSI2順應)。通道適配器1801(從所接收之F50t1函數形狀)計算所需的para以將L_HDR明亮度變換成例如500尼特PB_C L_IDR明亮度(可僅藉由施加para而使用上述先前實施例的通道調適數學,但之後忽略WLO及BLO調適,亦即para僅在二個0至1.0明亮度表示之間運作而不涉及任何偏移)。反相器1802使用方程式24的反方程式計算對應的abcara(亦即,具有鑑於方程式右側上的已知SGC而計算之在左方上的1/X)。此係將自所接收之L_IDR明亮度重建L_HDR像素明亮度的映射。假設例如在編解碼器定義鏈上保持恆定的WP,反相器1802將因此計算abcara的陰影增益SG_abc及高亮度增益HG_abc。進行後設資料管理的下軌最終將需要計算F_L_IDR (=F_I2S),所以適配器1803藉由在反方向上施加ETSI2 7.3的演算法(若部分輝度重新分級已藉由使用F_ENCINV_H2I而對IDR影像明亮度完成,實現總變換F_H2S的剩餘變換F_I2S)而判定所需的映射函數F_I2S(尤其係其之SG_IDR及HG_IDR)。The encoder now operates in practice in the other order (but obeying the same relationships to keep the system ETSI2 compliant). Channel adapter 1801 calculates (from the received F50t1 function shape) the required para to convert L_HDR brightness to, for example, 500 nits PB_C L_IDR brightness (the channel adaptation mathematics of the previous embodiment described above can be used by just applying para, but then ignoring the WLO and BLO adaptations, i.e. para only operates between two 0 to 1.0 brightness representations without any offset involved). Inverter 1802 calculates the corresponding abcara using the inverse of equation 24 (i.e., with 1/X on the left calculated in view of the known SGC on the right side of the equation). This is a mapping that will reconstruct the L_HDR pixel brightness from the received L_IDR brightness. Assuming for example that WP is kept constant on the codec definition chain, the inverter 1802 will therefore calculate the shadow gain SG_abc and the high brightness gain HG_abc of abcara. The down-track for meta data management will eventually require the calculation of F_L_IDR (=F_I2S), so the adapter 1803 determines the required mapping function F_I2S (especially its SG_IDR and HG_IDR) by applying the algorithm of ETSI2 7.3 in the reverse direction (if partial brightness re-grading has been done for the IDR image brightness by using F_ENCINV_H2I, the residual transform F_I2S of the total transform F_H2S is achieved).
如上文已提及的,在某些情景中,可發生HG_IDR值下降至在可ETSI2順應編碼為HG_COD的事物上方。在此一情景中可完成的係將HG_IDR的該值限制至其最大值,並通過鏈返回至所意指的事物,特別係不同的原始分級者的F_H2S函數所將對應者。然後所有計算可從該情況重新開始,且係以虛線顯示之可選單元在一個連續處理線中所執行者。As already mentioned above, in certain scenarios it can happen that the HG_IDR value drops above what can be ETSI2 compliantly encoded as HG_COD. What can be done in this scenario is to limit the value of HG_IDR to its maximum value and chain back to the intended one, in particular to which the F_H2S function of a different raw classifier will correspond. All calculations can then be restarted from this situation and are performed in one continuous processing line with the optional units shown in dashed lines.
圖 22 解釋當輝度映射曲線重新成形時限制器1804所執行者。以點虛線顯示起始F_H2S,及可如何使用固定通道調適演算法自此推導F_ENCINV_H2I函數,及(原始)剩餘部分重新分級函數F_I2S_or可如何推導(假如在目前制定的ETSI2中沒有額外的特定限制,原始F_IDR呼叫現在解釋之更詳細描述的特定實施例方法)。鑑於HDR視訊編碼上的此一全新方法,此函數的HG_IDR_or可能不適配於HG_COD定義,亦即需要高於可在ETSI2順應HDR視訊編碼信號中傳達之其之8位元最大值255的值。因此,HG_IDR_or必須下降至至多經限制之仍可編碼值HG_IDR_LIM(其在ETSI2的目前實施例中係2.0,但此並非本方法的基本限制)。此建立其具有更有些接近上水平邊限(L_out_X=1.0)的高亮度線性區段的para,其對應於更亮一些的IDR影像,但沒有基本問題(如上文提及的,在系統中有一些放寬可能性以設計各種變化)。將意謂著HDR場景影像中的最高輝度區域得到較小的對比IDR表示(雖然原始的主HDR係可完全回復的,且該SDR模樣及所有MDR重新分級看起來亦將係良好的),但因為係從較高的PB_C HDR主影像分級而沒有實際問題,且此對應於,例如,3000至5000尼特範圍中的事物,其一般係燈及類似者,其可遭受一點劣化(因為一些劣化映射係無論如何總是必需的,且對此類超明亮區域係有點預期的)。然後第二通道適配器1805將再度施加所有上述數學,但現在係在有限HG_IDR的情況下(所以首先可計算等效F_H2S,如所述,在此類別實施例中,其可藉由將有限F_I2S_LIM外插至PB_D=PB_Mster_HDR情況而執行,然後可再度施加通道調適)。 FIG22 illustrates what the limiter 1804 performs when the brightness mapping curve is reshaped. The starting F_H2S is shown as a dotted line, and how the F_ENCINV_H2I function can be derived therefrom using a fixed channel adaptation algorithm, and how the (original) residual re-scaling function F_I2S_or can be derived (assuming no additional specific restrictions in the current formulation of ETSI2, the original F_IDR calls the specific embodiment method described in more detail now explained). In view of this completely new approach to HDR video coding, the HG_IDR_or of this function may not fit into the HG_COD definition, i.e. requires values higher than its 8-bit maximum value of 255 that can be conveyed in an ETSI2 compliant HDR video coded signal. Therefore, HG_IDR_or must be reduced to at most the limited still codable value HG_IDR_LIM (which is 2.0 in the current implementation of ETSI2, but this is not a fundamental limitation of the method). This creates a para with a high-luminance linear segment somewhat closer to the upper horizontal margin (L_out_X=1.0), which corresponds to a brighter IDR picture, but no fundamental problem (as mentioned above, there is some slack possibility in the system to design various variations). It will mean that the highest brightness areas in the HDR scene imagery get a smaller contrast IDR representation (although the original master HDR is fully recoverable, and the SDR look and all MDR re-grades will look good too), but because it's graded from the higher PB_C HDR master imagery there's no real problem, and this corresponds to, for example, things in the 3000 to 5000 nit range, which are typically lamps and the like, which may suffer a bit of degradation (since some degradation mapping is always necessary anyway, and is somewhat expected for such super bright areas). The second channel adapter 1805 will then apply all of the above mathematics again, but now in the case of finite HG_IDR (so first the equivalent F_H2S can be calculated, as described, in this class of embodiments it can be performed by extrapolating the finite F_I2S_LIM to the case PB_D = PB_Mster_HDR, and then channel adaptation can be applied again).
此所得的F_H2I_LIM(亦即,將L_HDR明亮度映射至L_IDR明亮度)現在可藉由影像像素輝度映射器1806施加,以逐像素地判定所有IDR明亮度(或事實上,亦使用ETSI2的色度處理,亦即,對應於F_H2I_LIM明亮度映射函數形狀的已定義C_LUT,所有IDR YCbCr顏色)。最後IDR後設資料判定器1807計算完整的後設資料組以用於實現ETSI2順應的基於後設資料的重新分級,以將PB_C影像(針對任何顯示器PB_D)降低至低於其所係的PB_IDR(或藉由高於PB_IDR的外插)。所以,SG_IDR、HG_IDR、及WP_IDR再次根據形成如上文解釋之實施例的可能組合的任何者判定。現在亦判定BLO_IDR及WLO_IDR(如上文所解釋的,可將在Mster_HDR明亮度軸上的特定明亮度映射至SDR明亮度軸上的1.0,且此可係重新制定成經合適按比例調整的IDR明亮度的映射,亦即,定義WLO_IDR,並類似地用於BLO_IDR)。This resulting F_H2I_LIM (i.e. mapping L_HDR luminances to L_IDR luminances) can now be applied by the image pixel brightness mapper 1806 to determine all IDR luminances (or in fact, all IDR YCbCr colors, also using the chrominance processing of ETSI2, i.e. the defined C_LUT corresponding to the shape of the F_H2I_LIM luminance mapping function) on a pixel-by-pixel basis. Finally the IDR metadata determiner 1807 calculates the complete metadata set for implementing an ETSI2 compliant metadata-based regrading to reduce the PB_C image (for any display PB_D) to below the PB_IDR to which it belongs (or by extrapolation above the PB_IDR). So, SG_IDR, HG_IDR, and WP_IDR are again determined according to any of the possible combinations forming the embodiments explained above. BLO_IDR and WLO_IDR are now also determined (as explained above, a specific brightness on the Master_HDR brightness axis can be mapped to 1.0 on the SDR brightness axis, and this can be reformulated as a mapping of appropriately scaled IDR brightness, i.e., WLO_IDR is defined, and similarly for BLO_IDR).
最後,可客製化曲線可藉由可客製化曲線最佳化器1808針對新的IDR後設資料情況最佳化(假如使用可客製化曲線,因為一些次市場編解碼器技術實施例變化(諸如,現實生活廣播)可能已選擇決不使用可客製化曲線,然後施加前者的para+偏移數學)。Finally, the customizable curve can be optimized for the new IDR metadata case by the customizable curve optimizer 1808 (if the customizable curve is used, since some sub-market codec implementation variations (e.g., real life broadcasting) may have chosen not to use the customizable curve at all, and then apply the former's para+offset math).
圖 19 說明客製化曲線的調適如何運作。其始終包含二個概念成分(無論僅在單一方向(或反向)上直接施加)。第一成分可藉由將注意力聚焦在物體上而瞭解:假設一下多線性區段可客製化曲線之控制點的一者對應於一條褲子(所以特定L_in_S經正規化明亮度xo1I係,例如,所有褲子像素的平均明亮度)。使用變換以根據人類分級者(或自動化軟體),例如,使該等褲子像素變亮(周圍或特別係控制點的一者)以輸出對該等褲子係更佳明亮度的經正規化明亮度。亦在圖4中看到在ETSI方法中,此發生為解碼器(單元405)中的最末(可選的)精細分級編碼步驟,及對應地解碼器中的第一步驟。所以實際上,此明亮度變換實際上定義在SDR明亮度域中(在para+偏移的粗略HDR至SDR明亮度映射之後(若有的話))。 Fig. 19 illustrates how the adaptation of a customized curve works. It always consists of two conceptual components (regardless of whether it is applied directly in a single direction (or inversely)). The first component can be understood by focusing on the object: suppose that one of the control points of the following polylinear segment customizable curve corresponds to a pair of pants (so a particular L_in_S normalized brightness xo1I is, for example, the average brightness of all pants pixels). A transformation is used to classify the pants according to a human (or automated software), for example, to brighten the pants pixels (either around or specifically at one of the control points) to output a normalized brightness that is better for the pants. Also see in Figure 4 that in the ETSI approach this happens as the last (optional) fine level encoding step in the decoder (unit 405), and correspondingly the first step in the decoder. So in practice this brightness transform is actually defined in the SDR brightness domain (after a coarse HDR to SDR brightness mapping of para+offset (if any)).
所以,可推論任何明亮度均需要變換(針對該物體!),其可乘法地寫為L_out=m(L_in_SDR)*L_in_SDR。Therefore, it follows that any brightness needs to be transformed (for that object!), which can be written multiplicatively as L_out=m(L_in_SDR)*L_in_SDR.
所需的乘法明亮度變化(按百分比)可在任何其他影像(例如,IDR影像)上不同,但應能夠依靠的一個事物係精細分級的校正對應於需要重新分級的特定「物體」(即使將可客製化曲線用於其之效益的另一者,除了特定物體精細分級外,例如,粗略分級輝度映射曲線之形狀的改善,仍可在實體上將其解讀為此一基於物體的改善,其藉由定義對應於一些明亮度子範圍的一組虛擬物體)。所以,若追蹤該等物體至另一DR明亮度範圍,經正規化橫座標值可改變,但非物體的核心本質(例如,機車上的人具有與在SDR(亦即,5/100)中不同的HDR中之經正規化明亮度(亦即,5/5000))。所以,必須重新計算用於新的經正規化明亮度位置分布的函數(此可針對中間部分重新分級輝度映射函數的任何量完成,甚至無論各種向上及向下部分軌多麼複雜,將想要設計HDR視訊編碼實施例)。所以,圖19a大致地顯示此:原始SDR物體明亮度(例如,可客製化曲線之線性區段的線性區段端點)xo1I移動至xo1N(此將藉由施加,例如,係圖20之F_I2S的反式的abcara而發生)。相同者發生至其他點,例如,五邊形區段點(一般可將其假設成有足夠良好擴散的區段點(例如,16),若分級者,例如,施加粗略線性經客製化重新分級至較暗明亮度的相對大子區域,可藉由分級軟體自動地設定其(例如,10))。所以,使所有此等點偏移,現在可藉由施加原始CC_gr偏移(亦即,L_out_SDR=CC_gr[L_in_S])從主內容後設資料分級者的原始CC_gr曲線(在SDR明亮度範圍上具有CC的F_H2S)定義中間曲線CC_XRM,其中L_in_S值係原始值xo1I等(但現在將L_out值施加至xo1N經重新映射IDR明亮度位置(產生虛曲線)。當然,此將不係適當的HDR至IDR(或更準確地,IDR至IDR)映射乘數,使得校正在步驟2中執行,如圖19b所繪示的。The required multiplicative brightness change (in percentage) may be different on any other image (e.g., an IDR image), but one thing that should be relied upon is that the correction of the fine grading corresponds to a specific "object" that needs to be re-graded (even if another benefit of using customizable curves for this, in addition to specific object fine grading, e.g., an improvement in the shape of the coarse-graded brightness mapping curve, can still be physically interpreted as this object-based improvement by defining a set of virtual objects corresponding to some brightness sub-range). So, if tracking such objects to another DR brightness range, the normalized x-coordinate values may change, but not the core nature of the object (e.g., a person on a motorcycle has a different normalized brightness in HDR (i.e., 5/5000) than in SDR (i.e., 5/100)). So, the function for the new normalized luminance position distribution must be recalculated (this can be done for any amount of re-scaling of the luminance mapping function in the middle, even no matter how complex the various upward and downward tracks one would want to design an HDR video encoding implementation). So, Figure 19a shows this roughly: the original SDR object luminance (e.g., the linear segment endpoint of the linear segment of the customizable curve) xo1I is moved to xo1N (this would happen by applying, for example, an abcara which is the inverse of F_I2S of Figure 20). The same happens to other points, e.g. the pentagonal segment points (which can generally be assumed to have a sufficiently good diffusion of segment points (e.g. 16), if the grader, for example, applies a roughly linear customized regrading to relatively large sub-areas of darker brightness, which can be set automatically by the grading software (e.g. 10)). So, having offset all these points, an intermediate curve CC_XRM can now be defined from the original CC_gr curve of the master content metadata grader (F_H2S with CC over the SDR luminance range) by applying the original CC_gr offset (i.e., L_out_SDR = CC_gr[L_in_S]), where the L_in_S values are the original values xo1I etc. (but now with the L_out values applied to the xo1N remapped IDR luminance positions (producing the imaginary curve). Of course, this will not be a proper HDR to IDR (or more accurately, IDR to IDR) mapping multiplier, so the correction is performed in step 2, as shown in Figure 19b.
如可再次在圖19b中看到的,可將乘法精細校正解譯為在用於在重新分級影像的頻譜中(與Mster_HDR)最極端不同之PB_C影像的無校正(按照定義,Mster_HDR像素明亮度已係正確的,因為此影像由內容創作者最佳地分級以自其開始)至完全校正之間改變的可按比例調整處理,其在申請人的方法中一般係100尼特SDR影像(其中用於特定像素的完全校正係,例如,mso1,其可寫為絕對偏移,但亦可寫為乘法校正yio1=mso1*xso1(任何明亮度映射曲線形狀yio1=F_L(xso1)可制定為明亮度相依乘法值的曲線)。As can be seen again in FIG. 19b , the multiplicative fine correction can be interpreted as a scalable process varying between no correction (by definition, the Mster_HDR pixel brightness is already correct since this image was optimally graded by the content creator to start with) to full correction for the PB_C image that differs most extremely in the spectrum of the regraded image (from Mster_HDR), which in the applicant's approach is typically a 100 nit SDR image (where the full correction for a particular pixel is, for example, mso1, which can be written as an absolute offset, but can also be written as a multiplicative correction yio1=mso1*xso1 (any brightness mapping curve shape yio1=F_L(xso1) can be formulated as a curve of brightness-dependent multiplicative values).
因為可將乘法校正觀點制定為自對角線的偏移,其中yio1=xso1,可引入垂直比例因子: ScaleVer= max [(1-La)/(1-K); 0] [方程式25] 使用如上文定義的La及K。Since the multiplicative correction viewpoint can be formulated as an offset from the diagonal, where yio1=xso1, a vertical scale factor can be introduced: ScaleVer= max [(1-La)/(1-K); 0] [Equation 25] Using La and K as defined above.
然後將可客製化曲線的所需經調適值發現為: yiDA=Min[(yio1-xso1)*ScaleVer+xio1;1] [方程式26] 且此係對xso1的所有值計算。The required adjusted value of the customizable curve is then found to be: yiDA=Min[(yio1-xso1)*ScaleVer+xio1;1] [Equation 26] and this is calculated for all values of xso1.
圖 27 給出以用於解碼器之技術上優雅的方式判定可客製化精細分級曲線之區段端點的另一方式。已描述可如何重新計算粗略分級para曲線的參數(且若存在黑色及/或白色偏移,但將藉由聚焦在para而簡化解釋)。假設para從任何起始動態範圍進行粗略分級到最終動態範圍(例如,LDR動態範圍)。黑色及白色偏移可將經正規化範圍差異(若任何者係必要的)列入考量,所以可客製化曲線僅關於沿著經正規化軸重新定位特定區域的相對輝度。因此,曲線將分別在(0,0)開始且在(1,1)結束,且在實例中具有一些區段連接器點在2個曲線形狀判定點(例如,(x1,y1))其間。線性區段及點的數目在任何表示中及其之重新分級中相等亦係合理的,因為區域的本質不變(最暗區域,例如室內顏色在例如200尼特PB_C影像中可在與1500尼特PB_C影像中不同的(一般在感知上均勻)經正規化明亮度結束,但存在二個區域(室內及室外)的事實在重新分級時不改變。 FIG. 27 gives another way to determine the segment endpoints of a customizable fine-gradation curve in a technically elegant manner for a decoder. It has been described how the parameters of the coarse-graded para curve can be recalculated (and if there are black and/or white offsets, but the explanation will be simplified by focusing on para). Assume that para is coarsely graded from any starting dynamic range to a final dynamic range (e.g., an LDR dynamic range). The black and white offsets can take into account the normalized range differences (if any are necessary), so the customizable curve is just about relocating the relative brightness of specific regions along the normalized axes. Therefore, the curve will start at (0,0) and end at (1,1), respectively, and in the example have some segment connector points between 2 curve shape decision points (e.g., (x1,y1)). It is also reasonable that the number of linear segments and points is equal in any representation and its regrading, because the nature of the regions does not change (the darkest region, e.g. indoor colors may end up at a different (generally perceptually uniform) normalized brightness in e.g. a 200 nit PB_C image than in a 1500 nit PB_C image), but the fact that there are two regions (indoor and outdoor) does not change in the regrading.
因此,對於多重線性重新分級函數形狀重新判定,僅需要找到對應端點(xnew, ynew)。Therefore, for the reshape determination of the multilinear regrading function, it is only necessary to find the corresponding endpoints (xnew, ynew).
可利用待符合(理想上)的另一性質,亦即不論使用總跨度函數FL_50t1(其在此情形中將由二個連續待施加函數組成:總para 2710及總多重線性函數2711)直接重新分級主HDR影像,或在二個步驟中進行重新分級,首先從5000尼特主影像至700尼特IDR(再度藉由使用二個函數:IDR產生para 2701及IDR產生多重線性函數2702),然後自其降級至100尼特LDR影像(使用通道para 2703及通道多重線性函數2704),結果必須相同:相同的LDR影像,因為其係始終針對主HDR影像產生的LDR影像,亦即內容創作者已編碼及傳達的該者(具有降級輝度映射函數形狀)。亦即,選擇所有可能的輸入HDR經正規化明亮度x1_MH的任何者,最終LDR輸出明亮度均應相同。因此,此針對恰映射(經由先前映射)至通道多重線性之x座標:x1_CH_L的輸入明亮度將亦為真。此一者可用以重新計算線段,因為在縱座標y上具有相等性,僅需要針對其他動態範圍上的對應多重線性經客製化曲線的特定區段計算x_new。Another property to be met (ideally) can be exploited, namely that regardless of whether the master HDR image is regraded directly using the total stride function FL_50t1 (which in this case will consist of two consecutive functions to be applied: total para 2710 and total multilinear function 2711), or is regraded in two steps, first from a 5000 nit master image to a 700 nit IDR (again by using two functions: IDR generation para 2701 and IDR generation multilinear function 2702), and then downgraded from it to a 100 nit LDR image (using channel para 2703 and channel multilinear function 2704), the result must be the same: the same LDR image, because it is the LDR image always generated for the master HDR image, i.e. the one that the content creator has encoded and delivered (with the downgraded luminance mapping function shape). That is, choosing any of all possible input HDR normalized luminances x1_MH, the final LDR output luminance should be the same. Therefore, this will also be true for input luminances that map (via previous mapping) to the x-coordinate of the channel multilinear: x1_CH_L. This can be used to recalculate the line segments, since there is equality on the longitudinal coordinate y, and only x_new needs to be calculated for specific segments of the corresponding multilinear customized curve over the other dynamic range.
所以在編碼側,可藉由施加經按比例調整標準化演算法針對任何x1_MH輸入計算經通道調適Y_CHA。此值Y_CHA將形成次一方塊的對應輸入x座標,其進入通道PB_C經判定para(上文給出之方程式)。yi_CH值已為已知,因為其等於用於5000尼特至100尼特之總重新分級的y1_L值,其在編碼側(相對於解碼側)當然係直接已知(例如由人類分級者產生)。針對多重線性函數的所有點進行此,獲得待寫入視訊信號(作為F_I2sCI的一部分)中的所有其特徵化參數。So on the encoding side, the channel-adapted Y_CHA can be calculated for any x1_MH input by applying the scaled normalization algorithm. This value Y_CHA will form the corresponding input x-coordinate of the next block, which enters channel PB_C after determination para (the equation given above). The yi_CH value is already known, since it is equal to the y1_L value for the total re-grading from 5000 nits to 100 nits, which is of course directly known on the encoding side (as opposed to the decoding side) (e.g. generated by a human grader). Doing this for all points of the multilinear function, we obtain all its characterization parameters to be written into the video signal (as part of F_I2sCI).
在解碼器側,可再度使用相同的原理以得到有些不同的演算法,因為現在必須計算一些未知參數。所以現在必須計算對應於已接收而因此已知之x1_CH_L值的x1_ML值,因為第一步驟係恢復(多個)總重新分級函數。一般有函數的數位精確度,例如256個經量化x值(亦即,不係特定的,例如,二或三個區段間的點,而係所有點,所以在其間之線上的點亦如此),所以可簡單地在其係客製化時針對可客製化曲線之所有點數值地建構LUT表,亦即已知曲線的y1_L,對應於x1_CH_L的所需x1_ML。On the decoder side, the same principle can be used again to obtain a somewhat different algorithm, since now some unknown parameters must be calculated. So now the x1_ML values corresponding to the received and therefore known x1_CH_L values must be calculated, since the first step is to recover the total re-grading function(s). There is generally a numerical accuracy of the function, for example 256 quantized x values (i.e. not specific, for example, points between two or three segments, but all points, so also points on the line in between), so it is simple to construct the LUT table for all the points of the customizable curve when it is customized, i.e. y1_L of the known curve, corresponding to the required x1_ML for x1_CH_L.
從LDR映射至IDR明亮度,對任何yi_CH得到x1_CH,且該值可經由para 2703反映射。若知道para 2701及多重線性2702,亦可判定所有可能的x1_MH值的何者映射至此Y_CHA值。從上文知道如何從解碼器側接收的函數後設資料計算para 2701,如上文所解釋的。(仍)不知道多重線性2702,但目前不需要。因為知道已客製化曲線2702亦遵循經標準化演算法的垂直按比例調整方程式。任何經測試的X1_MH可轉換成對應的X_CHA,且對應於其(及需要)的Y_CHA值遵循:Y_CHA=(y1_L - x1_ML)*scaleVer + X_CHA,且x1_ML可藉由施加總para 2710而自x1_MH計算。Mapping from LDR to IDR brightness, for any yi_CH we get x1_CH, and this value can be demapped via para 2703. If we know para 2701 and multilinearity 2702, we can also determine which of all possible x1_MH values maps to this Y_CHA value. From above we know how to calculate para 2701 from the function metadata received on the decoder side, as explained above. Multilinearity 2702 is not (still) known, but it is not needed for now. Because we know that the customized curve 2702 also follows the vertical scaling equation of the standardized algorithm. Any tested X1_MH can be converted to a corresponding X_CHA, and the corresponding (and required) Y_CHA value follows: Y_CHA=(y1_L - x1_ML)*scaleVer + X_CHA, and x1_ML can be calculated from x1_MH by applying total para 2710.
因此,將恰發現分別對應於x1_MH及x1_ML值的一者,其將恢復總多重線性函數2711。因為之後知道總重新分級及通道部分重新分級,亦可判定剩餘的重新分級(亦即,在5000尼特主影像及700尼特IDR之間),因此解碼所有事物,亦即判定函數,且所有IDR影像像素顏色的處理可開始,如使用圖26所解釋的。Thus, exactly one corresponding to the x1_MH and x1_ML values, respectively, will be found which will recover the total multilinear function 2711. Since the total reclassification and the channel partial reclassifications are then known, the remaining reclassifications can also be determined (i.e., between 5000 nits main image and 700 nits IDR), so decoding everything, i.e. determining the function, and processing of all IDR image pixel colors can begin, as explained using FIG.
為確保讀者更佳程度的瞭解,圖26再度以總結方式概念地說明,具有通常知識的讀者可已於上文的詳細解釋中發現的全部內容。上軌格子係有關後設資料重新計算,亦即,各種輝度映射函數判定的各種步驟(下單元2650等係執行實際像素顏色處理者)。現在良好地參見二步驟計算,分別對應於編碼器(但現在來自SLHDR2PLUS視訊解碼器側)中之HDR函數單元901中的經標準化演算法施加及IDR映射函數產生器903的函數判定。如所解釋者,解碼器得到函數F_I2sCI,該函數指定具有其之通道峰亮度PB_CH的已接收經選擇IDR影像及100尼特分級之間的輝度重新分級行為。但需要判定原始函數計算器2601將進行的較大跨度函數(在100尼特與例如PB_C_H50=6000尼特的主HDR峰亮度之間),亦即,FL_50t1函數(或更精準地說,在編碼側使用之該者的反成形函數)。但仍未到達,想要將IDR經正規化輝度(或更精準地說,在吾等的一般解碼拓撲中,感知上均勻的經正規化像素明亮度)解碼成主HDR經重建輝度。所以,原始接收的F_I2sCI,及函數FL_50t1皆無法判定PB_C_H50尼特主影像與100尼特(不係PB_CH尼特IDR影像及其他二者的任一者)之間的重新分級,所以需要判定施加至所接收的IDR像素明亮度的函數F_IDRt50,以獲得(感知上均勻的)經重建主HDR影像像素明亮度YpMstr,其係重建函數判定器2602所將進行的。已將顯示調適可能性顯示為虛線顯示最佳化函數計算單元2603,因為如所述,雖然其一般亦將在吾等的全功能SLHDR2PLUS解碼IC中實現,但其原理上針對SLHDR2PLUS解碼係可選的。通道峰亮度PB_CH將用以轉換經正常編碼(例如,10位元YCbCr)IDR像素輝度至感知均勻IDR像素明亮度YpIDR上,一般將在吾等的較佳SLHDR2PLUS IC中於其上進行吾等的重建輝度映射(雖然具有通常知識者瞭解可如何在不施加感知均勻化、或其他方法等的替代電路或軟體中具現本發明原理)。感知均勻化器2650以PB_C_H= PB_CH對其施加方程式1及方程式2。輝度上映射器2651藉由簡單地施加經判定函數,亦即YpMstr=F_IDRt50(YpIDR),而重建主HDR影像明亮度。假如需要顯示調適以建立例如350尼特PB_C影像,顯示最佳化器2652僅針對其施加經判定顯示最佳化函數,產生經顯示最佳化像素明亮度:Yglim= F_DO(YpMstr)。可藉由線性化器2653將該等轉換成實際經正規化像素輝度L,該線性化器施加反方程式1及2,但現在使用顯示最佳化的例如350尼特PB_C_DO而非PB_CH。最後,一般可選地有進一步明亮度碼產生器2654,其施加SMPTE 2084的感知量化器EOTF以普及的HDR10格式給予輸出明亮度YPQ。To ensure a better understanding of the reader, FIG. 26 again summarizes conceptually all that a reader with general knowledge can already find in the detailed explanation above. The upper track grid is about the meta-data recalculation, that is, the various steps of the determination of the various brightness mapping functions (the lower unit 2650 etc. is the one that performs the actual pixel color processing). Now we can see the two-step calculation, which corresponds to the standardized algorithm application in the HDR function unit 901 in the encoder (but now from the SLHDR2PLUS video decoder side) and the function determination of the IDR mapping function generator 903. As explained, the decoder gets a function F_I2sCI that specifies the luminance re-grading behavior between the received selected IDR image with its channel peak luminance PB_CH and a grading of 100 nits. But it needs to decide the larger span function (between 100 nits and the main HDR peak luminance of, for example, PB_C_H50=6000 nits) that the original function calculator 2601 will perform, namely the FL_50t1 function (or more precisely, the inverse shaping function of that used on the encoding side). But it is still not reached, and it is desired to decode the IDR normalized luminance (or more precisely, the perceptually uniform normalized pixel brightness in our general decoding topology) into the main HDR reconstructed luminance. Therefore, the originally received F_I2sCI, and function FL_50t1, cannot determine the re-gradation between the PB_C_H 50 nit main image and 100 nit (not the PB_CH nit IDR image or either of the other two), so it is necessary to determine the function F_IDRt50 applied to the received IDR pixel brightness to obtain the (perceptually uniform) reconstructed main HDR image pixel brightness YpMstr, which is what the reconstruction function determiner 2602 will do. The display adaptation possibility has been shown as a dashed display optimization function calculation unit 2603, because as mentioned, although it will generally also be implemented in our full-featured SLHDR2PLUS decoding IC, it is in principle optional for SLHDR2PLUS decoding. The channel peak brightness PB_CH will be used to convert the normally encoded (e.g., 10-bit YCbCr) IDR pixel brightness to the perceptually uniform IDR pixel brightness YpIDR, on which our reconstructed brightness mapping will generally be performed in our preferred SLHDR2PLUS IC (although one of ordinary skill will understand how the principles of the invention can be implemented in alternative circuits or software that do not apply perceptual equalization, or other methods, etc.). The perceptual equalizer 2650 applies Equations 1 and 2 to it with PB_C_H = PB_CH. The brightness upmapper 2651 reconstructs the main HDR image brightness by simply applying the determined function, i.e., YpMstr = F_IDRt50(YpIDR). If display adaptation is required to create e.g. a 350 nit PB_C image, the display optimizer 2652 simply applies the determined display optimization function to it, resulting in a display optimized pixel brightness: Yglim = F_DO(YpMstr). These can be converted to actual normalized pixel brightness L by a linearizer 2653, which applies the inverse equations 1 and 2, but now using the display optimized e.g. 350 nit PB_C_DO instead of PB_CH. Finally, there is typically optionally a further brightness code generator 2654, which applies the perceptual quantizer EOTF of SMPTE 2084 to give an output brightness YPQ in the popular HDR10 format.
雖然提出一些實施例/教示以說明可單獨或組合地變化的一些態樣,但可瞭解數種進一步變化可沿著相同的基本原理形成:符合ETSI2 HDR視訊通訊或類似者從接收之不同的中間動態範圍影像後設資料重新推導明亮度映射方程式,以重建已在內容創作地點處最佳地分級的主HDR影像。在本文中揭示的演算法成分可(全部或部分地)實務上實現為硬體(例如,特定應用IC的一部分),或實現為在特殊數位信號處理器或通用處理器上運行的軟體等。While some embodiments/teachings are presented to illustrate some aspects that may vary individually or in combination, it is understood that several further variations may be formed along the same basic principle: re-derive the brightness mapping equation from different intermediate dynamic range image metadata received in accordance with ETSI2 HDR video communication or similar to reconstruct the master HDR image that has been optimally graded at the content creation site. The algorithm components disclosed herein may (in whole or in part) be implemented in practice as hardware (e.g., as part of a specific application IC), or as software running on a special digital signal processor or a general purpose processor, etc.
所屬技術領域中具有通常知識者應可從吾等的呈現理解該等成分可係可選的改善,且可與其他成分組合而實現,且方法的(可選的)步驟如何對應於設備的各別構件,且反之亦然。本申請案中的用字「設備(apparatus)」係以其最廣義的意義使用,亦即,允許特定目的之實現的構件群組,且可因此可係例如IC(的小電路部件)、或專用器具(諸如,具有顯示器的器具)、或網路系統的一部分等。「配置(arrangement)」亦意圖以最廣義的意義使用,所以其可除了其他事物之外包含單一設備、設備的一部分、協作設備的(一部分的)集合等。A person of ordinary skill in the art should be able to understand from our presentation that these components may be optional improvements and may be implemented in combination with other components, and how the (optional) steps of the method correspond to individual components of the apparatus, and vice versa. The word "apparatus" in this application is used in its broadest sense, i.e. a group of components that allows the implementation of a specific purpose, and may thus be, for example, an IC (a small circuit component), or a dedicated appliance (e.g., an appliance with a display), or a part of a network system, etc. "Arrangement" is also intended to be used in the broadest sense, so it may include, among other things, a single device, a part of a device, a collection of (parts of) cooperating devices, etc.
應將電腦程式產品標誌理解為涵蓋實現通用或特殊目的處理器、在一系列載入步驟(其可包括中間轉換步驟,諸如,轉譯至中間語言,及最後的處理器語言)以將命令輸入至處理器中之後、以執行發明之特徵功能之任何者的命令集合的任何實體實現。尤其是電腦程式產品可實現為載體(諸如,例如磁碟或磁帶)上的資料、存在於記憶體中的資料、經由網路連接(有線或無線)的資料行進、或紙上的程式碼。除了程式碼外,程式所需要的特徵資料亦可具現為電腦程式產品。The computer program product designation should be understood to cover any physical implementation of a general or special purpose processor, a set of commands that, after a series of loading steps (which may include intermediate conversion steps, such as translation into an intermediate language and a final processor language) to input the commands into the processor, performs any of the characteristic functions of the invention. In particular, a computer program product may be implemented as data on a carrier (such as, for example, a disk or tape), data stored in a memory, data traveling via a network connection (wired or wireless), or program code on paper. In addition to program code, characteristic data required by the program may also be embodied as a computer program product.
該方法之操作所需的一些步驟可能已經存在於處理器的功能中,而非在電腦程式產品中描述,諸如,資料輸入與輸出步驟。Some steps required for the operation of the method may already exist within the functionality of the processor rather than being described in the computer program product, for example, data input and output steps.
應注意以上提及的實施例說明而非限制本發明。當具有通常知識者可輕易地實現所呈現實例至申請專利範圍之其他區域的映射時,吾等為了簡明而不深入提及所有此等選項。除了在申請專利範圍中組合之本發明之元件的組合外,元件的其他組合係可能的。元件的任何組合可實現在單一專用元件中。It should be noted that the embodiments mentioned above illustrate rather than limit the present invention. While a person of ordinary skill can easily implement a mapping of the presented embodiments to other areas of the claimed scope, we do not mention all such options in depth for the sake of brevity. In addition to the combination of elements of the present invention combined in the claimed scope, other combinations of elements are possible. Any combination of elements can be implemented in a single dedicated element.
在申請專利範圍中的圓括號之間的任何參考符號未意圖限制申請專利範圍。用字「包含(comprising)」不排除未列於申請專利範圍中之元件或態樣的存在。元件前的用字「一(a/an)」不排除複數個此類元件的存在。 [所使用的術語及縮寫]:Any reference signs between parentheses in the claims are not intended to limit the claims. The word "comprising" does not exclude the presence of elements or aspects not listed in the claims. The word "a" or "an" before an element does not exclude the presence of a plurality of such elements. [Terms and abbreviations used]:
PB_C :影像的最大可編碼輝度,通常針對任何情況指示,C表示編碼(不與位元深度混淆),例如,HDR影像可具有PB_C_HDR = 4000尼特(其亦定義以下的所有相對輝度,因為L_norm = L/PB_C,其中L_norm(經正規化輝度)位於0.0與1.0之間 PB_C : Maximum encodable brightness of the image, usually indicated for any case, C stands for encoding (not to be confused with bit depth), e.g. an HDR image may have PB_C_HDR = 4000 nits (which also defines all relative brightness below, since L_norm = L/PB_C, where L_norm (normalized brightness) lies between 0.0 and 1.0
PB_D :任何顯示器的最大可顯示輝度(亦稱為,峰亮度),例如,目前的HDR顯示器一般具有1000尼特的PB_D(但降至600尼特或至多2000尼特且甚至4000尼特的值亦係目前可購買的,且在未來可能有更高的PB_D)。 PB_D : The maximum displayable brightness of any display (also called, peak brightness), e.g., current HDR displays typically have a PB_D of 1000 nits (but values down to 600 nits or up to 2000 nits and even 4000 nits are currently commercially available, and higher PB_Ds are possible in the future).
IDR (中間動態範圍):將原本以PB_C1(例如,10,000尼特)定義的影像(亦即,主影像)實際表示為具有PB_C2 < PB_C1(例如,一般係因子2或更低,且PB_C2一般>=500尼特)之次級HDR影像的機制。 IDR (Intermediate Dynamic Range): A mechanism by which an image originally defined with PB_C1 (e.g., 10,000 nits) (i.e., primary image) is actually represented as a secondary HDR image with PB_C2 < PB_C1 (e.g., typically a factor of 2 or less, and PB_C2 is typically >= 500 nits).
MDR (中等動態範圍;一定不要與IDR混淆):具有PB_C_MDR的影像一般位於所接收之HDR影像的PB_C (PB_C_H)與PB_C_SDR=100尼特(藉由視訊領域中的約定定義)之間,將該PB_C_MDR值設定成等於任何顯示器的PB_D(以此方式,具有錯誤的動態範圍,且因而更重要地具有相對於彼此的經正規化明亮度的錯誤相對統計分布的傳入HDR影像可針對較低動態範圍能力的特定可用顯示器最佳地重新分級,亦即PB_D< PB_C_H) MDR (Medium Dynamic Range; must not be confused with IDR): images with PB_C_MDR are generally between the PB_C (PB_C_H) of the received HDR image and PB_C_SDR=100 nits (by convention in the video field), setting this PB_C_MDR value equal to the PB_D of any display (in this way, incoming HDR images with incorrect dynamic range, and thus more importantly with incorrect relative statistical distribution of normalized luminance with respect to each other, can be optimally regraded for a particular available display of lower dynamic range capability, i.e. PB_D < PB_C_H)
Para :特定在程式上高度有用的函數,以將在對應於PB_C1之第一經正規化明亮度範圍上定義的明亮度映射成由PB_C2正規化,且該函數在上文係由方程式4及方程式5以及拋物線在其間的區段定義,或正式地以ETSI TS 103 433-1 V1.2.1 (2017-08) [簡而言之ETSI1] p. 70方程式C-20定義。 Para : A specific highly programmatically useful function to map brightness defined over a first normalized brightness range corresponding to PB_C1 to normalized by PB_C2, and which is defined above by Equations 4 and 5 and the segment of the parabola therebetween, or formally by ETSI TS 103 433-1 V1.2.1 (2017-08) [ETSI1 in Brief] p. 70 Equation C-20.
Abcara :任何para的反函數(亦即,具有獨特地定義其形狀的參數),該相反形狀亦可藉由交換軸而直觀地發現(但有時需要數學地計算)。 Abcara : For any para function (i.e., one with parameters that uniquely define its shape), the inverse shape can also be found intuitively (but sometimes needs to be calculated mathematically) by swapping the axes.
WLO (white level offset,白位準偏移):在被映射至第二經正規化明亮度範圍上的1.0之第一影像(im1)之經正規化明亮度範圍中的經正規化明亮度,藉此PB_C_im1>PB_C_im2。在此施加中,有沿著編碼程序定義用於不同PB_C之各種影像的數個不同的WLO,彼等給定後標,像是例如WLO_gr,因此可輕易區別其等。 WLO (white level offset): The normalized luminance in the normalized luminance range of the first image (im1) mapped to 1.0 on the second normalized luminance range, such that PB_C_im1>PB_C_im2. In this application, there are several different WLOs defined along the encoding process for various images with different PB_Cs, which are given suffixes such as WLO_gr so they can be easily distinguished.
BLO (black level offset,黑位準偏移):在被映射至第二經正規化明亮度範圍上的0.0之第一影像之經正規化明亮度範圍中的經正規化明亮度,藉此PB_C_im1>PB_C_im2。在此施加中,有沿著編碼處理定義用於不同PB_C之各種影像的數個不同的BLO,彼等給定後標,像是例如BLO_IDR,因此可輕易區別其等。 BLO (black level offset): the normalized luminance in the normalized luminance range of the first image mapped to 0.0 on the second normalized luminance range, such that PB_C_im1>PB_C_im2. In this application, there are several different BLOs defined for various images with different PB_Cs along the encoding process, which are given suffixes such as BLO_IDR so they can be easily distinguished.
P_LUT :將第一影像的任何可能經正規化明亮度轉換成第二影像的對應經正規化明亮度所需的總映射(包含吾等編解碼器方法中的部分重新分級,如圖4所解釋的),藉此PB_C_im1 != PB_C_im2(一般至少在乘法因子1.02不同)。因為P_LUT[L](其一般係影像內容相依的,且例如藉由智慧影像分析自動化或人類而最佳化)改變經正規化明亮度的相對分布,亦即直方圖,其係動態範圍變換的關鍵態樣,例如涉及在其係本新穎HDR編解碼器原則中之關鍵的IDR影像定義中的該者 P_LUT : The total mapping (including partial rescaling in our codec approach as explained in Figure 4) needed to transform any possible normalized luminance of the first image into the corresponding normalized luminance of the second image, whereby PB_C_im1 != PB_C_im2 (typically differing by at least a multiplication factor of 1.02). Because the P_LUT[L] (which is typically image content dependent and optimized e.g. automatically by intelligent image analysis or by humans) changes the relative distribution of normalized luminances, i.e. the histogram, which is a key aspect of dynamic range conversion, e.g. involved in the definition of IDR images which is key in the present novel HDR codec principles.
C_LUT :像素顏色之色度的像素明亮度相依映射,連同P_LUT完成顏色變換(YCbCr_out=T[Y_cbCr_in]) C_LUT : Pixel brightness dependent mapping of the chromaticity of the pixel color, together with P_LUT to complete the color transformation (YCbCr_out=T[Y_cbCr_in])
201:攝影機 202:OOTF映射 203:分級單元 204:影像通訊連接 205:顯示器 206:使用者介面控制單元 301:影像源 302:顏色變換器 303:視訊壓縮器 304:格式器 305:通訊媒體 306:未格式器 307:視訊解壓縮器 308:顏色變換器 309:顯示調適單元 310:顯示器 320:視訊接收器 321:廣播台側編碼器 401:EOTF施加單元/輝度處理器/輝度處理軌/明亮度處理軌 402:單元/輝度映射 403:黑色-白色位準偏移器/單元/輝度映射 404:粗略動態範圍變換器/單元 405:曲線施加器/單元 406:線性化器/單元 450:色度處理軌 451:飽和處理判定器/單元 452:乘法器 453:矩陣施加單元 454:第一乘法器 455:乘法器 601:陰影人 602:機器人 900:編碼器 901:HDR函數產生單元/單元 902:IDR影像計算單元 903:IDR映射函數產生器 920:影像輸入 921:第二後設資料輸入 922:第一後設資料輸入 923:第三後設資料輸入 930:影像輸出 931:第二後設資料輸出 932:第一後設資料輸出 933:第三後設資料輸出 1100:高動態範圍視訊解碼器/解碼器 1102:顏色變換器 1104:輝度函數判定單元 1110:影像輸入 1111:第一後設資料輸入 1112:第二後設資料輸入 1113:第三後設資料輸入 1120:影像輸出 1121:後設資料輸出 1122:影像輸出 1501:輝度映射器/明亮度處理軌 1502:感知化器/解碼器/單元 1503:精細分級單元/單元 1504:粗略輝度映射單元/單元 1505:反黑色及白色偏移器/單元 1506:線性化器/單元 1550:色度處理器 1551:色度處理判定單元/單元 1601:線性化器 1602:感知化器 1603:輝度映射單元 1604:HDR至MDR輝度映射器 1801:通道適配器 1802:反相器 1803:適配器 1804:限制器 1805:第二通道適配器 1806:影像像素輝度映射器 1807:IDR後設資料判定器 1808:可客製化曲線最佳化器 2301:內容創作側 2302:攝影機 2303:分級設備 2304:分級者 2305:視訊編碼器 2306:衛星天線 2340:通訊衛星 2351:本地衛星接收碟 2352:最終處理設備 2353:HDR顯示器 2370:視訊編碼器 2381:解碼器 2382:顯示最佳化器 2501:經SLHDR2PLUS編碼HDR影像信號 2502:像素顏色矩陣 2510:SLHDR2解碼器 2520:SLHDR2PLUS解碼器 2601:原始函數計算器 2602:重建函數判定器 2603:顯示最佳化函數計算單元 2650:感知均勻化器/單元 2651:輝度上映射器 2652:顯示最佳化器 2653:線性化器 2654:明亮度碼產生器 2701:para 2702:IDR產生多重線性函數/多重線性/曲線 2703:para 2704:通道多重線性函數 2710:總para 2711:總多重線性函數 BE:主HDR影像輝度範圍 Be2:IDR輝度範圍 BLO_gr:黑色位準偏移 BLO_IDR:參數 BLO_REC:偏移 BN:經正規化輝度 Bs:顏色分量 Cb:色度 Cb*:輸出色度 CC_XRM:中間曲線 ColHDR:明亮度 ColSDR:明亮度 Cr:色度 Cr*:輸出色度 DR_1:LDR輝度動態範圍/第一動態範圍 DR_2:HDR輝度範圍/第二動態範圍 DRKSPST:黑暗太空站 dX:橫座標 EXP:曝光 F_??:函數 F_C:粗略輝度擠壓函數/粗略映射函數 F_ct:顏色變換函數/顏色映射函數 F_E_I2S:輝度映射函數 F_ENCINV_H2I:解碼明亮度映射函數 F_H2h:輝度映射函數/向上分級函數 F_H2hCI:經調適明亮度映射函數/輝度映射函數 F_H2hCI_1:輸出 F_H2hCI_2:輸出 F_H2hCI_3:輸出 F_H2I_LIM:明亮度映射函數形狀 F_H2S:總輝度映射函數/起始輝度映射函數 F_I2s:函數 F_I2S:映射函數 F_I2S_or:剩餘部分重分級函數 F_I2sCI:通道明亮度映射函數/明亮度映射函數/輝度映射函數 F_IDRt50:函數 F_L_CU:精細分級輝度映射函數/函數/曲線 F_L_DA:顯示調適函數/輝度映射函數 F_L_REC:MsterHDR重建輝度映射函數 F_L_RHDR:重建para輝度映射函數形狀 F_L_subsq:明亮度映射函數 F_Mt1:函數/總輝度重新映射函數 F_Mt1_ca:中間函數 F50t1:函數形狀 Fd:距離 FL_50t1:主明亮度映射函數/總跨度函數/輝度降級函數 FL_ca:曲線 FL_gr:曲線 FL_IDR:曲線 Fx:函數 GN:經正規化輝度 Gs:顏色分量 HG:斜率 HG_abc:高亮度增益 HG_ca:高亮度增益 HG_gr:第二斜率/原始高亮度增益 HG_IDR:參數 HG_IDR_LIM:可編碼值 HG_REC:重建高亮度增益 IDR:中間動態範圍 Im1250:經顯示調適影像 Im5000:輸出影像 Im_COD:經壓縮SDR影像 Im_LDR:輸出影像/低動態範圍影像/標準態範圍影像 Im_RHDR:經重建HDR影像/原始主影像 Im_RLDR:SDR影像 ImDA:PB_C影像 ImSCN1:晴朗室外影像 ImSCN2:夜間場景 ImSCN3:場景影像 K:函數的完整動作 L_HDR:輝度 L_IDR:明亮度 L_in:輸入/參數/經正規化輝度 L_in_X:經正規化輸入輝度 L_Mster_HDR:明亮度 L_REC_M_HDR:明亮度 L_RHDR:經正規化經重建明亮度 L_SDR:輝度 Lh:輸出輝度 LIN_HDR:線性輝度影像 Ln_Mster_HDR:輸入明亮度 Ln_XDR:明亮度 Ls:SDR明亮度/輝度 MAST_HDR:主HDR影像 MB:最小黑色位準/下端點 MDR:中等動態範圍 MDR_300:中等動態範圍影像 mip:中點 mso1:絕對偏移 Mster_HDR:影像 Mster HDR:影像 MsterHDR:主HDR視訊/輸入高動態範圍影像/主高動態範圍影像 mx:橫座標 n:距離 OOTF:光-光學方法 P_IoH:明亮度 P_SoH:明亮度 PB_C:影像編碼峰亮度/最大可編碼輝度 PB_C_H:峰亮度 PB_C_H50:第一最大像素輝度/第二期望峰亮度/內容峰亮度 PB_CH:通道峰亮度/第二最大像素輝度 PB_MDR:最大像素輝度 PQ:感知量化器 REC_M_HDR:影像/經重建主HDR影像 RecHDR:HDR影像 RN:經正規化輝度 Rs:顏色分量 RS:SDR輝度子範圍 sc*:經標準化比例因子 SDR:標準動態範圍 SG:陰影增益 SG_abc:陰影增益 SG_ca:陰影增益 SG_gr:第一斜率 SG_IDR:IDR影像/參數 SG_REC:陰影增益 SG_RM:曲線/總陰影增益 WLO_ca:參數 WLO_gr:白色位準偏移 WLO_IDR:參數 WLO_REC:偏移 WP_IDR:參數 xo1I:明亮度 Y:明亮度 Y’CL:輸出明亮度 Y’GL:經分級LDR明亮度 Y’HC:初始HDR明亮度 Y’HP:灰色明亮度 Y’HPR:經重建Mster_HDR明亮度 Y’HPS:經按比例調整HDR明亮度/經正規化HDR輸入輝度 Y’IP:IDR明亮度 Y’IPG:重新分級IDR明亮度 Y_IDR:明亮度值 Yglim:經顯示最佳化像素明亮度 yio1:明亮度映射曲線形狀 YpIDR:感知均勻IDR像素明亮度 YpMstr:經重建主HDR影像像素明亮度 YPQ:輸出明亮度201: Camera 202: OOTF Mapping 203: Classification Unit 204: Image Communication Connection 205: Display 206: User Interface Control Unit 301: Image Source 302: Color Converter 303: Video Compressor 304: Formatter 305: Communication Media 306: Unformatter 307: Video Decompressor 308: Color Converter 309: Display Adapter 310: Display 320: Video Receiver 321: Broadcast station side encoder 401: EOTF application unit/brightness processor/brightness processing track/brightness processing track 402: unit/brightness mapping 403: black-white level shifter/unit/brightness mapping 404: coarse dynamic range converter/unit 405: curve applicator/unit 406: linearizer/unit 450: chroma processing track 451: saturation processing determiner/unit 452: multiplier 453: matrix applicator Adding unit 454: First multiplier 455: Multiplier 601: Shadow man 602: Robot 900: Encoder 901: HDR function generation unit/unit 902: IDR image calculation unit 903: IDR mapping function generator 920: Image input 921: Second meta-data input 922: First meta-data input 923: Third meta-data input 930: Image output 931: Second meta-data output 9 32: First meta-data output 933: Third meta-data output 1100: High dynamic range video decoder/decoder 1102: Color converter 1104: Brightness function determination unit 1110: Image input 1111: First meta-data input 1112: Second meta-data input 1113: Third meta-data input 1120: Image output 1121: Meta-data output 1122: Image output 1501: Brightness mapper / Luminance Processing Track 1502: Sensor/Decoder/Unit 1503: Fine Grading Unit/Unit 1504: Coarse Brightness Mapping Unit/Unit 1505: Anti-Black and White Shifter/Unit 1506: Linearizer/Unit 1550: Chroma Processor 1551: Chroma Processing Decision Unit/Unit 1601: Linearizer 1602: Sensor 1603: Brightness Mapping Unit 1604: HDR to MDR Brightness Mapping Transmitter 1801: Channel adapter 1802: Inverter 1803: Adapter 1804: Limiter 1805: Second channel adapter 1806: Image pixel brightness mapper 1807: IDR metadata determiner 1808: Customizable curve optimizer 2301: Content creation side 2302: Camera 2303: Grading equipment 2304: Grader 2305: Video encoder 2306: Satellite antenna 23 40: Communication satellite 2351: Local satellite receiving dish 2352: Final processing equipment 2353: HDR display 2370: Video encoder 2381: Decoder 2382: Display optimizer 2501: HDR image signal encoded by SLHDR2PLUS 2502: Pixel color matrix 2510: SLHDR2 decoder 2520: SLHDR2PLUS decoder 2601: Raw function calculator 2 602: Reconstruction function determiner 2603: Display optimization function calculation unit 2650: Perceptual equalizer/unit 2651: Brightness mapper 2652: Display optimizer 2653: Linearizer 2654: Brightness code generator 2701: para 2702: IDR generates multilinear function/multilinear/curve 2703: para 2704: Channel multilinear function 2710: Total para 271 1: Total multilinear function BE: Main HDR image luminance range Be2: IDR luminance range BLO_gr: Black level offset BLO_IDR: Parameter BLO_REC: Offset BN: Normalized luminance Bs: Color component Cb: Chroma Cb*: Output chroma CC_XRM: Midrange curve ColHDR: Brightness ColSDR: Brightness Cr: Chroma Cr*: Output chroma DR_1: LDR luminance =DYNAMIC RANGE/1ST DYNAMIC RANGE DR_2:HDR BRIGHTNESS RANGE/2ND DYNAMIC RANGE DRKSPST:Dark Space Station dX:Horizontal Axis EXP:Exposure F_??:Function F_C:Coarse BRIGHTNESS CRUSHING FUNCTION/Coarse Mapping Function F_ct:Color Conversion Function/Color Mapping Function F_E_I2S:Brightness Mapping Function F_ENCINV_H2I:Decoded Brightness Mapping Function F_H2h:Brightness Mapping Function/Upward Grading Function F_H2hCI: Adapted brightness mapping function/brightness mapping function F_H2hCI_1: Output F_H2hCI_2: Output F_H2hCI_3: Output F_H2I_LIM: Brightness mapping function shape F_H2S: Total brightness mapping function/Starting brightness mapping function F_I2s: Function F_I2S: Mapping function F_I2S_or: Residual re-grading function F_I2sCI: Channel brightness Mapping function/brightness mapping function/luminance mapping function F_IDRt50: function F_L_CU: fine-graded luminance mapping function/function/curve F_L_DA: display adaptation function/luminance mapping function F_L_REC: MasterHDR reconstruction luminance mapping function F_L_RHDR: reconstruction para luminance mapping function shape F_L_subsq: luminance mapping function F_Mt1: function/total luminance remapping function F _Mt1_ca: Intermediate function F50t1: Function shape Fd: Distance FL_50t1: Main brightness mapping function/Total span function/Brightness degradation function FL_ca: Curve FL_gr: Curve FL_IDR: Curve Fx: Function GN: Normalized brightness Gs: Color component HG: Slope HG_abc: High brightness gain HG_ca: High brightness gain HG_gr: Second slope/Original high brightness gain H G_IDR: parameter HG_IDR_LIM: encodable value HG_REC: reconstructed high brightness gain IDR: intermediate dynamic range Im1250: display-adapted image Im5000: output image Im_COD: compressed SDR image Im_LDR: output image/low dynamic range image/standard range image Im_RHDR: reconstructed HDR image/original main image Im_RLDR: SDR image ImDA :PB_C image ImSCN1: Clear outdoor image ImSCN2: Night scene ImSCN3: Scene image K: Full action of the function L_HDR: Luminance L_IDR: Brightness L_in: Input/parameter/Normalized luminance L_in_X: Normalized input luminance L_Mster_HDR: Brightness L_REC_M_HDR: Brightness L_RHDR: Normalized reconstructed luminance L_SD R: Luminance Lh: Output Luminance LIN_HDR: Linear Luminance Image Ln_Mster_HDR: Input Luminance Ln_XDR: Luminance Ls: SDR Luminance/Luminance MAST_HDR: Master HDR Image MB: Minimum Black Level/Lower Endpoint MDR: Medium Dynamic Range MDR_300: Medium Dynamic Range Image mip: Midpoint mso1: Absolute Offset Mster_HDR: Image Mster HDR: Image MsterHDR: Master HDR video/Input high dynamic range image/Master high dynamic range image mx: Horizontal coordinate n: Distance OOTF: Optical-Optical Method P_IoH: Brightness P_SoH: Brightness PB_C: Image coding peak brightness/Maximum encodable brightness PB_C_H: Peak brightness PB_C_H50: First maximum pixel brightness/Second expected peak brightness/Content peak brightness PB_CH: Channel peak brightness/Second maximum Pixel brightness PB_MDR: Maximum pixel brightness PQ: Perceptual quantizer REC_M_HDR: Image/Reconstructed master HDR image RecHDR: HDR image RN: Normalized brightness Rs: Color component RS: SDR brightness subrange sc*: Normalized scale factor SDR: Standard dynamic range SG: Shadow gain SG_abc: Shadow gain SG_ca: Shadow gain SG_gr: First slope SG_IDR: IDR image/parameters SG_REC:Shadow gain SG_RM:Curve/total shadow gain WLO_ca:parameters WLO_gr:White level offset WLO_IDR:parameters WLO_REC:Offset WP_IDR:parameters xo1I:Brightness Y:Brightness Y’CL:Output brightness Y’GL:Graded LDR brightness Y’HC:Original HDR brightness Y’HP:Gray brightness Y’HPR:Reconstructed Master_HDR brightness Y’HPS: scaled HDR brightness/normalized HDR input brightness Y’IP: IDR brightness Y’IPG: regraded IDR brightness Y_IDR: brightness value Yglim: display optimized pixel brightness yio1: brightness mapping curve shape YpIDR: perceptually uniform IDR pixel brightness YpMstr: reconstructed master HDR image pixel brightness YPQ: output brightness
根據本發明之方法及設備的此等及其他態樣將參考下文描述的實施方案及實施例並參考附圖說明而變得顯而易見,該等附圖僅作為例示更一般性之概念的非限制性具體圖示,且其中虛線用以指示組件係可選的,非虛線組件不必然係必要的。虛線亦可用於指示被解釋為必要的,但隱藏在物體的內部中,或用於無形事物,諸如,例如物體/區域的選擇(及它們可如何顯示在顯示器上)的元件。These and other aspects of the method and apparatus according to the present invention will become apparent with reference to the embodiments and examples described below and with reference to the accompanying drawings, which are intended only as non-limiting, specific illustrations of the more general concepts, and in which dashed lines are used to indicate that components are optional, and non-dashed line components are not necessarily required. Dashed lines may also be used to indicate elements that are interpreted as being required, but are hidden in the interior of an object, or used for intangible things, such as, for example, the selection of objects/areas (and how they can be displayed on a display).
在圖式中: [圖1]示意地繪示在將高動態範圍影像最佳地映射至對應的經最佳地顏色分級及類似模樣(鑑於各別第一動態範圍DR_1與第二動態範圍DR_2中的差異,如所期望及可行者般類似)的較低動態範圍影像(例如,100尼特峰亮度的標準動態範圍影像)時發生的許多一般顏色變換,其在可逆性(模式2)的情形中亦將對應於由接收器(解碼器)接收之SDR影像的映射,該SDR影像實際編碼HDR場景; [圖2]顯示用於HDR影像之擷取系統可看起來似者; [圖3]說明將HDR影像傳達為特定(不同)峰亮度的一些影像的可能方式及在後設資料中共同傳達之一般定義為明亮度映射函數(例如,定義為可傳統使用的SDR影像Im_LDR)的輝度映射函數,例如根據申請人的較佳方法,以高概念程度開始針對本技術之新讀者的說明; [圖4]顯示根據申請人之在ETSI2中標準化的特定方法編碼HDR影像的各種進一步細節,其等態樣係瞭解下文所撰寫之新穎SLHDR2PLUS編解碼器方法的各種教示的各種細節之所需; [圖5]說明顏色如何變換且特別係重新分級運作在YCbCr色域中的明亮度變換; [圖6]藉由解釋有用的應用以一些更多細節說明吾等的可客製化曲線的概念; [圖7]解釋在原始主HDR影像的中間動態範圍(IDR)編碼及通訊上的基本觀點,以及中等動態範圍影像(MDR)的不待混淆的概念,其一般從任何已接收影像計算以針對在可用之顯示峰亮度PB_D的任何特定顯示器上的顯示來將其最佳化,例如由企圖觀看所接收之HDR視訊的任何終端消費者購買的特定HDR tv; [圖8]進一步說明如何開始處理IDR問題,特別係藉由解碼器將其以特定可自動計算方式解決,且尤其係若可能接收內容的至少一些解碼器係已在市場上的ETSI2順應解碼器,且可能不可輕易地以新的SLHDR2PLUS技術升級(例如,因為tv或STB的擁有者不將其升級); [圖9]顯示本申請案之新穎SLHDR2PLUS編碼器一般需要之技術部件的通用構造; [圖10]說明在藉由編碼器連續推導各種對應明亮度映射函數時所涉及的一些基本技術態樣,特別使用para明亮度映射函數的實例說明; [圖11]顯示新穎的SLHDR2PLUS解碼器的可能典型高階構造,其遵循下文描述之各種可能性的一些實施例教示,以實現SLHDR2PLUS視訊通訊; [圖12]進一步解釋當根據內容創作者的觀點由他在他之定義他的視訊內容的重新分級需求的主明亮度映射函數中選擇時,黑色及白色偏移的一些態樣; [圖13]描述根據對角線按比例調整原理用於推導para之通道調適版本的固定演算法之較佳方法的技術原理; [圖14]說明para的反曲線(所謂的abcara)的一些態樣; [圖15]詳細說明SLHDR2PLUS解碼器的一些態樣; [圖16]說明實作顯示調適以計算用於與本申請案之新技術的SLHDR2PLUS編碼原理整合之任何特定PB_D顯示器的MDR影像的有用方式; [圖17]說明黑色與白色偏移(BLO & WLO)的通道調適以伴隨及簡化以下數學,並給予通用的物理技術基本原理的一些進一步態樣; [圖18]顯示SLHDR2PLUS編碼的另一實施例(或實際以一個圖式說明之各種實施例組合的數個教示),其係特別有用的,因為經編碼影像可由標準的已部署的ETSI2解碼器直接解碼; [圖19]說明如何針對各種相依峰亮度影像表示(無論作為輸入或輸出影像)判定原始主客製化曲線的對應版本,例如具有客製化曲線的共同規範的IDR影像,以依例如電影創作者之分級者的需要將粗略映射IDR明亮度精細調諧至精確的最終SDR明亮度; [圖20]在可重新分級影像的頻譜上顯示圖18之方法的基本技術原理; [圖21]說明超越起始影像之最高值(對應於一致變換或圖表中的對角線)的明亮度映射函數(例如,para)之調適的外插法(extrapolation),及亦以其說明在特定選擇的para函數形狀與對應的abcara之間的關係; [圖22]示意地說明para的高亮度增益的限制的具體實施例如何在輸入經正規化明亮度與輸出經正規化明亮度之間的輝度重新分級上運作; [圖23]示意地繪示總HDR通訊及處置鏈,以清楚地指出視訊編碼與顯示最佳化之間的差異(前者係關於影像本身的真正定義,而後者僅關於針對在特定動態範圍能力的任何特定顯示器上的最佳化,寬鬆地說可客製化,較佳地將各種可能相當不同且相當有挑戰性的HDR場景影像的重新分級需求列入考量); [圖24]進一步說明經標準化函數改變演算法將如何運作,其將採用由,例如,內容創作者的分級者產生的任何輸入函數形狀,以指定第一峰亮度動態範圍的經正規化輝度應如何映射至第二峰亮度影像表示的經正規化輝度,及如何可從此輸入函數針對任何給定輸入函數獨特地計算在不同的特別低的第三峰亮度表示及該第二者之間映射的函數; [圖25]示意地繪示目前已教示之SLHDR2PLUS視訊編碼機制及信號如何不錯地與傳統SLHDR2解碼器相容,但仍為已部署的SLHDR2PLUS解碼器提供額外可能性; [圖26]示意地在方塊中總結所有以下的解碼數學,以針對進行所有需要的解碼輝度處理恢復所有需要的輝度映射函數,以從已接收IDR影像獲得所需影像;及 [圖27]示意地說明不錯的實用方法,以數值地具體地獲得可客製化曲線的客製化版本(若存在)。In the diagram: [Figure 1] schematically illustrates many of the common color transformations that occur when optimally mapping a high dynamic range image to a corresponding optimally color graded and similar looking (as similar as desired and feasible given the differences in the respective first dynamic range DR_1 and second dynamic range DR_2) lower dynamic range image (e.g., a standard dynamic range image of 100 nits peak brightness), which in the case of reversibility (Mode 2) will also correspond to the color transformations received by the receiver (decoder) [Figure 2] shows what a capture system for HDR images may look like; [Figure 3] illustrates possible ways of conveying HDR images as some images of specific (different) peak brightness and a brightness mapping function generally defined as a brightness mapping function (e.g., defined as a conventionally usable SDR image Im_LDR) commonly conveyed in the metadata, such as according to the applicant's preferred method, with high concept [Figure 4] shows various further details of encoding HDR images according to the applicant's specific method standardized in ETSI2, which are required to understand the various details of the various teachings of the novel SLHDR2PLUS codec method written below; [Figure 5] illustrates how color transformations and in particular brightness transformations in the YCbCr color space are performed by re-grading; [Figure 6] explains some useful applications by explaining useful applications Explaining our concept of customizable curves in more detail; [Figure 7] explains the basic idea in the encoding and communication of the intermediate dynamic range (IDR) of the original master HDR image, and the unmistakable concept of the medium dynamic range image (MDR), which is generally calculated from any received image to optimize it for display on any specific display with available display peak brightness PB_D, such as a specific HDR purchased by any end consumer who intends to view the received HDR video. tv; [FIG. 8] further illustrates how to start dealing with the IDR problem, in particular by solving it in a specific automatically computable way by the decoder, and in particular if at least some of the decoders that may receive the content are ETSI2 compliant decoders that are already on the market and may not be easily upgraded with the new SLHDR2PLUS technology (for example, because the owner of the tv or STB does not upgrade it); [FIG. 9] shows the general structure of the technical components generally required for the novel SLHDR2PLUS encoder of this application; [FIG. 10] illustrates some basic technical aspects involved in the continuous derivation of various corresponding brightness mapping functions by the encoder, in particular using the example of the para brightness mapping function; [FIG. 11] shows a possible typical high-level structure of the novel SLHDR2PLUS decoder, which follows some embodiments of the various possibilities described below Instructions to implement SLHDR2PLUS video communication; [FIG. 12] Further explanation of some aspects of black and white offsets when selected by the content creator from his point of view in his master brightness mapping function that defines his video content re-grading needs; [FIG. 13] Describing the technical principle of the preferred method for deriving a channel-adapted version of para based on the principle of diagonal scaling; [FIG. 14] Illustrate some aspects of the inverse curve of para (the so-called abcara); [FIG. 15] Detailed description of some aspects of the SLHDR2PLUS decoder; [FIG. 16] Illustrate a useful way to implement display adaptation to calculate MDR images for any specific PB_D display integrated with the new technology of the SLHDR2PLUS encoding principle of this application; [FIG. 17] Illustrate black and white offsets (BLO & WLO) to accompany and simplify the following mathematics and give some further aspects of the general physical technical basic principles; [Figure 18] shows another embodiment of SLHDR2PLUS encoding (or actually several teachings of various embodiments combined in one diagram), which is particularly useful because the encoded image can be directly decoded by a standard deployed ETSI2 decoder; [Figure 19] illustrates how to determine the corresponding version of the original master customized curve for various dependent peak brightness image representations (whether as input or output images), such as a common standard IDR image with a customized curve, to fine-tune the coarse mapped IDR brightness to a fine tuned according to the needs of, for example, a film creator's grader =The correct final SDR brightness; [Figure 20] shows the basic technical principle of the method of Figure 18 on the spectrum of the regradable image; [Figure 21] illustrates the extrapolation of the adaptation of the brightness mapping function (e.g., para) beyond the highest value of the starting image (corresponding to the uniform transformation or the diagonal line in the diagram), and also illustrates the relationship between the shape of the para function and the corresponding abcara for a particular choice; [Figure 22] schematically illustrates how a specific embodiment of the limitation of the high brightness gain of para works on the brightness regradation between the input normalized brightness and the output normalized brightness; [Figure 23] schematically shows the total HDR communication and chain to clearly indicate the difference between video encoding and display optimization (the former is about the actual definition of the image itself, while the latter is only about optimizing for any specific display of a specific dynamic range capability, loosely speaking customizable, preferably taking into account the re-grading needs of various HDR scene images that may be quite different and quite challenging); [Figure 24] further illustrates how the normalized function change algorithm will operate, which will take any input function shape generated by, for example, a content creator's grader to specify how the normalized luminance of a first peak luminance dynamic range should be mapped to the normalized luminance of a second peak luminance image representation, and how from this input function can be optimized for any given input The input function specifically calculates the function mapping between different particularly low third peak brightness representations and the second one; [FIG. 25] schematically illustrates how the currently taught SLHDR2PLUS video coding mechanism and signal are well compatible with traditional SLHDR2 decoders, but still provide additional possibilities for deployed SLHDR2PLUS decoders; [FIG. 26] schematically summarizes all the following decoding mathematics in a block to recover all the required brightness mapping functions for all the required decoding brightness processing to obtain the desired image from the received IDR image; and [FIG. 27] schematically illustrates a good practical method to numerically specifically obtain a customized version of the customizable curve (if any).
900:編碼器 900: Encoder
901:HDR函數產生單元/單元 901: HDR function generation unit/unit
902:IDR影像計算單元 902: IDR image calculation unit
903:IDR映射函數產生器 903:IDR mapping function generator
920:影像輸入 920: Image input
921:第二後設資料輸入 921: Second metadata input
922:第一後設資料輸入 922: First meta-data input
923:第三後設資料輸入 923: Third meta-data input
930:影像輸出 930: Image output
931:第二後設資料輸出 931: Second metadata output
932:第一後設資料輸出 932: First metadata output
933:第三後設資料輸出 933: Third metadata output
F_H2hCI:經調適明亮度映射函數 F_H2hCI: Adaptive brightness mapping function
F_I2sCI:通道明亮度映射函數 F_I2sCI: Channel brightness mapping function
FL_50t1:主明亮度映射函數 FL_50t1: Main brightness mapping function
IDR:中間動態範圍 IDR: Intermediate Dynamic Range
MsterHDR:主HDR視訊/輸入高動態範圍影像 MsterHDR: Master HDR video/input high dynamic range image
PB_C_H50:第一最大像素輝度 PB_C_H50: First maximum pixel brightness
PB_CH:通道峰亮度/第二最大像素輝度 PB_CH: Channel peak brightness/second largest pixel brightness
Claims (10)
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP18192636.1 | 2018-09-05 | ||
| EP18192636.1A EP3621307A1 (en) | 2018-09-05 | 2018-09-05 | Multi-range hdr video coding |
| EP19176732 | 2019-05-27 | ||
| EP19176732.6 | 2019-05-27 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| TW202034692A TW202034692A (en) | 2020-09-16 |
| TWI843747B true TWI843747B (en) | 2024-06-01 |
Family
ID=73643467
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW108131502A TWI843747B (en) | 2018-09-05 | 2019-09-02 | Multi-range hdr video coding |
Country Status (1)
| Country | Link |
|---|---|
| TW (1) | TWI843747B (en) |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2689392B1 (en) * | 2011-03-24 | 2015-09-02 | Koninklijke Philips N.V. | Apparatus and method for analyzing image gradings |
| TW201543867A (en) * | 2014-02-26 | 2015-11-16 | Thomson Licensing | Method and apparatus for encoding and decoding HDR images |
| WO2016020189A1 (en) * | 2014-08-08 | 2016-02-11 | Koninklijke Philips N.V. | Methods and apparatuses for encoding hdr images |
| US20160307602A1 (en) * | 2010-03-03 | 2016-10-20 | Koninklijke Philips N.V. | Methods and apparatuses for processing or defining luminance/color regimes |
-
2019
- 2019-09-02 TW TW108131502A patent/TWI843747B/en active
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20160307602A1 (en) * | 2010-03-03 | 2016-10-20 | Koninklijke Philips N.V. | Methods and apparatuses for processing or defining luminance/color regimes |
| EP2689392B1 (en) * | 2011-03-24 | 2015-09-02 | Koninklijke Philips N.V. | Apparatus and method for analyzing image gradings |
| TW201543867A (en) * | 2014-02-26 | 2015-11-16 | Thomson Licensing | Method and apparatus for encoding and decoding HDR images |
| WO2016020189A1 (en) * | 2014-08-08 | 2016-02-11 | Koninklijke Philips N.V. | Methods and apparatuses for encoding hdr images |
Also Published As
| Publication number | Publication date |
|---|---|
| TW202034692A (en) | 2020-09-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7453214B2 (en) | Multi-range HDR video encoding | |
| JP6596125B2 (en) | Method and apparatus for creating a code mapping function for encoding of HDR images, and method and apparatus for use of such encoded images | |
| US11195492B2 (en) | Optimizing high dynamic range images for particular displays | |
| JP6615251B2 (en) | Method and apparatus for encoding HDR images, and method and apparatus for use of such encoded images | |
| CN108431886B (en) | Optimize high dynamic range images for specific displays | |
| JP7313285B2 (en) | Optimizing the saturation of decoded high dynamic range images | |
| CN108521859B (en) | Apparatus and method for processing multiple HDR image sources | |
| JP6495552B2 (en) | Dynamic range coding for images and video | |
| US10366672B2 (en) | Optimizing high dynamic range images for particular displays | |
| JP2019512954A (en) | HDR video encoding and decoding | |
| JP7705843B2 (en) | Adjusting Display Optimization Behavior for HDR Images | |
| JP2019506817A (en) | Processing multiple HDR image sources | |
| JP2024517241A (en) | Display-optimized HDR video contrast adaptation | |
| EP3621307A1 (en) | Multi-range hdr video coding | |
| JP7695482B2 (en) | Optimizing the display of HDR images | |
| TWI843747B (en) | Multi-range hdr video coding | |
| RU2790178C2 (en) | Multirange encoding of video with expanded dynamic range | |
| KR102279842B1 (en) | Methods and apparatuses for encoding hdr images | |
| JP2024519606A (en) | Display-optimized HDR video contrast adaptation |