[go: up one dir, main page]

HK1228623B - Signaling three-dimensional video information in communication networks - Google Patents

Signaling three-dimensional video information in communication networks Download PDF

Info

Publication number
HK1228623B
HK1228623B HK17102052.2A HK17102052A HK1228623B HK 1228623 B HK1228623 B HK 1228623B HK 17102052 A HK17102052 A HK 17102052A HK 1228623 B HK1228623 B HK 1228623B
Authority
HK
Hong Kong
Prior art keywords
dash
frame
profile
representation
media
Prior art date
Application number
HK17102052.2A
Other languages
Chinese (zh)
Other versions
HK1228623A1 (en
Inventor
Ozgur Oyman
Original Assignee
苹果公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 苹果公司 filed Critical 苹果公司
Publication of HK1228623A1 publication Critical patent/HK1228623A1/en
Publication of HK1228623B publication Critical patent/HK1228623B/en

Links

Description

在通信网络中信号传递三维视频信息Signaling 3D video information in communication networks

相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS

本申请要求2012年4月9日提交的名为“先进无线通信系统和技术”的美国临时专利申请号61/621,939的优先权益,和2012年8月3日提交的名为“先进无线通信系统和技术”的美国临时专利申请号61/679,627的优先权益,这些文档的全部公开通过引用结合于此。This application claims the benefit of priority to U.S. Provisional Patent Application No. 61/621,939, filed on April 9, 2012, entitled “ADVANCED WIRELESS COMMUNICATION SYSTEMS AND TECHNIQUES,” and to U.S. Provisional Patent Application No. 61/679,627, filed on August 3, 2012, entitled “ADVANCED WIRELESS COMMUNICATION SYSTEMS AND TECHNIQUES,” the entire disclosures of which are incorporated herein by reference.

技术领域Technical Field

本发明的实施例大体上涉及通信的领域,并且更具体地涉及在通信网络中信号传递三维视频信息。Embodiments of the present invention generally relate to the field of communications, and more particularly to signaling three-dimensional video information in a communications network.

背景技术Background Art

三维(3-D)视频提供高质量和拟真多媒体体验,其只是最近通过显示技术、信号处理、传输技术和电路设计中的进步才在消费电子产品和移动平台上变为可行。这在当前通过各种渠道(包括Blu-ray DiscTM、电缆和卫星传输等)被引入家庭,以及通过支持3-D的智能电话等被引入移动网络。涉及通过无线网络分发这样的内容的概念正在开发中。Three-dimensional (3-D) video offers a high-quality and immersive multimedia experience that has only recently become feasible on consumer electronics and mobile platforms through advances in display technology, signal processing, transmission technology, and circuit design. It is currently being introduced into homes through various channels, including Blu-ray Disc , cable, and satellite transmission, as well as into mobile networks through 3-D-enabled smartphones. Concepts involving the distribution of such content over wireless networks are under development.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

实施例将通过下列详细说明连同附图而容易理解。为了便于该描述,类似的标号指代类似的结构元件。实施例通过示例而非限制的方式在附图的图中图示。The embodiments will be readily understood by the following detailed description taken in conjunction with the accompanying drawings. To facilitate this description, similar reference numerals refer to similar structural elements. The embodiments are illustrated in the figures of the accompanying drawings by way of example and not limitation.

图1示意图示根据各种实施例的无线通信网络。FIG1 schematically illustrates a wireless communication network according to various embodiments.

图2a-b图示根据各种实施例的流内容和/或关联的会话描述和元数据文件的适配(adaption)。2a-b illustrate the adaptation of streaming content and/or associated session description and metadata files according to various embodiments.

图3图示根据实施例的流会话的设立。FIG3 illustrates the setup of a streaming session according to an embodiment.

图4图示根据各种实施例的帧兼容封装格式(packing format)。FIG4 illustrates a frame-compatible packing format according to various embodiments.

图5图示根据各种实施例的信号传递3-D视频设备能力的方法。FIG5 illustrates a method of signaling 3-D video device capabilities according to various embodiments.

图6图示根据各种实施例的信号传递3-D视频内容的方法。FIG6 illustrates a method of signaling 3-D video content according to various embodiments.

图7示意描绘根据各种实施例的示例系统。FIG7 schematically depicts an example system in accordance with various embodiments.

具体实施方式DETAILED DESCRIPTION

本公开的说明性实施例包括但不限于用于在通信网络中客户端设备的信号传递立体三维视频内容能力的方法、系统、计算机可读介质和装置。本发明的一些实施例在本上下文中可以关于在无线通信网络中移动设备的信号传递立体三维视频内容能力的方法、系统、计算机可读介质和装置。Illustrative embodiments of the present disclosure include, but are not limited to, methods, systems, computer-readable media, and apparatus for signaling stereoscopic 3D video content capabilities of a client device in a communication network. Some embodiments of the present invention may relate, in this context, to methods, systems, computer-readable media, and apparatus for signaling stereoscopic 3D video content capabilities of a mobile device in a wireless communication network.

说明性实施例的各种方面将使用通常由本领域内技术人员采用的术语来描述以向本领域内其他技术人员传达他们的工作的实质。然而,替代实施例可仅用描述的方面中的一些实践,这对于本领域内技术人员将是明显的。为了说明的目的,阐述具体数字、材料和配置以便提供对说明性实施例的全面理解。然而,替代实施例可在没有具体细节的情况下实践,这对于本领域内技术人员将是明显的。在其他实例中,省略或简化众所周知的特征以便不掩盖说明性实施例。Various aspects of the illustrative embodiments will be described using terms commonly employed by those skilled in the art to convey the essence of their work to others skilled in the art. However, alternative embodiments may be practiced using only some of the described aspects, as will be apparent to those skilled in the art. For illustrative purposes, specific numbers, materials, and configurations are set forth to provide a comprehensive understanding of the illustrative embodiments. However, alternative embodiments may be practiced without the specific details, as will be apparent to those skilled in the art. In other instances, well-known features are omitted or simplified so as not to obscure the illustrative embodiments.

]此外,各种操作将进而采用对于理解说明性实施例最有帮助的方式描述为多个分立操作;然而,描述的顺序不应该解释为暗示这些操作必定依赖于顺序。特别地,这些操作不需要按呈现的顺序进行。Furthermore, various operations will be described as multiple discrete operations in a manner that is most helpful for understanding the illustrative embodiments; however, the order of description should not be construed as to imply that these operations are necessarily order dependent. In particular, these operations do not need to be performed in the order presented.

重复使用短语“在一些实施例中”。该短语一般不指相同的实施例;然而,它可指相同的实施例。术语“包括”、“具有”和“包含”是同义的,除非上下文另外指示。短语“A和/或B”意思是(A)、(B)或(A和B)。短语“A/B”和“A或B”意思是(A)、(B)或(A和B),与短语“A和/或B”相似。短语“A、B和C中的至少一个”意思是(A)、(B)、(C)、(A和B)、(A和C)、(B和C)或(A、B和C)。短语“(A)B”意思是(B)或(A和B),即A是可选的。The phrase "in some embodiments" is used repeatedly. This phrase generally does not refer to the same embodiment; however, it may refer to the same embodiment. The terms "including," "having," and "comprising" are synonymous unless the context indicates otherwise. The phrase "A and/or B" means (A), (B), or (A and B). The phrases "A/B" and "A or B" mean (A), (B), or (A and B), similar to the phrase "A and/or B." The phrase "at least one of A, B, and C" means (A), (B), (C), (A and B), (A and C), (B and C), or (A, B, and C). The phrase "(A)B" means (B) or (A and B), i.e., A is optional.

尽管已经在本文说明和描述具体实施例,本领域内普通技术人员将意识到很多种替代和/或等同实现可代替示出和描述的具体实施例,而不偏离本公开的实施例的范围。本申请意在涵盖本文论述的实施例的任何适配或变化。因此,明显地规定本公开的实施例仅由权利要求和其等同物所限制。Although specific embodiments have been illustrated and described herein, those skilled in the art will appreciate that many alternative and/or equivalent implementations may be substituted for the specific embodiments shown and described without departing from the scope of the embodiments of the present disclosure. This application is intended to cover any adaptation or variation of the embodiments discussed herein. Therefore, it is expressly intended that the embodiments of the present disclosure be limited only by the claims and their equivalents.

如本文使用的,“模块”可指代以下各项、是以下各项的一部分或包括以下各项:专用集成电路(ASIC)、电子电路、执行一个或多个软件或固件程序的处理器(共享、专用或群组)和/或存储器(共享、专用或群组)、组合逻辑电路和/或提供描述的功能性的其他适合部件。As used herein, a "module" may refer to, be a part of, or include an application specific integrated circuit (ASIC), an electronic circuit, a processor (shared, dedicated, or group) and/or memory (shared, dedicated, or group) that executes one or more software or firmware programs, a combinational logic circuit, and/or other suitable components that provide the described functionality.

随着H.264/MPEG-4先进视频编码(AVC)标准的引入已经证明视频压缩能力的明显提高。因为开发该标准,ITU-T视频编码专家组(VCEG)和国际标准化组织(ISO)/国际电工委员会(IEC)移动图像专家组(MPEG)的联合视频小组还定出AVC扩展(称为多视点视频编码(MVC))的标准。MVC提供例如多个同步视频拍摄设备等视频场景的多个视图的紧凑表示。The introduction of the H.264/MPEG-4 Advanced Video Coding (AVC) standard has demonstrated significant improvements in video compression capabilities. Since developing this standard, the joint video group of the ITU-T Video Coding Experts Group (VCEG) and the International Organization for Standardization (ISO)/International Electrotechnical Commission (IEC) Moving Picture Experts Group (MPEG) has also standardized an AVC extension known as Multiview Video Coding (MVC). MVC provides a compact representation of multiple views of a video scene, such as from multiple synchronized video capture devices.

在立体3D视频应用中,显示两个视图。一个针对左眼并且一个针对右眼。存在用于使立体3D视频内容的视图格式化的各种方法。在一个实施例中,立体成对3D视频的编码可以是MVC的特殊情况,其中左眼和右眼视图经由MVC产生。产生3D视频内容的其他编码格式也是可能的。关于对这些不同的格式解码和渲染,各种设备可具有不同的能力。本文描述的实施例提供设备能力交换的各种参数,其可便于在例如无线网络(例如演进的通用地面无线电接入网络(EUTRAN))等通信网络中分发和查看3D视频内容。In stereoscopic 3D video applications, two views are displayed. One for the left eye and one for the right eye. Various methods exist for formatting views of stereoscopic 3D video content. In one embodiment, the encoding of stereoscopic paired 3D video can be a special case of MVC, where the left-eye and right-eye views are generated via MVC. Other encoding formats for generating 3D video content are also possible. Various devices may have different capabilities for decoding and rendering these different formats. The embodiments described herein provide various parameters for device capability exchange, which can facilitate the distribution and viewing of 3D video content in communication networks such as wireless networks (e.g., the Evolved Universal Terrestrial Radio Access Network (EUTRAN)).

图1示意地图示根据各种实施例的网络环境100。该网络环境100包括与无线电接入网络(RAN)108无线耦合的用户装置(UE)104,其还可称为客户终端或移动设备。该RAN108可包括增强节点基站(eNB)112,其配置成经由空中(OTA)接口与UE 104通信。该RAN 108可以是第三代合作伙伴计划(3GPP)长期演进(LTE)先进网络的一部分,并且可称为EUTRAN。在其他实施例中,可利用其他的无线电接入网络技术。FIG1 schematically illustrates a network environment 100 according to various embodiments. The network environment 100 includes a user equipment (UE) 104, which may also be referred to as a client terminal or mobile device, wirelessly coupled to a radio access network (RAN) 108. The RAN 108 may include an enhanced node base station (eNB) 112 configured to communicate with the UE 104 via an over-the-air (OTA) interface. The RAN 108 may be part of a 3rd Generation Partnership Project (3GPP) Long Term Evolution (LTE) Advanced network and may be referred to as EUTRAN. In other embodiments, other radio access network technologies may be utilized.

UE 104可通过RAN 108与远程媒体服务器116通信。尽管示出eNB 112与该媒体服务器直接通信,将理解在各种实施例中通信可流过许多中间联网部件,例如交换机、路由器、网关等。例如,在一些实施例中,RAN 108可与核心服务网络(CSN)耦合,该CSN使RAN 108与更大的网络(媒体服务器116可认为是其的一部分)、例如广域网通信耦合。The UE 104 may communicate with a remote media server 116 via the RAN 108. Although the eNB 112 is shown communicating directly with the media server, it will be appreciated that in various embodiments communications may flow through numerous intermediate networking components, such as switches, routers, gateways, etc. For example, in some embodiments, the RAN 108 may be coupled to a core serving network (CSN) that communicatively couples the RAN 108 to a larger network (of which the media server 116 may be considered a part), such as a wide area network.

尽管图1将网络环境描述为无线通信网络,其他实施例可在其他类型的网络中使用,例如有线网络。可理解可采用本发明的实施例的其他网络环境与在图1中描绘的示例中明确示出的那些相比可包括额外的、更少的或不同的部件。例如,在有线网络中采用的本发明的实施例可具有媒体服务器116和 UE 104,其在没有RAN 108的情况下互相通信。Although FIG1 depicts the network environment as a wireless communication network, other embodiments may be used in other types of networks, such as a wired network. It will be appreciated that other network environments in which embodiments of the present invention may be employed may include additional, fewer, or different components than those explicitly shown in the example depicted in FIG1 . For example, an embodiment of the present invention employed in a wired network may have a media server 116 and a UE 104 communicating with each other without a RAN 108.

UE 104和媒体服务器116可具有许多配置成便于3D视频内容的访问、存储、传输和显示的部件。例如,UE 104可包括内容管理模块120、具有流应用126的媒体播放器124和显示器128。流应用126可具有足够的功能性来接收3D视频内容和关联的信息;解码、拆解以及用别的方式重新组装3D视频;以及在显示器128上渲染3D视频。在各种实施例中,在采用的流技术的背景下可引用流应用126。例如,在内容由分组交换流服务(PSS)来流播的实施例中,流应用126可称为PSS应用。内容管理模块120可协商或用别的方式传送流参数,其包括例如设备能力参数,以能够以便于媒体播放器124操作的方式接收数据。UE 104 and media server 116 may include a number of components configured to facilitate access, storage, transmission, and display of 3D video content. For example, UE 104 may include a content management module 120, a media player 124 having a streaming application 126, and a display 128. Streaming application 126 may include sufficient functionality to receive 3D video content and associated information; decode, disassemble, and otherwise reassemble the 3D video; and render the 3D video on display 128. In various embodiments, streaming application 126 may be referenced in the context of the streaming technology employed. For example, in embodiments where content is streamed via a packet-switched streaming service (PSS), streaming application 126 may be referred to as a PSS application. Content management module 120 may negotiate or otherwise communicate streaming parameters, including, for example, device capability parameters, to enable data to be received in a manner that facilitates operation of media player 124.

媒体服务器116可包括具有流应用134的内容分发模块132、内容管理模块136和内容存储140。内容分发模块132可对存储在内容存储140中的3D视频内容编码、封装或用别的方式组装,用于传输到一个或多个UE,例如UE 104。内容管理模块136可协商或用别的方式传送流参数(其包括例如设备能力参数)并且以便于分发3D内容的方式控制内容分发模块132。The media server 116 may include a content distribution module 132 having a streaming application 134, a content management module 136, and a content store 140. The content distribution module 132 may encode, package, or otherwise assemble 3D video content stored in the content store 140 for transmission to one or more UEs, such as the UE 104. The content management module 136 may negotiate or otherwise communicate streaming parameters (including, for example, device capability parameters) and control the content distribution module 132 in a manner that facilitates the distribution of 3D content.

在一些实施例中,示出为是媒体服务器116的一部分的部件中的一个或多个可设置成与媒体服务器116分开并且通过通信链路与媒体服务器通信耦合。例如,在一些实施例中,内容存储140可设置成远离内容分发模块132和内容管理模块136。In some embodiments, one or more of the components shown as part of the media server 116 may be located separately from the media server 116 and communicatively coupled to the media server via a communication link. For example, in some embodiments, the content store 140 may be located remotely from the content distribution module 132 and the content management module 136.

在一些实施例中,内容分发模块132可根据3GPP流标准在一个示例中通过eNB112将3D视频内容分发给UE 104。例如,可根据PSS标准(例如3GPP TS 26.234 V11.0.0(2012年3月16日))、通过HTTP的动态自适应流(DASH)标准(例如3GPP TS 26.247 V.11.0.0(2012年3月16日))、多媒体广播和多播服务(MBMS)标准(例如,TS 26.346 V11.1.0(2012年6月29日))和/或基于IMS的PSS和MBMS服务(IMS_PSS_MBMS)标准(例如,TS 26.237 V.11.0.0(2012年6月29日))传输3D视频内容。流应用126可配置成通过许多传输协议中的任意传输协议接收3D视频内容,例如实时传输协议(RTP)、超文本传输协议(HTTP)等。In some embodiments, the content distribution module 132 can distribute the 3D video content to the UE 104 according to a 3GPP streaming standard, in one example, via the eNB 112. For example, the 3D video content can be transmitted according to the PSS standard (e.g., 3GPP TS 26.234 V11.0.0 (March 16, 2012)), the Dynamic Adaptive Streaming over HTTP (DASH) standard (e.g., 3GPP TS 26.247 V.11.0.0 (March 16, 2012)), the Multimedia Broadcast and Multicast Service (MBMS) standard (e.g., TS 26.346 V11.1.0 (June 29, 2012)), and/or the IMS-based PSS and MBMS Service (IMS_PSS_MBMS) standard (e.g., TS 26.237 V.11.0.0 (June 29, 2012)). The streaming application 126 may be configured to receive 3D video content via any of a number of transport protocols, such as Real-time Transport Protocol (RTP), Hypertext Transfer Protocol (HTTP), and the like.

能力交换使例如媒体服务器116等媒体流服务器能够向广泛设备提供适合于考虑中的特定设备的视频内容。为了便于服务器端内容协商以用于流,媒体服务器116可确定UE104的具体能力。Capability exchange enables a media streaming server, such as media server 116, to provide video content suitable for the particular device in question to a wide range of devices.To facilitate server-side content negotiation for streaming, media server 116 may determine the specific capabilities of UE 104.

内容管理模块120和内容管理模块136可协商或用别的方式传送3D视频内容流会话的参数。该协商可经由RAN 108通过会话级信令而发生。在一些实施例中,该会话级信令可包括与设备能力信息(其包括媒体播放器124的立体3D视频解码和渲染能力)有关的传输。在各种实施例中,设备能力信息可进一步包括预解码器缓冲区大小、初始缓冲、解码器能力、显示器特性(屏幕大小、分辨率、位深度等)、流方法(实时流协议(RTSP)、HTTP等)、自适应支持、体验质量(QoE)支持、扩展实时传输协议(RTCP)报告支持、快速内容切换支持、支持的RTP概况(RTP profile)、会话描述协议(SDP)属性,等。Content management module 120 and content management module 136 may negotiate or otherwise communicate parameters for a 3D video content streaming session. This negotiation may occur via session-level signaling via RAN 108. In some embodiments, this session-level signaling may include the transmission of device capability information, including stereoscopic 3D video decoding and rendering capabilities of media player 124. In various embodiments, the device capability information may further include pre-decoder buffer size, initial buffering, decoder capabilities, display characteristics (screen size, resolution, bit depth, etc.), streaming method (Real Time Streaming Protocol (RTSP), HTTP, etc.), adaptation support, Quality of Experience (QoE) support, extended Real-time Transport Protocol (RTCP) reporting support, fast content switching support, supported RTP profiles, Session Description Protocol (SDP) attributes, etc.

在流会话的设立期间,内容管理模块136可使用设备能力信息以采用向UE 104提供正确类型的多媒体内容的方式控制内容分发模块132。例如,媒体服务器116可基于UE104的实际能力确定视频流的多个可用变化形式中的哪些变化形式是期望的以确定对该终端最适合的流。这可允许提高3D视频内容以及关联的会话描述和元数据文件(例如SDP文件或媒体呈现描述(MPD)文件)到UE 104的分发。During the setup of a streaming session, the content management module 136 can use the device capability information to control the content distribution module 132 in a manner that provides the correct type of multimedia content to the UE 104. For example, the media server 116 can determine which of multiple available variations of the video stream is desirable based on the actual capabilities of the UE 104 to determine the most appropriate stream for the terminal. This can improve the delivery of 3D video content and associated session description and metadata files (e.g., SDP files or media presentation description (MPD) files) to the UE 104.

内容分发模块132可访问内容存储140中的内容并且在分发内容/关联文件之前根据协商会话参数使内容和/或关联的会话描述和元数据文件(例如SDP/MPD文件)适配。内容在被分发到UE 104时可由媒体播放器124解码并且在显示器128上渲染。The content delivery module 132 can access the content in the content storage 140 and adapt the content and/or associated session description and metadata files (e.g., SDP/MPD files) according to the negotiated session parameters before delivering the content/associated files. The content can be decoded by the media player 124 and rendered on the display 128 when delivered to the UE 104.

根据参考图2a-b的一些具体示例示出内容和/或关联会话描述和元数据文件的适配,而根据参考图3的具体示例示出流会话的设立。The adaptation of the content and/or associated session description and metadata files is shown with reference to some specific examples in FIG. 2 a - b , whereas the setup of a streaming session is shown with reference to a specific example in FIG. 3 .

图2a图示根据一些实施例的具有3D视频格式适配的基于DASH的流实施例。特别地,图2a图示与DASH客户端208通信并且实现推式流实施例的HTTP服务器204,其中流控制由客户端而不是服务器来维持,其中客户端在检查MPD后通过一系列基于HTTP的请求-应答事务从服务器下载内容。在基于DASH的流中,MPD元数据文件提供关于存储在HTTP服务器204中的媒体内容表示的结构和不同版本的信息(其包括不同的比特率、帧率、分辨率、编解码类型等)。基于描述片段的关系以及它们如何形成媒体呈现的该MPD元数据信息,DASH客户端208可使用HTTP GET或部分GET方法来请求媒体片段。HTTP服务器204和DASH客户端208可分别与媒体服务器116和UE 104相似并且大致上与其能互换。Figure 2a illustrates a DASH-based streaming embodiment with 3D video format adaptation in accordance with some embodiments. Specifically, Figure 2a illustrates an HTTP server 204 communicating with a DASH client 208 and implementing a push-based streaming embodiment, in which streaming control is maintained by the client rather than the server, and the client downloads content from the server through a series of HTTP-based request-response transactions after inspecting the MPD. In DASH-based streaming, the MPD metadata file provides information about the structure and different versions of the media content representations stored in the HTTP server 204 (including different bitrates, frame rates, resolutions, codec types, etc.). Based on this MPD metadata information, which describes the relationships between the segments and how they form the media presentation, the DASH client 208 can request media segments using HTTP GET or partial GET methods. The HTTP server 204 and the DASH client 208 can be similar to and generally interchangeable with the media server 116 and the UE 104, respectively.

在DASH中,可在MPD中信号传递3D视频格式和对应的内容信息的集合到DASH客户端208。根据DASH客户端208的能力概况和它支持的3D格式,HTTP服务器204可提供不同的格式化内容,例如HTTP服务器204可在MPD中排除DASH客户端208不支持的3D格式并且只包括DASH客户端208所支持的那些3D格式。在该背景下,HTTP服务器204可将对不同3D视频格式优化的内容提供给DASH客户端208。在进行此的时候,HTTP服务器204可使用来自DASH客户端108的描述各种支持的3D视频格式的设备能力交换信令。DASH客户端208然后可请求由DASH客户端208支持的3D视频内容的对应版本。此外,当用HTTP检索MPD时,DASH客户端208可在GET请求中包括3D视频编解码(codec)和格式信息,其包括基于概况差异(ProfDiff)对3D视频格式的任何临时调整。在示例中,该差异可配置成临时修改内容呈现会话的一个或多个MPD参数。例如,该差异可配置成修改MPD直到内容呈现会话结束或后续差异(其对应于首先传送的差异)被传送到HTTP服务器204。这样,HTTP服务器204可将优化的MPD分发给DASH客户端208。In DASH, a set of 3D video formats and corresponding content information can be signaled to the DASH client 208 in an MPD. Depending on the capability profile of the DASH client 208 and the 3D formats it supports, the HTTP server 204 can provide differently formatted content. For example, the HTTP server 204 can exclude 3D formats not supported by the DASH client 208 from the MPD and only include those 3D formats supported by the DASH client 208. In this context, the HTTP server 204 can provide content optimized for different 3D video formats to the DASH client 208. To do so, the HTTP server 204 can exchange signaling using device capabilities from the DASH client 108 that describe the various supported 3D video formats. The DASH client 208 can then request the corresponding version of the 3D video content supported by the DASH client 208. Furthermore, when retrieving the MPD using HTTP, the DASH client 208 can include 3D video codec and format information in the GET request, including any temporary adjustments to the 3D video format based on the profile difference (ProfDiff). In an example, the difference can be configured to temporarily modify one or more MPD parameters for the content rendering session. For example, the difference can be configured to modify the MPD until the content rendering session ends or a subsequent difference (which corresponds to the difference first transmitted) is transmitted to the HTTP server 204. In this way, the HTTP server 204 can distribute the optimized MPD to the DASH client 208.

图2b图示根据一些实施例的具有3D视频格式适配的基于RTSP的流的实施例。特别地,图2b图示实现推式流方法的服务器212和客户端216,其中流和会话控制由服务器212而不是客户端216来维持。服务器212和客户端216可分别与媒体服务器116和UE 104相似并且大致上与其能互换。FIG2 b illustrates an embodiment of RTSP-based streaming with 3D video format adaptation according to some embodiments. In particular, FIG2 b illustrates a server 212 and a client 216 implementing a push streaming approach, where streaming and session control are maintained by the server 212 rather than the client 216. The server 212 and the client 216 can be similar to and generally interchangeable with the media server 116 and the UE 104, respectively.

推式流的示例包括分别基于RTSP和会话发起协议(SIP)的PSS和IMS_PSS_MBMS服务。在该背景下,服务器212接收来自客户端216的支持的3D视频编解码和格式的集合并且基于该信息使内容适配,例如服务器212基于支持的3D视频格式在存储的内容版本之间选择最适合的内容版本或者动态转码该内容,以及将内容流播到客户端216。SDP中携带的会话相关元数据可携带流内容的3D视频格式信息。Examples of push streaming include PSS and IMS_PSS_MBMS services based on RTSP and Session Initiation Protocol (SIP), respectively. In this context, server 212 receives a set of supported 3D video codecs and formats from client 216 and adapts the content based on this information. For example, server 212 selects the most suitable content version from among the stored content versions based on the supported 3D video formats or dynamically transcodes the content, and streams the content to client 216. Session-related metadata carried in SDP can carry 3D video format information for the streamed content.

图3图示根据一些实施例的具有预订/通知IMS_PSS_MBMS服务的服务发现。特别地,图3图示UE 304、IP多媒体(IM)核心网(CN)子系统308和服务发现功能(SDF)312之间的交互。UE 304可与UE 104相似并且大致上与其能互换。IM CN子系统308和SDF 312可以是与接入网域(例如,RAN 108)接口的核心网域的一部分。FIG3 illustrates service discovery with subscription/notification of IMS_PSS_MBMS services, according to some embodiments. Specifically, FIG3 illustrates the interaction between a UE 304, an IP Multimedia (IM) Core Network (CN) subsystem 308, and a Service Discovery Function (SDF) 312. UE 304 may be similar to and generally interchangeable with UE 104. IM CN subsystem 308 and SDF 312 may be part of a core network domain that interfaces with an access network domain (e.g., RAN 108).

在IMS_PSS_MBMS服务中,UE 304可在服务发现期间在SIP SUBSCRIBE消息中将设备能力信息(例如,支持的3D视频编解码和格式)发送给IM CN子系统308。IM CN子系统308然后可将该消息转发给SDF 312。SDF 312例如根据如在用户概况(个性化服务发现)中描述的UE 304的能力来确定正确的服务发现信息。SDF 312然后可将SIP 200 OK消息发送给IMCN子系统308,该消息被转送到UE 304以基于发送的设备能力信息(其也包括支持的3D视频编解码和格式)来确认会话发起。之后,SDF 132可发送SIP NOTIFY消息(具有服务发现信息)给IM CN 子系统308,其将SIP NOTIFY消息转送回到UE 304。UE 304然后可通过将SIP200 OK消息发送到IM CN 子系统308而作出应答,该SIP 200 OK消息然后被转送到SDF312。In the IMS_PSS_MBMS service, UE 304 may send device capability information (e.g., supported 3D video codecs and formats) to IM CN subsystem 308 in a SIP SUBSCRIBE message during service discovery. IM CN subsystem 308 may then forward this message to SDF 312. SDF 312 determines the correct service discovery information based on, for example, UE 304's capabilities as described in the user profile (personalized service discovery). SDF 312 may then send a SIP 200 OK message to IM CN subsystem 308, which is forwarded to UE 304 to confirm session initiation based on the sent device capability information (which also includes supported 3D video codecs and formats). SDF 132 may then send a SIP NOTIFY message (with the service discovery information) to IM CN subsystem 308, which forwards the SIP NOTIFY message back to UE 304. UE 304 may then respond by sending a SIP 200 OK message to IM CN subsystem 308 , which is then forwarded to SDF 312 .

这样的框架在基于IMS的PSS和MBMS用户服务中使用支持的3D视频格式而实现优化的服务发现。稍后在IMS会话期间,UE 304还可使用SIP信令来指示更新,其包括基于ProfDiff(例如,如果当前设备取向与默认设备取向不同)对支持的3D视频编解码和格式的集合的任何临时调整。这可通过另外的SIP SUBSCRIBE消息(其包括关于对3D视频格式信息更新的信息)刷新预订而进行。Such a framework enables optimized service discovery using supported 3D video formats in IMS-based PSS and MBMS user services. Later during the IMS session, UE 304 can also use SIP signaling to indicate updates, including any temporary adjustments to the set of supported 3D video codecs and formats based on ProfDiff (for example, if the current device orientation differs from the default device orientation). This can be done by refreshing the subscription via another SIP SUBSCRIBE message (which includes information about updates to the 3D video format information).

再次参考图1,在一些实施例中,媒体服务器116可与具有UE 104的概况信息的设备概况服务器144耦合。该概况信息可包括设备能力信息中的一些或全部。在这样的实施例中,媒体服务器116可接收来自UE 104的标识信息并且然后从设备概况服务器144检索概况信息。这可作为会话级信令的一部分而进行。Referring again to FIG. 1 , in some embodiments, the media server 116 may be coupled to a device profile server 144 having profile information for the UE 104. This profile information may include some or all of the device capability information. In such embodiments, the media server 116 may receive identification information from the UE 104 and then retrieve the profile information from the device profile server 144. This may be done as part of session-level signaling.

在一些实施例中,UE 104可基于ProfDiff信令用附加属性补充从设备概况服务器144检索的概况信息或覆盖在它的设备能力概况中已经限定的属性。在一个示例中,这样的临时调整可通过用户偏好而触发,例如如果特定会话的用户仅想接收二维(2D)视频(即使终端能够渲染3D视频)。In some embodiments, UE 104 may supplement the profile information retrieved from device profile server 144 with additional attributes based on ProfDiff signaling or override attributes already defined in its device capability profile. In one example, such temporary adjustments may be triggered by user preferences, such as if the user of a particular session only wants to receive two-dimensional (2D) video (even if the terminal is capable of rendering 3D video).

流应用134可根据许多不同的流类型对3D视频内容编码以供在网络环境100中传输,其中每个流类型具有关联的帧类型。帧类型可以包括帧封装、联播(simulcast)或2D加辅助帧类型。The streaming application 134 may encode the 3D video content for transmission in the network environment 100 according to a number of different stream types, each of which has an associated frame type. The frame types may include frame packing, simulcast, or 2D plus auxiliary frame types.

帧封装可包括帧兼容封装格式和每视图全分辨率(FRPV)封装格式。在帧兼容分组格式(packet format)中,流应用134可在空间上将立体对的组成帧封装到单个帧内并且对该单个帧编码。由流应用126产生的输出帧包含立体对的组成帧。每个视图的原始帧的空间分辨率和封装的单个帧可以是相同的。在该情况下,流应用134可在封装操作之前对两个组成帧降采样。帧兼容封装格式可使用如分别在图4a-e中图示的垂直交错、水平交错、并排、上下排列或棋盘格式,并且可相应地进行降采样。Frame packing may include a frame-compatible packing format and a full-resolution per-view (FRPV) packing format. In the frame-compatible packet format, the streaming application 134 may spatially pack the constituent frames of a stereo pair into a single frame and encode the single frame. The output frame generated by the streaming application 126 contains the constituent frames of the stereo pair. The spatial resolution of the original frame of each view and the packed single frame may be the same. In this case, the streaming application 134 may downsample the two constituent frames before the packing operation. The frame-compatible packing format may use vertical interlaced, horizontal interlaced, side-by-side, top-and-bottom, or checkerboard formats as illustrated in Figures 4a-e, respectively, and may be downsampled accordingly.

在一些实施例中,流应用134可通过将如在H.264/AVC标准中规定的一个或多个帧封装设置补充增强信息(SEI)消息包括到位流中而指示使用的帧封装格式。流应用126可对帧解码、将来自解码器的输出帧的两个组成帧拆解、对帧过采样以恢复编码器侧降采样过程,并且在显示器128上渲染组成帧。In some embodiments, the streaming application 134 may indicate the frame packing format used by including one or more frame packing setup supplemental enhancement information (SEI) messages as specified in the H.264/AVC standard into the bitstream. The streaming application 126 may decode the frame, unpack the two component frames of the output frame from the decoder, upsample the frame to recover the encoder-side downsampling process, and render the component frames on the display 128.

FRPV封装格式可包括时间交错。在时间交错中,可以原始视频的帧率的两倍对3D视频编码,其中每个母图像和随后的图像构成立体对(左和右视图)。可典型地以高帧率进行时间交错的立体视频的渲染,其中主动(快门)眼镜用于使每只眼睛中的不正确视图混合。这可依靠眼镜与屏幕之间的准确同步。The FRPV packaging format can include temporal interleaving. In temporal interleaving, 3D video can be encoded at twice the frame rate of the original video, where each parent image and subsequent image form a stereo pair (left and right views). Rendering of temporally interleaved stereo video can typically be performed at high frame rates, with active (shutter) glasses used to blend the incorrect views in each eye. This relies on accurate synchronization between the glasses and the screen.

在使用联播帧类型的实施例中,左和右视图可在单独的联播流中传输。单独传输的流可通过流应用126结合并且被联合解码。In embodiments using the simulcast frame type, the left and right views may be transmitted in separate simulcast streams. The separately transmitted streams may be combined by the streaming application 126 and jointly decoded.

在使用2D加辅助帧类型的实施例中,2D视频内容可连同可由流应用126使用以在显示器128上渲染3D视频的辅助信息一起由流应用134发送。该辅助信息可以是,例如是2D图的深度/视差图,其中每个像素限定关联的2D视频帧中的一个或多个像素的深度/视差。In embodiments using a 2D plus auxiliary frame type, the 2D video content may be sent by the streaming application 134 along with auxiliary information that may be used by the streaming application 126 to render the 3D video on the display 128. The auxiliary information may be, for example, a depth/disparity map that is a 2D map, where each pixel defines the depth/disparity of one or more pixels in the associated 2D video frame.

在一些实施例中,可使用其他的帧类型。例如,在一些实施例中,流应用134可能够将立体视图编码为可在相同或不同的流中传输的基本视图流和非基本视图流。在一些实施例中,这可称为基于MVC的立体视频。非基本视图流可包括视图间预测帧,其提供空间/时间预测信息。基本视图流对于单视图(例如,2D)解码器可足以将基本视图渲染为2D视频,而非基本视图流可对3D解码器、例如流应用126提供足够信息来渲染3D视频。如果媒体服务器116已知UE能力,它可以省略将非基本视图流发送到不支持3D视频或不具有足够的比特率来支持3D视频的设备。In some embodiments, other frame types may be used. For example, in some embodiments, the streaming application 134 may be able to encode stereoscopic views into a base view stream and a non-base view stream that can be transmitted in the same or different streams. In some embodiments, this may be referred to as MVC-based stereoscopic video. The non-base view stream may include inter-view prediction frames that provide spatial/temporal prediction information. The base view stream may be sufficient for a single-view (e.g., 2D) decoder to render the base view as 2D video, while the non-base view stream may provide enough information for a 3D decoder, such as the streaming application 126, to render 3D video. If the media server 116 is aware of the UE capabilities, it can omit sending the non-base view stream to devices that do not support 3D video or do not have sufficient bit rate to support 3D video.

在各种实施例中,从内容管理模块120和/或设备概况服务器144传输到内容管理模块136的设备能力信息可包括3D格式属性,其包括与通过相关传输协议(例如RTP或HTTP,由流应用126支持)的立体3D视频的流相关的一个或多个格式的列表。在一些实施例中,3D格式属性可以是RTP或HTTP的流帧封装格式,其对于垂直交错具有整数值“1”、对于水平交错具有“2”、对于并排具有“3”、对于上下排列具有“4”、对于棋盘具有“0”或对于时间交错具有“5”。在一些实施例中,相同的3D格式属性可用于指示在具体文件或容器格式中支持的帧封装格式。在一些实施例中,3D格式属性可包括更广义的值,例如对于帧封装是“FP”。In various embodiments, the device capability information transmitted from the content management module 120 and/or the device profile server 144 to the content management module 136 may include a 3D format attribute that includes a list of one or more formats associated with streaming stereoscopic 3D video over a relevant transport protocol (e.g., RTP or HTTP, as supported by the streaming application 126). In some embodiments, the 3D format attribute may be a stream frame packing format for RTP or HTTP, having an integer value of "1" for vertical interleaving, "2" for horizontal interleaving, "3" for side-by-side, "4" for top-and-bottom, "0" for checkerboard, or "5" for temporal interleaving. In some embodiments, the same 3D format attribute may be used to indicate the frame packing formats supported in a specific file or container format. In some embodiments, the 3D format attribute may include a more general value, such as "FP" for frame packing.

在一些实施例中,3D格式属性可以是对联播具有值“SC”或对于2D视频加辅助信息具有值“2DA”的另一个流格式。In some embodiments, the 3D format attribute may be another stream format with a value of "SC" for simulcast or "2DA" for 2D video plus auxiliary information.

在UE 104支持超过一个格式类型的实施例中,它可进一步指示一个或多个优选的格式类型。这可以通过将格式类型按偏好的顺序列表、使偏好指示符与选择的格式类型关联等而进行。In embodiments where the UE 104 supports more than one format type, it may further indicate one or more preferred format types. This may be done by listing the format types in order of preference, associating a preference indicator with a selected format type, and the like.

在一些实施例中,除提供帧类型属性外,内容管理模块120和/或设备概况服务器144可提供一个或多个组元类型属性。这些组元类型属性可提供关于视频组元的具体类型的额外细节,这些视频组元是立体3D视频的组成元素、由流应用126支持和/或偏好。In some embodiments, in addition to providing a frame type attribute, content management module 120 and/or device profile server 144 may provide one or more component type attributes. These component type attributes may provide additional details about the specific types of video components that are components of stereoscopic 3D video and that are supported and/or preferred by streaming application 126.

组元类型属性可具有用于指示中心视点流(center-view stream)的值“C”、用于指示中心视点流和深度图的值“CD”、用于指示中心视点流和视差图的值“CP”、用于指示深度图的值“D”、用于指示视差图的值“P”、用于指示左视图流的值“L”、用于指示左视图流和深度图的值“LD”、用于指示视频帧(其包括来自左和右视图的交替扫描线)的值“LIL”、用于指示左视图流和视差图的值“LP”、用于指示右视图流的值“R”、用于指示帧顺序(例如,包括来自左和右流的交替帧-额外信令(例如AVC SEI消息)的视频流对于信号通知哪些帧包含左和右视图可以是必须的)的值“Seq”、用于指示并排的值“SbS”以及用于指示上下排列的值“TaB”。The component type attribute may have a value of "C" for indicating a center-view stream, a value of "CD" for indicating a center-view stream and a depth map, a value of "CP" for indicating a center-view stream and a disparity map, a value of "D" for indicating a depth map, a value of "P" for indicating a disparity map, a value of "L" for indicating a left-view stream, a value of "LD" for indicating a left-view stream and a depth map, a value of "LIL" for indicating a video frame (which includes alternating scan lines from left and right views), a value of "LP" for indicating a left-view stream and a disparity map, a value of "R" for indicating a right-view stream, a value of "Seq" for indicating a frame sequence (e.g., a video stream includes alternating frames from left and right streams - additional signaling (e.g., AVC SEI messages) may be necessary to signal which frames contain left and right views), a value of "SbS" for indicating side-by-side, and a value of "TaB" for indicating top-bottom.

每个格式类型属性可与组元类型属性的相应集合关联。例如,如果格式类型是SC,关联的组元类型可以是L或R,分别用于指示左和右视图。Each format type attribute can be associated with a corresponding set of component type attributes. For example, if the format type is SC, the associated component type can be L or R, indicating left and right views, respectively.

PSS规范3GPP TS 24.234中的设备能力交换信令能力使服务器能够向广泛设备提供适合于讨论中的特定设备的内容。为了提高立体3D视频内容到客户终端的分发,本公开描述新的属性集合,其可包括在PSS词汇中用于设备能力交换信令。这些提出的属性可描述客户终端的3D解码和渲染能力,其包括客户端支持哪些3D视频帧封装格式。这例如可允许服务器和网络向客户终端提供优化的RTSP SDP或DASH MPD,以及进行适当的转码和3D格式转换以便使传输的3D视频内容与客户端设备的能力匹配。The device capability exchange signaling capability in the PSS specification 3GPP TS 24.234 enables servers to deliver content tailored to the specific device in question to a wide range of devices. To improve the delivery of stereoscopic 3D video content to client terminals, this disclosure describes a new set of attributes that can be included in the PSS vocabulary for device capability exchange signaling. These proposed attributes can describe the 3D decoding and rendering capabilities of a client terminal, including which 3D video frame encapsulation formats the client supports. This can, for example, allow servers and networks to deliver optimized RTSP SDP or DASH MPD to client terminals, as well as perform appropriate transcoding and 3D format conversion to match the transmitted 3D video content to the capabilities of the client device.

支持的3D视频编解码和格式的设备能力交换信令可在3GPP TS 26.234中实现,其中PSS词汇中包括三个新属性:(1)对于流组元(streaming component),两个属性分别指示与通过RTP和HTTP的立体3D视频流相关的支持的帧封装格式的列表,以及(2)对于ThreeGPFileFormat组元,一个属性指示与立体3D视频相关的可以包括在3GPP文件格式(3GP)文件中的支持的帧封装格式的列表,其是通常用于基于3GPP的多媒体服务的多媒体容器格式。根据一些实施例,属性定义的细节在下文呈现。Device capability exchange signaling of supported 3D video codecs and formats may be implemented in 3GPP TS 26.234, where three new attributes are included in the PSS vocabulary: (1) for the streaming component, two attributes indicating a list of supported frame packing formats associated with stereoscopic 3D video streaming over RTP and HTTP, respectively, and (2) for the ThreeGPFileFormat component, one attribute indicating a list of supported frame packing formats that may be included in a 3GPP file format (3GP) file associated with stereoscopic 3D video, which is a multimedia container format commonly used for 3GPP-based multimedia services. Details of the attribute definitions are presented below, according to some embodiments.

属性名称:StreamingFramePackingFormatsRTPProperty Name: StreamingFramePackingFormatsRTP

属性定义:与PSS应用支持的通过RTP的立体3D视频流相关的支持的帧封装格式列表。在立体3D视频的范围内的帧封装格式包括:Property Definition: List of supported frame packing formats related to stereoscopic 3D video streaming over RTP supported by PSS applications. Frame packing formats within the scope of stereoscopic 3D video include:

帧兼容封装格式:1=垂直交错,2=水平交错,3=并排,4=上下排列,0=棋盘Frame compatible packing format: 1 = vertical interlaced, 2 = horizontal interlaced, 3 = side by side, 4 = top and bottom, 0 = checkerboard

每视图全分辨率的封装格式:5=时间交错Packing format for full resolution per view: 5 = Time Interleaved

组元:流Component: Flow

类型:文字(袋)Type: Text (bag)

合法值:对应于支持的帧封装格式的整数值列表Legal values: A list of integer values corresponding to supported frame encapsulation formats

分辨率规则: 附上Resolution Rules: Attached

示例:Example:

属性名称:StreamingFramePackingFormatsHTTPProperty name: StreamingFramePackingFormatsHTTP

属性定义:与PSS应用支持的通过HTTP的立体3D视频流相关的支持的帧封装格式列表。在立体3D视频的范围内的帧封装格式包括:Property Definition: List of supported frame packing formats related to stereoscopic 3D video streaming over HTTP supported by PSS applications. Frame packing formats in the scope of stereoscopic 3D video include:

帧兼容封装格式:1=垂直交错,2=水平交错,3=并排,4=上下排列,0=棋盘Frame compatible packing format: 1 = vertical interlaced, 2 = horizontal interlaced, 3 = side by side, 4 = top and bottom, 0 = checkerboard

每视图全分辨率的封装格式:5=时间交错Packing format for full resolution per view: 5 = Time Interleaved

组元:流Component: Flow

类型:文字(袋)Type: Text (bag)

合法值:对应于支持的帧封装格式的整数值列表Legal values: A list of integer values corresponding to supported frame encapsulation formats

分辨率规则: 附上Resolution Rules: Attached

示例:Example:

属性名称:ThreeGPFramePackingFormatsProperty Name: ThreeGPFramePackingFormats

属性定义:与立体3D视频相关的支持的帧封装格式的列表可以包括在3GP文件中并且由PSS应用所处理。Property Definition: A list of supported frame packing formats related to stereoscopic 3D video may be included in a 3GP file and processed by a PSS application.

组元:ThreeGPFileFormatComponent: ThreeGPFileFormat

类型:文字(袋)Type: Text (bag)

合法值:对应于支持的帧封装格式的整数值列表。整数值应该是3或4,其分别对应于并排和上下排列帧封装格式。Legal values: A list of integer values corresponding to supported frame packing formats. The integer value should be 3 or 4, which correspond to side-by-side and top-and-bottom frame packing formats respectively.

分辨率规则: 附上Resolution Rules: Attached

示例:Example:

在一些实施例中,例如如在MPD中描述的媒体呈现可包括适配集合、表示和子表示所共有的属性和元素。一个这样的元素可以是FramePacking元素。FramePacking元素可规定视频媒体组元类型的帧封装设置信息。当对视频组元未提供FramePacking元素时,帧封装可不用于视频媒体组元。In some embodiments, a media presentation, such as that described in an MPD, may include attributes and elements common to adaptation sets, representations, and sub-representations. One such element may be a FramePacking element. The FramePacking element may specify frame packing settings for a video media component type. When a FramePacking element is not provided for a video component, frame packing may not be used for the video media component.

FramePacking元素可包括@shcemeIdUri属性,其包括统一资源标识符(URI),用于识别采用的帧封装配置方案。在一些实施例中,FramePacking元素可进一步包括@value属性,用于对描述符元素提供值。The FramePacking element may include a @shcemeIdUri attribute including a Uniform Resource Identifier (URI) for identifying the frame packing configuration scheme employed. In some embodiments, the FramePacking element may further include a @value attribute for providing a value to the descriptor element.

在一些实施例中,可存在多个FramePacking元素。如果这样的话,每个元素可包含足够的信息来选择或拒绝描述的表示。In some embodiments, there may be multiple FramePacking elements. If so, each element may contain sufficient information to select or reject the described representation.

如果方案或者所有FramePacking元素的值未被识别,客户端可忽略描述的表示。客户端可在观察FramePacking元素的基础上拒绝适配集合。If the scheme or the values of all FramePacking elements are not recognized, the client may ignore the described representation. The client may refuse to adapt the set based on observing the FramePacking elements.

对于包含符合ISO/IEC信息技术-视听对象的编码-部分10:先进视频编码(ISO/IEC 14496-10:2012)的视频组元的适配集合或表示,FramePackin@schemeIdUri的统一资源号可以是For an adaptation set or representation containing video components conforming to ISO/IEC Information technology — Coding of audiovisual objects — Part 10: Advanced video coding (ISO/IEC 14496-10:2012), the URI of FramePackin@schemeIdUri can be

urn:mpeg:dash:14496:10:frame_pcking_arrangement_type:2011,其可定义成指示如由ISO/IEC 14496-10:2012(‘frame_packing_arrangement_type的定义’)的表D-8定义的帧封装设置包含在FramePacking元素中。@value可以是如在ISO/IEC 14496-10:2012的表D-8中规定的“Value”栏并且可以根据相同表格中的‘Interpretation’栏来解释。urn:mpeg:dash:14496:10:frame_pcking_arrangement_type:2011, which may be defined to indicate that the frame packing arrangement as defined by Table D-8 of ISO/IEC 14496-10:2012 ('Definition of frame_packing_arrangement_type') is contained in the FramePacking element. @value may be the 'Value' column as specified in Table D-8 of ISO/IEC 14496-10:2012 and may be interpreted according to the 'Interpretation' column in the same table.

图5图示根据一些实施例信号传递3D视频设备能力的方法500。方法500可由UE(例如,UE 104)的部件进行。在一些实施例中,UE可包括和/或访问具有存储在其上的指令的一个或多个计算机可读介质,这些指令在被执行时使UE或其部件进行方法500。FIG5 illustrates a method 500 for signaling 3D video device capabilities according to some embodiments. Method 500 may be performed by components of a UE (e.g., UE 104). In some embodiments, the UE may include and/or have access to one or more computer-readable media having instructions stored thereon that, when executed, cause the UE or its components to perform method 500.

在504,UE可确定设备能力信息。如上文描述的,该设备能力信息可包括关于媒体播放器的解码和渲染能力的信息。在一些实施例中,位于UE或别处上的内容管理模块可通过在UE上运行一个或多个脚本来直接测试能力而确定该信息。在其他实施例中,内容管理模块可访问包含相关信息的一个或多个存储文件。At 504, the UE may determine device capability information. As described above, this device capability information may include information about the decoding and rendering capabilities of the media player. In some embodiments, a content management module located on the UE or elsewhere may determine this information by directly testing the capabilities by running one or more scripts on the UE. In other embodiments, the content management module may access one or more stored files containing relevant information.

在508,UE可向媒体服务器116或设备概况服务器144提供设备能力信息,其包括在UE处的媒体播放器的立体3D视频解码和渲染能力。如上文描述的,设备能力信息可包括一个或多个格式类型属性,其代表由UE的流应用所支持的帧类型列表。在一些实施例中,可在512处的请求之前或之后提供设备能力信息。At 508, the UE may provide device capability information, including stereoscopic 3D video decoding and rendering capabilities of the media player at the UE, to the media server 116 or the device profile server 144. As described above, the device capability information may include one or more format type attributes representing a list of frame types supported by the UE's streaming application. In some embodiments, the device capability information may be provided before or after the request at 512.

在一些实施例中,设备能力信息中的一些或全部可由另一个实体(例如,设备概况服务器)提供给媒体服务器。In some embodiments, some or all of the device capability information may be provided to the media server by another entity (eg, a device profile server).

在512,UE可请求3D视频内容。在一些实施例中,请求可根据适当的流/传输协议,例如HTTP、RTP、RTSP、DASH、MBMS、PSS、IMS_PSS_MBMS等。请求可针对媒体服务器并且可包括统一资源定位符(URL)或请求的内容或其部分的一些其他指示符。在一些实施例中,对设备能力信息的临时调整(例如,经由ProfDiff信令)还可连同508处的请求一起被提供。相应地,UE可基于ProfDiff信令用附加属性补充从设备概况服务器检索的概况信息或覆盖在它的设备能力概况中已经限定的属性。在一个示例中,这样的临时调整可通过用户偏好而触发,例如如果特定会话的用户仅想接收二维(2D)视频(即使终端能够渲染3D视频)。At 512, the UE may request 3D video content. In some embodiments, the request may be based on an appropriate streaming/transport protocol, such as HTTP, RTP, RTSP, DASH, MBMS, PSS, IMS_PSS_MBMS, etc. The request may be directed to a media server and may include a uniform resource locator (URL) or some other indicator of the requested content or portion thereof. In some embodiments, temporary adjustments to the device capability information (e.g., via ProfDiff signaling) may also be provided along with the request at 508. Accordingly, the UE may supplement the profile information retrieved from the device profile server with additional attributes based on the ProfDiff signaling or override attributes already defined in its device capability profile. In one example, such temporary adjustments may be triggered by user preferences, such as if the user of a particular session only wants to receive two-dimensional (2D) video (even if the terminal is capable of rendering 3D video).

在516,UE可接收请求的3D视频内容并且在UE的显示器上渲染内容。内容的渲染可包括多种过程,例如但不限于,解码、过转换、拆解、排序等。At 516, the UE may receive the requested 3D video content and render the content on the UE's display. Rendering of the content may include various processes, such as, but not limited to, decoding, over-conversion, disassembly, sorting, and the like.

图6图示根据一些实施例信号传递3D视频内容的方法600。方法600可由媒体服务器(例如,媒体服务器116)的部件进行。在一些实施例中,媒体服务器可包括和/或访问具有存储在其上的指令的一个或多个计算机可读介质,这些指令在被执行时使媒体服务器或其部件进行方法600。FIG6 illustrates a method 600 for signaling 3D video content according to some embodiments. Method 600 may be performed by components of a media server (e.g., media server 116). In some embodiments, the media server may include and/or have access to one or more computer-readable media having instructions stored thereon that, when executed, cause the media server or components thereof to perform method 600.

在604,媒体服务器可确定设备能力信息。在一些实施例中,媒体服务器可通过接收作为会话级信令的一部分的来自UE或设备概况服务器的信息而确定设备能力信息。The media server may determine device capability information at 604. In some embodiments, the media server may determine the device capability information by receiving information from the UE or a device profile server as part of session level signaling.

在608,媒体服务器可接收对3D视频内容的请求。在一些实施例中,该请求可根据适当的流/传输协议,例如HTTP、RTP、RTSP、DASH、MBMS、PSS、IMS_PSS_MBMS等。该请求可来自UE并且可包括统一资源定位符(URL)或请求的内容或其部分的一些其他指示符。在一些实施例中,在608处接收的请求可与设备能力信息604的确定同时、在确定之前或在确定之后发生。在一些实施例中,对设备能力信息的临时调整(例如,经由ProfDiff信令)还可连同608处的请求一起被接收。相应地,媒体服务器可基于ProfDiff信令用附加属性补充从设备概况服务器检索的概况信息或覆盖在它的设备能力概况中已经限定的属性。At 608, the media server may receive a request for 3D video content. In some embodiments, the request may be in accordance with an appropriate streaming/transport protocol, such as HTTP, RTP, RTSP, DASH, MBMS, PSS, IMS_PSS_MBMS, etc. The request may be from a UE and may include a uniform resource locator (URL) or some other indicator of the requested content or portion thereof. In some embodiments, the request received at 608 may occur concurrently with, before, or after the determination of the device capability information 604. In some embodiments, temporary adjustments to the device capability information (e.g., via ProfDiff signaling) may also be received along with the request at 608. Accordingly, the media server may supplement the profile information retrieved from the device profile server with additional attributes based on the ProfDiff signaling or override attributes already defined in its device capability profile.

在612,媒体服务器可基于说明UE处的媒体播放器的立体3D视频解码和渲染能力的设备能力信息而生成会话描述和/或元数据文件以建立流会话,例如SDP文件或媒体呈现描述(MPD)。At 612 , the media server may generate a session description and/or metadata file to establish a streaming session, such as an SDP file or a media presentation description (MPD), based on device capability information describing stereoscopic 3D video decoding and rendering capabilities of a media player at the UE.

在616,媒体服务器可采用设备能力信息中如指示为由UE所支持的格式类型对3D视频内容编码。3D视频内容然后可流播到移动设备。At 616, the media server may encode the 3D video content using a format type indicated in the device capability information as supported by the UE. The 3D video content may then be streamed to the mobile device.

在本文描述的部件,例如UE 104、媒体服务器116和/或设备概况服务器144可使用任何适合的硬件和/或软件实现在系统中以根据期望配置。图7对于一个实施例图示示例系统700,其包括一个或多个处理器704、与处理器704中的至少一个耦合的系统控制逻辑708、与系统控制逻辑708耦合的系统存储器712、与系统控制逻辑708耦合的非易失性存储器(NVM)/存储716、与系统控制逻辑708耦合的网络接口720和/或与系统控制逻辑708耦合的输入/输出(I/O)设备732。Components described herein, such as UE 104, media server 116, and/or device profile server 144, can be implemented in a system using any suitable hardware and/or software to configure the system as desired. FIG7 illustrates, for one embodiment, an example system 700 that includes one or more processors 704, system control logic 708 coupled to at least one of the processors 704, system memory 712 coupled to the system control logic 708, non-volatile memory (NVM)/storage 716 coupled to the system control logic 708, a network interface 720 coupled to the system control logic 708, and/or an input/output (I/O) device 732 coupled to the system control logic 708.

处理器704可包括一个或多个单核或多核处理器。处理器704可包括通用处理器和专用处理器(例如,图形处理器、应用处理器、基带处理器等)的任何组合。The processor 704 may include one or more single-core or multi-core processors. The processor 704 may include any combination of general-purpose processors and special-purpose processors (eg, graphics processors, application processors, baseband processors, etc.).

对于一个实施例的系统控制逻辑708可包括任何适合的接口控制器,用于提供任何适合的接口给处理器704中的至少一个和/或与系统控制逻辑708通信的任何适合的设备或部件。System control logic 708 for one embodiment may include any suitable interface controller for providing any suitable interface to at least one of processors 704 and/or any suitable device or component in communication with system control logic 708 .

对于一个实施例的系统控制逻辑708可包括一个或多个存储器控制器,用于提供接口给系统存储器712。系统存储器712可用于加载和存储数据和/或指令,例如逻辑724。对于一个实施例的系统存储器712可包括任何适合的易失性存储器,例如适合的动态随机存取存储器(DRAM)等。System control logic 708 for one embodiment may include one or more memory controllers for providing an interface to system memory 712. System memory 712 may be used to load and store data and/or instructions, such as logic 724. System memory 712 for one embodiment may include any suitable volatile memory, such as a suitable dynamic random access memory (DRAM).

NVM/存储716可包括一个或多个有形的非暂时性计算机可读介质,用于存储数据和/或指令,例如逻辑724。NVM/存储716可包括任何适合的非易失性存储器,例如闪速存储器等,和/或可包括任何适合的非易失性存储设备,例如一个或多个硬盘驱动器(HDD)、一个或多个压缩盘(CD)驱动器和/或一个或多个数字多用途盘(DVD)驱动器等。NVM/storage 716 may include one or more tangible, non-transitory computer-readable media for storing data and/or instructions, such as logic 724. NVM/storage 716 may include any suitable non-volatile memory, such as flash memory, etc., and/or may include any suitable non-volatile storage devices, such as one or more hard disk drives (HDDs), one or more compact disk (CD) drives, and/or one or more digital versatile disk (DVD) drives, etc.

NVM/存储716可包括这样的存储资源,其在物理上是系统700安装在其上的设备的一部分,或者它可以能够由该设备访问但不一定是该设备的一部分。例如,可经由网络接口720通过网络访问和/或通过输入/输出(I/O)设备732访问NVM/存储716。NVM/storage 716 may include storage resources that are physically part of the device on which system 700 is installed, or that may be accessible by the device but not necessarily part of the device. For example, NVM/storage 716 may be accessible over a network via network interface 720 and/or through input/output (I/O) devices 732.

逻辑724在由处理器704中的至少一个执行时使系统进行在本文关于UE 104、媒体服务器116和/或设备概况服务器144描述的操作。逻辑724可另外/备选地设置在系统的其他部件中,例如在系统控制逻辑708中,并且可包括硬件、软件和/或固件部件的任何组合。The logic 724, when executed by at least one of the processors 704, causes the system to perform the operations described herein with respect to the UE 104, the media server 116, and/or the device profile server 144. The logic 724 may additionally/alternatively be disposed in other components of the system, such as in the system control logic 708, and may include any combination of hardware, software, and/or firmware components.

网络接口720可具有收发器722,用于提供无线电接口给系统700以通过一个或多个网络通信和/或与任何其他适合的设备通信。在各种实施例中,收发器722可与系统700的其他部件集成。例如,收发器722可包括处理器704的处理器、系统存储器712的存储器以及NVM/存储716的NVM/存储。网络接口720可包括任何适合的硬件和/或固件。网络接口720可包括多个天线,用于提供多输入多输出无线电接口。对于一个实施例的网络接口720可包括例如有线网络适配器、无线网络适配器、电话调制解调器和/或无线调制解调器。The network interface 720 may include a transceiver 722 for providing a radio interface to the system 700 for communicating over one or more networks and/or with any other suitable device. In various embodiments, the transceiver 722 may be integrated with other components of the system 700. For example, the transceiver 722 may include the processor of the processor 704, the memory of the system memory 712, and the NVM/storage of the NVM/storage 716. The network interface 720 may include any suitable hardware and/or firmware. The network interface 720 may include multiple antennas to provide a multiple-input, multiple-output radio interface. For one embodiment, the network interface 720 may include, for example, a wired network adapter, a wireless network adapter, a telephone modem, and/or a wireless modem.

对于一个实施例,处理器704中的至少一个可与系统控制逻辑708的一个或多个控制器的逻辑封装在一起。对于一个实施例,处理器704中的至少一个可与系统控制逻辑708的一个或多个控制器的逻辑封装在一起来形成系统级封装(SiP)。对于一个实施例,处理器704中的至少一个可集成在与系统控制逻辑708的一个或多个控制器的逻辑相同的芯片上。对于一个实施例,处理器704中的至少一个可集成在与系统控制逻辑708的一个或多个控制器的逻辑相同的芯片上来形成芯片上系统(SoC)。For one embodiment, at least one of the processors 704 may be packaged together with the logic of one or more controllers of the system control logic 708. For one embodiment, at least one of the processors 704 may be packaged together with the logic of one or more controllers of the system control logic 708 to form a system-in-package (SiP). For one embodiment, at least one of the processors 704 may be integrated on the same chip as the logic of one or more controllers of the system control logic 708. For one embodiment, at least one of the processors 704 may be integrated on the same chip as the logic of one or more controllers of the system control logic 708 to form a system on a chip (SoC).

在各种实施例中,I/O设备732可包括设计成使用户能够与系统700交互的用户界面、设计成使外围部件能够与系统700交互的外围部件接口和/或设计成确定涉及系统700的环境条件和/或位置信息的传感器。In various embodiments, I/O device 732 may include a user interface designed to enable a user to interact with system 700, a peripheral component interface designed to enable peripheral components to interact with system 700, and/or a sensor designed to determine environmental conditions and/or location information related to system 700.

在各种实施例中,用户界面可包括但不限于用于渲染3D视频的显示器(例如,液晶显示器、触摸屏显示器、自动立体显示器等)、扬声器、麦克风、一个或多个拍摄设备(例如,静物拍摄设备和/或视频拍摄设备)、闪光灯(例如,发光二极管闪光灯)和键盘。In various embodiments, the user interface may include, but is not limited to, a display for rendering 3D video (e.g., an LCD display, a touch screen display, an autostereoscopic display, etc.), a speaker, a microphone, one or more cameras (e.g., a still camera and/or a video camera), a flash (e.g., an LED flash), and a keyboard.

在各种实施例中,外围部件接口可包括但不限于非易失性存储器端口、通用串行总线(USB)端口、音频插口和电力供应接口。In various embodiments, the peripheral component interface may include, but is not limited to, a non-volatile memory port, a Universal Serial Bus (USB) port, an audio jack, and a power supply interface.

在各种实施例中,传感器可包括但不限于陀螺仪传感器、加速计、接近传感器、环境光传感器和定位单元。该定位单元也可以是网络接口720的一部分或与网络接口720交互来与定位网络的部件、例如全球定位系统(GPS)卫星通信。In various embodiments, sensors may include, but are not limited to, gyroscope sensors, accelerometers, proximity sensors, ambient light sensors, and positioning units. The positioning unit may also be part of or interact with network interface 720 to communicate with components of a positioning network, such as a Global Positioning System (GPS) satellite.

在各种实施例中,系统700可以是移动计算设备,例如但不限于膝上型计算设备、平板计算设备、上网本、智能电话等。在各种实施例中,系统700可具有更多或更少部件和/或不同的架构。In various embodiments, the system 700 may be a mobile computing device such as, but not limited to, a laptop computing device, a tablet computing device, a netbook, a smartphone, etc. In various embodiments, the system 700 may have more or fewer components and/or a different architecture.

根据本公开的一个方面,提供一种用于提供3D视频的媒体服务器,所述媒体服务器包括:用于从网络实体获得流帧封装格式属性的部件,所述流帧封装格式属性包括由无线通信网络的客户终端所支持的与通过所述客户终端上的分组交换流服务(PSS)应用所支持的传输协议的立体3D视频的流相关的帧封装格式的列表,其中传输协议是实时传输协议(RTP)或超文本传输协议(HTTP);用于基于所述流帧封装格式属性使内容适配的部件;用于基于所述流帧封装格式属性生成会话描述或元数据文件以建立流会话的部件;以及用于将适配的内容和生成的会话描述或元数据文件传送给所述客户终端的部件。According to one aspect of the present disclosure, a media server for providing 3D video is provided, the media server comprising: a component for obtaining stream frame encapsulation format attributes from a network entity, the stream frame encapsulation format attributes comprising a list of frame encapsulation formats supported by a client terminal of a wireless communication network and related to streaming of stereoscopic 3D video via a transport protocol supported by a packet switched streaming service (PSS) application on the client terminal, wherein the transport protocol is a real-time transport protocol (RTP) or a hypertext transfer protocol (HTTP); a component for adapting content based on the stream frame encapsulation format attributes; a component for generating a session description or metadata file based on the stream frame encapsulation format attributes to establish a streaming session; and a component for transmitting the adapted content and the generated session description or metadata file to the client terminal.

所述帧封装格式的列表包括垂直交错帧兼容封装格式、水平交错帧兼容封装格式、并排帧兼容封装格式、上下排列帧兼容封装格式、棋盘帧兼容封装格式或时间交错的每视图全分辨率封装格式的指示。The list of frame packing formats includes an indication of a vertically interlaced frame compatible packing format, a horizontally interlaced frame compatible packing format, a side-by-side frame compatible packing format, an over-under frame compatible packing format, a checkerboard frame compatible packing format, or a temporally interlaced per-view full-resolution packing format.

所述列表包括分别对应于一个或多个支持的帧封装格式的一个或多个整数值的列表。The list includes a list of one or more integer values corresponding to one or more supported frame packing formats, respectively.

所述一个或多个整数值的列表包括:1,对应于垂直交错帧兼容封装格式;2,对应于水平交错帧兼容封装格式;3,对应于并排帧兼容封装格式;4,对应于上下排列帧兼容封装格式;0,对应于棋盘帧兼容封装格式;或5,对应于时间交错的每视图全分辨率封装格式。The list of one or more integer values includes: 1, corresponding to a vertically interlaced frame-compatible packing format; 2, corresponding to a horizontally interlaced frame-compatible packing format; 3, corresponding to a side-by-side frame-compatible packing format; 4, corresponding to an over-under frame-compatible packing format; 0, corresponding to a checkerboard frame-compatible packing format; or 5, corresponding to a time-interlaced full-resolution per-view packing format.

所述传输协议包括RTP。The transmission protocol includes RTP.

所述传输协议包括HTTP。The transmission protocol includes HTTP.

所述会话描述或元数据文件是实时流协议(RTSP)会话描述协议(SDP)文件或通过超文本传输协议的动态自适应流(DASH)媒体呈现描述(MPD)文件。The session description or metadata file is a Real Time Streaming Protocol (RTSP) Session Description Protocol (SDP) file or a Dynamic Adaptive Streaming over Hypertext Transfer Protocol (DASH) Media Presentation Description (MPD) file.

内容是3D视频内容并且所述媒体服务器进一步包括:The content is 3D video content and the media server further comprises:

基于所述流帧封装格式属性将所述3D视频内容转码或转换所述3D视频内容的格式的部件。A component transcodes the 3D video content or converts the format of the 3D video content based on the stream frame encapsulation format attributes.

所述网络实体是设备概况服务器或包括所述客户终端的用户装置。The network entity is a device profile server or a user device including the client terminal.

根据本公开另一方面,提供一种用于提供3D视频的媒体服务器,所述媒体服务器包括:用于从网络实体获得帧封装格式属性的部件,所述帧封装格式属性包括由用户装置支持的与能够包括在第三代合作伙伴计划文件格式(3GP)文件中并且由所述用户装置上的分组交换流服务(PSS)应用所处理的立体3D视频相关的一个或多个帧封装格式的列表;用于基于所述帧封装格式属性将内容传送给所述用户装置的部件。According to another aspect of the present disclosure, a media server for providing 3D video is provided, the media server comprising: a component for obtaining a frame packing format attribute from a network entity, the frame packing format attribute comprising a list of one or more frame packing formats supported by a user device and related to stereoscopic 3D video that can be included in a Third Generation Partnership Project File Format (3GP) file and processed by a Packet Switched Streaming Service (PSS) application on the user device; and a component for transmitting content to the user device based on the frame packing format attribute.

所述帧封装格式的列表包括并排帧封装格式或上下排列帧封装格式的指示。The list of frame packing formats includes an indication of a side-by-side frame packing format or an over-under frame packing format.

所述列表包括分别对应于一个或多个帧封装格式的一个或多个整数值的列表。The list includes a list of one or more integer values corresponding to one or more frame packing formats, respectively.

所述一个或多个整数值的列表包括对应于并排帧封装格式的3或对应于上下排列帧封装格式的4。The list of one or more integer values includes 3 corresponding to a side-by-side frame packing format or 4 corresponding to an over-under frame packing format.

根据本公开再另一方面,提供一种用于查看3D视频的用户装置UE,所述UE包括:媒体播放器,用于对由UE无线接收的立体三维(3D)视频内容解码并且在所述UE的显示器上渲染;以及内容管理模块,用于:确定设备能力信息,其包括所述媒体播放器支持的3D视频编解码和格式;将一个或多个消息传送到媒体服务器或设备概况服务器,所述一个或多个消息包括所述设备能力信息;将至少一个消息传送到媒体服务器,所述至少一个消息包括对立体3D视频内容的请求以及关于所述设备能力信息的任何临时调整。According to yet another aspect of the present disclosure, a user equipment (UE) for viewing 3D videos is provided, the UE comprising: a media player for decoding stereoscopic three-dimensional (3D) video content wirelessly received by the UE and rendering it on a display of the UE; and a content management module for: determining device capability information including 3D video codecs and formats supported by the media player; transmitting one or more messages including the device capability information to a media server or a device profile server; and transmitting at least one message to the media server, the at least one message including a request for stereoscopic 3D video content and any temporary adjustments to the device capability information.

所述设备能力信息具有3D格式属性,其包括媒体播放器电路所支持的格式类型。The device capability information has a 3D format attribute, which includes the format types supported by the media player circuit.

所述格式类型是对应于帧兼容封装格式或每视图全分辨率封装格式的帧封装格式类型、联播格式类型或二维加辅助格式类型。The format type is a frame packing format type corresponding to a frame-compatible packing format or a per-view full-resolution packing format, a simulcast format type, or a 2D plus auxiliary format type.

所述格式类型是帧封装格式类型,其是具有用于指示垂直交错、水平交错、并排、上下排列或棋盘的值的帧兼容封装格式。The format type is a frame packing format type, which is a frame compatible packing format having a value indicating vertical interlace, horizontal interlace, side by side, top and bottom, or checkerboard.

所述3D视频内容经由分组交换流服务由移动设备无线接收。The 3D video content is wirelessly received by a mobile device via a packet-switched streaming service.

所述内容管理模块在服务发现期间在会话发起协议(SIP)SUBSCRIBE消息中传送设备能力信息到互联网协议多媒体核心网络子系统。The content management module transmits device capability information in a Session Initiation Protocol (SIP) SUBSCRIBE message to an Internet Protocol Multimedia Core Network Subsystem during service discovery.

在服务发现之后,所述内容管理模块使用SIP信令来更新支持的3D视频编解码和格式的集合。After service discovery, the content management module uses SIP signaling to update the set of supported 3D video codecs and formats.

所述媒体播放器进一步配置成:接收包括请求的立体3D视频内容的流,其中所述媒体播放器进一步配置成根据通过超文本传输协议的动态自适应流(DASH)协议、分组交换流(PSS)协议或基于互联网协议多媒体子系统(IMS)的PSS和多媒体广播/多播(MBMS)服务协议来接收所述流。The media player is further configured to receive a stream comprising the requested stereoscopic 3D video content, wherein the media player is further configured to receive the stream according to a Dynamic Adaptive Streaming over Hypertext Transfer Protocol (DASH) protocol, a Packet Switched Streaming (PSS) protocol, or an Internet Protocol Multimedia Subsystem (IMS)-based PSS and Multimedia Broadcast/Multicast Service (MBMS) protocol.

所述媒体播放器进一步配置成接收与所述流关联的会话描述协议(SDP)文件或媒体呈现描述(MPD)元数据文件。The media player is further configured to receive a Session Description Protocol (SDP) file or a Media Presentation Description (MPD) metadata file associated with the stream.

UE进一步包括:自动立体显示器,用于在所述媒体播放器的控制下渲染所述3D视频内容。The UE further includes an autostereoscopic display configured to render the 3D video content under control of the media player.

根据本公开再另一方面,提供一种用于便于呈现3D视频的设备概况服务器,所述设备概况服务器用于:获得对涉及用户装置的3D能力信息的请求;以及提供流帧封装格式属性,其包括由所述用户装置支持的与通过所述用户装置上的分组交换流服务(PSS)应用所支持的传输协议的立体3D视频的流相关的帧封装格式的列表,其中所述传输协议是实时传输协议(RTP)或超文本传输协议(HTTP)。According to yet another aspect of the present disclosure, a device profile server for facilitating presentation of 3D video is provided, the device profile server being configured to: obtain a request for 3D capability information relating to a user device; and provide a stream frame packing format attribute comprising a list of frame packing formats supported by the user device associated with streaming of stereoscopic 3D video over a transport protocol supported by a packet-switched streaming service (PSS) application on the user device, wherein the transport protocol is a real-time transport protocol (RTP) or a hypertext transfer protocol (HTTP).

所述帧封装格式的列表包括垂直交错帧兼容封装格式、水平交错帧兼容封装格式、并排帧兼容封装格式、上下排列帧兼容封装格式、棋盘帧兼容封装格式或时间交错的每视图全分辨率封装格式的指示。The list of frame packing formats includes an indication of a vertically interlaced frame compatible packing format, a horizontally interlaced frame compatible packing format, a side-by-side frame compatible packing format, an over-under frame compatible packing format, a checkerboard frame compatible packing format, or a temporally interlaced per-view full-resolution packing format.

所述传输协议包括RTP。The transmission protocol includes RTP.

所述传输协议包括HTTP。The transmission protocol includes HTTP.

所述列表包括分别对应于一个或多个支持的帧封装格式的一个或多个整数值的列表。The list includes a list of one or more integer values corresponding to one or more supported frame packing formats, respectively.

所述一个或多个整数值的列表包括:1,对应于垂直交错帧兼容封装格式;2,对应于水平交错帧兼容封装格式;3,对应于并排帧兼容封装格式;4,对应于上下排列帧兼容封装格式;0,对应于棋盘帧兼容封装格式;或5,对应于时间交错的每视图全分辨率封装格式。The list of one or more integer values includes: 1, corresponding to a vertically interlaced frame-compatible packing format; 2, corresponding to a horizontally interlaced frame-compatible packing format; 3, corresponding to a side-by-side frame-compatible packing format; 4, corresponding to an over-under frame-compatible packing format; 0, corresponding to a checkerboard frame-compatible packing format; or 5, corresponding to a time-interlaced full-resolution per-view packing format.

尽管已经在本文中为了描述的目的说明和描述某些实施例,用于实现相同目的所计算的许多种替代和/或等同实施例或实现可代替示出并且描述的实施例,而不偏离本公开的范围。该申请意在涵盖本文论述的实施例的任何适配或变化。因此,明确地规定本文描述的实施例仅由权利要求和其的等同物限制。Although certain embodiments have been illustrated and described herein for purposes of illustration, numerous alternative and/or equivalent embodiments or implementations calculated to achieve the same purpose may be substituted for the embodiments shown and described without departing from the scope of the present disclosure. This application is intended to cover any adaptation or variation of the embodiments discussed herein. Therefore, it is expressly stated that the embodiments described herein are limited only by the claims and their equivalents.

Claims (28)

1.一种要由媒体服务器采用的装置,所述装置包括:1. An apparatus for use by a media server, the apparatus comprising: 用于获得与无线通信网络的用户设备UE关联的第三代合作伙伴3GP-通过超文本传输协议的动态自适应流DASH简档的部件,所述3GP-DASH 简档用于指示与所述UE支持的立体三维3-D视频内容关联的一个或多个约束;A component for obtaining a 3G GP-DASH profile of a user equipment (UE) associated with a wireless communication network, the 3G GP-DASH profile indicating one or more constraints associated with stereoscopic 3D video content supported by the UE. 用于识别符合所获得的3GP-DASH简档并且能够用于传递至所述UE的第一媒体表示的部件;Components used to identify a first media representation that conforms to the obtained 3GP-DASH profile and can be used to transmit to the UE; 用于获得媒体表示描述MPD的部件,所述媒体表示描述MPD包括与所识别的第一媒体表示关联的信息和不符合所获得的3GP-DASH简档的第二媒体表示所关联的信息;Components for obtaining a Media Representation Description (MPD), the MPD including information associated with an identified first media representation and information associated with a second media representation that does not conform to the obtained 3GP-DASH profile; 用于基于所获得的3GP-DASH 简档来修改所述MPD以排除不符合所获得的3GP-DASH简档的所述第二媒体表示所关联的信息的部件;以及A component for modifying the MPD based on the obtained 3GP-DASH profile to exclude information associated with the second media representation that does not conform to the obtained 3GP-DASH profile; and 用于传送所修改的MPD至所述UE的部件。A component used to transmit the modified MPD to the UE. 2.如权利要求1所述的装置,其中所述3GP-DASH简档是多视图立体3D视频简档,用于指示所述UE支持多视图立体3D视频内容,其包括暂时交织的基本视图和非基本视图。2. The apparatus of claim 1, wherein the 3GP-DASH profile is a multi-view stereoscopic 3D video profile, used to indicate that the UE supports multi-view stereoscopic 3D video content, which includes temporarily interleaved basic views and non-basic views. 3.如权利要求1所述的装置,其中所述3GP-DASH简档是帧封装立体3D视频简档,用于指示所述UE支持帧封装3D视频内容,其包括在相同帧中封装的基本视图和非基本视图。3. The apparatus of claim 1, wherein the 3GP-DASH profile is a frame-encapsulated stereoscopic 3D video profile, used to indicate that the UE supports frame-encapsulated 3D video content, which includes a basic view and non-basic views encapsulated in the same frame. 4.如权利要求3所述的装置,进一步包括用于在所修改的MPD中包括帧封装元素以指示用于所述媒体表示的DASH传递的帧封装格式的类型的部件。4. The apparatus of claim 3, further comprising a component for including a frame encapsulation element in the modified MPD to indicate the type of frame encapsulation format for DASH transmission of the media representation. 5.如权利要求4所述的装置,其中所述帧封装元素指示所使用的帧封装格式的类型是垂直交织帧兼容封装格式、水平交织帧兼容封装格式、并排帧兼容封装格式、上下帧兼容封装格式或者棋盘帧兼容封装格式。5. The apparatus of claim 4, wherein the frame encapsulation element indicates that the type of frame encapsulation format used is a vertically interleaved frame compatible encapsulation format, a horizontally interleaved frame compatible encapsulation format, a side-by-side frame compatible encapsulation format, a top-bottom frame compatible encapsulation format, or a checkerboard frame compatible encapsulation format. 6.如权利要求1所述的装置,其中所修改的MPD包括与所述第一媒体表示的个体DASH表示关联的一个或多个属性,其中所述个体DASH表示包括与所述第一媒体表示的不同时间段关联的DASH表示。6. The apparatus of claim 1, wherein the modified MPD includes one or more attributes associated with an individual DASH representation of the first media representation, wherein the individual DASH representation includes DASH representations associated with different time periods of the first media representation. 7.如权利要求6所述的装置,其中所修改的MPD包括符合所述3GP-DASH简档的第一媒体表示的DASH表示所关联的属性,并且排除不符合所述3GP-DASH简档的所述第一媒体表示的一个或多个表示所关联的属性。7. The apparatus of claim 6, wherein the modified MPD includes attributes associated with a DASH representation that conforms to the first media representation of the 3GP-DASH profile, and excludes attributes associated with one or more representations that do not conform to the first media representation of the 3GP-DASH profile. 8.如权利要求1-7中任一项所述的装置,进一步包括:8. The apparatus of any one of claims 1-7, further comprising: 用于获得由所述UE传送的对于与所述第一媒体表示关联的DASH表示的超文本传输协议HTTP GET或部分GET请求的部件,以及Components for obtaining a Hypertext Transfer Protocol HTTP GET or partial GET request transmitted by the UE for a DASH representation associated with the first media representation, and 用于响应于所述HTTP GET 或部分GET请求来发送所述DASH表示给所述UE的部件。A component for sending the DASH representation to the UE in response to the HTTP GET or partial GET request. 9.如权利要求8所述的装置,其中所述DASH表示经由多媒体广播和多播服务(MBMS)被发送到所述UE。9. The apparatus of claim 8, wherein the DASH represents being transmitted to the UE via Multimedia Broadcast and Multicast Service (MBMS). 10.如权利要求1-7中任一项所述的装置,其中所述3GP-DASH简档从设备简档服务器来获得。10. The apparatus of any one of claims 1-7, wherein the 3GP-DASH profile is obtained from a device profile server. 11.一种要由用户设备UE采用的装置,所述装置包括:11. An apparatus to be used by a user equipment (UE), the apparatus comprising: 内容管理模块,用于:The content management module is used for: 经由长期演进LTE无线通信网络传送与所述UE关联的第三代合作伙伴3GP-通过超文本传输协议DASH的动态自适应流简档的标识符,所述3GP-DASH简档用于指示与所述UE支持的立体三维3-D视频内容关联的一个或多个约束;The identifier of the 3G Partner 3GP-DASH dynamic adaptive stream profile associated with the UE is transmitted via the Long Term Evolution (LTE) wireless communication network. The 3GP-DASH profile is used to indicate one or more constraints associated with stereoscopic 3D video content supported by the UE. 接收媒体表示描述MPD,所述媒体表示描述MPD包括符合所述3GP-DASH简档的第一媒体表示所关联的信息,并且排除不符合所述3GP-DASH简档的一个或多个其他媒体表示所关联的信息;以及The media representation description (MPD) is received, the MPD including information associated with a first media representation conforming to the 3GP-DASH profile, and excluding information associated with one or more other media representations that do not conform to the 3GP-DASH profile; and 传送对于与所述第一媒体表示关联的DASH表示的超文本传输协议HTTP GET 或部分GET请求;以及Transmit a Hypertext Transfer Protocol HTTP GET or partial GET request for the DASH representation associated with the first media representation; and 媒体播放器,耦合至所述内容管理模块,所述媒体播放器用于接收并且渲染所述DASH表示。A media player, coupled to the content management module, is used to receive and render the DASH representation. 12.如权利要求11所述的装置,其中所述3GP-DASH简档是多视图立体3D视频简档,用于指示所述UE支持多视图立体3D视频内容,其包括暂时交织的基本视图和非基本视图。12. The apparatus of claim 11, wherein the 3GP-DASH profile is a multi-view stereoscopic 3D video profile used to indicate that the UE supports multi-view stereoscopic 3D video content, which includes temporarily interleaved basic views and non-basic views. 13.如权利要求11所述的装置,其中所述3GP-DASH简档是帧封装立体3D视频简档,用于指示所述UE支持帧封装3D视频内容,其包括在相同帧中封装的基本视图和非基本视图。13. The apparatus of claim 11, wherein the 3GP-DASH profile is a frame-encapsulated stereoscopic 3D video profile, used to indicate that the UE supports frame-encapsulated 3D video content, which includes a basic view and non-basic views encapsulated in the same frame. 14.如权利要求13所述的装置,其中所述MPD包括帧封装元素,用于指示用于所述第一媒体表示的帧封装格式的类型。14. The apparatus of claim 13, wherein the MPD includes a frame encapsulation element for indicating the type of frame encapsulation format used for the first media representation. 15.如权利要求14所述的装置,其中所述帧封装元素指示所使用的帧封装格式的类型是垂直交织帧兼容封装格式、水平交织帧兼容封装格式、并排帧兼容封装格式、上下帧兼容封装格式或棋盘帧兼容封装格式。15. The apparatus of claim 14, wherein the frame encapsulation element indicates that the type of frame encapsulation format used is a vertically interleaved frame compatible encapsulation format, a horizontally interleaved frame compatible encapsulation format, a side-by-side frame compatible encapsulation format, a top-bottom frame compatible encapsulation format, or a checkerboard frame compatible encapsulation format. 16.如权利要求11所述的装置,其中所述MPD包括与所述第一媒体表示的个体DASH表示关联的一个或多个属性,其中所述个体表示包括与所述媒体表示的不同时间段关联的表示。16. The apparatus of claim 11, wherein the MPD includes one or more attributes associated with an individual DASH representation of the first media representation, wherein the individual representation includes representations associated with different time periods of the media representation. 17.如权利要求11所述的装置,其中所述UE用于经由多媒体广播和多播服务(MBMS)接收所述DASH表示。17. The apparatus of claim 11, wherein the UE is configured to receive the DASH representation via Multimedia Broadcast and Multicast Service (MBMS). 18.一种用户设备,包括如权利要求11-17中任一项所述的装置,进一步包括自动立体显示器,耦合到所述媒体播放器,用于显示所渲染的DASH表示。18. A user equipment comprising the means of any one of claims 11-17, further comprising an autostereoscopic display coupled to the media player for displaying a rendered DASH representation. 19.一种媒体服务器,包括:19. A media server, comprising: 内容管理电路,用于:Content management circuitry, used for: 获得与无线通信网络的用户设备UE关联的第三代合作伙伴3GP-通过超文本传输协议DASH的动态自适应流简档,其中所述3GP-DASH简档是多视图立体三维3D视频简档,用于指示所述UE支持多视图立体3D视频内容,其包括暂时交织的基本视图和非基本视图,或者所述3GP-DASH简档是帧封装立体3D视频简档,用于指示所述UE支持帧封装3D视频内容,其包括在相同帧中封装的基本视图和非基本视图;以及Obtain a third-generation partner 3GP-DASH dynamically adaptive streaming profile associated with a user equipment (UE) in a wireless communication network, wherein the 3GP-DASH profile is a multi-view stereoscopic 3D video profile indicating that the UE supports multi-view stereoscopic 3D video content, including temporarily interleaved basic and non-basic views, or the 3GP-DASH profile is a frame-encapsulated stereoscopic 3D video profile indicating that the UE supports frame-encapsulated 3D video content, including basic and non-basic views encapsulated within the same frame; and 基于所获得的3GP-DASH简档生成媒体表示描述MPD,所述媒体表示描述MPD包括符合所述3GP-DASH简档的媒体表示的第一DASH表示所关联的一个或多个属性,并且排除不符合所述3GP-DASH简档的媒体表示的第二DASH表示所关联的属性;以及A Media Representation Description (MPD) is generated based on the obtained 3GP-DASH profile. The MPD includes one or more attributes associated with a first DASH representation that conforms to the 3GP-DASH profile, and excludes attributes associated with a second DASH representation that does not conform to the 3GP-DASH profile. 传送所生成的MPD至所述UE,以及The generated MPD is transmitted to the UE, and 内容传递电路,耦合到所述内容管理电路,用于传递与所述第一DASH表示关联的3D视频内容至所述UE。A content delivery circuit, coupled to the content management circuit, is used to deliver 3D video content associated with the first DASH representation to the UE. 20.如权利要求19所述的媒体服务器,其中所述MPD包括帧封装元素,用于指示用于所述第一DASH表示的帧封装格式的类型。20. The media server of claim 19, wherein the MPD includes a frame encapsulation element for indicating the type of frame encapsulation format used for the first DASH representation. 21.如权利要求20所述的媒体服务器,其中所述帧封装元素指示所使用的帧封装格式的类型是垂直交织帧兼容封装格式、水平交织帧兼容封装格式、并排帧兼容封装格式、上下帧兼容封装格式或棋盘帧兼容封装格式。21. The media server of claim 20, wherein the frame encapsulation element indicates that the type of frame encapsulation format used is a vertically interlaced frame compatible encapsulation format, a horizontally interlaced frame compatible encapsulation format, a side-by-side frame compatible encapsulation format, a top-bottom frame compatible encapsulation format, or a checkerboard frame compatible encapsulation format. 22.如权利要求19-21中任一项所述的媒体服务器,其中所述内容管理电路用于从所述UE接收标识符,其中所述内容管理电路用于基于所述标识符从设备简档服务器获得所述3GP-DASH简档。22. The media server of any one of claims 19-21, wherein the content management circuitry is configured to receive an identifier from the UE, wherein the content management circuitry is configured to obtain the 3GP-DASH profile from the device profile server based on the identifier. 23.一种要由用户设备UE采用的装置,所述装置包括:23. An apparatus for use by a user equipment (UE), the apparatus comprising: 用于经由长期演进LTE无线通信网络传送与所述UE关联的第三代合作伙伴3GP-通过超文本传输协议DASH的动态自适应流简档的标识符的部件,所述3GP-DASH简档用于指示与所述UE支持的立体三维3-D视频内容关联的一个或多个约束;A component for transmitting, via a Long Term Evolution (LTE) wireless communication network, an identifier of a 3G Partner 3GP-DASH dynamic adaptive stream profile associated with the UE, the 3GP-DASH profile indicating one or more constraints associated with stereoscopic 3D video content supported by the UE. 用于接收媒体表示描述MPD的部件,所述媒体表示描述MPD包括符合所述3GP-DASH简档的媒体表示的第一个体DASH表示所关联的一个或多个属性并且排除不符合所述3GP-DASH简档的所述媒体表示的第二个体DASH表示所关联的一个或多个属性;A component for receiving a Media Representation Description (MPD), the MPD including one or more attributes associated with a first individual DASH representation that conforms to the 3GP-DASH profile and excluding one or more attributes associated with a second individual DASH representation that does not conform to the 3GP-DASH profile. 用于传送对于所述媒体表示的所述第一个体DASH表示的超文本传输协议HTTP GET或部分GET请求的部件;Components for transmitting a Hypertext Transfer Protocol HTTP GET or partial GET request for the first individual DASH representation of the media representation; 用于获得所述第一个体DASH表示的部件;以及Components for obtaining the DASH representation of the first individual; and 用于渲染所获得的所述第一个体DASH表示的部件。Used to render the component represented by the first individual DASH obtained. 24.如权利要求23所述的装置,其中所述3GP-DASH简档是多视图立体3D视频简档,用于指示所述UE支持多视图立体3D视频内容,其包括暂时交织的基本视图和非基本视图。24. The apparatus of claim 23, wherein the 3GP-DASH profile is a multi-view stereoscopic 3D video profile used to indicate that the UE supports multi-view stereoscopic 3D video content, which includes temporarily interleaved basic views and non-basic views. 25.如权利要求23所述的装置,其中所述3GP-DASH简档是帧封装立体3D视频简档,用于指示所述UE支持帧封装3D视频内容,其包括在相同帧中封装的基本视图和非基本视图。25. The apparatus of claim 23, wherein the 3GP-DASH profile is a frame-encapsulated stereoscopic 3D video profile, used to indicate that the UE supports frame-encapsulated 3D video content, which includes a basic view and non-basic views encapsulated in the same frame. 26.如权利要求25所述的装置,其中所述MPD包括帧封装元素,用于指示用于所述媒体表示的帧封装格式的类型。26. The apparatus of claim 25, wherein the MPD includes a frame encapsulation element for indicating the type of frame encapsulation format used for the media representation. 27.如权利要求26所述的装置,其中所述帧封装元素指示所使用的帧封装格式的类型是垂直交织帧兼容封装格式、水平交织帧兼容封装格式、并排帧兼容封装格式、上下帧兼容封装格式或棋盘帧兼容封装格式。27. The apparatus of claim 26, wherein the frame encapsulation element indicates that the type of frame encapsulation format used is a vertically interleaved frame compatible encapsulation format, a horizontally interleaved frame compatible encapsulation format, a side-by-side frame compatible encapsulation format, a top-bottom frame compatible encapsulation format, or a checkerboard frame compatible encapsulation format. 28.如权利要求23所述的装置,其中所述MPD排除不符合所述3GP-DASH简档的一个或多个其他媒体表示所关联的信息。28. The apparatus of claim 23, wherein the MPD excludes information associated with one or more other media representations that do not conform to the 3GP-DASH profile.
HK17102052.2A 2012-04-09 2017-02-24 Signaling three-dimensional video information in communication networks HK1228623B (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US61/621939 2012-04-09
US61/679627 2012-08-03
US13/626767 2012-09-25

Publications (2)

Publication Number Publication Date
HK1228623A1 HK1228623A1 (en) 2017-11-03
HK1228623B true HK1228623B (en) 2020-07-03

Family

ID=

Similar Documents

Publication Publication Date Title
US10194134B2 (en) Signaling three-dimensional video information in communication networks
AU2013246041B2 (en) Signaling three dimensional video information in communication networks
HK1228623B (en) Signaling three-dimensional video information in communication networks
HK1228623A1 (en) Signaling three-dimensional video information in communication networks
HK1187769B (en) Signaling three-dimensional video information in communication networks
FI20225190A1 (en) Signaling three-dimensional video information in communication networks
BR112014024061B1 (en) COMPUTER READABLE MEDIUM, METHOD, DEVICE AND SYSTEM FOR SIGNING THREE-DIMENSIONAL VIDEO INFORMATION ON COMMUNICATION NETWORKS