CN109102815B - Encoding device and method, transcoding method and transcoder, and non-transitory medium - Google Patents
Encoding device and method, transcoding method and transcoder, and non-transitory medium Download PDFInfo
- Publication number
- CN109102815B CN109102815B CN201810949078.6A CN201810949078A CN109102815B CN 109102815 B CN109102815 B CN 109102815B CN 201810949078 A CN201810949078 A CN 201810949078A CN 109102815 B CN109102815 B CN 109102815B
- Authority
- CN
- China
- Prior art keywords
- metadata
- frame
- inbound
- outbound
- encoded
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 97
- 238000012545 processing Methods 0.000 description 78
- 230000003111 delayed effect Effects 0.000 description 32
- 238000012986 modification Methods 0.000 description 25
- 230000004048 modification Effects 0.000 description 25
- 230000010076 replication Effects 0.000 description 14
- 238000009826 distribution Methods 0.000 description 12
- 238000009877 rendering Methods 0.000 description 11
- 230000005236 sound signal Effects 0.000 description 10
- 238000012795 verification Methods 0.000 description 10
- 230000008569 process Effects 0.000 description 9
- 238000006243 chemical reaction Methods 0.000 description 8
- 230000005540 biological transmission Effects 0.000 description 5
- 238000003860 storage Methods 0.000 description 5
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000010606 normalization Methods 0.000 description 3
- 230000009471 action Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 238000009432 framing Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 238000013475 authorization Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/68—Systems specially adapted for using specific information, e.g. geographical or meteorological information
- H04H60/73—Systems specially adapted for using specific information, e.g. geographical or meteorological information using meta-information
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L9/00—Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols
- H04L9/32—Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols including means for verifying the identity or authority of a user of the system or for message authentication, e.g. authorization, entity authentication, data integrity or data verification, non-repudiation, key authentication or verification of credentials
- H04L9/3236—Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols including means for verifying the identity or authority of a user of the system or for message authentication, e.g. authorization, entity authentication, data integrity or data verification, non-repudiation, key authentication or verification of credentials using cryptographic hash functions
- H04L9/3242—Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols including means for verifying the identity or authority of a user of the system or for message authentication, e.g. authorization, entity authentication, data integrity or data verification, non-repudiation, key authentication or verification of credentials using cryptographic hash functions involving keyed hash functions, e.g. message authentication codes [MACs], CBC-MAC or HMAC
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/173—Transcoding, i.e. converting between two coded representations avoiding cascaded coding-decoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L63/00—Network architectures or network communication protocols for network security
- H04L63/12—Applying verification of the received information
- H04L63/123—Applying verification of the received information received data contents, e.g. message integrity
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/60—Network streaming of media packets
- H04L65/61—Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio
- H04L65/612—Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio for unicast
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/60—Network streaming of media packets
- H04L65/70—Media network packetisation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/60—Network streaming of media packets
- H04L65/75—Media network packet handling
- H04L65/765—Media network packet handling intermediate
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/40—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video transcoding, i.e. partial or full decoding of a coded input stream followed by re-encoding of the decoded output stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/235—Processing of additional data, e.g. scrambling of additional data or processing content descriptors
- H04N21/2355—Processing of additional data, e.g. scrambling of additional data or processing content descriptors involving reformatting operations of additional data, e.g. HTML pages
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/435—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
- H04N21/4355—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream involving reformatting operations of additional data, e.g. HTML pages on a television screen
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Computer Security & Cryptography (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Power Engineering (AREA)
- Computer Hardware Design (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Theoretical Computer Science (AREA)
- Library & Information Science (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
本发明申请为申请日为2014年1月15日并于2015年7月17日进入中国国家阶段的发明名称为“元数据转码”的第201480005287.5号发明专利申请的分案申请。This invention application is a divisional application of the invention patent application No. 201480005287.5, titled "Metadata Transcoding", which was filed on January 15, 2014 and entered the Chinese national phase on July 17, 2015.
相关申请的交叉引用Cross-references to related applications
本申请要求于2013年1月21日提交的美国临时专利申请第61/754,893号的优先权,上述申请的全部内容通过引用合并到本文中。This application claims priority from U.S. Provisional Patent Application No. 61/754,893, filed on January 21, 2013, the entire contents of which are incorporated herein by reference.
技术领域Technical field
本文献涉及对元数据进行转码。具体地,本文献涉及用于以降低的计算复杂度对元数据进行转码的方法及系统。This document deals with transcoding metadata. In particular, this document relates to methods and systems for transcoding metadata with reduced computational complexity.
背景技术Background technique
当前,使用各种单通道和/或多通道音频渲染系统如5.1多声道音频渲染系统、7.1多声道音频渲染系统或9.1多声道音频渲染系统。音频渲染系统使得能够例如生成分别源于5+1扬声器位置、7+1扬声器位置或9+1扬声器位置的环绕声。为了高效传输或者为了高效存储相对应的单通道音频信号或多通道音频信号,使用音频编解码器(编码器/解码器)系统如杜比数字(DD)或杜比数字+(DD+)。Currently, various single-channel and/or multi-channel audio rendering systems are used such as 5.1 multi-channel audio rendering systems, 7.1 multi-channel audio rendering systems or 9.1 multi-channel audio rendering systems. The audio rendering system enables, for example, the generation of surround sound originating from 5+1 speaker positions, 7+1 speaker positions or 9+1 speaker positions respectively. For efficient transmission or for efficient storage of corresponding single-channel audio signals or multi-channel audio signals, audio codec (encoder/decoder) systems such as Dolby Digital (DD) or Dolby Digital Plus (DD+) are used.
可能存在音频渲染装置的重要安装基础,该音频渲染装置被配置成使用特定音频编解码器系统(例如,杜比数字)对已编码的音频信号进行解码。该特定音频编解码器系统可以例如称为第二音频编解码器。另一方面,音频编解码器系统的演变可以产生更新的音频编解码器系统(例如,杜比数字+),该更新的音频编解码器系统可以例如称为第一音频编解码器系统。更新的音频编解码器系统可以提供另外的特征(例如,数量增加的通道)和/或提高的编码质量。这样,内容提供者可以倾向于根据更新的音频编解码器系统来提供他们的内容。There may be a significant installed base of audio rendering devices configured to decode encoded audio signals using a specific audio codec system (eg, Dolby Digital). This particular audio codec system may for example be called a second audio codec. On the other hand, the evolution of the audio codec system may result in a newer audio codec system (eg, Dolby Digital Plus), which may, for example, be referred to as a first audio codec system. Newer audio codec systems may provide additional features (eg, increased number of channels) and/or improved encoding quality. In this way, content providers can tend to serve their content according to newer audio codec systems.
尽管如此,具有采用第二音频编解码器系统的解码器的音频渲染装置的用户应该仍然能够渲染已经根据第一音频编解码器系统编码的音频内容。这可以通过如下所谓的转码器或转换器来实现:该所谓的转码器或转换器被配置成将根据第一音频编解码器系统编码的音频内容转换成根据第二音频编解码器系统编码的经修改的音频内容。Nonetheless, users with audio rendering devices that employ decoders of the second audio codec system should still be able to render audio content that has been encoded according to the first audio codec system. This may be achieved by a so-called transcoder or converter configured to convert audio content encoded according to a first audio codec system into a second audio codec system. Encoded modified audio content.
沿着音频内容的分发链,可能产生对转码的另外的需求。内容提供者可以使用非常适用于音频内容的生产和广播的音频编解码器(如杜比E音频编解码器)来对音频内容进行编码。可以使用该面向生产的音频编解码器来分发音频内容,并且可以根据第二音频编解码器(如无损编解码器杜比TrueHD或如杜比数字+或杜比数字编解码器)对音频内容进行转码。Along the distribution chain of audio content, additional needs for transcoding may arise. Content providers can encode audio content using audio codecs that are well suited for the production and broadcast of audio content, such as the Dolby E audio codec. Audio content can be distributed using this production-oriented audio codec and can be modified based on a second audio codec such as a lossless codec such as Dolby TrueHD or a codec such as Dolby Digital Plus or Dolby Digital Transcode.
通常,音频内容与比特流中编码的表示音频内容的元数据相关联。通常,将音频内容分离成帧序列,其中,音频内容的每个帧包括预定数量的样本(例如,1024个样本)。所述帧序列中的帧可以与相应的元数据容器(container)或元数据帧相关联。元数据容器可以表示描述与该容器相关联的音频内容的帧的信息。这样的描述帧的信息的示例可以是关于帧的一些样本或所有样本的响度数据。替代地或附加地,元数据容器可以用于传输可以与相对应的音频内容的帧不直接相关联的辅助数据。这样的辅助数据可以例如用于以固件升级来提供音频编解码器系统的解码器。Typically, audio content is associated with metadata representing the audio content encoded in the bitstream. Typically, the audio content is separated into a sequence of frames, where each frame of the audio content includes a predetermined number of samples (eg, 1024 samples). Frames in the sequence of frames may be associated with corresponding metadata containers or metadata frames. A metadata container may represent information describing frames of audio content associated with the container. An example of such information describing a frame may be loudness data about some or all samples of the frame. Alternatively or additionally, a metadata container may be used to transport auxiliary data that may not be directly associated with the corresponding frame of audio content. Such auxiliary data may be used, for example, to provide a decoder of an audio codec system with a firmware upgrade.
除了将音频内容从第一音频编解码器系统转码至第二音频编解码器系统以外,转码器通常还需要对相关联的元数据进行转码。为了降低转码器/转换器(其例如在机顶盒内实现)的成本,第一音频编解码器系统与第二音频编解码器系统之间转换的计算复杂度应当较低。在对元数据进行转码的情况下,第一音频编解码器系统与第二音频编解码器系统之间转换的计算复杂度也应当较低。在本文献中,描述了用于转码的方法和系统,该方法和系统能够使得以降低的计算复杂度对元数据进行转码。In addition to transcoding audio content from a first audio codec system to a second audio codec system, the transcoder typically also needs to transcode associated metadata. In order to reduce the cost of the transcoder/converter (which is implemented within a set-top box for example), the computational complexity of the conversion between the first audio codec system and the second audio codec system should be low. In the case of transcoding metadata, the computational complexity of the conversion between the first audio codec system and the second audio codec system should also be low. In this document, methods and systems for transcoding are described which enable metadata to be transcoded with reduced computational complexity.
发明内容Contents of the invention
根据一个方面,描述了一种转码器,其被配置成将入站比特流转码成出站比特流。入站比特流可以包括入站内容帧和相关联的入站元数据帧。相关联的入站元数据帧可以包括在紧接着入站内容帧或在入站内容帧紧前的入站比特流中。这样,术语“相关联的”可以表示内容帧和元数据帧之间的时间关系(例如,该术语可以表示内容帧在元数据帧紧前或者反之)。应当注意,在一些实施方式中,相关联的入站元数据帧可以包括在入站内容帧中。通常,内容帧包括第一元素(例如,同步字段)和最后元素(例如,错误纠正字段如CRC字段)。相关联的元数据帧可以位于内容帧的如下字段中,该字段布置在内容帧的第一元素之后并且在内容帧的最后元素之前(例如,在内容帧的辅助数据字段中)。According to one aspect, a transcoder is described that is configured to transcode an inbound bitstream into an outbound bitstream. The inbound bitstream may include inbound content frames and associated inbound metadata frames. The associated inbound metadata frame may be included in the inbound bitstream immediately following or immediately preceding the inbound content frame. As such, the term "associated" may mean a temporal relationship between a content frame and a metadata frame (eg, the term may mean that a content frame immediately precedes a metadata frame or vice versa). It should be noted that in some implementations, the associated inbound metadata frame may be included in the inbound content frame. Typically, a content frame includes a first element (eg, a synchronization field) and a last element (eg, an error correction field such as a CRC field). The associated metadata frame may be located in a field of the content frame arranged after the first element of the content frame and before the last element of the content frame (eg, in an ancillary data field of the content frame).
元数据帧可以是所谓的演化帧。通常,入站比特流包括入站内容帧的序列和相关联的入站元数据帧的序列。通常,入站元数据帧与入站内容帧交替,以使得特定的入站内容帧的相关联的元数据帧直接跟着该特定的入站内容帧。以与入站比特流(也称为编码入站比特流)类似的方式,出站比特流(或编码出站比特流)可以包括出站内容帧和相关联的出站元数据帧。具体地,出站比特流可以包括交替的出站内容帧序列和出站元数据帧序列。Metadata frames can be so-called evolved frames. Typically, an inbound bitstream includes a sequence of inbound content frames and a sequence of associated inbound metadata frames. Typically, inbound metadata frames alternate with inbound content frames such that a particular inbound content frame's associated metadata frame directly follows that particular inbound content frame. In a similar manner to the inbound bitstream (also called encoded inbound bitstream), the outbound bitstream (or encoded outbound bitstream) may include outbound content frames and associated outbound metadata frames. Specifically, the outbound bitstream may include an alternating sequence of outbound content frames and a sequence of outbound metadata frames.
内容帧可以表示根据特定编解码器方案编码的信号。具体地,入站内容帧可以表示根据第一编解码器系统编码的信号,而出站内容帧可以表示根据第二编解码器系统编码的信号。第一音频编解码器系统和第二音频编解码器系统可以相同(在这种情况下,转码器可以被配置成提供比特率转换)或者第一音频编解码器系统和第二音频编解码器系统可以不同(在这种情况下,转码器可以被配置成提供编解码转换)。信号可以包括音频信号。第一编解码器系统和第二编解码器系统的示例有杜比E、杜比数字+、杜比数字、杜比TrueHD、杜比睿波(Dolby Pulse)、AAC(高级音频编码)和/或HE-AAC(高效率AAC)。在第一编解码器系统和第二编解码器系统不同的情况下,转码器可以被配置成:将信号内容从第一编解码器系统转码至第二编解码器系统。替代地或附加地,出站比特流的比特率可以与入站比特流的比特率不同,并且转码器可以被配置成执行编码信号内容从第一比特率转码至第二(不同的)比特率的转码。Content frames may represent signals encoded according to a specific codec scheme. In particular, inbound content frames may represent signals encoded according to a first codec system, and outbound content frames may represent signals encoded according to a second codec system. The first audio codec system and the second audio codec system may be the same (in which case the transcoder may be configured to provide bitrate conversion) or the first audio codec system and the second audio codec system The transcoder system may be different (in which case the transcoder may be configured to provide codec conversion). The signal may include an audio signal. Examples of first and second codec systems are Dolby E, Dolby Digital Plus, Dolby Digital, Dolby TrueHD, Dolby Pulse, AAC (Advanced Audio Coding) and/or HE -AAC (High Efficiency AAC). Where the first codec system and the second codec system are different, the transcoder may be configured to transcode the signal content from the first codec system to the second codec system. Alternatively or additionally, the bitrate of the outbound bitstream may be different from the bitrate of the inbound bitstream, and the transcoder may be configured to perform transcoding of the encoded signal content from a first bitrate to a second (different) Bitrate transcoding.
通常,将信号表示为包括预定数量的信号的样本(例如,512或1024个信号的样本)的帧序列。这样,入站内容帧可以表示信号的帧的一些或所有样本。出站内容帧可以表示信号的同一帧的一些或所有样本。这样,转码器可以被配置成生成出站内容帧,该出站内容帧表示相对应的入站内容帧的至少一些样本。Typically, a signal is represented as a sequence of frames including a predetermined number of samples of the signal (eg, 512 or 1024 samples of the signal). In this way, the inbound content frame may represent some or all samples of the signal's frames. Outbound content frames can represent some or all samples of the same frame of the signal. As such, the transcoder may be configured to generate outbound content frames that represent at least some samples of corresponding inbound content frames.
为了将入站比特流转码成出站比特流,转码器可以包括解码器,其被配置成根据第一编解码器系统对入站比特流进行解码。作为解码的结果,解码器可以提供每个内容帧的一组PCM样本。此外,解码器可以被配置成从元数据帧中提取元数据。可以将经解码的入站比特流(例如,一组PCM样本和提取的元数据)提供给被配置成根据第二编解码器系统对信号进行编码的编码器,从而提供出站比特流。这样,转码器可以被配置成:使用第一编解码器系统的解码器和第二编解码器系统的编码器,根据入站内容帧生成出站内容帧。转码器可以包括所谓的PCM连接转码器,其中,解码器将PCM样本组传送至转码器的编码器。这样,本文中描述的转码器可以包括在PCM连接转码器的背景中描述的特征。To transcode the inbound bitstream into the outbound bitstream, the transcoder may include a decoder configured to decode the inbound bitstream according to the first codec system. As a result of decoding, the decoder can provide a set of PCM samples for each content frame. Additionally, the decoder may be configured to extract metadata from the metadata frame. The decoded inbound bitstream (eg, a set of PCM samples and extracted metadata) may be provided to an encoder configured to encode the signal according to the second codec system, thereby providing an outbound bitstream. As such, the transcoder may be configured to generate outbound content frames from the inbound content frames using the decoder of the first codec system and the encoder of the second codec system. The transcoder may comprise a so-called PCM connected transcoder, in which a decoder passes groups of PCM samples to the encoder of the transcoder. As such, the transcoders described herein may include features described in the context of PCM connected transcoders.
应当注意,内容帧也可以表示根据隐含的编解码器系统的元数据。换言之,内容帧可以包括与内容帧中包括的信号相关联的元数据,其中,内容帧中包括的元数据由隐含的编解码器系统(即,第一编解码器系统或第二编解码器系统)来限定。与此形成对比,元数据帧能够使得对附加元数据(除了由编解码器系统指定的元数据以外)进行传输。这样的元数据的示例有响度或对白标准化参数或辅助数据如音频内容分发链中的解码器的固件升级。It should be noted that content frames may also represent metadata according to the underlying codec system. In other words, the content frame may include metadata associated with a signal included in the content frame, wherein the metadata included in the content frame is configured by an implicit codec system (i.e., a first codec system or a second codec system). system). In contrast, metadata frames enable the transmission of additional metadata (in addition to the metadata specified by the codec system). Examples of such metadata are loudness or dialogue normalization parameters or ancillary data such as firmware upgrades for decoders in the audio content distribution chain.
元数据帧可以遵循预定的语法。具体地,入站元数据帧和出站元数据帧可以遵循共同的语法。元数据帧的语法可以使得元数据帧能够包括零个元数据块、一个或更多个元数据块。每个元数据块可以包括特定类型的元数据。这样,元数据帧可以具体可变的大小,取决于包括在元数据帧中的元数据的量和/或元数据块的数量。每个元数据块可以表示(或可以包括)相对应的描述符,该描述符表示相对应的元数据块中包括的元数据的一个或更多个属性。具体地,描述符可以描述表示可以或应该如何操纵元数据块的属性。这样,转码器可以使用块的描述符来以计算高效的方式对入站元数据帧中包括的块进行转码。Metadata frames can follow a predetermined syntax. Specifically, inbound metadata frames and outbound metadata frames may follow a common syntax. The syntax of the metadata frame may be such that the metadata frame can include zero metadata blocks, one or more metadata blocks. Each metadata block can include a specific type of metadata. As such, the metadata frame may be of variable size, depending on the amount of metadata and/or the number of metadata blocks included in the metadata frame. Each metadata block may represent (or may include) a corresponding descriptor that represents one or more attributes of the metadata included in the corresponding metadata block. Specifically, the descriptor may describe attributes that represent how the metadata block can or should be manipulated. This way, the transcoder can use the chunk's descriptor to transcode the chunks included in the inbound metadata frame in a computationally efficient manner.
为了对元数据帧进行转码,转码器可以被配置成:从入站元数据帧中识别元数据入站块。可以使用块标识符来标识入站块。作为示例,可以使用块标识符来标识元数据帧的每个块。此外,元数据帧可以包括特定的块标识符,该特定的块标识符表示以下事实:元数据帧不包括任何另外的块(例如称为结束标识符)。转码器可以使用结束标识符来确定元数据帧不包括任何另外的元数据块。To transcode a metadata frame, the transcoder may be configured to identify an inbound block of metadata from the inbound metadata frame. Inbound blocks can be identified using block identifiers. As an example, each chunk of the metadata frame can be identified using a chunk identifier. Furthermore, the metadata frame may include a specific chunk identifier that represents the fact that the metadata frame does not include any further chunks (eg, called an end identifier). The transcoder can use the end identifier to determine that the metadata frame does not include any additional metadata chunks.
如上所述,元数据入站块可以与被称为入站描述符的描述符相关联。入站描述符可以表示元数据入站块中包括的元数据的一个或更多个属性。可以将描述符写入元数据块的数据字段中。描述符内包括的示例属性是表示信号的样本的时间戳参数。具体地,时间戳参数可以表示入站块的元数据与信号的由时间戳参数标识的样本相关联(例如,要被应用于该样本)。时间戳参数可以通过表示内容帧中的样本相对于内容帧的结束或相对于内容帧的开始的位置来标识样本。另外的示例是表示信号的数个样本的持续时间参数。持续时间参数可以表示:入站块的元数据与由持续时间参数表示的信号的数个样本(开始于由时间戳参数表示的样本)相关联。具体地,持续时间参数可以表示:元数据要被应用于在由时间戳参数表示的样本之后的数个样本,其中,所述数个样本由持续时间参数来表示。时间戳参数和/或持续时间参数可以用来例如表示:入站块的元数据(例如,响度值)可应用于在相关联的入站内容帧中编码的信号的哪些样本。作为示例,入站元数据帧可以包括多个入站块,该多个入站块表示在入站内容帧中被编码的不同组的信号样本的不同的响度值。As mentioned above, metadata inbound blocks may be associated with descriptors called inbound descriptors. An inbound descriptor may represent one or more properties of metadata included in the metadata inbound block. The descriptor can be written into the data field of the metadata block. An example attribute included within the descriptor is a timestamp parameter representing a sample of the signal. In particular, the timestamp parameter may represent the metadata of the inbound block associated with (eg, to be applied to) a sample of the signal identified by the timestamp parameter. The timestamp parameter may identify the sample by representing the position of the sample in the content frame relative to the end of the content frame or relative to the beginning of the content frame. A further example is a duration parameter representing several samples of a signal. The duration parameter may represent that the metadata of the inbound block is associated with a number of samples of the signal represented by the duration parameter (starting from the sample represented by the timestamp parameter). Specifically, the duration parameter may represent that the metadata is to be applied to a number of samples following the sample represented by the timestamp parameter, wherein the number of samples is represented by the duration parameter. The timestamp parameter and/or the duration parameter may be used, for example, to indicate to which samples of the signal encoded in the associated inbound content frame the metadata (eg, loudness value) of the inbound block may be applied. As an example, an inbound metadata frame may include multiple inbound blocks representing different loudness values for different sets of signal samples that are encoded in the inbound content frame.
描述符内表示(或包括)的属性的另一示例是转码器参数,该转码器参数表示是否要将入站块转码成出站比特流。作为示例,可以使用转码器参数来表示:入站块中包括的元数据仅适用于第一编解码器系统。这样,转码器可以被配置成:如果出站比特流根据与第一编解码器系统不同的第二编解码器系统来编码,则丢弃入站块中包括的元数据。Another example of an attribute represented (or included) within the descriptor is a transcoder parameter that indicates whether the inbound chunks are to be transcoded into an outbound bitstream. As an example, transcoder parameters can be used to indicate that the metadata included in the inbound block is only applicable to the first codec system. In this way, the transcoder may be configured to discard metadata included in the inbound chunks if the outbound bitstream is encoded according to a second codec system different from the first codec system.
描述符中包括的属性的又一示例是复制参数,该复制参数表示是否要将入站块的元数据包括在根据入站元数据帧生成的每个出站元数据帧中。以类似的方式,可以使用去复制参数作为如下属性:该属性表示如果根据多个入站元数据帧生成出站元数据帧,则入站块的元数据是否要被转码器丢弃。在入站比特流和出站比特流的成帧不同的情况下,转码器可以使用复制参数和/或去复制参数。Yet another example of an attribute included in the descriptor is the replication parameter, which indicates whether the inbound block's metadata is to be included in each outbound metadata frame generated from the inbound metadata frame. In a similar way, the deduplication parameter can be used as an attribute that indicates whether the metadata for the inbound chunk is to be discarded by the transcoder if an outbound metadata frame is generated from multiple inbound metadata frames. In cases where the inbound and outbound bitstreams are framed differently, the transcoder may use duplication parameters and/or deduplication parameters.
属性的再一示例是优先级参数,该优先级参数表示入站块的元数据相对于元数据的一个或更多个其他入站块的重要性。与入站比特流相比,在可以仅将减少的元数据量插入至出站比特流中的情况下,转码器可以使用优先级参数。属性的另外的示例是关联参数,该关联参数表示是否可以将入站块的元数据插入至在出站元数据帧之后的延迟的出站元数据帧中。这样,关联参数向转码器提供转码处理中的附加灵活性,原因是:转码器可以以高效的方式确定可以延迟哪个入站块以及必须保留与相关联的内容帧相关联的哪些入站块。Yet another example of an attribute is a priority parameter that represents the importance of an inbound block of metadata relative to one or more other inbound blocks of metadata. The transcoder can use priority parameters where only a reduced amount of metadata can be inserted into the outbound bitstream compared to the inbound bitstream. A further example of a property is an association parameter indicating whether the inbound block's metadata can be inserted into a delayed outbound metadata frame following the outbound metadata frame. In this way, the correlation parameters provide the transcoder with additional flexibility in the transcoding process because the transcoder can determine in an efficient manner which inbound chunks can be delayed and which inbound blocks associated with the associated content frame must be retained. Station blocks.
属性的另一个示例是PCM处理参数,该PCM处理参数表示在进行了对入站内容帧中包括的数据的修改的情况下是否要由转码器丢弃入站块的元数据。具体地,PCM处理参数可以向转码器指示:即使入站内容帧的数据(例如,入站内容帧中包括的信号样本)已经被修改,也要将入站块的元数据包括在出站元数据帧中。这可以是下述情况,例如当入站块包括与入站内容帧中包括的数据不相关的有效载荷如二进制数据或如附加比特流时。PCM处理参数与所谓的PCM连接转码器特别有关。Another example of an attribute is a PCM processing parameter that indicates whether the metadata of an inbound chunk is to be discarded by the transcoder if modifications to the data included in the inbound content frame are made. Specifically, the PCM processing parameters may indicate to the transcoder that the metadata of the inbound block is to be included in the outbound content frame even if the data of the inbound content frame (e.g., the signal samples included in the inbound content frame) have been modified. metadata frame. This may be the case, for example, when the inbound block includes payload such as binary data or like an additional bitstream that is not related to the data included in the inbound content frame. PCM processing parameters are particularly relevant for so-called PCM connection transcoders.
优选的入站描述符包括至少对以下的表示:时间戳参数和/或持续时间参数是否被包括在描述符中。此外,优选的入站描述符包括复制参数和去复制参数。Preferred inbound descriptors include at least an indication of whether a timestamp parameter and/or a duration parameter are included in the descriptor. Additionally, preferred inbound descriptors include replication parameters and de-replication parameters.
转码器可以被配置成基于入站描述符,根据入站元数据帧生成出站元数据帧。具体地,转码器可以被配置成:仅基于由入站描述符表示的一个或更多个属性,根据入站元数据帧生成出站元数据帧。更具体地,转码器可以被配置成:在不对入站块中包括的元数据进行分析的情况下根据入站元数据帧生成出站元数据帧。这样,转码器可以仅基于元数据块的描述符来执行对元数据帧中包括的元数据的转码,而无需分析和/或解析由元数据块携带的元数据。这使得转码器具有显著降低的计算复杂度。The transcoder can be configured to generate outbound metadata frames from inbound metadata frames based on the inbound descriptor. Specifically, the transcoder may be configured to generate an outbound metadata frame from the inbound metadata frame based solely on one or more attributes represented by the inbound descriptor. More specifically, the transcoder may be configured to generate an outbound metadata frame from the inbound metadata frame without analyzing the metadata included in the inbound block. In this way, the transcoder can perform transcoding of the metadata included in the metadata frame based solely on the descriptor of the metadata chunk, without the need to analyze and/or parse the metadata carried by the metadata chunk. This allows the transcoder to have significantly reduced computational complexity.
转码器可以被配置成:通过将元数据从入站元数据帧的一个或更多个入站块拷贝至相对应的一个或更多个出站块,来根据入站元数据帧生成出站元数据帧。可以将一个或更多个出站块插入至出站元数据帧中。拷贝和插入可以受制于由一个或更多个入站块的入站描述符表示的一个或更多个属性。作为示例,关联参数可以向转码器指示:特定的入站块要被插入至出站元数据帧中。另一方面,转码器参数可以向转码器指示:如果第二编解码器系统与第一编解码器系统不同,则应该丢弃特定的入站块。The transcoder may be configured to generate outbound metadata from an inbound metadata frame by copying metadata from one or more inbound chunks of the inbound metadata frame to a corresponding one or more outbound chunks. Station metadata frame. One or more outbound blocks can be inserted into the outbound metadata frame. Copies and inserts may be subject to one or more attributes represented by the inbound descriptor of one or more inbound blocks. As an example, correlation parameters may indicate to the transcoder that specific inbound chunks are to be inserted into outbound metadata frames. On the other hand, the transcoder parameters may indicate to the transcoder that specific inbound blocks should be discarded if the second codec system is different from the first codec system.
转码器可以被配置成:通过基于入站块的入站描述符生成出站块的出站描述符来生成出站元数据帧。具体地,出站描述符可以包括或可以表示由入站描述符表示的一些或所有属性。可以将入站描述符的一些或所有属性拷贝至出站描述符。另一方面,转码器可以被配置成:对由入站描述符表示的一个或更多个属性进行修改以生成出站描述符,其中,出站描述符表示一个或更多个修改的属性。作为示例,入站描述符可以表示时间戳参数。转码器可以修改时间戳参数,以使得即使转码器可能已经针对入站比特流对出站比特流进行了重新成帧(re-framing),修改的时间戳参数仍然表示与原时间戳参数相同的信号样本。The transcoder may be configured to generate an outbound metadata frame by generating an outbound descriptor for an outbound block based on the inbound descriptor of the inbound block. Specifically, an outbound descriptor may include or may represent some or all of the attributes represented by an inbound descriptor. Some or all properties of the inbound descriptor can be copied to the outbound descriptor. On the other hand, the transcoder may be configured to modify one or more attributes represented by the inbound descriptor to generate an outbound descriptor, where the outbound descriptor represents the one or more modified attributes. . As an example, an inbound descriptor can represent a timestamp parameter. The transcoder may modify the timestamp parameters such that even though the transcoder may have re-framed the outbound bitstream for the inbound bitstream, the modified timestamp parameters still represent the same timestamp parameters as the original timestamp parameters same signal sample.
如上所述,入站描述符的一个或更多个属性可以包括表示信号样本的时间戳参数,入站块的元数据与信号样本相关联。入站描述符的时间戳参数通常表示与入站内容帧有关的信号样本。转码器可以被配置成根据入站块生成出站块。此外,转码器可以被配置成:通过对入站描述符的时间戳参数进行修改来生成出站块的出站描述符,以使得出站描述符的相对应的时间戳参数表示与出站内容帧(与入站内容帧相比,其可以具有不同的成帧)有关的信号样本。这样,转码器可以被配置成:即使当入站比特流进行了重新成帧时,也确保由入站描述符表示的一个或更多个属性仍然有效。As mentioned above, one or more properties of the inbound descriptor may include a timestamp parameter representing a signal sample with which the metadata of the inbound block is associated. The timestamp parameter of the inbound descriptor typically represents a signal sample associated with the inbound content frame. The transcoder can be configured to generate outbound chunks based on inbound chunks. Additionally, the transcoder may be configured to generate an outbound descriptor for an outbound block by modifying the timestamp parameter of the inbound descriptor such that the corresponding timestamp parameter of the outbound descriptor represents the same representation as the outbound Signal samples related to content frames (which may have different framing compared to inbound content frames). In this way, the transcoder can be configured to ensure that one or more attributes represented by the inbound descriptor remain valid even when the inbound bitstream is reframed.
转码器可以被配置成将出站块(根据入站元数据帧的入站块来生成)插入至延迟的出站元数据帧中。作为示例,入站描述符的关联参数可以向转码器指示可以延迟入站块。转码器可以选择将元数据插入至延迟的出站元数据帧中(例如,由于第二比特流的有限的比特率)。延迟的出站元数据帧可能与下述延迟的出站内容帧相关联,该延迟的出站内容帧不包括由入站块的时间戳参数表示的信号样本。尽管如此,为了确保出站块的时间戳参数标识正确的信号样本,转码器可以被配置成:通过对入站块的时间戳参数进行修改来生成出站块的出站描述符,以使得出站描述符的时间戳参数表示与延迟的出站内容帧有关的信号样本。作为示例,修改的时间戳参数可以表示超过延迟的内容帧的样本数量的样本数量,从而表示信号的样本存在于延迟的内容帧的外部。The transcoder may be configured to insert outbound blocks (generated from the inbound blocks of the inbound metadata frame) into the delayed outbound metadata frame. As an example, the associated parameters of the inbound descriptor may indicate to the transcoder that the inbound block may be delayed. The transcoder may choose to insert metadata into delayed outbound metadata frames (eg, due to the limited bitrate of the second bitstream). A delayed outbound metadata frame may be associated with a delayed outbound content frame that does not include the signal sample represented by the timestamp parameter of the inbound block. Nonetheless, to ensure that the timestamp parameter of the outbound block identifies the correct signal sample, the transcoder can be configured to generate the outbound descriptor of the outbound block by modifying the timestamp parameter of the inbound block such that The timestamp parameter of the outbound descriptor represents a signal sample associated with a delayed outbound content frame. As an example, the modified timestamp parameter may represent a number of samples that exceeds the number of samples of the delayed content frame, thereby indicating that samples of the signal are present outside the delayed content frame.
如上所述,入站指示符的一个或更多个属性可以包括复制参数,该复制参数表示相对应的入站块的元数据是否要包括在根据入站元数据帧生成的每个出站元数据帧中。转码器可以被配置成:通过考虑复制参数,根据入站元数据帧生成多个出站元数据帧。具体地,转码器可以被配置成确定复制参数表示入站块的元数据要被包括在根据入站元数据帧生成的每个出站元数据帧中。在这样的情况下,转码器可以被配置成:将入站块的元数据插入至多个出站元数据帧的每个出站元数据帧中。具体地,转码器可以被配置成:对于多个出站元数据帧的每个出站元数据帧,根据入站块生成出站块。除了生成多个出站元数据帧以外,转码器还可以被配置成:根据入站内容帧生成多个出站内容帧,其中,多个出站内容帧可以分别与多个出站元数据帧相关联。As described above, one or more attributes of the inbound indicator may include a replication parameter that indicates whether the metadata for the corresponding inbound block is to be included in each outbound element generated from the inbound metadata frame. in the data frame. The transcoder can be configured to generate multiple outbound metadata frames based on the inbound metadata frame by taking into account replication parameters. In particular, the transcoder may be configured to determine that the replication parameter represents an inbound block of metadata to be included in each outbound metadata frame generated from the inbound metadata frame. In such a case, the transcoder may be configured to insert the metadata of the inbound chunk into each of the plurality of outbound metadata frames. Specifically, the transcoder may be configured to generate an outbound block based on the inbound block for each of the plurality of outbound metadata frames. In addition to generating multiple outbound metadata frames, the transcoder can be configured to generate multiple outbound content frames based on the inbound content frame, where the multiple outbound content frames can be individually associated with multiple outbound metadata frames. frame associated.
复制参数可以包括标记,该标记可以被设置成表示:入站块的元数据要被包括在根据入站元数据帧生成的每个出站元数据帧中,或者入站块的元数据不要被包括在根据入站元数据帧生成的每个出站元数据帧中(即,替代地,标记可以被设置成表示相反的情况)。The replication parameters may include a flag that may be set to indicate that the inbound chunk's metadata is to be included in each outbound metadata frame generated from the inbound metadata frame, or that the inbound chunk's metadata is not to be Included in every outbound metadata frame generated from an inbound metadata frame (i.e., the flag may instead be set to indicate the opposite).
如上所述,入站描述符的一个或更多个属性可以包括去复制参数,该去复制参数表示:如果根据多个入站元数据帧生成出站元数据帧,则入站块的元数据是否可以被(或要被)转码器丢弃。转码器可以被配置成:通过考虑去复制参数,根据入站比特流的多个入站元数据帧生成出站元数据帧。具体地,多个入站元数据帧可以包括多个元数据入站块,每个入站块与表示入站块的元数据可以被转码器丢弃的相应的去复制参数相关联。转码器可以被配置成:对于除了多个入站元数据帧中的一个入站元数据帧之外的所有入站元数据帧(例如,除了多个入站元数据帧的第一个入站元数据帧以外的所有入站元数据帧),丢弃多个入站块的元数据,以生成出站元数据帧。除了根据多个入站元数据帧生成出站元数据帧以外,转码器可以被配置成根据多个入站内容帧生成出站内容帧,其中,多个入站内容帧分别与多个入站元数据帧相关联。As mentioned above, one or more properties of the inbound descriptor may include a deduplication parameter that represents the metadata of the inbound block if an outbound metadata frame is generated from multiple inbound metadata frames. Whether it can be (or is to be) discarded by the transcoder. The transcoder may be configured to generate outbound metadata frames from multiple inbound metadata frames of the inbound bitstream by taking into account deduplication parameters. Specifically, a plurality of inbound metadata frames may include a plurality of inbound metadata blocks, each inbound block being associated with a corresponding deduplication parameter indicating that the inbound block's metadata may be discarded by the transcoder. The transcoder may be configured to: all inbound metadata frames except inbound metadata frames), discarding multiple inbound chunks of metadata to generate outbound metadata frames. In addition to generating outbound metadata frames from a plurality of inbound metadata frames, the transcoder may be configured to generate an outbound content frame from a plurality of inbound content frames, wherein the plurality of inbound content frames are respectively associated with the plurality of inbound metadata frames. Station metadata frame associated.
去复制参数可以包括标记,该标记可以被设置成表示:如果根据多个入站元数据帧生成出站元数据帧,则入站块的元数据可以被(或要被)转码器丢弃;或者如果根据多个入站元数据帧生成出站元数据帧,则入站块的元数据可以不被(或不要被)转码器丢弃(即,替代地,该标记可以被设置成表示相反的情况)。The deduplication parameters may include a flag that may be set to indicate that if an outbound metadata frame is generated from multiple inbound metadata frames, the metadata for the inbound block may be (or is to be) discarded by the transcoder; Or if an outbound metadata frame is generated from multiple inbound metadata frames, the inbound chunk's metadata may not be (or should not be) discarded by the transcoder (i.e., instead, the flag may be set to indicate the opposite Case).
如上所述,入站描述符的一个或更多个属性可以包括优先级参数,该优先级参数表示入站块的元数据相对于元数据的一个或更多个其他入站块的相对重要性。在转码器处接收的入站元数据帧可以包括具有表示优先级参数的不同值的描述符的多个入站块。转码器可以被配置成:按照多个入站块的优先级参数,根据多个入站块生成出站元数据帧。具体地,如果对于出站比特流而言可以获得足够的比特率,则转码器可以首先选择具有最高的相对优先级的一个或更多个入站块并且仅插入较低优先级的入站块。As discussed above, one or more attributes of the inbound descriptor may include a priority parameter that represents the relative importance of the inbound block's metadata relative to one or more other inbound blocks of metadata. . The inbound metadata frame received at the transcoder may include multiple inbound blocks with descriptors representing different values of the priority parameter. The transcoder can be configured to generate outbound metadata frames based on multiple inbound chunks according to their priority parameters. Specifically, if a sufficient bitrate is available for the outbound bitstream, the transcoder may first select the one or more inbound chunks with the highest relative priority and only insert the lower priority inbound chunks piece.
多个入站块可以与表示增量优先级的增量优先级参数相关联。多个入站块可以包括增量元数据,以使得多个入站块的组合元数据提供高质量元数据并且使得多个入站块中具有最高相对优先级的入站块的元数据提供质量降低的元数据(即,与由组合元数据提供的高质量元数据相比,提供质量降低的元数据)。具有下一个较低的优先级的入站块可以提供质量提高的元数据等,直到当对多个入站块全部进行组合时提供最高质量元数据为止。转码器可以被配置成:基于多个入站块中的至少一个或更多个入站块生成出站元数据帧,从而允许出站元数据帧中包括的元数据的质量的可伸缩(scalable)劣化。劣化的程度可以例如基于出站比特流的可用比特率。Multiple inbound blocks can be associated with a delta priority parameter that represents delta priority. Multiple inbound chunks may include incremental metadata such that the combined metadata of the multiple inbound chunks provides high quality metadata and such that the metadata of the inbound chunk with the highest relative priority among the multiple inbound chunks provides quality Degraded metadata (i.e., providing metadata of reduced quality compared to the high-quality metadata provided by combined metadata). The inbound block with the next lower priority may provide metadata of increasing quality, and so on, until the highest quality metadata is provided when multiple inbound blocks are all combined. The transcoder may be configured to generate an outbound metadata frame based on at least one or more of the plurality of inbound blocks, thereby allowing scalability of the quality of metadata included in the outbound metadata frame ( scalable) degradation. The degree of degradation may be based, for example, on the available bitrate of the outbound bitstream.
如上所述,入站描述符的一个或更多个属性可以包括关联参数,该关联参数表示:是否可以将入站块的元数据插入至在出站元数据帧之后的延迟的出站元数据帧中。转码器可以被配置成:基于关联参数和/或基于对出站比特流的比特率限制,将来自入站块的元数据插入至出站元数据帧中。具体地,转码器可以被配置成:如果关联参数表示可以延迟入站块的元数据,则将来自入站块的元数据插入至在出站元数据帧之后的延迟的出站元数据帧中。As mentioned above, one or more properties of the inbound descriptor may include an association parameter indicating whether metadata for the inbound block may be inserted into delayed outbound metadata following the outbound metadata frame. frame. The transcoder may be configured to insert metadata from the inbound chunk into the outbound metadata frame based on correlation parameters and/or based on bitrate limits on the outbound bitstream. Specifically, the transcoder may be configured to insert metadata from the inbound block into a delayed outbound metadata frame following the outbound metadata frame if the associated parameter indicates that the inbound block's metadata may be delayed. middle.
根据另一个方面,描述了一种用于将包括入站内容帧和相关联的入站元数据帧的入站比特流转码成出站比特流的方法。出站比特流可以包括出站内容帧和/或相关联的出站元数据帧。入站内容帧可以表示根据第一编解码器系统编码的信号,而出站内容帧可以表示根据第二编解码器系统编码的信号。如上所述,第一编解码器系统和第二编解码器系统可以相同或可以不同。该方法可以包括从入站元数据帧中识别元数据入站块。元数据入站块可以与表示元数据入站块中包括的元数据的一个或更多个属性的入站描述符相关联。此外,该方法可以包括:基于入站描述符,根据入站元数据帧生成出站元数据帧。换言之,可以通过考虑入站描述符来确定出站元数据帧,而通常无需进一步分析入站元数据帧中包括的元数据。According to another aspect, a method for transcoding an inbound bitstream including an inbound content frame and an associated inbound metadata frame into an outbound bitstream is described. The outbound bitstream may include outbound content frames and/or associated outbound metadata frames. Inbound content frames may represent signals encoded according to a first codec system, and outbound content frames may represent signals encoded according to a second codec system. As mentioned above, the first codec system and the second codec system may be the same or may be different. The method may include identifying an inbound block of metadata from an inbound metadata frame. The metadata inbound block may be associated with an inbound descriptor representing one or more properties of the metadata included in the metadata inbound block. Additionally, the method may include generating an outbound metadata frame from the inbound metadata frame based on the inbound descriptor. In other words, the outbound metadata frame can be determined by considering the inbound descriptor, often without further analysis of the metadata included in the inbound metadata frame.
根据另一个方面,描述了包括内容帧和相关联的元数据帧的编码比特流。内容帧可以表示根据第一编解码器系统编码的信号。元数据帧可以包括元数据块,并且元数据块可以与(或可以包括)表示元数据块中包括的元数据的一个或更多个属性的描述符相关联。According to another aspect, an encoded bitstream is described that includes content frames and associated metadata frames. The content frame may represent a signal encoded according to the first codec system. A metadata frame may include a metadata block, and the metadata block may be associated with (or may include) a descriptor representing one or more attributes of the metadata included in the metadata block.
根据另一个方面,描述了一种编码器,被配置成:生成包括内容帧和相关联的元数据帧的编码比特流。内容帧可以表示根据编解码器系统编码的信号。编码器可以被配置成生成元数据块。此外,编码器可以被配置成确定与元数据块相关联的描述符。描述符可以表示元数据块中包括的元数据的一个或更多个属性。此外,编码器可以被配置成将元数据块插入至元数据帧中。应当注意,本文献中在转码器的背景下描述的特征也适用于相对应的编码器。According to another aspect, an encoder is described configured to generate an encoded bitstream including a content frame and an associated metadata frame. Content frames may represent signals encoded according to a codec system. Encoders can be configured to generate chunks of metadata. Additionally, the encoder may be configured to determine descriptors associated with metadata chunks. A descriptor may represent one or more properties of the metadata included in the metadata block. Additionally, the encoder may be configured to insert metadata blocks into metadata frames. It should be noted that the features described in this document in the context of transcoders also apply to the corresponding encoders.
具体地,一个或更多个属性可以包括表示信号样本的时间戳参数,元数据块中包括的元数据与该信号样本相关联。信号样本可以包括在内容帧中。编码器可以被配置成将块插入至延迟的元数据帧中,其中,延迟的元数据帧与不包括信号样本的延迟的内容帧相关联。此外,编码器可以被配置成生成元数据块的描述符,以使得描述符的时间戳参数表示与延迟的内容帧有关的信号样本。这样,编码器可以被配置成:延迟元数据的传输并且相应地修改时间戳参数,从而使由编码器生成的比特流的比特率平滑。Specifically, one or more attributes may include a timestamp parameter representing a signal sample to which metadata included in the metadata block is associated. Signal samples may be included in content frames. The encoder may be configured to insert blocks into delayed metadata frames, where the delayed metadata frames are associated with delayed content frames that do not include signal samples. Furthermore, the encoder may be configured to generate descriptors of metadata chunks such that the timestamp parameters of the descriptors represent signal samples related to delayed content frames. In this way, the encoder can be configured to delay the transmission of metadata and modify the timestamp parameters accordingly, thereby smoothing the bitrate of the bitstream generated by the encoder.
根据一个方面,描述了相对应的解码器。解码器可以包括本文献中描述的与解码器相关的特征中的任何特征。解码器可以被配置成:对包括内容帧和相关联的元数据帧的编码比特流进行解码。如上所述,内容帧表示根据第一编解码器系统编码的信号。元数据帧可以包括元数据块,其中,元数据块与(或包括)表示元数据块中包括的元数据的一个或更多个属性的描述符相关联。解码器可以被配置成对内容帧中包括的编码信号进行解码。具体地,解码器可以包括第一编解码器系统的解码器,以对编码信号进行解码。因此,解码器可以被配置成提供编码信号的一组PCM样本。According to one aspect, a corresponding decoder is described. The decoder may include any of the decoder-related features described in this document. The decoder may be configured to decode an encoded bitstream including content frames and associated metadata frames. As mentioned above, the content frame represents a signal encoded according to the first codec system. A metadata frame may include a metadata block, wherein the metadata block is associated with (or includes) a descriptor representing one or more attributes of the metadata included in the metadata block. The decoder may be configured to decode the encoded signal included in the content frame. In particular, the decoder may comprise a decoder of the first codec system to decode the encoded signal. Therefore, the decoder may be configured to provide a set of PCM samples of the encoded signal.
此外,解码器可以被配置成:从元数据帧中识别元数据块并且从元数据块提取描述符。另外,解码器可以被配置成:根据由描述符表示的一个或更多个属性对元数据块中包括的元数据进行处理。一个或更多个属性可以与本文献中描述的任何一个或更多个属性相对应。解码器可以被配置成:将元数据的特定属性与对元数据的相对应的处理关联。作为示例,描述符可以表示时间戳参数,从而通知解码器:元数据块的元数据要被应用于特定的信号样本。这样,解码器可以被配置成:将元数据应用于由时间戳参数表示的样本。作为另一个示例,描述符可以表示关联参数。如果关联参数表示元数据块与内容帧无关,则解码器可以被配置成:将元数据块中包括的元数据传送至另一处理单元(其处理例如元数据块中包括的辅助数据)。Furthermore, the decoder may be configured to identify metadata blocks from the metadata frame and extract descriptors from the metadata blocks. Additionally, the decoder may be configured to process metadata included in the metadata block based on one or more attributes represented by the descriptor. One or more attributes may correspond to any one or more attributes described in this document. The decoder may be configured to associate specific attributes of the metadata with corresponding processing of the metadata. As an example, the descriptor may represent a timestamp parameter, thereby informing the decoder that the metadata of the metadata block is to be applied to a specific signal sample. In this way, the decoder can be configured to apply metadata to the samples represented by the timestamp parameter. As another example, a descriptor may represent an associated parameter. If the association parameter indicates that the metadata block is not related to the content frame, the decoder may be configured to pass the metadata included in the metadata block to another processing unit (which processes eg ancillary data included in the metadata block).
根据另一个方面,描述了一种用于对包括内容帧和相关联的元数据帧的编码比特流进行解码的方法。内容帧可以表示根据第一编解码器系统编码的信号。元数据帧可以包括元数据块,其中,元数据块可以与表示元数据块中包括的元数据的一个或更多个属性的描述符相关联。该方法可以包括对内容帧中包括的编码信号进行解码。此外,该方法可以包括从元数据帧中识别元数据块并且从元数据块中提取描述符。另外,该方法可以包括基于由描述符表示的一个或更多个属性对元数据块中包括的元数据进行处理。According to another aspect, a method for decoding an encoded bitstream including a content frame and associated metadata frame is described. The content frame may represent a signal encoded according to the first codec system. A metadata frame may include a metadata block, where the metadata block may be associated with a descriptor representing one or more attributes of the metadata included in the metadata block. The method may include decoding an encoded signal included in the content frame. Additionally, the method may include identifying metadata chunks from the metadata frame and extracting descriptors from the metadata chunks. Additionally, the method may include processing metadata included in the metadata block based on one or more attributes represented by the descriptor.
根据另一个方面,描述了一种用于生成包括内容帧和相关联的元数据帧的编码比特流的方法。内容帧可以表示根据编解码器系统编码的信号。该方法可以包括生成元数据块。此外,该方法可以包括确定与元数据块相关联的描述符,其中,描述符表示元数据块中包括的元数据的一个或更多个属性。另外,该方法可以包括将元数据块插入至元数据帧中。According to another aspect, a method for generating an encoded bitstream including a content frame and associated metadata frame is described. Content frames may represent signals encoded according to a codec system. The method may include generating a metadata block. Additionally, the method may include determining a descriptor associated with the metadata block, wherein the descriptor represents one or more attributes of metadata included in the metadata block. Additionally, the method may include inserting the metadata block into the metadata frame.
根据另一个方面,描述了一种编码器,其被配置成生成包括内容帧和相关联的元数据帧的编码比特流。内容帧可以表示根据第一编解码器系统编码的信号。编码器可以被配置成生成元数据块。在优选实施方式中,元数据块包括如本文献中描述的描述符。描述符可以表示元数据块中包括的元数据的一个或更多个属性。According to another aspect, an encoder is described that is configured to generate an encoded bitstream that includes content frames and associated metadata frames. The content frame may represent a signal encoded according to the first codec system. Encoders can be configured to generate chunks of metadata. In a preferred embodiment, the metadata block includes descriptors as described in this document. A descriptor may represent one or more properties of the metadata included in the metadata block.
编码器可以被配置成将元数据块插入至元数据帧中。此外,编码器可以被配置成从多个预定的安全秘钥中选择安全秘钥。多个预定的安全秘钥可以被配置成使得提供不同的信任等级。具体地,多个预定的安全秘钥可以包括仅编码器的开发人员(或相对应的解码器的开发人员或包括解码器和编码器的相对应的转码器的开发人员)知道的高度安全秘钥。此外,多个预定的安全秘钥可以包括编码器的操作人员(或相对应的解码器的操作人员或包括解码器和编码器的相对应的转码器的操作人员)知道的中等安全秘钥。The encoder may be configured to insert metadata blocks into metadata frames. Additionally, the encoder may be configured to select a security key from a plurality of predetermined security keys. Multiple predetermined security keys can be configured so as to provide different levels of trust. In particular, the plurality of predetermined security keys may include a high level of security known only to the developer of the encoder (or the developer of the corresponding decoder or the corresponding transcoder including the decoder and the encoder) Secret key. Furthermore, the plurality of predetermined security keys may include medium security keys known to the operator of the encoder (or the operator of the corresponding decoder or the corresponding transcoder including the decoder and the encoder) .
编码器可以被配置成:至少基于内容帧、相关联的元数据帧和所选择的安全秘钥生成加密值。具体地,编码器可以被配置成:计算HMAC-MD5值或HMAC-SHA256值(如在联邦信息处理标准FIPS PUB 180-2中指定的安全散列算法)以生成加密值。另外,编码器可以被配置成:将HMAC-MD5值或HMAC-SHA256值截短以产生加密值。通过将HMAC值截短,可以减少加密值所需的开销。编码器可以被配置成将生成的加密值插入至元数据帧中,从而在不被检测到的情况下确保内容帧和/或元数据帧不能被未授权方修改。The encoder may be configured to generate an encrypted value based on at least the content frame, the associated metadata frame, and the selected security key. Specifically, the encoder may be configured to calculate a HMAC-MD5 value or a HMAC-SHA256 value (such as the secure hash algorithm specified in Federal Information Processing Standard FIPS PUB 180-2) to generate an encrypted value. Additionally, the encoder may be configured to truncate the HMAC-MD5 value or the HMAC-SHA256 value to produce an encrypted value. By truncating the HMAC value, you reduce the overhead required to encrypt the value. The encoder may be configured to insert the generated encrypted value into the metadata frame, thereby ensuring that the content frame and/or the metadata frame cannot be modified by unauthorized parties without detection.
使用提供不同的信任等级的不同安全秘钥确保了相对应的解码器(或包括解码器的转码器)可以验证接收的比特流是否已经被修改,以及如果已经被修改,则可以验证哪一方修改了接收的比特流。作为示例,编码器可能使用高度安全秘钥最初生成了比特流。中间方可能已经修改了比特流并且可能已经使用中等安全秘钥来生成经修改的加密值。这样,解码器了解到接收的比特流已经被可以获得中等安全秘钥的一方修改。应当注意,多个预定的安全秘钥可以包括多于两级的信任,从而为解码器提供关于接收的比特流的信任度的更多细节。The use of different security keys providing different levels of trust ensures that the corresponding decoder (or transcoder including the decoder) can verify whether the received bitstream has been modified, and if so, which party Modified received bitstream. As an example, the encoder might have originally generated the bitstream using a highly secure key. The intermediary party may have modified the bitstream and may have used a medium security key to generate the modified encrypted value. In this way, the decoder learns that the received bitstream has been modified by a party with access to a moderately secure key. It should be noted that multiple predetermined security keys may include more than two levels of trust, thereby providing the decoder with more details about the trust level of the received bitstream.
编码器可以被配置成将对所选择的安全秘钥的表示插入至元数据帧中,从而使得相对应的解码器能够容易地验证接收的比特流是否已经被修改。另一方面,解码器可以被配置成使用多个预定的安全秘钥中的所有安全秘钥来验证所接收的比特流的可靠性,而不需要将所选择的安全秘钥的表示插入至元数据帧中。The encoder may be configured to insert a representation of the selected security key into the metadata frame so that the corresponding decoder can easily verify whether the received bitstream has been modified. On the other hand, the decoder may be configured to use all of the plurality of predetermined security keys to verify the authenticity of the received bitstream without inserting a representation of the selected security key into the element. in the data frame.
编码器可以被配置成:对于编码比特流生成多个连续的内容帧和相关联的元数据帧。此外,编码器可以被配置成:基于单个内容帧及其相关联的元数据帧并且基于所选择的安全秘钥生成帧加密值。可以将帧加密值插入至相关联的元数据帧中,并且相对应的解码器(或转码器)可以使用帧加密值来验证单独的内容帧/元数据帧的可靠性。此外,编码器可以被配置成:基于多个连续的内容帧及其相关联的元数据帧中的至少一些并且基于所选择的安全值来生成历史加密值。可以将历史加密值插入至多个连续的元数据帧中的一个元数据帧中,并且相对应的解码器(或转码器)可以使用历史加密值来验证多个连续的内容帧和元数据帧的正确的顺序次序。The encoder may be configured to generate a plurality of consecutive content frames and associated metadata frames for the encoded bitstream. Additionally, the encoder may be configured to generate a frame encryption value based on a single content frame and its associated metadata frame and based on a selected security key. The frame encryption value can be inserted into the associated metadata frame, and the corresponding decoder (or transcoder) can use the frame encryption value to verify the authenticity of the individual content frame/metadata frame. Furthermore, the encoder may be configured to generate historical encryption values based on at least some of the plurality of consecutive content frames and their associated metadata frames and based on the selected security value. The historical encryption value can be inserted into one of multiple consecutive metadata frames, and the corresponding decoder (or transcoder) can use the historical encryption value to verify multiple consecutive content frames and metadata frames the correct sequential order.
根据另一方面,描述了一种用于生成包括内容帧和相关联的元数据帧的编码比特流的方法。内容帧可以表示根据第一编解码器系统编码的信号。该方法可以包括:生成元数据块并且将元数据块插入至元数据帧中。此外,该方法可以包括从多个预定的安全秘钥中选择安全秘钥,其中,多个预定的安全秘钥提供不同的信任等级。另外,该方法可以包括:至少基于内容帧、相关联的元数据帧和所选择的安全秘钥来生成加密值。然后,可以将生成的加密值插入至元数据帧中。According to another aspect, a method for generating an encoded bitstream including a content frame and associated metadata frame is described. The content frame may represent a signal encoded according to the first codec system. The method may include generating a metadata block and inserting the metadata block into a metadata frame. Additionally, the method may include selecting a security key from a plurality of predetermined security keys, wherein the plurality of predetermined security keys provide different levels of trust. Additionally, the method may include generating an encrypted value based on at least the content frame, the associated metadata frame, and the selected security key. The resulting encrypted value can then be inserted into the metadata frame.
根据另一个方面,描述了相对应的解码器。解码器可以被配置成:接收包括内容帧和相关联的元数据帧的编码比特流。编码比特流可以具有本文献中描述的任意一个或更多个属性。具体地,内容帧可以表示根据第一编解码器系统(例如,如在本文献中提到的编解码器系统)编码的信号。解码器可以被配置成从元数据帧中提取加密值。如本文献中所述,相对应的编码器可能已经将加密值插入至元数据帧中。具体地,可以使用多个预定的安全秘钥中的一个安全秘钥来确定加密值。如上所述,多个预定的安全秘钥可以提供不同的信任等级。作为示例,多个预定的安全秘钥可以包括高度安全秘钥和中等安全秘钥。According to another aspect, a corresponding decoder is described. The decoder may be configured to receive an encoded bitstream including content frames and associated metadata frames. An encoded bitstream may have any one or more of the properties described in this document. In particular, the content frame may represent a signal encoded according to a first codec system (eg a codec system as mentioned in this document). The decoder can be configured to extract encrypted values from the metadata frame. The corresponding encoder may have inserted the encrypted value into the metadata frame as described in this document. In particular, the encrypted value may be determined using one of a plurality of predetermined security keys. As mentioned above, multiple predetermined security keys can provide different levels of trust. As an example, the plurality of predetermined security keys may include a high security key and a medium security key.
解码器可以被配置成:从多个预定的安全秘钥中确定安全秘钥。具体地,解码器可以被配置成:通过从元数据帧(例如,从元数据帧的特定字段)中提取安全秘钥来确定安全秘钥。此外,解码器可以被配置成:至少基于接收的内容帧、接收的相关联的元数据帧和确定的安全秘钥来生成验证加密值。另外,解码器可以被配置成将提取的加密值与验证加密值进行比较,以确定是否可以信任接收的编码比特流。作为示例,如果提取的加密值和验证加密值匹配,则解码器可以确定接收的编码比特流可以被信任。此外,用于生成验证加密值的安全秘钥可以向解码器指示与接收的编码比特流相关联的信任等级。作为示例,与中等安全秘钥相比,高度安全秘钥可以表示较高等级的信任。The decoder may be configured to determine the security key from a plurality of predetermined security keys. Specifically, the decoder may be configured to determine the security key by extracting the security key from the metadata frame (eg, from a specific field of the metadata frame). Furthermore, the decoder may be configured to generate a verification encrypted value based at least on the received content frame, the received associated metadata frame, and the determined security key. Additionally, the decoder may be configured to compare the extracted encrypted value with the verified encrypted value to determine whether the received encoded bitstream can be trusted. As an example, if the extracted encrypted value and the verified encrypted value match, the decoder may determine that the received encoded bitstream can be trusted. Additionally, the security key used to generate the authenticated encrypted value may indicate to the decoder the level of trust associated with the received encoded bitstream. As an example, a high security key can represent a higher level of trust than a medium security key.
解码器可以被配置成确定多个预定的安全秘钥中哪个安全秘钥已经用于生成提取的加密值。如上所述,已经用于生成提取的加密值的安全秘钥可以提供对接收的编码比特流的信任等级的表示。具体地,解码器可以被配置成:对于多个预定的安全秘钥分别生成多个验证加密值。此外,解码器可以被配置成将多个验证加密值中的每个验证加密值与提取的加密值进行比较。另外,解码器可以被配置成:如果比较显示多个验证加密值中的一个验证加密值与提取的加密值匹配,则确定多个预定的安全秘钥中的一个安全秘钥已经用于生成提取的加密值。The decoder may be configured to determine which of a plurality of predetermined security keys has been used to generate the extracted encrypted value. As mentioned above, the security key that has been used to generate the extracted encrypted value can provide an indication of the level of trust in the received encoded bitstream. Specifically, the decoder may be configured to generate a plurality of verification encryption values for a plurality of predetermined security keys respectively. Additionally, the decoder may be configured to compare each of the plurality of verified encrypted values to the extracted encrypted value. Additionally, the decoder may be configured to determine that one of the plurality of predetermined security keys has been used to generate the extraction if the comparison shows that one of the plurality of verified encrypted values matches the extracted encrypted value. encrypted value.
根据另一个方面,描述了一种用于确定接收的包括内容帧和相关联的元数据帧的编码比特流的信任等级的方法。内容帧可以表示根据第一编解码器系统编码的信号。该方法可以包括从元数据帧中提取加密值。此外,该方法可以包括根据多个预定的安全秘钥来确定安全秘钥,其中,多个预定的安全秘钥提供不同的信任等级。另外,该方法可以包括至少基于内容帧、相关联的元数据帧和确定的安全秘钥来生成验证加密值。该方法可以进行将提取的加密值与验证加密值进行比较以确定接收的编码比特流的信任等级,其中,信任等级可以由确定的安全秘钥来表示。According to another aspect, a method for determining a trust level of a received encoded bitstream including a content frame and associated metadata frame is described. The content frame may represent a signal encoded according to the first codec system. The method may include extracting the encrypted value from the metadata frame. Additionally, the method may include determining a security key based on a plurality of predetermined security keys, wherein the plurality of predetermined security keys provide different levels of trust. Additionally, the method may include generating a verification encrypted value based at least on the content frame, the associated metadata frame, and the determined security key. The method may perform a comparison of the extracted encrypted value with the verified encrypted value to determine a trust level of the received encoded bitstream, where the trust level may be represented by a determined security key.
根据另一个方面,描述了一种转码器,其被配置成:将包括入站内容帧和相关联的入站元数据帧的入站比特流转码成出站比特流。转码器可以包括本文献中描述的与转码器相关的特征中的任何特征。如上所述,入站比特流可以表示一组信号样本,例如表示信号的帧的样本。转码器可以包括解码器(用于对接收的入站比特流进行解码)和编码器(用于对经解码的入站比特流重新编码,以提供经转码的出站比特流)。转码器可以包括所谓的PCM连接转码器。According to another aspect, a transcoder is described that is configured to transcode an inbound bitstream including an inbound content frame and an associated inbound metadata frame into an outbound bitstream. A transcoder may include any of the features described in this document related to transcoders. As mentioned above, the inbound bitstream may represent a set of signal samples, such as samples representing frames of a signal. The transcoder may include a decoder for decoding the received inbound bitstream and an encoder for re-encoding the decoded inbound bitstream to provide a transcoded outbound bitstream. Transcoders may include so-called PCM connection transcoders.
转码器的解码器可以被配置成将入站内容帧转换成信号的一组经解码的PCM样本。此外,解码器可以被配置成从入站元数据帧提取元数据。这样,解码器可以被配置成:将入站比特流转换成经解码的PCM样本和相关联的元数据的序列。转码器的编码器可以使用经解码的PCM样本和相关联的元数据的序列来生成出站比特流(根据第二编解码器系统)。解码器可以被配置成:针对该组经解码的PCM样本和提取的元数据,使用解码器安全秘钥生成签名值。可以使用HMAC-MD5散列函数或HMAC-SHA256散列函数来生成签名值。可以将得到的值截短以提供签名值。这样,解码器可以被配置成提供签名值,从而使得编码器能够验证:在转码器的解码器和编码器之间经解码的PCM样本和/或元数据是否已经被未经授权的实体(其不能获得解码器安全秘钥)修改。The decoder of the transcoder may be configured to convert the inbound content frame into a set of decoded PCM samples of the signal. Additionally, the decoder may be configured to extract metadata from inbound metadata frames. As such, the decoder may be configured to convert the inbound bitstream into a sequence of decoded PCM samples and associated metadata. The sequence of decoded PCM samples and associated metadata may be used by the transcoder's encoder to generate an outbound bitstream (according to the second codec system). The decoder may be configured to generate a signature value using the decoder security key for the set of decoded PCM samples and extracted metadata. The signature value can be generated using the HMAC-MD5 hash function or the HMAC-SHA256 hash function. The resulting value can be truncated to provide a signed value. In this way, the decoder can be configured to provide a signature value, thereby enabling the encoder to verify whether the decoded PCM samples and/or metadata between the transcoder's decoder and the encoder have been modified by an unauthorized entity ( It cannot obtain the decoder security key) modification.
转码器的编码器可以被配置成接收一组PCM样本和相关联的元数据。所接收的一组PCM样本通常与由解码器提供的一组经解码的PCM样本相对应,并且接收的元数据通常与来自解码器的提取的元数据相对应。然而,PCM样本和/或元数据可能已经被修改,以使得所接收的一组PCM样本和/或接收的元数据可能与一组经解码的PCM样本和/或提取的元数据不同。The transcoder's encoder may be configured to receive a set of PCM samples and associated metadata. The received set of PCM samples typically corresponds to a decoded set of PCM samples provided by the decoder, and the received metadata typically corresponds to extracted metadata from the decoder. However, the PCM samples and/or metadata may have been modified such that the received set of PCM samples and/or received metadata may be different from the set of decoded PCM samples and/or extracted metadata.
编码器可以被配置成接收签名值。接收的签名值可以对应于或等同于由解码器生成的签名值。另一方面,接收的签名值可以与由解码器生成的签名值不同(例如,如果由经授权的实体修改,则进行了对PCM样本和/或元数据的修改)。编码器可以被配置成:使用编码器安全秘钥验证接收的签名值对于接收的一组PCM样本和相关联的元数据是否有效。此外,编码器可以被配置成:如果接收的签名有效,则根据接收的一组PCM样本生成出站比特流的出站内容帧并且根据接收的元数据生成出站比特流的相关联的出站元数据帧。另一方面,编码器可以被配置成:如果接收的签名无效,则阻止将接收的元数据插入至出站比特流中。这样,转码器的编码器可以被配置成:如果元数据或PCM样本已经被未经授权的实体修改,则阻止将元数据帧插入至出站比特流中。The encoder can be configured to receive signed values. The received signature value may correspond to or be identical to the signature value generated by the decoder. On the other hand, the received signature value may differ from the signature value generated by the decoder (eg, if modified by an authorized entity, modifications to the PCM sample and/or metadata were made). The encoder may be configured to use the encoder security key to verify that the received signature value is valid for a received set of PCM samples and associated metadata. Furthermore, the encoder may be configured to, if the received signature is valid, generate an outbound content frame of the outbound bitstream based on the received set of PCM samples and generate an associated outbound content frame of the outbound bitstream based on the received metadata. Metadata frame. On the other hand, the encoder may be configured to prevent insertion of received metadata into the outbound bitstream if the received signature is invalid. In this way, the transcoder's encoder can be configured to prevent metadata frames from being inserted into the outbound bitstream if the metadata or PCM samples have been modified by an unauthorized entity.
应当注意,通常,转码器中使用的解码器和编码器由相应的音频编解码器系统的开发人员提供。这样,解码器和编码器的功能可以由开发人员来控制,从而确保了高质量的音频内容和相关联的元数据。另一方面,未经授权的实体可以对解码器和编码器之间的PCM样本和/或元数据进行修改,从而产生音频内容和/或元数据的质量降低的风险。通过提供生成签名值的解码器并且通过提供验证签名值的编码器,可以确保能够检测出对PCM样本和/或元数据的未经授权的修改。It should be noted that typically, the decoders and encoders used in transcoders are provided by the developers of the corresponding audio codec systems. This way, decoder and encoder functionality can be controlled by developers, ensuring high-quality audio content and associated metadata. On the other hand, unauthorized entities can make modifications to PCM samples and/or metadata between the decoder and encoder, creating a risk of reduced quality of the audio content and/or metadata. By providing a decoder that generates a signature value and by providing an encoder that verifies the signature value, it is ensured that unauthorized modifications to PCM samples and/or metadata can be detected.
编码器可以被配置成将解码器安全秘钥用作编码器安全秘钥。这样,确保编码器可以验证接收的PCM样本和元数据是否与由相对应的解码器提供的PCM样本和元数据相对应。The encoder can be configured to use the decoder security key as the encoder security key. This ensures that the encoder can verify that the received PCM samples and metadata correspond to those provided by the corresponding decoder.
转码器还可以包括PCM处理级,其被配置成对一组经解码的PCM样本和/或提取的元数据进行修改,从而产生一组第二PCM样本和相关联的第二元数据。该组第二PCM样本可以与一组经解码的PCM样本或一组经修改的PCM样本相对应。以类似的方式,第二元数据可以与提取的元数据或经修改的提取的元数据相对应。此外,PCM处理级可以被配置成将一组第二PCM样本和相关联的第二元数据传送至编码器。通过使用接收的签名值和编码器安全秘钥,编码器可以被配置成检测经解码的PCM样本和/或提取的元数据已经被PCM处理级修改。换言之,编码器可以被配置成检测第二PCM样本(由编码器接收)和第二元数据(由编码器接收)不与经解码的PCM样本和提取的元数据(由解码器提供)相对应。The transcoder may further include a PCM processing stage configured to modify a set of decoded PCM samples and/or extracted metadata to produce a set of second PCM samples and associated second metadata. The set of second PCM samples may correspond to a set of decoded PCM samples or a set of modified PCM samples. In a similar manner, the second metadata may correspond to extracted metadata or modified extracted metadata. Additionally, the PCM processing stage may be configured to communicate a second set of PCM samples and associated second metadata to the encoder. By using the received signature value and the encoder security key, the encoder may be configured to detect that the decoded PCM samples and/or the extracted metadata have been modified by the PCM processing stage. In other words, the encoder may be configured to detect that the second PCM sample (received by the encoder) and the second metadata (received by the encoder) do not correspond to the decoded PCM sample and extracted metadata (provided by the decoder) .
转码器还可以包括重新签名单元,其被配置成:针对一组第二PCM样本和相关联的第二元数据,使用重新签名的安全秘钥来确定更新的签名值。此外,重新签名单元可以被配置成将更新的签名值传送至编码器。重新签名的安全秘钥可以与解码器安全秘钥不同。编码器可以被配置成将重新签名的安全秘钥用作编码器安全秘钥。这样,编码器可以被配置成检测PCM样本和/或相关联的元数据已经被经授权的PCM处理级修改。以与本文献中描述的加密值类似的方式,可以从多个预定的安全秘钥中选择用于签名值的安全秘钥。作为示例,解码器安全秘钥可以是高度安全秘钥,而重新签名的安全秘钥可以是中等安全秘钥,从而针对在转码器的编码器处接收的PCM样本和/或相关联的元数据提供不同的信任等级。The transcoder may further include a re-signing unit configured to use the re-signed security key to determine an updated signature value for a second set of PCM samples and associated second metadata. Furthermore, the re-signing unit may be configured to communicate updated signature values to the encoder. The re-signing security key can be different from the decoder security key. The encoder can be configured to use the re-signed security key as the encoder security key. In this way, the encoder may be configured to detect that PCM samples and/or associated metadata have been modified by an authorized PCM processing level. In a similar manner to the encrypted values described in this document, the security key used to sign the value may be selected from a plurality of predetermined security keys. As an example, the decoder security key may be a high security key and the re-signed security key may be a medium security key, thereby targeting the PCM samples and/or associated elements received at the encoder of the transcoder. Data offers different levels of trust.
转码器的编码器可以包括PCM处理级,其被配置成对一组接收的PCM样本和/或接收的元数据进行修改。一组接收的PCM样本可以与一组经解码的PCM样本或一组第二PCM样本相对应。以类似的方式,接收的元数据可以与提取的元数据或第二元数据相对应。编码器可以被配置成:基于已经被编码器的PCM处理级修改的所修改的所接收的一组PCM样本和/或所修改的所接收的元数据,来生成出站内容帧和/或出站元数据帧。通过为转码器的编码器提供PCM处理级,可以确保在转码器中保持信任链(因为PCM处理在由编码器的开发人员提供的编码器中执行)。The encoder of the transcoder may include a PCM processing stage configured to modify a set of received PCM samples and/or received metadata. A set of received PCM samples may correspond to a set of decoded PCM samples or a set of second PCM samples. In a similar manner, received metadata may correspond to extracted metadata or secondary metadata. The encoder may be configured to generate outbound content frames and/or outbound content frames based on a modified set of received PCM samples and/or modified received metadata that have been modified by a PCM processing stage of the encoder. Station metadata frame. By providing the transcoder's encoder with a PCM processing stage, you ensure that the chain of trust is maintained within the transcoder (because the PCM processing is performed in the encoder provided by the encoder's developer).
如上所述,PCM连接转码器可以包括本文献中描述的特征中的任何特征。具体地,转码器的解码器可以被配置成:从入站元数据帧中识别元数据入站块。如本文献中所述,元数据入站块可以与表示元数据入站块中包括的元数据的一个或更多个属性的入站描述符相关联。该一个或更多个属性可以是本文献中描述的属性中的任意一个或更多个属性。转码器的编码器可以被配置成:至少基于入站描述符,根据入站元数据帧生成出站元数据帧。As mentioned above, the PCM connection transcoder may include any of the features described in this document. Specifically, the decoder of the transcoder may be configured to identify metadata inbound chunks from inbound metadata frames. As described in this document, a metadata inbound block may be associated with an inbound descriptor representing one or more attributes of the metadata included in the metadata inbound block. The one or more attributes may be any one or more of the attributes described in this document. The transcoder's encoder may be configured to generate outbound metadata frames from inbound metadata frames based at least on the inbound descriptor.
具体地,一个或更多个属性可以包括PCM处理参数,该PCM处理参数表示在进行了对一组PCM样本的修改和/或对提取的元数据的修改的情况下是否要由编码器丢弃入站块的元数据。在这样的情况下,转码器的编码器可以被配置成:基于PCM处理参数的值,将入站块包括或不包括在出站元数据帧中。具体地,编码器可以被配置成:即使一组PCM样本和/或提取的元数据已经被修改,但是如果PCM处理参数表示入站块的元数据不应当被丢弃,则将入站块的元数据包括在出站元数据帧中。这例如在入站块中包括的元数据独立于一组PCM样本的情形下(如在例如关于辅助数据或二进制数据的情况下)可以是有用的。Specifically, one or more attributes may include a PCM processing parameter indicating whether the incoming data is to be discarded by the encoder if a modification of a set of PCM samples and/or a modification of the extracted metadata is made. Metadata for the site block. In such cases, the transcoder's encoder may be configured to include or exclude the inbound block in the outbound metadata frame based on the value of the PCM processing parameter. Specifically, the encoder may be configured to: even if a set of PCM samples and/or extracted metadata has been modified, if the PCM processing parameters indicate that the metadata of the inbound block should not be discarded, then the metadata of the inbound block should not be discarded. The data is included in the outbound metadata frame. This may be useful for example where the metadata included in the inbound block is independent of a set of PCM samples (as in the case for example regarding ancillary data or binary data).
转码器的PCM处理级可以被配置成:向转码器的编码器提供对一个或更多个PCM处理条件的表示。该一个或更多个PCM处理条件可以表示PCM处理级如何处理了一组PCM样本和/或提取的元数据。作为示例,一个或更多个PCM处理条件可以包括以下中的一个或更多个:一组PCM样本的采样率的转换、PCM样本与系统声音的混合、提取的元数据的修改、一组PCM样本的通道配置的修改(在音频信号的情况下)、对一组PCM样本的响度的调平。然后,编码器可以被配置成:还基于一个或更多个PCM处理条件,根据入站元数据帧来生成出站元数据帧。具体地,编码器可以被配置成:基于PCM处理参数的值并且基于一个或更多个PCM处理条件,来确定是否要将入站块包括在出站元数据帧中或者不将入站块包括在出站元数据帧中。具体地,PCM处理参数可以表示:受制于一个或更多个PCM处理条件而如何对入站块进行处理。The PCM processing stage of the transcoder may be configured to provide a representation of one or more PCM processing conditions to the encoder of the transcoder. The one or more PCM processing conditions may represent how the PCM processing stage processed a set of PCM samples and/or extracted metadata. As an example, the one or more PCM processing conditions may include one or more of the following: conversion of the sample rate of a set of PCM samples, mixing of PCM samples with system sounds, modification of extracted metadata, a set of PCM Modification of the channel configuration of the samples (in the case of audio signals), leveling of the loudness of a set of PCM samples. The encoder may then be configured to generate outbound metadata frames from the inbound metadata frames based also on one or more PCM processing conditions. Specifically, the encoder may be configured to determine whether to include the inbound block in the outbound metadata frame or not to include the inbound block based on the value of the PCM processing parameter and based on one or more PCM processing conditions. In the outbound metadata frame. Specifically, PCM processing parameters may represent how inbound blocks are processed subject to one or more PCM processing conditions.
根据另一个方面,描述了一种用于将包括入站内容帧和相关联的入站元数据帧的入站比特流转码成出站比特流的方法。入站比特流可以表示信号的一组样本。该方法可以包括在解码器处将入站内容帧转换成信号的一组经解码的PCM样本。此外,该方法可以包括在解码器处从入站元数据帧中提取元数据。此外,针对一组经解码的PCM样本和提取的元数据,可以使用解码器安全秘钥来生成签名值。可以将一组经解码的PCM样本、提取的元数据和生成的签名值传送至相对应的编码器。另外,该方法可以包括在编码器处接收一组PCM样本和相关联的元数据并且接收签名值。该方法可以进行:使用编码器安全秘钥确定接收的签名值对于接收的一组PCM样本和相关联的元数据是否有效。此后,如果接收的签名有效,则可以根据接收的一组PCM样本生成出站比特流的出站内容帧并且可以根据接收的元数据来生成出站比特流的相关联的出站元数据帧。According to another aspect, a method for transcoding an inbound bitstream including an inbound content frame and an associated inbound metadata frame into an outbound bitstream is described. An inbound bitstream can represent a set of samples of a signal. The method may include converting the inbound content frame into a set of decoded PCM samples of the signal at the decoder. Additionally, the method may include extracting metadata from the inbound metadata frame at the decoder. Additionally, a decoder security key can be used to generate a signature value against a set of decoded PCM samples and extracted metadata. A set of decoded PCM samples, extracted metadata and generated signature values can be passed to the corresponding encoder. Additionally, the method may include receiving at the encoder a set of PCM samples and associated metadata and receiving a signature value. The method proceeds by using the encoder security key to determine whether the received signature value is valid for a received set of PCM samples and associated metadata. Thereafter, if the received signature is valid, an outbound content frame for the outbound bitstream may be generated based on the received set of PCM samples and an associated outbound metadata frame for the outbound bitstream may be generated based on the received metadata.
根据其他方面,单独描述了上述PCM连接转码器的解码器和编码器。解码器和/或编码器分别可以包括本文献中描述的与解码器和/或编码器相关的特征中的任何一个或更多个特征。在转码器(如上所述)中可以使用解码器和/或编码器。替代地或附加地,可以单独使用解码器和/或编码器。这样,根据另一个方面,描述了一种解码器,其被配置成:对包括入站内容帧和相关联的入站元数据帧的入站比特流进行解码。入站比特流可以表示信号的一组样本。解码器可以被配置成将入站内容帧转换成信号的一组经解码的PCM样本。此外,解码器可以被配置成从入站元数据帧中提取元数据。另外,解码器可以被配置成:针对一组经解码的PCM样本并且针对提取的元数据,使用解码器安全秘钥生成签名值。这样,可以使用签名值保护一组经解码的PCM样本和相关联的提取的元数据。一组经解码的PCM样本的接收方和相关联的提取的元数据的接收方可以使用签名值,来验证一组经解码的PCM样本和/或相关联的提取的元数据是否已经以未经授权的方式被修改。接收方可以是被配置成将一组经解码的PCM样本和相关联的提取的元数据重新编码成出站比特流的编码器。因此,解码器可以被配置成将一组经解码的PCM样本、提取的元数据和生成的签名值发送至编码器以用于重新编码。According to other aspects, the decoder and encoder of the above-mentioned PCM connection transcoder are described separately. The decoder and/or encoder respectively may comprise any one or more of the features described in this document in relation to decoders and/or encoders. Decoders and/or encoders may be used in transcoders (as described above). Alternatively or additionally, the decoder and/or encoder may be used separately. Thus, according to another aspect, a decoder is described that is configured to decode an inbound bitstream including an inbound content frame and an associated inbound metadata frame. An inbound bitstream can represent a set of samples of a signal. The decoder may be configured to convert the inbound content frame into a set of decoded PCM samples of the signal. Additionally, the decoder may be configured to extract metadata from inbound metadata frames. Additionally, the decoder may be configured to generate a signature value using the decoder security key for the set of decoded PCM samples and for the extracted metadata. In this way, a set of decoded PCM samples and associated extracted metadata can be protected using the signature value. Recipients of a set of decoded PCM samples and associated extracted metadata may use the signature value to verify that a set of decoded PCM samples and/or associated extracted metadata have been processed in an unspecified format. The authorization method has been modified. The receiver may be an encoder configured to re-encode a set of decoded PCM samples and associated extracted metadata into an outbound bitstream. Therefore, the decoder may be configured to send a set of decoded PCM samples, extracted metadata and generated signature values to the encoder for re-encoding.
根据另一个方面,描述了一种编码器,其被配置成:对包括出站内容帧和相关联的出站元数据帧的出站比特流进行编码。编码器可以被配置成:接收一组PCM样本和相关联的元数据并且接收该组PCM样本和相关联的元数据的签名值。所接收的一组PCM样本可以与上述一组经解码的PCM样本相对应(或可以从其得到)。以类似的方式,所接收的一组相关联的元数据可以与上述一组提取的元数据相对应(或可以从其得到)。使用上述一组经解码的PCM样本和提取的元数据可以确定(例如,在解码器处)接收的签名值。According to another aspect, an encoder is described that is configured to encode an outbound bitstream including an outbound content frame and an associated outbound metadata frame. The encoder may be configured to receive a set of PCM samples and associated metadata and receive a signature value for the set of PCM samples and associated metadata. The received set of PCM samples may correspond to (or may be derived from) the set of decoded PCM samples described above. In a similar manner, the received set of associated metadata may correspond to (or may be derived from) the set of extracted metadata described above. The received signature value can be determined (eg, at the decoder) using the set of decoded PCM samples and extracted metadata described above.
编码器可以被配置成:使用编码器安全秘钥来验证接收的签名值对于接收的一组PCM样本和相关联的元数据是否有效。此外,编码器可以被配置成:如果接收的签名有效,则根据接收的一组PCM样本生成出站比特流的出站内容帧并且根据接收的元数据生成出站比特流的相关联的出站元数据帧。The encoder may be configured to use the encoder security key to verify that the received signature value is valid for a received set of PCM samples and associated metadata. Furthermore, the encoder may be configured to, if the received signature is valid, generate an outbound content frame of the outbound bitstream based on the received set of PCM samples and generate an associated outbound content frame of the outbound bitstream based on the received metadata. Metadata frame.
根据另一个方面,描述了一种用于对包括入站内容帧和相关联的入站元数据帧的入站比特流进行解码的方法。入站比特流可以表示信号的一组样本。该方法可以包括:将入站内容帧转换成信号的一组经解码的PCM样本,并且从入站元数据帧提取元数据。此外,该方法可以包括:针对一组经解码的PCM样本并且针对提取的元数据,使用解码器安全秘钥生成签名值。另外,该方法可以包括:将一组经解码的PCM样本、提取的元数据和生成的签名值提供至编码器以用于重新编码。According to another aspect, a method for decoding an inbound bitstream including an inbound content frame and an associated inbound metadata frame is described. An inbound bitstream can represent a set of samples of a signal. The method may include converting the inbound content frame into a set of decoded PCM samples of the signal and extracting metadata from the inbound metadata frame. Furthermore, the method may include generating a signature value using a decoder security key for the set of decoded PCM samples and for the extracted metadata. Additionally, the method may include providing the set of decoded PCM samples, extracted metadata and generated signature values to the encoder for re-encoding.
根据另一个方面,描述了一种用于对包括出站内容帧和相关联的出站元数据帧的出站比特流进行编码的方法。该方法可以包括:接收一组PCM样本和相关联的元数据并且接收该组PCM样本和相关联的元数据的签名值。此外,该方法可以包括:使用编码器安全秘钥来验证接收的签名值对于接收的一组PCM样本和相关联的元数据是否有效。该方法可以进行:如果接收的签名有效,则根据接收的一组PCM样本生成出站比特流的出站内容帧并且根据接收的元数据生成出站比特流的相关联的出站元数据帧。According to another aspect, a method for encoding an outbound bitstream including an outbound content frame and an associated outbound metadata frame is described. The method may include receiving a set of PCM samples and associated metadata and receiving a signature value for the set of PCM samples and associated metadata. Additionally, the method may include using the encoder security key to verify that the received signature value is valid for the received set of PCM samples and associated metadata. The method may proceed by, if the received signature is valid, generating an outbound content frame of the outbound bitstream based on the received set of PCM samples and generating an associated outbound metadata frame of the outbound bitstream based on the received metadata.
根据本申请的一个方面,提供了一种编码方法,包括:对当前内容帧编码;生成针对相关联的元数据帧的保护字段,相关联的元数据帧与当前内容帧相关联;对相关联的元数据帧编码,包括保护字段;以及将当前内容帧和相关联的元数据帧包括在输出比特流中,其中:生成保护字段涉及生成一个或更多个加密值;一个或更多个加密值中的至少一个是表示当前内容帧的真实性的帧加密值;以及通过将单向函数应用于包括当前内容帧和相关联的元数据帧的一组帧而生成帧加密值。According to one aspect of the present application, an encoding method is provided, including: encoding the current content frame; generating a protection field for an associated metadata frame, the associated metadata frame being associated with the current content frame; encoding the metadata frame, including the protection field; and including the current content frame and the associated metadata frame in the output bitstream, wherein: generating the protection field involves generating one or more encrypted values; one or more encrypted values At least one of the values is a frame encryption value that represents authenticity of the current content frame; and the frame encryption value is generated by applying a one-way function to a set of frames including the current content frame and associated metadata frames.
根据本申请的另一个方面,提供了一种包括一个或更多个硬件元件的编码装置,该编码装置被配置用于:对当前内容帧编码;生成针对相关联的元数据帧的保护字段,相关联的元数据帧与当前内容帧相关联;对相关联的元数据帧编码,包括保护字段;以及将当前内容帧和相关联的元数据帧包括在输出比特流中,其中:生成保护字段涉及生成一个或更多个加密值;一个或更多个加密值中的至少一个是表示当前内容帧的真实性的帧加密值;以及通过将单向函数应用于包括当前内容帧和相关联的元数据帧的一组帧而生成帧加密值。According to another aspect of the present application, there is provided an encoding device comprising one or more hardware elements, the encoding device being configured to: encode a current content frame; generate a protection field for an associated metadata frame, associating the associated metadata frame with the current content frame; encoding the associated metadata frame, including a guard field; and including the current content frame and the associated metadata frame in the output bitstream, wherein: generating the guard field Involves generating one or more encrypted values; at least one of the one or more encrypted values being a frame encrypted value representing the authenticity of a current content frame; and by applying a one-way function to a frame including the current content frame and an associated A frame encryption value is generated from a set of frames of metadata frames.
根据本申请的另一个方面,提供了一种用于将入站比特流转码成出站比特流的方法,入站比特流包括第一入站内容帧和相关联的第一入站元数据帧,该方法包括:在解码器处:将第一入站内容帧转换成第一组经解码的脉冲编码调制(PCM)样本;从第一入站元数据帧中提取第一元数据;在第一元数据中识别第一元数据入站块,第一元数据入站块与表示第一元数据的一个或更多个属性的第一入站描述符相关联,一个或更多个属性包括PCM处理参数,PCM处理参数表示在进行了对所述第一组经解码的PCM样本的修改、对提取的第一元数据的修改、或者对所述第一组经解码的PCM样本和提取的第一元数据二者的修改的情况下,第一元数据入站块的元数据是否要被编码器丢弃;基于第一组经解码的PCM样本和第一元数据来生成帧签名值;以及将第一组经解码的PCM样本、第一元数据和帧签名值传送至对应的编码器;以及在编码器处:接收第一组经解码的PCM样本、第一元数据和帧签名值;确定帧签名值对于第一组经解码的PCM样本和第一元数据是否有效;以及至少部分地基于帧签名值是否有效来确定:是否根据第一组经解码的PCM样本生成出站比特流的第一出站内容帧;以及是否至少部分地基于第一入站描述符、根据第一入站元数据帧生成出站比特流的相关联的第一出站元数据帧。According to another aspect of the present application, a method for transcoding an inbound bitstream into an outbound bitstream is provided, the inbound bitstream comprising a first inbound content frame and an associated first inbound metadata frame , the method includes: at the decoder: converting the first inbound content frame into a first set of decoded pulse code modulation (PCM) samples; extracting first metadata from the first inbound metadata frame; A first inbound block of metadata is identified in the metadata, the first inbound block of metadata being associated with a first inbound descriptor representing one or more attributes of the first metadata, the one or more attributes including PCM processing parameters, the PCM processing parameters represent the modification of the first group of decoded PCM samples, the modification of the extracted first metadata, or the modification of the first group of decoded PCM samples and the extracted whether the metadata of the first metadata inbound block is to be discarded by the encoder if both the first metadata and the first metadata are modified; generating a frame signature value based on the first set of decoded PCM samples and the first metadata; and transmitting the first set of decoded PCM samples, the first metadata and the frame signature value to the corresponding encoder; and at the encoder: receiving the first set of decoded PCM samples, the first metadata and the frame signature value; Determining whether the frame signature value is valid for the first set of decoded PCM samples and the first metadata; and determining, based at least in part on whether the frame signature value is valid: whether to generate an outbound bitstream based on the first set of decoded PCM samples. a first outbound content frame; and whether an associated first outbound metadata frame of the outbound bitstream is generated from the first inbound metadata frame based at least in part on the first inbound descriptor.
根据本申请的另一个方面,提供了一种转码器,被配置成:将包括入站内容帧和相关联的入站元数据帧的入站比特流转码成包括出站内容帧和相关联的出站元数据帧的出站比特流;其中,入站内容帧表示根据第一编解码器系统编码的信号,并且其中,出站内容帧表示根据第二编解码器系统编码的信号;其中,转码器被配置成:从入站元数据帧中识别元数据入站块;其中,元数据入站块与表示元数据入站块中包括的元数据的一个或更多个属性的入站描述符相关联;以及基于入站描述符根据入站元数据帧生成出站元数据帧。According to another aspect of the present application, a transcoder is provided, configured to: transcode an inbound bitstream including an inbound content frame and an associated inbound metadata frame to include an outbound content frame and an associated an outbound bitstream of outbound metadata frames; wherein the inbound content frame represents a signal encoded in accordance with a first codec system, and wherein the outbound content frame represents a signal encoded in accordance with a second codec system; wherein , the transcoder is configured to: identify a metadata inbound block from the inbound metadata frame; wherein the metadata inbound block is associated with an inbound block representing one or more attributes of the metadata included in the metadata inbound block. the inbound descriptor is associated; and the outbound metadata frame is generated from the inbound metadata frame based on the inbound descriptor.
根据另一个方面,描述了一种软件程序。该软件程序可能适用于在处理器上执行并且适用于当在处理器上执行时执行本文献中描述的方法步骤。According to another aspect, a software program is described. The software program may be adapted for execution on a processor and may be adapted to perform the method steps described in this document when executed on the processor.
根据另一个方面,描述了一种存储介质。该存储介质可以包括软件程序,该软件程序适用于在处理器上执行并且适用于当在处理器上执行时执行本文献中描述的方法步骤。According to another aspect, a storage medium is described. The storage medium may comprise a software program adapted for execution on a processor and adapted to perform the method steps described in this document when executed on the processor.
根据另一个方面,描述了一种计算机程序产品。该计算机程序可以包括可执行的指令,该可执行的指令用于当在计算机上执行时执行本文献中描述的方法步骤。According to another aspect, a computer program product is described. The computer program may comprise executable instructions for performing the method steps described in this document when executed on a computer.
应当注意,如本专利申请中描述的包括其优选实施方式的方法和系统可以单独使用或者与本文献中公开的其他方法和系统结合使用。此外,可以对本专利申请中描述的方法和系统的所有方面进行任意组合。具体地,可以以任意方式将本申请中的特征彼此组合。It should be noted that the methods and systems as described in this patent application, including preferred embodiments thereof, may be used alone or in combination with other methods and systems disclosed in this document. Furthermore, all aspects of the methods and systems described in this patent application may be combined in any way. In particular, the features of the present application can be combined with each other in any way.
附图说明Description of drawings
下面参照附图以示例性方式对本发明进行了说明,在附图中:The invention is described below by way of example with reference to the accompanying drawing, in which:
图1a示出了包括转码器的示例音频内容分发链的框图;Figure 1a shows a block diagram of an example audio content distribution chain including a transcoder;
图1b示出了元数据帧的示例结构;Figure 1b shows an example structure of a metadata frame;
图2a和图2b示出了元数据帧中使用的时间戳属性的示例;Figures 2a and 2b show examples of timestamp attributes used in metadata frames;
图3a和图3b示出了元数据帧中使用的去复制属性的示例;Figures 3a and 3b show examples of deduplication attributes used in metadata frames;
图4a和图4b示出了元数据帧中使用的复制属性的示例;以及Figures 4a and 4b show examples of copy attributes used in metadata frames; and
图5a至图5d和图6示出了示例PCM连接转码器。Figures 5a-5d and 6 illustrate an example PCM connected transcoder.
具体实施方式Detailed ways
如背景部分中所描述的,音频内容通常与元数据相关联,并且将音频内容编码至包括编码内容帧的序列和相关联的元数据容器(也称为元数据帧)的序列的联合比特流中。图1示出了音频内容的示例分发系统100的框图。本文献中描述的方法和系统是在音频内容的背景下概述的。然而,应当注意,该方法和系统适用于其他类型的内容如视频内容。从更一般的意义上来说,本文献中描述的方法和系统适用于多媒体内容如音频和/或视频,其中,多媒体内容与元数据相关联。As described in the background section, audio content is typically associated with metadata, and the audio content is encoded into a joint bitstream that includes a sequence of encoded content frames and a sequence of associated metadata containers (also called metadata frames) middle. Figure 1 illustrates a block diagram of an example distribution system 100 for audio content. The methods and systems described in this document are outlined in the context of audio content. However, it should be noted that the method and system are applicable to other types of content such as video content. In a more general sense, the methods and systems described in this document are suitable for multimedia content such as audio and/or video, where the multimedia content is associated with metadata.
分发系统100包括编码器101,其被配置成对音频内容进行编码并且提供编码比特流110(也称为第一编码比特流110或入站编码比特流110)。通常,第一编码比特流110包括编码内容帧111序列,其中,编码内容帧111可以与相对应的元数据帧112相关联。编码器101被配置成:提供根据第一音频编解码器系统编码的第一编码比特流110。第一音频编解码器系统可以是例如以下中之一:杜比E、杜比数字+、杜比数字、杜比真正高清(Dolby TrueHD)、杜比睿波、AAC和/或HE-AAC。内容帧111可以表示或者可以包括预定数量的音频内容的样本,例如1536、1024或512个音频内容的样本。Distribution system 100 includes an encoder 101 configured to encode audio content and provide an encoded bitstream 110 (also referred to as first encoded bitstream 110 or inbound encoded bitstream 110). Typically, the first encoded bitstream 110 includes a sequence of encoded content frames 111 , where the encoded content frames 111 may be associated with corresponding metadata frames 112 . The encoder 101 is configured to provide a first encoded bitstream 110 encoded according to a first audio codec system. The first audio codec system may be, for example, one of: Dolby E, Dolby Digital Plus, Dolby Digital, Dolby TrueHD, Dolby Pulse, AAC and/or HE-AAC. Content frame 111 may represent or may include a predetermined number of samples of audio content, such as 1536, 1024, or 512 samples of audio content.
第一编码比特流110经由传输介质或经由存储介质102提供至转码器103。转码器103被配置成:将第一编码比特流110转码或者转换成第二编码比特流120(也称为出站比特流120),其中,第二编码比特流120是根据第二音频编解码器系统编码的。第二音频编解码器系统可以与第一音频编解码器系统不同。另一方面,第二音频编解码器系统可以与第一音频编解码器相同,但是使用不同的配置,例如不同的比特率、不同的帧速率和/或不同的通道配置。第二音频编解码器系统可以是例如以下中之一:杜比E、杜比数字+、杜比数字、杜比真正高清(Dolby True HD)、杜比睿波、AAC和/或HE-AAC。以与第一编码比特流110类似的方式,第二编码比特流120包括内容帧121序列和相对应的元数据帧122序列。第二编码比特流120的内容帧121可以具有与第一编码比特流110的内容帧111的帧大小不同的帧大小。可以将第二编码比特流120提供至解码器104以用于渲染音频内容。The first encoded bitstream 110 is provided to the transcoder 103 via a transmission medium or via a storage medium 102 . The transcoder 103 is configured to transcode or convert the first encoded bitstream 110 into a second encoded bitstream 120 (also referred to as the outbound bitstream 120), wherein the second encoded bitstream 120 is based on the second audio Encoded by the codec system. The second audio codec system may be different from the first audio codec system. On the other hand, the second audio codec system may be the same as the first audio codec, but use a different configuration, such as a different bit rate, a different frame rate, and/or a different channel configuration. The second audio codec system may be, for example, one of: Dolby E, Dolby Digital Plus, Dolby Digital, Dolby True HD, Dolby Pulse, AAC, and/or HE-AAC. In a similar manner to the first encoded bitstream 110, the second encoded bitstream 120 includes a sequence of content frames 121 and a corresponding sequence of metadata frames 122. The content frame 121 of the second encoded bitstream 120 may have a different frame size than the frame size of the content frame 111 of the first encoded bitstream 110 . The second encoded bitstream 120 may be provided to the decoder 104 for rendering audio content.
元数据帧112、122可以具有预定的结构。换言之,元数据帧112、122可以遵循预定的语法。作为示例,元数据帧112、122可以遵循表1中所示的所谓的演化帧语法。可以例如在标准多媒体内容编解码器系统如杜比视频广播(DVB)系统和/或运动图像专家组(MPEG)编解码器系统的背景下使用演化帧语法。应当注意,表1中所示的元数据帧语法和后续的表仅仅是示例。可以对语法进行变型或修改。具体地,本文献中所示的语法可以通过例如用于提供附加功能的附加字段进行扩展。Metadata frames 112, 122 may have a predetermined structure. In other words, the metadata frames 112, 122 may follow a predetermined syntax. As an example, the metadata frames 112, 122 may follow the so-called evolved frame syntax shown in Table 1. The evolved frame syntax may be used, for example, in the context of standard multimedia content codec systems such as the Dolby Video Broadcast (DVB) system and/or the Moving Picture Experts Group (MPEG) codec system. It should be noted that the metadata frame syntax shown in Table 1 and the subsequent tables are only examples. Variations or modifications can be made to the grammar. In particular, the syntax shown in this document can be extended by, for example, additional fields to provide additional functionality.
表1Table 1
表1中所示的演化帧的参数的语义可以如下:The semantics of the parameters of the evolving frame shown in Table 1 can be as follows:
●key_id可以是用于散列(即,用于计算protection()字段的protection_bits)的加密密钥的标识符。• key_id may be the identifier of the encryption key used for hashing (i.e., used to calculate the protection_bits of the protection() field).
●payload_id可以是跟随的应用的有效载荷的标识符;payload_idEND=“0000b”可以具有如下意义:在该evo_frame()中不包含任何另外的有效载荷;● payload_id can be the identifier of the payload of the following application; payload_idEND="0000b" can have the following meaning: no additional payload is included in this evo_frame();
●payload_size可以表示跟随的有效载荷字段中的字节数。●payload_size can represent the number of bytes in the following payload field.
演化帧语法指定可以包括多个元数据块的元数据帧112、122,其中,元数据块也称为有效载荷。这样,元数据帧112、122可以包括0个元数据块、一个或更多个元数据块,其中,每个元数据块表示特定元数据类型和/或特定元数据方面。元数据类型的示例有:The evolved frame syntax specifies a metadata frame 112, 122 that may include multiple metadata chunks, also referred to as payloads. As such, metadata frames 112, 122 may include zero metadata blocks, one or more metadata blocks, where each metadata block represents a specific metadata type and/or a specific metadata aspect. Examples of metadata types are:
●描述性元数据,其描述与元数据帧112相关联(例如,拍子信息和/或和声信息)的内容帧111的特定方面;• Descriptive metadata that describes specific aspects of the content frame 111 associated with the metadata frame 112 (e.g., tempo information and/or harmony information);
●不相关元数据,该不相关元数据包括不直接与内容帧111相关的辅助数据(如编码音频内容的目标解码器的固件升级);● Irrelevant metadata, which includes ancillary data not directly related to the content frame 111 (such as firmware upgrades of the target decoder encoding the audio content);
●控制元数据,可以用于对与元数据帧112相关联的内容帧111的一个或更多个样本的渲染进行控制(例如,内容帧111的一个或更多个样本的响度值)。- Control metadata that may be used to control the rendering of one or more samples of content frame 111 associated with metadata frame 112 (eg, the loudness value of one or more samples of content frame 111).
这样,当需要时,元数据帧112提供可以通过附加元数据块进行扩展的灵活结构,以描述编码音频内容的附加特性或者以便传送比特流110中的附加辅助数据。在无元数据要连同内容帧111一起传输的情况下,元数据帧112可以不包括元数据块,在表1的演化帧的语法中,这可以由与预定的“END”ID(标识符)相对应的payload_id来表示。In this way, the metadata frame 112 provides a flexible structure that can be extended by additional metadata blocks when needed to describe additional characteristics of the encoded audio content or to convey additional ancillary data in the bitstream 110. In the case that no metadata is to be transmitted along with the content frame 111, the metadata frame 112 may not include a metadata block. In the syntax of the evolved frame of Table 1, this may be represented by a predetermined "END" ID (identifier). Represented by the corresponding payload_id.
在本文献中,提出了将描述符添加至元数据块,其中,描述符描述相关联的元数据块中包括的元数据的一个或更多个特性或属性。在表1中所示的演化帧的语法中,将该描述符称为“payload_config()”。然后,转码器可以使用描述符来执行对元数据块的高效转码,而无需分析相关联的元数据块中包括的元数据。因此,可以显著降低元数据转码的复杂度。In this document, it is proposed to add descriptors to metadata blocks, wherein the descriptors describe one or more characteristics or attributes of the metadata included in the associated metadata block. In the syntax of the evolved frame shown in Table 1, this descriptor is called "payload_config()". The transcoder can then use the descriptor to perform efficient transcoding of the metadata chunk without parsing the metadata included in the associated metadata chunk. Therefore, the complexity of metadata transcoding can be significantly reduced.
换言之,本文献描述了如下方法:将元数据帧112内(例如,在表1中所示的演化帧内)的元数据块(也称为有效载荷)从一种编码比特流转码成另一种比特流。转码操作可以由每个有效载荷内的特定字段(例如,如表1中所示,元数据块的字段“payload_config()”)引导。然后,可以指定转码操作,以使得可以将各自的有效载荷适当地从一种编码流转码至另一种编码流,而无需提取或解析元数据块的隐含的元数据参数的本质(即,无需提取或解析隐含的有效载荷的本质)。In other words, this document describes a method of transcoding a block of metadata (also called a payload) within a metadata frame 112 (eg, within the evolved frame shown in Table 1) from one encoded bitstream to another. kind of bit stream. Transcoding operations may be directed by specific fields within each payload (for example, the field "payload_config()" of the metadata block as shown in Table 1). Transcoding operations can then be specified such that the respective payloads can be appropriately transcoded from one encoding stream to another without having to extract or parse the nature of the metadata chunk's implicit metadata parameters (i.e. , without extracting or parsing the underlying payload's essence).
图1b示出了元数据帧130的示例结构(例如,元数据帧112)。元数据帧130可以包括帧头部131,帧头部131表示关于元数据帧130的结构和元数据帧130与编码比特流110的内容帧111之间的关联的一般信息。帧头部131可以包括表1的演化帧中的与帧的有效载荷不相关的一些或所有字段。此外,元数据帧130可以包括一个或更多个元数据块140(也称为元数据有效载荷140)。元数据块140可以包括块头部141,块头部141可以表示元数据块140的大小(在表1中称为payload_size)。此外,元数据块140可以包括描述符142(在表1中称为payload_config()),其中,描述符142可以表示元数据的类型和/或元数据的一个或更多个属性,元数据包括在元数据块140的数据字段143(即,表1中所示的payload())中。Figure Ib shows an example structure of metadata frame 130 (eg, metadata frame 112). The metadata frame 130 may include a frame header 131 representing general information regarding the structure of the metadata frame 130 and the association between the metadata frame 130 and the content frame 111 of the encoded bitstream 110 . The frame header 131 may include some or all fields in the evolved frame of Table 1 that are not related to the payload of the frame. Additionally, metadata frame 130 may include one or more metadata blocks 140 (also referred to as metadata payload 140). The metadata block 140 may include a block header 141, which may represent the size of the metadata block 140 (referred to as payload_size in Table 1). Additionally, the metadata block 140 may include a descriptor 142 (referred to as payload_config() in Table 1), where the descriptor 142 may represent a type of metadata and/or one or more attributes of the metadata, the metadata including In data field 143 of metadata block 140 (ie, payload() shown in Table 1).
表2中示出了演化帧的元数据块140的示例描述符142,即示例“payload_config()”字段。能够看出描述符142可以包括或者可以表示块140中包括的元数据的一个或更多个属性。在表2的示例中,属性有:An example descriptor 142, ie, an example "payload_config()" field, for the metadata block 140 of the evolved frame is shown in Table 2. It can be seen that descriptor 142 may include or may represent one or more attributes of the metadata included in block 140 . In the example in Table 2, the attributes are:
●块140的元数据能够适用于的表示音频内容的样本的时间戳参数。时间戳可以表示与块140的元数据帧112相关联的内容帧111中包括的样本。替代地或附加地,时间戳可以被配置成具有足够大的值,以表示与块140的元数据帧112相关联的内容帧111之后的内容帧中包括的样本。• The metadata of block 140 can be adapted to timestamp parameters representing samples of audio content. The timestamp may represent a sample included in the content frame 111 associated with the metadata frame 112 of block 140 . Alternatively or additionally, the timestamp may be configured to have a value large enough to represent samples included in the content frame subsequent to the content frame 111 associated with the metadata frame 112 of block 140 .
●块140的元数据能够适用于的表示数个样本(开始于由时间戳表示的样本)的持续时间参数。• The metadata of block 140 can be adapted to a duration parameter representing a number of samples (starting from a sample represented by a timestamp).
●转码标记(在表2中称为“不转码”标记),该转码标记向转码器提供如下指令:是否要对元数据块140进行转码。如果设置了“不转码”标记,则当对入站比特流110进行转码时,转码器可以简单地忽略或移除元数据块140。这在元数据仅与入站比特流110的第一编解码器系统有关的情况下可以有用,而对比特流110可以被转码至的任何其他编解码器系统毫无意义(如在例如对入站比特流110中包括的数据生成循环冗余校验(CRC)的情况下。通常,只有当编码数据未被修改时CRC才有意义,所以无需对CRC进行转码)。从更一般的意义上而言,可以使用转码标记来识别仅在对转码器内的入站比特流的解码处理期间有用(从而不要求用于生成出站比特流的随后的重新编码处理)的元数据。• A transcoding flag (referred to as the "do not transcode" flag in Table 2) that provides instructions to the transcoder as to whether the metadata block 140 is to be transcoded. If the "do not transcode" flag is set, the transcoder may simply ignore or remove the metadata chunk 140 when transcoding the inbound bitstream 110. This may be useful where the metadata is relevant only to the first codec system of the inbound bitstream 110 and is meaningless to any other codec system to which the bitstream 110 may be transcoded (as in e.g. The data included in the inbound bitstream 110 generates a cyclic redundancy check (CRC). Typically, the CRC is only meaningful if the encoded data has not been modified, so there is no need to transcode the CRC). In a more general sense, transcoding flags can be used to identify objects that are only useful during the decoding process of the inbound bitstream within a transcoder (thus not requiring a subsequent recoding process for generating the outbound bitstream. ) metadata.
●复制标记,该复制标记向转码器提供如下指令:当内容帧111的大小在转码之前和转码之后不同时,是否复制块140中包括的元数据。• Copy flag that provides instructions to the transcoder whether to copy the metadata included in block 140 when the size of the content frame 111 is different before and after transcoding.
●去复制标记,该去复制标记向转码器提供如下指令:当内容帧111的大小在转码之前和转码之后不同时,是否将块140中包括的元数据的副本移除。• Deduplication flag that provides instructions to the transcoder whether to remove copies of the metadata included in block 140 when the size of the content frame 111 is different before and after transcoding.
●优先级参数,该优先级参数提供对块140中包括的元数据的相对重要性的表示。例如,如果相对于第一比特流110的比特率而言,经转码的第二比特流120的允许的比特率降低,则转码器可以使用优先级参数从元数据帧130中选择一个或更多个块140。• A priority parameter that provides an indication of the relative importance of the metadata included in block 140. For example, if the allowed bitrate of the transcoded second bitstream 120 is reduced relative to the bitrate of the first bitstream 110 , the transcoder may use the priority parameter to select one or more from the metadata frame 130 More blocks 140.
●关联标记(在表2中称为“now_or_never”标记),该关联标记向转码器提供如下指示:块140中包括的元数据是否与相对应的内容帧111相关联。这样,如果设置了“now_or_never”标记,则转码器了解到如下事实:块140中包括的元数据应该立即被转码还是应该被丢弃(因为“now_or_never”标记表示:如果元数据被延迟,则解码器不能使用元数据)。• Association flag (referred to as the "now_or_never" flag in Table 2) that provides an indication to the transcoder whether the metadata included in block 140 is associated with the corresponding content frame 111. This way, if the "now_or_never" flag is set, the transcoder learns the fact that the metadata included in block 140 should be transcoded immediately or should be discarded (because the "now_or_never" flag indicates that if the metadata is delayed, then The decoder cannot use metadata).
表2Table 2
换言之,表2中所示的描述符142的属性参数的语义可以如下:In other words, the semantics of the attribute parameters of descriptor 142 shown in Table 2 can be as follows:
●时间戳参数,表示样本从所讨论的有效载荷143所属的内容帧111的开始起的偏移;• a timestamp parameter representing the offset of the sample from the beginning of the content frame 111 to which the payload 143 in question belongs;
●持续时间参数,表示所讨论的有效载荷143对于其仍然有效的样本中的时间;● a duration parameter, indicating the time in the sample for which the payload 143 in question is still valid;
●dont_transcode标记,该标记标志着:当转码(标记=1)时,是否必须丢弃所讨论的有效载荷143或者是否可以发生转码(标记=0);● dont_transcode flag, which marks whether the payload in question 143 must be discarded when transcoded (flag=1) or whether transcoding can occur (flag=0);
●复制标记,当被设置成1时,该标记标志着:在转码期间需要复制所讨论的有效载荷143,以使得有效载荷143出现在时间戳与时间戳加持续时间之间的经转码的块140中。可以例如针对响度数据来设置复制标记以表示帧具有相同的对白标准化。从更一般的意义上来说,可以针对不具有时间观念的元数据来设置复制标记。通常,不针对自身支持时间概念的数据像例如编解码器的比特流来设置复制标记。换言之,可以不为内部地被定时的元数据提供所设置的复制标记和/或去复制标记,其中,术语“内部地被定时”意味着仅元数据块的确切序列是有意义的,即,复制或去复制将使元数据无效。内部地被定时的元数据的示例是被嵌入至元数据帧序列的元数据块序列中的不同的比特流(与内容帧中包括的内容不同)。绝不应该对这样的比特流的有效载荷进行复制或去复制。否则,比特流将被部分地重复或被部分地切割。内部地被定时的数据的另一示例是二进制数据如可执行程序。如果在多个元数据帧的多个元数据块中传输这样的二进制数据,则元数据块的复制或去复制将使二进制数据的意义无效。● Copy flag, when set to 1, this flag signals that the payload 143 in question needs to be copied during transcoding such that the payload 143 appears in the transcoded time between timestamp and timestamp plus duration in block 140. A copy flag may be set, for example, for loudness data to indicate that the frames have the same dialogue normalization. In a more general sense, replication flags can be set for time-agnostic metadata. Typically, the copy flag is not set for data that itself supports the concept of time, like the bitstream of a codec for example. In other words, metadata that is internally timed may not be provided with a set replication flag and/or deduplication flag, where the term "internally timed" means that only the exact sequence of metadata blocks is meaningful, i.e. Copying or deduplication will invalidate the metadata. An example of internally timed metadata is a different bitstream (different from the content included in the content frame) embedded in a sequence of metadata blocks of a sequence of metadata frames. The payload of such a bitstream should never be copied or decopied. Otherwise, the bitstream will be partially repeated or partially cut. Another example of internally timed data is binary data such as executable programs. If such binary data is transmitted in multiple metadata chunks across multiple metadata frames, copying or deduplication of the metadata chunks will invalidate the meaning of the binary data.
●去复制标记:去复制标记可以确保:在转码期间,除了使该标记被设置成1的第一元数据块以外,可以删除同一出站元数据帧中的具有特定id的每个元数据块。可以例如针对无需对每个出站元数据帧122都呈现多次的响度数据如对白标准化来设置去复制标记。● Deduplication flag: The deduplication flag ensures that during transcoding, every metadata with a specific id in the same outbound metadata frame is deleted except for the first metadata block where the flag is set to 1. piece. Deduplication flags may be set, for example, for loudness data such as dialogue normalization that does not need to be presented multiple times for each outbound metadata frame 122 .
●“now_or_never”标记,其表示:在转码时不必对有效载荷进行延迟。● The "now_or_never" flag, which indicates that the payload does not need to be delayed while transcoding.
●PCM处理参数,在表2中将其称为“tight_coupling”参数。在对内容帧中包括的信号的样本进行修改的情况下,例如在如下所述的PCM连接转码器的背景下可以使用PCM处理参数,以向PCM连接转码器通知:如何处理与特定内容帧相关联的特定元数据帧的元数据。下面在描述PCM连接转码器的功能时进一步详细描述了PCM处理参数的功能。● PCM processing parameters, referred to as "tight_coupling" parameters in Table 2. In the case of modifications to the samples of the signal included in the content frame, for example in the context of a PCM connection transcoder as described below, PCM processing parameters may be used to inform the PCM connection transcoder how to process information related to a specific content Metadata for the specific metadata frame associated with the frame. The function of PCM processing parameters is described in further detail below when describing the function of PCM connecting to the transcoder.
表3table 3
表3示出了元数据块140的示例数据字段143的语法。Table 3 shows the syntax of example data fields 143 of metadata block 140.
如上所述,用于携带元数据(即,包括元数据块140的元数据帧130)的比特流语法可以限定一般元数据属性(例如,包括在描述符142中,即在表2中所示的payload_config()字段中)。即使第一编解码器(用于对入站比特流110进行编码)和第二编解码器(用于对出站比特流120进行编码)使用不同的成帧,这些属性仍然使得能够将元数据从一个入站(即第一)比特流110简单拷贝至出站(即第二)比特流120。执行元数据的拷贝的该方式由描述符142中包括的属性来引导。在转码处理期间,可能需要改变的唯一事物可能是属性本身。然而,对描述符142中包括的属性的修改并不要求关于块140的数据字段143中包括的元数据的实际意义的知识。As discussed above, the bitstream syntax for carrying metadata (i.e., metadata frame 130 including metadata block 140) may define general metadata attributes (e.g., included in descriptor 142, i.e., as shown in Table 2 payload_config() field). These properties enable metadata to be encoded even if the first codec (used to encode the inbound bitstream 110) and the second codec (used to encode the outbound bitstream 120) use different framing. A simple copy is made from an inbound (i.e. first) bitstream 110 to an outbound (i.e. second) bitstream 120. The manner in which the copy of metadata is performed is guided by the attributes included in the descriptor 142 . The only thing that may need to change during the transcoding process may be the properties themselves. However, modification of the attributes included in the descriptor 142 does not require knowledge about the actual meaning of the metadata included in the data field 143 of the block 140 .
在下面,更详细地描述表2中所示的示例属性。具体地,描述了转码器103可以如何使用由描述符142表示的一个或更多个属性来执行对元数据块140中包括的元数据的高效转码。Below, the example properties shown in Table 2 are described in more detail. Specifically, how transcoder 103 may use one or more attributes represented by descriptor 142 to perform efficient transcoding of metadata included in metadata block 140 is described.
图2a和图2b示出了元数据块140的描述符142中包括的时间戳参数的使用。在图2a中,示出了:当将元数据从第一比特流110转码成第二比特流120时,转码器103可以如何更新时间戳参数201。在示出的示例中,时间戳参数201表示特定样本202相对于相关联的内容帧111(即,相对于最近的样本)的末端的位置。这样,时间戳参数201表示样本202相对于内容帧111中包括的最近的样本的“延迟”。在图2所示的示例中,第二比特流120的内容帧121具有与第一比特流110的内容帧111不同的大小,具体地,第二比特流120的内容帧121的大小大于第一比特流110的内容帧111的大小。因此,与第一比特流110的内容帧111内的相对位置相比,特定样本202可以位于第二比特流120的内容帧121内的不同的相对位置处。具体地,与相对于入站内容帧111中包括的最近的样本相比较,特定样本202可以表现出相对于出站内容帧121中包括的最近的样本不同的“延迟”。因此,当被插入至第二比特流120的元数据帧122中时,可能需要修改第一比特流110的元数据帧112中包括的时间戳参数201,从而产生经转码的时间戳参数203。Figures 2a and 2b illustrate the use of the timestamp parameter included in the descriptor 142 of the metadata block 140. In Figure 2a, it is shown how the transcoder 103 may update the timestamp parameter 201 when transcoding metadata from the first bitstream 110 to the second bitstream 120. In the example shown, the timestamp parameter 201 represents the position of a particular sample 202 relative to the end of the associated content frame 111 (ie, relative to the most recent sample). As such, timestamp parameter 201 represents the "delay" of sample 202 relative to the most recent sample included in content frame 111. In the example shown in FIG. 2 , the content frame 121 of the second bitstream 120 has a different size from the content frame 111 of the first bitstream 110 . Specifically, the size of the content frame 121 of the second bitstream 120 is larger than that of the first bitstream 120 . The size of the content frame 111 of the bitstream 110. Therefore, the particular sample 202 may be located at a different relative position within the content frame 121 of the second bitstream 120 compared to the relative position within the content frame 111 of the first bitstream 110 . Specifically, a particular sample 202 may exhibit a different "latency" relative to the most recent sample included in the outbound content frame 121 than relative to the most recent sample included in the inbound content frame 111 . Therefore, when inserted into the metadata frame 122 of the second bitstream 120 , the timestamp parameter 201 included in the metadata frame 112 of the first bitstream 110 may need to be modified, thereby producing a transcoded timestamp parameter 203 .
图2b示出了在比特流110、120中移动元数据块140的可能性。这可以用于在转码之后使比特流120的比特率平滑。作为示例,元数据帧112中的特定块140的元数据可以与内容帧111中的特定样本202(由时间戳参数211表示)相关联。如上所述,可以相对于最后的、即最近的入站内容帧111的样本来表示特定样本202的位置。如果特定块140的元数据紧接着在包括样本202的内容帧121之后到达不是必须的(如可以由关联标记(在表2中称为“now_or_never”标记)来表示),则转码器可以将特定块140移动至在包括样本202的内容帧121之后的内容帧221的元数据帧222。转码器103可以更新时间戳参数213,以使得该时间戳参数213指向正确的样本202。Figure 2b shows the possibility of moving metadata blocks 140 in the bitstreams 110, 120. This can be used to smooth the bitrate of the bitstream 120 after transcoding. As an example, metadata for a particular block 140 in the metadata frame 112 may be associated with a particular sample 202 in the content frame 111 (represented by a timestamp parameter 211). As mentioned above, the location of a particular sample 202 may be expressed relative to the sample of the last, most recent inbound content frame 111 . If it is not necessary that the metadata for a particular chunk 140 arrives immediately after the content frame 121 including the sample 202 (as may be indicated by an association flag (referred to as the "now_or_never" flag in Table 2)), the transcoder may Specific block 140 moves to metadata frame 222 of content frame 221 following content frame 121 including sample 202 . The transcoder 103 may update the timestamp parameter 213 so that the timestamp parameter 213 points to the correct sample 202 .
具体地,时间戳参数213可以表示样本202相对于最后的、即包括时间戳参数213的出站元数据帧222与其相关联的最近的出站内容帧221的样本的位置。出于此目的,时间戳参数213可以具有超过内容帧221中包括的样本的数量的值。以类似的方式,时间戳参数213可以被配置成具有负值。这样的负值可以用来表示将来的内容帧中包括的样本202,即,在与包括时间戳参数213的元数据帧222相关联的内容帧221之后的内容帧中包括的样本202。通过这样做,可以在与元数据相关联(例如,元数据要被应用于)的一个或更多个样本之前传输该元数据。Specifically, the timestamp parameter 213 may represent the position of the sample 202 relative to the last, ie, the most recent outbound content frame 221 sample with which the outbound metadata frame 222 that includes the timestamp parameter 213 is associated. For this purpose, the timestamp parameter 213 may have a value that exceeds the number of samples included in the content frame 221 . In a similar manner, the timestamp parameter 213 may be configured to have a negative value. Such negative values may be used to represent samples 202 included in future content frames, ie, samples 202 included in content frames subsequent to the content frame 221 associated with the metadata frame 222 including the timestamp parameter 213 . By doing so, the metadata may be transmitted prior to one or more samples to which the metadata is associated (eg, to which the metadata is to be applied).
这样,时间戳参数211(可能与关联标记相结合)使得转码器103能够在随后的元数据帧222或之前的元数据帧222中传输与时间戳211相关联的元数据,并且使转码器103能够调整时间戳213以使得该时间戳213指代同一PCM样本202(即使在转码之后,样本202不包括在与包括特定块140的元数据帧222相关联的内容帧221中)。因此,为转码器103提供了一些灵活性,以使第二比特流120的比特率平滑。In this way, the timestamp parameter 211 (possibly combined with an associated flag) enables the transcoder 103 to transmit metadata associated with the timestamp 211 in a subsequent metadata frame 222 or a previous metadata frame 222, and enables transcoding The processor 103 is able to adjust the timestamp 213 so that the timestamp 213 refers to the same PCM sample 202 (even after transcoding, the sample 202 is not included in the content frame 221 associated with the metadata frame 222 that includes the particular block 140). Therefore, the transcoder 103 is provided with some flexibility to smooth the bitrate of the second bitstream 120.
应当注意,以与转码器103类似的方式,编码器101可以被配置成:将样本的元数据包括在随后的元数据帧中。这样,编码器101可以被配置成:生成指向内容帧121中包括的样本202的时间戳213,该内容帧121不是与包括时间戳213的元数据帧相关联的内容帧。It should be noted that in a similar manner to the transcoder 103, the encoder 101 may be configured to include the metadata of the sample in subsequent metadata frames. As such, the encoder 101 may be configured to generate a timestamp 213 pointing to a sample 202 included in a content frame 121 that is not a content frame associated with a metadata frame that includes the timestamp 213 .
图3a和图3b示出了由元数据块140的描述符142表示的去复制标记的可能的用例。在所示的情形中,与第一比特流110的内容帧111相比,第二比特流120的内容帧121代表较高数量的样本(即,具有较高的帧大小)。如果帧大小不同,则可能发生如下情况:第二比特流120的单个内容帧121包括来自第一比特流110的多于一个的内容帧111的样本。在这样的情况下,元数据块140可以从与第一比特流110的多于一个的内容帧111相关联的多于一个的元数据帧112获得。转码器103必须确定元数据块140中的哪些元数据块要被包括在第二比特流120的单个内容帧121的单个元数据帧122中。特定块140的去复制标记可以向转码器103指示:如果来自第一比特流110的多个元数据帧112的元数据块140要被合并,则无需将特定块140插入至第二比特流120的元数据帧122中。这样,转码器103可以被配置成:丢弃或忽略设置有去复制标记的附加元数据帧112的元数据块140。Figures 3a and 3b illustrate possible use cases for the deduplication flag represented by the descriptor 142 of the metadata block 140. In the scenario shown, the content frames 121 of the second bitstream 120 represent a higher number of samples (ie, have a higher frame size) than the content frames 111 of the first bitstream 110 . If the frame sizes are different, it may happen that a single content frame 121 of the second bitstream 120 includes samples from more than one content frame 111 of the first bitstream 110 . In such a case, the metadata chunk 140 may be obtained from more than one metadata frame 112 associated with more than one content frame 111 of the first bitstream 110 . The transcoder 103 must determine which of the metadata blocks 140 are to be included in the single metadata frame 122 of the single content frame 121 of the second bitstream 120 . The deduplication flag for a particular block 140 may indicate to the transcoder 103 that if the metadata blocks 140 from multiple metadata frames 112 of the first bitstream 110 are to be merged, the particular block 140 does not need to be inserted into the second bitstream 110 120 in metadata frame 122. As such, the transcoder 103 may be configured to discard or ignore the metadata chunks 140 of the additional metadata frame 112 that have the deduplication flag set.
这在图3a中示出了,其中出站内容帧121(即,出站比特流120的内容帧121)包括入站内容帧111和311(即,入站比特流110的内容帧111、311)的样本。转码器103必须确定:入站元数据帧112、312(即,入站比特流110的元数据帧112、312)的块140中的哪些块要被包括在与出站内容帧121相关联的出站元数据帧122(即,出站比特流120的元数据帧122)中。在图3a所示的示例中,假定至少针对入站元数据帧312的一个或更多个块140来设置去复制标记。这样,转码器103可以被配置成丢弃入站元数据帧312的块140。This is illustrated in Figure 3a, where outbound content frame 121 (i.e., content frame 121 of outbound bitstream 120) includes inbound content frames 111 and 311 (i.e., content frames 111, 311 of inbound bitstream 110 ) sample. The transcoder 103 must determine which of the blocks 140 of the inbound metadata frames 112 , 312 (i.e., the metadata frames 112 , 312 of the inbound bitstream 110 ) are to be included in association with the outbound content frame 121 in the outbound metadata frame 122 (i.e., the metadata frame 122 of the outbound bitstream 120). In the example shown in Figure 3a, it is assumed that a deduplication flag is set for at least one or more blocks 140 of the inbound metadata frame 312. As such, transcoder 103 may be configured to discard block 140 of inbound metadata frame 312 .
应当注意,也可以设置入站元数据帧112的一个或更多个块140的去复制标记。转码器103可以被配置成:仅丢弃用于创建出站元数据帧122的第二(或更多)元数据帧312的块140。换言之,转码器103可以被配置成:只有当要考虑多于一个的入站元数据帧112用于生成出站元数据帧122时,才考虑去复制标记。这样,可以使用去复制标记来阻止特定类型的元数据块140的“复制”,同时仍然确保包括该特定类型的至少一个元数据块140。It should be noted that a deduplication flag may also be set for one or more blocks 140 of the inbound metadata frame 112 . The transcoder 103 may be configured to discard only the blocks 140 used to create the second (or more) metadata frames 312 of the outbound metadata frame 122 . In other words, the transcoder 103 may be configured to consider the deduplication mark only when more than one inbound metadata frame 112 is to be considered for generating the outbound metadata frame 122 . In this way, deduplication flags can be used to prevent "duplication" of metadata blocks 140 of a particular type while still ensuring that at least one metadata block 140 of that particular type is included.
图3b示出了未设置去复制标记的示例情况。在这种情况下,转码器103可以被配置成:考虑多个入站元数据帧112和312的块140用于创建出站元数据帧122。具体地,转码器103可以被配置成:如果未设置去复制标记(即使在根据多个入站元数据帧112、312生成出站元数据帧122的情况下),则将来自入站元数据帧312的块140插入至出站元数据帧122中。Figure 3b shows an example case where the deduplication flag is not set. In this case, transcoder 103 may be configured to consider block 140 of multiple inbound metadata frames 112 and 312 for creating outbound metadata frame 122 . Specifically, the transcoder 103 may be configured to: if the deduplication flag is not set (even where the outbound metadata frame 122 is generated from multiple inbound metadata frames 112, 312), then the Blocks 140 of data frame 312 are inserted into outbound metadata frame 122 .
可以例如使用去复制标记来识别被插入至多个连续的元数据帧112、312(例如,插入至比特流110的每个元数据帧112、312)中的元数据块140。这样,去复制标记使得转码器103能够容易地识别可以被丢弃的元数据块140(无需分析存储在元数据块140的数据字段143中的元数据)。因此,降低了对元数据转码的计算复杂度。另一方面,未设置的去复制标记表示:不应该丢弃相对应的元数据块140。这可以用于辅助数据,以确保即使多个入站元数据帧112、312被转码成单个出站元数据帧122,也不会丢弃辅助数据。A metadata block 140 inserted into a plurality of consecutive metadata frames 112, 312 (eg, inserted into each metadata frame 112, 312 of the bitstream 110) may be identified, for example, using a deduplication mark. In this way, the deduplication mark enables the transcoder 103 to easily identify metadata chunks 140 that can be discarded (without having to analyze the metadata stored in the data field 143 of the metadata chunk 140). Therefore, the computational complexity of transcoding metadata is reduced. On the other hand, an unset deduplication flag indicates that the corresponding metadata block 140 should not be discarded. This can be used for ancillary data to ensure that even if multiple inbound metadata frames 112, 312 are transcoded into a single outbound metadata frame 122, the ancillary data is not discarded.
图4a和图4b示出了在元数据块140的描述符142中表示的复制标记的示例使用。在所示的情形中,与出站内容帧121相比,入站内容帧111包括较高数量的样本(即,具有较大的帧大小)。如果帧大小不同,则可能发生下述情况:单个入站内容帧111的样本被包括在多于一个的出站内容帧121、321中。因此,转码器103接收单个入站元数据帧112,并且必须确定:在多个出站元数据帧122、322中的哪个出站元数据帧中放置元数据的特定块140。可以使用复制标记向转码器130指示是否要复制来自入站元数据帧112的特定块140。如图4a所示,通过设置复制标记,可以表示:块140中包括的元数据应该被包括在每个出站元数据帧122、322中。另一方面,未设置的复制标记表示:应当仅传输一次元数据块140。这样,转码器103将来自入站元数据帧112的块140仅插入至多个出站元数据帧122、322中的单个出站元数据帧中(如图4b所示)。Figures 4a and 4b illustrate an example use of a copy flag represented in descriptor 142 of metadata block 140. In the scenario shown, inbound content frame 111 includes a higher number of samples (ie, has a larger frame size) than outbound content frame 121 . If the frame sizes are different, it may happen that a sample of a single inbound content frame 111 is included in more than one outbound content frame 121, 321. Therefore, the transcoder 103 receives a single inbound metadata frame 112 and must determine in which of a plurality of outbound metadata frames 122, 322 to place a particular block of metadata 140. A copy flag may be used to indicate to the transcoder 130 whether a particular block 140 from the inbound metadata frame 112 is to be copied. As shown in Figure 4a, by setting the copy flag, it can be indicated that the metadata included in block 140 should be included in each outbound metadata frame 122, 322. On the other hand, an unset replication flag indicates that the metadata block 140 should be transferred only once. In this way, the transcoder 103 inserts the block 140 from the inbound metadata frame 112 into only a single outbound metadata frame of the plurality of outbound metadata frames 122, 322 (as shown in Figure 4b).
如上所述,元数据块140的描述符142可以表示关联标记(在表2中称为“now_or_never”标记)。关联标记可以表示:可以延迟块140中包括的元数据,而不影响相关联的内容帧中包括的内容。这样,如果使元数据延迟任意数量的时间是元数据的一个属性,则描述符142的语法可以使得转码器103能够使元数据延迟任意数量的时间。这可以通过将标记now_or_never设置成0来表示。例如当隐含的音频编解码器能够“负担得起”元数据的传输时,例如当内容帧包括无声时,关联标记使得转码器103能够对块140中包括的元数据进行传输。可以被延迟的元数据的一个示例是无需连同特定的内容帧121一起被传输的辅助数据或二进制数据,如固件升级。As mentioned above, descriptor 142 of metadata block 140 may represent an association tag (referred to as the "now_or_never" tag in Table 2). The association flag may indicate that the metadata included in block 140 may be delayed without affecting the content included in the associated content frame. Thus, if delaying metadata by an arbitrary amount of time is a property of the metadata, the syntax of the descriptor 142 may enable the transcoder 103 to delay the metadata by an arbitrary amount of time. This can be indicated by setting the flag now_or_never to 0. The association flag enables the transcoder 103 to transmit the metadata included in block 140, for example when the underlying audio codec can "afford" the transmission of the metadata, for example when the content frame includes silence. One example of metadata that may be delayed is ancillary data or binary data that does not need to be transmitted along with a specific content frame 121, such as a firmware upgrade.
如表2的上下文中描述的,元数据块140的描述符142可以表示或可以包括优先级属性或优先级参数。优先级参数可以表示特定块140的元数据的相对重要性(例如,相对于其他块140的重要性)。转码器103可以确定:仅对一定数量的元数据块140进行转码并且丢弃元数据帧112中的所有其他元数据块。当从较高比特率的入站比特流110转码成较低比特率的出站比特流120时,例如可以要求上述操作。优先级参数可以使得转码器103能够选择具有相对最高的优先级的入站元数据帧112的那些块140并且丢弃(或延迟)具有相对较低优先级的那些块140。As described in the context of Table 2, the descriptor 142 of the metadata block 140 may represent or may include a priority attribute or priority parameter. The priority parameter may represent the relative importance of metadata for a particular block 140 (eg, relative to the importance of other blocks 140). Transcoder 103 may determine to transcode only a certain number of metadata chunks 140 and discard all other metadata chunks in metadata frame 112 . This may be required, for example, when transcoding from a higher bitrate inbound bitstream 110 to a lower bitrate outbound bitstream 120. The priority parameter may enable the transcoder 103 to select those chunks 140 of the inbound metadata frame 112 that have a relatively highest priority and to discard (or delay) those chunks 140 that have a relatively lower priority.
应用和/或编码器101可以提供同一元数据帧112中的多组元数据,每组元数据具有不同的优先级。多组元数据可以与不同质量的元数据相关联。较高质量元数据的优先级可以低于较低质量元数据的优先级。这样,转码器103可以被配置成:通过考虑优先级参数来降低元数据的质量。作为示例,如果以如下方式设置优先级:该方式使得可伸缩性是可能的,即,如果对同一应用的较高优先级的所有组元数据进行传输则可以应用每组元数据,则转码器可以适度地降低元数据的质量而不必知道元数据的意义。具体地,多组元数据可以包括增量元数据,即,每组元数据可以向具有下一最高优先级的一组元数据增加一些质量。然后,可以通过对所有组元数据(从最高优先级降至最低优先级)进行组合来提供最高质量元数据。这样,入站元数据帧112可以包括多个增量元数据块140,其中,具有最高优先级的元数据块140包括具有最小可接受质量的版本的元数据,并且其中,具有连续较低优先级的块140包括使得能够增量式提高元数据的质量的增量版本的元数据。这样,转码器103可以通过考虑增量元数据的多个块140的优先级参数来确定第二比特流120中包括的元数据的质量。The application and/or encoder 101 may provide multiple sets of metadata in the same metadata frame 112, each set of metadata having a different priority. Multiple sets of metadata can be associated with different qualities of metadata. Higher quality metadata can have a lower priority than lower quality metadata. In this way, the transcoder 103 may be configured to reduce the quality of metadata by taking into account priority parameters. As an example, if the priorities are set in such a way that scalability is possible, i.e. each set of metadata can be applied if all sets of metadata of higher priority for the same application are transmitted, then transcoding The processor can moderately degrade the quality of metadata without knowing the meaning of the metadata. Specifically, multiple sets of metadata may include incremental metadata, ie, each set of metadata may add some quality to the set of metadata with the next highest priority. The highest quality metadata can then be provided by combining all group metadata from highest priority down to lowest priority. As such, the inbound metadata frame 112 may include a plurality of incremental metadata chunks 140 , where the metadata chunk 140 with the highest priority includes a version of the metadata with the smallest acceptable quality, and where the metadata chunk 140 with successively lower priority Blocks 140 of levels include incremental versions of metadata that enable incremental improvements in the quality of the metadata. In this way, the transcoder 103 may determine the quality of the metadata included in the second bitstream 120 by considering the priority parameters of the plurality of blocks 140 of incremental metadata.
如表1中所示的元数据帧112的示例语法中所示的,元数据帧130可以包括保护字段。保护字段可以用于使得解码器104能够验证元数据帧130的内容和/或相关联的内容帧的内容是否已经被修改并且可能因此是无效的。换言之,保护字段可以使得解码器104能够验证元数据帧130和/或相关联的内容帧中包括的元数据是否是可信赖的。表4示出了元数据帧130的保护字段的示例语法。保护字段可以包括在元数据帧130的头部131中。As shown in the example syntax of metadata frame 112 shown in Table 1, metadata frame 130 may include a protection field. The protection field may be used to enable the decoder 104 to verify whether the contents of the metadata frame 130 and/or the contents of the associated content frame have been modified and may therefore be invalid. In other words, the protection field may enable the decoder 104 to verify whether the metadata included in the metadata frame 130 and/or the associated content frame is trustworthy. Table 4 shows an example syntax for the protection field of metadata frame 130. The protection field may be included in the header 131 of the metadata frame 130.
表4Table 4
保护字段的语义可以如下:The semantics of protected fields can be as follows:
●protection_bits_frame可以包括当前帧(包括内容帧和/或相关联的元数据帧)的截短保护有效载荷。• protection_bits_frame may include the truncation protection payload of the current frame (including the content frame and/or associated metadata frame).
●protection_bits_history可以包括当前帧的截短保护有效载荷和当前帧之前的帧(包括内容帧和/或相关联的元数据帧)的截短保护有效载荷。在WO2011/015369中描述了用于使帧序列安全的示例方案,其内容通过引用合并到本文中。• protection_bits_history may include a truncation protection payload for the current frame and a truncation protection payload for frames preceding the current frame (including content frames and/or associated metadata frames). An example scheme for making frame sequences secure is described in WO2011/015369, the content of which is incorporated herein by reference.
这样,保护字段可以包括一个或更多个加密值。可以基于当前元数据帧(包括保护字段)中包括的元数据和/或基于与当前元数据帧相关联的内容帧来生成一个加密值。这样,可以保证不修改隔离的元数据帧和/或相关联的内容帧。可以基于当前元数据帧和一个或更多个之前的元数据帧(以及基于相应的相关联的内容帧)中包括的元数据来生成另一个加密值。这样,可以确保内容帧序列和/或元数据帧序列不被修改。As such, the protection field may include one or more encrypted values. An encrypted value may be generated based on metadata included in the current metadata frame (including protection fields) and/or based on the content frame associated with the current metadata frame. This way, it is guaranteed that isolated metadata frames and/or associated content frames are not modified. Another encrypted value may be generated based on metadata included in the current metadata frame and one or more previous metadata frames (and based on corresponding associated content frames). In this way, it is ensured that the sequence of content frames and/or the sequence of metadata frames are not modified.
通过将单向函数应用于一组一个或更多个元数据帧112、312和/或相关联的内容帧111、311,可以在编码器101处确定加密值。具体地,可以使用秘钥值和加密散列函数(所谓的单向函数)来生成加密值。具体地,可以通过计算一个或更多个元数据帧112、312中包括的数据的和一个或更多个相关联的内容帧111、311中包括的数据的HMAC-MD5(散列消息认证码)值生成加密值。此外,加密值的生成可以包括对HMAC-MD5值的截短,例如,将HMAC-MD5值截短成16、24、32、48、64或128位。截短可以有益于降低包括元数据帧112、312的编码比特流110中加密值所需的开销。应当注意,可以使用其他散列函数如SHA-1或SHA-256来替代MD5。此外,应当注意,例如在不要求保护元数据的情况下,编码器101可以被配置成发送0位的加密值,即,不发送加密值。The encryption value may be determined at the encoder 101 by applying a one-way function to a set of one or more metadata frames 112, 312 and/or associated content frames 111, 311. In particular, a secret key value and a cryptographic hash function (so-called one-way function) can be used to generate the encrypted value. Specifically, the HMAC-MD5 (Hash Message Authentication Code) of the data included in one or more metadata frames 112, 312 and the data included in one or more associated content frames 111, 311 can be calculated. ) value generates an encrypted value. Additionally, generation of the encrypted value may include truncation of the HMAC-MD5 value, for example, to 16, 24, 32, 48, 64, or 128 bits. Truncation may be beneficial in reducing the overhead required to encrypt values in the encoded bitstream 110 including the metadata frames 112, 312. It should be noted that other hash functions such as SHA-1 or SHA-256 can be used instead of MD5. Furthermore, it should be noted that the encoder 101 may be configured to send an encrypted value of 0 bits, ie, no encrypted value is sent, for example where metadata protection is not required.
更详细地,可以通过使用加密散列函数H(.)和“秘密的”秘钥K(也称为安全秘钥)确定一个或更多个内容帧111、311的加密值和一个或更多个元数据帧112、312的加密值。该“秘密的”秘钥K通常在右边用额外的0填充至散列函数H(.)的块大小,以确定一个或更多个内容帧111、311的和一个或更多个元数据帧112、312的散列消息认证码(HMAC)。令||符号表示连接(concatenation),令⊕符号表示异或,并且令外部填充opad=0x5c5c5c…5c5c以及令内部填充ipad=0x363636…3636是散列函数H(.)的块大小的长度的常量,则一个或更多个内容帧111、311的和一个或更多个元数据帧112、312的HMAC值可以写作In more detail, the encrypted value of one or more content frames 111, 311 and one or more The encrypted value of each metadata frame 112, 312. The "secret" key K is typically padded on the right with additional zeros to the block size of the hash function H(.) to determine one or more content frames 111, 311 and one or more metadata frames Hash Message Authentication Code (HMAC) of 112, 312. Let the || symbol represent concatenation, let the ⊕ symbol represent exclusive OR, and let the outer padding opad=0x5c5c5c...5c5c and the inner padding ipad=0x363636...3636 be constants for the length of the block size of the hash function H(.) , then the HMAC values of one or more content frames 111, 311 and one or more metadata frames 112, 312 can be written as
其中,m是一个或更多个内容帧111、311的和一个或更多个元数据帧112、312的组合比特序列。MD5或SHA-1或SHA-256散列函数使用的块大小通常是512位。HMAC运算的输出的大小与隐含的散列函数的大小相同,即,在MD5的情况下为128位或在SHA-1的情况下为160位。Where, m is a combined bit sequence of one or more content frames 111, 311 and one or more metadata frames 112, 312. The block size used by MD5 or SHA-1 or SHA-256 hash functions is usually 512 bits. The size of the output of the HMAC operation is the same as the size of the underlying hash function, i.e. 128 bits in the case of MD5 or 160 bits in the case of SHA-1.
这样,保护字段可以包括至少两个加密值In this way, the protected field can include at least two encrypted values
●帧加密值(在表4中称为“protection_bits_frame”),该帧加密值表示各个内容帧111及其相关联的元数据帧112的可靠性。帧加密值可以用来识别各个内容帧111的数据及其相关联的元数据帧112是否已经被改变。可以使用包括各个内容帧111的比特次序及其相关联的元数据帧112(或者各个内容帧111中包括的有效载荷及其相关联的元数据帧112)的比特序列的消息m来确定帧加密值。● Frame encryption value (referred to as "protection_bits_frame" in Table 4) that represents the reliability of each content frame 111 and its associated metadata frame 112. The frame encryption value can be used to identify whether the data of each content frame 111 and its associated metadata frame 112 have been changed. Frame encryption may be determined using a message m that includes the bit order of each content frame 111 and its associated metadata frame 112 (or the bit sequence of the payload included in each content frame 111 and its associated metadata frame 112 ). value.
●历史加密值(在表4中称为“protection_bits_history”),该历史加密值表示至少两个内容帧111、311及其相关联的至少两个元数据帧112、312的序列的可靠性。历史加密值可以用来识别至少两个内容帧111、311及其相关联的元数据帧112、312的序列是否已经被改变。可以使用包括至少两个内容帧111、311及其相关联的至少两个元数据帧112、312(或者至少两个内容帧111、311及其相关联的至少两个元数据帧112、312中包括的有效载荷)的比特序列的消息m来确定历史加密值。• A historical encryption value (called "protection_bits_history" in Table 4) that represents the reliability of the sequence of at least two content frames 111, 311 and their associated at least two metadata frames 112, 312. The historical encryption value may be used to identify whether the sequence of at least two content frames 111, 311 and their associated metadata frames 112, 312 has been changed. A frame including at least two content frames 111, 311 and their associated at least two metadata frames 112, 312 (or at least two content frames 111, 311 and their associated at least two metadata frames 112, 312) may be used. m including the bit sequence of the payload) to determine the historical encryption value.
如上所述,使用安全秘钥K来确定加密值,通常仅编码器101和解码器104知道安全秘钥K。在本文献中,提出了:通过允许使用提供不同的信任等级的不同的安全秘钥K来实现多级信任。作为示例,可以提供至少两级可信秘钥As mentioned above, the encryption value is determined using a security key K, which is generally known only to the encoder 101 and the decoder 104. In this document, it is proposed to achieve multi-level trust by allowing the use of different security keys K providing different levels of trust. As an example, at least two levels of trusted keys can be provided
●高度安全秘钥K1,可以不对沿分发链100提供组件101、103、104的实体的以外的任何一方公开高度安全秘钥K1。这样的实体可以是沿分发链100使用的编解码器系统的提供者(例如,杜比实验室)。具体地,这样的实体可以是沿分发链100使用的编码器和解码器的提供者。通过保持高度安全秘钥不被公开,可以确保渲染接收的比特流120中包括的音频信号的解码器104可以确信:接收的比特流120的元数据帧122、322中包括的元数据是可信的并且尚未沿分发链100以未经授权的方式被修改。● The high security key K 1 may not be disclosed to any party other than the entity providing components 101, 103, 104 along the distribution chain 100. Such an entity may be the provider of the codec system used along the distribution chain 100 (eg, Dolby Laboratories). In particular, such entities may be providers of encoders and decoders used along the distribution chain 100 . By keeping the high security key from being disclosed, it is ensured that the decoder 104 rendering the audio signal included in the received bitstream 120 can be confident that the metadata included in the metadata frames 122, 322 of the received bitstream 120 is authentic. and has not been modified in an unauthorized manner along the distribution chain 100.
●中等安全秘钥K2,可以向其他方例如沿分发链100的组件101、103、104的一些组件进行操作的多方(例如,编解码器系统的提供者的被许可人)公开该中等安全秘钥K2。如果解码器104接收已经使用中等安全秘钥K2对其进行保护的比特流120,则解码器104知道比特流120包括下述元数据(在元数据帧122、322中),该元数据是已经根据分发链100的操作者的可以与编解码器系统的提供者(持有高度安全秘钥K1)的策略不同的一些策略而被处理的元数据。• A medium security key K 2 that may be disclosed to other parties such as parties operating along some of the components 101 , 103 , 104 of the distribution chain 100 (eg licensees of the provider of the codec system) Secret key K 2 . If the decoder 104 receives a bitstream 120 that has been protected using a medium security key K2 , the decoder 104 knows that the bitstream 120 includes metadata (in metadata frames 122, 322) that is Metadata that has been processed according to some policy of the operator of the distribution chain 100 which may be different from the policy of the provider of the codec system (who holds the high security key K 1 ).
可以在元数据帧130(例如,在元数据帧130的头部131)中提供对由编码器101使用的安全秘钥K的表示。示出了key_id参数的表1中示出了上述情况。如表4所示,key_id参数可以包括预定数量的安全秘钥的索引,从而使得解码器104能够确定安全秘钥K,该安全秘钥K用于确定一个或更多个加密值,其中,一个或更多个加密值可以包括在元数据帧130的protection()字段中。然后,解码器104可以使用所识别的安全秘钥以与相对应的编码器101执行的方式相同的方式来确定一个或更多个加密值。由解码器104确定的加密值可以称为验证加密值。然后,将验证加密值与存储在元数据帧103中的加密值进行比较。在匹配的情况下,确认各个帧和/或帧序列尚未被修改。另一方面,在不匹配的情况下,确认各个帧和/或该帧序列已经被修改。A representation of the security key K used by the encoder 101 may be provided in the metadata frame 130 (eg, in the header 131 of the metadata frame 130). The above situation is shown in Table 1 showing the key_id parameter. As shown in Table 4, the key_id parameter may include an index of a predetermined number of security keys, thereby enabling the decoder 104 to determine the security key K, which is used to determine one or more encrypted values, where one One or more encrypted values may be included in the protection() field of metadata frame 130. The decoder 104 may then use the identified security key to determine one or more encrypted values in the same manner as the corresponding encoder 101 performs. The encrypted value determined by decoder 104 may be referred to as the verified encrypted value. The verification encrypted value is then compared to the encrypted value stored in the metadata frame 103. In the case of a match, it is confirmed that the individual frames and/or sequence of frames have not been modified. On the other hand, in the case of a mismatch, it is confirmed that the individual frames and/or the sequence of frames have been modified.
替代地或者除了在元数据帧130中提供对安全秘钥的表示以外,解码器104可以被配置成:使用解码器104已知的多个预定的安全秘钥来确定多组验证加密值。如果所述多组验证加密值中的一组验证加密值与元数据帧130中包括的加密值匹配,则解码器104知道:已经使用了哪个安全秘钥并且各个帧和/或该帧序列尚未被修改。另一方面,所有组的验证加密值都不匹配表示各个帧和/或该帧序列已被修改。Alternatively or in addition to providing a representation of the security key in the metadata frame 130, the decoder 104 may be configured to determine a plurality of sets of verification encryption values using a plurality of predetermined security keys known to the decoder 104. If one of the plurality of sets of verified encrypted values matches an encrypted value included in the metadata frame 130, the decoder 104 knows which security key has been used and that the respective frame and/or sequence of frames has not modified. On the other hand, non-matching of all sets of verified encryption values indicates that the individual frames and/or the sequence of frames have been modified.
能够检测到使用了哪个秘钥来使解码器104和转码器103中的比特流110、120安全使应用能够对要采用不同可信度的数据做什么进行更细粒度的决定。决定可能取决于检测到的安全秘钥而不同。具体地,可能检测到高度安全秘钥,可能检测到中等安全秘钥,或者可能检测不到有效秘钥从而安全检查不通过。Being able to detect which key was used to secure the bitstreams 110, 120 in the decoder 104 and transcoder 103 enables applications to make more fine-grained decisions about what to do with data of varying degrees of confidence. The decision may vary depending on the security key detected. Specifically, a high security key may be detected, a medium security key may be detected, or a valid key may not be detected and the security check fails.
这样,与仅使用单个安全秘钥的仅能对数据是否可以被信任作出二元决策的解决方案相比,当使用多个不同的安全秘钥(附属于不同的信任等级)时,可以提供各个等级的可信度。In this way, when using multiple different security keys (attached to different trust levels), it is possible to provide individual level of credibility.
如图1的上下文中所述,音频内容的分发链100可以包括转码器103,其被配置成将入站比特流110转换成出站比特流120。由转码器103执行的转码可以与从第一音频编解码器系统转码至可能不同的第二音频编解码器系统有关。替代地或附加地,该转码可以与出站比特流120的比特率相对于入站比特流110的比特率的变化有关。转码器103可以包括用于将入站比特流110解码成PCM(脉冲编码调制)音频信号的解码器。此外,转码器103可以包括用于将PCM音频信号编码成出站比特流120的编码器。这样的转码器103可以称为“PCM连接”转码器,原因是一个或更多个解码器(用于对一个或更多个入站比特流110进行解码)经由线性PCM连接至一个或更多个编码器(用于对一个或更多个出站比特流120进行编码)。As described in the context of FIG. 1 , a distribution chain 100 for audio content may include a transcoder 103 configured to convert an inbound bitstream 110 into an outbound bitstream 120 . The transcoding performed by the transcoder 103 may relate to transcoding from a first audio codec system to a second, possibly different, audio codec system. Alternatively or additionally, the transcoding may be related to changes in the bitrate of the outbound bitstream 120 relative to the bitrate of the inbound bitstream 110 . The transcoder 103 may include a decoder for decoding the inbound bit stream 110 into a PCM (Pulse Code Modulation) audio signal. Additionally, the transcoder 103 may include an encoder for encoding the PCM audio signal into an outbound bitstream 120 . Such a transcoder 103 may be referred to as a "PCM connected" transcoder since one or more decoders (used to decode one or more inbound bitstreams 110) are connected via linear PCM to one or more More encoders (for encoding one or more outbound bitstreams 120).
转码器103可以为作为由专业的内容提供者如广播公司使用的装置的所谓的专业转码器。如上所述,转码器103可以被配置成:按照第一格式(例如,杜比E)接受入站比特流110,并且将入站比特流110转码成不同的格式(例如,杜比数字+)。这样的转码器103通常包括一个或更多个解码器(用于对入站比特流110进行解码)和一个或更多个编码器(用于对出站比特流120进行编码)。The transcoder 103 may be a so-called professional transcoder, which is a device used by professional content providers such as broadcasters. As described above, the transcoder 103 may be configured to accept the inbound bitstream 110 in a first format (e.g., Dolby E) and transcode the inbound bitstream 110 to a different format (e.g., Dolby Digital +). Such a transcoder 103 typically includes one or more decoders (for decoding the inbound bitstream 110) and one or more encoders (for encoding the outbound bitstream 120).
PCM连接转码器可以具有在解码器和编码器之间的一个或更多个PCM处理级。响度调平是这样的PCM处理的一个示例。PCM处理的其他示例包括采样率转换、通道下混合和/或通道上混合。A PCM connected transcoder may have one or more PCM processing stages between the decoder and encoder. Loudness leveling is an example of such PCM processing. Other examples of PCM processing include sample rate conversion, channel downmixing, and/or channel upmixing.
这样的PCM连接转码器103引起关于上述可靠性、保护和信任问题的挑战。如上所述,入站比特流110可以包括使用一个或更多个加密值(如表1和表4所示,包括在例如元数据帧112、312的保护字段中)保护的元数据帧112、312。PCM连接转码器103使得用户能够对从内容帧111、311获得的PCM数据进行修改,从而可能使相关联的元数据帧112、312中包括的元数据无效,并且从而可能危及元数据的可信度。Such a PCM connected transcoder 103 raises challenges regarding the reliability, protection and trust issues mentioned above. As described above, the inbound bitstream 110 may include metadata frames 112, 112, 112, 110, 110, 110, 110, 110, 100, 100, 100, 100, 100, 100, 100, etc., protected using one or more encryption values (as shown in Tables 1 and 4, included in, for example, the protection fields of the metadata frames 112, 312). 312. The PCM connection transcoder 103 enables a user to make modifications to the PCM data obtained from the content frames 111, 311, thereby potentially invalidating the metadata included in the associated metadata frames 112, 312, and thus possibly compromising the availability of the metadata. reliability.
在本文献中,描述了用于确保转码器103中的元数据的可信度的方法和系统。具体地,描述的方法和系统使得即使当使用PCM连接转码器103时,也能够保持元数据帧112、312中包括的元数据的可信度。In this document, methods and systems for ensuring the trustworthiness of metadata in the transcoder 103 are described. In particular, the described method and system enable the authenticity of the metadata included in the metadata frames 112, 312 to be maintained even when the transcoder 103 is connected using PCM.
图5a至图5d分别示出了示例PCM连接转码器503、513、523、533。转码器包括解码器504,解码器504被配置成:将入站比特流110(其包括内容帧111序列和相关联的元数据帧112序列)分别转换成PCM数据和元数据。解码器504可以被配置成:使用上述保护方案来验证入站比特流110的正确性。出于该目的,解码器504可以知道到一些或所有预定的安全秘钥。Figures 5a to 5d illustrate example PCM connection transcoders 503, 513, 523, 533 respectively. The transcoder includes a decoder 504 configured to convert the inbound bitstream 110 (which includes a sequence of content frames 111 and an associated sequence of metadata frames 112) into PCM data and metadata respectively. Decoder 504 may be configured to verify the correctness of inbound bitstream 110 using the protection scheme described above. For this purpose, the decoder 504 may be aware of some or all predetermined security keys.
通常,解码器504提供一组未受保护的PCM数据和元数据(例如,以逐帧为基础)。换言之,解码器504通常对每个内容帧111和相关联的元数据帧112进行解码,并且不加保护地提供相应的一组PCM数据和元数据。这样,解码器504提供来自相对应的内容帧111和元数据帧112的序列的多组PCM数据和元数据的序列。多组PCM数据和元数据的序列可以被转码器修改并且然后可以被传送至如下编码器501:该编码器501被配置成将(可能经修改的)多组PCM数据和元数据的序列转换成出站比特流120。在该背景下,编码器501通常不能验证(可能经修改的)多组PCM数据和元数据的序列是否已经以可觉察的方式被修改。换言之,编码器501可能无法验证(可能经修改的)多组PCM数据和元数据的序列的可信度。Typically, the decoder 504 provides an unprotected set of PCM data and metadata (eg, on a frame-by-frame basis). In other words, the decoder 504 typically decodes each content frame 111 and associated metadata frame 112 and provides a corresponding set of PCM data and metadata without protection. In this manner, the decoder 504 provides sequences of sets of PCM data and metadata from corresponding sequences of content frames 111 and metadata frames 112 . The sequence of sets of PCM data and metadata may be modified by the transcoder and may then be passed to an encoder 501 configured to convert the (possibly modified) sequence of sets of PCM data and metadata into outbound bitstream 120. In this context, the encoder 501 generally cannot verify whether the sequence of (possibly modified) sets of PCM data and metadata has been modified in a perceptible manner. In other words, the encoder 501 may not be able to verify the plausibility of the (possibly modified) sequences of sets of PCM data and metadata.
在本文献中,提出了使解码器504能够基于一组或更多组PCM数据和元数据来提供一个或更多个签名值,从而使得能够保护解码器504与编码器501之间的PCM连接。如上所述,可以以与加密值类似的方式来确定签名值。然而,签名值也可以使用包括一组或更多组PCM数据和元数据(与一个或更多个内容帧和相关联的元数据帧形成对比)的消息m。具体地,解码器504可以被配置成In this document, it is proposed to enable the decoder 504 to provide one or more signature values based on one or more sets of PCM data and metadata, thereby enabling the protection of the PCM connection between the decoder 504 and the encoder 501 . As mentioned above, the signature value can be determined in a similar manner to the encrypted value. However, the signature value may also use a message m that includes one or more sets of PCM data and metadata (in contrast to one or more content frames and associated metadata frames). Specifically, decoder 504 may be configured to
●基于各组PCM数据和相关联的元数据来确定帧签名值;以及● Determine frame signature values based on each set of PCM data and associated metadata; and
●基于两组或更多组连续的PCM数据和相关联的元数据来确定历史签名值。● Determine historical signature values based on two or more consecutive sets of PCM data and associated metadata.
换言之,在PCM连接转码器503的PCM域内(即,解码器504和编码器501之间),可以使用一个或更多个签名(也称为签名值)来“保护”内容的可信度。解码器504可以被配置成产生一个或更多个签名值作为输出。当由解码器504产生一个或更多个签名值时,可以根据PCM数据和常规元数据(从内容帧获得)与附加元数据(从相关联的元数据帧获得)的联合来计算一个或更多个签名值。这样,对于入站比特流110的每个帧,可以基于经解码的各组PCM数据和元数据来确定一个或更多个签名值。相对应的编码器501可以使用这些一个或更多个签名值来验证所接收的一组PCM数据和元数据是否已经被修改和/或是否可信赖。In other words, within the PCM domain of the PCM connection transcoder 503 (i.e., between the decoder 504 and the encoder 501), one or more signatures (also called signature values) can be used to "protect" the credibility of the content . Decoder 504 may be configured to produce one or more signature values as output. When one or more signature values are generated by the decoder 504, one or more signature values may be calculated based on the union of the PCM data and regular metadata (obtained from the content frame) with additional metadata (obtained from the associated metadata frame). Multiple signature values. In this manner, for each frame of the inbound bitstream 110, one or more signature values may be determined based on the decoded sets of PCM data and metadata. The corresponding encoder 501 may use these one or more signature values to verify whether the received set of PCM data and metadata has been modified and/or is trustworthy.
编码器501接受一个或更多个签名值连同PCM数据、常规元数据和附加元数据作为输入。然后,编码器501可以针对其他输入(即,针对所接收的一组或多组PCM数据和元数据)来检查签名值。如果其他输入已经被修改/篡改,则签名检查将失败并且编码器将采取适当的行动。可以通过以下操作在编码器501处执行对一个或更多个签名值的验证:基于接收的一组或更多组PCM数据和元数据(以与针对加密值所描述的类似的方式)来确定验证签名值。Encoder 501 accepts as input one or more signature values along with PCM data, regular metadata and additional metadata. Encoder 501 may then check the signature value against other inputs (ie, against the received set or sets of PCM data and metadata). If other inputs have been modified/tampered with, the signature check will fail and the encoder will take appropriate action. Verification of one or more signature values may be performed at encoder 501 by determining based on one or more sets of PCM data and metadata received (in a manner similar to that described for encrypted values) Verify signature value.
这样,通过以下操作可以在PCM连接转码器503中保持经解码的PCM数据(和相关联的元数据)的可信度:使得解码器504能够基于经解码的PCM数据和相关联的元数据确定一个或更多个签名值,并且使得相对应的编码器501能够基于所述一个或更多个签名值验证要被编码的PCM数据(和相关联的元数据)的可靠性。如上所述,可以基于单个或基于多个调平的安全秘钥K1和K2来执行对一个或更多个签名值的确定及其验证,其中,一个或更多个安全秘钥可以仅为解码器504和编码器501所知,并且通常不为对解码器504和编码器501之间的连接执行PCM处理的实体所知。In this way, the credibility of the decoded PCM data (and associated metadata) can be maintained in the PCM connection transcoder 503 by enabling the decoder 504 to base the decoded PCM data and associated metadata on One or more signature values are determined, and the corresponding encoder 501 is enabled to verify the authenticity of the PCM data (and associated metadata) to be encoded based on the one or more signature values. As mentioned above, the determination of one or more signature values and their verification can be performed based on a single or based on multiple leveled security keys K 1 and K 2 , wherein the one or more security keys can only Known to the decoder 504 and the encoder 501 and generally unknown to the entity performing PCM processing on the connection between the decoder 504 and the encoder 501 .
如图5a至图5d所示,对一个或更多个签名值的使用使得能够实现各种使用情形。图5a示出了转码器503,其中,在解码器504和编码器503之间不执行PCM处理。因此,受保护的数据510(包括一组或更多组PCM数据和相关联的元数据以及一个或更多个相关联的签名)未被修改并且在转码器503中保持信任链。因此,图5a的转码器503被配置成:接收包括受保护的且信任的入站内容帧111和相关联的入站元数据帧112(也称为演化帧)的序列的入站比特流110,并且提供包括受保护的且信任的出站内容帧121和相关联的出站元数据帧122的序列的出站比特流120。通过使用一个或更多个签名来保护经解码的PCM数据、常规元数据和附加元数据(也称为演化元数据)来确保上述情况。编码器501验证一个或更多个签名并且将附加元数据作为出站元数据帧122传送至出站比特流120。图5a中所示的使用情形可以例如适用于对比特流从第一比特率至第二比特率的转码。As shown in Figures 5a-5d, the use of one or more signature values enables various usage scenarios. Figure 5a shows a transcoder 503 where no PCM processing is performed between the decoder 504 and the encoder 503. Therefore, the protected data 510 (including one or more sets of PCM data and associated metadata and one or more associated signatures) is not modified and the chain of trust is maintained in the transcoder 503 . Accordingly, the transcoder 503 of Figure 5a is configured to receive an inbound bitstream comprising a sequence of protected and trusted inbound content frames 111 and associated inbound metadata frames 112 (also referred to as evolved frames) 110, and provides an outbound bitstream 120 that includes a sequence of protected and trusted outbound content frames 121 and associated outbound metadata frames 122. This is ensured by using one or more signatures to protect decoded PCM data, regular metadata and additional metadata (also called evolved metadata). Encoder 501 verifies one or more signatures and transmits additional metadata to outbound bitstream 120 as outbound metadata frames 122 . The use case shown in Figure 5a may for example be suitable for transcoding a bitstream from a first bitrate to a second bitrate.
图5b示出了信任链被非置信的PCM处理级505破坏的PCM连接转码器513。PCM处理级505接收受保护的数据510并且修改数据510。PCM处理级505是“非置信的”,原因是:PCM处理级505不知道解码器504使用的安全秘钥K。因此,修改的数据511包括一组或更多组修改的PCM数据和相关联的元数据以及一个或更多个无效签名。编码器501被配置成确定签名的无效性并且可以被配置成采取适当的行动。具体地,编码器501可以被配置成从入站元数据帧112中丢弃附加元数据,从而提供仅包括内容帧121序列但不包括相关联的元数据帧122的出站比特流120。通过这样做,确保转码器513不转发非置信的附加元数据。此外,由于比特流120不包括元数据帧122的事实,比特流120不包括上述加密值(来自元数据帧122的保护字段)。这样,解码器104可以将比特流120识别为非置信的。Figure 5b shows a PCM connection transcoder 513 where the chain of trust is broken by an untrusted PCM processing stage 505. The PCM processing stage 505 receives the protected data 510 and modifies the data 510 . The PCM processing stage 505 is "untrusted" because the PCM processing stage 505 does not know the security key K used by the decoder 504. Thus, modified data 511 includes one or more sets of modified PCM data and associated metadata and one or more invalid signatures. Encoder 501 is configured to determine the invalidity of a signature and may be configured to take appropriate action. Specifically, encoder 501 may be configured to discard additional metadata from inbound metadata frames 112 , thereby providing an outbound bitstream 120 that includes only the sequence of content frames 121 but not the associated metadata frames 122 . By doing this, it is ensured that the transcoder 513 does not forward untrusted additional metadata. Furthermore, due to the fact that the bitstream 120 does not include the metadata frame 122, the bitstream 120 does not include the aforementioned encrypted value (from the protection field of the metadata frame 122). In this manner, decoder 104 may identify bitstream 120 as untrusted.
如上所述,编码器501可以被配置成:如果一个或更多个签名值无效,则从入站元数据帧112中丢弃附加元数据。如表2的上下文中所述,入站元数据帧112的元数据块140可以表示描述相对应的元数据块140的一个或更多个属性的相应的描述符142。这些属性中之一可以是PCM处理参数(在表2中称为tight_coupling参数)。编码器501可以被配置成:使用元数据块142的PCM处理参数,以确定是否要将元数据块142中包括的元数据包括在出站比特流120中。具体地,PCM处理参数可以向编码器501指示:即使相关联的内容帧111的PCM样本已经被修改,仍然将来自入站元数据帧112的块140的元数据包括在出站比特流120中。As described above, the encoder 501 may be configured to discard additional metadata from the inbound metadata frame 112 if one or more signature values are invalid. As described in the context of Table 2, the metadata chunks 140 of the inbound metadata frame 112 may represent corresponding descriptors 142 that describe one or more attributes of the corresponding metadata chunk 140. One of these attributes may be a PCM processing parameter (called tight_coupling parameter in Table 2). Encoder 501 may be configured to use the PCM processing parameters of metadata block 142 to determine whether the metadata included in metadata block 142 is to be included in outbound bitstream 120 . Specifically, the PCM processing parameters may indicate to the encoder 501 that the metadata from the block 140 of the inbound metadata frame 112 is still included in the outbound bitstream 120 even if the PCM samples of the associated content frame 111 have been modified. .
表5示出了PCM处理参数(即,表2的tight_coupling参数)的示例语义。在所示的示例中,PCM处理参数的值“0”表示:只有当无PCM处理发生时,例如只有当一个或更多个签名值已经被编码器501验证时,才应该将元数据块140的有效载荷143(即元数据)包括在出站比特流120中。另一方面,PCM处理参数的值“3”可以表示:即使PCM样本已经被修改,例如即使一个或更多个签名值尚未被验证,也应该总是将块140的有效载荷143包括在出站比特流120中。此外,PCM处理参数可以具有表示中间情况的值,即,PCM处理参数可以具有表示将有效载荷143包括在出站比特流120中需要满足的PCM处理条件的值或者表示在不将有效载荷143包括在出站比特流120中的情况下的PCM处理条件的值。Table 5 shows example semantics of the PCM processing parameters (ie, the tight_coupling parameter of Table 2). In the example shown, a value of "0" for the PCM processing parameter indicates that the metadata block 140 should only be processed when no PCM processing occurs, for example only when one or more signature values have been verified by the encoder 501 The payload 143 (i.e. metadata) is included in the outbound bitstream 120. On the other hand, a value of "3" for the PCM processing parameter may mean that the payload 143 of block 140 should always be included on the outbound even if the PCM sample has been modified, e.g. even if one or more signature values have not been verified. Bitstream 120. Furthermore, the PCM processing parameters may have values representing intermediate cases, i.e., the PCM processing parameters may have values representing PCM processing conditions that need to be satisfied for the payload 143 to be included in the outbound bitstream 120 or to indicate that the payload 143 is not included in the outbound bitstream 120 . The value of the PCM processing condition in the case of outbound bitstream 120.
PCM处理级505可以被配置成:向编码器501通知关于在PCM处理级505中已经对PCM样本执行的处理。换言之,PCM处理级505可以被配置成:向编码器501通知PCM处理条件(例如,PCM样本的样本率的转换、系统声音包括在PCM样本中、元数据的修改、通道配置的修改(例如,将单声道信号修改成立体声信号,或者将5.1多声道信号下混合成立体声信号)、响度的调平等)。这样,编码器501可以被配置成:接收来自PCM处理级505的对PCM处理条件的表示。此外,编码器501可以被配置成:基于接收的PCM处理条件并且基于PCM处理参数的值(例如,根据表5的语义),来处理元数据块140的元数据。The PCM processing stage 505 may be configured to inform the encoder 501 about the processing that has been performed on the PCM samples in the PCM processing stage 505 . In other words, the PCM processing stage 505 may be configured to: notify the encoder 501 of PCM processing conditions (e.g., conversion of sample rates of PCM samples, inclusion of system sounds in PCM samples, modification of metadata, modification of channel configurations (e.g., Modify the mono signal into a stereo signal, or mix the 5.1 multi-channel signal into a stereo signal), adjust the loudness, etc.). In this manner, encoder 501 may be configured to receive a representation of PCM processing conditions from PCM processing stage 505 . Furthermore, the encoder 501 may be configured to process the metadata of the metadata block 140 based on the received PCM processing conditions and based on the values of the PCM processing parameters (eg, according to the semantics of Table 5).
表5table 5
图5c示出了被配置成执行置信PCM处理的PCM连接转码器523的情况。可以通过将PCM处理级506与附加的重新签名级507进行组合来实现上述情况。出于该目的,可以为置信方提供一个或更多个安全秘钥,从而使得置信方能够对修改的数据511重新签名。作为示例,可以为置信方提供中等安全秘钥K2。因此,可以对修改的数据511重新签名(即,可以使用中等安全秘钥K2,基于修改的数据511来确定一个或更多个签名值),从而提供受保护的修改的数据512(包括多组修改的PCM数据和相关联的元数据的序列以及一个或更多个新签名)。编码器501可以被配置成:验证新签名并且生成包括内容帧121的序列和相关联的元数据帧122的序列的置信出站比特流120。此外,编码器501可以被配置成:确定信任链已经被破坏并且新链已经被创建,原因是:重新签名级507可能已经使用了与解码器504(其可能已经使用了高度安全秘钥K1)不同的安全秘钥(例如,中等安全秘钥K2)。Figure 5c shows the case of a PCM connection transcoder 523 configured to perform trusted PCM processing. This can be achieved by combining the PCM processing stage 506 with an additional re-signing stage 507. For this purpose, the trusted party may be provided with one or more security keys, thereby enabling the trusted party to re-sign the modified data 511. As an example, the trusted party may be provided with a medium security key K2 . Accordingly, the modified data 511 may be re-signed (i.e., one or more signature values may be determined based on the modified data 511 using the medium security key K2 ), thereby providing protected modified data 512 (including multiple A sequence of modified PCM data and associated metadata and one or more new signatures). Encoder 501 may be configured to verify the new signature and generate a trusted outbound bitstream 120 that includes a sequence of content frames 121 and a sequence of associated metadata frames 122 . Additionally, the encoder 501 may be configured to determine that the chain of trust has been broken and a new chain has been created because the re-signing stage 507 may have been used with the decoder 504 (which may have used the high security key K 1 ) different security keys (eg, medium security key K 2 ).
图5d示出了在编码器501中包括PCM处理级509的PCM连接转码器533的框图。具体地,转码器533被配置成:通过确保由知道解码器504用于确定一个或更多个签名值的安全秘钥的实体(例如,编码器501)执行PCM处理来保持信任链。编码器501被配置成验证保护数据510的一个或更多个签名。然后,内部PCM处理级508可以修改所接收的多组PCM数据和相关联的元数据。此外,编码器501可以包括被配置成对进行了在PCM处理级508中执行的修改的元数据帧进行更新的元数据更新单元509。具体地,元数据更新单元509可以被配置成:基于经转码的内容帧121和元数据帧122来确定更新的加密值。然后,更新的加密值可以被包括在元数据帧122中以传送至解码器104。Figure 5d shows a block diagram of a PCM connected transcoder 533 including a PCM processing stage 509 in the encoder 501. Specifically, transcoder 533 is configured to maintain the chain of trust by ensuring that the PCM process is performed by an entity (eg, encoder 501 ) that knows the security key used by decoder 504 to determine one or more signature values. Encoder 501 is configured to verify one or more signatures protecting data 510 . The internal PCM processing stage 508 may then modify the received sets of PCM data and associated metadata. Furthermore, the encoder 501 may include a metadata update unit 509 configured to update metadata frames subject to modifications performed in the PCM processing stage 508 . Specifically, the metadata update unit 509 may be configured to determine an updated encryption value based on the transcoded content frame 121 and the metadata frame 122 . The updated encrypted value may then be included in the metadata frame 122 for communication to the decoder 104 .
图6分别提供了转码器503、513、523和533的另一种表示。Figure 6 provides another representation of transcoders 503, 513, 523 and 533 respectively.
在本文献中,已经描述了用于对元数据进行转码的方法和系统。这些方法和系统使得能够以降低的计算复杂度对元数据进行转码。具体地,提出了为元数据块提供描述符,从而使得转码器能够仅基于描述符对元数据进行转码,而无需分析元数据块中包括的实际的元数据。通过这样做,可以显著降低转码器的复杂度。此外,本文献提供了用于保护元数据帧以及用于保护PCM连接转码器中的PCM数据的方法和系统。因此,可以确保为转码器元数据的接收器提供对接收的元数据的可信度的表示。In this document, methods and systems for transcoding metadata have been described. These methods and systems enable metadata to be transcoded with reduced computational complexity. Specifically, it is proposed to provide a descriptor for a metadata block, thereby enabling a transcoder to transcode metadata based solely on the descriptor, without the need to analyze the actual metadata included in the metadata block. By doing this, the complexity of the transcoder can be significantly reduced. Furthermore, this document provides methods and systems for protecting metadata frames and for protecting PCM data in PCM connected transcoders. Thus, it is ensured that the receiver of the transcoder metadata is provided with an indication of the trustworthiness of the received metadata.
本文献中描述的方法和系统可以实现为软件、固件和/或硬件。某些部件可以例如实现为在数字信号处理器或微处理器上运行的软件。其他部件可以例如实现为硬件和或实现为专用集成电路。所描述的方法和系统中遇到的信号可以存储在介质如随机存取存储器或光存储介质上。可以经由网络如射频网络、卫星网络、无线网络或有线网络如因特网来传输所描述的方法和系统中遇到的信号。使用本文献中描述的方法和系统的典型装置包括用于存储和/或渲染音频信号的便携式电子装置或其他消费类设备。The methods and systems described in this document may be implemented as software, firmware and/or hardware. Certain components may, for example, be implemented as software running on a digital signal processor or microprocessor. Other components may, for example, be implemented as hardware and/or as application specific integrated circuits. Signals encountered in the described methods and systems may be stored on a medium such as random access memory or optical storage media. Signals encountered in the described methods and systems may be transmitted via a network such as a radio frequency network, a satellite network, a wireless network, or a wired network such as the Internet. Typical devices using the methods and systems described in this document include portable electronic devices or other consumer devices for storing and/or rendering audio signals.
本技术还可以如下配置。This technology can also be configured as follows.
(1)一种转码器(103),被配置成:将包括入站内容帧(111)和相关联的入站元数据帧(112)的入站比特流(110)转码成包括出站内容帧(121)和相关联的出站元数据帧(122)的出站比特流(120);其中,所述入站内容帧(111)表示根据第一编解码器系统编码的信号,并且其中,所述出站内容帧(121)表示根据第二编解码器系统编码的信号;其中,所述转码器(103)被配置成:(1) A transcoder (103) configured to: transcode an inbound bitstream (110) including an inbound content frame (111) and an associated inbound metadata frame (112) to include an outbound an outbound bitstream (120) of an inbound content frame (121) and an associated outbound metadata frame (122); wherein said inbound content frame (111) represents a signal encoded in accordance with a first codec system, and wherein said outbound content frame (121) represents a signal encoded according to a second codec system; wherein said transcoder (103) is configured to:
-从所述入站元数据帧(112)中识别元数据入站块(140);其中,所述元数据入站块(140)与表示所述元数据入站块(140)中包括的元数据的一个或更多个属性的入站描述符(142)相关联;以及- identifying a metadata inbound block (140) from said inbound metadata frame (112); wherein said metadata inbound block (140) is identical to a associated with the inbound descriptor (142) of one or more attributes of the metadata; and
-基于所述入站描述符(142)根据所述入站元数据帧生成所述出站元数据帧(122)。- Generate the outbound metadata frame (122) from the inbound metadata frame based on the inbound descriptor (142).
(2)根据(1)所述的转码器(103),其中,所述转码器(103)被配置成通过以下生成所述出站元数据帧(122):(2) The transcoder (103) according to (1), wherein the transcoder (103) is configured to generate the outbound metadata frame (122) by:
-将来自所述入站块(140)的元数据拷贝至出站块(140);-Copy metadata from said inbound block (140) to the outbound block (140);
以及as well as
-将所述出站块(140)插入至所述出站元数据帧(122)中;其中,所述拷贝和所述插入受制于由所述入站描述符(142)表示的所述一个或更多个属性。- inserting said outbound block (140) into said outbound metadata frame (122); wherein said copy and said insertion are subject to said one represented by said inbound descriptor (142) or more attributes.
(3)根据(2)所述的转码器(103),其中,所述转码器(103)被配置成:通过基于所述入站块(140)的所述入站描述符(142)生成所述出站块(140)的出站描述符(142)来生成所述出站元数据帧(122)。(3) The transcoder (103) according to (2), wherein the transcoder (103) is configured to pass the inbound descriptor (142) based on the inbound block (140) ) generates an outbound descriptor (142) of the outbound block (140) to generate the outbound metadata frame (122).
(4)根据(3)所述的转码器(103),其中,所述转码器(103)被配置成:对由所述入站描述符(142)表示的一个或更多个属性进行修改以生成所述出站描述符(142);其中,所述出站描述符(142)表示所修改的一个或更多个属性。(4) The transcoder (103) according to (3), wherein the transcoder (103) is configured to: Modifications are made to generate the outbound descriptor (142); wherein the outbound descriptor (142) represents the modified one or more attributes.
(5)根据前述任一项所述的转码器(103),其中,所述转码器(103)被配置成:仅基于由所述入站描述符(142)表示的所述一个或更多个属性,根据所述入站元数据帧(112)生成所述出站元数据帧(122)。(5) The transcoder (103) according to any of the preceding, wherein the transcoder (103) is configured to: based solely on the one or and generating the outbound metadata frame (122) based on the inbound metadata frame (112).
(6)根据前述任一项所述的转码器(103),其中,所述转码器(103)被配置成:在不对所述入站块(140)中包括的元数据进行分析的情况下,根据所述入站元数据帧(112)生成所述出站元数据帧(122)。(6) The transcoder (103) according to any one of the preceding, wherein the transcoder (103) is configured to: perform analysis without analyzing metadata included in the inbound block (140). In this case, the outbound metadata frame (122) is generated based on the inbound metadata frame (112).
(7)根据前述任一项所述的转码器(103),其中,所述一个或更多个属性包括下述中的一个或更多个:(7) The transcoder (103) according to any of the preceding, wherein the one or more attributes include one or more of the following:
-表示所述信号的样本的时间戳参数;其中,所述入站块(140)的元数据与所述信号的样本相关联;- a timestamp parameter representing a sample of said signal; wherein the metadata of said inbound block (140) is associated with the sample of said signal;
-表示所述信号的数个样本的持续时间参数;其中,所述入站块(140)的元数据与所述信号的所述数个样本相关联;- a duration parameter representing a number of samples of said signal; wherein the metadata of said inbound block (140) is associated with said number of samples of said signal;
-转码参数,表示所述入站块(140)是否要被转码成所述出站比特流(120);- a transcoding parameter indicating whether said inbound block (140) is to be transcoded into said outbound bitstream (120);
-复制参数,表示所述入站块(140)的元数据是否要被包括在根据所述入站元数据帧(112)生成的每个出站元数据帧(122,322)中;- a replication parameter indicating whether the metadata of said inbound block (140) is to be included in each outbound metadata frame (122, 322) generated from said inbound metadata frame (112);
-去复制参数,表示如果根据多个入站元数据帧(112,312)来生成所述出站元数据帧(122)则所述入站块(140)的元数据是否要被所述转码器(103)丢弃;- Deduplication parameter indicating whether the metadata of the inbound block (140) is to be transferred if the outbound metadata frame (122) is generated based on multiple inbound metadata frames (112, 312). Coder(103) discard;
-优先级参数,表示所述入站块(140)的元数据相对于一个或更多个其他元数据入站块(140)的重要性;- a priority parameter indicating the importance of the metadata of said inbound block (140) relative to one or more other inbound blocks of metadata (140);
-关联参数,表示所述入站块(140)的元数据是否可以被插入至所述出站元数据帧(112)之后的延迟的出站元数据帧(322)中;和/或- an association parameter indicating whether the metadata of the inbound block (140) can be inserted into a delayed outbound metadata frame (322) following the outbound metadata frame (112); and/or
-PCM处理参数,表示进行了对所述入站内容帧(111)中包括的数据的修改的情况下所述入站块(140)的元数据是否要被所述转码器(103)丢弃。- a PCM processing parameter indicating whether the metadata of the inbound block (140) is to be discarded by the transcoder (103) in case of modifications to the data included in the inbound content frame (111) .
(8)根据前述任一项所述的转码器(103),其中(8) The transcoder (103) according to any of the preceding items, wherein
-所述一个或更多个属性包括表示所述信号的样本(202)的时间戳参数(201),所述入站块(140)的元数据与所述信号的所述样本(202)相关联;其中,所述入站描述符(142)的所述时间戳参数(201)表示与所述入站内容帧(111)有关的所述信号的样本(202);- said one or more attributes include a timestamp parameter (201) representing a sample (202) of said signal to which metadata of said inbound block (140) is associated wherein the timestamp parameter (201) of the inbound descriptor (142) represents a sample (202) of the signal associated with the inbound content frame (111);
-所述转码器(103)被配置成根据所述入站块(140)生成出站块(140);以及- the transcoder (103) is configured to generate an outbound block (140) based on the inbound block (140); and
-所述转码器(103)被配置成:通过修改所述入站描述符(142)的所述时间戳参数(201)来生成所述出站块(140)的出站描述符(142),以使得所述出站描述符(142)的时间戳参数(203)表示与所述出站内容帧(121)有关的所述信号的样本(202)。- said transcoder (103) is configured to generate an outbound descriptor (142) of said outbound block (140) by modifying said timestamp parameter (201) of said inbound descriptor (142) ), such that the timestamp parameter (203) of the outbound descriptor (142) represents a sample (202) of the signal associated with the outbound content frame (121).
(9)根据前述任一项所述的转码器(103),其中(9) The transcoder (103) according to any of the preceding items, wherein
-所述一个或更多个属性包括表示所述信号的样本(202)的时间戳参数(201),所述入站块(140)的元数据与所述信号的样本(202)相关联;其中,所述入站描述符(142)的所述时间戳参数(201)表示与所述入站内容帧(111)有关的所述信号的样本(202);- the one or more attributes include a timestamp parameter (201) representing a sample (202) of the signal with which the metadata of the inbound block (140) is associated; wherein said timestamp parameter (201) of said inbound descriptor (142) represents a sample (202) of said signal associated with said inbound content frame (111);
-所述转码器(103)被配置成根据所述入站块(140)生成出站块(140);- the transcoder (103) is configured to generate an outbound block (140) based on the inbound block (140);
-所述转码器(203)被配置成:将所述出站块(140)插入至延迟的出站元数据帧(222)中;- the transcoder (203) is configured to insert the outbound block (140) into a delayed outbound metadata frame (222);
-所述延迟的出站元数据帧(222)与不包括所述信号的所述样本(202)的延迟的出站内容帧(221)相关联;以及- said delayed outbound metadata frame (222) is associated with a delayed outbound content frame (221) that does not include said sample (202) of said signal; and
-所述转码器(103)被配置成:通过修改所述入站块(140)的时间戳参数(201)来生成所述出站块(140)的出站描述符(142),以使得所述出站描述符(142)的时间戳参数(213)表示与所述延迟的出站内容帧(221)有关的所述信号的样本(202);- the transcoder (103) is configured to generate an outbound descriptor (142) of the outbound block (140) by modifying the timestamp parameter (201) of the inbound block (140) to causing the timestamp parameter (213) of the outbound descriptor (142) to represent a sample (202) of the signal associated with the delayed outbound content frame (221);
(10)根据前述任一项所述的转码器(103),其中(10) The transcoder (103) according to any of the preceding items, wherein
-所述一个或更多个属性包括复制参数,所述复制参数表示所述入站块(140)的元数据是否要被包括在根据所述入站元数据帧(112)生成的每个出站元数据帧(122,322)中;以及- said one or more attributes include a replication parameter indicating whether the metadata of said inbound block (140) is to be included in each outbound block generated from said inbound metadata frame (112) in the station metadata frame (122, 322); and
-所述转码器(103)被配置成:通过考虑所述复制参数,根据所述入站元数据帧(112)生成多个出站元数据帧(122,322)。- The transcoder (103) is configured to generate a plurality of outbound metadata frames (122, 322) from the inbound metadata frame (112) by taking into account the replication parameters.
(11)根据(10)所述的转码器(103),其中,所述转码器(103)被配置成(11) The transcoder (103) according to (10), wherein the transcoder (103) is configured to
-确定所述复制参数表示所述入站块(140)的元数据要被包括在根据所述入站元数据帧(112)生成的每个出站元数据帧(122,322)中;- determining that said replication parameters represent metadata of said inbound block (140) to be included in each outbound metadata frame (122, 322) generated from said inbound metadata frame (112);
-将所述入站块(140)的元数据插入至所述多个出站元数据帧(122,322)中的每个出站元数据帧中。- Inserting the metadata of the inbound block (140) into each of the plurality of outbound metadata frames (122, 322).
(12)根据(10)至(11)中任一项所述的转码器(103),其中,所述复制参数包括标记,所述标记能够被设置成表示所述入站块(140)的元数据要被包括在根据所述入站元数据帧(112)生成的每个出站元数据帧(122,322)中,或者所述标记能够被设置成表示所述入站块(140)的元数据不要被包括在根据所述入站元数据帧(112)生成的每个出站元数据帧(122,322)中。(12) The transcoder (103) according to any of (10) to (11), wherein the replication parameters include a flag that can be set to represent the inbound block (140) of metadata to be included in each outbound metadata frame (122, 322) generated from the inbound metadata frame (112), or the flag can be set to represent the inbound block (140 ) metadata shall not be included in each outbound metadata frame (122, 322) generated from said inbound metadata frame (112).
(13)根据(10)至(12)中任一项所述的转码器(103),其中(13) The transcoder (103) according to any one of (10) to (12), wherein
-所述转码器(103)被配置成根据所述入站内容帧(111)生成多个出站内容帧(121,321);以及- the transcoder (103) is configured to generate a plurality of outbound content frames (121, 321) based on the inbound content frame (111); and
-所述多个出站内容帧(121,321)与所述多个出站元数据帧(122,322)中的对应的出站元数据帧相关联。- the plurality of outbound content frames (121, 321) are associated with corresponding outbound metadata frames of the plurality of outbound metadata frames (122, 322).
(14)根据前述任一项所述的转码器(103),其中,所述一个或更多个属性包括(14) The transcoder (103) according to any of the preceding, wherein the one or more attributes include
-去复制参数,表示如果根据多个入站元数据帧(112,312)来生成所述出站元数据帧(122)则所述入站块(140)的元数据是否可以被所述转码器(103)丢弃;- Deduplication parameter indicating whether the metadata of the inbound block (140) can be transferred if the outbound metadata frame (122) is generated based on multiple inbound metadata frames (112, 312). Coder(103) discard;
-所述转码器(103)被配置成:通过考虑所述去复制参数,根据所述入站比特流(110)的多个入站元数据帧(112)生成所述出站元数据帧(122)。- the transcoder (103) is configured to generate said outbound metadata frame from a plurality of inbound metadata frames (112) of the inbound bitstream (110) by taking into account said deduplication parameters (122).
(15)根据(14)所述的转码器(103),其中(15) The transcoder (103) according to (14), wherein
-所述多个入站元数据帧(112)包括多个元数据入站块(140),每个所述入站块(140)与表示所述入站块(140)的元数据可以被所述转码器(103)丢弃的相应的去复制参数相关联;以及- said plurality of inbound metadata frames (112) comprise a plurality of inbound metadata blocks (140), each said inbound block (140) together with metadata representing said inbound block (140) may be Corresponding deduplication parameters discarded by the transcoder (103) are associated; and
-所述转码器(103)被配置成:对于除了所述多个入站元数据帧(112,312)中的一个入站元数据帧以外的所有入站元数据帧,丢弃所述多个入站块(140)的元数据,以生成所述出站元数据帧(122)。- the transcoder (103) is configured to, for all but one inbound metadata frame of the plurality of inbound metadata frames (112, 312), discard said plurality of inbound metadata frames (112, 312). of inbound chunks (140) of metadata to generate the outbound metadata frame (122).
(16)根据(14)至(15)中任一项所述的转码器(103),其中(16) The transcoder (103) according to any one of (14) to (15), wherein
-所述转码器(103)被配置成根据多个入站内容帧(111,311)生成所述出站内容帧(121);以及- the transcoder (103) is configured to generate the outbound content frame (121) based on a plurality of inbound content frames (111, 311); and
-所述多个入站内容帧(111,311)与所述多个入站元数据帧(112,312)中的相应的入站元数据帧相关联。- the plurality of inbound content frames (111, 311) are associated with corresponding inbound metadata frames of the plurality of inbound metadata frames (112, 312).
(17)根据(14)至(16)中任一项所述的转码器(103),其中,所述去复制参数包括标记,所述标记可以被设置成表示:如果根据多个入站元数据帧(112,312)来生成所述出站元数据帧(122),则所述入站块(140)的元数据可以被所述转码器(103)丢弃;或者所述标记可以被设置成表示:如果根据多个入站元数据帧(112,312)来生成所述出站元数据帧(122),则所述入站块(140)的元数据不可以被所述转码器(103)丢弃。(17) The transcoder (103) according to any one of (14) to (16), wherein the de-duplication parameters include a flag that may be set to indicate that if metadata frame (112, 312) to generate the outbound metadata frame (122), then the metadata of the inbound block (140) can be discarded by the transcoder (103); or the mark can is set to indicate that if the outbound metadata frame (122) is generated from a plurality of inbound metadata frames (112, 312), the metadata of the inbound block (140) may not be transferred. Encoder (103) discarded.
(18)根据前述任一项所述的转码器(103),其中(18) The transcoder (103) according to any of the preceding items, wherein
-所述一个或更多个属性包括表示所述入站块(140)的元数据相对于一个或更多个其他元数据入站块(140)的相对重要性的优先级参数;- the one or more attributes include a priority parameter representing the relative importance of the metadata of the inbound block (140) with respect to one or more other inbound blocks of metadata (140);
-所述入站元数据帧(112)包括具有表示所述优先级参数的不同值的描述符(142)的多个入站块(140);以及- the inbound metadata frame (112) includes a plurality of inbound blocks (140) with descriptors (142) representing different values of the priority parameter; and
-所述转码器(103)被配置成:按照所述多个入站块(140)的所述优先级参数,根据所述多个入站块(140)来生成所述出站元数据帧(122)。- said transcoder (103) is configured to generate said outbound metadata based on said plurality of inbound blocks (140) according to said priority parameters of said plurality of inbound blocks (140) frame(122).
(19)根据(18)所述的转码器(103),其中(19) The transcoder (103) according to (18), wherein
-所述多个入站块(140)与表示增量优先级的增量优先级参数相关联;- said plurality of inbound blocks (140) are associated with an incremental priority parameter representing an incremental priority;
-所述多个入站块(140)包括增量元数据,以使得多个入站块(140)的组合元数据提供高质量元数据并且使得所述多个入站块(140)中的具有最高相对优先级的入站块(140)的元数据提供下降质量的元数据;以及- the plurality of inbound blocks (140) include incremental metadata such that the combined metadata of the plurality of inbound blocks (140) provides high quality metadata and such that the combined metadata of the plurality of inbound blocks (140) Metadata for the inbound block (140) with the highest relative priority provides metadata of reduced quality; and
所述转码器(103)被配置成:基于所述多个入站块(140)中的一个或更多个入站块生成所述出站元数据帧(122),从而使所述出站元数据帧(122)中包括的所述元数据的质量能够可伸缩地劣化。The transcoder (103) is configured to generate the outbound metadata frame (122) based on one or more of the plurality of inbound blocks (140) such that the outbound The quality of the metadata included in the station metadata frame (122) can be scalably degraded.
(20)根据前述任一项所述的转码器(103),其中(20) The transcoder (103) according to any of the preceding items, wherein
-所述一个或更多个属性包括关联参数,所述关联参数表示所述入站块(140)的元数据是否可以被插入至在所述出站元数据帧(112)之后的延迟的出站元数据帧(222)中;- the one or more attributes include an association parameter indicating whether the metadata of the inbound block (140) can be inserted into a delayed outbound after the outbound metadata frame (112) In station metadata frame (222);
-所述转码器(103)被配置成:基于所述关联参数并且基于对所述出站比特流(120)的比特率限制,将来自所述入站块(140)的元数据插入至所述出站元数据帧(122)中。- The transcoder (103) is configured to insert metadata from the inbound chunk (140) into the in the outbound metadata frame (122).
(21)根据(20)所述的转码器(103),其中(21) The transcoder (103) according to (20), wherein
-所述转码器被配置成:如果所述关联参数表示所述入站块(140)的元数据可以被延迟,则将来自所述入站块(140)的元数据插入至在所述出站元数据帧(122)之后的延迟的出站元数据帧(222)中。- the transcoder is configured to insert metadata from the inbound block (140) into the In the delayed outbound metadata frame (222) following the outbound metadata frame (122).
(22)根据前述任一项所述的转码器(103),其中(22) The transcoder (103) according to any of the preceding items, wherein
-所述入站元数据帧(112)和所述出站元数据帧(122)遵循共同的语法;- the inbound metadata frame (112) and the outbound metadata frame (122) follow a common syntax;
-所述共同的语法使得元数据帧(130)能够包括零个元数据块(140)、一个或更多个元数据块(140);- the common syntax enables a metadata frame (130) to include zero metadata blocks (140), one or more metadata blocks (140);
-每个元数据块(140)表示相对应的描述符(142),所述相对应的描述符(142)表示对应元数据块(140)中包括的元数据的一个或更多个属性。- Each metadata block (140) represents a corresponding descriptor (142) representing one or more attributes of the metadata included in the corresponding metadata block (140).
(23)根据前述任一项所述的转码器(103),还被配置成(23) The transcoder (103) according to any of the preceding items, further configured to
-使用所述第一编解码器系统的解码器和所述第二编解码器系统的编码器,根据所述入站内容帧(111)生成所述出站内容帧(121)。- generating said outbound content frame (121) from said inbound content frame (111) using a decoder of said first codec system and an encoder of said second codec system.
(24)根据前述任一项所述的转码器(103),其中(24) The transcoder (103) according to any of the preceding items, wherein
-所述信号包括音频信号;和/或- said signal includes an audio signal; and/or
-所述入站内容帧(111)表示所述信号的帧的一些或所有样本;以及- said inbound content frames (111) represent some or all samples of frames of said signal; and
-所述出站内容帧(121)表示所述信号的帧的一些或所有样本。- The outbound content frames (121) represent some or all samples of frames of the signal.
(25)根据前述任一项所述的转码器(103),其中(25) The transcoder (103) according to any of the preceding items, wherein
-所述第一编解码器系统与所述第二编解码器系统不同;和/或- the first codec system is different from the second codec system; and/or
-所述第一编解码器系统和所述第二编解码器系统包括以下项中的一个或更多个:杜比E、杜比数字+、杜比数字、杜比TrueHD、杜比睿波、AAC和/或HE-AAC。- said first codec system and said second codec system comprise one or more of the following: Dolby E, Dolby Digital Plus, Dolby Digital, Dolby TrueHD, Dolby Pulse, AAC and/or HE-AAC.
(26)根据前述任一项所述的转码器(103),其中,所述出站比特流(120)的比特率不同于所述入站比特流(110)的比特率。(26) The transcoder (103) according to any of the preceding, wherein the bitrate of the outbound bitstream (120) is different from the bitrate of the inbound bitstream (110).
(27)一种用于将包括入站内容帧(111)和相关联的入站元数据帧(112)的入站比特流(110)转码成包括出站内容帧(121)和相关联的出站元数据帧(122)的出站比特流(120)的方法;其中,所述入站内容帧(111)表示根据第一编解码器系统编码的信号,并且其中,所述出站内容帧(121)表示根据第二编解码器系统编码的信号;其中,所述方法包括:(27) A method for transcoding an inbound bitstream (110) including an inbound content frame (111) and associated inbound metadata frame (112) to include an outbound content frame (121) and associated A method of outbound bitstream (120) of an outbound metadata frame (122); wherein said inbound content frame (111) represents a signal encoded according to a first codec system, and wherein said outbound The content frame (121) represents a signal encoded according to the second codec system; wherein the method includes:
-从所述入站元数据帧(112)中识别元数据入站块(140);其中,所述元数据入站块(140)与表示所述元数据入站块(140)中包括的元数据的一个或更多个属性的入站描述符(142)相关联;以及- identifying a metadata inbound block (140) from said inbound metadata frame (112); wherein said metadata inbound block (140) is identical to a associated with the inbound descriptor (142) of one or more attributes of the metadata; and
-基于所述入站描述符(142),根据所述入站元数据帧(112)生成所述出站元数据帧(122)。- Generate said outbound metadata frame (122) from said inbound metadata frame (112) based on said inbound descriptor (142).
(28)一种包括内容帧(111)和相关联的元数据帧(112)的编码比特流(110);其中,所述内容帧(111)表示根据第一编解码器系统编码的信号;其中,所述元数据帧(112)包括元数据块(140);其中,所述元数据块(140)与表示所述元数据块(140)中包括的元数据的一个或更多个属性的描述符(142)相关联。(28) An encoded bitstream (110) comprising a content frame (111) and an associated metadata frame (112); wherein the content frame (111) represents a signal encoded according to a first codec system; wherein the metadata frame (112) includes a metadata block (140); wherein the metadata block (140) is associated with one or more attributes representing metadata included in the metadata block (140) associated with the descriptor (142).
(29)一种编码器(101),被配置成生成包括内容帧(111)和相关联的元数据帧(112)的编码比特流(110);其中,所述内容帧(111)表示根据第一编解码器系统编码的信号;其中,所述编码器(101)被配置成(29) An encoder (101) configured to generate an encoded bitstream (110) comprising a content frame (111) and an associated metadata frame (112); wherein the content frame (111) represents a A signal encoded by a first codec system; wherein the encoder (101) is configured to
-生成元数据块(140);-Generate metadata block(140);
-确定与所述元数据块(140)相关联的描述符(142);其中,所述描述符(142)表示所述元数据块(140)中包括的元数据的一个或更多个属性;以及- Determining a descriptor (142) associated with said metadata block (140); wherein said descriptor (142) represents one or more attributes of metadata included in said metadata block (140) ;as well as
-将所述元数据块(140)插入至所述元数据帧(112)中。- Inserting the metadata block (140) into the metadata frame (112).
(30)根据(29)所述的编码器(101),其中(30) The encoder (101) according to (29), wherein
-所述一个或更多个属性包括表示所述信号的样本(202)的时间戳参数(213),所述块(140)的元数据与所述信号的所述样本(202)相关联;- said one or more attributes include a timestamp parameter (213) representing a sample (202) of said signal with which metadata of said block (140) is associated;
-所述编码器(101)被配置成将所述块(140)插入至延迟的元数据帧(222)中;- the encoder (101) is configured to insert the block (140) into a delayed metadata frame (222);
-所述延迟的元数据帧(222)与不包括所述信号的所述样本(202)的延迟的内容帧(221)相关联;以及- said delayed metadata frame (222) is associated with a delayed content frame (221) that does not include said sample (202) of said signal; and
-所述编码器(101)被配置成生成所述块(140)的描述符(142),以使得所述描述符(142)的时间戳参数(213)表示与所述延迟的内容帧(221)有关的所述信号的样本(202)。- the encoder (101) is configured to generate the descriptor (142) of the block (140) such that the timestamp parameter (213) of the descriptor (142) represents the same as the delayed content frame ( 221) Samples (202) of said signal in question.
(31)一种用于生成包括内容帧(111)和相关联的元数据帧(112)的编码比特流(110)的方法;其中,所述内容帧(111)表示根据第一编解码器系统编码的信号;其中,所述方法包括(31) A method for generating an encoded bitstream (110) comprising a content frame (111) and an associated metadata frame (112); wherein the content frame (111) represents a System coded signal; wherein the method includes
-生成元数据块(140);-Generate metadata block(140);
-确定与所述元数据块(140)相关联的描述符(142);其中,所述描述符(142)表示所述元数据块(140)中包括的所述元数据的一个或更多个属性;以及- Determining a descriptor (142) associated with said metadata block (140); wherein said descriptor (142) represents one or more of said metadata included in said metadata block (140) attributes; and
-将所述元数据块(140)插入至所述元数据帧(112)中。- Inserting the metadata block (140) into the metadata frame (112).
(32)一种编码器(101),被配置成生成包括内容帧(111)和相关联的元数据帧(112)的编码比特流(110);其中,所述内容帧(111)表示根据第一编解码器系统编码的信号;其中,所述编码器(101)被配置成(32) An encoder (101) configured to generate an encoded bitstream (110) including a content frame (111) and an associated metadata frame (112); wherein the content frame (111) represents a A signal encoded by a first codec system; wherein the encoder (101) is configured to
-生成元数据块(140);-Generate metadata block(140);
-将所述元数据块(140)插入至所述元数据帧(112)中;- inserting the metadata block (140) into the metadata frame (112);
-从多个预定的安全秘钥中选择安全秘钥;其中,所述多个预定的安全秘钥提供不同的信任等级;- selecting a security key from a plurality of predetermined security keys; wherein said plurality of predetermined security keys provide different levels of trust;
-至少基于所述内容帧(111)、所述相关联的元数据帧(112)和所选择的安全秘钥来生成加密值;以及- generating an encrypted value based at least on the content frame (111), the associated metadata frame (112) and the selected security key; and
-将所生成的加密值插入至所述元数据帧(112)中。- Insert the generated encrypted value into the metadata frame (112).
(33)根据(32)所述的编码器(101),其中,所述多个预定的安全秘钥包括(33) The encoder (101) according to (32), wherein the plurality of predetermined security keys include
-仅为所述编码器(101)的开发人员所知的高度安全秘钥;以及- a highly secure key known only to the developer of said encoder (101); and
-为所述编码器(101)的操作人员所知的中等安全秘钥。- A medium security key known to the operator of said encoder (101).
(34)根据(32)至(33)中任一项所述的编码器(101),其中,所述编码器(101)被配置成(34) The encoder (101) according to any one of (32) to (33), wherein the encoder (101) is configured to
-针对所述编码比特流(110)生成多个连续的内容帧(111,311)和相关联的元数据帧(112,312);- generate a plurality of consecutive content frames (111, 311) and associated metadata frames (112, 312) for said encoded bitstream (110);
-基于单个内容帧(111)及其相关联的元数据帧(112)并且基于所选择的安全秘钥生成帧加密值;以及- Generating a frame encryption value based on a single content frame (111) and its associated metadata frame (112) and based on the selected security key; and
-基于所述多个连续的内容帧(111,311)中的至少一些连续的内容帧及其相关联的元数据帧(112,312)并且基于所选择的安全值生成历史加密值。- Generate historical encryption values based on at least some of the plurality of consecutive content frames (111, 311) and their associated metadata frames (112, 312) and based on the selected security value.
(35)根据(32)至(34)中任一项所述的编码器(101),其中,所述编码器(101)被配置成:计算用于生成加密值的HMAC-MD5值或HMAC-SHA256值。(35) The encoder (101) according to any one of (32) to (34), wherein the encoder (101) is configured to: calculate an HMAC-MD5 value or HMAC for generating an encrypted value -SHA256 value.
(36)根据(35)所述的编码器(101),其中,所述编码器(101)被配置成:将HMAC-MD5值或HMAC-SHA256值截短以产生所述加密值。(36) The encoder (101) according to (35), wherein the encoder (101) is configured to truncate an HMAC-MD5 value or an HMAC-SHA256 value to produce the encrypted value.
(37)根据(32)至(36)中任一项所述的编码器(101),其中,所述编码器(101)被配置成将所选择的安全秘钥的表示插入至所述元数据帧(112)中。(37) The encoder (101) according to any of (32) to (36), wherein the encoder (101) is configured to insert a representation of a selected security key into the element. in data frame (112).
(38)一种用于生成包括内容帧(111)和相关联的元数据帧(112)的编码比特流(110)的方法;其中,所述内容帧(111)表示根据第一编解码器系统编码的信号;其中,所述方法包括(38) A method for generating an encoded bitstream (110) comprising a content frame (111) and an associated metadata frame (112); wherein the content frame (111) represents a System coded signal; wherein the method includes
-生成元数据块(140);-Generate metadata block(140);
-将所述元数据块(140)插入至所述元数据帧(112)中;- inserting the metadata block (140) into the metadata frame (112);
-从多个预定的安全秘钥中选择安全秘钥;其中,所述多个预定的安全秘钥提供不同的信任等级;- selecting a security key from a plurality of predetermined security keys; wherein said plurality of predetermined security keys provide different levels of trust;
-至少基于所述内容帧(111)、所述相关联的元数据帧(112)和所选择的安全秘钥生成加密值;以及- generating an encrypted value based at least on the content frame (111), the associated metadata frame (112) and the selected security key; and
-将所生成的加密值插入至所述元数据帧(112)中。- Insert the generated encrypted value into the metadata frame (112).
(39)一种转码器(503),被配置成将包括入站内容帧(111)和相关联的入站元数据帧(112)的入站比特流(110)转码成出站比特流(120);其中,所述入站比特流(110)表示信号的一组样本;其中,所述转码器(103)包括(39) A transcoder (503) configured to transcode an inbound bitstream (110) including an inbound content frame (111) and an associated inbound metadata frame (112) into outbound bits stream (120); wherein the inbound bitstream (110) represents a set of samples of a signal; wherein the transcoder (103) includes
-解码器(504),被配置成-Decoder (504), configured to
-将所述入站内容帧(111)转换成所述信号的一组经解码的PCM样本;- converting said inbound content frame (111) into a set of decoded PCM samples of said signal;
-从所述入站元数据帧(112)中提取元数据;以及- extract metadata from said inbound metadata frame (112); and
-针对所述一组经解码的PCM样本和所提取的元数据,使用解码器安全秘钥生成签名值;以及- generating a signature value using the decoder security key for the set of decoded PCM samples and the extracted metadata; and
-编码器(501),被配置成-Encoder(501), configured to
-接收一组PCM样本和相关联的元数据;-Receive a set of PCM samples and associated metadata;
-接收签名值;-Receive signature value;
-使用编码器安全秘钥,验证所接收的签名值对于所接收的一组PCM样本和相关联的元数据是否有效;以及- Verify that the received signed value is valid for the received set of PCM samples and associated metadata using the encoder security key; and
-如果所接收的签名有效,则根据所接收的一组PCM样本生成所述出站比特流(120)的出站内容帧(121)并且根据所接收的元数据生成所述出站比特流(120)的相关联的出站元数据帧(122)。- If the received signature is valid, generate an outbound content frame (121) of the outbound bitstream (120) based on the received set of PCM samples and generate the outbound bitstream based on the received metadata ( The associated outbound metadata frame (122) of 120).
(40)根据(39)所述的转码器(503),其中,所述编码器(501)被配置成使用所述解码器安全秘钥作为所述编码器安全秘钥。(40) The transcoder (503) according to (39), wherein the encoder (501) is configured to use the decoder security key as the encoder security key.
(41)根据(39)至(40)中任一项所述的转码器(503),其中,所述编码器(501)被配置成:如果所接收的签名无效,则阻止所述将所接收的元数据插入至所述出站比特流(120)中。(41) The transcoder (503) according to any of (39) to (40), wherein the encoder (501) is configured to prevent the received signature from being The received metadata is inserted into the outbound bitstream (120).
(42)根据(39)至(41)中任一项所述的转码器(503),还包括PCM处理级(505,506),其被配置成(42) The transcoder (503) according to any one of (39) to (41), further comprising a PCM processing stage (505, 506) configured to
-对所述一组经解码的PCM样本和/或所提取的元数据进行修改,从而产生一组第二PCM样本和第二元数据;以及- modifying said set of decoded PCM samples and/or extracted metadata to produce a second set of PCM samples and second metadata; and
-将所述一组第二PCM样本和所述第二元数据传送至所述编码器(501)。-Transmitting said second set of PCM samples and said second metadata to said encoder (501).
(43)根据(42)所述的转码器(503),还包括重新签名单元507,其被配置成(43) The transcoder (503) according to (42), further comprising a re-signing unit 507 configured to
-针对所述一组第二PCM样本和所述第二元数据,使用重新签名安全秘钥确定更新的签名值;以及- determining an updated signature value using a re-sign security key for said set of second PCM samples and said second metadata; and
-将所述更新的签名值传送至所述编码器(501)。-Transmit the updated signature value to the encoder (501).
(44)根据(43)所述的转码器(503),其中(44) The transcoder (503) according to (43), wherein
-所述重新签名安全秘钥与所述解码器安全秘钥不同;以及- the re-signing security key is different from the decoder security key; and
-所述编码器(501)被配置成使用所述重新签名安全秘钥作为所述编码器安全秘钥。- The encoder (501) is configured to use the re-sign security key as the encoder security key.
(45)根据(39)至(44)中任一项所述的转码器(503),其中(45) The transcoder (503) according to any one of (39) to (44), wherein
-所述编码器(501)包括PCM处理级(508),所述PCM处理级(508)被配置成对所接收的一组PCM样本和/或所接收的元数据进行修改;以及- the encoder (501) includes a PCM processing stage (508) configured to modify a received set of PCM samples and/or received metadata; and
-所述编码器(501)被配置成:基于所修改的所接收的一组PCM样本和/或所修改的所接收的元数据,生成所述出站内容帧(121)和/或所述出站元数据帧(122)。- The encoder (501) is configured to generate the outbound content frame (121) and/or the outbound content frame (121) based on the modified received set of PCM samples and/or the modified received metadata. Outbound metadata frame (122).
(46)根据(39)至(45)中任一项所述的转码器(503),其中(46) The transcoder (503) according to any one of (39) to (45), wherein
-所述解码器(504)被配置成从所述入站元数据帧(112)中识别元数据入站块(140);其中,所述元数据入站块(140)与表示所述元数据入站块(140)中包括的元数据的一个或更多个属性的入站描述符(142)相关联;以及- said decoder (504) is configured to identify a metadata inbound block (140) from said inbound metadata frame (112); wherein said metadata inbound block (140) is identical to said metadata inbound block (140) representing said metadata an inbound descriptor (142) associated with one or more attributes of the metadata included in the data inbound block (140); and
-所述编码器(501)被配置成:基于所述入站描述符(142),根据所述入站元数据帧(112)生成所述出站元数据帧(122)。- The encoder (501) is configured to generate the outbound metadata frame (122) from the inbound metadata frame (112) based on the inbound descriptor (142).
(47)根据(46)所述的转码器(503),其中,所述一个或更多个属性包括PCM处理参数,所述PCM处理参数表示在进行了对所述一组PCM样本的修改和/或对提取的元数据的修改的情况下所述入站块(140)的元数据是否要被所述编码器(501)丢弃。(47) The transcoder (503) of (46), wherein the one or more attributes include PCM processing parameters that indicate when modifications to the set of PCM samples have been made. and/or whether the metadata of the inbound block (140) is to be discarded by the encoder (501) in the case of modifications to the extracted metadata.
(48)根据(47)所述的转码器(503),其中,所述编码器(501)被配置成:即使所述一组PCM样本和/或所提取的元数据已经被修改,如果所述PCM处理参数表示所述入站块(140)的元数据不应该被丢弃,则将所述入站块(140)的元数据包括在所述出站元数据帧(122)中。(48) The transcoder (503) according to (47), wherein the encoder (501) is configured to: even if the set of PCM samples and/or the extracted metadata have been modified, if If the PCM processing parameter indicates that the metadata of the inbound block (140) should not be discarded, the metadata of the inbound block (140) is included in the outbound metadata frame (122).
(49)根据向前从属于(42)至(44)中任一项的(47)所述的转码器(503),其中(49) The transcoder (503) according to (47) forwardly dependent on any of (42) to (44), wherein
-所述PCM处理级(505)被配置成:向所述编码器(501)提供对一个或更多个PCM处理条件的表示;- the PCM processing stage (505) is configured to: provide a representation of one or more PCM processing conditions to the encoder (501);
-所述一个或更多个PCM处理条件表示所述PCM处理级(505)如何处理了所述一组PCM样本和/或所提取的元数据;以及- the one or more PCM processing conditions represent how the set of PCM samples and/or extracted metadata has been processed by the PCM processing stage (505); and
-所述编码器被配置成还基于所述一个或更多个PCM处理条件,根据所述入站元数据帧(112)生成所述出站元数据帧(122)。- the encoder is configured to generate the outbound metadata frame (122) from the inbound metadata frame (112) further based on the one or more PCM processing conditions.
(50)根据(49)所述的转码器(503),其中,所述一个或更多个PCM处理条件包括以下中的一个或更多个:所述一组PCM样本的采样率的转换、所述PCM样本与系统声音的混合、所提取的元数据的修改、所述一组PCM样本的通道配置的修改、所述一组PCM样本的响度的调平。(50) The transcoder (503) of (49), wherein the one or more PCM processing conditions include one or more of: conversion of the sampling rate of the set of PCM samples , the mixing of the PCM samples with the system sound, the modification of the extracted metadata, the modification of the channel configuration of the set of PCM samples, the leveling of the loudness of the set of PCM samples.
(51)一种用于将包括入站内容帧(111)和相关联的入站元数据帧(112)的入站比特流(110)转码成出站比特流(120)的方法;其中,所述入站比特流(110)表示信号的一组样本;所述方法包括,(51) A method for transcoding an inbound bitstream (110) including an inbound content frame (111) and associated inbound metadata frame (112) to an outbound bitstream (120); wherein , the inbound bitstream (110) represents a set of samples of the signal; the method includes,
在解码器(504)处,At the decoder (504),
-将所述入站内容帧(111)转换成所述信号的一组经解码的PCM样本;- converting said inbound content frame (111) into a set of decoded PCM samples of said signal;
-从所述入站元数据帧(112)中提取元数据;以及- extract metadata from said inbound metadata frame (112); and
-针对所述一组经解码的PCM样本和所提取的元数据,使用解码器安全秘钥生成签名值;- generating a signature value using the decoder security key for the set of decoded PCM samples and the extracted metadata;
-将所述一组经解码的PCM样本、所提取的元数据和所生成的签名值传送至相对应的编码器(501);以及- transmitting the set of decoded PCM samples, extracted metadata and generated signature values to the corresponding encoder (501); and
在所述编码器(501)处At the encoder (501)
-接收一组PCM样本和相关联的元数据;-Receive a set of PCM samples and associated metadata;
-接收签名值;-Receive signature value;
-使用编码器安全秘钥来确定所接收的签名值对于所接收的一组PCM样本和相关联的元数据是否有效;以及- Use the encoder security key to determine whether the received signature value is valid for the received set of PCM samples and associated metadata; and
-如果所接收的签名有效,则根据所接收的一组PCM样本生成所述出站比特流(120)的出站内容帧(121)并且根据所接收的元数据生成所述出站比特流(120)的相关联的出站元数据帧(122)。- If the received signature is valid, generate an outbound content frame (121) of the outbound bitstream (120) based on the received set of PCM samples and generate the outbound bitstream based on the received metadata ( The associated outbound metadata frame (122) of 120).
(52)一种解码器(104),被配置成接收包括内容帧(111)和相关联的元数据帧(112)的编码比特流(110);其中,所述内容帧(111)表示根据第一编解码器系统编码的信号;其中,所述解码器(104)被配置成(52) A decoder (104) configured to receive an encoded bitstream (110) including a content frame (111) and an associated metadata frame (112); wherein the content frame (111) represents a A signal encoded by a first codec system; wherein the decoder (104) is configured to
-从所述元数据帧(112)提取加密值;- Extract encrypted values from said metadata frame (112);
-根据多个预定的安全秘钥来确定安全秘钥;其中,所述多个预定的安全秘钥提供不同的信任等级;- Determining a security key based on a plurality of predetermined security keys; wherein the plurality of predetermined security keys provide different levels of trust;
-至少基于所述内容帧(111)、所述相关联的元数据帧(112)和所确定的安全秘钥来生成验证加密值;以及- generating a verification encrypted value based at least on said content frame (111), said associated metadata frame (112) and the determined security key; and
-将所提取的加密值与所述验证加密值进行比较,以确定是否可以信任所接收的编码比特流(110)。- Compare the extracted encrypted value with the verified encrypted value to determine whether the received encoded bitstream can be trusted (110).
(53)根据(52)所述的解码器(104),其中,所述解码器(104)被配置成:通过从所述元数据帧(112)中提取所述安全秘钥来确定所述安全秘钥。(53) The decoder (104) of (52), wherein the decoder (104) is configured to determine the security key by extracting the security key from the metadata frame (112). Security key.
(54)根据(52)至(53)中任一项所述的解码器(104),其中,所述解码器(104)还被配置成:确定所述多个预定的安全秘钥中的哪个安全秘钥已被用于生成所提取的加密值,以确定所接收的编码比特流(110)的信任等级。(54) The decoder (104) according to any one of (52) to (53), wherein the decoder (104) is further configured to determine which of the plurality of predetermined security keys Which security key has been used to generate the extracted encryption value to determine the trust level of the received encoded bitstream (110).
(55)根据(52)至(54)中任一项所述的解码器(104),其中,所述解码器(104)被配置成(55) The decoder (104) according to any one of (52) to (54), wherein the decoder (104) is configured to
-针对所述多个预定的安全秘钥分别生成多个验证加密值;- Generate multiple verification encryption values for the plurality of predetermined security keys respectively;
-将所述多个验证加密值中的每个验证加密值与所提取的加密值进行比较;以及- comparing each of the plurality of verified encrypted values with the extracted encrypted value; and
-如果所述多个验证加密值中的一个验证加密值与所提取的加密值匹配,则确定所述多个预定的安全秘钥中的一个安全秘钥已被用于生成所提取的加密值。- if one of the plurality of verified encrypted values matches the extracted encrypted value, determining that one of the plurality of predetermined security keys has been used to generate the extracted encrypted value .
(56)一种用于确定接收的包括内容帧(111)和相关联的元数据帧(112)的编码比特流(110)的信任等级的方法;其中,所述内容帧(111)表示根据第一编解码器系统编码的信号;其中,所述方法包括(56) A method for determining a trust level of a received encoded bitstream (110) comprising a content frame (111) and an associated metadata frame (112); wherein the content frame (111) represents a a signal encoded by a first codec system; wherein the method includes
-从所述元数据帧(112)中提取加密值;- Extract encrypted values from said metadata frame (112);
-根据多个预定的安全秘钥来确定安全秘钥;其中,所述多个预定的安全秘钥提供不同的信任等级;- Determining a security key based on a plurality of predetermined security keys; wherein the plurality of predetermined security keys provide different levels of trust;
-至少基于所述内容帧(111)、所述相关联的元数据帧(112)和所确定的安全秘钥来生成验证加密值;以及- generating a verification encrypted value based at least on said content frame (111), said associated metadata frame (112) and the determined security key; and
-将所提取的加密值与所述验证加密值进行比较,以根据所确定的安全秘钥的信任等级来确定是否可以信任所述接收的编码比特流(110)。- Comparing the extracted encrypted value with the verified encrypted value to determine whether the received encoded bitstream can be trusted based on the determined trust level of the security key (110).
(57)一种解码器(504),被配置成:对包括入站内容帧(111)和相关联的入站元数据帧(112)的入站比特流(110)进行解码;其中,所述入站比特流(110)表示信号的一组样本;其中,所述解码器(504)被配置成(57) A decoder (504) configured to decode an inbound bitstream (110) including an inbound content frame (111) and an associated inbound metadata frame (112); wherein The inbound bitstream (110) represents a set of samples of a signal; wherein the decoder (504) is configured to
-将所述入站内容帧(111)转换成所述信号的一组经解码的PCM样本;- converting said inbound content frame (111) into a set of decoded PCM samples of said signal;
-从所述入站元数据帧(112)中提取元数据;- extract metadata from said inbound metadata frame (112);
-针对所述一组经解码的PCM样本和所提取的元数据,使用解码器安全秘钥生成签名值;以及- generating a signature value using the decoder security key for the set of decoded PCM samples and the extracted metadata; and
-将所述一组经解码的PCM样本、所提取的元数据和所生成的签名值发送至编码器(501)以用于重新编码。-Sending the set of decoded PCM samples, extracted metadata and generated signature values to the encoder (501) for re-encoding.
(58)一种编码器(501),被配置成:对包括出站内容帧(121)和相关联的出站元数据帧(122)的出站比特流(120)进行编码,其中,所述编码器(501)被配置成(58) An encoder (501) configured to encode an outbound bitstream (120) including an outbound content frame (121) and an associated outbound metadata frame (122), wherein The encoder (501) is configured to
-接收一组PCM样本和相关联的元数据;-Receive a set of PCM samples and associated metadata;
-接收针对所述一组PCM样本和相关联的元数据的签名值;- receiving signature values for said set of PCM samples and associated metadata;
-使用编码器安全秘钥验证所接收的签名值对于所接收的一组PCM样本和相关联的元数据是否有效;以及- Verify that the received signed value is valid for the received set of PCM samples and associated metadata using the encoder security key; and
-如果所接收的签名有效,则根据所接收的一组PCM样本生成所述出站比特流(120)的出站内容帧(121)并且根据所接收的元数据生成所述出站比特流(120)的相关联的出站元数据帧(122)。- If the received signature is valid, generate an outbound content frame (121) of the outbound bitstream (120) based on the received set of PCM samples and generate the outbound bitstream based on the received metadata ( The associated outbound metadata frame (122) of 120).
(59)一组用于对包括入站内容帧(111)和相关联的入站元数据帧(112)的入站比特流(110)进行解码的方法;其中,所述入站比特流(110)表示信号的一组样本;其中,所述方法包括(59) A set of methods for decoding an inbound bitstream (110) including an inbound content frame (111) and an associated inbound metadata frame (112); wherein said inbound bitstream (112) 110) A set of samples representing a signal; wherein the method includes
-将所述入站内容帧(111)转换成所述信号的一组经解码的PCM样本;- converting said inbound content frame (111) into a set of decoded PCM samples of said signal;
-从所述入站元数据帧(112)中提取元数据;- extract metadata from said inbound metadata frame (112);
-使用解码器安全秘钥来生成针对所述一组经解码的PCM样本和所提取的元数据的签名值;以及- using the decoder security key to generate a signature value for the set of decoded PCM samples and extracted metadata; and
-将所述一组经解码的PCM样本、所提取的元数据和所生成的签名值提供给编码器(501)以用于重新编码。- Providing the set of decoded PCM samples, extracted metadata and generated signature values to the encoder (501) for re-encoding.
(60)一种用于对包括出站内容帧(121)和相关联的出站元数据帧(122)的出站比特流(120)进行编码的方法;其中,所述方法包括(60) A method for encoding an outbound bitstream (120) including an outbound content frame (121) and an associated outbound metadata frame (122); wherein the method includes
-接收一组PCM样本和相关联的元数据;-Receive a set of PCM samples and associated metadata;
-接收针对所述一组PCM样本和相关联的元数据的签名值;- receiving signature values for said set of PCM samples and associated metadata;
-使用编码器安全秘钥,验证所接收的签名值对于所接收的一组PCM样本和相关联的元数据是否有效;以及- Verify that the received signed value is valid for the received set of PCM samples and associated metadata using the encoder security key; and
-如果所接收的签名有效,则根据所接收的一组PCM样本生成所述出站比特流(120)的出站内容帧(121)并且根据所接收的元数据生成所述出站比特流(120)的相关联的出站元数据帧(122)。- If the received signature is valid, generate an outbound content frame (121) of the outbound bitstream (120) based on the received set of PCM samples and generate the outbound bitstream based on the received metadata ( The associated outbound metadata frame (122) of 120).
(61)一种解码器(104),被配置成:对包括内容帧(111)和相关联的元数据帧(112)的编码比特流(110)进行解码;其中,所述内容帧(111)表示根据第一编解码器系统编码的信号;其中,所述元数据帧(112)包括元数据块(140);其中,所述元数据块(140)与表示所述元数据块(140)中包括的元数据的一个或更多个属性的描述符(142)相关联;其中,所述解码器(104)被配置成(61) A decoder (104) configured to decode an encoded bitstream (110) including a content frame (111) and an associated metadata frame (112); wherein the content frame (111 ) represents a signal encoded according to a first codec system; wherein said metadata frame (112) includes a metadata block (140); wherein said metadata block (140) and represents said metadata block (140 ); wherein the decoder (104) is configured to
-对所述内容帧(111)中包括的编码信号进行解码;- decoding the encoded signal included in said content frame (111);
-从所述元数据帧(112)中识别所述元数据块(140);- identifying said metadata block (140) from said metadata frame (112);
-从所述元数据块(140)中提取所述描述符(142);以及-Extract the descriptor (142) from the metadata block (140); and
-根据由所述描述符(142)表示的所述一个或更多个属性来处理所述元数据块(140)中包括的元数据。- Processing the metadata included in the metadata block (140) according to the one or more attributes represented by the descriptor (142).
(62)一种用于对包括内容帧(111)和相关联的元数据帧(112)的编码比特流(110)进行解码的方法;其中,所述内容帧(111)表示根据第一编解码器系统编码的信号;其中,所述元数据帧(112)包括元数据块(140);其中,所述元数据块(140)与表示所述元数据块(140)中包括的元数据的一个或更多个属性的描述符(142)相关联;其中,所述方法包括(62) A method for decoding an encoded bitstream (110) comprising a content frame (111) and an associated metadata frame (112); wherein the content frame (111) represents a A signal encoded by a decoder system; wherein the metadata frame (112) includes a metadata block (140); wherein the metadata block (140) and represents metadata included in the metadata block (140) associated with a descriptor (142) of one or more attributes; wherein the method includes
-对所述内容帧(111)中包括的编码信号进行解码;- decoding the encoded signal included in said content frame (111);
-从所述元数据帧(112)中识别所述元数据块(140);- identifying said metadata block (140) from said metadata frame (112);
-从所述元数据块(140)中提取所述描述符(142);以及基于由所述描述符(142)表示的所述一个或更多个属性对所述元数据块(140)中包括的元数据进行处理。- extracting said descriptor (142) from said metadata block (140); and extracting said descriptor (142) from said metadata block (140) based on said one or more attributes represented by said descriptor (142) Included metadata is processed.
Claims (14)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201810949078.6A CN109102815B (en) | 2013-01-21 | 2014-01-15 | Encoding device and method, transcoding method and transcoder, and non-transitory medium |
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201361754893P | 2013-01-21 | 2013-01-21 | |
| US61/754,893 | 2013-01-21 | ||
| CN201480005287.5A CN105027478B (en) | 2013-01-21 | 2014-01-15 | Metadata transcoding |
| PCT/US2014/011695 WO2014113478A1 (en) | 2013-01-21 | 2014-01-15 | Metadata transcoding |
| CN201810949078.6A CN109102815B (en) | 2013-01-21 | 2014-01-15 | Encoding device and method, transcoding method and transcoder, and non-transitory medium |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201480005287.5A Division CN105027478B (en) | 2013-01-21 | 2014-01-15 | Metadata transcoding |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN109102815A CN109102815A (en) | 2018-12-28 |
| CN109102815B true CN109102815B (en) | 2023-09-19 |
Family
ID=50031614
Family Applications (3)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201810949078.6A Active CN109102815B (en) | 2013-01-21 | 2014-01-15 | Encoding device and method, transcoding method and transcoder, and non-transitory medium |
| CN202311104445.XA Active CN117219100B (en) | 2013-01-21 | 2014-01-15 | System and method for processing an encoded audio bitstream, computer readable medium |
| CN201480005287.5A Active CN105027478B (en) | 2013-01-21 | 2014-01-15 | Metadata transcoding |
Family Applications After (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202311104445.XA Active CN117219100B (en) | 2013-01-21 | 2014-01-15 | System and method for processing an encoded audio bitstream, computer readable medium |
| CN201480005287.5A Active CN105027478B (en) | 2013-01-21 | 2014-01-15 | Metadata transcoding |
Country Status (10)
| Country | Link |
|---|---|
| US (3) | US9755835B2 (en) |
| EP (1) | EP2946495B1 (en) |
| JP (2) | JP6242916B2 (en) |
| KR (1) | KR101717928B1 (en) |
| CN (3) | CN109102815B (en) |
| BR (2) | BR122020018263B1 (en) |
| ES (1) | ES2629195T3 (en) |
| IN (1) | IN2015MN01633A (en) |
| RU (1) | RU2602332C1 (en) |
| WO (1) | WO2014113478A1 (en) |
Families Citing this family (33)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9621963B2 (en) | 2014-01-28 | 2017-04-11 | Dolby Laboratories Licensing Corporation | Enabling delivery and synchronization of auxiliary content associated with multimedia data using essence-and-version identifier |
| CN105532009B (en) | 2014-07-18 | 2021-03-12 | 索尼公司 | Sending device, sending method, receiving device and receiving method |
| US10228751B2 (en) | 2014-08-06 | 2019-03-12 | Apple Inc. | Low power mode |
| US9647489B2 (en) | 2014-08-26 | 2017-05-09 | Apple Inc. | Brownout avoidance |
| US9774974B2 (en) * | 2014-09-24 | 2017-09-26 | Electronics And Telecommunications Research Institute | Audio metadata providing apparatus and method, and multichannel audio data playback apparatus and method to support dynamic format conversion |
| US10231033B1 (en) | 2014-09-30 | 2019-03-12 | Apple Inc. | Synchronizing out-of-band content with a media stream |
| US10708391B1 (en) | 2014-09-30 | 2020-07-07 | Apple Inc. | Delivery of apps in a media stream |
| RU2708942C2 (en) * | 2014-10-24 | 2019-12-12 | Долби Интернешнл Аб | Audio signals encoding and decoding |
| US9769234B2 (en) * | 2015-02-20 | 2017-09-19 | Disney Enterprises, Inc. | Algorithmic transcoding |
| TWI607655B (en) * | 2015-06-19 | 2017-12-01 | Sony Corp | Coding apparatus and method, decoding apparatus and method, and program |
| US20170055253A1 (en) * | 2015-08-17 | 2017-02-23 | Harman International Industries, Incorporated | Metadata distribution in a network |
| WO2017193374A1 (en) * | 2016-05-13 | 2017-11-16 | SZ DJI Technology Co., Ltd. | System and method for presenting a video via transcode |
| US10535355B2 (en) | 2016-11-18 | 2020-01-14 | Microsoft Technology Licensing, Llc | Frame coding for spatial audio data |
| US10349097B2 (en) * | 2017-10-27 | 2019-07-09 | Mti Film, Llc | Metadata editor for multimedia delivery |
| CN107682379A (en) * | 2017-11-22 | 2018-02-09 | 南京汽车集团有限公司 | Safe information transmission device, transmission method and storage method based on homomorphic cryptography |
| US11363133B1 (en) | 2017-12-20 | 2022-06-14 | Apple Inc. | Battery health-based power management |
| US10817307B1 (en) | 2017-12-20 | 2020-10-27 | Apple Inc. | API behavior modification based on power source health |
| CN110198474B (en) * | 2018-02-27 | 2022-03-15 | 中兴通讯股份有限公司 | Code stream processing method and device |
| US11108552B1 (en) * | 2018-05-02 | 2021-08-31 | Amazon Technologies, Inc. | Data encryption method and system |
| US11770260B1 (en) * | 2019-03-28 | 2023-09-26 | Amazon Technologies, Inc. | Determining authenticity of digital content |
| US11449584B1 (en) | 2019-03-28 | 2022-09-20 | Amazon Technologies, Inc. | Generating authenticable digital content |
| US12177644B2 (en) * | 2019-11-04 | 2024-12-24 | Qualcomm Incorporated | Signalling of audio effect metadata in a bitstream |
| US11843792B2 (en) * | 2020-11-12 | 2023-12-12 | Istreamplanet Co., Llc | Dynamic decoder configuration for live transcoding |
| CN112584156B (en) * | 2020-11-18 | 2022-08-02 | 卓旺(安徽)航空科技产业股份有限公司 | Transcoding 4K video algorithm based on 2K video data deep learning |
| US11930349B2 (en) | 2020-11-24 | 2024-03-12 | Naver Corporation | Computer system for producing audio content for realizing customized being-there and method thereof |
| KR102773326B1 (en) * | 2020-11-24 | 2025-02-27 | 가우디오랩 주식회사 | Method for performing normalization of audio signal and apparatus therefor |
| KR102505249B1 (en) * | 2020-11-24 | 2023-03-03 | 네이버 주식회사 | Computer system for transmitting audio content to realize customized being-there and method thereof |
| US11930348B2 (en) | 2020-11-24 | 2024-03-12 | Naver Corporation | Computer system for realizing customized being-there in association with audio and method thereof |
| EP4020998A1 (en) | 2020-12-23 | 2022-06-29 | Axis AB | Encoding of modified video |
| US12212819B2 (en) * | 2021-04-22 | 2025-01-28 | Cisco Technology, Inc. | In-band metadata for authenticity and role-based access in enterprise video streaming services |
| JP7663418B2 (en) * | 2021-06-09 | 2025-04-16 | 日本放送協会 | Audio metadata processing device and program |
| US20250087225A1 (en) * | 2023-09-13 | 2025-03-13 | Apple Inc. | Transcoding Audio Frames and Converting Metadata Frames based on a Target Encoder |
| US20250087222A1 (en) * | 2023-09-13 | 2025-03-13 | Apple Inc. | Decoding Audio Frames and Converted Metadata Frames from a Target Encoder |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1589468A (en) * | 2001-11-17 | 2005-03-02 | 汤姆森许可贸易公司 | Method and apparatus for determining whether a data frame contains encoded additional data |
| CN1717944A (en) * | 2002-10-28 | 2006-01-04 | 高通股份有限公司 | Apparatus and method for cascade-free vocoder operation between incompatible communication systems by extracting TFO information within or between systems |
| CN101189662A (en) * | 2005-05-31 | 2008-05-28 | 微软公司 | Sub-band speech codec with multilevel codebook and redundant coding |
| CN102576559A (en) * | 2009-08-07 | 2012-07-11 | 杜比国际公司 | Authentication of data streams |
Family Cites Families (53)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5577209A (en) * | 1991-07-11 | 1996-11-19 | Itt Corporation | Apparatus and method for providing multi-level security for communication among computers and terminals on a network |
| US5646997A (en) * | 1994-12-14 | 1997-07-08 | Barton; James M. | Method and apparatus for embedding authentication information within digital data |
| JPH08297638A (en) * | 1995-04-26 | 1996-11-12 | Nippon Telegr & Teleph Corp <Ntt> | User authentication method |
| JPH103256A (en) * | 1995-10-16 | 1998-01-06 | Sony Corp | Encryption method, encryption device, recording method, decryption method, decryption device, and recording medium |
| US7962413B2 (en) * | 1998-08-13 | 2011-06-14 | International Business Machines Corporation | End-user system of preventing unauthorized rerecording of multimedia content |
| US6446037B1 (en) * | 1999-08-09 | 2002-09-03 | Dolby Laboratories Licensing Corporation | Scalable coding method for high quality audio |
| US6856331B2 (en) * | 1999-11-12 | 2005-02-15 | International Business Machines Corporation | System and method of enriching non-linkable media representations in a network by enabling an overlying hotlink canvas |
| JP2002184114A (en) | 2000-12-11 | 2002-06-28 | Toshiba Corp | Music data recording / reproducing system and music data storage medium |
| US6944770B2 (en) * | 2001-05-17 | 2005-09-13 | Intelli-Mark Technologies, Inc. | Methods and systems for generating and validating value-bearing documents |
| AUPR960601A0 (en) | 2001-12-18 | 2002-01-24 | Canon Kabushiki Kaisha | Image protection |
| AU2002318848B2 (en) * | 2001-12-18 | 2004-12-09 | Canon Kabushiki Kaisha | Image Protection |
| AUPR970301A0 (en) | 2001-12-21 | 2002-01-24 | Canon Kabushiki Kaisha | Content authentication for digital media based recording devices |
| EP1349080A1 (en) * | 2002-03-26 | 2003-10-01 | Deutsche Thomson-Brandt Gmbh | Methods and apparatus for using metadata from different sources |
| GB2387287B (en) | 2002-04-05 | 2006-03-15 | Snell & Wilcox Limited | Video compression transcoding |
| US7167560B2 (en) * | 2002-08-08 | 2007-01-23 | Matsushita Electric Industrial Co., Ltd. | Partial encryption of stream-formatted media |
| US7827312B2 (en) * | 2002-12-27 | 2010-11-02 | The Nielsen Company (Us), Llc | Methods and apparatus for transcoding metadata |
| WO2004073178A2 (en) * | 2003-02-06 | 2004-08-26 | Dolby Laboratories Licensing Corporation | Continuous backup audio |
| KR100869630B1 (en) * | 2003-04-25 | 2008-11-21 | 애플 인크. | Methods and system for secure network-based distribution of content |
| US20040267388A1 (en) * | 2003-06-26 | 2004-12-30 | Predictive Media Corporation | Method and system for recording and processing of broadcast signals |
| KR100526189B1 (en) | 2004-02-14 | 2005-11-03 | 삼성전자주식회사 | Transcoding system and method for keeping timing parameters constant after transcoding |
| US8131134B2 (en) * | 2004-04-14 | 2012-03-06 | Microsoft Corporation | Digital media universal elementary stream |
| US20060005031A1 (en) * | 2004-06-15 | 2006-01-05 | Apostolopoulos John G | Methods and systems for utilizing a single cryptographic integrity check to generate multiple cryptographic integrity check values for components of transcodable content |
| US7672743B2 (en) * | 2005-04-25 | 2010-03-02 | Microsoft Corporation | Digital audio processing |
| US20060259781A1 (en) * | 2005-04-29 | 2006-11-16 | Sony Corporation/Sony Electronics Inc. | Method and apparatus for detecting the falsification of metadata |
| JP4582411B2 (en) * | 2005-08-04 | 2010-11-17 | ソニー株式会社 | Information processing apparatus and method, and program |
| KR20050092688A (en) * | 2005-08-31 | 2005-09-22 | 한국정보통신대학교 산학협력단 | Integrated multimedia file format structure, its based multimedia service offer system and method |
| JP4883342B2 (en) * | 2005-09-06 | 2012-02-22 | ソニー株式会社 | Information processing apparatus and method, and program |
| US7913277B1 (en) | 2006-03-30 | 2011-03-22 | Nortel Networks Limited | Metadata extraction and re-insertion and improved transcoding in digital media systems |
| KR100781528B1 (en) * | 2006-06-29 | 2007-12-03 | 삼성전자주식회사 | Apparatus and method for providing a video stream guaranteeing integrity |
| US8107786B2 (en) * | 2006-08-31 | 2012-01-31 | Open Tv, Inc. | Systems and methods to modify playout or playback |
| BRPI0622046B1 (en) * | 2006-09-30 | 2020-01-21 | Interdigital Vc Holdings Inc | method and device for encoding and decoding color enhancement layer for video |
| BRPI0718629A2 (en) * | 2006-11-14 | 2013-11-26 | Qualcomm Inc | CHANNEL SWITCHING SYSTEM AND METHODS. |
| EP2175445A3 (en) * | 2007-04-17 | 2010-05-19 | Panasonic Corporation | Communication system |
| KR101432994B1 (en) * | 2007-08-13 | 2014-08-22 | 삼성전자주식회사 | METHOD FOR CREATING MEDIA OBJECT-BASED METADATA, REPRODUCTION METHOD, |
| US8457958B2 (en) | 2007-11-09 | 2013-06-04 | Microsoft Corporation | Audio transcoder using encoder-generated side information to transcode to target bit-rate |
| CN101547345A (en) * | 2008-03-25 | 2009-09-30 | 华为技术有限公司 | Method and system for processing streaming media transcoding |
| JP5049207B2 (en) | 2008-06-23 | 2012-10-17 | ヤフー株式会社 | Place name display mode adjustment method, place name display mode adjustment device, and place name display mode adjustment system |
| US8798776B2 (en) | 2008-09-30 | 2014-08-05 | Dolby International Ab | Transcoding of audio metadata |
| US7738454B1 (en) * | 2008-09-30 | 2010-06-15 | Juniper Networks, Inc. | Methods and apparatus related to packet classification based on range values |
| US8904184B2 (en) * | 2008-12-10 | 2014-12-02 | At&T Intellectual Property I, Lp | Certification of authenticity of media signals |
| US8676822B2 (en) * | 2009-02-06 | 2014-03-18 | Disney Enterprises, Inc. | System and method for quality assured media file storage |
| US8687648B2 (en) | 2009-07-17 | 2014-04-01 | Qualcomm Incorporated | Wireless transmission of data using an available channel of a spectrum |
| WO2011035136A1 (en) * | 2009-09-18 | 2011-03-24 | Telesocial, Inc. | Telecommunication service employing an electronic information repository storing social network user, developer, and mobile network operator information |
| TWI443646B (en) * | 2010-02-18 | 2014-07-01 | Dolby Lab Licensing Corp | Audio decoder and decoding method using efficient downmixing |
| US9613142B2 (en) | 2010-04-26 | 2017-04-04 | Flash Networks Ltd | Method and system for providing the download of transcoded files |
| WO2011145353A1 (en) * | 2010-05-19 | 2011-11-24 | 三洋電機株式会社 | Base station |
| US8407808B2 (en) * | 2010-05-27 | 2013-03-26 | Media Rights Technologies, Inc. | Security thread for protecting media content |
| US9549197B2 (en) | 2010-08-16 | 2017-01-17 | Dolby Laboratories Licensing Corporation | Visual dynamic range timestamp to enhance data coherency and potential of metadata using delay information |
| US8521541B2 (en) | 2010-11-02 | 2013-08-27 | Google Inc. | Adaptive audio transcoding |
| TWI665659B (en) * | 2010-12-03 | 2019-07-11 | 美商杜比實驗室特許公司 | Audio decoding device, audio decoding method, and audio encoding method |
| US20120185693A1 (en) * | 2011-01-05 | 2012-07-19 | General Instrument Corporation | Secure progressive download for media content playback |
| US9761229B2 (en) * | 2012-07-20 | 2017-09-12 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for audio object clustering |
| US9560392B2 (en) * | 2012-09-07 | 2017-01-31 | Google Inc. | Dynamic bit rate encoding |
-
2014
- 2014-01-15 CN CN201810949078.6A patent/CN109102815B/en active Active
- 2014-01-15 CN CN202311104445.XA patent/CN117219100B/en active Active
- 2014-01-15 EP EP14702396.4A patent/EP2946495B1/en active Active
- 2014-01-15 KR KR1020157017689A patent/KR101717928B1/en active Active
- 2014-01-15 US US14/761,892 patent/US9755835B2/en active Active
- 2014-01-15 JP JP2015553802A patent/JP6242916B2/en active Active
- 2014-01-15 ES ES14702396.4T patent/ES2629195T3/en active Active
- 2014-01-15 WO PCT/US2014/011695 patent/WO2014113478A1/en not_active Ceased
- 2014-01-15 IN IN1633MUN2015 patent/IN2015MN01633A/en unknown
- 2014-01-15 BR BR122020018263-6A patent/BR122020018263B1/en active IP Right Grant
- 2014-01-15 CN CN201480005287.5A patent/CN105027478B/en active Active
- 2014-01-15 BR BR112015017048-0A patent/BR112015017048B1/en active IP Right Grant
- 2014-01-15 RU RU2015129751/08A patent/RU2602332C1/en active
-
2017
- 2017-08-24 JP JP2017161132A patent/JP6505791B2/en active Active
- 2017-09-02 US US15/694,787 patent/US10554415B2/en active Active
-
2019
- 2019-12-17 US US16/717,441 patent/US11075762B2/en active Active
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1589468A (en) * | 2001-11-17 | 2005-03-02 | 汤姆森许可贸易公司 | Method and apparatus for determining whether a data frame contains encoded additional data |
| CN1717944A (en) * | 2002-10-28 | 2006-01-04 | 高通股份有限公司 | Apparatus and method for cascade-free vocoder operation between incompatible communication systems by extracting TFO information within or between systems |
| CN101189662A (en) * | 2005-05-31 | 2008-05-28 | 微软公司 | Sub-band speech codec with multilevel codebook and redundant coding |
| CN102576559A (en) * | 2009-08-07 | 2012-07-11 | 杜比国际公司 | Authentication of data streams |
Also Published As
| Publication number | Publication date |
|---|---|
| CN109102815A (en) | 2018-12-28 |
| JP2016510544A (en) | 2016-04-07 |
| CN105027478B (en) | 2018-09-21 |
| CN105027478A (en) | 2015-11-04 |
| CN117219100A (en) | 2023-12-12 |
| BR122020018263B1 (en) | 2022-09-27 |
| JP2017229087A (en) | 2017-12-28 |
| EP2946495B1 (en) | 2017-05-17 |
| US20200127846A1 (en) | 2020-04-23 |
| WO2014113478A1 (en) | 2014-07-24 |
| JP6242916B2 (en) | 2017-12-06 |
| US20150372820A1 (en) | 2015-12-24 |
| CN117219100B (en) | 2025-02-25 |
| BR112015017048B1 (en) | 2022-12-13 |
| US11075762B2 (en) | 2021-07-27 |
| KR101717928B1 (en) | 2017-04-04 |
| US20170373857A1 (en) | 2017-12-28 |
| ES2629195T3 (en) | 2017-08-07 |
| US10554415B2 (en) | 2020-02-04 |
| IN2015MN01633A (en) | 2015-08-28 |
| BR112015017048A2 (en) | 2017-07-11 |
| RU2602332C1 (en) | 2016-11-20 |
| EP2946495A1 (en) | 2015-11-25 |
| JP6505791B2 (en) | 2019-04-24 |
| US9755835B2 (en) | 2017-09-05 |
| KR20150105955A (en) | 2015-09-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN109102815B (en) | Encoding device and method, transcoding method and transcoder, and non-transitory medium | |
| RU2509424C2 (en) | Data stream authentication | |
| TW202032537A (en) | Audio decoding device, audio decoding method, and audio encoding method | |
| HK40099492A (en) | System and method for processing encoded audio bitstream, computer readable medium | |
| HK40002417B (en) | Encoding apparatus, encoding method, transcoding method, transcoder, and non-transitory media | |
| HK40002417A (en) | Encoding apparatus, encoding method, transcoding method, transcoder, and non-transitory media | |
| US20260006268A1 (en) | Multisource methods and systems for coded media | |
| HK1168461B (en) | Authentication of data streams |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 40002417 Country of ref document: HK |
|
| GR01 | Patent grant | ||
| GR01 | Patent grant | ||
| TG01 | Patent term adjustment | ||
| TG01 | Patent term adjustment |