US20200213625A1 - Recovery From Packet Loss During Transmission Of Compressed Video Streams - Google Patents
Recovery From Packet Loss During Transmission Of Compressed Video Streams Download PDFInfo
- Publication number
- US20200213625A1 US20200213625A1 US16/812,185 US202016812185A US2020213625A1 US 20200213625 A1 US20200213625 A1 US 20200213625A1 US 202016812185 A US202016812185 A US 202016812185A US 2020213625 A1 US2020213625 A1 US 2020213625A1
- Authority
- US
- United States
- Prior art keywords
- frame
- server
- slices
- slice
- client device
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 230000005540 biological transmission Effects 0.000 title description 15
- 238000011084 recovery Methods 0.000 title description 6
- 238000000034 method Methods 0.000 claims abstract description 30
- 238000010586 diagram Methods 0.000 description 15
- 230000008569 process Effects 0.000 description 8
- 239000013598 vector Substances 0.000 description 5
- 102100037812 Medium-wave-sensitive opsin 1 Human genes 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000009877 rendering Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000036461 convulsion Effects 0.000 description 1
- 238000013144 data compression Methods 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000000116 mitigating effect Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000002688 persistence Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
- H04N19/89—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving methods or arrangements for detection of transmission errors at the decoder
- H04N19/895—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving methods or arrangements for detection of transmission errors at the decoder in combination with error concealment
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/164—Feedback from the receiver or from the transmission channel
- H04N19/166—Feedback from the receiver or from the transmission channel concerning the amount of transmission errors, e.g. bit error rate [BER]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/174—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a slice, e.g. a line of blocks or a group of blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/65—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience
Definitions
- the present disclosure relates generally to transmission of compressed video over computer networks; more specifically, to methods and apparatus for mitigating the effects of packet loss which occur when one or more packets of digital data travelling across a computer network fail to reach their destination intact.
- Remote hosting of online, fast-action, interactive video games and other high-end video applications typically requires very low latencies. For example, for twitch video games and applications, low round-trip latency, as measured from the time a user's control input is sent to the hosting service center to the time that the newly generated video content appears on the screen of the user's client device, is typically required. At higher latencies, performance suffers noticeably. Achieving such low latencies over the Internet or other similar networks requires the video compressor at the hosting service to generate a packet stream with particular characteristics such that the packet sequence flowing through the entire path from the hosting service to the client device is not subject to delays or excessive packet loss. In addition, the video compressor must create a packet stream which is sufficiently robust so that it can tolerate the inevitable packet loss and packet reordering that occurs in normal Internet and network transmissions.
- lost or dropped packets can result in highly noticeable performance issues, potentially causing the screen to completely freeze for a period of time or show other screen-wide visual artifacts (e.g., jitter).
- a lost/delayed packet causes the loss of a key frame (i.e., I-frame)
- the decompressor on the client device will lack a reference for all of the P-frames that follow until a new I-frame is received.
- a P frame is lost, that will impact the P-frames that follow.
- this can have a significant visual impact.
- I-frames are the only type of frame that is not coded with reference to any other frame.
- I-frames are coded predicatively from a previous I-frame or P-frame; B-frames are coded predicatively from I-frames and P-frames.
- a B-frame associated with a group of pictures (“GOPs”) may need to reference the I-frame of a next GOP.
- I-frame is intended to broadly refer to an Inter-frame and its equivalents, e.g., an IDR frame in the case of H.264.
- FIG. 1 is an example network diagram illustrating one embodiment for effectively dealing with packet loss.
- FIG. 2 is a flow diagram illustrating an example method for dealing with packet loss.
- FIG. 3 is another example network diagram illustrating an embodiment for handling packet loss.
- FIG. 4 is yet another example network diagram illustrating an embodiment for handling packet loss.
- FIG. 5 is a flow diagram illustrating another example process flow for handling packet loss.
- FIG. 6 is still another example network diagram illustrating an embodiment for handling packet loss.
- FIG. 7 is a flow diagram illustrating an example process flow for dealing with packet loss.
- a video “encoder” broadly refers to a device, circuit or algorithm (embodied in hardware or software) that compresses (i.e., encodes) video data using fewer bits/bytes to reduce the size of the original video data.
- Data compression is also frequently referred to as source coding, i.e., coding of data performed at the source before it is either transmitted or stored.
- a video “decoder” or decompressor is a device, circuit or algorithm which performs the reverse operation of an encoder, undoing the encoding to retrieve the original (decompressed) video data.
- server broadly refers to any combination of hardware or software embodied in a computer (i.e., a processor) designed to provide services to client devices or processes.
- a server therefore can refer to one or more computer processors that run a server operating system from computer-executable code stored in a memory, and which is provided to the user as virtualized or non-virtualized server; it can also refer to any software or dedicated hardware capable of providing computing services.
- a “client device” refers a computer device such as a PC, desktop computer, tablet, mobile, handheld, set-top box, or any other general purpose computer (e.g., Microsoft Windows- or Linux-based PCs or Apple, Inc. Macintosh computers) having a wired or wireless connection to a public network such as the Internet, and which further includes the ability to decompress/decode compressed packet data received over a network connection.
- the client device may include either an internal or external display device for displaying one of the many digital images (compressed or uncompressed) which comprise a movie or video (i.e., a live or moving picture).
- a video “frame” refers one of the many digital images (compressed or uncompressed) which comprise a movie or video (i.e., a live or moving picture).
- a movie or video i.e., a live or moving picture.
- each frame of the moving picture is flashed on a screen for a short time (nowadays, usually 1/24, 1/25, 1/30 or 1/60 of a second) and then immediately replaced by the next one.
- the human attribute of persistence of vision blends the frames together, such that a view perceives a live, or real-time moving picture.
- a frame can be divided up into regions of an image, which are commonly referred to as “tiles” or “slices.” For instance, in the H.264/AVC standard a frame can be composed of a single slice or multiple slices.
- packet loss refers broadly to the occurrence of when one or more packets travelling across a computer network fail to reach their destination, or when one or more packets transmitted over a network arrive at their destination with errors.
- FIG. 1 is an example network diagram illustrating one embodiment for effectively dealing with packet loss.
- a plurality of video frames 11 is compressed (coded) by an encoder 12 to produce a primary stream of compressed video data as well as a subset stream containing key minimal data. Key data would differ among embodiments and would depend on specific application.
- the additional stream may be transmitted at considerably lower bitrates than the normal stream.
- the subset stream, or sub-stream is shown by arrow 13 .
- the primary and subset streams are packetized by packetize devices 14 a and 14 b , respectively, before being transmitted, substantially simultaneously, to the client over network 16 .
- the hosting service not only sends the normal video stream, but also a subset of that stream in separate network packets.
- subset stream contains only motion vectors.
- subset stream contains motion vectors and residuals.
- Motion vectors represent the spatial displacement or “delta” between two successive image areas (e.g., frame-to-frame).
- information that leads to one possible encoding i.e., motion vectors/residuals
- the decoder may utilize the motion vectors to construct the frame as best as possible.
- the additional sub-stream may be transmitted at considerably lower bit rates than the normal or primary stream. This is represented in FIG. 1 by the smaller-sized sub-stream packets 15 b shown being transmitted over network 16 , as compared with the primary packet stream 15 a.
- switching device 17 On the client side, both the primary stream and the sub-stream are received at switching device 17 , which may comprise a router, switch, or other network switching device that may be used to select between the primary and sub-stream.
- Switching device 17 normally selects the primary stream, which is then fed into packet reconstruction device 18 .
- the reconstructed packets are then decoded by the decoder 19 to produce the reconstructed video frames 20 for display on the client device.
- switching device 17 switches from the primary stream to the sub-stream.
- the sub-stream information is used by decoder 19 to make a prediction of, or otherwise reconstruct, the desired frame. Afterwards, switching device 17 switches back to the normal or primary packet stream.
- FIG. 2 is a flow diagram illustrating an example method for dealing with packet loss in correspondence with the network diagram of FIG. 1 .
- the process may begin at block 24 with the arrival of a network packet, followed by the client-side device detecting whether the primary packet stream is corrupt. If it is not corrupt, the primary network packet stream continues to be selected (block 24 ) for subsequent decoding and rendering of reconstructed video frames on the client display device. On the other hand, if the primary stream is corrupted by a dropped or lost packet then the subset network packet stream is selected (block 22 ). The subset data are decoded and used to reconstruct the desired frame. This process may continue as long as the primary stream remains corrupt.
- decision block 23 once the primary stream is recovered or is no longer corrupted (block 23 ) the primary network packet stream is once again selected and the normal packets are decoded, as described above.
- FIG. 3 is another example network diagram illustrating an embodiment for handling packet loss that is similar to that shown in FIG. 1 .
- video frames 31 are encoded as desired by encoder 32 a to produce a primary or normal video stream 33 a .
- a separate encoding generates a less ideal stream 33 b with each frame based on the previous frame of the primary stream but with the quality scaled such that a slice or frame neatly fits into a predetermined set of network packets, which could be a single packet.
- the normal video stream is packetized by packetizer 34 a and transmitted over network 35 .
- the secondary stream is also packetized for transmission over network 35 . It is appreciated that the bandwidth of the secondary stream may be much lower as compared to the normal video stream.
- the encoded primary stream frames are reconstructed by a device 36 to replicate video stream 33 a .
- a switching device 37 is utilized to select between the normal video stream and the secondary video stream. Whichever stream is selected, the received packets are then decoded by a decoder 39 to generate reproduced video frames 40 on the display of the client device.
- the secondary stream is selected by switching device 37 .
- the video stream switching device 37 switches back the primary or normal video stream.
- the client device may notify the server-side when it switches over to the secondary video stream.
- a processor associated with the client device utilizes the secondary video stream to construct a lower-quality, yet accurate, representation of the video frames.
- FIG. 4 is yet another example network diagram illustrating an embodiment for handling packet loss.
- the server-side encoder 41 stores a copy of the encoded bits per slice/frame in an associated memory 42 (e.g., RAM, disk, etc.). These encoded frames may then be retrieved and individually decoded by decoder 43 for reconstruction of the client-side state. In this manner, encoder 41 can continue to feed encoded slices/frames to the client-side decoder 45 , which is coupled to its own associated memory 46 , even after packet loss.
- decoder 45 or a processor coupled with decoder 45
- encoder 41 can utilize decoder 43 and the stored encoded bits/frames to determine exactly what happened at decoder 45 on the client side. In this way, the server can keep itself aware of client state even when the client has received erroneous transmissions.
- encoder 41 has encoded frames F 1 -F 7 and sent them over a transmission network to the client device.
- the first frame, F 1 may be an I-frame, followed by a sequence of P-frames that are calculated predictively based on a difference or delta (A) from the previous I-frame or P-frame.
- the second frame, F 2 is decoded as a delta from F 1
- the third frame, F 3 is decoded as a delta from F 2 , and so on.
- the third frame, F 3 is lost or corrupted.
- a notification is sent to encoder 41 via feedback loop 47 , notifying encoder 41 that the last good frame received was F 2 .
- server-side encoder 41 generates the eighth frame, F 8 , predictively as a delta from the last good frame, F 2 .
- encoder 41 constructs F 8 from F 2 , taking into account all of the client errors resulting on the client-side due to the loss of F 3 , and the subsequent frames. Utilizing decoder 43 and the stored encoded bits in memory 42 , encoder determines exactly what each of the packets subsequent to the lost packet (e.g., F 4 -F 8 ) would look like, taking into consideration the client-side errors that have occurred.
- FIG. 5 is a flow diagram illustrating an example process flow wherein the server-side encoder keeps encoding frames for reconstruction of the client state following packet loss.
- the next frame e.g., P-frame
- the server-side encoder sends the encoded bits to the client device and also stores a copy of these same bits in an associated memory.
- the server-side encoder queries whether a notification of packet loss has been received from the client-side decoder. If not, the encoder continues encoding frames based on the previous client state. (Block 51 )
- the server-side calculates the client state from the last known good state (before packet loss) and the decode data that the client received correctly, plus the decode data that followed the correctly received data. (Block 54 ) This later decode data comprises the client errors resulting from the packet loss. The process then continues at block 51 , with the encoder coding the next frame based on the previous client state, with the previous client state now being that calculated from block 54 .
- FIG. 6 is still another example network diagram illustrating a slice-based recovery technique for overcoming packet loss.
- This embodiment may be used for video frames 61 that are divided into two or more slices.
- a frame slicer 62 is shown dividing a video frame into four slices, which are then encoded by server-side encoder 63 and transmitted to the client-side decoder 64 .
- Decoder 64 generates reconstructed video frames 69 for display on the client device.
- FIG. 6 six frames, each having four slices, are shown transmitted by encoder 63 .
- the first frame comprises four I-slices
- the second frame comprises four P-slices
- the third frame comprises four P-slices, and so on.
- decoder 64 is shown receiving frames 1 - 3 without incident.
- frame 4 is shown being received with the data for the third slice, denoted by reference numeral 65 , having been lost during network transmission.
- a notification is sent back to encoder 63 via feedback channel or loop 67 .
- the notification is received by encoder 63 immediately following transmission of frame 5 , which comprises four P-slices. Responsive to the notification of lost data in the third slice, encoder encodes the next frame (frame 6 ) with slice 3 as an I-slice, as denoted by reference numeral 68 .
- the embodiment of FIG. 6 thus performs frame repair by repairing individual slices for which data has been lost.
- Practitioners in the art will appreciate that this embodiment has the advantage of avoiding the standard practice of insuring that a video stream contains a certain density of I-frames, which are very large and costly to transmit. Instead of sending an I-frame say, every two seconds (as in the case of DVD or on-demand video transmissions that lack feedback) the embodiment of FIG. 6 relies upon P-slice transmission for frames subsequent to the initial frame, and then implements slice-based recovery by transmitting an I-slice at the slice position where lost slice data was detected at the client device.
- a single network packet should not contain data for more than a single slice.
- FIG. 7 is a flow diagram illustrating an example process flow for slice-based recovery from lost data packets.
- video frames are divided into N, where N is an integer>1, slices.
- the hosting service queries whether it has received a notification from the client device indicative of a lost data slice. If no such notification has been received, it proceeds to encode each of the slices in the frame as P-slices (block 74 ), which are then transmitted to the client device (block 75 ). If, on the other hand, the hosting service has received a notification that the data of a particular slice has been lost during network transmission, the corresponding slice in the current (i.e., next) frame is encoded as an I-slice. (Block 73 ) That I-slice is then transmitted over the network to the client device. (Block 74 ).
- incoming packets/slices may be checked for data integrity.
- Block 76 If data is lost for a particular slice, the server at the hosting service center is immediately notified.
- Block 77 If no data is detected lost, then each slice received is decoded (block 78 ) in order to reconstruct the video frames for rendering on the client-side display device.
- elements of the disclosed subject matter may also be provided as a computer program product which may include a machine-readable medium having stored thereon instructions or code which may be used to program a computer (e.g., a processor or other electronic device) to perform a sequence of operations. Alternatively, the operations may be performed by a combination of hardware and software.
- the machine-readable medium may include, but is not limited to, floppy diskettes, optical disks, CD-ROMs, and magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, magnet or optical cards, or other type of machine-readable medium suitable for storing electronic instructions.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
- This application is a Divisional of U.S. application Ser. No. 15/620,808, filed on Jun. 12, 2017, entitled “Recovery From Packet Loss During Transmission Of Compressed Video Streams”, which is a further Divisional of U.S. application Ser. No. 13/837,541 filed Mar. 15, 2013 (U.S. Pat. No. 9,681,155, issued on Jun. 13, 2017), entitled “Recovery From Packet Loss During Transmission Of Compressed Video Streams”, which are herein incorporated by reference.
- The present disclosure relates generally to transmission of compressed video over computer networks; more specifically, to methods and apparatus for mitigating the effects of packet loss which occur when one or more packets of digital data travelling across a computer network fail to reach their destination intact.
- Remote hosting of online, fast-action, interactive video games and other high-end video applications typically requires very low latencies. For example, for twitch video games and applications, low round-trip latency, as measured from the time a user's control input is sent to the hosting service center to the time that the newly generated video content appears on the screen of the user's client device, is typically required. At higher latencies, performance suffers noticeably. Achieving such low latencies over the Internet or other similar networks requires the video compressor at the hosting service to generate a packet stream with particular characteristics such that the packet sequence flowing through the entire path from the hosting service to the client device is not subject to delays or excessive packet loss. In addition, the video compressor must create a packet stream which is sufficiently robust so that it can tolerate the inevitable packet loss and packet reordering that occurs in normal Internet and network transmissions.
- In streaming video technologies, lost or dropped packets can result in highly noticeable performance issues, potentially causing the screen to completely freeze for a period of time or show other screen-wide visual artifacts (e.g., jitter). If a lost/delayed packet causes the loss of a key frame (i.e., I-frame), then the decompressor on the client device will lack a reference for all of the P-frames that follow until a new I-frame is received. Similarly, if a P frame is lost, that will impact the P-frames that follow. Depending on how long it will be before an I-frame appears, this can have a significant visual impact. (As is well-known, I-frames are the only type of frame that is not coded with reference to any other frame. P-frames are coded predicatively from a previous I-frame or P-frame; B-frames are coded predicatively from I-frames and P-frames. In order to be properly decoded, a B-frame associated with a group of pictures (“GOPs”) may need to reference the I-frame of a next GOP. In the context of the present disclosure, the term “I-frame” is intended to broadly refer to an Inter-frame and its equivalents, e.g., an IDR frame in the case of H.264.)
- A variety of mechanisms have been developed for handling packet loss. For instance, when packet loss occurs in network transport protocols such as Transmission Control Protocol (TCP), any segments that have not been acknowledged are simply resent. But the problem with such approaches is that they are often unfeasible or impractical in streaming video technologies where it is essential to maintain high data rates and throughput.
- Non-limiting and non-exhaustive embodiments of the present invention are described with reference to the following figures, wherein like reference numerals refer to like parts throughout the various views unless otherwise specified.
-
FIG. 1 is an example network diagram illustrating one embodiment for effectively dealing with packet loss. -
FIG. 2 is a flow diagram illustrating an example method for dealing with packet loss. -
FIG. 3 is another example network diagram illustrating an embodiment for handling packet loss. -
FIG. 4 is yet another example network diagram illustrating an embodiment for handling packet loss. -
FIG. 5 is a flow diagram illustrating another example process flow for handling packet loss. -
FIG. 6 is still another example network diagram illustrating an embodiment for handling packet loss. -
FIG. 7 is a flow diagram illustrating an example process flow for dealing with packet loss. - In the following description, numerous specific details are set forth in order to provide a thorough understanding of the embodiments described. It will be apparent, however, to one having ordinary skill in the art that the specific details may not be needed to practice the embodiments described. In other instances, well-known apparatus or methods have not been described in detail in order to avoid obscuring the embodiments disclosed.
- Reference throughout this specification to “one embodiment”, “an embodiment”, “one example” or “an example” means that a particular feature, structure or characteristic described in connection with the embodiment or example is included in at least one embodiment of the present invention. Thus, appearances of the phrases “in one embodiment”, “in an embodiment”, “one example” or “an example” in various places throughout this specification are not necessarily all referring to the same embodiment or example. Furthermore, the particular features, structures or characteristics may be combined in any suitable combinations and/or sub-combinations in one or more embodiments or examples. In addition, it is appreciated that the figures provided herewith are for explanation purposes to persons ordinarily skilled in the art.
- In the context of the present disclosure, a video “encoder” broadly refers to a device, circuit or algorithm (embodied in hardware or software) that compresses (i.e., encodes) video data using fewer bits/bytes to reduce the size of the original video data. Data compression is also frequently referred to as source coding, i.e., coding of data performed at the source before it is either transmitted or stored. Conversely, a video “decoder” or decompressor is a device, circuit or algorithm which performs the reverse operation of an encoder, undoing the encoding to retrieve the original (decompressed) video data.
- The term “server” broadly refers to any combination of hardware or software embodied in a computer (i.e., a processor) designed to provide services to client devices or processes. A server therefore can refer to one or more computer processors that run a server operating system from computer-executable code stored in a memory, and which is provided to the user as virtualized or non-virtualized server; it can also refer to any software or dedicated hardware capable of providing computing services.
- A “client device” refers a computer device such as a PC, desktop computer, tablet, mobile, handheld, set-top box, or any other general purpose computer (e.g., Microsoft Windows- or Linux-based PCs or Apple, Inc. Macintosh computers) having a wired or wireless connection to a public network such as the Internet, and which further includes the ability to decompress/decode compressed packet data received over a network connection. The client device may include either an internal or external display device for displaying one of the many digital images (compressed or uncompressed) which comprise a movie or video (i.e., a live or moving picture).
- A video “frame” refers one of the many digital images (compressed or uncompressed) which comprise a movie or video (i.e., a live or moving picture). When video is displayed, each frame of the moving picture is flashed on a screen for a short time (nowadays, usually 1/24, 1/25, 1/30 or 1/60 of a second) and then immediately replaced by the next one. The human attribute of persistence of vision blends the frames together, such that a view perceives a live, or real-time moving picture. A frame can be divided up into regions of an image, which are commonly referred to as “tiles” or “slices.” For instance, in the H.264/AVC standard a frame can be composed of a single slice or multiple slices.
- In the context of the present disclosure, the term “packet loss” refers broadly to the occurrence of when one or more packets travelling across a computer network fail to reach their destination, or when one or more packets transmitted over a network arrive at their destination with errors.
-
FIG. 1 is an example network diagram illustrating one embodiment for effectively dealing with packet loss. As shown, a plurality ofvideo frames 11 is compressed (coded) by anencoder 12 to produce a primary stream of compressed video data as well as a subset stream containing key minimal data. Key data would differ among embodiments and would depend on specific application. The additional stream may be transmitted at considerably lower bitrates than the normal stream. The subset stream, or sub-stream, is shown byarrow 13. After encoding, the primary and subset streams are packetized by 14 a and 14 b, respectively, before being transmitted, substantially simultaneously, to the client overpacketize devices network 16. Thus, for every slice, the hosting service not only sends the normal video stream, but also a subset of that stream in separate network packets. - In one embodiment, the subset stream contains only motion vectors. In another embodiment, subset stream contains motion vectors and residuals. Motion vectors represent the spatial displacement or “delta” between two successive image areas (e.g., frame-to-frame).
- In the embodiment shown, information that leads to one possible encoding (i.e., motion vectors/residuals) is separated from the actual encoding, and then sent downstream so that the decoder may make best use of it as can in the event that packet loss occurs. In other words, the decoder may utilize the motion vectors to construct the frame as best as possible. Practitioners in the art will appreciate that the additional sub-stream may be transmitted at considerably lower bit rates than the normal or primary stream. This is represented in
FIG. 1 by the smaller-sizedsub-stream packets 15 b shown being transmitted overnetwork 16, as compared with theprimary packet stream 15 a. - On the client side, both the primary stream and the sub-stream are received at switching
device 17, which may comprise a router, switch, or other network switching device that may be used to select between the primary and sub-stream.Switching device 17 normally selects the primary stream, which is then fed intopacket reconstruction device 18. The reconstructed packets are then decoded by thedecoder 19 to produce the reconstructed video frames 20 for display on the client device. In the event that packet loss is detected, switchingdevice 17 switches from the primary stream to the sub-stream. The sub-stream information is used bydecoder 19 to make a prediction of, or otherwise reconstruct, the desired frame. Afterwards, switchingdevice 17 switches back to the normal or primary packet stream. -
FIG. 2 is a flow diagram illustrating an example method for dealing with packet loss in correspondence with the network diagram ofFIG. 1 . The process may begin atblock 24 with the arrival of a network packet, followed by the client-side device detecting whether the primary packet stream is corrupt. If it is not corrupt, the primary network packet stream continues to be selected (block 24) for subsequent decoding and rendering of reconstructed video frames on the client display device. On the other hand, if the primary stream is corrupted by a dropped or lost packet then the subset network packet stream is selected (block 22). The subset data are decoded and used to reconstruct the desired frame. This process may continue as long as the primary stream remains corrupt. Atdecision block 23, once the primary stream is recovered or is no longer corrupted (block 23) the primary network packet stream is once again selected and the normal packets are decoded, as described above. -
FIG. 3 is another example network diagram illustrating an embodiment for handling packet loss that is similar to that shown inFIG. 1 . In this example, video frames 31 are encoded as desired byencoder 32 a to produce a primary or normal video stream 33 a. In addition, a separate encoding generates a less ideal stream 33 b with each frame based on the previous frame of the primary stream but with the quality scaled such that a slice or frame neatly fits into a predetermined set of network packets, which could be a single packet. The normal video stream is packetized bypacketizer 34 a and transmitted overnetwork 35. The secondary stream is also packetized for transmission overnetwork 35. It is appreciated that the bandwidth of the secondary stream may be much lower as compared to the normal video stream. - At the client-side device, the encoded primary stream frames are reconstructed by a
device 36 to replicate video stream 33 a. A switchingdevice 37 is utilized to select between the normal video stream and the secondary video stream. Whichever stream is selected, the received packets are then decoded by adecoder 39 to generate reproduced video frames 40 on the display of the client device. As in the previous embodiment, if a packet loss is detected on the client-side device, the secondary stream is selected by switchingdevice 37. When the primary stream transmission recovers, the videostream switching device 37 switches back the primary or normal video stream. - It is appreciated by practitioners in the art that in order to maintain synchronicity between the client-side decoder and the server-side decoder (not shown), the client device may notify the server-side when it switches over to the secondary video stream. When lost or corrupted data packets have been detected and switching
device 37 has selected the secondary stream, a processor associated with the client device utilizes the secondary video stream to construct a lower-quality, yet accurate, representation of the video frames. -
FIG. 4 is yet another example network diagram illustrating an embodiment for handling packet loss. In the embodiment shown, the server-side encoder 41 stores a copy of the encoded bits per slice/frame in an associated memory 42 (e.g., RAM, disk, etc.). These encoded frames may then be retrieved and individually decoded bydecoder 43 for reconstruction of the client-side state. In this manner,encoder 41 can continue to feed encoded slices/frames to the client-side decoder 45, which is coupled to its own associatedmemory 46, even after packet loss. When packet loss does occur, decoder 45 (or a processor coupled with decoder 45) sends a notification to encoder 41 (or a processor controlling encoder 41) via feedback channel orloop 47. In response to the notification of packet loss,encoder 41 can utilizedecoder 43 and the stored encoded bits/frames to determine exactly what happened atdecoder 45 on the client side. In this way, the server can keep itself aware of client state even when the client has received erroneous transmissions. - In the example shown in
FIG. 4 ,encoder 41 has encoded frames F1-F7 and sent them over a transmission network to the client device. By way of example, the first frame, F1, may be an I-frame, followed by a sequence of P-frames that are calculated predictively based on a difference or delta (A) from the previous I-frame or P-frame. Thus, the second frame, F2, is decoded as a delta from F1, the third frame, F3, is decoded as a delta from F2, and so on. As shown, the third frame, F3, is lost or corrupted. When this is detected atdecoder 45, a notification is sent to encoder 41 viafeedback loop 47, notifyingencoder 41 that the last good frame received was F2. In response to the notification, server-side encoder 41 generates the eighth frame, F8, predictively as a delta from the last good frame, F2. To do this and maintain state synchronicity between the server and client sides,encoder 41 constructs F8 from F2, taking into account all of the client errors resulting on the client-side due to the loss of F3, and the subsequent frames. Utilizingdecoder 43 and the stored encoded bits inmemory 42, encoder determines exactly what each of the packets subsequent to the lost packet (e.g., F4-F8) would look like, taking into consideration the client-side errors that have occurred. -
FIG. 5 is a flow diagram illustrating an example process flow wherein the server-side encoder keeps encoding frames for reconstruction of the client state following packet loss. With the server-side encoder having transmitted an I-frame, the next frame (e.g., P-frame) is encoded based on the previous client state. (Block 51) Atblock 52, the server-side encoder sends the encoded bits to the client device and also stores a copy of these same bits in an associated memory. Atdecision block 53, the server-side encoder queries whether a notification of packet loss has been received from the client-side decoder. If not, the encoder continues encoding frames based on the previous client state. (Block 51) - On the other hand, if packet loss was detected and a notification received by the encoder, the server-side calculates the client state from the last known good state (before packet loss) and the decode data that the client received correctly, plus the decode data that followed the correctly received data. (Block 54) This later decode data comprises the client errors resulting from the packet loss. The process then continues at
block 51, with the encoder coding the next frame based on the previous client state, with the previous client state now being that calculated fromblock 54. -
FIG. 6 is still another example network diagram illustrating a slice-based recovery technique for overcoming packet loss. This embodiment may be used for video frames 61 that are divided into two or more slices. For instance, in this example aframe slicer 62 is shown dividing a video frame into four slices, which are then encoded by server-side encoder 63 and transmitted to the client-side decoder 64.Decoder 64 generates reconstructed video frames 69 for display on the client device. - In
FIG. 6 , six frames, each having four slices, are shown transmitted byencoder 63. The first frame comprises four I-slices, the second frame comprises four P-slices, the third frame comprises four P-slices, and so on. Over on the client-side,decoder 64 is shown receiving frames 1-3 without incident. However,frame 4 is shown being received with the data for the third slice, denoted byreference numeral 65, having been lost during network transmission. When lost data is detected bydecoder 64, or by a packet loss detection device 66, a notification is sent back toencoder 63 via feedback channel orloop 67. In this example, the notification is received byencoder 63 immediately following transmission offrame 5, which comprises four P-slices. Responsive to the notification of lost data in the third slice, encoder encodes the next frame (frame 6) withslice 3 as an I-slice, as denoted by reference numeral 68. - The embodiment of
FIG. 6 thus performs frame repair by repairing individual slices for which data has been lost. Practitioners in the art will appreciate that this embodiment has the advantage of avoiding the standard practice of insuring that a video stream contains a certain density of I-frames, which are very large and costly to transmit. Instead of sending an I-frame say, every two seconds (as in the case of DVD or on-demand video transmissions that lack feedback) the embodiment ofFIG. 6 relies upon P-slice transmission for frames subsequent to the initial frame, and then implements slice-based recovery by transmitting an I-slice at the slice position where lost slice data was detected at the client device. - Practitioners will further appreciate that for optimal results, a single network packet should not contain data for more than a single slice.
-
FIG. 7 is a flow diagram illustrating an example process flow for slice-based recovery from lost data packets. On the server-side, video frames are divided into N, where N is an integer>1, slices. (Block 71) Atdecision block 72, the hosting service queries whether it has received a notification from the client device indicative of a lost data slice. If no such notification has been received, it proceeds to encode each of the slices in the frame as P-slices (block 74), which are then transmitted to the client device (block 75). If, on the other hand, the hosting service has received a notification that the data of a particular slice has been lost during network transmission, the corresponding slice in the current (i.e., next) frame is encoded as an I-slice. (Block 73) That I-slice is then transmitted over the network to the client device. (Block 74). - On the client side, incoming packets/slices may be checked for data integrity. (Block 76) If data is lost for a particular slice, the server at the hosting service center is immediately notified. (Block 77) If no data is detected lost, then each slice received is decoded (block 78) in order to reconstruct the video frames for rendering on the client-side display device.
- It should be understood that elements of the disclosed subject matter may also be provided as a computer program product which may include a machine-readable medium having stored thereon instructions or code which may be used to program a computer (e.g., a processor or other electronic device) to perform a sequence of operations. Alternatively, the operations may be performed by a combination of hardware and software. The machine-readable medium may include, but is not limited to, floppy diskettes, optical disks, CD-ROMs, and magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, magnet or optical cards, or other type of machine-readable medium suitable for storing electronic instructions.
- The above description of illustrated example embodiments, including what is described in the Abstract, are not intended to be exhaustive or to be limitation to the precise forms disclosed. While specific embodiments and examples of the subject matter described herein are for illustrative purposes, various equivalent modifications are possible without departing from the broader spirit and scope of the present invention.
Claims (17)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/812,185 US20200213625A1 (en) | 2013-03-15 | 2020-03-06 | Recovery From Packet Loss During Transmission Of Compressed Video Streams |
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US13/837,541 US9681155B2 (en) | 2013-03-15 | 2013-03-15 | Recovery from packet loss during transmission of compressed video streams |
| US15/620,808 US11039174B2 (en) | 2013-03-15 | 2017-06-12 | Recovery from packet loss during transmission of compressed video streams |
| US16/812,185 US20200213625A1 (en) | 2013-03-15 | 2020-03-06 | Recovery From Packet Loss During Transmission Of Compressed Video Streams |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US15/620,808 Division US11039174B2 (en) | 2013-03-15 | 2017-06-12 | Recovery from packet loss during transmission of compressed video streams |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20200213625A1 true US20200213625A1 (en) | 2020-07-02 |
Family
ID=51526949
Family Applications (3)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US13/837,541 Active 2035-06-07 US9681155B2 (en) | 2013-03-15 | 2013-03-15 | Recovery from packet loss during transmission of compressed video streams |
| US15/620,808 Active US11039174B2 (en) | 2013-03-15 | 2017-06-12 | Recovery from packet loss during transmission of compressed video streams |
| US16/812,185 Abandoned US20200213625A1 (en) | 2013-03-15 | 2020-03-06 | Recovery From Packet Loss During Transmission Of Compressed Video Streams |
Family Applications Before (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US13/837,541 Active 2035-06-07 US9681155B2 (en) | 2013-03-15 | 2013-03-15 | Recovery from packet loss during transmission of compressed video streams |
| US15/620,808 Active US11039174B2 (en) | 2013-03-15 | 2017-06-12 | Recovery from packet loss during transmission of compressed video streams |
Country Status (1)
| Country | Link |
|---|---|
| US (3) | US9681155B2 (en) |
Families Citing this family (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9749627B2 (en) | 2013-04-08 | 2017-08-29 | Microsoft Technology Licensing, Llc | Control data for motion-constrained tile set |
| US9407923B2 (en) * | 2013-05-20 | 2016-08-02 | Gamefly Israel Ltd. | Overconing lost IP packets in streaming video in IP networks |
| US10158889B2 (en) * | 2015-01-31 | 2018-12-18 | Intel Corporation | Replaying old packets for concealing video decoding errors and video decoding latency adjustment based on wireless link conditions |
| US10091104B1 (en) * | 2015-06-01 | 2018-10-02 | Akamai Technologies, Inc. | Object reordering for fast Layer 4 switching of multiplexed connections |
| US10003811B2 (en) | 2015-09-01 | 2018-06-19 | Microsoft Technology Licensing, Llc | Parallel processing of a video frame |
| CN105872613A (en) * | 2016-03-30 | 2016-08-17 | 乐视控股(北京)有限公司 | Method and system for performing HLS slice loss compensation |
| US20170288816A1 (en) * | 2016-03-30 | 2017-10-05 | Le Holdings (Beijing) Co., Ltd. | Method and system for compensating hls slice loss |
| US10200727B2 (en) * | 2017-03-29 | 2019-02-05 | International Business Machines Corporation | Video encoding and transcoding for multiple simultaneous qualities of service |
| US10812857B2 (en) | 2018-09-28 | 2020-10-20 | Apple Inc. | Systems and methods for reducing latency of a video transmission system |
| US11039149B2 (en) * | 2019-08-01 | 2021-06-15 | Qualcomm Incorporated | Dynamic video insertion based on feedback information |
| EP4015314A1 (en) * | 2020-12-18 | 2022-06-22 | Constellium Singen GmbH | Crash extension for crash management system |
| CN114205555B (en) * | 2021-11-10 | 2022-08-19 | 广东广信通信服务有限公司 | Intelligent video customer service information processing method, system, equipment and medium |
| EP4210332A1 (en) | 2022-01-11 | 2023-07-12 | Tata Consultancy Services Limited | Method and system for live video streaming with integrated encoding and transmission semantics |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20090213938A1 (en) * | 2008-02-26 | 2009-08-27 | Qualcomm Incorporated | Video decoder error handling |
| US20130016781A1 (en) * | 2011-07-14 | 2013-01-17 | Comcast Cable Communications, Llc | Preserving Image Quality in Temporally Compressed Video Streams |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6201834B1 (en) * | 1996-12-20 | 2001-03-13 | Intel Corporation | Method and apparatus for packet loss recovery with standard-based packet video |
| JP4373973B2 (en) * | 2005-11-15 | 2009-11-25 | 株式会社東芝 | Program sending system and program sending method |
| US8693538B2 (en) * | 2006-03-03 | 2014-04-08 | Vidyo, Inc. | System and method for providing error resilience, random access and rate control in scalable video communications |
| WO2008088305A2 (en) * | 2006-12-20 | 2008-07-24 | Thomson Research Funding Corporation | Video data loss recovery using low bit rate stream in an iptv system |
| PT2123052E (en) * | 2007-01-18 | 2011-03-02 | Fraunhofer Ges Forschung | Quality scalable video data stream |
| US20110194602A1 (en) * | 2010-02-05 | 2011-08-11 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and apparatus for sub-pixel interpolation |
| US8861342B2 (en) * | 2011-10-28 | 2014-10-14 | Cisco Technology, Inc. | Multicast-only fast re-route processing for point-to-multipoint pseudowire |
-
2013
- 2013-03-15 US US13/837,541 patent/US9681155B2/en active Active
-
2017
- 2017-06-12 US US15/620,808 patent/US11039174B2/en active Active
-
2020
- 2020-03-06 US US16/812,185 patent/US20200213625A1/en not_active Abandoned
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20090213938A1 (en) * | 2008-02-26 | 2009-08-27 | Qualcomm Incorporated | Video decoder error handling |
| US20130016781A1 (en) * | 2011-07-14 | 2013-01-17 | Comcast Cable Communications, Llc | Preserving Image Quality in Temporally Compressed Video Streams |
Also Published As
| Publication number | Publication date |
|---|---|
| US20170280167A1 (en) | 2017-09-28 |
| US9681155B2 (en) | 2017-06-13 |
| US20140269917A1 (en) | 2014-09-18 |
| US11039174B2 (en) | 2021-06-15 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20200213625A1 (en) | Recovery From Packet Loss During Transmission Of Compressed Video Streams | |
| US9661351B2 (en) | Client side frame prediction for video streams with skipped frames | |
| CN1242623C (en) | Video encoding method, decoding method, and related encoder and decoder | |
| KR101249569B1 (en) | Seamless handover of multicast sessions in internet protocol based wireless networks using staggercasting | |
| US8929443B2 (en) | Recovering from dropped frames in real-time transmission of video over IP networks | |
| JP5455648B2 (en) | System and method for improving error tolerance in video communication system | |
| US20150373075A1 (en) | Multiple network transport sessions to provide context adaptive video streaming | |
| US20040218669A1 (en) | Picture coding method | |
| JP2006518127A (en) | Picture decoding method | |
| CN108141581B (en) | Video coding | |
| EP3345392A1 (en) | Video coding | |
| US9264737B2 (en) | Error resilient transmission of random access frames and global coding parameters | |
| US12401706B2 (en) | Loss-resilient real-time video streaming | |
| CN103918258A (en) | Reducing amount of data in video encoding | |
| EP2908516A1 (en) | Process for transmitting an ongoing video stream from a publisher to a receiver through a MCU unit during a live session | |
| Vilei et al. | A novel unbalanced multiple description scheme for video transmission over wlan | |
| Kropfberger et al. | Evaluation of RTP immediate feedback and retransmission extensions [video streaming applications] | |
| WO2022205064A1 (en) | Video encoder, video decoder and corresponding method | |
| HK1088162B (en) | Picture decoding method |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: APPLICATION DISPATCHED FROM PREEXAM, NOT YET DOCKETED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: ADVISORY ACTION MAILED |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |