[go: up one dir, main page]

WO1998034181A2 - A method and device for keyframe-based video displaying using a video cursor frame in a multikeyframe screen - Google Patents

A method and device for keyframe-based video displaying using a video cursor frame in a multikeyframe screen Download PDF

Info

Publication number
WO1998034181A2
WO1998034181A2 PCT/IB1998/000089 IB9800089W WO9834181A2 WO 1998034181 A2 WO1998034181 A2 WO 1998034181A2 IB 9800089 W IB9800089 W IB 9800089W WO 9834181 A2 WO9834181 A2 WO 9834181A2
Authority
WO
WIPO (PCT)
Prior art keywords
video
keyframes
keyframe
presentation
format
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/IB1998/000089
Other languages
French (fr)
Other versions
WO1998034181A3 (en
Inventor
Magdalena Dingena Janse
Duco Annardo David Das
Hok Kong Tang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Philips AB
Philips Norden AB
Original Assignee
Koninklijke Philips Electronics NV
Philips AB
Philips Norden AB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics NV, Philips AB, Philips Norden AB filed Critical Koninklijke Philips Electronics NV
Priority to EP98900327A priority Critical patent/EP0928455A2/en
Priority to JP10529206A priority patent/JP2000511736A/en
Priority to KR1019980707892A priority patent/KR100633711B1/en
Priority to PCT/IB1998/000089 priority patent/WO1998034181A2/en
Publication of WO1998034181A2 publication Critical patent/WO1998034181A2/en
Publication of WO1998034181A3 publication Critical patent/WO1998034181A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/34Indicating arrangements 
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/73Querying
    • G06F16/738Presentation of query results
    • G06F16/739Presentation of query results in form of a video summary, e.g. the video summary being a video sequence, a composite still image or having synthesized frames
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/74Browsing; Visualisation therefor
    • G06F16/745Browsing; Visualisation therefor the internal structure of a single video sequence
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/102Programmed access in sequence to addressed parts of tracks of operating record carriers
    • G11B27/107Programmed access in sequence to addressed parts of tracks of operating record carriers of operating tapes
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/11Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information not detectable on the record carrier
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/19Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
    • G11B27/28Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/20Disc-shaped record carriers
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/40Combinations of multiple record carriers
    • G11B2220/45Hierarchical combination of record carriers, e.g. HDD for fast access, optical discs for long term storage or tapes for backup
    • G11B2220/455Hierarchical combination of record carriers, e.g. HDD for fast access, optical discs for long term storage or tapes for backup said record carriers being in one device and being used as primary and secondary/backup media, e.g. HDD-DVD combo device, or as source and target media, e.g. PC and portable player
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/90Tape-like record carriers

Definitions

  • a method and device for keyframe-based video displaying using a video cursor frame in a multikey frame screen is a method and device for keyframe-based video displaying using a video cursor frame in a multikey frame screen.
  • the invention relates to a method for keyframe-based displaying of a video presentation whilst enabling a user to select among said keyframes, and through such selecting, displaying a substantially continuous video stream based on said presentation, and which method displays various such keyframes in parallel in a reduced and static video format.
  • a continuous video stream means that video remains "on", which may include animation, a series of stills, or an interactive sequence of images.
  • the character may be various, such as film, news, or for example a shopping list. State of the art is represented by the article 'Content-Based Video Indexing and Retrieval' by S.W. Smoliar and H.J. Zhang, IEEE Multimedia, Summer 1994, pages 62-72.
  • keyframes may be derived from the video material upon its reception at the user's premises through some derivation algorithm, or keyframes may be labelled as such by the video provider, for example, in that each new video shot will start with a keyframe.
  • the present invention recognizes that such keyframes should be utilized so as to give users a dynamic overview over the presentation, combined with useful facilities for enabling them to edit the material in a sensible manner.
  • the invention is characterized in that said displaying is controlled as starting from a particular active keyframe which subsequently to the selecting acts as a dynamic video cursor frame within said format.
  • the cursor frame moves in real time, thereby enabling users to discern between keyframes, that may have similar scene configurations but different dynamisms.
  • the method allows for rendering an audio-accompanied video presentation, whilst said rendering is controlled as starting from audio associated to a particular active keyframe that subsequently to the selecting acts as a discrete video cursor frame within said format.
  • the audio is more discriminative than the video, and so the audio could represent the ongoing stream, while the video keyframe would operate as a kind of "header" pertaining to the interval between the two successive keyframes in question.
  • the video keyframe may remain static, while the active keyframe that works as discrete or discretely stepping cursor frame may be highlighted in an appropriate manner.
  • said dynamic video cursor frame freezes upon coinciding with the next displayed keyframe's content.
  • the freezed display content may pertain either to the start, or to the end of the interval that has just been displayed. Subsequently, the next keyframe may become the a dynamic video cursor frame.
  • the taking over may be controllable and/or programmable: this allows a user to view what editing the presentation will produce. Also, upon going to a next interval, the display mode may be altered, such as from normal to fast or slow display.
  • the invention may be used for fully digital video, or rather for video in which the various video frames have digital labels for easy addressing.
  • the keyframes are set in a hierarchical structure and/or various keyframes are allowed to be deleted from display for so joining intervals of said presentation before and behind the deleted keyframe.
  • the hierarchical structure may be implemented through the provider, or rather through some clustering algorithm. For example, if keyframes will be found through some cross-correlation, a rather small discrepancy therein may signal a low level keyframe, whereas a high discrepancy may signal a high level keyframe. The first could occur between successive shots in essentially the same movie scene, whereas the second would pertain to a changeover between scenes, that also may change overall conditions such as lighting level dramatically. Another situation is when various different types of video matter follow each other, such as news, film, or a service shop list. All this may facilitate the operations of the end user.
  • the invention also relates to a device for keyframe-based displaying of a video presentation and having selection means for enabling a user to select among said keyframes, and based on such selecting rendering a substantially continuous sensory stream relating to said presentation, said display device being arranged for displaying various such keyframes in parallel in a reduced and static video format, characterized in that the device has control means for controlling said rendering as starting from a particular active keyframe which subsequently acts as a video cursor frame within said format.
  • Figure 1 a block diagram of a TV-Recorder combination
  • Figure 2 an exemplary structure of a video recording
  • Figure 3 a design of a scrolling mosaic user interface
  • Figure 4 a design of a scrolling list user interface
  • Figure 5 a realisation of Figure 3
  • Figure 6 a realisation of Figure 4;
  • Figure 7 a state diagram of the system operation.
  • the keyframes must be presented in such a manner that they are generally distinguishable from each other by a user person located at a typical TV viewing distance, as related to the video size in question. For a family set, this distance amounts to a few meters, for a desktop set to about one half of a meter, and so on. • The number of keyframes presented simultaneously on one screen should be sufficient for providing a user person with an overview of a significant part of the contents of the digital video material.
  • the keyframes should be displayed in a generally undistorted fashion for maximizing discernability.
  • the aspect ratio of a particular key frame should preferably be retained.
  • the remote control device of the TV set itself operates as the user control input device.
  • Figure 1 is a diagram showing a TV-Recorder combination.
  • Item 20 represents the TV-set display and associated immediate control and powering.
  • Item 22 represents the antenna, or the connection with another type of signal distribution entity, such as cable distribution. This item includes, if appropriate, also the derivation of the digital video information or the digital signal part from the received signal.
  • Item 34 represents the routing of the video streams and associated information between the various subsystems of Figure 1. This routing is governed by control box 34 through control signals on line 35. The latter has been drawn as a single bidirectional interconnection but may in fact be built from any number of unidirectional or bidirectional lines.
  • the control box receives detection signals from the display, and from further subsystems 38, 40, whilst also controlling the latter two.
  • block 38 is a linear tape recorder with a very high storage capacity in the multi-gigabyte region.
  • Block 40 is a magnetic disc recorder with a high storage capacity, but which is nevertheless only a fraction of that of recorder 38; on the other hand, access in recorder 40 is much faster through cross- track jumping.
  • blocks 28, 40 form a two-level storage organization that is somewhat akin to a computer memory cache system, and stores all items of a video presentation at least once.
  • Figure 2 shows an exemplary structure of a video presentation.
  • bar 60 contains the video itself, either in the form of frames, or as a string of compressed video matter, such as MPEG-coded.
  • the information is stored along the bar as video time progresses, although actual storage requirements need not be uniform over time.
  • Interspersed keyframes have been indicated by dark vertical stripes such as 68.
  • a keyframe is used as representing, or as being typical of the overall video in the interval up to the next key frame.
  • the video provider singles out the first frame of each new shot as the keyframe thereof through adding some label or inclusion in a "table of contents" (TOC).
  • TOC "table of contents”
  • the receiver through some algorithm, detects that the video content changes abruptly from one frame to the next.
  • the present invention takes the associated algorithms for granted. As shown, their distribution may be non-uniform.
  • An alternative mechanism is that successive keyframes succeed each other at prescribed intervals, such as every ten seconds.
  • the keyframes are represented.
  • the keyframes are organized in some hierarchy, in that indication 64 has only a limited set of highly relevant keyframes. This hierarchization may be repeated, in that indication 66 is associated with only a single keyframe for all of the video presentation 60.
  • the various levels of keyframes may be determined in different ones of the organizations recited supra, and may even exist side by side.
  • the storage mapping on Figure 1 may be effected in that the main body of the video presentation is stored in tape recorder 38, whereas the keyframes are at least reproduced in disc recorder 40, possibly in combination with small video intervals immediately following the associated keyframe.
  • the length of such interval may correspond to the time latency of linear tape recorder 38, so that thereby real-time access may be attained.
  • the video presentation may be essentially linear, such as a film.
  • Alternative usage is that certain storage intervals may contain animation, stills, or other images to be used by a consumer present.
  • a possible influencing of a keyframe is to suppress it. This effectively joins the time interval before the keyframe in question with the time interval behind it.
  • a reset feature may again disjoin the interval.
  • keyframes may be suppressed, such as the class that is separated from each other by a fixed time interval.
  • Various different classes of keyframes could be available for one presentation, such as those introduced by the provider versus those that are generated by a local algorithm at reception.
  • FIG. 3 is an exemplary design of a scrolling mosaic user interface. Every screen presents twenty keyframes starting from the top left corner from left to right down to the bottom right corner: each keyframe has its number in the overall ranking of keyframes shown. Actually, keyframe 144 is highlighted by a rectangular control cursor. A user person may through activating a remote control, move the control cursor freely over the keyframes displayed, as well as over the buttons displayed at the top and bottom bars, through the navigational controls on the cursor device. If the user moves the control cursor to the left in the top left corner, the display jumps back by 20 keyframes. Moving to the right in the lower right hand corner will cause a forward jump over 20 frames. Further, accessing the top bar of the overall screen will effect shortcut controls for accessing other parts of the presentation, in that the latter is divided into five equally long parts: a black horizontal bar indicates the total time covered by the twenty keyframes displayed here, of the overall presentation.
  • View program may control a start at a cursor- accessed keyframe.
  • View segment may do the same, but may be restricted to playing only a single segment, that will end at the next keyframe.
  • View from x to y may control a start at the earliest in time of two cursor-accessed keyframes, and stop at the last in time of the two cursor-accessed keyframes.
  • Various other control modes or functions are feasible in combination with the keyframe-selecting functionalities described supra. Examples are fast- forward or slow-forward, that allow a user person to check a particular interval for certain occurrences, or reverse to attain certain video effects.
  • the display upon passing the instant in time pertaining to a particular keyframe, the latter becomes active and effectively displays the video stream, until arriving at the instant associated with the next keyframe. Thereupon, the latter becomes the active frame.
  • the above feature allows a user to straightforwardly program a video recorder for an interval display sequence such as by leaving out certain segments, such as advertising, or rather, to draw attention to certain details by means of slow-forward.
  • audio may be active or suppressed through a control button not shown. Alternatively, control may let audio go on, but the video cursor is discrete, in that it steps only from interval to interval though appropriate highlighting.
  • Figure 4 shows an exemplary design of a scrolling list user interface.
  • keyframe 145 is highlighted by a rectangular control cursor that runs along the edges of the keyframe.
  • Keyframe 145 is also displayed at a larger magnification in the background.
  • the control interface is the same as with respect to Figure 3, although the control button positions are different.
  • the enlarged keyframe is suppressed in the multi-keyframe bar.
  • Figure 5 shows a realisation of Figure 3, with actual images representing the keyframes, and with a few modifications.
  • the control interface is identical with the one discussed earlier, although a few items have been left out that were considered aesthetically unattractive.
  • the time stretch covered by the 20 keyframes amounted to about 2 minutes, which appears to be close to a useful lower bound.
  • the various keyframes can indeed quite well be distinguished from each other. Further modifications may include leaving out various ones of the control buttons, when it is felt that user persons will not feel the need for extensive editing.
  • Figure 7 is a state diagram of the system operation.
  • the system awaits input from the user, while displaying the multiple keyframes. Such input may imply jumping among the displayed keyframes, jumping to another set of keyframes, selecting a keyframe for displaying the associated interval. Any such input effects arrow 104 and starts a new time interval. Absent any such input during n seconds (such as 20 seconds) effects arrow 108, so that state 102 is reached.
  • the system runs the dynamic video cursor frame. As long as no user input is received, arrow 110 is effected, and the system continues as long as displayable video material is available. If user input is received however, arrow 106 is effected, and the system freezes, either at the actual content of the dynamic video cursor frame, or at the beginning of the actual interval.
  • Figure 6 is a realisation of Figure 4. It displays a screen capture of the actual implementation. Here the lines diverging from the top bar of the screen have not been implemented for reasons of a aesthetics. Moreover, the cursor position remains fixed in the centre of the screen, thereby operating as a focal point on a big real of film snap shots. These small changes aside, the user interface functions as described hereabove.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Studio Circuits (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

Keyframe-based displaying of a video presentation enables a user to select among keyframes, and based on the selecting displays a substantially continuous video stream relating to the presentation. In particular, various keyframes are displayed in parallel in a reduced and static video format, and the displaying is controlled as starting from a particular active keyframe which subsequently acts as a dynamic cursor frame within the video format. The cursor may be dynamic video plus dynamic audio, dynamic video alone, or static video per interval plus dynamic audio.

Description

A method and device for keyframe-based video displaying using a video cursor frame in a multikey frame screen.
BACKGROUND OF THE INVENTION
The invention relates to a method for keyframe-based displaying of a video presentation whilst enabling a user to select among said keyframes, and through such selecting, displaying a substantially continuous video stream based on said presentation, and which method displays various such keyframes in parallel in a reduced and static video format. The use of keyframes as representative parts of a video presentation that is recorded for subsequent selective playback has been proposed elsewhere. A continuous video stream means that video remains "on", which may include animation, a series of stills, or an interactive sequence of images. The character may be various, such as film, news, or for example a shopping list. State of the art is represented by the article 'Content-Based Video Indexing and Retrieval' by S.W. Smoliar and H.J. Zhang, IEEE Multimedia, Summer 1994, pages 62-72.
By themselves, keyframes may be derived from the video material upon its reception at the user's premises through some derivation algorithm, or keyframes may be labelled as such by the video provider, for example, in that each new video shot will start with a keyframe. The present invention recognizes that such keyframes should be utilized so as to give users a dynamic overview over the presentation, combined with useful facilities for enabling them to edit the material in a sensible manner.
SUMMARY TO THE INVENTION
Therefore, amongst other things it is an object of the present invention to provide a user with an immediate feel of the dynamic aspects, as well as of the static content of the video presentation, as combined in a single video screen. Now, according to one of its aspects, the invention is characterized in that said displaying is controlled as starting from a particular active keyframe which subsequently to the selecting acts as a dynamic video cursor frame within said format. In this embodiment, the cursor frame moves in real time, thereby enabling users to discern between keyframes, that may have similar scene configurations but different dynamisms.
Advantageously, the method allows for rendering an audio-accompanied video presentation, whilst said rendering is controlled as starting from audio associated to a particular active keyframe that subsequently to the selecting acts as a discrete video cursor frame within said format. In certain presentations, for proper editing the audio is more discriminative than the video, and so the audio could represent the ongoing stream, while the video keyframe would operate as a kind of "header" pertaining to the interval between the two successive keyframes in question. The video keyframe may remain static, while the active keyframe that works as discrete or discretely stepping cursor frame may be highlighted in an appropriate manner.
Advantageously, said dynamic video cursor frame freezes upon coinciding with the next displayed keyframe's content. The freezed display content may pertain either to the start, or to the end of the interval that has just been displayed. Subsequently, the next keyframe may become the a dynamic video cursor frame. The taking over may be controllable and/or programmable: this allows a user to view what editing the presentation will produce. Also, upon going to a next interval, the display mode may be altered, such as from normal to fast or slow display. The invention may be used for fully digital video, or rather for video in which the various video frames have digital labels for easy addressing.
Advantageously, the keyframes are set in a hierarchical structure and/or various keyframes are allowed to be deleted from display for so joining intervals of said presentation before and behind the deleted keyframe. In this way, fine-grained as well as coarse-grained editing can be applied in quick alternation. Again, the hierarchical structure may be implemented through the provider, or rather through some clustering algorithm. For example, if keyframes will be found through some cross-correlation, a rather small discrepancy therein may signal a low level keyframe, whereas a high discrepancy may signal a high level keyframe. The first could occur between successive shots in essentially the same movie scene, whereas the second would pertain to a changeover between scenes, that also may change overall conditions such as lighting level dramatically. Another situation is when various different types of video matter follow each other, such as news, film, or a service shop list. All this may facilitate the operations of the end user.
The invention also relates to a device for keyframe-based displaying of a video presentation and having selection means for enabling a user to select among said keyframes, and based on such selecting rendering a substantially continuous sensory stream relating to said presentation, said display device being arranged for displaying various such keyframes in parallel in a reduced and static video format, characterized in that the device has control means for controlling said rendering as starting from a particular active keyframe which subsequently acts as a video cursor frame within said format. The providing of the recited facilities in the environment of a user terminal, such as an enhanced video recorder plus TV set combination is believed to greatly extend user-directed performance thereof. Further advantageous aspects of the invention are recited in dependent Claims.
BRIEF DESCRIPTION OF THE DRAWING
These and other aspects and advantages of the invention will be discussed more in detail with reference to the disclosure of preferred embodiments hereinafter, and in particular with reference to the appended Figures that show: Figure 1, a block diagram of a TV-Recorder combination;
Figure 2, an exemplary structure of a video recording;
Figure 3, a design of a scrolling mosaic user interface;
Figure 4, a design of a scrolling list user interface;
Figure 5, a realisation of Figure 3; Figure 6, a realisation of Figure 4;
Figure 7, a state diagram of the system operation.
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS
Various advantageous aspects, in particular, but not exclusively pertaining to a standard ordinary customer and private home use, imply the following:
• The keyframes must be presented in such a manner that they are generally distinguishable from each other by a user person located at a typical TV viewing distance, as related to the video size in question. For a family set, this distance amounts to a few meters, for a desktop set to about one half of a meter, and so on. • The number of keyframes presented simultaneously on one screen should be sufficient for providing a user person with an overview of a significant part of the contents of the digital video material.
• The keyframes should be displayed in a generally undistorted fashion for maximizing discernability. For example, the aspect ratio of a particular key frame should preferably be retained.
• Preferably, the remote control device of the TV set itself operates as the user control input device.
• Feedback must be such that the information associated to the feedback is perceivable from a typical viewing distance of an ordinary television set, as discussed supra. • Computer concepts such as "drag and drop" are generally not necessary.
• Occasional usage of the facilities must be provided for.
• It must be provided for, that the facilities may be used only occasionally, rather than continually. • Concepts in the user interface should reflect the linear model of the video presentation that is familiar to users.
DISCLOSURE OF A PARTICULAR EMBODIMENT
Regarding to an implementation of the invention, Figure 1 is a diagram showing a TV-Recorder combination. Item 20 represents the TV-set display and associated immediate control and powering. Item 22 represents the antenna, or the connection with another type of signal distribution entity, such as cable distribution. This item includes, if appropriate, also the derivation of the digital video information or the digital signal part from the received signal. Item 34 represents the routing of the video streams and associated information between the various subsystems of Figure 1. This routing is governed by control box 34 through control signals on line 35. The latter has been drawn as a single bidirectional interconnection but may in fact be built from any number of unidirectional or bidirectional lines. The control box receives detection signals from the display, and from further subsystems 38, 40, whilst also controlling the latter two. Now, block 38 is a linear tape recorder with a very high storage capacity in the multi-gigabyte region. Block 40 is a magnetic disc recorder with a high storage capacity, but which is nevertheless only a fraction of that of recorder 38; on the other hand, access in recorder 40 is much faster through cross- track jumping. Together, blocks 28, 40 form a two-level storage organization that is somewhat akin to a computer memory cache system, and stores all items of a video presentation at least once.
Figure 2 shows an exemplary structure of a video presentation. For effecting the video matter functionality, bar 60 contains the video itself, either in the form of frames, or as a string of compressed video matter, such as MPEG-coded. The information is stored along the bar as video time progresses, although actual storage requirements need not be uniform over time. Interspersed keyframes have been indicated by dark vertical stripes such as 68. A keyframe is used as representing, or as being typical of the overall video in the interval up to the next key frame. One realization is that the video provider singles out the first frame of each new shot as the keyframe thereof through adding some label or inclusion in a "table of contents" (TOC). Another realization is that the receiver, through some algorithm, detects that the video content changes abruptly from one frame to the next. The present invention takes the associated algorithms for granted. As shown, their distribution may be non-uniform. An alternative mechanism is that successive keyframes succeed each other at prescribed intervals, such as every ten seconds. According to the embodiment, at indication 62 only the keyframes are represented. Furthermore, the keyframes are organized in some hierarchy, in that indication 64 has only a limited set of highly relevant keyframes. This hierarchization may be repeated, in that indication 66 is associated with only a single keyframe for all of the video presentation 60. The various levels of keyframes may be determined in different ones of the organizations recited supra, and may even exist side by side. The storage mapping on Figure 1 may be effected in that the main body of the video presentation is stored in tape recorder 38, whereas the keyframes are at least reproduced in disc recorder 40, possibly in combination with small video intervals immediately following the associated keyframe. The length of such interval may correspond to the time latency of linear tape recorder 38, so that thereby real-time access may be attained. By itself, the video presentation may be essentially linear, such as a film. Alternative usage is that certain storage intervals may contain animation, stills, or other images to be used by a consumer present. A possible influencing of a keyframe is to suppress it. This effectively joins the time interval before the keyframe in question with the time interval behind it. A reset feature may again disjoin the interval. Also, various classes of keyframes may be suppressed, such as the class that is separated from each other by a fixed time interval. Various different classes of keyframes could be available for one presentation, such as those introduced by the provider versus those that are generated by a local algorithm at reception.
Figure 3 is an exemplary design of a scrolling mosaic user interface. Every screen presents twenty keyframes starting from the top left corner from left to right down to the bottom right corner: each keyframe has its number in the overall ranking of keyframes shown. Actually, keyframe 144 is highlighted by a rectangular control cursor. A user person may through activating a remote control, move the control cursor freely over the keyframes displayed, as well as over the buttons displayed at the top and bottom bars, through the navigational controls on the cursor device. If the user moves the control cursor to the left in the top left corner, the display jumps back by 20 keyframes. Moving to the right in the lower right hand corner will cause a forward jump over 20 frames. Further, accessing the top bar of the overall screen will effect shortcut controls for accessing other parts of the presentation, in that the latter is divided into five equally long parts: a black horizontal bar indicates the total time covered by the twenty keyframes displayed here, of the overall presentation.
Other functions may be initiated by first selecting a particular keyframe and subsequently one of the bottom buttons. "View program" may control a start at a cursor- accessed keyframe. "View segment" may do the same, but may be restricted to playing only a single segment, that will end at the next keyframe. "View from x to y" may control a start at the earliest in time of two cursor-accessed keyframes, and stop at the last in time of the two cursor-accessed keyframes. Various other control modes or functions are feasible in combination with the keyframe-selecting functionalities described supra. Examples are fast- forward or slow-forward, that allow a user person to check a particular interval for certain occurrences, or reverse to attain certain video effects. During the display, upon passing the instant in time pertaining to a particular keyframe, the latter becomes active and effectively displays the video stream, until arriving at the instant associated with the next keyframe. Thereupon, the latter becomes the active frame. The above feature allows a user to straightforwardly program a video recorder for an interval display sequence such as by leaving out certain segments, such as advertising, or rather, to draw attention to certain details by means of slow-forward. During the display, audio may be active or suppressed through a control button not shown. Alternatively, control may let audio go on, but the video cursor is discrete, in that it steps only from interval to interval though appropriate highlighting.
Figure 4 shows an exemplary design of a scrolling list user interface. Here, at the bottom of the overall screen five keyframes are displayed, keyframe 145 being highlighted by a rectangular control cursor that runs along the edges of the keyframe. Keyframe 145 is also displayed at a larger magnification in the background. The control interface is the same as with respect to Figure 3, although the control button positions are different. Alternatively, the enlarged keyframe is suppressed in the multi-keyframe bar.
Figure 5 shows a realisation of Figure 3, with actual images representing the keyframes, and with a few modifications. The control interface is identical with the one discussed earlier, although a few items have been left out that were considered aesthetically unattractive. In the embodiment, the time stretch covered by the 20 keyframes amounted to about 2 minutes, which appears to be close to a useful lower bound. The various keyframes can indeed quite well be distinguished from each other. Further modifications may include leaving out various ones of the control buttons, when it is felt that user persons will not feel the need for extensive editing. The rationale of the arrangements for having a dynamic representation of the video cursor that runs in time in the actually active keyframe field, is that the static representation of the keyframes alone does insufficiently convey the semantics of the video representation as a whole, when such dynamics are indispensible for 'understanding' the evolvement of the events. Therefore, the semantics are enhanced as follows. After the system has been idle for a certain amount of time, the keyframe that the cursor 'encloses' will 'come alive' as it will start playing the digital video material in miniature, including any associated audio and further effects. If during the playback, the next keyframe is reached, the cursor will automatically 'jump' to the next keyframe presented in the user interface, until the user will (re)start interacting with the system.
In this respect, Figure 7 is a state diagram of the system operation. In state 100, the system awaits input from the user, while displaying the multiple keyframes. Such input may imply jumping among the displayed keyframes, jumping to another set of keyframes, selecting a keyframe for displaying the associated interval. Any such input effects arrow 104 and starts a new time interval. Absent any such input during n seconds (such as 20 seconds) effects arrow 108, so that state 102 is reached. Therein, the system runs the dynamic video cursor frame. As long as no user input is received, arrow 110 is effected, and the system continues as long as displayable video material is available. If user input is received however, arrow 106 is effected, and the system freezes, either at the actual content of the dynamic video cursor frame, or at the beginning of the actual interval.
Figure 6 is a realisation of Figure 4. It displays a screen capture of the actual implementation. Here the lines diverging from the top bar of the screen have not been implemented for reasons of a aesthetics. Moreover, the cursor position remains fixed in the centre of the screen, thereby operating as a focal point on a big real of film snap shots. These small changes aside, the user interface functions as described hereabove.

Claims

oCLAIMS:
1. A method for keyframe-based displaying of a video presentation whilst enabling a user to select among said keyframes, and through such selecting, displaying a substantially continuous video stream based on said presentation, and which method displays various such keyframes in parallel in a reduced and static video format, characterized in that said displaying is controlled as starting from a particular active keyframe which subsequently to the selecting acts as a dynamic video cursor frame within said format.
2. A method as claimed in Claim 1, wherein said video is audio- accompanied, and the method allows suppressing of the audio associated to the actual dynamic video cursor frame.
3. A method for keyframe-based rendering of an audio-accompanied video presentation whilst enabling a user to select among said keyframes, and through such selecting, outputting a substantially continuous sensory stream based on said presentation, and which method displays various such keyframes in parallel in a reduced and static video format, characterized in that said rendering is controlled as starting from audio associated to a particular active keyframe that subsequently to the selecting acts as a discrete video cursor frame within said format.
4. A method as claimed in Claims 1 or 2, wherein said dynamic video cursor frame freezes upon coinciding with the next displayed keyframe's content.
5. A method as claimed in Claim 4, wherein upon said coinciding, a next keyframe takes over as dynamic video cursor.
6. A method as claimed in Claim 5, wherein said taking over is controllable and/or programmable.
7. A method as claimed in Claim 5, wherein said taking over may involve a programmable mode altering.
8. A method as claimed in Claim 1, wherein said displayed keyframes substantially take up all of an actual video format.
9. A method as claimed in Claim 1, wherein said displayed keyframes take up a fraction of an actual video format, and said dynamic video cursor frame is displayed in an enlarged format with respect to other keyframes.
10. A method as claimed in Claim 1, which furthermore displays a predetermined set of soft control keys.
11. A method as claimed in Claim 1, wherein said video presentation is at least digitally labelled.
12. A method as claimed in Claim 1, wherein keyframes are set in a hierarchical structure and/or various keyframes are allowed to be deleted from display for so joining intervals of said presentation before and behind the deleted keyframe.
13. A device for keyframe-based displaying of a video presentation and having selection means for enabling a user to select among said keyframes, and based on such selecting rendering a substantially continuous sensory stream relating to said presentation, said display device being arranged for displaying various such keyframes in parallel in a reduced and static video format, characterized in that the device has control means for controlling said rendering as starting from a particular active keyframe which subsequently acts as a video cursor frame within said format.
14. A device as claimed in Claim 13, wherein said rendering includes continuous video and said video cursor frame is dynamic.
15. A device as claimed in Claim 14, wherein said rendering includes audio, and said video cursor frame is a discrete video cursor frame.
16. A device as claimed in Claim 13 and arranged for displaying said keyframes at mutually uniform sizes.
17. A device as claimed in Claim 13 and arranged for displaying a video cursor frame in an enlarged size with respect to a plurality of further keyframes in a uniform said reduced format.
PCT/IB1998/000089 1997-02-03 1998-01-22 A method and device for keyframe-based video displaying using a video cursor frame in a multikeyframe screen Ceased WO1998034181A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
EP98900327A EP0928455A2 (en) 1997-02-03 1998-01-22 A method and device for keyframe-based video displaying using a video cursor frame in a multikeyframe screen
JP10529206A JP2000511736A (en) 1997-02-03 1998-01-22 Method and apparatus for displaying video based on keyframes using video cursor frames on a multi-keyframe screen
KR1019980707892A KR100633711B1 (en) 1997-02-03 1998-01-22 Video display method and device based on key frame using video cursor frame on multi key frame screen
PCT/IB1998/000089 WO1998034181A2 (en) 1997-02-03 1998-01-22 A method and device for keyframe-based video displaying using a video cursor frame in a multikeyframe screen

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP97200286.9 1997-02-03
PCT/IB1998/000089 WO1998034181A2 (en) 1997-02-03 1998-01-22 A method and device for keyframe-based video displaying using a video cursor frame in a multikeyframe screen

Publications (2)

Publication Number Publication Date
WO1998034181A2 true WO1998034181A2 (en) 1998-08-06
WO1998034181A3 WO1998034181A3 (en) 1998-12-10

Family

ID=11004655

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB1998/000089 Ceased WO1998034181A2 (en) 1997-02-03 1998-01-22 A method and device for keyframe-based video displaying using a video cursor frame in a multikeyframe screen

Country Status (1)

Country Link
WO (1) WO1998034181A2 (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0859365A3 (en) * 1997-02-12 2001-07-18 Sony Corporation Recording/reproducing apparatus and method
EP0915471A3 (en) * 1997-11-10 2001-08-01 Hitachi, Ltd. Video searching, video information production, and storage medium for storing processing program therefor
EP1132834A1 (en) * 2000-03-08 2001-09-12 Telematica Instituut Method and device for retrieving video information
EP1116091A4 (en) * 1998-10-02 2001-12-19 Worldspace Man Corp System for providing a user with active and passive access to cached content
KR20020074328A (en) * 2001-03-20 2002-09-30 서치캐스트 주식회사 Method for playing motion pictures using keyframe and apparatus thereof
EP1251515A1 (en) * 2001-04-19 2002-10-23 Koninklijke Philips Electronics N.V. Method and system for selecting a position in an image sequence
WO2002084667A1 (en) * 2001-04-17 2002-10-24 Koninklijke Philips Electronics N.V. Method and system for selecting a position in an image sequence
KR100374040B1 (en) * 2001-03-09 2003-03-03 엘지전자 주식회사 Method for detecting caption synthetic key frame in video stream
WO2003071545A1 (en) * 2002-02-21 2003-08-28 Koninklijke Philips Electronics N.V. Simultaneous viewing of time divided segments of a tv program
EP1107613A3 (en) * 1999-12-01 2004-03-10 Sony Corporation Picture recording apparatus and methods
AU2003204917B2 (en) * 2002-06-24 2006-01-12 Canon Kabushiki Kaisha Method and Apparatus for Synchronising a Keyframe with Sound
EP1630813A1 (en) * 2004-08-24 2006-03-01 Sony Corporation Information processing apparatus and associated method of prioritizing content for playback
EP2256647A1 (en) * 2009-05-29 2010-12-01 Kabushiki Kaisha Toshiba Thumbnail generating device and thumbnail generating method
US9043701B2 (en) 2005-01-06 2015-05-26 Thomson Licensing Method and apparatus for indicating the impaired sequences of an audiovisual document
CN114041165A (en) * 2019-07-18 2022-02-11 华为技术有限公司 Video similarity detection method, device and equipment

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
IEEE MULTIMEDIA, Summer 1994, STEPHEN W. SMOLIAR et al., "Content-Based Video Indexing and Retrieval", pages 62-72. *
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, Volume 41, No. 3, August 1995, H.-Y. CHEN et al., "A Multi-Layer Video Browsing System". *
JOURNAL OF VISUAL LANGUAGES AND COMPUTING, Volume 1, 1990, Y. TONOMURA et al., "Content Oriented Visual Interface Using Video Icons for Visual Database Systems", pages 183-198. *
PROCEEDINGS OF THE SPIE, Volume 2417, 1995, H.J. ZHANG et al., "Content-Based Video Browsing Tools", pages 389-398. *
See also references of EP0928455A2 *

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0859365A3 (en) * 1997-02-12 2001-07-18 Sony Corporation Recording/reproducing apparatus and method
US6377745B2 (en) 1997-02-12 2002-04-23 Sony Corporation Recording/reproducing apparatus and method
US6449608B1 (en) 1997-11-10 2002-09-10 Hitachi, Ltd. Video searching method and apparatus, video information producing method, and storage medium for storing processing program thereof
EP0915471A3 (en) * 1997-11-10 2001-08-01 Hitachi, Ltd. Video searching, video information production, and storage medium for storing processing program therefor
US6584463B2 (en) 1997-11-10 2003-06-24 Hitachi, Ltd. Video searching method, apparatus, and program product, producing a group image file from images extracted at predetermined intervals
EP1116091A4 (en) * 1998-10-02 2001-12-19 Worldspace Man Corp System for providing a user with active and passive access to cached content
US6928234B2 (en) 1999-12-01 2005-08-09 Sony Corporation Picture recording apparatus and method thereof
US7483625B2 (en) 1999-12-01 2009-01-27 Sony Corporation Picture recording apparatus and method thereof
EP1107613A3 (en) * 1999-12-01 2004-03-10 Sony Corporation Picture recording apparatus and methods
NL1014584C2 (en) * 2000-03-08 2001-09-13 Telematica Inst Method and device for retrieving video information.
EP1132834A1 (en) * 2000-03-08 2001-09-12 Telematica Instituut Method and device for retrieving video information
KR100374040B1 (en) * 2001-03-09 2003-03-03 엘지전자 주식회사 Method for detecting caption synthetic key frame in video stream
KR20020074328A (en) * 2001-03-20 2002-09-30 서치캐스트 주식회사 Method for playing motion pictures using keyframe and apparatus thereof
WO2002084667A1 (en) * 2001-04-17 2002-10-24 Koninklijke Philips Electronics N.V. Method and system for selecting a position in an image sequence
WO2002086897A1 (en) * 2001-04-19 2002-10-31 Koninklijke Philips Electronics N.V. Keyframe-based playback position selection method and system
EP1251515A1 (en) * 2001-04-19 2002-10-23 Koninklijke Philips Electronics N.V. Method and system for selecting a position in an image sequence
WO2003071545A1 (en) * 2002-02-21 2003-08-28 Koninklijke Philips Electronics N.V. Simultaneous viewing of time divided segments of a tv program
AU2003204917B2 (en) * 2002-06-24 2006-01-12 Canon Kabushiki Kaisha Method and Apparatus for Synchronising a Keyframe with Sound
EP1630813A1 (en) * 2004-08-24 2006-03-01 Sony Corporation Information processing apparatus and associated method of prioritizing content for playback
US7804505B2 (en) 2004-08-24 2010-09-28 Sony Corporation Information processing apparatus and associated method of prioritizing content for playback
US9043701B2 (en) 2005-01-06 2015-05-26 Thomson Licensing Method and apparatus for indicating the impaired sequences of an audiovisual document
EP2256647A1 (en) * 2009-05-29 2010-12-01 Kabushiki Kaisha Toshiba Thumbnail generating device and thumbnail generating method
CN114041165A (en) * 2019-07-18 2022-02-11 华为技术有限公司 Video similarity detection method, device and equipment

Also Published As

Publication number Publication date
WO1998034181A3 (en) 1998-12-10

Similar Documents

Publication Publication Date Title
US6340971B1 (en) Method and device for keyframe-based video displaying using a video cursor frame in a multikeyframe screen
US6222532B1 (en) Method and device for navigating through video matter by means of displaying a plurality of key-frames in parallel
JP4364190B2 (en) Method and apparatus for nonlinear playback control of multimedia data stream
US10587925B2 (en) Television viewer interface system
JP6356097B2 (en) Electronic program guide with digital storage
US8091111B2 (en) Methods and apparatus for recording and replaying sports broadcasts
US7333712B2 (en) Visual summary for scanning forwards and backwards in video content
KR100411340B1 (en) Video browsing system based on article of news video content
US20120087637A1 (en) Methods and apparatus for recording and replaying video broadcasts
US20030184679A1 (en) Method, apparatus, and program for providing slow motion advertisements in video information
WO1998034181A2 (en) A method and device for keyframe-based video displaying using a video cursor frame in a multikeyframe screen
CN101124818A (en) Hierarchical Program Guide
WO1998034182A2 (en) A method and device for navigating through video matter by means of displaying a plurality of key-frames in parallel
KR100552248B1 (en) Method and apparatus for navigating through video material by multiple key-frames parallel display
US6718117B1 (en) Method and device for keyframe-based video displaying using a video cursor frame in a multikeyframe screen
JPH0779404A (en) Method and apparatus for creating a list of representative images of video
EP0928455A2 (en) A method and device for keyframe-based video displaying using a video cursor frame in a multikeyframe screen

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): JP KR

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE

WWE Wipo information: entry into national phase

Ref document number: 1998900327

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 1019980707892

Country of ref document: KR

AK Designated states

Kind code of ref document: A3

Designated state(s): JP KR

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWP Wipo information: published in national office

Ref document number: 1998900327

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1019980707892

Country of ref document: KR

WWG Wipo information: grant in national office

Ref document number: 1019980707892

Country of ref document: KR