[go: up one dir, main page]

CN1383543A - Karaoka system - Google Patents

Karaoka system Download PDF

Info

Publication number
CN1383543A
CN1383543A CN01801723A CN01801723A CN1383543A CN 1383543 A CN1383543 A CN 1383543A CN 01801723 A CN01801723 A CN 01801723A CN 01801723 A CN01801723 A CN 01801723A CN 1383543 A CN1383543 A CN 1383543A
Authority
CN
China
Prior art keywords
video
user
image
karaoke
concept
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN01801723A
Other languages
Chinese (zh)
Inventor
I·科尔塞特
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Original Assignee
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics NV filed Critical Koninklijke Philips Electronics NV
Publication of CN1383543A publication Critical patent/CN1383543A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/272Means for inserting a foreground image in a background image, i.e. inlay, outlay
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B31/00Arrangements for the associated working of recording or reproducing apparatus with related apparatus
    • G11B31/02Arrangements for the associated working of recording or reproducing apparatus with related apparatus with automatic musical instruments
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H1/00Details of electrophonic musical instruments
    • G10H1/36Accompaniment arrangements
    • G10H1/361Recording/reproducing of accompaniment for use with an external source, e.g. karaoke systems
    • G10H1/368Recording/reproducing of accompaniment for use with an external source, e.g. karaoke systems displaying animated or moving pictures synchronized with the music or audio part
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2220/00Input/output interfacing specifically adapted for electrophonic musical tools or instruments
    • G10H2220/005Non-interactive screen display of musical or status data
    • G10H2220/011Lyrics displays, e.g. for karaoke applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2220/00Input/output interfacing specifically adapted for electrophonic musical tools or instruments
    • G10H2220/155User input interfaces for electrophonic musical instruments
    • G10H2220/441Image sensing, i.e. capturing images or optical patterns for musical purposes or musical control purposes
    • G10H2220/455Camera input, e.g. analyzing pictures from a video camera and using the analysis results as control data

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Reverberation, Karaoke And Other Acoustics (AREA)
  • Studio Circuits (AREA)
  • Indexing, Searching, Synchronizing, And The Amount Of Synchronization Travel Of Record Carriers (AREA)
  • Closed-Circuit Television Systems (AREA)

Abstract

The concept of singing karaoke exclusively relies on audio based techniques. The invention consists of developing the concept of video immersion: the user will see his (her) image inserted into the video clip or movie at the place of his (her) favorite dancer, singer or player, and will therefore be able to play clip/song on video tape, and replace any famous star. More precisely, the invention relates to a karaoke system in which are provided successive means for picking up an image of the user and his/her voice, analyzing and processing the obtained signals, mixing the audio/video signals thus analyzed and processed and a pre-recorded material and displaying the combination signals thus obtained.

Description

一种卡拉OK系统A karaoke system

本发明涉及一种卡拉OK系统,其用于一个如视频剪辑(videoclip)或影片的序列中的演唱。The present invention relates to a karaoke system for singing in a sequence such as a video clip or film.

如例如欧洲专利申请EP0782338中所描述的卡拉OK系统中,音乐、歌词或任何种类的音频数据由传输站传输到分配站。系统的主模块的音乐控制装置把音乐通过监视器的内设扬声器播出,且把声音从一个未示出的麦克风通过所述的扬声器播出。图象控制装置将背景图象(如视频图像或从背景图象存储装置中提取的静态图象)显示在监视器上,而歌词控制装置通过把歌词叠加于背景图象上来显示歌词。图象拾取设备,如一个CCD摄像机,拾取演唱者的图像,并将其通过视频图象控制装置叠加于监视器屏幕上作为叠加的图像。这样的系统,可定义为卡拉OK概念中所谓的“视频混合”。In a karaoke system as described eg in European patent application EP0782338, music, lyrics or any kind of audio data is transmitted from a transmission station to a distribution station. The music control unit of the main module of the system broadcasts music through the built-in speaker of the monitor, and broadcasts sound from a microphone not shown through said speaker. The image control means displays a background image (such as a video image or a still image extracted from the background image storage means) on the monitor, and the lyrics control means displays the lyrics by superimposing the lyrics on the background image. Image pick-up equipment, such as a CCD camera, picks up the image of the singer, and it is superimposed on the monitor screen as the superimposed image by the video image control device. Such a system can be defined as the so-called "video mixing" in the karaoke concept.

本发明的目的是提出另一类型的具备额外功能的卡拉OK系统。The purpose of the present invention is to propose another type of karaoke system with additional functions.

为了这一目的,该系统涉及一种用于在如视频剪辑或影片的序列过程中演唱的卡拉OK系统,并包括一系列用于拾取使用者的图象和声音的拾取设备,一个用于把使用者的至少一部分与背景分离的分析与处理设备,一个用于把所述分析与处理设备的输出信号与预先录制的材料组合起来的混合与再现(rendering)设备,和一个用于显示该组合信号的显示设备。For this purpose, the system relates to a karaoke system for singing during a sequence such as a video clip or film, and includes a series of pickup devices for picking up the user's image and sound, a an analysis and processing device with at least a portion of the user separated from the background, a mixing and rendering device for combining the output signal of said analysis and processing device with pre-recorded material, and a device for displaying the combined Signal display device.

时至今日,唱卡拉OK的概念只依赖于基于音频的技术,其只提供有限的功能且不可能把使用者真正插入到视频虚拟世界中。所建议的引入了视频混合(video mixing)的概念的解决方案,允许将这一卡拉OK概念扩展到视频中,并一般地,允许发展完全音频-视频插入概念:根据所述概念,在歌曲的视频剪辑中的声音和面孔可由偶然(fortuitous)演唱者的声音和面孔代替(此后也称使用者,因为他或她事实上可以是演唱者、表演者、舞蹈者等…)。同样建议的技术可在其它环境中发现相似的应用,例如电子商务领域或用于预先录制内容的视频编辑。As of today, the concept of singing karaoke relies only on audio-based technology, which provides limited functionality and makes it impossible to actually insert the user into the video virtual world. The proposed solution, which introduces the concept of video mixing, allows the extension of this karaoke concept to video and, in general, the development of a full audio-video insertion concept: according to said concept, in the song The voices and faces in the video clip can be replaced by those of a fortuitous singer (hereafter also referred to as a user because he or she can in fact be a singer, performer, dancer, etc. . . . ). The same proposed technology may find similar applications in other contexts, such as the field of e-commerce or video editing for pre-recorded content.

下面将参考附图,通过实例来描述本发明,其中:The present invention will be described by way of example below with reference to the accompanying drawings, in which:

图1:根据本发明的卡拉OK系统的方框图。Figure 1: Block diagram of a karaoke system according to the present invention.

图2:根据本发明的卡拉OK系统的另一实施方案。Figure 2: Another embodiment of the karaoke system according to the invention.

如图1所示,实现根据本发明的卡拉OK系统所必需的不同的子系统主要是一个分析与处理设备11和一个混合与再现设备12。As shown in FIG. 1 , the different subsystems necessary to realize the karaoke system according to the invention are mainly an analysis and processing device 11 and a mixing and reproduction device 12 .

用于接收由拾取设备10拾取到的使用者(黑色示出的人)的图象和声音的分析与处理设备11包括一个分割电路,其用于把例如使用者的面孔与背景分离,从而限定一个阿尔法(alpha)平面(如果使用者被置于舞台上,这样的电路可基于例如蓝色屏幕前的色度键技术)。混合与再现设备12是一个利用设备11中分析的形状信息把使用者与预先录制的由媒体13传送的视频或音频-视频背景合成起来的电路(所述的预先录制的材料在媒体左侧示出)。这一合成完成了用于把所述使用者的声音与来自歌曲的预先录制的音乐背景混合的音频合成。然后,利用由设备11限定的阿尔法平面,根据下述类型的关系式,容易地把两个来源组合起来:[(视频1×阿尔法)+(视频2×(255-阿尔法))]/255=最终视频。最后,一个如监视器的显示设备14,用于最后显示最终结果(即预先录制的材料和特别属于使用者的之组合)。The analysis and processing device 11 for receiving images and sounds of the user (person shown in black) picked up by the pickup device 10 includes a segmentation circuit for separating, for example, the user's face from the background, thereby defining An alpha plane (if the user is placed on stage, such a circuit could be based eg on a chroma key technique in front of a blue screen). Mixing and rendering device 12 is a circuit that uses the shape information analyzed in device 11 to synthesize the user with a pre-recorded video or audio-video background delivered by medium 13 (the said pre-recorded material is shown on the left side of the medium). out). This synthesis completes the audio synthesis for mixing the user's voice with the pre-recorded musical background from the song. Then, using the alpha plane defined by the device 11, the two sources are easily combined according to a relationship of the following type: [(Video 1 x Alpha)+(Video 2 x (255-Alpha))]/255= final video. Finally, a display device 14, such as a monitor, is used to finally display the final result (ie the combination of pre-recorded material and specific user-specific).

显然,为提高质量,在设备11中完成的分析可产生8比特阿尔法平面,其能在被镶饰的对象边缘(fronteer)有较好的混合。另外,还指出的是系统可以只替换使用者的头部或他或她的整个身体。Obviously, for improved quality, the analysis done in the device 11 can produce 8-bit alpha planes that allow for better blending at the fronteer of veneered objects. Additionally, it is noted that the system may replace only the user's head or his or her entire body.

相对于音频-视频来源的类型,可考虑不同的情况:Different situations can be considered with respect to the type of audio-visual source:

(a)两个音频/视频来源没有被压缩:这一选择可用于例如卡拉OK餐馆,演唱者的全部身体被镶饰在剪辑/影片中时(预先录制的数据可存储在磁带上,且偶然演唱者视频可被分析并直接传输到视频混合器中);(a) Both audio/video sources are not compressed: this option can be used e.g. in karaoke restaurants, where the singer's full body is framed in the clip/film (pre-recorded data can be stored on tape and occasionally Singer videos can be analyzed and transferred directly to the video mixer);

(b)一个或两个来源被压缩:对这一情况的一个适配的架构(framework)是新近发展起来MPEG-4标准,其能对对象的形状和阿尔法平面—这里是偶然使用者的面孔进行编码-(MPEG-4标准已定义了一个能使音频和视频对象合成的整个系统架构)。(b) One or both sources are compressed: an adapted framework for this situation is the newly developed MPEG-4 standard, which enables object shapes and alpha planes—here the face of the casual user Encoding - (The MPEG-4 standard has defined an overall system architecture that enables the synthesis of audio and video objects).

也可考虑本发明应用的不同情况:Different cases of application of the invention can also be considered:

(a)使用者可能想记录混合操作的结果,这在图2中示出,图2示出了与图1实施方案相似的系统,只是包括了一个额外的录制设备25;(a) the user may wish to record the results of the mixing operation, which is illustrated in Figure 2, which shows a system similar to the embodiment of Figure 1, but including an additional recording device 25;

(b)在一些情况下,卡拉OK系统可在线工作:则预先录制的剪辑可存储于数据库(例如互联网)上,并且使用者在家中录制他或她的表演并打算产生卡拉OK剪辑的组合并将其放到他或她的个人主页上(在这一情况下,压缩技术的使用尤其有用,而且更普遍地,在所有应用中它都运行于带宽所限的环境中);(b) In some cases, karaoke systems may work online: then pre-recorded clips may be stored on a database (e.g., the Internet), and the user records his or her performance at home and intends to generate a combination of karaoke clips and Put it on his or her personal home page (the use of compression techniques is especially useful in this case, and more generally in all applications that operate in bandwidth-constrained environments);

(c)另外,在一些情况下,使用者可能打算只把他或她的头部放在原唱的头部的位置,其包括在混合与再现设备12中的进一步处理,因为使用者头部的位置需要与原唱身体的取向和姿势匹配。(c) In addition, in some cases, the user may intend to only place his or her head in the position of the original singer's head, which includes further processing in the mixing and reproduction device 12, because the user's head The position of the song needs to match the orientation and posture of the original singer's body.

Claims (1)

1.一种卡拉OK系统,其用于在一个如视频剪辑或影片的序列过程中演唱,并包括一系列用于拾取使用者的图象和声音的拾取设备,一个用于把使用者的至少一部分与背景分离的分析与处理设备,一个用于把所述分析与处理设备的输出信号与预先录制的材料组合起来的混合与再现设备,以及一个用于显示该组合信号的显示设备。1. A kind of karaoke system, it is used for singing in a sequence process as video clip or film, and comprises a series of pick-up equipment that is used to pick up user's image and sound, one is used for at least An analyzing and processing device separated from the background, a mixing and reproducing device for combining the output signal of said analyzing and processing device with pre-recorded material, and a display device for displaying the combined signal.
CN01801723A 2000-06-20 2001-06-15 Karaoka system Pending CN1383543A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP00401758 2000-06-20
EP00401758.8 2000-06-20

Publications (1)

Publication Number Publication Date
CN1383543A true CN1383543A (en) 2002-12-04

Family

ID=8173734

Family Applications (1)

Application Number Title Priority Date Filing Date
CN01801723A Pending CN1383543A (en) 2000-06-20 2001-06-15 Karaoka system

Country Status (6)

Country Link
US (1) US20020007718A1 (en)
EP (1) EP1297692A2 (en)
JP (1) JP2004501576A (en)
KR (1) KR20020026374A (en)
CN (1) CN1383543A (en)
WO (1) WO2001099413A2 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1312912C (en) * 2004-10-21 2007-04-25 上海交通大学 Entertainment system for video frequency real time synthesizing and recording
CN101945216A (en) * 2009-07-03 2011-01-12 奥林巴斯映像株式会社 Camera device and moving image reproduction method
WO2014169653A1 (en) * 2013-08-28 2014-10-23 中兴通讯股份有限公司 Method and device for optimizing image synthesis
WO2016177296A1 (en) * 2015-05-04 2016-11-10 腾讯科技(深圳)有限公司 Video generation method and apparatus
CN110164242A (en) * 2019-06-04 2019-08-23 平顶山学院 A kind of vocals simulative training platform

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2389221A (en) * 2002-05-15 2003-12-03 Stuart Arnold Recording to provide a rock star experience
US7053915B1 (en) * 2002-07-30 2006-05-30 Advanced Interfaces, Inc Method and system for enhancing virtual stage experience
US7734070B1 (en) * 2002-12-31 2010-06-08 Rajeev Sharma Method and system for immersing face images into a video sequence
US7528890B2 (en) * 2003-05-02 2009-05-05 Yoostar Entertainment Group, Inc. Interactive system and method for video compositing
AU2004281154A1 (en) * 2003-10-16 2005-04-28 Novartis Vaccines And Diagnostics, Inc. 2,6-disubstituted quinazolines, quinoxalines, quinolines and isoquinolines as inhibitors of Raf kinase for treatment of cancer
US7517219B2 (en) * 2004-02-20 2009-04-14 Mcdonald Michael Method of providing specialized dance videos
US20050206751A1 (en) * 2004-03-19 2005-09-22 East Kodak Company Digital video system for assembling video sequences
WO2006052666A2 (en) * 2004-11-04 2006-05-18 Allan Robert Staker Apparatus and methods for encoding data for video compositing
KR20060127459A (en) * 2005-06-07 2006-12-13 엘지전자 주식회사 Digital broadcasting terminal and method thereof having digital broadcasting content conversion function
US8172638B2 (en) * 2005-08-06 2012-05-08 Parental Media LLC Method and apparatus for education and entertainment
US20070122786A1 (en) * 2005-11-29 2007-05-31 Broadcom Corporation Video karaoke system
GB0525789D0 (en) * 2005-12-19 2006-01-25 Landesburg Andrew Live performance entertainment apparatus and method
JP2007228343A (en) * 2006-02-24 2007-09-06 Orion Denki Kk Digital broadcast receiver
US8572642B2 (en) 2007-01-10 2013-10-29 Steven Schraga Customized program insertion system
US20080276792A1 (en) * 2007-05-07 2008-11-13 Bennetts Christopher L Lyrics superimposed on video feed
EP2141689A1 (en) 2008-07-04 2010-01-06 Koninklijke KPN N.V. Generating a stream comprising interactive content
US8824861B2 (en) 2008-07-01 2014-09-02 Yoostar Entertainment Group, Inc. Interactive systems and methods for video compositing
CN102742261A (en) * 2010-05-24 2012-10-17 联发科技(新加坡)私人有限公司 Method for generating multimedia data to be displayed on display device and related multimedia playback device
CN102231272A (en) * 2011-01-21 2011-11-02 辜进荣 Method and device for synthesizing network videos and audios
CN102496359A (en) * 2011-11-28 2012-06-13 华为终端有限公司 Method and device for realizing multi-party remote karaoke
EP2805483A4 (en) * 2012-01-20 2016-03-02 Karaoke Reality Video Inc Interactive audio/video system and method
WO2014182508A1 (en) 2013-05-06 2014-11-13 Yoostar Entertainment Group, Inc. Audio-video compositing and effects
US20190018572A1 (en) * 2015-01-13 2019-01-17 Google Inc. Content item players with voice-over on top of existing media functionality

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10282975A (en) * 1997-04-04 1998-10-23 Amtex Kk Karaoke system
US6514083B1 (en) * 1998-01-07 2003-02-04 Electric Planet, Inc. Method and apparatus for providing interactive karaoke entertainment
JP2000209500A (en) * 1999-01-14 2000-07-28 Daiichikosho Co Ltd A method of synthesizing and displaying a person image separately shot on a recorded background image and displaying and outputting the same, and a karaoke apparatus employing the method

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1312912C (en) * 2004-10-21 2007-04-25 上海交通大学 Entertainment system for video frequency real time synthesizing and recording
CN101945216A (en) * 2009-07-03 2011-01-12 奥林巴斯映像株式会社 Camera device and moving image reproduction method
CN101945216B (en) * 2009-07-03 2016-04-13 奥林巴斯株式会社 Camera device and moving image reproduction method
WO2014169653A1 (en) * 2013-08-28 2014-10-23 中兴通讯股份有限公司 Method and device for optimizing image synthesis
WO2016177296A1 (en) * 2015-05-04 2016-11-10 腾讯科技(深圳)有限公司 Video generation method and apparatus
CN110164242A (en) * 2019-06-04 2019-08-23 平顶山学院 A kind of vocals simulative training platform
CN110164242B (en) * 2019-06-04 2020-12-08 平顶山学院 A vocal singing simulation training platform

Also Published As

Publication number Publication date
KR20020026374A (en) 2002-04-09
WO2001099413A3 (en) 2002-03-07
US20020007718A1 (en) 2002-01-24
EP1297692A2 (en) 2003-04-02
WO2001099413A2 (en) 2001-12-27
JP2004501576A (en) 2004-01-15

Similar Documents

Publication Publication Date Title
CN1383543A (en) Karaoka system
US6514083B1 (en) Method and apparatus for providing interactive karaoke entertainment
JP3615195B2 (en) Content recording / playback apparatus and content editing method
MXPA05010595A (en) AUTOMATIC EXTRACTION OF FACES FOR USE ON RECORDED CONFERENCE TIME LINES.
JP2003111031A (en) Portable electronic image display device and audio player, and method for displaying and capturing a plurality of digital images
US20100209073A1 (en) Interactive Entertainment System for Recording Performance
KR20020065912A (en) Method, device and arrangement for inserting extra information
WO2001016935A1 (en) Information retrieving/processing method, retrieving/processing device, storing method and storing device
US6971882B1 (en) Method and apparatus for providing interactive karaoke entertainment
US20020188772A1 (en) Media production methods and systems
JP4030440B2 (en) Message reproducing apparatus, message recording and reproducing method, and program
CN1981525A (en) Information processing device, information processing method, recording medium and program
CN1068492C (en) Video accompaniment apparatus
KR100417369B1 (en) Apparatus and Method of multi-media with multi-channel
JP2007158527A (en) Signal processing apparatus, signal processing method, reproduction apparatus, and recording apparatus
WO2007071954A1 (en) Live performance entertainment apparatus and method
JP2000029483A (en) Karaoke equipment
JPH086577A (en) Karaoke equipment
JP4529632B2 (en) Content processing method and content processing apparatus
CN1719872A (en) Movie show entertainment system based on whole body fusion
JP3743321B2 (en) Data editing method, information processing apparatus, server, data editing program, and recording medium
JP2017092832A (en) Reproduction method and reproducer
JP2002290901A (en) Viewer video recording and playback device
JP4708898B2 (en) Digital watermark information embedded music information distribution system
JPH10149193A (en) Information processing apparatus and method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication