CN101816007A - System for checking animation data, method for creating animation database, and system and program for registering animation data in animation database - Google Patents
System for checking animation data, method for creating animation database, and system and program for registering animation data in animation database Download PDFInfo
- Publication number
- CN101816007A CN101816007A CN200880106643A CN200880106643A CN101816007A CN 101816007 A CN101816007 A CN 101816007A CN 200880106643 A CN200880106643 A CN 200880106643A CN 200880106643 A CN200880106643 A CN 200880106643A CN 101816007 A CN101816007 A CN 101816007A
- Authority
- CN
- China
- Prior art keywords
- image
- animation
- login
- data
- animation data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/16—Analogue secrecy systems; Analogue subscription systems
- H04N7/173—Analogue secrecy systems; Analogue subscription systems with two-way working, e.g. subscriber sending a programme selection signal
- H04N7/17309—Transmission or handling of upstream communications
- H04N7/17318—Direct or substantially direct transmission and handling of requests
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/783—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/7847—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content
- G06F16/785—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content using colour or luminescence
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/48—Matching video sequences
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/25—Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
- H04N21/254—Management at additional data server, e.g. shopping server, rights management server
- H04N21/2541—Rights Management
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/27—Server based end-user applications
- H04N21/274—Storing end-user multimedia data in response to end-user request, e.g. network recorder
- H04N21/2743—Video hosting of uploaded data from client
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/44008—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/472—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
- H04N21/47202—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for requesting content on demand, e.g. video on demand
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/81—Monomedia components thereof
- H04N21/8146—Monomedia components thereof involving graphical data, e.g. 3D object, 2D graphics
- H04N21/8153—Monomedia components thereof involving graphical data, e.g. 3D object, 2D graphics comprising still images, e.g. texture, background image
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/835—Generation of protective data, e.g. certificates
- H04N21/8352—Generation of protective data, e.g. certificates involving content or source identification data, e.g. Unique Material Identifier [UMID]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/84—Generation or processing of descriptive data, e.g. content descriptors
- H04N21/8405—Generation or processing of descriptive data, e.g. content descriptors represented by keywords
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Computer Security & Cryptography (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Library & Information Science (AREA)
- Computer Graphics (AREA)
- Human Computer Interaction (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Processing Or Creating Images (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Storage Device Security (AREA)
Abstract
Description
技术领域technical field
本发明涉及对在网络上的站点中置为可阅览状态的阅览动画数据进行核对的核对系统、制作把为了核对阅览动画数据而使用的多个动画数据作为信息进行了登录的动画数据库的方法、以及用于在动画数据库中登录动画数据的登录系统及程序。The present invention relates to a collating system for collating browsing animation data set to be available for browsing at a site on the network, a method for creating an animation database in which a plurality of animation data used for checking browsing animation data is registered as information, And a registration system and program for registering animation data in an animation database.
背景技术Background technique
近年来,伴随小说、音乐、电影、相片等信息的数字化以及因特网的普及,因特网上的著作权侵犯被视为问题。其起因在于,通过把信息数字化能够不降低质量地正确地复制,另外信息的编辑、改编等二次利用或三次利用容易,并且能够通过因特网向公众发送,是能够容易地利用其他人的著作物的环境。特别地,不仅是企业等团体,个人也处于这样的环境中,所以使著作权侵犯的预防、管理变得困难。In recent years, with the digitization of information such as novels, music, movies, and photos, and the spread of the Internet, copyright infringement on the Internet has been regarded as a problem. The reason for this is that by digitizing information, it can be reproduced accurately without degrading the quality, and the secondary or tertiary use of information such as editing and adaptation is easy, and it can be distributed to the public via the Internet, making it possible to easily use other people's works. environment of. In particular, not only groups such as companies but also individuals are in such an environment, making prevention and management of copyright infringement difficult.
最近在因特网上提供共享动画的动画共享服务,在该动画共享服务中著作权侵犯成为大的问题。所谓动画共享服务,是通过使某用户上传的动画数据与共享化,使之成为其他用户也可以下载的状态,能够公开、阅览动画的服务。在该动画共享服务中,现状是,一天投稿数万件动画数据,在这些动画数据中包含许多侵犯著作权的违法的动画数据。例如未经著作权人允许擅自上传电影、电视节目、生活影像、推销视频等。Recently, an animation sharing service for sharing animations has been provided on the Internet, and copyright infringement has become a big problem in the animation sharing service. The so-called video sharing service is a service that allows other users to download and share video data uploaded by a certain user, so that the video can be published and viewed. In this video sharing service, tens of thousands of pieces of video data are posted a day, and many of these video data include illegal video data that infringes copyrights. For example, uploading movies, TV programs, life images, promotional videos, etc. without the permission of the copyright owner.
目前,动画共享服务中的著作权的管理,仅是在使用公约等中记载了禁止投稿侵犯著作权的动画数据的内容,依赖于各用户的道德,而没有设置专门的检查系统。假如即使在上传了违法的动画数据时,仅由第三者指出是违法的动画数据进行删除。著作权人等下载、再生被上传的动画数据,通过观看识别来检索、通报违法的动画数据,但是要确认每日以数万件增加的全部的动画数据并不现实。而且,即使通报、删除了违法的动画数据,但很多时候会由用户再次投稿,现有的对策没有实效。At present, the management of copyright in animation sharing services is only about prohibiting the posting of copyright-infringing animation data content in the usage agreement, etc., and relies on the morality of each user, without setting up a dedicated inspection system. If even when illegal video data is uploaded, only the video data identified as illegal by a third party will be deleted. Copyright holders, etc. download and reproduce uploaded video data, and search and report illegal video data through viewing recognition. However, it is not realistic to check all video data that increases by tens of thousands every day. Furthermore, even if the illegal animation data is notified and deleted, users will often post it again, and existing countermeasures are ineffective.
但是,作为现有的动画检索技术,多半是在动画中设定关键字,检索者输入关键字,检索与关键字一致的动画的技术。但是,在使用关键字的动画检索方法中,在设定的关键字对于动画不适当时,存在无法检索适当的动画的问题。并且,在关键字的设定中,还具有存在个人的差别、另外由于各人的主观的不同或者表现的不同关键字未必一致的问题,并且在关键字一致时,也存在检索到目的动画以外的动画的问题。However, most of the conventional video retrieval techniques are techniques in which a keyword is set in a video, a searcher inputs the keyword, and a video matching the keyword is searched for. However, in the video search method using a keyword, if the set keyword is inappropriate for a video, there is a problem that a suitable video cannot be searched. In addition, in the setting of keywords, there is also the problem that there are differences between individuals, and the keywords may not necessarily match due to differences in the subjectivity or expression of each person, and when the keywords are the same, there may be problems other than the target animation being retrieved. animation problem.
相对于这样的通过关键字的动画检索方法,在专利文献1中公开了以下的动画检索方法,提取切换动画场面的帧图像来作为代表帧图像,从代表帧图像的全体或者部分,计算图像的特定颜色的面积、代表帧之间的时间长、声音强度的积分值、声音为一定强度以上的时间长等特征值,分配与特征量对应的代码,对于代表帧通过代码或者代码列附加名称,对于动画使用与代表帧列对应的帧的名称列作成动画的名称,使用该动画的名称进行核对。With respect to such a video retrieval method using keywords, the following video retrieval method is disclosed in
专利文献1:特开平7-114567号公报Patent Document 1: Japanese Unexamined Patent Publication No. 7-114567
发明内容Contents of the invention
但是,目前,在动画共享服务中,违法的动画数据的过滤未充分实现,正在寻求有效的动画检索方法。上述专利文献1的动画检索方法,是以在动画中检索一致的影像场景为目的,不是在动画共享服务中检索违法的动画数据。However, at present, filtering of illegal video data is not sufficiently realized in video sharing services, and an effective video retrieval method is being sought. The animation retrieval method of the above-mentioned
在动画共享服务中为了检索违法的动画数据,需要调查在动画共享服务中上传的动画是否违法,对于在动画共享服务中上传的多个动画,需要核对各个动画是否违法。作为上传的动画,有时为原始的整个动画,有时仅上传任意剪切的一部分。另外,也有时多次上传同样的动画,还需要持续的监视。In order to retrieve illegal video data in the video sharing service, it is necessary to investigate whether the video uploaded in the video sharing service is illegal, and for a plurality of videos uploaded in the video sharing service, it is necessary to check whether each video is illegal. As an uploaded animation, sometimes the original entire animation, sometimes only a portion of an arbitrary cut is uploaded. In addition, sometimes the same animation is uploaded multiple times, and continuous monitoring is required.
在专利文献1中,未公开在这样的动画共享服务中检索违法的动画数据的方法。而且,在专利文献1的动画检索方法中,因为对于动画通过与代表帧列对应的帧的名称列(动画的名称)进行核对,所以需要下载上传的动画,算出上传的动画的动画的名称后进行核对。因此,因为每次必须下载上传的动画,所以检索花费时间,另外下载时间成为面向高速化的瓶颈之一。另外,可能无法正确地核对细小剪切的动画数据,或者由于动画的剪切方法无法进行正确的核对。并且,对于随时更新的动画数据需要计算动画的名称,处理装置的处理量变得巨大。此外,关于成为计算动画名称的基础的特征量,即使是不同的图像,如果提取的特征量一致或者类似,则被检索到,所以存在认证精度低这样的问题。
因此,本发明的目的在于提供一种核对系统,其能够对于在网络上的站点中置为可阅览的状态的阅览动画数据进行有效的检查。另外,本发明的目的在于提供一种在作为信息登录动画数据的动画数据库中,增加每单位容量的动画数据的登录数量,检索也容易的动画数据库的制作方法。并且,本发明的目的在于提供一种在动画数据库中登录动画数据时,使用简易的方法,而且能够提高对于信息泄漏的安全性的登录系统以及程序。除此之外,本发明的目的在于使用这些核对系统、动画数据库的制作方法、登录系统以及程序,提供著作权的管理或者新的业务模式。这些目的代表性地表示了各自关联很深的那些目的,它们相互关联或者重叠。Therefore, an object of the present invention is to provide a collation system capable of efficiently checking browsing video data that has been made available for browsing at a site on the Internet. Another object of the present invention is to provide a method for creating an animation database that increases the number of registered animation data per unit capacity in an animation database that registers animation data as information and facilitates search. Furthermore, an object of the present invention is to provide a registration system and program that can be used in an easy-to-use method when registering animation data in an animation database, and can improve security against information leakage. In addition, an object of the present invention is to provide copyright management or a new business model using these collation systems, animation database creation methods, registration systems, and programs. These purposes are representative of those purposes that are closely related to each other, are interrelated or overlap.
本发明的核对系统是核对在网络上的站点中置为可阅览状态的阅览动画数据的核对系统,其特征是:具有把多个登录动画数据作为信息登录的动画数据库;以及核对所述阅览动画数据和在所述动画数据库中登录的登录动画数据的核对单元,所述阅览动画数据具有作为样本用而显示的静止图像,所述动画数据库具有记录介质,作为各登录动画数据记录有由再生该登录动画数据得到的再生动画的多个帧图像生成的多个登录用图像组成的登录用图像组,所述核对单元核对由所述阅览动画数据的样本用静止图像生成的检索用图像和所述动画数据库的记录介质中记录的登录用图像。The collation system of the present invention is a collation system for collation of browsing video data set to be available for browsing at a site on the network, and is characterized in that: it has a video database that registers a plurality of registered video data as information; and collates the browsing video means for collating data and registered animation data registered in the animation database, the browsing animation data having a still image displayed as a sample, the animation database having a recording medium in which each registered animation data is recorded with the A registration image group composed of a plurality of registration images generated from a plurality of frame images of reproduced animation obtained by registering the animation data, the collation unit collates the retrieval image generated from the sample still images of the browsing animation data with the An image for registration recorded in the recording medium of the video database.
并且,优选在本发明的上述核对系统中,具有从所述阅览动画数据中任意或者随意地提取样本的静止图像的功能。此外,优选关于所述各登录动画数据的登录用图像组,该登录用图像组中包含的一个登录用图像与另一个登录用图像的相关运算的结果小于预先设定的阈值。并且,优选在由帧图像生成所述登录用图像的期间,至少进行二值图像化处理。在此,优选在由所述阅览动画数据的样本用静止图像生成所述检索用图像的期间,至少进行二值图像化处理,所述核对单元对所述检索用图像和所述登录用图像进行相关运算来对它们进行核对。Furthermore, it is preferable that the collation system according to the present invention has a function of arbitrarily or arbitrarily extracting sample still images from the browsing video data. In addition, it is preferable that a result of correlation calculation between one registration image and another registration image included in the registration image group of each registration video data is smaller than a preset threshold value. Furthermore, it is preferable that at least a binary image processing is performed while the image for registration is generated from the frame image. Here, it is preferable that at least a binary image processing is performed while the search image is being generated from a sample still image of the browsing video data, and that the collation unit performs the search image and the registration image Correlation operations to check them.
并且,在本发明的上述核对系统中,可以在所述动画数据库中记录了与各登录用图像对应的帧图像的图像特征信息,所述核对单元计算所述阅览动画数据的样本用静止图像的图像特征信息,与在所述动画数据库中记录的图像特征信息进行核对。此外,在本发明的上述核对系统中,可以在所述登录动画数据中设定至少一个关键字,所述阅览动画数据附加了至少一个关键字数据,所述核对单元核对从所述阅览动画数据附加的关键字数据中取得的关键字和在所述登录动画数据中设定的关键字。In addition, in the collation system of the present invention, image feature information of frame images corresponding to the images for registration may be recorded in the video database, and the collation unit may calculate the value of the sample still images of the browsing video data. The image characteristic information is checked with the image characteristic information recorded in the animation database. In addition, in the above collation system of the present invention, at least one keyword may be set in the registered animation data, at least one keyword data is added to the browsing animation data, and the checking unit checks the data from the browsing animation data. The keyword acquired in the added keyword data and the keyword set in the registered video data.
并且,在本发明的上述核对系统中,优选当在所述动画数据库中登录了所述阅览动画数据时,对所述阅览动画数据附加识别数据,更优选根据所述识别数据,变更所述阅览动画数据的阅览条件。Furthermore, in the collation system of the present invention, it is preferable that identification data is added to the browsing animation data when the browsing animation data is registered in the animation database, and it is more preferable that the browsing animation data is changed based on the identification data. Viewing conditions of video data.
另外,本发明的动画数据库的制作方法是作为信息登录了多个登录动画数据的动画数据库的制作方法,其特征为,具有:再生工序,再生登录动画数据之一来获得再生动画;生成工序,对于所述再生动画的多个帧图像进行包含二值图像化的图像处理,由此生成多个登录用图像;以及记录工序,在记录介质上记录由所述多个登录用图像组成的登录用图像组,在所述再生工序和所述记录工序之间具备选择工序,该选择工序以所述登录用图像组中包含的一个登录用图像和另一个登录用图像的相关运算的结果小于预先设定的阈值的方式进行选择。In addition, the method for making an animation database of the present invention is a method for creating an animation database in which a plurality of registered animation data are registered as information, and is characterized in that it has: a reproduction step of reproducing one of the registered animation data to obtain a reproduced animation; a generation step, A plurality of frame images of the reproduced animation is subjected to image processing including binary imaging, thereby generating a plurality of registration images; and a recording step of recording a registration image composed of the plurality of registration images on a recording medium. The image group includes a selection step between the reproducing step and the recording step. In the selecting step, a result of correlation calculation between one registration image and the other registration image included in the registration image group is smaller than a predetermined value. Selected in the way of a certain threshold.
并且,在上述动画数据库的制作方法中,优选所述选择工序包含以下的处理:在由所述再生动画的各帧图像生成的各登录用图像彼此之间进行相关运算,在相关运算的结果大于预先设定的阈值时,把一方的登录用图像删除。In addition, in the animation database creation method described above, it is preferable that the selection step includes a process of performing a correlation calculation between the registration images generated from each frame image of the reproduced animation, and when the result of the correlation calculation is greater than or equal to When the preset threshold is reached, one image for registration will be deleted.
并且,在上述动画数据库的制作方法中,可以包含登录数据记录工序,关于与所述登录用图像对应的帧图像,计算图像特征信息,与该登录用图像关联地记录图像特征信息。Furthermore, the method for creating a moving image database may include a registration data recording step of calculating image characteristic information for a frame image corresponding to the registration image, and recording the image characteristic information in association with the registration image.
并且,在上述动画数据库的制作方法中,优选在具有所述登录动画数据之一的登录者的信息处理装置中,至少执行所述再生工序以及所述生成工序。In addition, in the method for creating a video database described above, it is preferable that at least the reproducing step and the generating step are executed in an information processing device of a registrant having one of the registered video data.
另外,本发明的登录系统是在作为信息登录了多个登录动画数据的动画数据库中登录登录动画数据的登录系统,其特征为:具有网络、与所述网络连接的动画数据库的服务器、以及与所述网络连接的信息处理装置,所述信息处理装置再生登录动画数据之一,来取得再生动画,对于所述再生动画的多个帧图像,至少进行二值图像化处理,生成多个发送用图像,把所述多个发送用图像发送给所述动画数据库的服务器,所述动画数据库的服务器由所述多个发送用图像生成多个登录用图像,在记录介质上记录由所述多个登录用图像组成的登录用图像组。In addition, the registration system of the present invention is a registration system for registering and registering animation data in an animation database in which a plurality of registered animation data are registered as information, and is characterized in that it has a network, a server of the animation database connected to the network, and The information processing device connected to the network, wherein the information processing device regenerates one of the registered video data to obtain the reproduced video, performs at least binary image processing on a plurality of frame images of the reproduced video, and generates a plurality of frames for transmission. image, sending the plurality of images for transmission to the server of the animation database, the server of the animation database generates a plurality of images for registration from the plurality of images for transmission, and records the images generated by the plurality of images on the recording medium. A group of images for registration consisting of images for registration.
并且,在上述登录系统中,优选所述发送用图像是所述登录用图像,所述动画数据库的服务器把从所述信息处理装置发送的多个发送用图像作为多个登录用图像记录在记录介质中。此外,优选所述信息处理装置在基于所述再生动画的各帧图像的图像彼此之间进行相关运算,在相关运算的结果大于预先设定的阈值时,把一方的图像删除。In addition, in the above-mentioned registration system, it is preferable that the image for transmission is the image for registration, and the server of the animation database records a plurality of images for transmission sent from the information processing device as a plurality of images for registration in the record. medium. Furthermore, it is preferable that the information processing device performs a correlation calculation between images based on each frame image of the reproduced animation, and deletes one of the images when the result of the correlation calculation is greater than a preset threshold value.
另外,本发明的程序,为了在作为信息登录了多个登录动画数据的动画数据库中登录登录动画数据,使通过网络与所述动画数据库的服务器连接的信息处理装置具有以下的功能:再生单元,再生所述登录动画数据,获得再生动画;图像处理单元,对于所述再生动画的多个帧图像,至少进行二值图像化处理,生成发送用图像;以及发送单元,经由所述网络向所述动画数据库的服务器发送所述多个发送用图像。In addition, in the program of the present invention, in order to register the registered animation data in the animation database in which a plurality of registered animation data are registered as information, the information processing device connected to the server of the animation database through the network has the following functions: a reproduction unit, regenerating the registered animation data to obtain the regenerated animation; the image processing unit performs at least binary image processing on multiple frame images of the regenerated animation to generate an image for transmission; and the sending unit transmits the image to the The server of the animation database transmits the plurality of transmission images.
并且,在上述程序中,优选还使所述信息处理装置具有选择单元的功能,所述选择单元在基于所述再生动画的各帧图像的图像彼此之间进行相关运算,在相关运算的结果大于预先设定的阈值时,把一方的图像删除。在此,可以使所述信息处理装置具有阈值的输入单元的功能,在所述选择单元中使用从所述输入单元输入的阈值。In addition, in the above program, it is preferable that the information processing device also has a function of a selection unit that performs correlation calculations between images based on each frame image of the reproduced animation, and when the result of the correlation calculation is greater than When the preset threshold is reached, one side of the image will be deleted. Here, the information processing apparatus may have a function of a threshold value input unit, and the selection unit may use the threshold value input from the input unit.
通过利用本发明的核对系统,因为能够对于在网络上的站点中置为可阅览状态的阅览动画数据,与动画数据库中登录的登录动画数据进行核对,所以能够汇总地执行登录动画数据的管理和著作权的保护或许诺,另外,在动画数据库中登录的范围内能够进行检查,能够避免通过上传违法的动画数据而导致的侵权行为,并且避免登录者行使权利。关于其他的效果,记载在以下的实施形式中。By using the checking system of the present invention, because the browsing animation data set to be browseable in the site on the network can be checked with the registration animation data registered in the animation database, the management and management of the registration animation data can be collectively executed. In addition, copyright protection or promise can be checked within the scope of registration in the animation database, which can prevent infringements caused by uploading illegal animation data and prevent registrants from exercising their rights. Other effects are described in the following embodiments.
附图说明Description of drawings
图1是表示本发明的核对系统的概略结构图。FIG. 1 is a schematic configuration diagram showing a collating system of the present invention.
图2是概要表示数据的流动的框图。FIG. 2 is a block diagram schematically showing the flow of data.
图3(A)表示示意性的动画数据库的数据结构,(B)~(E)分别表示示意性的登录动画数据的数据结构。FIG. 3(A) shows a schematic data structure of an animation database, and (B) to (E) each show a typical data structure of registered animation data.
图4是动画数据库制作处理的流程图。FIG. 4 is a flowchart of animation database creation processing.
图5是核对处理的流程图。FIG. 5 is a flowchart of collation processing.
图6是核对服务器的结构图。Fig. 6 is a structural diagram of a verification server.
图7(A)~(C)表示动画数据库中的错误率。7(A) to (C) show the error rate in the animation database.
图8表示把样本用静止图像用作检索用图像的核对系统中的错误率。Fig. 8 shows the error rate in a collation system using still images for samples as images for retrieval.
具体实施方式Detailed ways
下面参照附图说明本发明的实施的形式,但本发明并不限于以下的例子。图1是表示本发明的核对系统以及关联系统的概要结构图,图2是概要地表示数据流动的框图。在图1中,形成了连接客户机组10的网络11,在网络11上开设向客户机组10可阅览地提供动画数据的站点12,设置核对在站点12中置为可阅览状态的阅览图像数据13的核对服务器1。最初,概要地说明核对系统全体及其业务模式,其后详细叙述核对系统等的具体的结构。Embodiments of the present invention will be described below with reference to the drawings, but the present invention is not limited to the following examples. FIG. 1 is a schematic configuration diagram showing a collating system and a related system of the present invention, and FIG. 2 is a block diagram schematically showing data flow. In FIG. 1 , a
在核对系统1内设置把多个登录动画数据作为信息进行登录的动画数据库2、和核对动画数据和在动画数据库2中登录的登录动画数据的核对单元3。登录动画数据由登录者5、6提供,登录在动画数据库2中。In the
登录者5、6例如是站点12的运营者、或者是登录动画数据的著作权人或制作者等。这里,如图2所示,关于登录动画数据,可以同时登录其标题、阅览条件、关键字、图像特征信息等登录数据。关于登录数据,即使不是登录者5、6,也可以任意地登录到核对服务器或站点中。The
客户机组10,由利用站点12的多个客户机构成,与网络11连接,能够对于站点12的服务器提出动画数据的阅览请求等。例如,作为客户机组10,可以举出动画共享站点的用户或付费内容的会员等,作为具体的终端装置,可以举出个人计算机、便携信息终端、移动电话等。The
网络11是在客户机的终端装置之间进行连接的电气通信线路网,可以举出因特网、WAN(Wide Area Network(广域网))、LAN(Local Area Network(局域网))等。The
站点12准备有向客户机组10提供阅览动画数据的内容,在图1中,准备有阅览条件不同的A、B、C的内容。作为A、B、C的内容,例如可以举出完全免费内容、仅部分可视听的内容、付费内容等。站点12也可以对于投稿的阅览动画数据,在投稿时设定关键字。另外,站点12也可以是对于投稿的阅览动画数据,作为样本显示至少一幅静止图像(一般称为缩略图)的结构。The
优选使阅览动画数据13在由核对服务器1核对后,在站点12中置为可阅览状态,但也可以像图2所示那样,不经由核对服务器1直接上传到站点12。在把阅览动画数据13直接上传到站点12时,核对服务器1从站点12取得阅览动画数据,然后与登录动画数据核对即可。在图1中,阅览动画数据13从客户机组10通过网络11被上传到站点12的服务器。上传的阅览动画数据15,根据来自其他客户机的阅览请求,通过网络11下载到该客户机的终端装置中。It is preferable to make the
还可以通过客户机以外的单元,使阅览动画数据13成为可阅览状态,或者还可以通过网络以外的手段将其发送给站点12。并且,还可以为在站点12上仅粘贴链接,把阅览动画数据13自身保存在动画数据的提供者的终端装置中的状态。这里,在经由在站点12上粘贴的链接从其他的客户机具有阅览请求时,从提供者的终端装置直接下载到其他客户机的终端装置。The
在本发明的核对系统中,为更高速地进行核对处理,优选阅览动画数据具有作为样本而显示的静止图像。作为样本而显示的静止图像,是指阅览动画数据的再生动画中的帧图像或者缩小帧图像使容量减小的图像,是至少在核对服务器上显示的图像。典型的是再生动画的最初的帧图像,但并不限于此。另外,作为样本而显示的静止图像,可以是一幅,也可以是多幅。不仅是在站点12中作为投稿的阅览动画数据的缩略图而显示的图像,还包含即使不在站点12上显示,但在从客户机具有阅览请求时,作为阅览动画数据的样本而下载显示的静止图像、或者基于后述的不在客户机或网络上显示的任意或者随意提取的帧图像的静止图像。如图1所示,在阅览动画数据13从客户机组10经由核对服务器1被上传到站点12时,作为从客户机组10的信息处理装置到核对服务器1的信息处理装置之间的接口,只要是能够下载阅览动画数据13的再生动画中的帧图像或者缩小帧图像使容量减小的图像的结构,则阅览动画数据13具有作为样本而显示的静止图像。In the collating system of the present invention, in order to perform collating processing at a higher speed, it is preferable that the browsing video data have a still image displayed as a sample. The still image displayed as a sample refers to a frame image in a reproduced animation of browsing animation data or an image reduced in size by shrinking a frame image, and is an image displayed on at least the collation server. Typically, the first frame image of the animation is reproduced, but it is not limited to this. In addition, one or more still images may be displayed as samples. Not only images displayed on the
另外,优选在阅览动画数据13中,至少附加一个为了使核对以及检索变得容易的关键字数据(参照图2)。关键字数据可以由阅览动画数据13的投稿人任意选择,但也可以预先设定确定动画数据的领域的关键字。另外,也可以采用不输入关键字就不能向站点12投稿的结构。该关键字数据可以在站点12中,在客户机检索动画时使用,也可以在核对服务器1的动画数据库2中为了使核对容易而使用该关键字。例如,在向动画数据库2登录登录动画数据时,作为登录数据记录了关键字,从与关键字数据相应的登录动画数据中进行核对,可以作为动画数据库2在领域、关键字类别中登录登录动画数据,从与关键字数据相应的领域中进行核对。In addition, it is preferable to add at least one keyword data (refer to FIG. 2 ) for easy checking and searching to the
优选对通过核对服务器1核对过的已核对览动画数据14赋予识别数据。关于识别数据,可以仅赋予在核对服务器1中已核对这样的履历,也可以是识别是否在动画数据库2中登录了阅览动画数据13的识别数据,并且当在动画数据库2中登录了阅览动画数据13时,还可以包含阅览条件、登录数据的一部分。It is preferable to give identification data to the checked-up
例如,在被赋予了在核对服务器1中已核对这样的识别数据的阅览动画数据14,被下载到其他的客户机中,并且从其他的客户机再次被投稿时,核对服务器1识别出已核对这样的识别数据,关于赋予了该识别数据的阅览动画数据14能够判断要不要核对。另外,如果作为识别数据包含是否已登录在动画数据库2中的信息,则能够通过登录的有无变更阅览条件,管理网络上的动画数据。并且,如果在识别数据中还包含动画的标题、阅览条件等信息,则能够进行更细致的阅览条件的设定或管理。For example, when the
作为阅览条件,例如可以举出不阅览便删除投稿的阅览动画数据13、付费内容、根据阅览次数计费、仅一部分可视听、或者无效内容等。Browsing conditions include, for example, browsing
在图1中,与核对服务器1的动画数据库2核对阅览动画数据13,根据核对结果赋予A~C的识别数据,根据A~C的识别数据上传到在站点12上准备的A、B、C的各内容中。In FIG. 1, the
还可以是仅对阅览动画数据中的登录在动画数据库2中的阅览动画数据赋予识别数据的结构。另外,当在核对服务器1中进行与核对结果对应的阅览动画数据的处理时,可以不对阅览动画数据赋予识别数据。例如,关于登录在动画数据库2中的登录动画数据,如果是不全部阅览便不能上传的情况,则在核对服务器1中删除与登录在动画数据库中的登录动画数据一致的阅览动画数据即可,可以不对上传到站点12的阅览动画数据赋予识别数据。换言之,向站点12上传自身意味着该阅览动画数据未登录在动画数据库中。此外,如果仅是进行简单的调查或者管理动画数据的投稿次数等,只要在核对服务器1中向登录者提供这些信息即可,可以不对阅览动画数据13赋予识别数据。It is also possible to provide identification data only to the browsing video data registered in the
并且,核对服务器1或者站点12,在阅览动画数据13已登录在动画数据库2中时,可以对向动画数据库2登录了该动画数据的登录者提供与动画数据有关的信息(参照图2)。所谓与动画数据有关的信息,例如可以举出已登录的动画数据的投稿次数、阅览(下载)次数、投稿动画数据的客户机信息等。Furthermore, when the
另外,核对服务器1,对于作为阅览动画数据13投稿了已登录在动画数据库2中的动画数据的客户机,可以发送通告投稿该动画数据侵犯著作权的警告消息(参照图2)。通过发送这样的警告消息,能够提高客户机组10对于著作权的意识。In addition, the
当利用以上那样的核对系统时,登录者5、6通过把各自想要管理的动画数据作为登录动画数据登录在动画数据库2中,能够汇总地执行网络11上的登录动画数据的管理或者著作权的保护或者许诺。另外,站点12对于阅览动画数据,能够在已登录在动画数据库2中的范围内进行检查,能够避免上传违法的动画数据导致的侵权行为,并且避免登录者的行使权利。When using the collation system as above, the
即,核对服务器1能够对登录者5、6提供在动画数据库2中登录想要管理的动画数据的服务,另外还可以提供代行登录动画数据的一部分管理的服务。并且,核对服务器1可以对站点12提供在动画数据库2中登录的范围内检查阅览动画数据的服务,还可以提供代行阅览动画数据的一部分管理的服务。登录者5、6或者站点12自身也可以提供核对服务器1。That is, the
然后说明核对服务器1。核对服务器1设置有记录介质以及信息处理装置,该记录介质记录了把多个登录动画数据作为信息进行了登录的动画数据库2,该信息处理装置具有核对阅览动画数据和在动画数据库2中登录的登录动画数据的核对单元3的至少一部分功能。Then, the
本发明的基于核对服务器1的核对系统的特征之一在于,通过比较图像彼此(包含电子化的图像数据彼此),来进行最终的阅览动画数据和登录动画数据的核对。即,本发明的核对系统,虽然不排除现有的通过关键字或特征量进行的核对,但必须进行由登录动画数据的再生动画生成的登录用图像和由阅览动画数据的再生动画生成的检索用图像的直接核对。因为直接对图像彼此进行核对,所以能够提高核对精度。另外,作为登录动画数据,通过记录与登录动画数据的再生动画的主要的帧图像对应的登录用图像组成的登录用图像组,即使检索用图像是登录动画数据的哪个场面,也能够可靠地进行检索。One of the features of the collation system based on the
因此,即使在仅使用由登录动画数据的样本用静止图像生成的检索用图像进行了核对时,通过与登录动画数据的登录用图像组进行核对,能够判别阅览动画数据和登录动画数据是否一致或者是否类似。样本用静止图像自身,与下载阅览动画数据相比容量极小,所以能够缩短下载时间,能够进行高速的核对。于是,即使对于向站点上传的多个阅览动画数据,也能成为有效的核对数据。例如,即使阅览动画数据的容量在数MB(百万字节)以上,因为缩略图自身是128像素(横)×96像素(纵)左右的图像数据,所以能够绝定性地缩短下载时间以及运算量。Therefore, even when checking is performed using only the search image generated from the sample still image of the registered video data, it can be determined whether the browsing video data and the registered video data match or not by checking with the registration image group of the registered video data. Is it similar. The still image itself for the sample has an extremely small capacity compared with the downloaded browsing video data, so the download time can be shortened, and high-speed verification can be performed. Therefore, it becomes valid collation data even for a plurality of browsing animation data uploaded to the site. For example, even if the capacity of the browsing movie data is several MB (megabytes), since the thumbnail itself is image data of about 128 pixels (horizontal) x 96 pixels (vertical), it is possible to absolutely shorten the download time and computation time. quantity.
如此,对于利用缩略图的核对系统,存在具有恶意的客户机对于成为缩略图的帧图像进行某种图像处理,来妨碍核对的可能性。因此,优选核对系统还具有从阅览动画数据中任意或者随意地提取样本的静止图像的功能。为了实现这样的功能,使用使进行上传的客户机的信息处理装置、核对服务器的信息处理装置、以及站点的服务器的信息处理装置等具有再生阅览动画数据获得再生动画的再生单元、以及从再生动画中提取任意的帧图像或者随意提取帧图像,由提取的帧图像生成样本的静止图像的图像生成单元的作用的程序即可。这里,优选由时间上隔开的多幅帧图像生成多幅样本的静止图像。另外,为了提高核对系统的可靠性,关于帧图像的提取位置,优选采用在客户机上无法操作的程序。另外,采用在客户机或者网络上无法阅览提取了哪个帧图像的结构也是有效的。In this manner, in a collation system using thumbnails, there is a possibility that a malicious client may interfere with collation by performing some kind of image processing on the frame images used as thumbnails. Therefore, it is preferable that the collating system also has a function of arbitrarily or arbitrarily extracting sample still images from the browsing video data. In order to realize such a function, the information processing device of the uploading client, the information processing device of the check server, and the information processing device of the server of the site, etc. have a reproduction unit for reproducing the browsing animation data to obtain the reproduction animation, and from the regeneration animation The program may be a program that functions as an image generating unit that extracts an arbitrary frame image or extracts a frame image at will, and generates a still image of a sample from the extracted frame image. Here, it is preferable to generate a plurality of sample still images from a plurality of temporally spaced frame images. In addition, in order to improve the reliability of the collation system, it is preferable to use a program that cannot be operated on the client computer for the extraction position of the frame image. In addition, it is also effective to adopt a configuration in which which frame image is extracted cannot be viewed on the client computer or the network.
然后,在动画数据库2中,作为各登录动画数据,记录由再生该登录动画数据得到的再生动画的多幅帧图像生成的多个登录用图像所组成的登录用图像组。即如图3(A)所示,在动画数据库2中,记录了与登录动画数据的数量对应的多个登录用图像组16,在各登录用图像组16中,分别包含多个登录用图像17。Then, in the moving
作为多个帧图像,原样不变地登录再生动画的全部帧图像,记录最多的登录动画数据的信息量,作为结果,运算时间变长,需要庞大的记录容量。目前在日本使用的动画,因为在TV广播中在NTSC规格中是29.97fps(帧/秒),在电影中是24fps,所以在1秒间内分别由大约30帧以及24帧的静止图像构成。另外,各帧图像因为具有与分辨率对应的像素数量的像素的排列构成的图形信息、以及针对每一像素的颜色信息,所以当把全部帧图像作为登录用图像组进行登录时,总数据容量变得极大。例如,分辨率为600×480的全色(24比特)图像,简单计算为27MB的数据容量,在为90分钟的电影(24fps)时,帧图像为129,600幅,所以即使通过一个动画数据进行计算无论如何也不现实。As a plurality of frame images, all the frame images of the playback movie are registered as they are, and the maximum amount of information of the registered movie data is recorded. As a result, the calculation time becomes longer and a huge storage capacity is required. Animation currently used in Japan is 29.97fps (frame per second) in the NTSC standard for TV broadcasting and 24fps for movies, so it consists of about 30 frames and 24 frames of still images in one second. In addition, since each frame image has graphic information of an arrangement of pixels corresponding to the resolution and color information for each pixel, when all frame images are registered as a registration image group, the total data capacity become enormous. For example, a full-color (24-bit) image with a resolution of 600×480 is simply calculated as a data capacity of 27MB, and when it is a 90-minute movie (24fps), there are 129,600 frame images, so even if calculated by one animation data It's not realistic by any means.
因此,优选对再生动画的帧图像,至少进行二值图像化处理,由此来生成登录用图像。所谓二值图像化处理是指转换为通过白和黑(0和1)两种颜色显示各像素的图像的处理,一般的动画数据的各像素至少具有灰度的等级信息(8比特),多数时具有色彩的颜色信息(24比特),所以通过二值图像化处理,能够显著减小登录用图像的数据容量。而且,当对进行了二值图像化处理的二值的登录用图像、和同样进行了二值图像化处理的二值的检索用图像进行相关运算来进行核对时,运算速度比对全色的图像彼此或者对灰度的图像彼此进行相关运算时的运算速度快,有时核对精度还得到提高,二值图像化处理是极为有用的处理。而且,因为进行了二值图像化处理的图像的运算量小,所以还能够通过LSI等集成电路作为硬件制作专用的运算装置。Therefore, it is preferable to generate a registration image by performing at least binary image processing on the frame image of the reproduced video. The so-called binary image processing refers to the processing of converting into an image displaying each pixel in two colors of white and black (0 and 1). Generally, each pixel of animation data has at least gradation information (8 bits), and most Since it has the color information (24 bits) of the color, the data capacity of the image for registration can be significantly reduced by binary image processing. In addition, when the binary registration image subjected to binary imaging processing and the binary search image similarly subjected to binary imaging processing are performed to perform correlation calculations for comparison, the calculation speed is faster than that of the full-color image. When performing correlation calculations between images or between grayscale images, the calculation speed is fast, and the matching accuracy may be improved in some cases, and binary image processing is extremely useful. Furthermore, since the amount of computation of the image subjected to the binary image processing is small, it is also possible to manufacture a dedicated computation device using integrated circuits such as LSI as hardware.
作为二值图像化处理的方法,可以大体分为固定阈值法和区域指定法两种。前者是用灰度等级的某个值决定二值化的阈值的方法,后者是首先事先决定全体图像的二值的比例,根据浓度值的直方图决定二值化的阈值的方法。当设二值化的阈值为T、原图像的各像素的要素为f(i,j)、二值图像的各像素的要素为g(i,j)时,可以如下述数学式(1)那样定义。As methods of binary image processing, they can be roughly classified into two types: a fixed threshold method and a region specifying method. The former is a method of determining the threshold value for binarization using a certain value of the gray scale, and the latter is a method of determining the ratio of binary values of the entire image in advance, and then determining the threshold value of binarization from the histogram of density values. When the threshold of binarization is T, the element of each pixel of the original image is f(i, j), and the element of each pixel of the binary image is g(i, j), the following formula (1) Defined that way.
[数学式1][mathematical formula 1]
在帧图像是彩色图像时,首先去除色彩的颜色信息,变更为只有等级信息的灰度,然后进行二值化即可。另外,对于帧图像或二值图像,还可以进行其他的图像处理。例如,还可以在二值图像化处理前或后进行变更分辨率的图像处理或边缘强调处理等。When the frame image is a color image, first remove the color information of the color, change it to grayscale with only level information, and then perform binarization. In addition, for frame images or binary images, other image processing can also be performed. For example, image processing for changing the resolution, edge emphasis processing, etc. may be performed before or after the binary image processing.
另外,二值图像化处理与数据形式的变换处理、压缩处理以及加密处理不同,基本上是仅删除信息的不可逆的变换。即,因为数据形式的变换处理、压缩处理以及加密处理,为了恢复为原来的帧图像,使能够进行逆变换的信息为自身或者别的文件。但是,二值图像化处理舍去原来的帧图像所具有的颜色信息以及等级信息,用二值图像自身无法恢复为原来的帧图像。In addition, the binary image processing is basically an irreversible conversion that only deletes information, unlike data format conversion processing, compression processing, and encryption processing. That is, because of data format conversion processing, compression processing, and encryption processing, in order to restore the original frame image, the information that can be inversely converted is itself or another file. However, the binary image processing discards the color information and gradation information of the original frame image, and the binary image itself cannot restore the original frame image.
不能恢复为原来的帧图像初看可能认为是缺点,但实际上能够有效地防止原始的动画数据的信息泄漏。即,为了从登录者向动画数据库登录原始的动画数据,在作为登录动画数据提供给核对服务器时,由于流通的事故或者网络传送中的事故或者人为的信息泄漏等,具有原始的动画数据流出的危险性。例如,公开的电影或者未公开的生活影像流出时造成的损失是巨大的。但是,如果在登录者的信息处理装置中,对于登录动画数据的帧图像,至少进行二值图像化处理,然后把二值图像提供给核对服务器,则即使二值图像流出,因为其自身无法恢复为原始的动画数据,所以能够避免原始的动画数据流出。将在后面叙述用于在利用了该效果的动画数据库中登录动画数据的登录系统以及程序。The inability to restore the original frame image may be considered a disadvantage at first glance, but it is actually effective in preventing information leakage of the original animation data. That is, in order to register the original animation data from the registrant to the animation database, when it is provided to the verification server as registered animation data, the original animation data may flow out due to an accident in distribution, an accident in network transmission, or artificial information leakage. dangerous. For example, the loss caused by the release of public movies or unpublished life images is huge. However, if the registrant's information processing device performs at least binary image processing on the frame image of the registered video data, and then provides the binary image to the verification server, even if the binary image flows out, it cannot be restored by itself. It is the original animation data, so the outflow of original animation data can be avoided. A registration system and program for registering animation data in the animation database using this effect will be described later.
另外,为了减少登录用图像的数量,优选把类似的图像删除不进行登录,以某种程度进行选择来进行登录。为此,在登录动画数据的各帧图像彼此之间、或者在对各帧图像进行了图像处理得到的图像(包含二值图像)彼此之间进行相关运算,在其结果大于预先设定的阈值时,除去某一方的图像,在其结果比阈值小时,登录至少一方的图像。阈值是进行选择的类似度的基准,如果阈值高,则一般核对精度提高,但是登录用图像的数目也增加;如果阈值低,则核对精度降低,但登录用图像的数目也减少,所以优选预先通过实验等求出最佳的阈值。另外,如果使登录者能够变更阈值,则能够对于重要的动画数据提高阈值,对于重要度低的动画数据较低地设定阈值。登录用图像之一不需要与其他全部的登录用图像进行相关运算,只要与至少另一个登录用图像的相关运算的结果比阈值小即可。In addition, in order to reduce the number of images for registration, it is preferable to delete similar images and not to register them, but to select and register them to some extent. For this purpose, a correlation operation is performed between each frame image of the registered animation data, or between images (including binary images) obtained by image processing of each frame image, and the result is greater than a preset threshold When , one of the images is excluded, and if the result is smaller than the threshold value, at least one of the images is registered. The threshold is a criterion of similarity for selection. If the threshold is high, the collation accuracy generally increases, but the number of images for registration also increases; if the threshold is low, the collation accuracy decreases, but the number of images for registration also decreases, so it is preferable to The optimum threshold value is determined by experiments or the like. In addition, if the registrant can change the threshold, the threshold can be set higher for important video data, and the threshold can be set lower for video data of low importance. One of the images for registration does not need to be correlated with all the other images for registration, and it is only necessary that the result of the correlation calculation with at least one other image for registration is smaller than a threshold value.
作为相关运算,例如,通过求出登录用图像彼此之间的内积、欧几里得距离或相关系数,可以计算类似度。关于N×N像素的登录用图像F、G的各像素的要素,在表现为一维列矢量f、列矢量g时,可以用数学式(2)计算内积,可以用数学式(3)计算欧几里得距离,可以用数学式(4)计算相关系数。类似度的计算不限于这些方式。As a correlation calculation, for example, the degree of similarity can be calculated by obtaining the inner product, the Euclidean distance, or the correlation coefficient between registration images. Regarding the elements of each pixel of the registration images F and G of N×N pixels, when expressed as a one-dimensional column vector f and a column vector g, the inner product can be calculated by formula (2), and the inner product can be calculated by formula (3) To calculate the Euclidean distance, the correlation coefficient can be calculated by mathematical formula (4). Calculation of the degree of similarity is not limited to these methods.
[数学式2][mathematical formula 2]
[数学式3][mathematical formula 3]
[数学式4][mathematical formula 4]
图3(B)~(E)示意地表示以上说明的各登录动画数据的数据结构。图3(B)是把再生动画的全部帧图像原样不变作为登录用图像组16,各登录用图像17是各帧图像,具有等级信息或颜色信息。图3(C)对于图3(B)进行了二值图像化处理,因为各登录用图像17成为二值图像,所以除去了等级信息或颜色信息,能够减低登录用图像组16的数据容量。图3(D)对于图3(B)进行了选择,各登录用图像17被选择的帧图像,能够与被除去的帧图像的容量对应地减低登录用图像组16的数据容量。图3(E)对于图3(B)进行了二值图像化处理以及选择两者。可以在先进行二值图像化处理,然后选择二值图像(图3(B)→(C)→(E)),也可以在选择帧图像后进行二值图像化处理(图3(B)→(D)→(E))。在图3(E)中,各登录用图像17是选择后的二值图像,与图3(B)相比能够大幅减少数据容量。3(B) to (E) schematically show the data structure of each registered video data described above. FIG. 3(B) uses all the frame images of the reproduced animation as the
这样的动画数据库的制作方法,只要具有下述步骤即可:再生步骤,再生登录动画数据之一来获得再生动画;生成步骤,通过对再生动画的多个帧图像至少进行二值图像化处理,生成多个登录用图像;记录步骤,在记录介质上记录由多个登录用图像组成的登录用图像组;选择步骤,在再生步骤和记录步骤之间,以登录用图像组中包含的一个登录用图像和另一个登录用图像的相关运算的结果小于预先设定的阈值的方式进行选择。如上所述,可以先进行包含二值图像化处理的生成步骤和选择步骤中的任何一个,也可以在这些步骤之间或者前后进行其他的图像处理或者步骤。但是,为了选择步骤中的相关运算处理的运算速度的高速化以及提高精度,优选在进行了二值图像化处理之后,在选择步骤中进行相关运算。The manufacturing method of such an animation database only needs to have the following steps: a regenerating step of regenerating one of the registered animation data to obtain a regenerated animation; a generating step of at least performing binary image processing on a plurality of frame images of the regenerated animation, Generate a plurality of images for registration; Recording step, recording on the recording medium a registration image group consisting of a plurality of registration images; Selection step, between the reproduction step and the recording step, registering with one of the registration images contained in the recording medium The selection is made so that the result of the correlation calculation between the image and another image for registration is smaller than a preset threshold value. As described above, any one of the generation step and the selection step including binary image processing may be performed first, and other image processing or steps may be performed between or before and after these steps. However, in order to speed up the calculation speed and improve the accuracy of the correlation calculation processing in the selection step, it is preferable to perform the correlation calculation in the selection step after performing the binary image processing.
图4是动画数据库制作处理的流程图的一例。首先,取得登录动画数据,再生登录动画数据(S41:再生步骤)。由再生动画生成全部帧图像组Mi(i=1,2,3,...N,N是帧数),对于全部帧图像组Mi,进行二值图像化处理,变换为二值图像组Bi(i=1,2,3,...N,N是帧数)(S42~S43:生成步骤)。然后,作为成为比较基准的基准图像Bp代入Bi(初期值i=1),进行基准图像Bp和进行比较的二值图像Bi(i=i+1)的相关运算,取得相关值V(S44~S46:选择步骤)。在相关值V为阈值以上时(S47的No:选择步骤),判断为基准图像Bp和二值图像Bi类似,把二值图像Bi删除不登录在动画数据库中,进行基准图像Bp和下一个二值图像Bi(i=i+1)的相关运算(S45~S46)。另一方面,在相关值V比阈值低时(S47的Yes:选择步骤),判断为基准图像Bp和二值图像Bi不类似,在动画数据库中登录基准图像Bp(初始值是二值图像Bi)(S48:记录步骤),把不类似的二值图像Bi作为下一个基准图像Bp与下一个二值图像Bi(i=i+1)进行相关运算(S49的No,S44~S46)。当在S49中二值图像Bi是最后的帧的二值图像BN时,登录帧图像BN然后结束(S50)。最终,作为不类似登录的多个基准图像被记录为登录用图像组Bm(m=1,2,3,...M,M是登录用图像的数目)。FIG. 4 is an example of a flowchart of animation database creation processing. First, registered video data is acquired, and the registered video data is reproduced (S41: reproduction step). Generate all frame image groups M i (i=1, 2, 3, ... N, N is the number of frames) from the regenerated animation, for all frame image groups M i , perform binary image processing, and convert to binary images Group B i (i=1, 2, 3, . . . N, where N is the number of frames) (S42 to S43: generating step). Then, the reference image B p used as a comparison reference is substituted into Bi (initial value i=1), and the correlation calculation between the reference image B p and the binary image B i (i=i+1) to be compared is performed to obtain a correlation value. V (S44-S46: selection step). When the correlation value V is above the threshold (No of S47: selection step), it is judged that the reference image B p is similar to the binary image B i, the binary image B i is deleted and not registered in the animation database, and the reference image B p is performed. Correlation calculation with the next binary image B i (i=i+1) (S45-S46). On the other hand, when the correlation value V is lower than the threshold (Yes in S47: selection step), it is judged that the reference image B p is not similar to the binary image B i , and the reference image B p is registered in the animation database (initial value is binary value image B i ) (S48: recording step), the binary image B i that is not similar is used as the next reference image B p and the next binary image B i (i=i+1) is correlated (No of S49 , S44~S46). When the binary image B i is the binary image B N of the last frame in S49, the registration of the frame image B N then ends (S50). Finally, a plurality of reference images registered as dissimilarities are recorded as a registration image group B m (m=1, 2, 3, . . . M, where M is the number of registration images).
在图4所示的流程图的算法中,与初始的基准图像和下一基准图像进行相关运算,但有可能不进行与那以外的基准图像的相关运算,而登录了类似的图像。关于这点,如果对于初始的基准图像,按顺序对它以外的全部的二值图像进行相关运算,把相关值大于阈值的二值图像删除,不登录在动画数据库中,把阈值大的二值图像中的一个作为下一基准图像,按顺序和那以外的二值图像进行相关运算,则能够进一步削减登录用图像的数目。In the algorithm of the flow chart shown in FIG. 4 , the initial reference image and the next reference image are correlated. However, correlation calculations with other reference images may not be performed and similar images may be registered. Regarding this point, if for the initial reference image, perform correlation calculations on all binary images other than it in order, delete the binary images whose correlation value is greater than the threshold, do not log in the animation database, and replace the binary images with larger thresholds If one of the images is used as the next reference image, and the other binary images are sequentially correlated, the number of registration images can be further reduced.
并且,还可以包含登录数据记录步骤,记录与登录用图像组对应的标题、阅览条件、关键字、关于与登录用图像对应的帧图像的图像特征信息等登录数据。作为图像特征信息,在登录用图像是二值图像或者灰度时,当在原始的帧图像的特定颜色的分配或面积等中存在特征时,把这些特征记录为信息,即使对于与检索用图像对应的帧图像也计算图像特征信息,作为二次核对可以利用图像特征信息。In addition, a registration data recording step may be included to record registration data such as titles corresponding to the registration image group, browsing conditions, keywords, and image feature information on frame images corresponding to the registration images. As image feature information, when the image for registration is a binary image or grayscale, if there are features in the distribution of specific colors or areas of the original frame image, these features are recorded as information, even for the image for retrieval The image feature information is also calculated for the corresponding frame image, and the image feature information can be used as a secondary check.
动画数据库的制作,可以仅由核对服务器进行,也可以在登录者的信息处理装置中进行一部分的步骤。关于得到再生动画的再生步骤以及对于帧图像进行二值图像化处理的生成步骤,为了防止上述的信息泄漏,优选通过登录者的信息处理装置进行。可以把通过登录者的信息处理装置生成的登录用图像或者发送用图像经由网络发送给核对服务器,也可以离线地发送到核对服务器。The creation of the video database may be performed only by the verification server, or a part of the steps may be performed in the registrant's information processing device. In order to prevent the above-mentioned information leakage, it is preferable to perform the reproduction step of obtaining the reproduced moving image and the generation step of binary image processing for the frame image by the registrant's information processing device. The image for registration or the image for transmission generated by the information processing device of the registrant may be sent to the verification server via a network, or may be sent to the verification server offline.
另外,当说明利用网络的登录系统的一例时,登录系统具有网络、与网络连接的信息处理装置和动画数据库的服务器。并且,信息处理装置再生登录动画数据之一来取得再生动画,对于再生动画的多个帧图像至少进行二值图像化处理,生成多个发送用图像。作为发送用图像,对帧图像至少进行二值图像化处理即可,还可以进行其他的图像处理,在动画数据库的服务器中,可以是原样不变地作为登录用图像进行记录的状态,还可以是在动画数据库的服务器中,在进行了进一步的图像处理等处理后进行记录的状态。并且,信息处理装置经由网络向动画数据库的服务器发送多个发送用图像。多个发送用图像,因为通过二值图像化处理成为二值图像,所以即使在网络上泄漏,也是不可逆转为再生动画的状态,能够防止再生动画的信息泄漏。动画数据库的服务器从接收到的多个发送用图像生成多个登录用图像,在记录介质中记录由多个登录用图像组成的登录用图像组。在能够把接收到的多个发送用图像原样不变地作为登录用图像使用时,动画数据库的服务器在记录介质中原样不变地记录多个发送用图像来作为多个登录用图像。In addition, when describing an example of a login system using a network, the login system includes a network, an information processing device connected to the network, and a video database server. Then, the information processing device reproduces one of the registered video data to obtain the reproduced video, and performs at least binary image processing on a plurality of frame images of the reproduced video to generate a plurality of transmission images. As the image for transmission, at least binary image processing may be performed on the frame image, and other image processing may also be performed. In the server of the animation database, it may be recorded as the image for registration as it is, or it may be This is the state recorded in the video database server after further processing such as image processing. Then, the information processing device transmits a plurality of transmission images to the server of the animation database via the network. Since multiple images for transmission are converted into binary images by binary image processing, even if they are leaked on the network, they will be irreversibly changed to the state of reproduced animation, which can prevent information leakage of reproduced animation. The video database server generates a plurality of images for registration from the received images for transmission, and records a group of images for registration consisting of the images for registration on a recording medium. When the received multiple transmission images can be used as the registration images without change, the video database server records the multiple transmission images without change on the recording medium as the multiple registration images.
并且,信息处理装置进行基于再生动画的各帧图像的图像彼此之间的相关运算,在相关运算的结果大于预先设定的阈值时,可以把一方的图像从发送用图像中删除。这在选择步骤中也不是由动画数据库的服务器进行,而是在通过网络连接的信息处理装置中进行,能够减少发送用图像的总的数据容量。所谓基于帧图像的图像,包含帧图像自身、对于帧图像进行了二值图像化处理以外的图像处理的图像以及进行了二值图像化处理的图像。In addition, the information processing device may perform a correlation calculation between images based on each frame image of the reproduced animation, and may delete one of the images from the transmission image when the result of the correlation calculation is greater than a preset threshold value. This is not performed by the server of the animation database in the selection step, but is performed by an information processing device connected via a network, so that the total data capacity of images for transmission can be reduced. An image based on a frame image includes a frame image itself, an image subjected to image processing other than binary image processing, and an image subjected to binary image processing.
为了利用这样的登录系统,在信息处理装置中安装用于使信息处理装置具有再生登录动画数据得到再生动画的再生单元、对于再生动画的多个帧图像至少进行二值图像化处理生成发送用图像的图像处理单元、以及经由网络向动画数据库的服务器发送多个发送用图像的发送单元的功能的程序即可。并且,也可以使信息处理装置具有进行基于再生动画的各帧图像的图像之间的相关运算,在相关运算的结果大于预先设定的阈值时把一方的图像删除的选择单元的功能。在此,为了使登录者能够选择阈值,可以使信息处理装置具有阈值的输入单元的功能。In order to utilize such a registration system, an information processing device is equipped with a reproducing unit for reproducing registered video data to obtain a regenerated video, and performs at least binary image processing on a plurality of frame images of the reproduced video to generate an image for transmission. The image processing unit and the program of the function of the sending unit that sends a plurality of sending images to the server of the animation database via the network may be sufficient. In addition, the information processing device may have a function of selecting means for performing a correlation calculation between images based on each frame image of the reproduced animation, and deleting one of the images when the result of the correlation calculation is greater than a preset threshold value. Here, in order for the registrant to select the threshold value, the information processing device may have a threshold value input unit function.
但是,作为动画数据库的记录介质,可以利用大容量的硬盘或者全息记录介质等。在动画数据库中利用大容量的硬盘、在通过运算处理核对图像时,作为核对单元,可以利用信息处理装置。另一方面,在利用全息记录介质时,需要光学记录单元以及核对单元。上述的核对系统、动画数据库、登录系统等,在任何核对单元中都可以利用。However, a large-capacity hard disk, a holographic recording medium, or the like can be used as the recording medium of the moving image database. When using a large-capacity hard disk in the video database and collating images by arithmetic processing, an information processing device can be used as a collating unit. On the other hand, when using a hologram recording medium, an optical recording unit and a collation unit are required. The above-mentioned collation system, animation database, registration system, etc. can be used in any collation unit.
下面说明核对单元进行核对时的动作。如上所述,在本发明的核对系统中,优选利用由阅览动画数据的样本用静止图像生成的检索用图像进行核对。图5是在记录了进行了二值图像化处理的登录用图像组的动画数据库中,通过信息处理装置利用由阅览动画数据的样本用静止图像生成的检索用图像进行核对时的核对处理的流程图。Next, the operation of the collating means for collating will be described. As described above, in the collating system of the present invention, it is preferable to perform collating using a search image generated from a sample still image of browsing video data. Fig. 5 is a flowchart of collation processing when an information processing device performs collation using a search image generated from a sample still image of browsed video data in a video database in which a group of registration images subjected to binary image processing is recorded picture.
首先,核对服务器的信息处理装置取得阅览动画数据的样本用静止图像(S51)。接着,对静止图像进行二值图像化处理,变换为二值的检索用图像S(S52)。然后,对于二值的检索用图像S和在动画数据库中记录的登录用图像组Bm(m=1,2,3,...M,M是登录用图像的数目)的各登录用图像按顺序进行相关运算,取得最大相关值Vmax(S53)。比较最大相关值Vmax和动画数据的核对阈值,在最大相关值Vmax大于核对阈值时(S54的Yes:核对步骤),识别为阅览动画数据是登录用图像组的登录动画数据(S55)。另外,在最大相关值Vmax为核对阈值以下时(S54的No:核对步骤),识别为阅览动画数据不是登录用图像组的登录动画数据,接着判断是否存在下一登录用图像组(S56)。在存在下一登录用图像组时(S56的No),再次对于检索用图像S和别的登录用图像组的各登录用图像按顺序进行相关运算,取得最大相关值Vmax,与核对阈值比较(S53~S54)。接着在不存在下一登录用图像组时(S56的Yes),识别为阅览动画数据未登录在动画数据库中(S57)。First, the information processing device of the collation server acquires a still image for a sample of browsing video data (S51). Next, binary imaging processing is performed on the still image, and converted into a binary search image S (S52). Then, for the binary retrieval image S and each registration image of the registration image group B m (m=1, 2, 3, . . . M, M being the number of registration images) recorded in the animation database, Correlation operations are performed sequentially to obtain the maximum correlation value V max (S53). The maximum correlation value V max is compared with the verification threshold of the video data, and when the maximum correlation value V max is greater than the verification threshold (Yes in S54: verification step), the browsing video data is recognized as the registration video data of the image group for registration (S55). In addition, when the maximum correlation value V max is equal to or less than the verification threshold (No: verification step of S54), it is recognized that the browsing video data is not the registration video data of the registration image group, and then it is determined whether there is a next registration image group (S56). . When the next image group for registration exists (No in S56), the correlation calculation is performed again in order for the image S for retrieval and each registration image in another image group for registration to obtain the maximum correlation value V max and compare it with the verification threshold. (S53~S54). Next, when there is no image group for next registration (Yes in S56), it is recognized that the browsing video data is not registered in the video database (S57).
在对静止图像进行二值图像化处理,变换为二值的检索用图像S的处理(S52)中,还能够进行其他的图像处理。例如,还可以在二值图像化处理的前或后进行变更分辨率的图像处理或边缘强调处理。另外,还可以在选择与检索用图像S进行核对的登录用图像组时,取得阅览动画数据的关键字数据或样本用静止图像的图像特征信息等,参照登录数据,优先选择登录了关键字数据的登录用图像组,或者优先选择包含图像特征信息一致的登录用图像的登录用图像组。In the processing (S52) of converting the still image into a binary image S by converting it into a binary image (S52), other image processing can be performed. For example, image processing for changing the resolution or edge enhancement processing may be performed before or after the binary image processing. In addition, when selecting an image group for registration to be compared with the search image S, keyword data of browsing video data or image feature information of sample still images may be obtained, and the registered data may be referred to to preferentially select the registered keyword data. image group for registration, or preferentially select a group of registration images including images for registration whose image feature information matches.
并且,核对服务器的信息处理装置,还可以对结束了核对的阅览动画数据,赋予核对结果来作为识别数据,然后上传到站点。Furthermore, the information processing device of the collation server may add a collation result as identification data to the browsing video data that has been collated, and then upload it to the site.
下面简单说明利用全息记录介质的核对服务器1。The
图6表示利用全息记录介质的核对服务器1的具体的结构,具有全息记录介质20、信息处理装置30、光学系统40。在核对服务器1中设置图像数据库2和核对单元3,全息记录介质20构成图像数据库2的一部分,信息处理装置30以及光学系统40构成核对单元3。FIG. 6 shows a specific configuration of a
在图6中,全息记录介质20是在表面保护层22和反射层23之间夹入了由感光材料构成的较厚的全息图记录层21的反射型记录介质。在全息记录介质20的全息图记录层21中,通过基于登录图像数据生成的信息光和基于参照标记生成的参照标记光的干涉条纹24记录图像数据。当作为全息记录介质20采用原盘状的介质时,因为能够使一边旋转一边进行核对,所以能够提高核对速度。另外,优选全息记录介质20具有确定干涉条纹24的位置的地址层。例如,作为地址信息,可以通过在反射层23的表面上设置的凹凸形状来形成凹坑,把反射层23作为地址层。作为全息记录介质20的表面保护层22,当利用玻璃基板时,能够抑制温度变化等引起的收缩等。作为反射层23可以利用铝等金属材料。In FIG. 6 , the hologram recording medium 20 is a reflective recording medium in which a thick hologram recording layer 21 made of a photosensitive material is sandwiched between a surface protection layer 22 and a reflective layer 23 . In the hologram recording layer 21 of the hologram recording medium 20, image data is recorded by interference fringes 24 of information light generated based on the registered image data and reference mark light generated based on the reference mark. When an original disk-shaped medium is used as the hologram recording medium 20, since the collation can be performed while rotating, the collation speed can be increased. In addition, it is preferable that the hologram recording medium 20 has an address layer for specifying the positions of the interference fringes 24 . For example, as address information, it is possible to form pits in a concavo-convex shape provided on the surface of the reflective layer 23, and the reflective layer 23 can be used as an address layer. When a glass substrate is used as the surface protection layer 22 of the hologram recording medium 20, shrinkage or the like due to temperature changes or the like can be suppressed. A metal material such as aluminum can be used as the reflective layer 23 .
信息处理装置30与网络连接,进行由核对系统执行的各种信息处理。例如,可以举出阅览图像数据的再生以及检索用图像的制作、登录数据的记录以及再生、对于阅览动画数据赋予识别数据、根据关键字数据决定核对顺序、与动画数据有关的信息的发送、警告消息的发送等。The information processing device 30 is connected to a network, and performs various information processing performed by the collation system. Examples include reproduction of browsing image data and creation of search images, recording and reproduction of registration data, adding identification data to browsing video data, determining a collation order based on keyword data, transmission of information related to video data, and warnings. Message sending, etc.
光学系统40能够在全息记录介质20上记录干涉条纹,核对在全息记录介质20中记录的登录图像数据和阅览图像数据。光学系统40具有全息激光器41、镜子42、空间光调制器43、偏振光分束器44、第一中继透镜45、镜子46、第二中继透镜47、分束器48、四分之一波长板49、物镜50、光圈51、参照光检测器52,还有地址用激光器60、分束器61、镜子62以及地址光检测器63。The optical system 40 is capable of recording interference fringes on the holographic recording medium 20 and collating the registration image data and review image data recorded on the holographic recording medium 20 . The optical system 40 has a holographic laser 41, a mirror 42, a spatial
全息激光器41在记录干涉条纹时成为信息光、参照光的光源,在核对时成为检索光的光源,例如优选蓝色激光器或者绿色激光器等短波长的高输出激光器。空间光调制器43具有多个像素,通过对每个像素使光的属性变化能够空间地调制光,例如可以利用液晶显示装置或DMD(Digital Micromirror Device(数字微型镜设备))。在图6中,作为空间光调制器32使用了DMD。偏振光分束器44透过正交的偏振光方向的一方,反射另一方,透过朝向记录介质20的信息光、参照光以及检索光,朝向参照光检测器52反射通过记录介质再生的参照光。第一以及第二中继透镜45、47使在空间光调制器43上显示的图像成像在物镜的焦平面上。分束器48用于使来自地址用激光器60的光朝向记录介质20。四分之一波长板49用于把直线偏振光变换为圆偏振光,通过二次透过能够使直线偏振光旋转90度。通过该四分之一波长板49,参照光在照射时透过偏振光分束器44,在再生时通过偏振光分束器44被反射。物镜50用于对在空间光调制器43上显示的图像进行傅立叶变换,然后照射在记录介质20的全息图记录层21上。光圈51具有遮挡由记录介质20反射的检索光,具有仅使再生的参照光通到参照光检测器52的开口。参照光检测器52用于检测再生的参照光,特别优选检测参照光的光强度。例如可以使用针形光电二极管、CMOS传感器或CCD传感器。The holographic laser 41 serves as a light source for information light and reference light when recording interference fringes, and as a light source for search light when checking. For example, a short-wavelength high-output laser such as a blue laser or a green laser is preferable. The spatial
从全息激光器41照射的光通过镜子42反射,通过显示登录用图像或检索用图像的空间光调制器43进行空间调制,生成信息光或检索光。信息光或检索光透过偏振光分束器44,通过第一以及第二中继透镜45、47进行中继,在途中通过镜子46进行反射,透过分束器48以及四分之一波长板49,由物镜50进行傅立叶变换,照射到记录介质20的全息图记录层21上。信息光在全息图记录层21中与参照光干涉,记录干涉条纹。另外,检索光当和记录了在全息图记录层21中记录的干涉条纹的信息光的相关值高时,通过干涉条纹进行衍射,再生参照光。再生的参照光由反射层24反射,从记录介质20射出,在与照射时相反的方向上经过物镜50、四分之一波长板49、分束器48、第一以及第二中继透镜45、47、镜子46,通过偏振光分束器44反射,通过光圈51入射到参照光检测器52。The light irradiated from the holographic laser 41 is reflected by the mirror 42 and spatially modulated by the spatial
另外,地址用激光器60、分束器61、镜子62以及地址光检测器63,当在记录介质20上设置了地址层时,从地址层确定照射位置,从地址用激光器60照射的光,透过分束器61由镜子62反射,并且通过分束器48反射,透过四分之一波长板49,通过物镜50照射在记录介质20的地址层上。来自记录介质20的反射光,相反地通过光学系统,通过分束器61反射,由地址光检测器63检测。作为地址用激光器60,优选利用红色光等波长比较长的激光器。In addition, the address laser 60, the beam splitter 61, the mirror 62, and the address photodetector 63, when an address layer is provided on the recording medium 20, determine the irradiation position from the address layer, and the light irradiated from the address laser 60 transmits It is reflected by the mirror 62 through the beam splitter 61 and reflected by the
在这样的利用全息记录介质的核对服务器1中,当在动画数据库中记录登录用图像时,在空间光调制器43上显示登录用图像以及参照光用图像,通过登录用图像以及参照光用图像生成空间调制后的信息光以及参照光。然后,信息光以及参照光通过物镜50进行傅立叶变换,然后照射到记录介质20的全息图记录层21上,由此进行了傅立叶变换的信息光以及参照光在全息图记录层21中发生干涉,记录与登录用图像对应的干涉条纹。然后,在核对阅览用动画数据时,在空间光调制器43上显示检索用图像,通过检索用图像生成进行了空间调制的检索光。检索光在通过物镜50进行了傅立叶变换后,被照射在记录介质20的全息图记录层21的干涉条纹上,当形成了干涉条纹的信息光的登录用图像和检索用图像的相关值高时,通过干涉条纹使检索光衍射,再生参照光。即,通过检测再生的参照光的强度,可以求出相关值。在登录用图像或检索用图像是进行了二值图像化处理的二值图像时,使其与空间光调制器43的各像素对应容易,在这一点是上理想的。In the
在以上的说明中,因为利用了使信息光和参照光在同轴上干涉的共线方式的装置,所以能够高速地核对。在以上的说明中,通过反射型的全息记录介质进行了说明,但即使是透过显示通过光相关运算再生的参照标志的透过型也可以实现。另外,即使不是共线方式,而是分离信息光的光路和参照标志的光路,在记录介质上以一定的角度交叉的两光束干涉方式的装置也可以实现。例如,可以把来自光源41的光通过分束器分割成两个光,通过空间光调制器对一方的光进行调制来生成信息光,使另一方的光变形来生成参照标志光,以在记录介质上交叉的方式进行照射。In the above description, since a collinear system device in which information light and reference light are coaxially interfered is used, high-speed collation can be performed. In the above description, a reflection-type hologram recording medium has been described, but a transmission-type hologram recording medium that transmits and displays a reference mark reproduced by optical correlation calculation is also possible. Also, instead of the collinear method, a device of the two-beam interference method that separates the optical path of the information light from the optical path of the reference mark and intersects at a fixed angle on the recording medium can also be realized. For example, the light from the light source 41 can be split into two lights by a beam splitter, one light can be modulated by a spatial light modulator to generate information light, and the other light can be deformed to generate a reference mark light for recording Irradiation is carried out in a crosswise manner on the medium.
(实施例1)(Example 1)
作为登录动画数据,准备30个10秒、30fps的动画数据,分别对全部帧图像(300帧)进行二值图像化处理等图像处理生成二值图像组Bi。然后,把自相关(同一图像彼此之间的相关运算)的相关值标准化为100(a.u.:任意单位),把阈值设定为70(a.u.)和60(a.u.)两种,进行选择步骤来制作动画数据库。在使阈值为70(a.u.)时,对于300帧×30个的9000幅的二值图像,可以把登录用图像的数目削减到2504幅,在使阈值为60(a.u.)时,相对于300帧×30个的9000幅的二值图像,可以把登录用图像的数目削减到1580幅。下述表1表示每个动画数据的削减数目。As registered animation data, 30 pieces of animation data of 10 seconds and 30 fps are prepared, and all frame images (300 frames) are subjected to image processing such as binary image processing to generate a binary image group Bi. Then, the correlation value of autocorrelation (correlation calculation between the same image) is normalized to 100 (a.u.: arbitrary unit), and the threshold is set to 70 (a.u.) and 60 (a.u.), and the selection process is performed to create animation database. When the threshold value is 70 (a.u.), for 9000 binary images of 300 frames × 30, the number of registration images can be reduced to 2504. When the threshold value is 60 (a.u.), compared to 300 frames With 9000 binary images × 30, the number of registration images can be reduced to 1580. Table 1 below shows the number of cuts for each animation data.
[表1][Table 1]
并且,关于登录了全部二值图像的动画数据库、进行了阈值70的选择步骤的动画数据库以及进行了阈值60的选择步骤的动画数据库各自,求出错误率。图7(A)~(C)的各图表示各动画数据库中的错误率,纵轴是错误率,横轴是与标准化的相关值有关的阈值。错误率由两条错误曲线构成,一条是把已登录的图像误认为未登录时的登录图像拒绝率(FRR:False Rejection Rate),另一条是把不同的图像误认为是已登录的图像时的其他图像接受率(FAR:False Acceptance Rate)。登录图像拒绝率FRR以及其他图像接受率FAR分别根据使阈值变化时的核对结果的正误率来取出。图7(A)~(C),是从已登录的30个动画数据中各提取1帧图像制作30幅检索用图像,分别与各动画数据库的登录用图像进行核对,在取得的相关值大于阈值时判断为与登录用图像一致时的登录图像拒绝率FRR以及其他图像接受率FAR。Then, the error rate is obtained for each of the animation database in which all binary images are registered, the animation database in which the selection procedure of the threshold value 70 is performed, and the animation database in which the selection procedure of the threshold value 60 is performed. 7 (A) to (C) each graph shows the error rate in each animation database, the vertical axis is the error rate, and the horizontal axis is the threshold value related to the normalized correlation value. The error rate consists of two error curves, one is the registered image rejection rate (FRR: False Rejection Rate) when a registered image is mistaken for an unregistered image, and the other is when a different image is mistaken for a registered image Other image acceptance rate (FAR: False Acceptance Rate). The registered image rejection rate FRR and the other image acceptance rate FAR are respectively extracted based on the true/false rate of the collation result when the threshold value is changed. Figure 7 (A) to (C) is to extract 1 frame of image from the registered 30 animation data to make 30 images for retrieval, and check them with the registration images of each animation database respectively. When the obtained correlation value is greater than When the threshold value is judged to be consistent with the registration image, the registration image rejection rate FRR and other image acceptance rates FAR.
登录图像拒绝率FRR与其他图像接受率FAR交叉的值,是登录图像拒绝率FRR和其他图像接受率FAR双方共同变小的阈值,把此时的错误率称为EER(Equal Error Rate),在EER具有一定的范围时把该范围称为阈值区域。EER越小越好,如果EER为0%,则登录图像拒绝率FRR和其他图像接受率FAR都为0%,理论上不发生错误。另外,阈值区域的宽度表示核对的可靠性,越宽意味着核对的可靠性越高。为使还能够进行类似的图像的核对,特意使阈值为小于阈值区域的值,提高其他图像接受率FAR。The value of the intersection of the login image rejection rate FRR and other image acceptance rates FAR is the threshold at which both the login image rejection rate FRR and other image acceptance rates FAR become smaller. The error rate at this time is called EER (Equal Error Rate). When the EER has a certain range, this range is called a threshold region. The smaller the EER, the better. If the EER is 0%, then the login image rejection rate FRR and other image acceptance rates FAR are both 0%, theoretically no error occurs. In addition, the width of the threshold area indicates the reliability of the verification, and the wider the threshold area, the higher the reliability of the verification. In order to also perform similar checks on images, the threshold value is deliberately set to a value smaller than the threshold area, so as to increase the acceptance rate of other images by FAR.
图7(A)是登录了全部的二值图像的动画数据库的情形,(B)是进行了阈值70的选择步骤的动画数据库的情形,(C)是进行了阈值60的选择步骤的动画数据库的情形。在图7(A)以及(B)中EER是0%,在图7(C)中也可以得到EER为3.3%这样极低的值。Fig. 7 (A) is the situation of the animation database in which all binary images are registered, (B) is the situation of the animation database in which the selection step of the threshold value 70 is performed, and (C) is the animation database in which the selection procedure of the threshold value 60 is performed situation. The EER is 0% in Fig. 7(A) and (B), and an extremely low value of 3.3% is obtained in Fig. 7(C).
[实施例2][Example 2]
下面进行利用了由阅览动画数据的样本用静止图像生成的检索用图像的核对系统的核对精度评价实验。作为动画数据库,作为登录动画数据,准备30个10秒、30fps的动画数据,分别对全部帧图像(300帧)进行二值图像化处理等图像处理,来生成9000幅二值图像Bi,使用登录了所有二值图像的动画数据库。检索中使用的样本用静止图像大小是128×96像素,把对于各登录动画数据的一幅、共计30幅的样本用静止图像进行了二值图像化处理等图像处理后的图像作为检索用图像利用。图8表示该核对精度评价实验中的错误率,纵轴是错误率,横轴是与标准化的相关值有关的阈值。关于错误率,与图7相同。该结果证实了能够进行ERR0%这样的高精度的核对。Next, a collation accuracy evaluation experiment of a collation system using a search image generated from a sample still image of browsing video data was performed. As the animation database, as the registered animation data, prepare 30 animation data of 10 seconds and 30 fps, and perform image processing such as binary image processing on all frame images (300 frames) to generate 9000 binary images B i , using An animation database with all binary images registered. The size of the still image for the sample used in the search is 128×96 pixels, and an image obtained by performing image processing such as binary image processing on one sample still image for each registered video data, a total of 30, is used as the image for the search use. FIG. 8 shows the error rate in this collation accuracy evaluation experiment, the vertical axis is the error rate, and the horizontal axis is the threshold value related to the normalized correlation value. Regarding the error rate, it is the same as in FIG. 7 . This result proves that high-accuracy collation of ERR0% can be performed.
Claims (19)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2007-267204 | 2007-09-12 | ||
| JP2007267204A JP4281097B2 (en) | 2007-09-12 | 2007-09-12 | Video data verification system |
| PCT/JP2008/066413 WO2009035032A1 (en) | 2007-09-12 | 2008-09-11 | Moving image data checking system, moving image database creating method, and registering system and program for registering moving image data in moving image database |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN101816007A true CN101816007A (en) | 2010-08-25 |
Family
ID=40452039
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN200880106643A Pending CN101816007A (en) | 2007-09-12 | 2008-09-11 | System for checking animation data, method for creating animation database, and system and program for registering animation data in animation database |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US20100299353A1 (en) |
| EP (1) | EP2202663A4 (en) |
| JP (1) | JP4281097B2 (en) |
| KR (1) | KR101579626B1 (en) |
| CN (1) | CN101816007A (en) |
| WO (1) | WO2009035032A1 (en) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107992575A (en) * | 2017-12-01 | 2018-05-04 | 广州昊梦天信息科技有限公司 | A kind of animation comprehensive management platform of internet |
| CN113327306A (en) * | 2021-05-26 | 2021-08-31 | 厦门大学 | Exclusive animation generation method and generation system based on hand shadow |
Families Citing this family (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102542591A (en) * | 2010-12-10 | 2012-07-04 | 北京电影学院 | Animation creation method based on data base |
| JP5590427B2 (en) * | 2011-03-25 | 2014-09-17 | 日本電気株式会社 | Video processing system, video content monitoring method, video processing apparatus, control method thereof, and control program |
| EP2720457A4 (en) | 2011-06-13 | 2015-03-25 | Nec Corp | Video processing system, video processing method, method of creating video processing database, video processing database, video processing apparatus, and control method and control program therefor |
| JP5911298B2 (en) * | 2011-12-27 | 2016-04-27 | キヤノン株式会社 | Imaging apparatus and control method thereof |
| US20130312112A1 (en) * | 2012-05-18 | 2013-11-21 | Rumblefish, Inc. | Licensing protected works within electronic information networks |
| US20180173858A1 (en) * | 2015-06-05 | 2018-06-21 | Nec Corporation | Image processing system, server apparatus, controlling method thereof, and program |
| US12405573B2 (en) | 2021-12-27 | 2025-09-02 | Kwangwoon University Industry-Academic Collaboration Foundation | Webserver-based video content production system and method for printing holographic stereogram photo |
Family Cites Families (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP3340532B2 (en) | 1993-10-20 | 2002-11-05 | 株式会社日立製作所 | Video search method and apparatus |
| JP3765145B2 (en) * | 1996-02-02 | 2006-04-12 | 株式会社日立製作所 | Electronic information distribution method |
| JP3198980B2 (en) * | 1996-10-22 | 2001-08-13 | 松下電器産業株式会社 | Image display device and moving image search system |
| JP3496559B2 (en) * | 1999-01-06 | 2004-02-16 | 日本電気株式会社 | Image feature generation apparatus and image feature generation method |
| US20030228565A1 (en) * | 2000-04-26 | 2003-12-11 | Cytokinetics, Inc. | Method and apparatus for predictive cellular bioinformatics |
| JP2001136363A (en) * | 1999-11-02 | 2001-05-18 | Nippon Telegraph & Telephone West Corp | Content license management method and device |
| WO2002082271A1 (en) * | 2001-04-05 | 2002-10-17 | Audible Magic Corporation | Copyright detection and protection system and method |
| EP1260934A3 (en) * | 2001-05-22 | 2004-04-14 | Matsushita Electric Industrial Co., Ltd. | Surveillance recording device and method |
| JP2002366531A (en) * | 2001-06-06 | 2002-12-20 | Japan Science & Technology Corp | Copyright management system |
| JP4372389B2 (en) * | 2002-02-28 | 2009-11-25 | ソフトバンクモバイル株式会社 | CONTENT PROVIDING METHOD, CONTENT PROVIDING SERVER USED FOR THE METHOD, AND COMMUNICATION SYSTEM |
| JP2004240750A (en) * | 2003-02-06 | 2004-08-26 | Canon Inc | Image retrieval device |
| JP2005123667A (en) * | 2003-10-14 | 2005-05-12 | Seiko Epson Corp | Generation of still image data from multiple image data |
| JP2005135146A (en) * | 2003-10-30 | 2005-05-26 | Nippon Telegr & Teleph Corp <Ntt> | Content information disclosure server, content use support method, content use support program, and recording medium recording the program |
| JP4741301B2 (en) * | 2005-07-06 | 2011-08-03 | 株式会社オプティム | Information search system, information search device, information search method, recording medium, and program |
| JP4740706B2 (en) * | 2005-09-28 | 2011-08-03 | ヤフー株式会社 | Fraud image detection apparatus, method, and program |
| KR20060101421A (en) * | 2006-09-02 | 2006-09-25 | 장중혁 | How to search video using summary video data |
| JP4228320B2 (en) * | 2006-09-11 | 2009-02-25 | ソニー株式会社 | Image processing apparatus and method, and program |
-
2007
- 2007-09-12 JP JP2007267204A patent/JP4281097B2/en active Active
-
2008
- 2008-09-11 WO PCT/JP2008/066413 patent/WO2009035032A1/en not_active Ceased
- 2008-09-11 US US12/676,645 patent/US20100299353A1/en not_active Abandoned
- 2008-09-11 KR KR1020107005374A patent/KR101579626B1/en active Active
- 2008-09-11 EP EP08831092A patent/EP2202663A4/en not_active Withdrawn
- 2008-09-11 CN CN200880106643A patent/CN101816007A/en active Pending
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107992575A (en) * | 2017-12-01 | 2018-05-04 | 广州昊梦天信息科技有限公司 | A kind of animation comprehensive management platform of internet |
| CN113327306A (en) * | 2021-05-26 | 2021-08-31 | 厦门大学 | Exclusive animation generation method and generation system based on hand shadow |
| CN113327306B (en) * | 2021-05-26 | 2022-06-21 | 厦门大学 | An exclusive animation generation method and generation system based on hand shadow implementation |
Also Published As
| Publication number | Publication date |
|---|---|
| US20100299353A1 (en) | 2010-11-25 |
| WO2009035032A1 (en) | 2009-03-19 |
| KR20100066505A (en) | 2010-06-17 |
| EP2202663A1 (en) | 2010-06-30 |
| JP2009070349A (en) | 2009-04-02 |
| KR101579626B1 (en) | 2015-12-23 |
| JP4281097B2 (en) | 2009-06-17 |
| EP2202663A4 (en) | 2011-01-05 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN101816007A (en) | System for checking animation data, method for creating animation database, and system and program for registering animation data in animation database | |
| Ho et al. | Handbook of digital forensics of multimedia data and devices | |
| US8259177B2 (en) | Video fingerprint systems and methods | |
| US8347408B2 (en) | Matching of unknown video content to protected video content | |
| CN114846465B (en) | Authenticity assessment of modified content | |
| Poisel et al. | Forensics investigations of multimedia data: A review of the state-of-the-art | |
| Battiato et al. | Multimedia forensics: discovering the history of multimedia contents | |
| Khan et al. | Visual user-generated content verification in journalism: An overview | |
| US20090327334A1 (en) | Generating Measures of Video Sequences to Detect Unauthorized Use | |
| JP5325583B2 (en) | Movie database creation method, registration system and program for registering movie data in movie database | |
| US8478033B2 (en) | Image inspection apparatus and method | |
| Celebi et al. | A survey of deep fake detection for trial courts | |
| Patil et al. | AlexNet based pirate detection system | |
| JP5380643B2 (en) | Video data verification system | |
| Jayasinghe et al. | VANGUARD: a blockchain-based solution to digital piracy | |
| Meshram et al. | Video forensic for video tamper detection | |
| Aulakh et al. | A systematic literature review of video forgery detection techniques | |
| US11966500B2 (en) | Systems and methods for isolating private information in streamed data | |
| JP5008464B2 (en) | Image data verification system | |
| Kara et al. | A forensic method for investigating manipulated video recordings | |
| Akhtar et al. | A three-level benchmark dataset for spatial and temporal forensic analysis of videos | |
| Hwang et al. | Identification of mobile phone and analysis of original version of videos through a delay time analysis of sound signals from mobile phone videos | |
| US20250258892A1 (en) | Generative artificial intelligence content management | |
| Kim et al. | An in-depth forensic examination of video files edited by Apple Photos | |
| Temmermans et al. | An Introduction to the JPEG Fake Media Initiative |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
| WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20100825 |