JP2016114445A

JP2016114445A - Three-dimensional position calculation device, program for the same, and cg composition apparatus

Info

Publication number: JP2016114445A
Application number: JP2014252738A
Authority: JP
Inventors: 寛史盛岡; Hirofumi Morioka; 英彦大久保; Hidehiko Okubo; 秀樹三ツ峰; Hideki Mitsumine
Original assignee: Nippon Hoso Kyokai NHK
Current assignee: Japan Broadcasting Corp
Priority date: 2014-12-15
Filing date: 2014-12-15
Publication date: 2016-06-23

Abstract

【課題】魚眼カメラを用いて特徴点の３次元位置を算出する３次元位置算出装置を提供する。【解決手段】３次元位置算出装置１０は、等距離射影方式の２台の魚眼カメラで撮影された画像から、それぞれ特徴点を抽出する特徴点抽出手段１２と、抽出された特徴点における画像特徴量が両画像間で最も類似する特徴点対において、両画像上を極座標で表したときのそれぞれの特徴点の動径の差である動径距離が所定の閾値よりも小さく、かつ、偏角の差である角度距離が所定の閾値よりも小さい特徴点対を同じ特徴点を指し示す特徴点対としてフィルタリングする特徴点フィルタリング手段１３と、フィルタリングされた特徴点対の両画像上の位置と、魚眼カメラの距離とにより、当該特徴点の３次元位置を計算する特徴点３次元位置計算手段１５と、を備える。【選択図】図２A three-dimensional position calculation apparatus for calculating a three-dimensional position of a feature point using a fisheye camera is provided. A three-dimensional position calculation apparatus 10 includes feature point extraction means 12 for extracting feature points from images photographed by two fisheye cameras of an equidistant projection method, and images at the extracted feature points. In the feature point pair whose feature quantity is most similar between both images, the radial distance, which is the difference between the radial diameters of the feature points when both images are represented in polar coordinates, is smaller than a predetermined threshold value, and A feature point filtering means 13 for filtering a feature point pair whose angle distance, which is a difference in angles, is smaller than a predetermined threshold value as a feature point pair indicating the same feature point, a position on both images of the filtered feature point pair, Feature point three-dimensional position calculation means 15 for calculating the three-dimensional position of the feature point according to the distance of the fisheye camera. [Selection] Figure 2

Description

本発明は、魚眼カメラを用いた３次元空間上のオブジェクトの位置を算出する３次元位置算出装置およびそのプログラム、ならびに、ＣＧ合成装置に関する。 The present invention relates to a three-dimensional position calculation device that calculates the position of an object in a three-dimensional space using a fisheye camera, a program for the same, and a CG composition device.

近年、ＣＧ（Computer Graphics）技術の発達に伴い、映画、テレビ番組等で、ＣＧオブジェクトと実写映像とを合成したＣＧ合成映像が用いられている。
このように、ＣＧオブジェクトと実写映像とを合成する場合、ＣＧオブジェクトを実写映像の照明条件に合わせる必要がある。 In recent years, with the development of CG (Computer Graphics) technology, CG composite video in which a CG object and a live-action video are combined is used in movies, television programs, and the like.
Thus, when combining a CG object and a live-action video, it is necessary to match the CG object with the illumination conditions of the live-action video.

従来、実世界において、できるだけ広範囲からの光を取り込むため、２台の魚眼カメラで撮影された画像から、実光源環境をモデル化し、その環境でＣＧ合成を行う手法（以下、実光源計測手法）が開示されている（特許文献１、非特許文献１）。
この実光源計測手法は、まず、２台の魚眼カメラを光源方向に向け、光源の分布を撮影する。そして、この手法は、撮影された２つの画像から、マッチングする特徴点の３次元位置を求め、その特徴点を頂点とする三角メッシュを生成する。そして、この手法は、それぞれの三角メッシュごとに、撮影された画像から輝度を計算することで、光源分布の情報を、三角メッシュをつなぎ合わせたモデルで近似的に求めている。
なお、実光源計測手法は、それぞれの画像中心から３次元空間上の特徴点までの２直線の交差誤差が最小となる特徴点を、画像間でマッチングする特徴点として選択し、交差誤差が最小となる特徴点が複数存在する場合は、画像上のピクセル座標が最も近い特徴点を選択している。 Conventionally, in order to capture light from a wide range as much as possible in the real world, a method of modeling an actual light source environment from images taken by two fisheye cameras and performing CG synthesis in that environment (hereinafter referred to as an actual light source measurement method) ) Is disclosed (Patent Document 1, Non-Patent Document 1).
In this actual light source measurement method, first, two fisheye cameras are pointed in the light source direction, and the distribution of the light sources is photographed. In this method, the three-dimensional position of the matching feature point is obtained from the two captured images, and a triangular mesh having the vertex as the feature point is generated. In this method, the luminance distribution is calculated from the captured image for each triangular mesh, so that information on the light source distribution is approximately obtained by a model in which the triangular meshes are connected.
In the actual light source measurement method, the feature point that minimizes the intersection error of two straight lines from the center of each image to the feature point in the three-dimensional space is selected as the feature point that matches between the images, and the intersection error is minimized. When there are a plurality of feature points, the feature point having the closest pixel coordinate on the image is selected.

このように複数の魚眼カメラで撮影された画像間で特徴点のマッチングをとる手法は、特許文献１、非特許文献１以外にも、種々提案されている（例えば、特許文献２，３、非特許文献２，３等）。
なお、特許文献２の手法は、特徴点の対応付けに、ＲＡＮＳＡＣ（RANdom SAmpleConsensus）を用いている。また、特許文献３の手法は、魚眼カメラで撮影された魚眼画像（全天画像）を平面展開し、ＳＳＤ（Sum of Squared Difference）の演算により特徴点のマッチングを行っている。また、非特許文献２の手法は、全特徴点を対象として、ＺＮＣＣ（正規化相互相関：Zero-mean Normalized Cross-Correlation）の演算により特徴点のマッチングを行っている。また、非特許文献３の手法は、特徴点の対応の誤差を最小化するためバンドル調整のアルゴリズムを用いている。 In addition to Patent Document 1 and Non-Patent Document 1, various methods for matching feature points between images photographed by a plurality of fisheye cameras have been proposed (for example, Patent Documents 2 and 3; Non-patent documents 2 and 3).
Note that the method of Patent Document 2 uses RANSAC (RANdom SAmpleConsensus) for associating feature points. In the method of Patent Document 3, a fish-eye image (all-sky image) photographed by a fish-eye camera is developed on a plane, and feature points are matched by calculating an SSD (Sum of Squared Difference). In the method of Non-Patent Document 2, feature points are matched by calculating ZNCC (Zero-mean Normalized Cross-Correlation) for all feature points. Further, the method of Non-Patent Document 3 uses a bundle adjustment algorithm in order to minimize the error corresponding to the feature points.

特開平１１−１７５７６２号公報JP-A-11-175762 国際公開第２０１２／０６３４６７号International Publication No. 2012/063467 国際公開第２００６／０７５５２８号International Publication No. 2006/075528

佐藤いまり, 佐藤洋一, 池内克史, “全方位ステレオによる実光源環境の計測とそれにもとづく仮想物体の実画像への重ね込み”, 電子情報通信学会論文誌D-II, J81-DII, No.5, pp.861-871, 1998年5月Imari Sato, Yoichi Sato, Katsushi Ikeuchi, “Measurement of real light source environment by omnidirectional stereo and superimposition of virtual object based on it”, IEICE Transactions D-II, J81-DII, No. 5, pp.861-871, May 1998 M. Lhuillier, “Automatic Structure and Motion using a Catadioptric Camera”, Proc. IEEE Workshop OMNIVIS’05, 2005M. Lhuillier, “Automatic Structure and Motion using a Catadioptric Camera”, Proc. IEEE Workshop OMNIVIS’05, 2005 E. Mouragnon, M. Lhuillier, M. Dhome, F. Dekeyser, P. Sayd,“Generic and Real-Time Structure from Motion”, Proc. BMVC, 2007E. Mouragnon, M. Lhuillier, M. Dhome, F. Dekeyser, P. Sayd, “Generic and Real-Time Structure from Motion”, Proc. BMVC, 2007

しかし、前記した実光源計測手法のように、魚眼カメラで撮影された画像上では、実世界で計測される距離が必ずしもそのまま反映されない。すなわち、魚眼カメラで撮影された画像（全天画像）は、画像中心と画像周縁とで画像上の距離が同じであっても、実世界の距離が異なっている。
そのため、従来のように、全天画像上の距離を指標として、特徴点のマッチングを行うと、誤マッチングを生じさせる可能性が高いという問題がある。 However, the distance measured in the real world is not necessarily reflected as it is on the image photographed by the fisheye camera as in the above-described real light source measurement method. In other words, the images taken by the fisheye camera (all-sky image) have different real-world distances even if the image center and the image periphery have the same distance on the image.
For this reason, there is a problem in that, when a feature point is matched using a distance on the whole sky image as an index as in the past, there is a high possibility of causing a false matching.

また、従来の手法は、特徴点マッチングを行う際に、全天画像を平面展開する必要があったり、多くの特徴点を対象としてＲＡＳＮＳＡＣ、ＳＳＤ、ＺＮＣＣ等の演算を行う必要があったり等、前処理や計算対象の多さによって、計算コストがかかってしまうという問題がある。 In addition, when performing the feature point matching in the conventional method, it is necessary to flatten the whole sky image, or it is necessary to perform operations such as RASSNSAC, SSD, and ZNCC for many feature points. There is a problem that the calculation cost is increased due to the large number of preprocessing and calculation objects.

本発明は、このような問題に鑑みてなされたものであり、魚眼カメラを用いて特徴点の３次元位置を算出する際に、従来よりも、正確、かつ、計算コストの低い３次元位置算出装置およびそのプログラム、ならびに、魚眼カメラを用いたＣＧ合成装置を提供することを課題とする。 The present invention has been made in view of such a problem. When a three-dimensional position of a feature point is calculated using a fisheye camera, the three-dimensional position is more accurate and less expensive than the conventional one. It is an object of the present invention to provide a calculation device, a program thereof, and a CG synthesis device using a fisheye camera.

前記課題を解決するため、本発明に係る３次元位置算出装置は、等距離射影方式の魚眼レンズを備えた２台の魚眼カメラで撮影された画像から、当該画像中の特徴点の３次元位置を算出する３次元位置算出装置であって、特徴点抽出手段と、特徴点フィルタリング手段と、特徴点３次元位置計算手段と、を備える構成とした。 In order to solve the above-described problem, a three-dimensional position calculation apparatus according to the present invention is configured such that a three-dimensional position of a feature point in an image is obtained from images captured by two fisheye cameras having equidistant projection-type fisheye lenses. Is a configuration including a feature point extracting unit, a feature point filtering unit, and a feature point three-dimensional position calculating unit.

かかる構成において、３次元位置算出装置は、特徴点抽出手段によって、２台の魚眼カメラで撮影された一方の画像と他方の画像とから、それぞれ画像特徴の変化する特徴点を抽出する。この特徴点は、例えば、ＳＩＦＴ（Scale Invariant Feature Transform）等によって抽出することができる。 In this configuration, the three-dimensional position calculation device extracts feature points whose image features change from one image and the other image captured by the two fisheye cameras by the feature point extraction unit. This feature point can be extracted by, for example, SIFT (Scale Invariant Feature Transform) or the like.

そして、３次元位置算出装置は、特徴点フィルタリング手段によって、特徴点抽出手段で抽出された特徴点における画像特徴量が両画像間で最も類似する特徴点対において、両画像を極座標で表したときのそれぞれの特徴点の動径距離が所定の閾値よりも小さく、かつ、角度距離が所定の閾値よりも小さい特徴点対を同じ特徴点を指し示す特徴点対としてフィルタリングする。
なお、動径距離は、特徴点対の画像上の極座標における動径の差である。また、角度距離は、特徴点対の画像上の極座標における偏角の差である。
これによって、特徴点フィルタリング手段は、画像の中心から離れるほど、両画像間の視差が大きくなるという、魚眼カメラの特性に合わせた特徴点のマッチング判定を行うことができる。 Then, the three-dimensional position calculation device uses the feature point filtering unit to represent both images in polar coordinates in the feature point pair having the most similar image feature amount between the two images. The feature point pairs in which the radial distance of each of the feature points is smaller than a predetermined threshold and the angular distance is smaller than the predetermined threshold are filtered as feature point pairs indicating the same feature point.
The radial distance is a radial difference in polar coordinates on the feature point pair image. The angular distance is a difference in declination in polar coordinates on the feature point pair image.
Thereby, the feature point filtering means can perform the matching determination of the feature points according to the characteristics of the fisheye camera such that the farther from the center of the image, the larger the parallax between the two images.

そして、３次元位置算出装置は、特徴点３次元位置計算手段によって、特徴点フィルタリング手段を用いた後で、正しい特徴点のマッチングであると判定された特徴点対の両画像上の位置と、魚眼カメラ中心の距離とにより、三角測量の原理により、当該特徴点の３次元位置を計算する。
なお、３次元位置算出装置は、照明環境を撮影し、算出された特徴点の３次元位置により、照明環境をモデル化し、ＣＧオブジェクトを生成するＣＧ合成装置に組み込むことができる。 Then, the three-dimensional position calculation device uses the feature point three-dimensional position calculation means, after using the feature point filtering means, the positions of the feature point pairs determined to be correct feature point matching on both images, Based on the triangulation principle, the three-dimensional position of the feature point is calculated based on the distance from the center of the fisheye camera.
The three-dimensional position calculation device can be incorporated into a CG composition device that captures an image of the illumination environment, models the illumination environment based on the calculated three-dimensional position of the feature point, and generates a CG object.

本発明は、以下に示す優れた効果を奏するものである。
本発明によれば、２つの魚眼カメラで撮影された画像において、魚眼カメラの特性に合わせて、特徴点のマッチングを判定することができるため、誤マッチングを減らすことができ、特徴点の精度の高い３次元位置を求めることができる。
また、本発明によれば、特徴点のマッチング判定において、閾値判定を行うという簡易な方法であるため、計算コストを低く抑えることができる。 The present invention has the following excellent effects.
According to the present invention, since the matching of feature points can be determined in accordance with the characteristics of a fish-eye camera in images taken by two fish-eye cameras, false matching can be reduced, and A highly accurate three-dimensional position can be obtained.
Further, according to the present invention, since it is a simple method of performing threshold determination in feature point matching determination, the calculation cost can be kept low.

本発明の実施形態に係る映像合成システムの全体構成を示す構成図である。1 is a configuration diagram showing an overall configuration of a video composition system according to an embodiment of the present invention. 本発明の実施形態に係る３次元位置算出装置（３次元位置算出手段）を含むＣＧ合成装置の構成を示すブロック構成図である。It is a block block diagram which shows the structure of CG synthetic | combination apparatus containing the three-dimensional position calculation apparatus (three-dimensional position calculation means) which concerns on embodiment of this invention. 魚眼カメラで撮影された画像（全天画像）の座標系を説明するための説明図である。It is explanatory drawing for demonstrating the coordinate system of the image (all-sky image) image | photographed with the fisheye camera. 閾値による特徴点のマッチング判定を説明するための説明図であって、（ａ）は本発明における動径および偏角の差を閾値とする例を示す図、（ｂ）は従来の単純距離を閾値とする例を示す図である。It is explanatory drawing for demonstrating the matching judgment of the feature point by a threshold value, Comprising: (a) is a figure which shows the example which makes the threshold value the difference of the radial and declination in this invention, (b) is the conventional simple distance. It is a figure which shows the example used as a threshold value. ２つの魚眼カメラで検出された特徴点の３次元位置を計算する三角測量の原理を説明するための説明図である。It is explanatory drawing for demonstrating the principle of the triangulation which calculates the three-dimensional position of the feature point detected with two fisheye cameras. 三角測量の原理で使用する特徴点方向ベクトルを説明するための説明図であって、（ａ）は特徴点方向ベクトルの天頂角を説明するための図、（ｂ）は特徴点方向ベクトルの偏角を説明するための図である。It is explanatory drawing for demonstrating the feature point direction vector used by the principle of a triangulation, (a) is a figure for demonstrating the zenith angle of a feature point direction vector, (b) is a bias of a feature point direction vector. It is a figure for demonstrating a corner | angular. 光源分布マップの役割を説明するための模式図である。It is a schematic diagram for demonstrating the role of a light source distribution map. 本発明の実施形態に係る３次元位置算出装置（３次元位置算出手段）を含むＣＧ合成装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the CG synthetic | combination apparatus containing the three-dimensional position calculation apparatus (three-dimensional position calculation means) concerning embodiment of this invention.

以下、本発明の実施形態について図面を参照して説明する。
≪映像合成システムの全体構成≫
最初に、図１を参照して、本発明の実施形態に係る映像合成システムＳの全体構成について説明する。 Embodiments of the present invention will be described below with reference to the drawings.
≪Overall configuration of video composition system≫
First, the overall configuration of the video composition system S according to the embodiment of the present invention will be described with reference to FIG.

映像合成システムＳは、スタジオカメラＣｓで撮影されるスタジオＳＴ内の実写映像と、ＣＧオブジェクトとを実写映像における照明条件に合わせて合成し、合成映像Ｄを生成するものである。ここでは、映像合成システムＳは、ＣＧ合成装置１と、実写ＣＧ合成装置３と、を備える。 The video composition system S synthesizes the live-action video in the studio ST photographed by the studio camera Cs and the CG object in accordance with the illumination conditions in the live-action video, and generates a composite video D. Here, the video composition system S includes a CG composition device 1 and a live-action CG composition device 3.

ＣＧ合成装置１は、スタジオＳＴ内に予め設置した魚眼レンズを使用したセンサカメラ（以下、魚眼カメラＣ_Ｌ，Ｃ_Ｒ）で撮影された画像（左カメラ画像Ｇ_Ｌ，右カメラ画像Ｇ_Ｒ）から、光源Ｌ等の照明環境をモデル化するとともに、合成対象となる素材ＣＧデータＣＧ_Ｄをその照明環境においてレンダリングすることで、ＣＧオブジェクト（ＣＧ画像）を生成するものである。なお、本明細書中、それぞれの魚眼カメラやその撮影画像を表すために、図面に合わせて便宜上“左”、“右”と表現しているが、これは単に“一方”、“他方”の意味を表しているにすぎない。 The CG synthesizer 1 is based on images (left camera image G _L , right camera image G _R ) captured by a sensor camera (hereinafter referred to as fish eye cameras C _L , C _R ) using a fish-eye lens installed in the studio ST in advance. The CG object (CG image) is generated by modeling the illumination environment such as the light source L and rendering the material CG data CG _D to be synthesized in the illumination environment. In this specification, in order to represent each fisheye camera and its captured image, it is expressed as “left” and “right” for convenience in accordance with the drawings, but this is simply “one”, “other”. It just represents the meaning of

ここで、魚眼カメラＣ_Ｌ，Ｃ_Ｒは、照明環境を撮影するため、天頂を撮影対象として、撮影画像が平行になるように所定の間隔で配置される。また、魚眼カメラＣ_Ｌ，Ｃ_Ｒは、等距離射影方式のレンズを用い、同じ解像度であることとする。ここで、等距離射影方式とは、撮影した画像の中心からの距離と天頂角とが比例する方式である。
このＣＧ合成装置１は、生成したＣＧオブジェクトを実写ＣＧ合成装置３に出力する。このＣＧ合成装置１の構成および動作については、後で詳細に説明する。 Here, the fisheye cameras C _L and C _R are arranged at a predetermined interval so that the captured images are parallel with the zenith as the imaging target in order to capture the illumination environment. Further, fisheye camera C _L, C _R is a lens equidistant projection method, and it is the same resolution. Here, the equidistant projection method is a method in which the distance from the center of the captured image is proportional to the zenith angle.
The CG composition device 1 outputs the generated CG object to the live-action CG composition device 3. The configuration and operation of the CG synthesizer 1 will be described in detail later.

実写ＣＧ合成装置３は、スタジオカメラＣ_Ｓで撮影された映像に、ＣＧ合成装置１で生成されたＣＧオブジェクトを合成するものである。なお、この実写ＣＧ合成装置３は、映像のフレームごとに、ＣＧオブジェクトを合成する一般的な合成装置である。
また、実写ＣＧ合成装置３に入力される映像を撮影するスタジオカメラＣ_Ｓは、スタジオＳＴ内で被写体を映像として撮影する、放送局等で使用される一般的なカメラである。
このように、映像合成システムＳは、ＣＧ合成装置１によって、実際の照明環境をＣＧオブジェクトに適用することで、視覚的に違和感のないＣＧを実写映像に合成することができる。
なお、ここでは、映像合成システムＳは、スタジオＳＴ内のシステムとして構成した例を示しているが、一般的な建物内部、あるいは、室外において構成することとしても構わない。
以下、ＣＧ合成装置１の構成および動作について詳細に説明する。 Stock CG synthesizing unit 3, the image photographed by the studio camera C _S, is to synthesize a CG object generated by the CG synthesizing apparatus 1. The live-action CG composition device 3 is a general composition device that composes a CG object for each frame of a video.
Moreover, the studio camera C _S for capturing the images input to the live-action CG synthesizing unit 3 captures the subject as an image in a studio ST, a general camera for use in broadcasting stations or the like.
In this way, the video composition system S can synthesize a CG that is visually uncomfortable with a live-action video by applying the actual lighting environment to the CG object by the CG composition device 1.
Here, the video composition system S is shown as an example configured as a system in the studio ST, but may be configured inside a general building or outdoors.
Hereinafter, the configuration and operation of the CG synthesizer 1 will be described in detail.

≪ＣＧ合成装置の構成≫
まず、図２を参照（適宜図１参照）して、本発明の実施形態に係るＣＧ合成装置１の構成について説明する。
図２に示すように、ＣＧ合成装置１は、３次元位置算出手段１０と、レンダリング手段２０と、を備える。 ≪Configuration of CG synthesizer≫
First, referring to FIG. 2 (refer to FIG. 1 as appropriate), the configuration of the CG synthesizer 1 according to the embodiment of the present invention will be described.
As shown in FIG. 2, the CG synthesizing apparatus 1 includes a three-dimensional position calculation unit 10 and a rendering unit 20.

３次元位置算出手段１０は、魚眼カメラＣ_Ｌ，Ｃ_Ｒで撮影されたそれぞれの画像（左カメラ画像Ｇ_Ｌ，右カメラ画像Ｇ_Ｒ）を入力し、画像内の特徴点を対応付けて、その特徴点の３次元位置を算出するものである。
ここでは、３次元位置算出装置１０は、画像入力手段１１と、特徴点抽出手段１２と、特徴点フィルタリング手段１３と、校正手段１４と、特徴点３次元位置計算手段１５と、を備える。 3D position calculating unit 10 inputs fisheye camera C _L, C _R in captured each image (left camera image G _L, the right camera image G _R) and, in association with the feature points in the image, The three-dimensional position of the feature point is calculated.
Here, the three-dimensional position calculation apparatus 10 includes an image input unit 11, a feature point extraction unit 12, a feature point filtering unit 13, a calibration unit 14, and a feature point three-dimensional position calculation unit 15.

画像入力手段１１は、魚眼カメラＣ_Ｌ，Ｃ_Ｒが撮影した画像（左カメラ画像Ｇ_Ｌ，右カメラ画像Ｇ_Ｒ）を入力するものである。ここでは、画像入力手段１１は、入力する魚眼カメラＣ_Ｌ，Ｃ_Ｒに対応した左画像入力手段１１Ｌと、右画像入力手段１１Ｒと、を備える。すなわち、左画像入力手段１１Ｌは、魚眼カメラＣ_Ｌが撮影した左カメラ画像Ｇ_Ｌを入力し、右画像入力手段１１Ｒは、魚眼カメラＣ_Ｒが撮影した右カメラ画像Ｇ_Ｒを入力する。
この左画像入力手段１１Ｌおよび右画像入力手段１１Ｒは、入力した画像（左カメラ画像Ｇ_Ｌ，右カメラ画像Ｇ_Ｒ）を特徴点抽出手段１２に出力する。 Image input means 11 is used for inputting fisheye camera _C L, _{C R} are photographed image (left camera image _{G L,} the right camera image _{G R)} to. Here, the image input unit 11 is provided with fish-eye camera _C L to enter, the left image input unit 11L corresponding to _{C R,} and the right image input unit 11R, a. That is, the left image input unit 11L receives the left camera image G _L that fisheye camera C _L is taken, the right image input unit 11R inputs right camera image G _R which fisheye camera C _R is taken.
The left image input unit 11L and the right image input unit 11R output the input images (left camera image G _L and right camera image G _R ) to the feature point extraction unit 12.

特徴点抽出手段１２は、画像入力手段１１から入力された画像（左カメラ画像Ｇ_Ｌ，右カメラ画像Ｇ_Ｒ）から、それぞれの特徴点を抽出するものである。ここでは、特徴点抽出手段１２は、入力する画像（左カメラ画像Ｇ_Ｌ，右カメラ画像Ｇ_Ｒ）に対応した左画像特徴点抽出手段１２Ｌと、右画像特徴点抽出手段１２Ｒと、を備える。すなわち、左画像特徴点抽出手段１２Ｌは、左カメラ画像Ｇ_Ｌから特徴点（左画像特徴点）を抽出し、右画像特徴点抽出手段１２Ｒは、右カメラ画像Ｇ_Ｒから特徴点（右画像特徴点）を抽出する。 The feature point extraction unit 12 extracts each feature point from the image (left camera image G _L , right camera image G _R ) input from the image input unit 11. Here, the feature point extracting unit 12 includes a left image feature point extracting unit 12L corresponding to an input image (left camera image G _L , right camera image G _R ), and a right image feature point extracting unit 12R. That is, the left image feature point extracting means 12L is a feature point (left image feature point) is extracted from the left camera image G _L, the right image feature point extracting unit 12R are feature points from the right camera image G _R (right image feature Point).

ここで、特徴点は、撮影された画像内の画像特徴の変化する点であって、例えば、隣接画素に対する画素値あるいは輝度値が変化する点である。なお、この特徴点は、一般的な手法を用いて抽出することができる。例えば、左画像特徴点抽出手段１２Ｌおよび右画像特徴点抽出手段１２Ｒは、ＳＩＦＴ、ＳＵＲＦ（Speeded Up Robust Features）等によって特徴点を抽出する。
この左画像特徴点抽出手段１２Ｌおよび右画像特徴点抽出手段１２Ｒは、抽出した特徴点（画像上の位置および特徴量）を特徴点フィルタリング手段１３に出力する。なお、特徴量は、例えば、ＳＩＦＴで特徴点を検出した場合、１２８次元のＳＩＦＴ特徴量である。 Here, the feature point is a point at which the image feature in the photographed image changes, for example, a point at which a pixel value or a luminance value with respect to an adjacent pixel changes. This feature point can be extracted using a general method. For example, the left image feature point extracting unit 12L and the right image feature point extracting unit 12R extract feature points by SIFT, SURF (Speeded Up Robust Features), or the like.
The left image feature point extracting unit 12L and the right image feature point extracting unit 12R output the extracted feature points (position and feature amount on the image) to the feature point filtering unit 13. The feature amount is, for example, a 128-dimensional SIFT feature amount when a feature point is detected by SIFT.

特徴点フィルタリング手段１３は、特徴点抽出手段１２の左画像特徴点抽出手段１２Ｌおよび右画像特徴点抽出手段１２Ｒで抽出された左画像特徴点と右画像特徴点との間の動径距離および角度距離に基づいて、対応する特徴点をフィルタリングするものである。
この特徴点フィルタリング手段１３は、左画像特徴点抽出手段１２Ｌおよび右画像特徴点抽出手段１２Ｒのそれぞれで抽出された特徴量（画像特徴量）が最も類似する左画像特徴点と右画像特徴点とに対して、動径距離および角度距離がそれぞれ所定の閾値を基準に、当該閾値を上回っているか、あるいは、下回っているかにより、特徴点をフィルタリングする。ここでは、特徴点フィルタリング手段１３は、動径距離が所定の閾値よりも小さく、かつ、角度距離が所定の閾値よりも小さい対応を、正しいマッチングとして採用する。 The feature point filtering unit 13 is a radial distance and an angle between the left image feature point and the right image feature point extracted by the left image feature point extracting unit 12L and the right image feature point extracting unit 12R of the feature point extracting unit 12. Based on the distance, corresponding feature points are filtered.
The feature point filtering unit 13 includes a left image feature point and a right image feature point that have the most similar feature amounts (image feature amounts) extracted by the left image feature point extraction unit 12L and the right image feature point extraction unit 12R, respectively. On the other hand, the feature points are filtered based on whether the radial distance and the angular distance are above or below the predetermined threshold values, respectively. Here, the feature point filtering unit 13 adopts a correspondence in which the radial distance is smaller than a predetermined threshold and the angular distance is smaller than the predetermined threshold as correct matching.

なお、特徴量が最も類似するとは、例えば、左画像特徴点と右画像特徴点との間の特徴量（例えば、ＳＩＦＴ特徴量）のユークリッド距離が最小となることをいう。
また、動径距離とは、画像（左カメラ画像Ｇ_Ｌ，右カメラ画像Ｇ_Ｒ）を極座標系で表したときの動径の距離（差）である。また、角度距離とは、画像を極座標系で表したときの偏角の距離（差）である。
これによって、特徴点フィルタリング手段１３は、予めＳＩＦＴ等によって特徴量が最も類似するとして対応している特徴点群から、誤マッチングを除去する。
そして、特徴点フィルタリング手段１３は、フィルタリングした、対応する特徴点（画像上の位置および特徴量）を、校正手段１４に出力する。 Note that the feature amount is most similar means, for example, that the Euclidean distance of the feature amount (for example, SIFT feature amount) between the left image feature point and the right image feature point is minimized.
The radial distance is a radial distance (difference) when images (left camera image G _L , right camera image G _R ) are represented in a polar coordinate system. The angular distance is a declination distance (difference) when an image is expressed in a polar coordinate system.
As a result, the feature point filtering unit 13 removes erroneous matching from the feature point group corresponding to the feature amount that is most similar by SIFT or the like in advance.
Then, the feature point filtering unit 13 outputs the filtered corresponding feature points (position and feature amount on the image) to the calibration unit 14.

ここで、図３を参照して、特徴点フィルタリング手段１３が、マッチングの判定を行う基準となる動径距離および角度距離について説明する。図３は、左カメラ画像Ｇ_Ｌ（全天画像）の座標系を示す図である。なお、右カメラ画像Ｇ_Ｒについても同様である。
図３に示すように、左カメラ画像Ｇ_Ｌの特徴点（左画像特徴点）の座標は、直交座標系で表した場合、ｐ＝（ｐ_ｘ，ｐ_ｙ）、極座標系で表した場合、動径ｒ、偏角θとして、ｐ＝（ｒｃｏｓθ，ｒｓｉｎθ）となる。 Here, with reference to FIG. 3, the radial distance and the angular distance, which serve as a reference for the feature point filtering unit 13 to perform the matching determination, will be described. FIG. 3 is a diagram illustrating a coordinate system of the left camera image G _L (full sky image). The same applies to the right camera image G _R.
As shown in FIG. 3, the coordinates of the feature point (left image feature point) of the left camera image _GL are p = (p _x , p _y ) when expressed in an orthogonal coordinate system, and when expressed in a polar coordinate system, As the moving radius r and the deflection angle θ, p = (r cos θ, r sin θ).

すなわち、特徴点フィルタリング手段１３は、左カメラ画像Ｇ_Ｌおよび右カメラ画像Ｇ_Ｒにおいて、極座標系で用いられる動径ｒおよび偏角θの距離により、特徴点のフィルタリングを行う。
ここで、距離計算を行う対象となる、左カメラ画像Ｇ_Ｌの特徴点の座標をｐ＝（ｐ_ｘ，ｐ_ｙ）、図示を省略しているが、右カメラ画像Ｇ_Ｒの特徴点の座標をｑ＝（ｑ_ｘ，ｑ_ｙ）とする。
このとき、特徴点フィルタリング手段１３は、２点間の動径の距離（動径距離ｒ_ｄ）を、以下の式（１）により求める。 That is, the feature point filtering means 13, in the left camera image G _L and the right camera image G _R, the distance of the radius vector r and declination θ used in a polar coordinate system, the filtering feature points.
Here, the target of the distance calculation, coordinates p = characteristic points of the left camera image _{_{G L (p x, p y}} ), although not shown, the feature point of the right camera image G _R coordinates Is q = (q _x , q _y ).
At this time, the feature point filtering means 13 obtains a radial distance (radial distance r _d ) between the two points by the following equation (1).

また、特徴点フィルタリング手段１３は、２点間の偏角の距離（角度距離θ_ｄ）を、以下の式（２）により求める。 Further, the feature point filtering means 13 obtains the declination distance (angular distance θ _d ) between the two points by the following equation (2).

そして、特徴点フィルタリング手段１３は、以下の式（３）に示すように、動径距離ｒ_ｄが閾値ｒ_{ｔｈｒｅｓｓｈｏｌｄ}よりも小さく、かつ、角度距離θ_ｄが閾値θ_{ｔｈｒｅｓｓｈｏｌｄ}よりも小さい２点ｐ，ｑをマッチングする点とする。 Then, the feature point filtering means 13, as shown in the following equation (3), radial distance _{r d} is less than the threshold value _{r Thresshold,} and angular distance theta _d threshold theta 2 points p less than _Thresshold, Let q be a matching point.

なお、閾値ｒ_{ｔｈｒｅｓｓｈｏｌｄ}および閾値θ_{ｔｈｒｅｓｓｈｏｌｄ}は、以下の式（４）および式（５）で計算される値とする。 The threshold _{r Thresshold} and threshold theta _Thresshold is a value that is calculated by the following equation (4) and (5).

ここで、ｐ_{ｗｉｄｔｈ}は、画像（左カメラ画像Ｇ_Ｌ，右カメラ画像Ｇ_Ｒ）の横幅（画素数）、ｐ_{ｈｅｉｇｈｔ}は、同画像の縦幅（画素数）を示す（図３参照）。
また、ｒ_{ｐａｒｔｉｔｉｏｎ}は、画像（左カメラ画像Ｇ_Ｌ，右カメラ画像Ｇ_Ｒ）を極座標で表したときの動径ｒの分割数（動径分割数）を示し、θ_{ｐａｒｔｉｔｉｏｎ}は、同画像を極座標で表したときの偏角θの分割数（偏角分割数）を示す。
この動径分割数ｒ_{ｐａｒｔｉｔｉｏｎ}および偏角分割数θ_{ｐａｒｔｉｔｉｏｎ}は、画像（左カメラ画像Ｇ_Ｌ，右カメラ画像Ｇ_Ｒ）を動径および偏角で区分する細かさを示す指標であって、画像の解像度等に基づいて予め定められる。もちろん、これらの分割数は、外部から設定されることとしてもよい。
また、動径分割数ｒ_{ｐａｒｔｉｔｉｏｎ}は、閾値ｒ_{ｔｈｒｅｓｓｈｏｌｄ}が少なくとも魚眼カメラＣ_Ｌ，Ｃ_Ｒの離間距離に相当する左カメラ画像Ｇ_Ｌ，右カメラ画像Ｇ_Ｒの視差以上となる値とする。
なお、特徴点フィルタリング手段１３は、さらに、対応する特徴点から、最小メジアン法等によって外れ値を除外しておくことが望ましい。 Here, p _width represents the horizontal width (number of pixels) of the image (left camera image G _L , right camera image G _R ), and p _height represents the vertical width (number of pixels) of the image (see FIG. 3).
R _partition indicates the number of radial radius divisions (radial radius division number) when the images (left camera image G _L , right camera image G _R ) are expressed in polar coordinates, and θ _partition indicates the image in polar coordinates. The number of divisions of declination θ (the number of declination divisions) is shown.
The radial division number r _partition and the declination division number θ _partition are indices indicating the fineness of dividing an image (left camera image G _L , right camera image G _R ) by radial and declination, It is determined in advance based on the resolution or the like. Of course, the number of divisions may be set from the outside.
Also, radial division number _{r partition} the threshold _{r Thresshold} is at least a fish left camera image _{G L,} the value equal to or larger than the disparity of the right camera image _{G R} which corresponds to the distance between the eye cameras _C L, _{C R.}
Note that it is desirable that the feature point filtering means 13 further excludes outliers from the corresponding feature points by the minimum median method or the like.

ここで、図４を参照して、特徴点フィルタリング手段１３が行う閾値によるマッチング判定について視覚的に説明する。なお、図４は、左カメラ画像Ｇ_Ｌおよび右カメラ画像Ｇ_Ｒを同一座標上で表している。
図４（ａ）に示すように、特徴点フィルタリング手段１３は、特徴点ｑ_ａが特徴点ｐ_ａにマッチングするか否かは、動径および偏角の差が、領域Ｂａの範囲であるか否かにより行う。
また、特徴点フィルタリング手段１３は、特徴点ｐ_ａよりも動径が小さい位置に存在する特徴点ｐ_ｂについては、領域Ｂａよりも範囲を狭くて、動径および偏角の差が、領域Ｂｂの範囲であるか否かにより行う。
これによって、特徴点フィルタリング手段１３は、動径が小さいほど、同じ特徴点であると判定する面積範囲を狭くし、動径が大きいほど、当該面積範囲を広げて、特徴点のマッチング判定を行う。 Here, with reference to FIG. 4, the matching determination by the threshold value which the feature point filtering means 13 performs is demonstrated visually. Incidentally, FIG. 4 shows the left camera image G _L and the right camera images G _R on the same coordinates.
As shown in FIG. 4 (a), or the feature point filtering means 13, whether the feature point q _a is matched to the feature point p _a, the difference between the radius vector and polarization angle, a range of the area Ba Depending on whether or not.
Further, the feature point filtering means 13, for the feature point p _b at the position movement diameter is smaller than the feature point p _a, narrow range than the region Ba, the difference between the radius vector and declination, region Bb It is performed depending on whether or not it is within the range.
As a result, the feature point filtering means 13 narrows the area range to be determined as the same feature point as the radius is smaller, and widens the area range as the radius is larger, and performs the feature point matching judgment. .

なお、参考までに、図４（ｂ）に、従来の画像上の単純距離によってマッチング判定を行う場合の閾値範囲について説明しておく。図４（ｂ）に示すように、従来は、たとえ動径距離が異なる特徴点ｐ_ｃ，ｐ_ｄであっても、同一の距離（同一半径の円領域Ｂｃ，Ｂｄ）の範囲で、特徴点のマッチングを行っていた。
このように、特徴点フィルタリング手段１３は、画像中心から離れる（動径が大きい）ほど、より多くの画素をマッチング判定の画素とすることで、魚眼カメラの特性に合わせて、対応する特徴点を精度よくフィルタリングすることができる。
図２に戻って、ＣＧ合成装置１の構成について説明を続ける。 For reference, FIG. 4B illustrates a threshold range in the case of performing matching determination based on a simple distance on a conventional image. As shown in FIG. 4B, conventionally, even if the feature points p _c and p _d have different radial distances, the feature points are within the same distance (the circular regions Bc and Bd having the same radius). Was matching.
As described above, the feature point filtering unit 13 sets more pixels as matching determination pixels as the distance from the image center increases (the radius is larger), so that the corresponding feature points are matched to the characteristics of the fisheye camera. Can be accurately filtered.
Returning to FIG. 2, the description of the configuration of the CG synthesizer 1 will be continued.

校正手段１４は、特徴点フィルタリング手段１３でフィルタリングされた特徴点に基づいて、カメラ校正を行うものである。すなわち、校正手段１４は、左カメラ画像Ｇ_Ｌと右カメラ画像Ｇ_Ｒとが、魚眼カメラＣ_Ｌ，Ｃ_Ｒの光軸が平行し、魚眼カメラ内の撮像素子の水平軸が一致するように、一方の画像（例えば、左カメラ画像Ｇ_Ｌ）の特徴点を基準として、他方の画像（例えば、右カメラ画像Ｇ_Ｒ）の特徴点の画像上の位置補正を行う。 The calibration unit 14 performs camera calibration based on the feature points filtered by the feature point filtering unit 13. That is, calibration means 14, so that the left camera image G _L and the right camera image G _R, parallel optical axis of the fisheye camera C _L, C _R, the horizontal axis of the imaging device in the fisheye camera matches The position correction of the feature point of the other image (for example, the right camera image G _R ) is performed on the basis of the feature point of one image (for example, the left camera image G _L ).

この校正手段１４は、特徴点フィルタリング手段１３でフィルタリングされた対応（マッチング）する特徴点群から、一方の画像と他方の画像とが平行になるような基本行列を求め、両方の画像の特徴点の位置を基本行列により変換（回転）する。
この基本行列は、例えば、対応する少なくとも８点の特徴点により求めることができる。この基本行列を求める８点アルゴリズムについては、一般的なものであるため、ここでは説明を省略する。
この校正手段１４は、校正した後の対応する特徴点の位置（画像上の位置）を、特徴点３次元位置計算手段１５に出力する。 The calibrating unit 14 obtains a basic matrix such that one image and the other image are parallel from the corresponding (matching) feature point group filtered by the feature point filtering unit 13, and feature points of both images Is converted (rotated) by a basic matrix.
This basic matrix can be obtained from, for example, at least eight corresponding feature points. Since the 8-point algorithm for obtaining the basic matrix is a general algorithm, description thereof is omitted here.
The calibrating unit 14 outputs the position of the corresponding feature point after calibration (position on the image) to the feature point three-dimensional position calculating unit 15.

特徴点３次元位置計算手段１５は、校正後の画像上の対応する２点の特徴点の位置と、魚眼カメラＣ_Ｌ，Ｃ_Ｒ間の距離とから、特徴点の３次元空間内での位置（特徴点３次元位置）を計算するものである。 Feature point three-dimensional position calculating means 15, on the image after calibration the position of the feature point of the corresponding two points, fisheye camera C _L, and a distance between the C _R, the feature points in the three-dimensional space The position (feature point three-dimensional position) is calculated.

具体的には、特徴点３次元位置計算手段１５は、図５に示すように、魚眼カメラＣ_Ｌ，Ｃ_Ｒの主点位置から３次元空間上の特徴点（特徴点３次元位置Ｐ）までのベクトル（単位方向ベクトル）ｕ，ｖを求める。そして、特徴点３次元位置計算手段１５は、予め３次元空間上の位置が既知である魚眼カメラＣ_Ｌ，Ｃ_Ｒ間の距離ｗと、ベクトルｕ，ｖとにより三角測量の原理で、特徴点の３次元空間内での位置（特徴点３次元位置Ｐ）を計算する。
なお、それぞれのベクトルは、魚眼カメラＣ_Ｌ，Ｃ_Ｒで撮影された左カメラ画像Ｇ_Ｌ，右カメラ画像Ｇ_Ｒ（校正されている場合は、校正後の画像）の特徴点の座標から求めることができる。 Specifically, the feature point three-dimensional position calculating means 15, as shown in FIG. 5, fisheye camera C _L, C principal point feature point in a three-dimensional space from the position of _R (3-dimensional feature point position P) Vectors (unit direction vectors) u and v are obtained. Then, the feature point three-dimensional position calculating means 15, a fish-eye camera C _L position on advance three-dimensional space is _known, the distance w between C _R, the principle of triangulation by a vector u, v, wherein The position of the point in the three-dimensional space (feature point three-dimensional position P) is calculated.
Note that each of the vectors, fisheye camera C _L, C _R in the captured left camera image G _L, the right camera image G _{R (if} it is calibrated, the image after calibration) determined from the coordinates of the feature points of be able to.

ここで、図６を参照して、特徴点３次元位置計算手段１５が算出するベクトルについて説明する。なお、魚眼カメラＣ_Ｌ，Ｃ_Ｒのベクトル算出は同じ手法であるため、ここでは、魚眼カメラＣ_Ｌを始点とするベクトルｕについて説明する。
魚眼カメラＣ_Ｌは等距離射影方式のカメラであることから、魚眼カメラＣ_Ｌの焦点距離をｆとしたとき、図６（ａ）に示すように、魚眼カメラＣ_Ｌの天頂方向のベクトルＴとベクトルｕとの角（天頂角Ｌ_θ１）と、画像（左カメラ画像Ｇ_Ｌ）上の特徴点の座標ｐの動径ｒとには、以下の式（６）の関係が成り立つ。 Here, with reference to FIG. 6, the vector which the feature point three-dimensional position calculation means 15 calculates is demonstrated. Since fisheye camera C _L, vector calculation of C _R is the same technique will be described here vector u whose starting point is the fisheye camera C _L.
Since fisheye camera C _L is a camera equidistant projection method, when the focal length of the fisheye camera C _L is f, as shown in FIG. 6 (a), the zenith of the fisheye camera C _L The following equation (6) is established between the angle between the vector T and the vector u (the zenith angle L _θ1 ) and the radius r of the coordinate p of the feature point on the image (left camera image G _L ).

また、図６（ｂ）に示すように、ベクトルｕを画像（左カメラ画像Ｇ_Ｌ）上に投影したときの偏角Ｌ_θ２は、図３で説明したように、画像（左カメラ画像Ｇ_Ｌ）を極座標系で表したときの偏角と同じである。
よって、特徴点３次元位置計算手段１５は、魚眼カメラＣ_Ｌに対応するベクトルｕを、天頂角Ｌ_θ１および偏角Ｌ_θ２で特定する。同様に、特徴点３次元位置計算手段１５は、魚眼カメラＣ_Ｒに対応するベクトルｖを、天頂角Ｒ_θ１および偏角Ｒ_θ２で特定する。 Further, as shown in FIG. 6 (b), declination L _.theta.2 in projecting the vector u on the image (left camera image G _L), as described in FIG. 3, the image (left camera image G _L ) Is the same as the declination when expressed in the polar coordinate system.
Therefore, the feature point three-dimensional position calculating unit 15, the vector u corresponding to fisheye camera C _L, identified in zenith angle L _.theta.1 and declination L _.theta.2. Similarly, the feature point three-dimensional position calculating unit 15, the vector v corresponding to the fisheye camera C _R, identified in zenith angle R _.theta.1 and declination R _.theta.2.

そして、特徴点３次元位置計算手段１５は、図５に示すように、魚眼カメラＣ_Ｌ，Ｃ_Ｒ間の距離ｗと、ベクトルｕ（Ｌ_θ１，Ｌ_θ２），ｖ（Ｒ_θ１，Ｒ_θ２）とにより三角測量の原理で、特徴点の３次元空間内での位置（特徴点３次元位置Ｐ）を計算する。
なお、３次元空間内では、ベクトルｕ，ｖはねじれの関係になり、交点をもたない場合がある。そこで、特徴点３次元位置計算手段１５は、ベクトルｕ，ｖの最接近点を特徴点３次元位置Ｐとして計算する。
図２に戻って、ＣＧ合成装置１の構成について説明を続ける。 Then, as shown in FIG. 5, the feature point three-dimensional position calculation means 15 calculates the distance w between the fisheye cameras C _L and C _R and the vectors u (L _θ1 , L _θ2 ) and v (R _θ1 , R _θ2). The position of the feature point in the three-dimensional space (feature point three-dimensional position P) is calculated by the principle of triangulation.
In the three-dimensional space, the vectors u and v have a twisted relationship and may not have an intersection. Therefore, the feature point three-dimensional position calculation means 15 calculates the closest point of the vectors u and v as the feature point three-dimensional position P.
Returning to FIG. 2, the description of the configuration of the CG synthesis apparatus 1 will be continued.

特徴点３次元位置計算手段１５は、計算で求めた特徴点３次元位置を、レンダリング手段２０に出力する。また、ここでは、特徴点３次元位置計算手段１５は、特徴点３次元位置に対応する魚眼カメラの画像（例えば、左カメラ画像Ｇ_Ｌ）の特徴点の位置を、レンダリング手段２０に出力する。 The feature point three-dimensional position calculation unit 15 outputs the feature point three-dimensional position obtained by the calculation to the rendering unit 20. Here, the feature point three-dimensional position calculation means 15 outputs the position of the feature point of the fisheye camera image (for example, the left camera image G _L ) corresponding to the feature point three-dimensional position to the rendering means 20. .

これによって、３次元位置算出手段１０は、特徴点マッチングを行う際に、全天画像を平面画像に展開するような前処理を必要とせず、動径距離、角度距離の単純な閾値判定により、対応する特徴点のフィルタリングを行うため、計算コストを抑えることができる。また、３次元位置算出手段１０は、魚眼カメラの特性を考慮した距離（動径距離、角度距離）を用いるため、画像内の実際の距離により特徴点をフィルタリングする従来の手法に比べて、特徴点のマッチングを精度よく行うことができる。 As a result, the three-dimensional position calculation means 10 does not require preprocessing for developing the whole sky image into a flat image when performing feature point matching, and by simple threshold determination of the radial distance and the angular distance, Since the corresponding feature points are filtered, the calculation cost can be reduced. In addition, since the three-dimensional position calculation means 10 uses a distance (radial distance, angular distance) in consideration of the characteristics of the fisheye camera, compared to the conventional method of filtering feature points by the actual distance in the image, Feature point matching can be performed with high accuracy.

レンダリング手段２０は、３次元位置算出手段１０で算出された特徴点３次元位置によって特定される照明環境をモデル化し、素材ＣＧデータからその照明環境に応じたＣＧ画像（ＣＧオブジェクト）を生成するものである。
ここでは、レンダリング手段２０は、光源分布マップ生成手段２１と、陰影処理手段２２と、を備える。 The rendering unit 20 models an illumination environment specified by the feature point 3D position calculated by the 3D position calculation unit 10 and generates a CG image (CG object) corresponding to the illumination environment from the material CG data. It is.
Here, the rendering unit 20 includes a light source distribution map generation unit 21 and a shadow processing unit 22.

光源分布マップ生成手段２１は、３次元位置算出手段１０で算出された特徴点３次元位置から、照明環境の光源分布の情報を、特徴点３次元位置を頂点とする三角メッシュに対応付けて、光源分布マップを生成するものである。
すなわち、光源分布マップ生成手段２１は、特徴点３次元位置を頂点とする三角メッシュごとに、当該三角メッシュ内の領域を、光源分布画像（ここでは、左カメラ画像Ｇ_Ｌのものとする）中の当該三角メッシュの各頂点である特徴点で囲まれる領域に対応付ける。 The light source distribution map generating means 21 associates the information on the light source distribution of the illumination environment from the three-dimensional position of the feature point calculated by the three-dimensional position calculation means 10 with a triangular mesh having the feature point three-dimensional position as a vertex, A light source distribution map is generated.
That is, the light source distribution map generation unit 21 sets the region in the triangular mesh for each triangular mesh having the three-dimensional position of the feature point as a vertex in the light source distribution image (here, the left camera image _GL ). Are associated with the region surrounded by the feature points that are the vertices of the triangular mesh.

これによって、光源分布マップ生成手段２１は、三角メッシュごとに画像領域を対応付けて、照明環境となる光源分布マップを生成することができる。
この光源分布マップ生成手段２１は、モデル化した照明環境のデータである光源分布マップを陰影処理手段２２に出力する。 As a result, the light source distribution map generating means 21 can generate a light source distribution map as an illumination environment by associating image regions with each triangular mesh.
The light source distribution map generation unit 21 outputs a light source distribution map, which is modeled illumination environment data, to the shadow processing unit 22.

陰影処理手段２２は、光源分布マップ生成手段２１でモデル化された光源分布マップに基づいて、素材ＣＧデータＣＧ_Ｄを陰影処理して、実写映像に合成するためのＣＧオブジェクトを生成するものである。
この陰影処理手段２２は、ＣＧオブジェクトを配置する実空間（３次元空間）上の位置で、カメラ（スタジオカメラＣ_Ｓ）を視点位置とするＣＧオブジェクトを生成する。このとき、陰影処理手段２２は、素材ＣＧデータＣＧ_Ｄに、光源分布マップのそれぞれの三角メッシュ領域で特定される光源輝度に基づいて、陰影処理を施す。 The shadow processing unit 22 performs a shading process on the material CG data CG _D on the basis of the light source distribution map modeled by the light source distribution map generation unit 21 and generates a CG object to be combined with a live-action image. .
The shadow processing means 22 generates a CG object having a camera (studio camera C _S ) as a viewpoint position at a position on the real space (three-dimensional space) where the CG object is placed. At this time, the shading processing means 22 performs shading processing on the material CG data CG _D based on the light source luminance specified in each triangular mesh region of the light source distribution map.

この陰影処理手段２２において、陰影処理を行う際の光源分布マップの役割について簡単に説明する。
図７に示すように、光源分布マップＭは、照明環境を三角メッシュでモデル化したもので、個々の三角メッシュには、例えば、光源Ｌそのものの輝度情報や、光源Ｌの間接光の輝度情報が対応対けられている。 The role of the light source distribution map when performing shadow processing in the shadow processing means 22 will be briefly described.
As shown in FIG. 7, the light source distribution map M is obtained by modeling the illumination environment with a triangular mesh. Each triangular mesh includes, for example, luminance information of the light source L itself and luminance information of indirect light of the light source L. Has been dealt with.

すなわち、陰影処理手段２２は、ＣＧ形状およびその形状に照射する光の方向に対応する三角メッシュの光源情報に基づいて、ＣＧオブジェクトの画素値を決定する。
この陰影処理手段２２における陰影処理は、一般的な手法を用いればよく、例えば、ＩＢＬ（Image Based Lighting）等の手法を用いればよい。
この陰影処理手段２２は、生成したＣＧオブジェクトを、実写ＣＧ合成装置３に出力する。 That is, the shadow processing means 22 determines the pixel value of the CG object based on the light source information of the triangular mesh corresponding to the CG shape and the direction of the light that irradiates the shape.
The shading processing in the shading processing means 22 may use a general method, for example, a method such as IBL (Image Based Lighting).
The shadow processing means 22 outputs the generated CG object to the live-action CG composition device 3.

これによって、レンダリング手段２０は、３次元位置算出手段１０で算出された特徴点の３次元位置から、光源分布マップを生成し、実際の照明環境に合ったＣＧオブジェクトを生成することができる。 Thus, the rendering unit 20 can generate a light source distribution map from the three-dimensional position of the feature point calculated by the three-dimensional position calculation unit 10 and generate a CG object suitable for the actual lighting environment.

以上説明したように、ＣＧ合成装置１を構成することで、ＣＧ合成装置１は、魚眼カメラの特性に応じた特徴点のフィルタリングにより、特徴点のマッチングの精度を高め、より正確な照明環境をモデル化することができる。これによって、ＣＧ合成装置１は、実写映像に合成しても、違和感のないＣＧオブジェクトを生成することができる。 As described above, by configuring the CG synthesizer 1, the CG synthesizer 1 increases the accuracy of feature point matching by filtering feature points according to the characteristics of the fisheye camera, thereby providing a more accurate lighting environment. Can be modeled. Thereby, the CG synthesizing apparatus 1 can generate a CG object that does not feel uncomfortable even if it is synthesized with a live-action video.

≪ＣＧ合成装置の動作≫
次に、図８を参照（適宜図１，図２参照）して、本発明の実施形態に係るＣＧ合成装置１の動作について説明する。
まず、ＣＧ合成装置１は、３次元位置算出手段１０の画像入力手段１１（左画像入力手段１１Ｌ，右画像入力手段１１Ｒ）によって、照明環境を撮影した魚眼カメラＣ_Ｌ，Ｃ_Ｒから、撮影画像（左カメラ画像Ｇ_Ｌ，右カメラ画像Ｇ_Ｒ）を入力する（ステップＳ１）。 ≪Operation of CG synthesizer≫
Next, referring to FIG. 8 (refer to FIGS. 1 and 2 as appropriate), the operation of the CG synthesis apparatus 1 according to the embodiment of the present invention will be described.
First, the CG synthesizing apparatus 1 is photographed from the fisheye cameras C _L and C _R that photographed the illumination environment by the image input unit 11 (the left image input unit 11L and the right image input unit 11R) of the three-dimensional position calculation unit 10. Images (left camera image G _L , right camera image G _R ) are input (step S1).

そして、ＣＧ合成装置１は、特徴点抽出手段１２（左画像特徴点抽出手段１２Ｌ，右画像特徴点抽出手段１２Ｒ）によって、ステップＳ１で入力された画像からＳＩＦＴ等により特徴点を抽出する（ステップＳ２）。
すなわち、左画像特徴点抽出手段１２Ｌが、左カメラ画像Ｇ_Ｌから特徴点（左画像特徴点）を抽出し、右画像特徴点抽出手段１２Ｒが、右カメラ画像Ｇ_Ｒから特徴点（右画像特徴点）を抽出する。
そして、ＣＧ合成装置１は、特徴点フィルタリング手段１３によって、ステップＳ２で抽出された、特徴量が最も類似する左画像特徴点と右画像特徴点とに対して、動径距離が所定の閾値よりも小さく、かつ、角度距離が所定の閾値よりも小さい対応を、正しいマッチングとして採用（フィルタリング）する（ステップＳ３）。
これによって、特徴点フィルタリング手段１３は、魚眼カメラの特性に合わせて、対応する特徴点を精度よくフィルタリングすることができる。 Then, the CG synthesizing apparatus 1 extracts feature points from the image input in step S1 by SIFT or the like by the feature point extraction unit 12 (left image feature point extraction unit 12L, right image feature point extraction unit 12R) (step S1). S2).
That is, the left image feature point extracting unit 12L extracts feature points (left image feature point) from the left camera image G _L, the right image feature point extracting unit 12R is, feature points from the right camera image G _R (right image feature Point).
Then, the CG synthesis apparatus 1 uses the feature point filtering unit 13 to extract the radial distance from the predetermined threshold with respect to the left image feature point and the right image feature point with the most similar feature amount extracted in step S2. That is smaller and the angular distance is smaller than a predetermined threshold is adopted (filtered) as correct matching (step S3).
Thereby, the feature point filtering means 13 can accurately filter the corresponding feature points according to the characteristics of the fisheye camera.

そして、ＣＧ合成装置１は、校正手段１４によって、カメラ校正による特徴点の位置補正を行う（ステップＳ４）。
すなわち、校正手段１４は、ステップＳ３でフィルタリングされたマッチングする特徴点群から、位置補正を行うための基本行列を求め、両画像（左カメラ画像Ｇ_Ｌおよび右カメラ画像Ｇ_Ｒ）の特徴点の画像上の位置補正を行う。 Then, the CG synthesizing apparatus 1 uses the calibration unit 14 to correct the position of the feature point by camera calibration (step S4).
That is, calibration means 14, the feature point group matching is filtered in step S3, determined the basic matrix for performing the position correction, the feature points of two images (left camera image G _L and the right camera image G _R) Perform position correction on the image.

そして、ＣＧ合成装置１は、特徴点３次元位置計算手段１５によって、ステップＳ４の校正後の画像上の対応する２点の特徴点の位置と、魚眼カメラＣ_Ｌ，Ｃ_Ｒ間の距離とから、特徴点の３次元空間内での位置（特徴点３次元位置）を計算する（ステップＳ５）。
このステップＳ５までの動作によって、魚眼カメラＣ_Ｌ，Ｃ_Ｒが撮影した画像（左カメラ画像Ｇ_Ｌ，右カメラ画像Ｇ_Ｒ）から、特徴点の３次元位置を算出することができる。 Then, CG synthesizing apparatus 1, the feature point three-dimensional position calculating unit 15, the position of the corresponding two points characteristic points on the calibration image after the step S4, fisheye camera C _L, and the distance between C _R Then, the position of the feature point in the three-dimensional space (the feature point three-dimensional position) is calculated (step S5).
The operation up to this step S5, fisheye camera C _L, C _R are photographed image (left camera image G _L, the right camera image G _R) from, it is possible to calculate the three-dimensional position of the feature point.

その後、ＣＧ合成装置１は、レンダリング手段２０の光源分布マップ生成手段２１によって、ステップＳ５までの動作で算出された特徴点の３次元位置から、照明環境の光源分布の情報を、特徴点３次元位置を頂点とする三角メッシュに対応付けて、光源分布マップを生成する（ステップＳ６）。
これによって、実空間における照明環境がモデル化されることになる。 Thereafter, the CG synthesis device 1 uses the light source distribution map generation unit 21 of the rendering unit 20 to obtain the light source distribution information of the illumination environment from the three-dimensional position of the feature points calculated by the operations up to step S5. A light source distribution map is generated in association with the triangular mesh whose position is the vertex (step S6).
As a result, the lighting environment in the real space is modeled.

そして、ＣＧ合成装置１は、陰影処理手段２２によって、ステップＳ６で生成された光源分布マップに基づいて、素材ＣＧデータに陰影処理を施して、実写映像に合成するためのＣＧオブジェクトを生成する（ステップＳ７）。
これによって、ＣＧ合成装置１は、実空間の照明環境に合った、実写映像に合成されても違和感のないＣＧオブジェクトを生成することができる。 Then, the CG composition device 1 performs a shadow process on the material CG data based on the light source distribution map generated in step S6 by the shadow processing means 22, and generates a CG object to be combined with the live-action video ( Step S7).
As a result, the CG composition device 1 can generate a CG object that is suitable for the lighting environment in the real space and does not feel uncomfortable even if it is synthesized with the live-action video.

以上、本発明の実施形態に係るＣＧ合成装置１の構成および動作について説明したが、本発明はこの実施形態に限定されるものではない。
例えば、ここでは、ＣＧ合成装置１の構成として、３次元位置算出手段１０と、レンダリング手段２０と、を備える構成としたが、これらは独立して構成してもよい。
すなわち、３次元位置算出手段１０を３次元位置算出装置、レンダリング手段２０をレンダリング装置として分離して構成してもよい。 As mentioned above, although the structure and operation | movement of CG synthesizing | combining apparatus 1 which concern on embodiment of this invention were demonstrated, this invention is not limited to this embodiment.
For example, here, the configuration of the CG synthesizing apparatus 1 includes the three-dimensional position calculation unit 10 and the rendering unit 20, but these may be configured independently.
That is, the three-dimensional position calculating unit 10 may be configured as a three-dimensional position calculating device and the rendering unit 20 as a rendering device.

また、ここでは、ＣＧ合成装置１の３次元位置算出手段１０は、校正手段１４を備える構成としたが、予め魚眼カメラＣ_Ｌ，Ｃ_Ｒが校正された状態で配置されたものであれば、校正手段１４を構成から省略しても構わない。
また、逆に、魚眼カメラＣ_Ｌ，Ｃ_Ｒが無校正な状態で配置されたものであれば、画像入力手段１１と特徴点抽出手段１２との間に、カメラ校正を行う第２の校正手段を備えることとしてもよい。
この場合、第２の校正手段は、例えば、一方の画像（例えば、左カメラ画像Ｇ_Ｌ）を基準として、他方の画像（例えば、右カメラ画像Ｇ_Ｒ）が最も類似するように、当該他方の画像を回転させることとする。 Here, the three-dimensional position calculation means 10 of the CG synthesizer 1 is configured to include the calibration means 14, but as long as the fisheye cameras C _L and _CR are arranged in advance in a calibrated state. The calibration unit 14 may be omitted from the configuration.
Conversely, as long as the fisheye camera C _L, C _R are arranged in a non-calibration state, between the image input unit 11 and feature point extracting unit 12, a second calibration performing camera calibration Means may be provided.
In this case, for example, the second calibration unit uses the one image (for example, the left camera image G _L ) as a reference so that the other image (for example, the right camera image G _R ) is most similar. The image is rotated.

以上説明したＣＧ合成装置１およびその変形例は、コンピュータを前記した各手段として機能させるプログラム（ＣＧ合成プログラム）で動作させることができる。
また、ＣＧ合成装置１を、３次元位置算出装置およびレンダリング装置として分離して構成する場合、３次元位置算出装置は、コンピュータを、先に説明した３次元位置算出手段１０の各手段として機能させるプログラム（３次元位置算出プログラム）で動作させることができる。また、レンダリング装置は、コンピュータを、先に説明したレンダリング手段２０の各手段として機能させるプログラム（レンダリングプログラム）で動作させる。また、これらのプログラムは、記録媒体に記録して配布したり、通信回線を介して配布したりすることも可能である。 The CG synthesizer 1 described above and its modification can be operated by a program (CG synthesis program) that causes a computer to function as each of the above-described means.
When the CG composition device 1 is configured separately as a three-dimensional position calculation device and a rendering device, the three-dimensional position calculation device causes the computer to function as each unit of the three-dimensional position calculation unit 10 described above. It can be operated by a program (three-dimensional position calculation program). In addition, the rendering apparatus operates the computer with a program (rendering program) that functions as each unit of the rendering unit 20 described above. Also, these programs can be recorded on a recording medium and distributed, or distributed via a communication line.

Ｓ映像合成システム
１ＣＧ合成装置
１０３次元位置算出手段（３次元位置算出装置）
１１画像入力手段
１１Ｌ左画像入力手段
１１Ｒ右画像入力手段
１２特徴点抽出手段
１２Ｌ左画像特徴点抽出手段
１２Ｒ右画像特徴点抽出手段
１３特徴点フィルタリング手段
１４校正手段
１５特徴点３次元位置計算手段
２０レンダリング手段（レンダリング装置）
２１光源分布マップ生成手段
２２陰影処理手段
Ｃ_Ｌ魚眼カメラ
Ｃ_Ｒ魚眼カメラ S image composition system 1 CG composition device 10 3D position calculation means (3D position calculation device)
DESCRIPTION OF SYMBOLS 11 Image input means 11L Left image input means 11R Right image input means 12 Feature point extraction means 12L Left image feature point extraction means 12R Right image feature point extraction means 13 Feature point filtering means 14 Calibration means 15 Feature point three-dimensional position calculation means 20 Rendering means (rendering device)
21 light source distribution map generating means 22 shading means C _L fisheye camera C _R fisheye camera

Claims

A three-dimensional position calculation device for calculating a three-dimensional position of a feature point in an image from images captured by two fisheye cameras having equidistant projection type fisheye lenses,
Feature point extracting means for extracting feature points with varying image characteristics from one image and the other image captured by the two fisheye cameras;
In the feature point pair in which the image feature quantity at the feature point extracted by the feature point extracting unit is the most similar between the two images, a motion that is a difference in radius of each feature point when the two images are expressed in polar coordinates. A feature point filtering means for filtering feature point pairs in which the radial distance is smaller than a predetermined threshold and the angular distance that is a difference in declination is smaller than the predetermined threshold as a feature point pair indicating the same feature point;
A feature point three-dimensional position calculation means for calculating a three-dimensional position of the feature point based on the position of both of the feature point pairs filtered by the feature point filtering means and the distance of the fisheye camera;
A three-dimensional position calculation apparatus comprising:

A basic matrix for parallelizing the other image with the one image is obtained from positions on both images of the plurality of feature point pairs filtered by the feature point filtering means, and the feature points of the other image are obtained. Calibration means for converting according to the basic matrix further comprises
The three-dimensional position calculation apparatus according to claim 1, wherein the feature point three-dimensional position calculation unit calculates the three-dimensional position based on the converted feature point pair.

The feature point three-dimensional position calculation means is represented by a direction vector composed of a zenith angle and a declination angle specified by a position on each image of the feature point pair through the principal point position of the fisheye camera. The three-dimensional position calculation apparatus according to claim 1, wherein a three-dimensional position where two straight lines are closest is calculated as a three-dimensional position of the feature point.

The three-dimensional position calculation program for functioning a computer as a three-dimensional position calculation apparatus as described in any one of Claims 1-3.

CG synthesis for generating a CG object to be synthesized with a live-action video photographed in the illumination environment from images photographed by two fisheye cameras equipped with equidistant projection fisheye lenses for photographing the illumination environment A device,
The three-dimensional position calculation device according to any one of claims 1 to 3, wherein a three-dimensional position of a feature point in the image is calculated from images captured by the two fisheye cameras;
For each triangular mesh whose vertex is the three-dimensional position calculated by the three-dimensional position calculation device, each vertex of the triangular mesh in the image photographed by any of the fisheye cameras A light source distribution map generating means for generating a light source distribution map that models the illumination environment in association with a region surrounded by feature points that are:
Based on the light source distribution map generated by the light source distribution map generating means, the shadow processing means for generating the CG object by shading the material CG data,
A CG synthesizer comprising: