WO2020050084A1

WO2020050084A1 - Storage medium having guidance control program stored thereon

Info

Publication number: WO2020050084A1
Application number: PCT/JP2019/033282
Authority: WO
Inventors: 真一郎坂井; 久旺新井
Original assignee: Japan Aerospace Exploration Agency JAXA
Current assignee: Japan Aerospace Exploration Agency JAXA
Priority date: 2018-09-07
Filing date: 2019-08-26
Publication date: 2020-03-12
Anticipated expiration: 2021-03-07
Also published as: JP2020041858A

Abstract

A non-transitory storage medium stores a guidance control program. When the guidance control program causes a computer mounted on a mobile body provided with a plurality of sensors to employ a Kalman filter to predict a future state quantity of the mobile body on the basis of detected values from each of a first sensor and a second sensor included in the plurality of sensors, the guidance control program causes the computer to derive, on the basis of a parameter of the Kalman filter, an index value indicating a degree of error in the state quantity of the mobile body, and causes the computer to solve an optimization problem in order to derive the state quantity of the mobile body which optimizes an evaluation function including the index value as an element.

Description

Storage medium storing guidance control program

　本発明は、誘導制御プログラムを格納した記憶媒体に関する。
　本願は、２０１８年９月７日に出願された日本国特許出願２０１８－１６８１９３号に基づき優先権を主張し、その内容をここに援用する。 The present invention relates to a storage medium storing a guidance control program.
Priority is claimed on Japanese Patent Application No. 2018-168193, filed on September 7, 2018, the content of which is incorporated herein by reference.

　従来、月や惑星などの重力天体に対して、画像航法により自律精密着陸（Autonomous Precision Landing：ＡＰＬ）を行う宇宙機が知られている。重力天体に宇宙機を着陸させる際の動力降下着陸フェーズ（Powerd Descent Landing：ＰＤＬ）では、重力天体の地表面を撮像した画像と、事前に取得された重力天体の地形データとの照合によって画像航法を行うとともに、現在地に基づいて誘導軌道をリアルタイムに生成することが要求される。一方で、慣性センサと測位衛星による測位センサとを搭載する飛行体が、各センサの検出値をカルマンフィルタの観測値として利用することで、各センサの検出結果を複合的に利用した複合航法を行う技術が知られている（特許文献１参照）。 2. Description of the Related Art Conventionally, spacecraft that perform autonomous precision landing (APL) on gravity objects such as the moon and planets by image navigation are known. In the Powered Descent Landing (PDL) phase of landing a spacecraft on a gravitational celestial body, image navigation is performed by collating an image of the ground surface of the gravitational celestial body with terrain data obtained in advance. As well as generating a guided trajectory in real time based on the current location. On the other hand, a flying vehicle equipped with an inertial sensor and a positioning sensor based on a positioning satellite uses the detection value of each sensor as an observation value of a Kalman filter to perform complex navigation using the detection results of each sensor in a complex manner. Techniques are known (see Patent Document 1).

日本国特開２０１８－１０９５３０号公報Japanese Patent Application Publication No. 2018-109530

　しかしながら、従来の技術では、航法誤差が小さくなるように誘導軌道を十分に最適化することができていなかった。誘導 However, the conventional technology has not been able to sufficiently optimize the guidance trajectory so as to reduce the navigation error.

　本発明の一つの態様は、航法誤差が小さくなるように誘導軌道を最適化することができる誘導制御プログラムを格納した記憶媒体を提供する。 One embodiment of the present invention provides a storage medium storing a guidance control program capable of optimizing a guidance trajectory so as to reduce a navigation error.

　本発明の一態様は、複数のセンサを備える移動体に搭載されるコンピュータに、前記複数のセンサに含まれる第１センサと第２センサとのそれぞれの検出値に基づき、カルマンフィルタを用いて将来の前記移動体の状態量を予測させる際に、前記カルマンフィルタのパラメータに基づいて、前記移動体の状態量の誤差の程度を示す指標値を導出させ、前記指標値が要素として含まれる評価関数が最適となる前記移動体の状態量を、最適化問題を解くことで導出させるための誘導制御プログラムを格納したコンピュータ読み取り可能な非一過性の記憶媒体である。 One embodiment of the present invention provides a computer mounted on a moving object including a plurality of sensors, based on the detection values of the first sensor and the second sensor included in the plurality of sensors, using a Kalman filter in the future. When predicting the state quantity of the moving body, an index value indicating the degree of the error of the state quantity of the moving body is derived based on the parameters of the Kalman filter, and the evaluation function including the index value as an element is optimal. A non-transitory computer-readable storage medium storing a guidance control program for deriving a state quantity of the moving object by solving an optimization problem.

　本発明の一態様によれば、航法誤差が小さくなるように誘導軌道を最適化することができる。 According to one aspect of the present invention, the guidance trajectory can be optimized so that the navigation error is reduced.

実施形態の移動体の一例を示す図である。It is a figure showing an example of a mobile of an embodiment. 実施形態の航法装置の構成の一例を示す図である。It is a figure showing an example of composition of a navigation system of an embodiment. 実施形態の制御部による一連の処理の一例を示すフローチャートである。6 is a flowchart illustrating an example of a series of processes performed by a control unit according to the embodiment. 共分散行列と処理周期との関係の一例を示す図である。FIG. 4 is a diagram illustrating an example of a relationship between a covariance matrix and a processing cycle. 評価関数の勾配情報を用いた凸最適化問題を解く処理を模式的に示す概念図である。It is a conceptual diagram which shows typically the process which solves the convex optimization problem using the gradient information of an evaluation function.

　以下、図面を参照し、本発明の誘導制御プログラムの実施形態について説明する。本実施形態における誘導制御プログラムは、例えば、移動体に搭載されたコンピュータによって実行される。誘導制御プログラムを実行するコンピュータは、種々の処理を行う。 Hereinafter, an embodiment of the guidance control program of the present invention will be described with reference to the drawings. The guidance control program according to the present embodiment is executed by, for example, a computer mounted on a moving object. The computer that executes the guidance control program performs various processes.

　図１は、実施形態の移動体Ｍの一例を示す図である。移動体Ｍは、例えば、月や惑星などの重力天体（以下、単に天体ＰＬと称する）に着陸し、その天体ＰＬの探査を行うような宇宙機（宇宙探査機）であってよい。宇宙機である移動体Ｍには、誘導制御プログラムを実行する航法装置（コンピュータ）１００が搭載される。宇宙機である移動体Ｍには、航法装置１００に加えて、カメラ１０と、慣性計測装置（Inertial Measurement Unit；ＩＭＵ）２０と、推進力出力装置３０とが搭載される。 FIG. 1 is a diagram illustrating an example of the moving object M according to the embodiment. The moving object M may be, for example, a spacecraft (space explorer) that lands on a gravitational celestial body (hereinafter, simply referred to as a celestial body PL) such as a moon or a planet, and searches for the celestial body PL. A navigation device (computer) 100 that executes a guidance control program is mounted on a moving object M that is a spacecraft. The moving object M, which is a spacecraft, is equipped with a camera 10, an inertial measurement unit (IMU) 20, and a propulsion output device 30 in addition to the navigation device 100.

　カメラ１０は、例えば、移動体Ｍの下側に設けられ、天体ＰＬに移動体Ｍが着陸する際に、天体ＰＬの地表を撮像する。「下側」とは、例えば、天体ＰＬの地表と接する脚が設けられた筐体側である。言い換えれば、カメラ１０は、天体ＰＬの重力が移動体Ｍに作用する方向（鉛直方向下向き）側に設けられる。カメラ１０は、撮像した画像を航法装置１００に出力する。カメラ１０は、「第１センサ」の一例である。カメラ１０は、「地形検出センサ」の一例である。 The camera 10 is provided, for example, below the moving body M and captures an image of the surface of the celestial body PL when the moving body M lands on the celestial body PL. The “lower side” is, for example, the side of the housing provided with legs that contact the surface of the celestial body PL. In other words, the camera 10 is provided on the direction in which the gravity of the celestial body PL acts on the moving body M (vertically downward). The camera 10 outputs the captured image to the navigation device 100. The camera 10 is an example of a “first sensor”. The camera 10 is an example of a “terrain detection sensor”.

　慣性計測装置２０は、例えば、ＭＥＭＳ（Micro Electro Mechanical Systems）や光ファイバによって構成される三軸式加速度センサと、三軸式ジャイロセンサとを含む。慣性計測装置２０は、これらのセンサによって検出された検出値を航法装置１００に出力する。慣性計測装置２０による検出値には、例えば、水平方向、垂直方向、奥行き方向の各加速度及び／又は角速度や、ピッチ、ロール、ヨーの各軸の速度（レート）などが含まれる。慣性計測装置２０は、「第２センサ」の一例である。 The inertia measurement device 20 includes, for example, a three-axis acceleration sensor formed of MEMS (Micro Electro Mechanical Systems) or an optical fiber, and a three-axis gyro sensor. The inertial measurement device 20 outputs detection values detected by these sensors to the navigation device 100. The values detected by the inertial measurement device 20 include, for example, the accelerations and / or angular velocities in the horizontal, vertical, and depth directions, and the velocities (rates) of the pitch, roll, and yaw axes. The inertial measurement device 20 is an example of a “second sensor”.

　推進力出力装置３０は、例えば、ジンバルアクチュエータやイオンエンジンなどを含む。例えば、推進力出力装置３０は、ジンバルアクチュエータを駆動させることによって、イオンエンジンが発生させた推進力の出力方向を、任意の方向に変更する。 The propulsion output device 30 includes, for example, a gimbal actuator and an ion engine. For example, the thrust output device 30 changes the output direction of the thrust generated by the ion engine to an arbitrary direction by driving the gimbal actuator.

　航法装置１００は、誘導制御プログラムを実行することで、例えば、宇宙機である移動体Ｍを、カメラ１０によって撮像された画像と、慣性計測装置２０の検出結果とを利用して航行させる。以下、カメラ１０によって撮像された画像を利用した航法を「画像航法」と称し、慣性計測装置２０の検出結果を利用した航法を「慣性航法」と称して説明する。移動体Ｍは、宇宙機に限られず、ＧＮＳＳ（Global Navigation Satellite System）を利用できない環境下において飛行するＵＡＶ（Unmanned Aerial Vehicle）や、海底の資源探査などを行うために水中を推進するＡＵＶ（Autonomous Underwater Vehicle）といった他の移動体であってもよい。以下、一例として、移動体Ｍが宇宙機であるものとして説明する。 By executing the guidance control program, the navigation device 100 causes the moving object M, which is a spacecraft, to navigate, for example, using the image captured by the camera 10 and the detection result of the inertial measurement device 20. Hereinafter, the navigation using the image captured by the camera 10 will be referred to as “image navigation”, and the navigation using the detection result of the inertial measurement device 20 will be referred to as “inertial navigation”. The mobile object M is not limited to a spacecraft, but an UAV (Unmanned Aerial Vehicle) that flies in an environment where GNSS (Global Navigation Satellite System) cannot be used, and an AUV (Autonomous Vehicle) that propells underwater to search for resources on the sea floor. Other moving objects such as Underwater Vehicles may be used. Hereinafter, as an example, a description will be given assuming that the moving object M is a spacecraft.

　図２は、実施形態の航法装置１００の構成の一例を示す図である。航法装置１００は、例えば、通信部１０２と、制御部１１０と、記憶部１３０とを備える。 FIG. 2 is a diagram illustrating an example of the configuration of the navigation device 100 according to the embodiment. The navigation device 100 includes, for example, a communication unit 102, a control unit 110, and a storage unit 130.

　通信部１０２は、例えば、テレメータ回線などで利用されるような周波数帯の電波を用いて地球上の監視装置と無線通信する。地球上の監視装置は、例えば、通信部１０２を介して航法装置１００を遠隔制御（遠隔誘導）することで、着陸すべき天体ＰＬと地球とを結ぶ直線（電波の直進方向）に対して直行する方向の移動体Ｍの位置を制御する。すなわち、地球上の監視装置は、天体ＰＬの重力を基準とした鉛直方向に対して直行する水平方向に関する移動体Ｍの位置を制御する。 The communication unit 102 wirelessly communicates with a monitoring device on the earth using radio waves in a frequency band used for a telemeter line, for example. The monitoring device on the earth, for example, remote-controls (remotely guides) the navigation device 100 via the communication unit 102, so as to go straight to a straight line connecting the celestial body PL to be landed and the earth (straight direction of radio wave). The position of the moving object M in the direction in which the moving object M moves is controlled. That is, the monitoring device on the earth controls the position of the moving body M in the horizontal direction perpendicular to the vertical direction based on the gravity of the celestial body PL.

　制御部１１０は、例えば、取得部１１２と、画像航法演算処理部１１４と、慣性航法演算処理部１１６と、パラメータ決定部１１８、予測部１２０と、機体制御部１２２とを備える。 The control unit 110 includes, for example, an acquisition unit 112, an image navigation calculation processing unit 114, an inertial navigation calculation processing unit 116, a parameter determination unit 118, a prediction unit 120, and an airframe control unit 122.

　制御部１１０の構成要素は、例えば、ＣＰＵ（Central Processing Unit）やＧＰＵ（Graphics Processing Unit）などのプロセッサが記憶部１３０に格納された誘導制御プログラムを実行することにより実現される。制御部１１０の構成要素の一部または全部は、ＬＳＩ（Large Scale Integration）、ＡＳＩＣ（Application Specific Integrated Circuit）、またはＦＰＧＡ（Field-Programmable Gate Array）などのハードウェアにより実現されてもよいし、ソフトウェアとハードウェアの協働によって実現されてもよい。プロセッサにより参照される誘導制御プログラムは、予め記憶部１３０に格納されていてもよいし、ＤＶＤやＣＤ－ＲＯＭなどの着脱可能な記憶媒体に格納されており、その記憶媒体が航法装置１００のドライブ装置に装着されることで記憶媒体から記憶部１３０にインストールされてもよい。 The components of the control unit 110 are realized, for example, by a processor such as a CPU (Central Processing Unit) or a GPU (Graphics Processing Unit) executing a guidance control program stored in the storage unit 130. Some or all of the components of the control unit 110 may be realized by hardware such as LSI (Large Scale Integration), ASIC (Application Specific Integrated Circuit), or FPGA (Field-Programmable Gate Array), or software. And hardware may cooperate. The guidance control program referred to by the processor may be stored in the storage unit 130 in advance, or may be stored in a removable storage medium such as a DVD or a CD-ROM. It may be installed in the storage unit 130 from a storage medium by being mounted on the device.

　記憶部１３０は、例えば、ＨＤＤ（Hard Disc Drive）、フラッシュメモリ、ＥＥＰＲＯＭ（Electrically Erasable Programmable Read Only Memory）、ＲＯＭ（Read Only Memory）、ＲＡＭ（Random Access Memory）などの記憶装置により実現される。記憶部１３０には、ファームウェアやアプリケーションプログラム（誘導制御プログラムを含む）などの各種プログラムのほかに、地形データ１３２や参照軌道情報１３４、フィルタパラメータ情報１３６などが格納される。 The storage unit 130 is realized by a storage device such as a hard disk drive (HDD), a flash memory, an electrically erasable programmable read only memory (EEPROM), a read only memory (ROM), and a random access memory (RAM). The storage unit 130 stores terrain data 132, reference trajectory information 134, filter parameter information 136, and the like, in addition to various programs such as firmware and application programs (including a guidance control program).

　地形データ１３２は、例えば、移動体Ｍに着陸させる天体ＰＬの地表面の三次元形状がモデル化されたデータである。天体ＰＬの地表面の三次元形状を示すモデルには、天体ＰＬの地表面に凹凸を形成するクレータや岩などの視覚的に目立つ特徴（特徴点）が含まれる。 The terrain data 132 is, for example, data in which a three-dimensional shape of the ground surface of a celestial body PL to be landed on the moving object M is modeled. The model showing the three-dimensional shape of the ground surface of the celestial body PL includes visually prominent features (feature points) such as craters and rocks that form irregularities on the ground surface of the celestial body PL.

　参照軌道情報１３４は、参照軌道（ノミナル軌道）を定義した情報である。参照軌道とは、地上の監視装置から移動体Ｍを天体ＰＬに遠隔誘導する際に、移動体Ｍがとるべき位置や速度、加速度、姿勢などの状態量を、ある時間間隔ごと、または距離間隔ごとに指定した情報である。参照軌道情報１３４が示す状態量は、「参照状態量」の一例である。 The reference trajectory information 134 is information defining a reference trajectory (nominal trajectory). The reference trajectory refers to a state quantity such as a position, a speed, an acceleration, and a posture to be taken by the moving object M when the moving object M is remotely guided to the celestial body PL from the ground monitoring device, at a certain time interval or at a distance interval. This is the information specified for each. The state quantity indicated by the reference trajectory information 134 is an example of “reference state quantity”.

　フィルタパラメータ情報１３６は、後述するカルマンフィルタの方程式の各パラメータが定義された情報である。 The filter parameter information 136 is information in which each parameter of a Kalman filter equation described later is defined.

　取得部１１２は、カメラ１０から画像を取得するとともに、慣性計測装置２０から加速度や速度といった検出値を取得する。取得部１１２は、通信部１０２を介して、地上の監視装置から参照軌道情報１３４を定期的に取得し、これを記憶部１３０に記憶させることで、記憶部１３０に格納された参照軌道情報１３４を更新する。 The acquisition unit 112 acquires an image from the camera 10 and acquires detection values such as acceleration and speed from the inertial measurement device 20. The acquisition unit 112 periodically acquires the reference trajectory information 134 from the monitoring device on the ground via the communication unit 102, and stores the reference trajectory information 134 in the storage unit 130, so that the reference trajectory information 134 stored in the storage unit 130 is obtained. To update.

　画像航法演算処理部１１４は、取得部１１２によってカメラ１０から取得された画像と、記憶部１３０に格納された地形データ１３２とを照合（比較）して、移動体Ｍの状態量を導出する。例えば、画像航法演算処理部１１４は、カメラ１０の画像に対して画像処理を行って、岩やクレータなどの特徴点を抽出し、抽出した特徴点と、地形データ１３２が示す三次元モデルに含まれる特徴点とのパターンマッチングによって、天体ＰＬの重力を基準とした鉛直方向に対して直行する水平方向に関する移動体Ｍの位置を、状態量として導出する。 The image navigation calculation processing unit 114 collates (compares) the image acquired from the camera 10 by the acquisition unit 112 with the terrain data 132 stored in the storage unit 130 to derive the state quantity of the moving object M. For example, the image navigation calculation processing unit 114 performs image processing on the image of the camera 10 to extract feature points such as rocks and craters, and includes the extracted feature points and the three-dimensional model indicated by the terrain data 132. The position of the moving body M in the horizontal direction perpendicular to the vertical direction based on the gravity of the celestial body PL is derived as a state quantity by pattern matching with the feature points to be obtained.

　慣性航法演算処理部１１６は、例えば、天体ＰＬの地表面を基準とした（例えば地表面の高度をゼロとした）移動体Ｍの高度が所定高度以下となった場合、慣性計測装置２０の検出結果を利用した慣性航法によって、移動体Ｍの状態を定量的に示す状態量を導出する。「状態量」には、例えば、位置、速度、機体座標系から慣性座標系へのクォータニオンなどが含まれる。機体座標系は、移動体Ｍの重心を座標の原点とし、移動体Ｍの機軸方向をある一軸とした座標系である。慣性座標系は、天体ＰＬの重力を基準とした鉛直方向をある一軸とした座標系である。「状態量」には、慣性計測装置２０によって検出された加速度や角速度に対するセンサバイアス値が含まれていてもよい。 For example, when the altitude of the moving object M with respect to the ground surface of the celestial body PL (for example, the altitude of the ground surface is set to zero) falls below a predetermined altitude, the inertial navigation calculation processing unit 116 detects the inertial measurement device 20 A state quantity that quantitatively indicates the state of the moving object M is derived by inertial navigation using the result. The “state quantity” includes, for example, a position, a speed, a quaternion from the body coordinate system to the inertial coordinate system, and the like. The machine body coordinate system is a coordinate system in which the center of gravity of the moving body M is the origin of the coordinates, and the machine axis direction of the moving body M is one axis. The inertial coordinate system is a coordinate system in which the vertical direction based on the gravity of the celestial body PL is a certain axis. The “state quantity” may include a sensor bias value for the acceleration or the angular velocity detected by the inertial measurement device 20.

　一般的に、地球上の監視装置との無線通信によって遠隔制御する場合、地球と天体ＰＬとの距離のオーダーが億キロメートルである場合があり、電波の往復伝搬時間に換算して数十分以上掛かる場合がある。そのため、天体ＰＬの地表面に対する移動体Ｍの高度を低下させ、移動体Ｍを天体ＰＬに着陸させていく際、地上からの指令に基づく遠隔制御では対応しきれないことがある。従って、慣性航法演算処理部１１６は、自律航法を行うために、慣性計測装置２０の検出値を利用した慣性航法によって移動体Ｍの状態量を導出する。 Generally, in the case of remote control by wireless communication with a monitoring device on the earth, the order of the distance between the earth and the celestial body PL may be 100 million kilometers, and it is more than tens of minutes in terms of the round-trip propagation time of radio waves. May hang. For this reason, when lowering the height of the moving body M with respect to the ground surface of the celestial body PL and landing the moving body M on the celestial body PL, remote control based on a command from the ground may not be sufficient. Therefore, the inertial navigation calculation processing unit 116 derives the state quantity of the moving object M by the inertial navigation using the detection value of the inertial measurement device 20 to perform the autonomous navigation.

　パラメータ決定部１１８は、参照軌道情報１３４を参照し、カルマンフィルタの各種方程式に含まれるパラメータの値を決定する。パラメータには、例えば、後述するカルマンフィルタの状態遷移行列や、カルマンフィルタのプロセスノイズの共分散行列、カルマンフィルタの観測ノイズの共分散行列などが含まれる。パラメータ決定部１１８は、パラメータの値を決定すると、そのパラメータ値をフィルタパラメータ情報１３６として記憶部１３０に記憶させる。 The 決定 parameter determining unit 118 refers to the reference trajectory information 134 and determines values of parameters included in various equations of the Kalman filter. The parameters include, for example, a state transition matrix of a Kalman filter described later, a covariance matrix of process noise of the Kalman filter, a covariance matrix of observation noise of the Kalman filter, and the like. After determining the value of the parameter, the parameter determination unit 118 stores the parameter value in the storage unit 130 as the filter parameter information 136.

　予測部１２０は、慣性航法演算処理部１１６によって移動体Ｍの状態量が導出されると、フィルタパラメータ情報１３６によってパラメータが定義されたカルマンフィルタを利用した第１処理（予測処理）において、移動体Ｍの状態量の誤差（航法誤差）の変動を予測し、カルマンフィルタを利用した第２処理（更新処理）において、移動体Ｍの状態量の誤差（航法誤差）を低減させる。 When the state quantity of the moving object M is derived by the inertial navigation operation processing unit 116, the prediction unit 120 performs the first processing (prediction processing) using the Kalman filter in which the parameters are defined by the filter parameter information 136 (the prediction processing). Of the state quantity error (navigation error) of the moving object M is reduced in the second processing (update processing) using the Kalman filter.

　例えば、カルマンフィルタにおける今回の処理周期をｋとした場合、第１処理には、現周期ｋにおいて、次回の処理周期ｋ＋１の移動体Ｍの状態量を予測する処理と、現周期ｋにおいて、次回の処理周期ｋ＋１の移動体Ｍの状態量の誤差を予測する処理とが含まれる。より具体的には、第１処理には、現周期ｋにおいて得られると推定された移動体Ｍの状態量（後述する状態量ベクトル）と、現周期ｋにおける制御入力（後述する制御ベクトル）とに基づいて、次の周期ｋ＋１において得られることが推定される移動体Ｍの状態量（推定値）を予測する処理と、現周期ｋの推定値に含まれる誤差の程度、あるいは推定値の精度を示す共分散（事前推定共分散）と、現周期ｋにおけるカルマンフィルタのプロセスノイズの程度を示す共分散とに基づいて、現周期ｋにおいて導出される次の周期ｋ＋１の推定値に含まれる誤差の程度、あるいは推定値の精度を示す共分散（事後推定共分散）を導出する処理と、が含まれる。 For example, when the current processing cycle in the Kalman filter is k, the first processing includes, in the current cycle k, processing for predicting the state quantity of the moving object M in the next processing cycle k + 1, and in the current cycle k, the next processing. And a process of estimating an error of the state quantity of the moving object M in the processing cycle k + 1. More specifically, in the first processing, the state quantity (state quantity vector described later) of the moving object M estimated to be obtained in the current cycle k, the control input (control vector described later) in the current cycle k, and For estimating the state quantity (estimated value) of the moving object M estimated to be obtained in the next cycle k + 1 based on the above, and the degree of error included in the estimated value of the current cycle k, or the accuracy of the estimated value Of the error included in the estimated value of the next cycle k + 1 derived in the current cycle k, based on the covariance (pre-estimated covariance) indicating the current cycle k and the covariance indicating the degree of the process noise of the Kalman filter in the current cycle k. A process of deriving a covariance (a posteriori estimated covariance) indicating the degree or accuracy of the estimated value.

　第２処理は、第１処理において事後推定共分散として推定された移動体Ｍの状態量の共分散に基づいて、カルマンゲインを導出し、導出したカルマンゲインと、ある観測値とに基づいて、第１処理において推定された移動体Ｍの状態量を更新する処理と、導出したカルマンゲインと、第１処理において事後推定共分散として推定された移動体Ｍの状態量の共分散とに基づいて、移動体Ｍの状態量の共分散を更新する処理と、が含まれる。観測値は、例えば、慣性航法演算処理部１１６によって導出された移動体Ｍの状態量と、画像航法演算処理部１１４によって導出された移動体Ｍの状態量との差分であってよい。すなわち、観測値は、慣性航法によって求めた移動体Ｍの位置と、画像航法によって求めた移動体Ｍの位置との差分であってよい。第２処理では、位置の差分を観測値として利用することで、位置に加えて、位置の微分値である速度や加速度なども併せて状態量として推定される。 The second processing derives a Kalman gain based on the covariance of the state quantity of the moving object M estimated as the posterior estimation covariance in the first processing, and based on the derived Kalman gain and a certain observation value, Based on the process of updating the state quantity of the moving object M estimated in the first process, the derived Kalman gain, and the covariance of the state amount of the moving object M estimated as the posterior estimated covariance in the first process. And updating the covariance of the state quantity of the moving object M. The observation value may be, for example, a difference between the state quantity of the moving body M derived by the inertial navigation calculation processing unit 116 and the state quantity of the moving body M derived by the image navigation calculation processing unit 114. That is, the observation value may be a difference between the position of the moving object M obtained by the inertial navigation and the position of the moving object M obtained by the image navigation. In the second process, by using the position difference as an observation value, in addition to the position, the velocity, acceleration, or the like, which is a differential value of the position, is estimated as a state quantity.

　ここで、予測部１２０は、カルマンフィルタを用いて第１処理および第２処理を行う際に、数式（１）から（６）として例示する最適化問題（最適制御問題）を解くことで、将来の移動体Ｍの状態量の最適解を導出する。以下、アルファベットに（→）を付けたものは、ベクトルあるいは行列を表すものとする。 Here, when performing the first processing and the second processing by using the Kalman filter, the prediction unit 120 solves an optimization problem (optimal control problem) exemplified as Expressions (1) to (6), so that the future An optimal solution of the state quantity of the moving object M is derived. Hereinafter, a letter with (→) represents a vector or a matrix.

　ｘ（→）は、移動体Ｍの状態量を示す状態量ベクトルを表し、ｕ（→）は、移動体Ｍの制御ベクトルを表し、ｆは、非線形ダイナミクス関数を表し、ｇは、状態制約式を表し、ｈは、制御制約式を表し、Ｆ（→）は、カルマンフィルタの状態遷移行列を表し、Ｈ（→）は、カルマンフィルタの観測行列を表し、Ｋ（→）は、上述した第２処理（更新処理）を行う処理周期（ステップ）の数を表し、Ｐ（→）は、カルマンフィルタの共分散行列を表し、Ｑ（→）は、カルマンフィルタのプロセスノイズの共分散行列を表し、Ｒ（→）は、カルマンフィルタの観測ノイズの共分散行列を表し、ｋは、カルマンフィルタの各処理周期（ステップ）を表している。 x (→) represents a state quantity vector indicating the state quantity of the moving body M, u (→) represents a control vector of the moving body M, f represents a nonlinear dynamics function, and g represents a state constraint equation. , H represents a control constraint equation, F (→) represents a state transition matrix of a Kalman filter, H (→) represents an observation matrix of a Kalman filter, and K (→) represents the second processing described above. P (→) represents the covariance matrix of the Kalman filter, Q (→) represents the covariance matrix of the process noise of the Kalman filter, and R (→ ) Represents the covariance matrix of the observation noise of the Kalman filter, and k represents each processing cycle (step) of the Kalman filter.

　数式（１）は、ある評価関数Ｊの最小化問題を定式化した数式を表し、数式（２）から（６）は、数式（１）を解く上で満たすべき数式を表している。数式（１）に示す評価関数Ｊは、ある処理周期ｋ_ｆの観測行列Ｇ（→）と、処理周期ｋ_ｆの共分散行列Ｐ（→）_ｋｆとの合成写像のトレース値を表している。処理周期ｋ_ｆとは、地上の監視装置から指示された誘導軌道に基づいて計算されたカルマンフィルタの最終処理時刻（最終処理周期）である。言い換えれば、処理周期ｋ_ｆは、移動体Ｍが天体ＰＬに着陸するときの最終処理周期である。観測行列Ｇ（→）は、非対角要素がゼロである行列であり、最終処理時刻ｋ_ｆでの共分散行列Ｐ（→）_ｋｆの特定の対角成分に重みをつけてトレース値(対角成分の和)を計算するための行列である。そのため、評価関数Ｊは、共分散行列Ｐ（→）_ｋｆの特定の対角成分に重みづけをしたトレース値である。 Formula (1) represents a formula that formulates a minimization problem of a certain evaluation function J, and formulas (2) to (6) represent formulas to be satisfied in solving formula (1). Evaluation function J shown in Equation (1) represents a certain processing cycle k _f of the observation matrix G (→), the trace value of a composite mapping and covariance matrix P (→) _kf processing cycle k _f. The processing cycle k _f, which is the final processing time of the Kalman filter, which is calculated based on the derived track instructed from the ground of the monitoring device (final processing cycle). In other words, processing cycle k _f is the final processing period when the moving object M is landing on celestial PL. Observation matrix G (→) is a matrix-diagonal elements are zero, the final treatment time covariance matrix P in k _f (→) trace value with a weighted specific diagonal elements of _kf (vs. This is a matrix for calculating the sum of angular components. Therefore, the evaluation function J is a trace value in which a specific diagonal component of the covariance matrix P (→) _kf is weighted.

　数式（２）は、状態量ベクトルｘ（→）と制御ベクトルｕ（→）とを変数とする非線形ダイナミクス関数によって表される状態方程式であり、数式（３）は、等式の制約条件式を表し、数式（４）は、不等式の制約条件式を表している。 Equation (2) is a state equation represented by a nonlinear dynamics function having a state quantity vector x (→) and a control vector u (→) as variables, and equation (3) is a constraint equation of the equation. Expression (4) represents an inequality constraint condition expression.

　数式（５）は、上述した第１処理に含まれる２つの処理のうち、事後推定共分散を導出する処理を表している。すなわち、数式（５）は、前周期ｋ－１の推定値に含まれる誤差の共分散行列（事前推定共分散）Ｐ（→）_{ｋ－１｜ｋ－１}と、現周期ｋにおけるカルマンフィルタのプロセスノイズの共分散行列Ｑ（→）_ｋとに基づいて、現周期ｋの推定値に含まれる誤差の共分散行列（事後推定共分散）Ｐ（→）_{ｋ｜ｋ－１}を導出することを表している。ｋ｜ｋ－１は、一つ前の周期ｋ－１において、一つ先の将来にあたる現周期ｋの共分散行列を推定することを意味する。 Equation (5) represents the process of deriving the posterior estimated covariance among the two processes included in the above-described first process. That is, Equation (5) is obtained by calculating the covariance matrix (prior estimated covariance) P (→) _{k−1 | k−1 of the} error included in the estimated value of the previous cycle _k−1 and the process of the Kalman filter in the current cycle k. Deriving a covariance matrix of the error (post posteriori covariance) P (→) _{k | k−1} based on the noise covariance matrix Q (→) _k and the error included in the estimated value of the current period k. ing. k | k-1 means estimating the covariance matrix of the current cycle k which is one time ahead in the previous cycle k-1.

　数式（６）は、上述した第２処理に含まれる２つの処理のうち、移動体Ｍの状態量の共分散を更新する処理を表している。すなわち、数式（６）は、数式（５）に示すように、前の周期ｋ－１で推定した現周期ｋの共分散行列Ｐ（→）_{ｋ｜ｋ－１}に基づいて、現周期ｋの共分散行列Ｐ（→）_ｋ｜ｋを再度導出することで、前の周期ｋ－１で推定した共分散行列Ｐ（→）_{ｋ｜ｋ－１}を、より確からしい共分散行列Ｐ（→）_ｋ｜ｋに更新することを表している。上述したカルマンゲインは、数式（６）の右辺項Ｐ（→）_{ｋ｜ｋ－１}Ｈ（→）^Ｔ（Ｈ（→）Ｐ（→）_{ｋ｜ｋ－１}Ｈ（→）^Ｔ＋Ｒ（→）_ｋ）^－１に相当する。共分散を更新する処理は、予め決められた処理周期Ｋ（→）で行われる。ｋ＞Ｋ_ｆである。 Equation (6) represents a process of updating the covariance of the state quantity of the moving object M among the two processes included in the above-described second process. That is, as shown in Expression (5), Expression (6) expresses the current period k based on the covariance matrix P (→) _{k | k-1} of the current period k estimated in the previous period k-1. By deriving the covariance matrix P (→) _{k | k} again, the covariance matrix P (→) _{k | k−1} estimated at the previous cycle k−1 is converted to a more reliable covariance matrix P (→) _{k | k} is updated. The above-mentioned Kalman gain is obtained by calculating the right-hand side term P (→) _{k | k−1} H (→) ^T (H (→) P (→) _{k | k−1} H (→) ^T + R (→) in Expression (6). _k ) ^-1 . The process of updating the covariance is performed at a predetermined processing cycle K (→). a k> _{K f.}

　予測部１２０は、数式（１）に示す評価関数Ｊが最小となる制御ベクトルｕ（→）の時間履歴を求めることで、最適化問題を解く。言い換えれば、予測部１２０は、カルマンフィルタの各処理をｋ_１からｋ_ｆまで繰り返す中で得られたｋ_ｆ個の制御ベクトルｕ（→）の集合の中から、評価関数Ｊが最小となる制御ベクトルｕ（→）を探索することで、最適化問題を解く。 The prediction unit 120 solves the optimization problem by finding the time history of the control vector u (→) that minimizes the evaluation function J shown in Expression (1). In other words, the prediction unit 120, control vector from a set of obtained in repeating the processing of the Kalman filter from k ₁ to k _f k _f-number of the control vector u (→), the evaluation function J is minimized Solve the optimization problem by searching for u (→).

　最適化問題を解くためには評価関数Ｊを、ある条件下で、制御ベクトルｕ（→）に対して離散線形化する計算手法が必要である。従って、本実施形態では、予測部１２０は、数式（７）に示す近似式を第１処理および第２処理に適用し、評価関数Ｊの一要素である共分散行列Ｐ（→）_ｋの離散線形化を行う。数式（７）中のＰ（－）は、参照軌道として決定された移動体Ｍがとるべき移動量を基にしてカルマンフィルタによって導出した共分散行列を表している。移動体Ｍが参照軌道を移動した際の航法精度を表す分散行列Ｐ（－）は、実際に移動体Ｍが参照軌道を移動するときに利用されるカルマンフィルタを仮想的に動作させることで導出される。「仮想的に動作させる」とは、カルマンフィルタの計算結果を移動体Ｍの機体制御に利用しないことである。（－）は、オーバーライン（バー）を表している。 In order to solve the optimization problem, a calculation method is required in which the evaluation function J is discretely linearized with respect to the control vector u (→) under a certain condition. Therefore, in the present embodiment, the prediction unit 120 applies the approximation formula shown in Expression (7) to the first processing and the second processing, and calculates the discrete covariance matrix P (→) _k which is one element of the evaluation function J. Perform linearization. P (−) in Expression (7) represents a covariance matrix derived by a Kalman filter based on the amount of movement to be taken by the moving object M determined as the reference trajectory. The variance matrix P (-) representing the navigation accuracy when the moving object M moves on the reference trajectory is derived by virtually operating the Kalman filter used when the moving object M actually moves on the reference trajectory. You. “Virtually operate” means that the calculation result of the Kalman filter is not used for control of the mobile unit M. (-) Indicates an overline (bar).

　数式（７）に示す近似式は、参照軌道（ｘ（→），ｕ（→））の近傍で、各処理周期の共分散行列Ｐ（→）_ｋが、Ｐ（→）_ｋをｔｒ（Ｐ（→）_ｋ）で微分した結果と一致するか、または極めて近くなるという仮定を表しており、数式（８）に示す条件式を満たす場合のみ成立する。言い換えれば、数式（７）に示す近似式は、共分散行列Ｐ（→）_ｋ（－）のトレース値（ｔｒ（Ｐ（→）_ｋ（－）））に対する共分散行列Ｐ（→）_ｋのトレース値（ｔｒ（Ｐ（→）_ｋ））の除算値を、共分散行列Ｐ（→）_ｋ（－）に乗算した値に、共分散行列Ｐ（→）_ｋが近似することを仮定している。数式（７）に示す共分散行列Ｐ（→）_ｋは、「第１共分散行列」の一例であり、数式（７）に示す共分散行列Ｐ（→）_ｋ（－）は、「第２共分散行列」の一例である。 Approximate expression shown in Equation (7), the reference trajectory (x (→), u (→)) in the vicinity of, the covariance matrix P _{(→) k} of each processing cycle, P _{(→) k} to tr (P (→) _k ) represents an assumption that the result is the same as or very close to the result of differentiation, and is satisfied only when the conditional expression shown in Expression (8) is satisfied. In other words, the approximation formula shown in Expression (7) is obtained by calculating the covariance matrix P (→) _{k with} respect to the trace value (tr (P (→) _k (−))) of the covariance matrix P (→) _k (−). Assuming that the covariance matrix P (→) _k approximates the value obtained by multiplying the covariance matrix P (→) _k (−) by the division value of the trace value (tr (P (→) _k )). I have. The covariance matrix P (→) _k shown in equation (7) is an example of “first covariance matrix”, and the covariance matrix P (→) _k (−) shown in equation (7) is It is an example of “covariance matrix”.

　数式（９）は、共分散行列Ｐ（→）_ｋの線形式を表す。予測部１２０は、数式（９）と離散線形化された数式（２）から（４）とに従って、評価関数Ｊを最小にする制御ベクトルｕ（→）の時間履歴を導出する。数式（９）の［Ｋ^－，Ｋ^＋］は、式（６）の処理周期、すなわち第２処理（共分散行列の更新処理）の処理周期の間であることを表しており、各ｔｒ（Ｐ（→））は、線形化の際に基準とした参照軌道（ｘ（→），ｕ（→））の近傍において成り立つ線形式で表されている。 Equation (9) represents the linear form of the covariance matrix P (→) _k . The prediction unit 120 derives the time history of the control vector u (→) that minimizes the evaluation function J according to Expression (9) and Expressions (2) to (4) that have been linearized discretely. [K ⁻ , K ⁺ ] in Expression (9) indicates that the period is the processing period of Expression (6), that is, the processing period of the second processing (the processing of updating the covariance matrix), and each tr ( P (→)) is expressed in a linear form that is established in the vicinity of the reference trajectory (x (→), u (→)) used as a reference for linearization.

　具体的には、予測部１２０は、数式（９）に従って評価関数Ｊを最小にする制御ベクトルｕ（→）を導出すると、その導出した制御ベクトルｕ（→）を数式（２）に代入することで状態量ベクトルｘ（→）を導出し、導出した状態量ベクトルｘ（→）および制御ベクトルｕ（→）を基に、再び数式（７）を用いた離散化および線形化を行うことを繰り返し行う。 Specifically, when the prediction unit 120 derives the control vector u (→) that minimizes the evaluation function J according to Expression (9), the prediction unit 120 substitutes the derived control vector u (→) into Expression (2). , And repeats discretization and linearization using Expression (7) again based on the derived state quantity vector x (→) and control vector u (→). Do.

　このように、評価関数Ｊが最小となる制御ベクトルｕ（→）の時間履歴を探索するという最適化問題を、参照軌道（ｘ（→），ｕ（→））周りで離散化および線形化することで、線形化した最適化問題を、凸最適化問題として解くことができる。凸最適化問題は、凸集合上の凸関数の最小化問題であり、一般的な最適化問題よりも簡単に最適化が可能であり、局所的な最小値が大域的な最小値と一致する性質をもつ。このように最適化問題を線形化して凸最適化問題として扱うことで、共分散行列のトレース値（スカラー値）を基にした比較的簡易な計算を行った場合であっても、あるいは物理的に成り立たない初期解が与えられた場合であっても、その計算の解を収束させ、最適解（ｘ（→），ｕ（→））を求めることができる。 Thus, the optimization problem of searching for the time history of the control vector u (→) in which the evaluation function J is minimized is discretized and linearized around the reference trajectory (x (→), u (→)). Thus, the linearized optimization problem can be solved as a convex optimization problem. The convex optimization problem is a minimization problem of a convex function on a convex set, and can be optimized more easily than a general optimization problem, and the local minimum matches the global minimum Has properties. By linearizing the optimization problem and treating it as a convex optimization problem in this way, even when a relatively simple calculation based on the trace value (scalar value) of the covariance matrix is performed, Even if an initial solution that does not hold is given, the solution of the calculation can be made to converge and the optimal solution (x (→), u (→)) can be obtained.

　機体制御部１２２は、予測部１２０がカルマンフィルタを用いて導出した最適解（ｘ（→），ｕ（→））に基づいて推進力出力装置３０を制御することで、移動体Ｍの位置、速度、加速度、姿勢などの状態が、最適解（ｘ（→），ｕ（→））が示す所望の状態となるように制御する。 The body control unit 122 controls the propulsion output device 30 based on the optimal solution (x (→), u (→)) derived by the prediction unit 120 using the Kalman filter, and thereby the position and speed of the moving body M. , Acceleration, posture, etc., are controlled to be the desired states indicated by the optimal solutions (x (→), u (→)).

　図３は、実施形態の制御部１１０による一連の処理の一例を示すフローチャートである。本フローチャートの処理は、例えば、天体ＰＬの地表面と移動体Ｍとの距離（高度）が所定距離（例えば４０［ｍ］程度）以下となる降下フェーズにおいて行われる。 FIG. 3 is a flowchart illustrating an example of a series of processes performed by the control unit 110 according to the embodiment. The processing of this flowchart is performed, for example, in a descent phase in which the distance (altitude) between the ground surface of the celestial body PL and the moving body M is equal to or less than a predetermined distance (for example, about 40 [m]).

　まず、パラメータ決定部１１８は、参照軌道情報１３４を参照し、初期値となる参照軌道の（ｘ（→），ｕ（→））を取得する（ステップＳ１００）。次に、パラメータ決定部１１８は、初期値とする参照軌道の（ｘ（→），ｕ（→））を基に、カルマンフィルタのパラメータ（Ｆ（→）、Ｑ（→）、Ｒ（→））の値を決定する（ステップＳ１０２）。 First, the parameter determination unit 118 refers to the reference trajectory information 134 and obtains (x (→), u (→)) of the reference trajectory that is an initial value (step S100). Next, the parameter determination unit 118 determines the parameters of the Kalman filter (F (→), Q (→), R (→)) based on (x (→), u (→)) of the reference trajectory as the initial value. Is determined (step S102).

　次に、予測部１２０は、カルマンフィルタの処理周期ｋが、予め決められた共分散行列の更新処理を開始する処理周期Ｋ（→）となるまで、パラメータ決定部１１８によって決定されたパラメータを含むカルマンフィルタを用いて、共分散行列Ｐ（→）_ｋを導出することを繰り返す（ステップＳ１０４）。 Next, the prediction unit 120 calculates the Kalman filter including the parameters determined by the parameter determination unit 118 until the processing cycle k of the Kalman filter becomes the processing cycle K (→) for starting the update processing of the predetermined covariance matrix. Is used to repeatedly derive the covariance matrix P (→) _k (step S104).

　図４は、共分散行列Ｐ（→）_ｋと処理周期ｋとの関係の一例を示す図である。図示のように、共分散行列Ｐ（→）_ｋは、初期値となるＰ（→）_０が与えられると、処理を繰り返すほど一定の値に収束していく。このとき、収束値となる共分散行列Ｐ（→）_ｋは、カルマンフィルタのパラメータによって一意に決定される。 FIG. 4 is a diagram illustrating an example of the relationship between the covariance matrix P (→) _k and the processing cycle k. As shown, the covariance matrix P (→) _k converges to a constant value as the process is repeated, given the initial value P (→) ₀ . At this time, the covariance matrix P (→) _{k serving as} a convergence value is uniquely determined by the parameters of the Kalman filter.

　図３の説明に戻り、次に、予測部１２０は、参照軌道として与えられた参照軌道（ｘ（→），ｕ（→））周りで、勾配情報を取得（導出）する（ステップＳ１０６）。勾配情報とは、評価関数Ｊの勾配に関する情報である。評価関数Ｊの勾配は、例えば、数式（７）の左辺式によって表される、評価関数Ｊの一要素である共分散行列Ｐ（→）_ｋのトレース値のｘ（→）やｕ（→）での偏微分量である。例えば、予測部１２０は、勾配情報を得るために、共分散行列Ｐ（→）ｋをｘ（→）またはｕ（→）で偏微分する。予測部１２０は、この偏微分結果を用いて、共分散行列Ｐ（→）_ｋのトレース値（ｔｒ（Ｐ（→）ｋ））を離散化および線形化（すなわち凸化）させることができる。離散化および線形化とは、評価関数Ｊを近似的に離散化された各時刻ｋのｘ（→）やｕ（→）の線形式で書き表す際に、ｘ（→）やｕ（→）にかかる係数としてこの偏微分値を利用することである。 Returning to the description of FIG. 3, next, the prediction unit 120 acquires (derives) gradient information around the reference trajectory (x (→), u (→)) given as the reference trajectory (step S106). The gradient information is information on the gradient of the evaluation function J. The gradient of the evaluation function J is represented by, for example, x (→) or u (→) of the trace value of the covariance matrix P (→) _k , which is one element of the evaluation function J, which is represented by the left-hand side equation of Expression (7). Is the partial differential amount at. For example, the prediction unit 120 partially differentiates the covariance matrix P (→) k with x (→) or u (→) in order to obtain gradient information. The prediction unit 120 can discretize and linearize (ie, convexize) the trace value (tr (P (→) k)) of the covariance matrix P (→) _k using the partial differentiation result. Discretization and linearization means that when the evaluation function J is written in a linear form of x (→) or u (→) at each time k approximately discretized, x (→) or u (→) This partial differential value is used as such a coefficient.

　次に、予測部１２０は、最適化問題の一つである凸最適化問題を解くことで、トレース値（ｔｒ（Ｐ（→）_ｋ））が最小値をとる最適解（ｘ（→），ｕ（→））を導出する（ステップＳ１０８）。 Next, the prediction unit 120 solves a convex optimization problem, which is one of the optimization problems, so that the trace value (tr (P (→) _k )) takes the minimum value to obtain the optimal solution (x (→), u (→)) is derived (step S108).

　図５は、評価関数Ｊの勾配情報を用いた凸最適化問題を解く処理を模式的に示すの概念図である。図示の例は、最適化問題を、評価関数Ｊの勾配情報を用いた凸最適化問題に変形することを表している。図示の例のように、最適化問題の一つである凸最適化問題を、評価関数Ｊの勾配情報を利用して解く場合、予測部１２０は、共分散行列Ｐ（→）_ｋのトレース値（ｔｒ（Ｐ（→）_ｋ））を凸化させる。これによって、局所的に最適解を探索した場合であっても、トレース値（ｔｒ（Ｐ（→）_ｋ））が最小値をとる最適解が局所解として導出されず、凸最適化によって、元の問題の局所解の一つが高速に求解出来る。 FIG. 5 is a conceptual diagram schematically showing a process for solving a convex optimization problem using gradient information of the evaluation function J. The illustrated example shows that the optimization problem is transformed into a convex optimization problem using the gradient information of the evaluation function J. When the convex optimization problem, which is one of the optimization problems, is solved by using the gradient information of the evaluation function J as in the example shown, the prediction unit 120 calculates the trace value of the covariance matrix P (→) _k (Tr (P (→) _k )) is made convex. As a result, even when the optimal solution is locally searched, the optimal solution having the minimum trace value (tr (P (→) _k )) is not derived as a local solution, and the original solution is obtained by convex optimization. One of the local solutions to the problem can be solved quickly.

　次に、予測部１２０は、参照軌道として与えられた（ｘ（→），ｕ（→））（図中ｘ（－））と、最適解として導出した（ｘ（→），ｕ（→））（図中ｘ）との差分のノルム（ユークリッドノルム）が、閾値Δ以下であるか否かを判定する（ステップＳ１１０）。 Next, the prediction unit 120 derives (x (→), u (→)) (x (−) in the figure) given as the reference trajectory and derives the optimal solution (x (→), u (→)). ) (X in the figure), it is determined whether or not the norm (Euclidean norm) of the difference is equal to or smaller than a threshold value Δ (step S110).

　予測部１２０によってノルムが閾値Δ以下であると判定された場合、機体制御部１２２は、予測部１２０がカルマンフィルタを用いて導出した最適解（ｘ（→），ｕ（→））に基づいて推進力出力装置３０を制御する（ステップＳ１１２）。これによって本フローチャートの処理が終了する。 When the prediction unit 120 determines that the norm is equal to or smaller than the threshold Δ, the airframe control unit 122 proceeds based on the optimal solution (x (→), u (→)) derived by the prediction unit 120 using the Kalman filter. The power output device 30 is controlled (step S112). Thus, the processing of this flowchart ends.

　予測部１２０は、ノルムが閾値Δを超えると判定した場合、初期値として与えられた参照軌道（ｘ（→），ｕ（→））を、Ｓ１０８の処理で最適解として導出した軌道（ｘ（→），ｕ（→））に更新し（ステップＳ１１４）、Ｓ１０２の処理に戻る。予測部１２０は、Ｓ１１４の処理において、最適解として得られる状態量ｘ（→）が、参照軌道が示す状態量ｘ（→）（－）から大きく乖離しないように、最適解として取り得る状態量ｘ（→）の上下限を変更してもよい。 When determining that the norm exceeds the threshold value Δ, the prediction unit 120 derives the reference trajectory (x (→), u (→)) given as the initial value as the optimal trajectory (x (→ ( →), u (→)) (step S114), and the process returns to S102. In the process of S114, the prediction unit 120 determines a state quantity that can be taken as an optimal solution such that the state quantity x (→) obtained as the optimal solution does not greatly deviate from the state quantity x (→) (−) indicated by the reference trajectory. The upper and lower limits of x (→) may be changed.

　これを受けて、パラメータ決定部１１８は、更新された軌道（ｘ（→），ｕ（→））を基に、カルマンフィルタのパラメータ（Ｆ（→）、Ｑ（→）、Ｒ（→））の値を再度決定する。このように、参照軌道、あるいは前回の処理で最適解として求めた軌道との乖離が閾値Δ以下となるまで凸最適化問題を解くこと繰り返すことで、移動体Ｍを天体ＰＬに着陸させる際に行わせる自律航法の航法誤差が小さくなる移動体Ｍの将来の状態量を逐次的に導出することができる。この結果、時系列に連続する移動体Ｍの将来の状態量によって表される誘導軌道を航法誤差が小さくなるように最適化することができる。 In response to this, the parameter determination unit 118 determines the Kalman filter parameters (F (→), Q (→), R (→)) based on the updated trajectory (x (→), u (→)). Determine the value again. As described above, when the moving object M lands on the celestial body PL by repeatedly solving the convex optimization problem until the deviation from the reference trajectory or the trajectory obtained as the optimal solution in the previous process is equal to or smaller than the threshold value Δ. It is possible to sequentially derive the future state quantity of the moving object M in which the navigation error of the autonomous navigation to be performed becomes small. As a result, it is possible to optimize the guidance trajectory represented by the future state quantity of the moving object M that is continuous in time series so that the navigation error is reduced.

　以上説明した実施形態によれば、カメラ１０や慣性計測装置２０といった複数のセンサを備える移動体Ｍに搭載されたコンピュータである航法装置１００に、カメラ１０と慣性計測装置２０とのそれぞれの検出値に基づき、カルマンフィルタを用いて移動体Ｍの将来の状態量を予測させる際に、カルマンフィルタのパラメータ（Ｆ（→）、Ｑ（→）、Ｒ（→））に基づいて、移動体Ｍの将来の状態量の誤差の程度を示す指標値である共分散行列Ｐ（→）_ｋを導出させ、共分散行列Ｐ（→）_ｋのトレース値（ｔｒ（Ｐ（→）_ｋ））が要素として含まれる評価関数Ｊが最適となる移動体の状態量ｘ（→）を、最適化問題（例えば凸最適化問題）を解くことで導出させる。これによって、航法誤差が小さくなるように、天体ＰＬに移動体Ｍを着陸させる際の誘導軌道を最適化することができる。 According to the above-described embodiment, the navigation device 100, which is a computer mounted on a moving body M including a plurality of sensors such as the camera 10 and the inertial measurement device 20, supplies the detection values of the camera 10 and the inertial measurement device 20 with each other. When predicting the future state quantity of the moving object M using the Kalman filter based on the Kalman filter, based on the Kalman filter parameters (F (→), Q (→), R (→)), A covariance matrix P (→) _k which is an index value indicating the degree of error of the state quantity is derived, and the trace value (tr (P (→) _k )) of the covariance matrix P (→) _k is included as an element. The state quantity x (→) of the moving object at which the evaluation function J is optimal is derived by solving an optimization problem (for example, a convex optimization problem). This makes it possible to optimize the guidance trajectory when landing the moving body M on the celestial body PL so that the navigation error is reduced.

　［変形例］
　以下、上述した実施形態の変形例について説明する。上述した実施形態では、移動体Ｍが宇宙機であり、その移動体Ｍに搭載されるセンサが、カメラ１０および慣性計測装置２０であるものとして説明したがこれに限られない。例えば、移動体Ｍが上述したＵＡＶである場合、その移動体Ｍに搭載されるセンサには、カメラ１０に代えて、あるいは加えて、レーダやＬＩＤＡＲ（Laser(Light) Imaging Detection and Ranging）が含まれてよい。移動体Ｍが上述したＡＵＶである場合、その移動体Ｍに搭載されるセンサには、カメラ１０に代えて、あるいは加えて、ソナーが含まれてよい。レーダ、ＬＩＤＡＲ、およびソナーは、「地形検出センサ」の他の例である。 [Modification]
Hereinafter, a modified example of the above-described embodiment will be described. In the above-described embodiment, the moving body M is a spacecraft, and the sensors mounted on the moving body M are the camera 10 and the inertial measurement device 20. However, the present invention is not limited to this. For example, when the moving object M is the above-described UAV, the sensors mounted on the moving object M include radar and LIDAR (Laser (Light) Imaging Detection and Ranging) instead of or in addition to the camera 10. May be. When the moving body M is the above-described AUV, the sensor mounted on the moving body M may include a sonar instead of or in addition to the camera 10. Radar, LIDAR, and sonar are other examples of "terrain detection sensors."

　上述した実施形態では、航法装置１００が、最適化問題の一つである凸最適化問題を解くことで、評価関数Ｊが最適となる移動体の状態量ｘ（→）を導出するものとして説明したがこれに限られない。例えば、航法装置１００の予測部１２０は、評価関数Ｊの勾配情報である共分散行列Ｐ（→）_ｋのトレース値のｘ（→）やｕ（→）による偏微分量に基づいて、ニュートン法や最急降下法といった他の最適化問題を解くことで、評価関数Ｊが最適となる移動体の状態量ｘ（→）を導出してもよい。言い換えれば、予測部１２０は、共分散行列Ｐ（→）_ｋ（－）のトレース値（ｔｒ（Ｐ（→）_ｋ（－）））に対する共分散行列Ｐ（→）_ｋのトレース値（ｔｒ（Ｐ（→）_ｋ））の除算値を、共分散行列Ｐ（→）_ｋ（－）に乗算した値に、共分散行列Ｐ（→）_ｋが近似するという仮定の下で、ニュートン法や最急降下法といった他の最適化問題を解くことで、評価関数Ｊが最適となる移動体の状態量ｘ（→）を導出してよい。 In the above-described embodiment, the navigation device 100 is described as deriving the state quantity x (→) of the moving object at which the evaluation function J is optimal by solving a convex optimization problem which is one of the optimization problems. However, it is not limited to this. For example, the prediction unit 120 of the navigation apparatus 100 calculates the Newton's method based on the partial differential amount of x (→) or u (→) of the trace value of the covariance matrix P (→) _k which is the gradient information of the evaluation function J. By solving another optimization problem such as the steepest descent method or the steepest descent method, the state quantity x (→) of the moving object at which the evaluation function J is optimal may be derived. In other words, the prediction unit 120, the covariance matrix P _(→) k _(-) Trace value _{(tr (P (→) k} (-))) covariance matrix for P _(→) trace value of _k (tr ( P (→) _k )) is multiplied by the covariance matrix P (→) _k (−), under the assumption that the covariance matrix P (→) _k approximates By solving another optimization problem such as the steep descent method, the state quantity x (→) of the moving object at which the evaluation function J is optimal may be derived.

　以上、本発明を実施するための形態について実施形態を用いて説明したが、本発明はこうした実施形態に何等限定されるものではなく、本発明の要旨を逸脱しない範囲内において種々の変形及び置換を加えることができる。 As described above, the embodiments for carrying out the present invention have been described using the embodiments. However, the present invention is not limited to these embodiments at all, and various modifications and substitutions may be made without departing from the gist of the present invention. Can be added.

１０…カメラ、２０…慣性計測装置、１００…航法装置、１０２…通信部、１１０…制御部、１１２…取得部、１１４…画像航法演算処理部、１１６…慣性航法演算処理部、１１８…パラメータ決定部、１２０…予測部、１２２…機体制御部、１３０…記憶部、Ｍ…移動体 DESCRIPTION OF SYMBOLS 10 ... Camera, 20 ... Inertial measurement device, 100 ... Navigation device, 102 ... Communication part, 110 ... Control part, 112 ... Acquisition part, 114 ... Image navigation calculation processing part, 116 ... Inertial navigation calculation processing part, 118 ... Parameter determination Unit, 120: prediction unit, 122: airframe control unit, 130: storage unit, M: mobile unit

Claims

A computer mounted on a moving object equipped with a plurality of sensors,
Based on the respective detection values of the first sensor and the second sensor included in the plurality of sensors, when predicting the future state amount of the moving object using a Kalman filter, based on the parameters of the Kalman filter, Deriving an index value indicating the degree of error in the state quantity of the moving object,
The index value, the state quantity of the moving body in which the evaluation function included as an element is optimal is derived by solving an optimization problem,
A non-transitory computer-readable storage medium storing a guidance control program.

On the computer,
If the norm of the difference between the state quantity of the moving body derived by solving the optimization problem and a preset reference state quantity exceeds a threshold, the parameter is changed based on the state quantity of the moving body. Let
Based on the Kalman filter including the changed parameter, to derive the index value,
A storage medium storing the guidance control program according to claim 1.

On the computer,
Until the norm becomes equal to or less than a threshold, the parameter is changed based on the state quantity of the moving object, and the index value is derived based on the Kalman filter including the changed parameter.
A storage medium storing the guidance control program according to claim 2.

On the computer,
The parameter is determined based on the state quantity of the derived moving body, and based on the Kalman filter including the determined parameter, a first covariance matrix is derived as the index value,
A parameter of the Kalman filter is determined based on a previously set reference state quantity, and based on the Kalman filter including the determined parameter, a second covariance matrix is derived as the index value,
Assuming that the first covariance matrix approximates a value obtained by multiplying the second covariance matrix by a value obtained by dividing the trace value of the first covariance matrix by the trace value of the second covariance matrix, Solve the problem
A storage medium storing the guidance control program according to claim 1.

The first sensor is a terrain detection sensor that detects terrain, the second sensor is an inertial measurement device,
On the computer,
Based on the detection result by the terrain detection sensor, to derive the state quantity of the moving body,
The state quantity of the moving object measured by the inertial measurement device, and the difference between the state quantity of the moving object derived based on the image as the observed value of the Kalman filter, the future state of the moving object To predict the amount,
A storage medium storing the guidance control program according to claim 1.