JP2023008679A

JP2023008679A - Information providing method, information providing program, and marking support system

Info

Publication number: JP2023008679A
Application number: JP2021112417A
Authority: JP
Inventors: 能久浅山; Yoshihisa Asayama
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2021-07-06
Filing date: 2021-07-06
Publication date: 2023-01-19
Anticipated expiration: 2041-07-06
Also published as: JP7658200B2

Abstract

【課題】技の認識結果に関する情報提供の品質向上を実現すること。【解決手段】情報提供方法では、技認識用の機械学習モデルに入力された骨格情報から前記技認識用の機械学習モデルが出力する技に対応する特徴量を算出し、算出された特徴量と、前記技認識用の機械学習モデルの訓練に用いられた骨格情報の訓練データのうち前記技と同一の技のラベルが付与された訓練データから求まる特徴量との差分を算出し、算出された特徴量の差分を出力する、処理をコンピュータが実行する。【選択図】図６[Problem] To improve the quality of information provision regarding the results of technique recognition. [Solution] In the information provision method, a computer executes a process of calculating a feature amount corresponding to a technique output by a machine learning model for technique recognition from skeletal information input to the machine learning model for technique recognition, calculating a difference between the calculated feature amount and a feature amount obtained from training data of skeletal information used to train the machine learning model for technique recognition that is labeled with the same technique as the technique, and outputting the calculated feature amount difference. [Selected Figure] Figure 6

Description

本発明は、情報提供技術に関する。 The present invention relates to information provision technology.

人物の姿勢認識や行動認識が各種の分野で利用されている。例えば、体操競技を例に挙げれば、現状の採点方法は複数の審判による目視で行われているが、器具の進化やトレーニング方法の改善により、動きの複雑化を伴う技の高度化が進み、審判による技の認識が困難となる場面が増加している。この結果、審判ごとに選手の採点結果が異なるなど採点の公平性や正確性の維持に懸念が生じている。 Human posture recognition and action recognition are used in various fields. Taking gymnastics as an example, the current scoring method is based on visual observation by multiple referees. The number of situations where it is difficult for referees to recognize techniques is increasing. As a result, there are concerns about maintaining the fairness and accuracy of scoring, such as the scoring results of athletes differing for each referee.

このような背景から、３次元のレーザセンサにより選手の３次元点群データを取得し、選手の骨格情報の検出、選手により実施された技の認識を実行して技の認識結果に関する情報を提供することで、審判の採点を支援する採点支援システムが提案されている。 Against this background, we acquire 3D point cloud data of athletes using a 3D laser sensor, detect the athlete's skeletal information, recognize the techniques performed by the athlete, and provide information on the recognition results of the techniques. A scoring support system has been proposed that assists referees in scoring.

例えば、採点支援システムでは、技の認識結果に関する情報提供の一環として、技の認識に用いられる機械学習モデルが出力する尤度が提供されている。これにより、１つの側面として、機械学習モデルが不正解の技のラベルを出力している可能性が高い場面、例えば過検知や誤検知などの場面で自動採点のミスに注意を喚起する。 For example, in a scoring support system, the likelihood output by a machine learning model used for skill recognition is provided as part of providing information on skill recognition results. As an aspect, this draws attention to mistakes in automatic scoring in situations where there is a high possibility that the machine learning model is outputting a label for an incorrect technique, such as an over-detection or a false detection.

特開２０１７－２２８１００号公報Japanese Patent Application Laid-Open No. 2017-228100 特表２０１４－５０９０１１号公報Special Table 2014-509011 特開２０１９－１６６３１１号公報JP 2019-166311 A 米国特許出願公開第２０１２／０２１４５９４号明細書U.S. Patent Application Publication No. 2012/0214594

しかしながら、上記の尤度は、以下に説明する通り、技の認識結果に関する情報提供の品質として十分なものであるとは限らない一面がある。 However, as will be described below, the above likelihood is not always sufficient as the quality of providing information on the recognition result of the trick.

すなわち、上記の尤度は、機械学習モデルの訓練時に訓練データとして用いられる骨格情報と、採点などの場面で入力される骨格情報との差に対応するものに過ぎず、真の確からしさとは一致しない。特に、機械学習モデルが分類するクラスの種類が多いほど、さらには、機械学習モデルによる分類精度に個人差が与える影響が大きいほど、上記の尤度と真の確からしさは一致しづらくなる。例えば、機械学習モデルが出力する尤度が低い場合でもラベルが正解である場合もあれば、機械学習モデルが出力する尤度が高い場合でもラベルが不正解である場合もある。加えて、機械学習モデルによる判断プロセスは、人間と異なるものであってブラックボックスである場合があるので、機械学習モデルが出力する尤度を提供したとしても人への説明に適さない一面もある。 In other words, the above likelihood simply corresponds to the difference between the skeletal information used as training data when training the machine learning model and the skeletal information input in situations such as scoring. It does not match. In particular, the greater the number of types of classes classified by the machine learning model, and the greater the impact of individual differences on the classification accuracy of the machine learning model, the more difficult it is for the likelihood and true likelihood to match. For example, the label may be correct even if the likelihood output by the machine learning model is low, and the label may be incorrect even if the likelihood output by the machine learning model is high. In addition, the decision process by machine learning models is different from that of humans and may be a black box, so even if the likelihood output by machine learning models is provided, there is also an aspect that is not suitable for explanation to humans. .

１つの側面では、技の認識結果に関する情報提供の品質向上を実現できる情報提供方法、情報提供プログラム及び採点支援システムを提供することを目的とする。 An object of one aspect is to provide an information providing method, an information providing program, and a scoring support system capable of improving the quality of providing information on skill recognition results.

一態様にかかる情報提供方法では、技認識用の機械学習モデルに入力された骨格情報から前記技認識用の機械学習モデルが出力する技に対応する特徴量を算出し、算出された特徴量と、前記技認識用の機械学習モデルの訓練に用いられた骨格情報の訓練データのうち前記技と同一の技のラベルが付与された訓練データから求まる特徴量との差分を算出し、算出された特徴量の差分を出力する、処理をコンピュータが実行する。 In an information providing method according to one aspect, a feature amount corresponding to a technique output by the machine learning model for technique recognition is calculated from skeleton information input to the machine learning model for technique recognition, and the calculated feature amount and , out of the training data of the skeleton information used for training the machine learning model for the technique recognition, the difference between the feature amount obtained from the training data labeled with the same technique as the technique is calculated. A computer executes a process of outputting the difference of the feature amounts.

一実施形態によれば、技の認識結果に関する情報提供の品質向上を実現できる。 According to one embodiment, it is possible to improve the quality of providing information on the recognition results of tricks.

図１は、採点支援システムの構成例を示す図である。FIG. 1 is a diagram showing a configuration example of a scoring support system. 図２は、骨格認識機能を示す模式図である。FIG. 2 is a schematic diagram showing the skeleton recognition function. 図３は、技認識機能を示す模式図である。FIG. 3 is a schematic diagram showing the technique recognition function. 図４は、マルチアングルビューの一例を示す図である。FIG. 4 is a diagram showing an example of a multi-angle view. 図５は、自動採点ビューの一例を示す図である。FIG. 5 is a diagram showing an example of an automatic scoring view. 図６は、実施例１に係るサーバ装置の機能構成例を示すブロック図である。FIG. 6 is a block diagram of a functional configuration example of a server device according to the first embodiment; 図７は、採点支援ビューの一例を示す図である。FIG. 7 is a diagram showing an example of the scoring support view. 図８は、トレーニング支援ビューの一例を示す図である。FIG. 8 is a diagram showing an example of a training assistance view. 図９は、実施例１に係る情報提供処理の手順を示すフローチャートである。FIG. 9 is a flowchart illustrating the procedure of information provision processing according to the first embodiment. 図１０は、実施例２に係るサーバ装置の機能構成例を示すブロック図である。FIG. 10 is a block diagram illustrating a functional configuration example of a server device according to the second embodiment; 図１１は、実施例２に係る情報提供処理の手順を示すフローチャートである。FIG. 11 is a flowchart illustrating the procedure of information provision processing according to the second embodiment. 図１２は、ハードウェア構成例を示す図である。FIG. 12 is a diagram illustrating a hardware configuration example.

以下、添付図面を参照して本願に係る情報提供方法、情報提供プログラム及び採点支援システムの実施例について説明する。各実施例には、あくまで１つの例や側面を示すに過ぎず、このような例示により数値や機能の範囲、利用シーンなどは限定されない。そして、各実施例は、処理内容を矛盾させない範囲で適宜組み合わせることが可能である。 Hereinafter, embodiments of an information providing method, an information providing program, and a scoring support system according to the present application will be described with reference to the accompanying drawings. Each embodiment merely shows one example or one aspect, and such examples do not limit the numerical values, the range of functions, the usage scene, and the like. Further, each embodiment can be appropriately combined within a range that does not contradict the processing contents.

＜システム構成＞
図１は、採点支援システムの構成例を示す図である。図１に示す採点支援システム１は、被写体である演技者３の３次元データを取得し、骨格認識や技認識などを通じて演技者３により実施される技の採点を支援するものである。 <System configuration>
FIG. 1 is a diagram showing a configuration example of a scoring support system. A scoring support system 1 shown in FIG. 1 acquires three-dimensional data of an actor 3, who is a subject, and supports scoring of techniques performed by the actor 3 through skeleton recognition, technique recognition, and the like.

このような技の例として、体操競技における演技を挙げるが、フィギュアスケートにおける演技の他、各種のスポーツにおける動作、例えば野球のスイングやサッカーのシュートなども技の範疇に含まれてよい。 As an example of such a technique, a performance in gymnastics will be given, but in addition to performance in figure skating, movements in various sports, such as a baseball swing and a soccer shoot, may also be included in the category of techniques.

図１に示すように、採点支援システム１には、３Ｄレーザセンサ５と、サーバ装置１０と、クライアント端末３０とが含まれ得る。 As shown in FIG. 1 , the scoring support system 1 may include a 3D laser sensor 5 , a server device 10 and a client terminal 30 .

３Ｄ（Three－Dimensional）レーザセンサ５は、赤外線レーザ等を用いて対象物までの距離、いわゆる深度を走査点に対応する画素ごとに測定するセンサ装置の一例である。例えば、３Ｄレーザセンサ５の例として、深度画像カメラやＬＩＤＡＲ（Light Detection and Ranging）技術を用いたレーザセンサ、例えばＭＥＭＳ（Micro-Electro-Mechanical Systems）ミラー型のレーザセンサであってよい。 The 3D (Three-Dimensional) laser sensor 5 is an example of a sensor device that uses an infrared laser or the like to measure the distance to an object, the so-called depth, for each pixel corresponding to a scanning point. For example, the 3D laser sensor 5 may be a laser sensor using a depth image camera or LIDAR (Light Detection and Ranging) technology, such as a MEMS (Micro-Electro-Mechanical Systems) mirror-type laser sensor.

サーバ装置１０は、各種のサービスを提供するコンピュータの一例である。あくまで一例として、サーバ装置１０は、図１に示すように、骨格認識機能７、技認識機能８および採点支援機能９がパッケージ化された採点支援サービスを提供する。例えば、骨格認識機能７は、３Ｄレーザセンサ５により測定される深度画像を用いて、演技者３の骨格部位、例えば関節の位置などの骨格情報を検出する機能を提供する。技認識機能８は、骨格認識機能７による骨格認識で得られる骨格情報の時系列データを用いて、演技者３により実施される技を認識する機能を提供する。採点支援機能９は、技認識機能８による技認識結果に基づいて採点を支援する機能を提供する。 The server device 10 is an example of a computer that provides various services. As an example only, the server device 10 provides a scoring support service in which a skeleton recognition function 7, a technique recognition function 8, and a scoring support function 9 are packaged as shown in FIG. For example, the skeleton recognition function 7 uses the depth image measured by the 3D laser sensor 5 to provide a function of detecting skeleton information such as the positions of the skeleton parts of the performer 3, such as joint positions. The technique recognition function 8 provides a function of recognizing techniques performed by the performer 3 using time-series data of skeleton information obtained by skeleton recognition by the skeleton recognition function 7 . The scoring support function 9 provides a function for supporting scoring based on the skill recognition result by the skill recognition function 8 .

これら骨格認識機能７、技認識機能８および採点支援機能９を実現するパッケージソフトウェアを任意のコンピュータに実行させることにより、上記の採点支援サービスが提供され得る。例えば、サーバ装置１０は、ＳａａＳ（Software as a Service）型のアプリケーションとして実装することで、骨格認識機能７、技認識機能８および採点支援機能９をクラウドサービスとして提供することができる。この他、サーバ装置１０は、骨格認識機能７、技認識機能８および採点支援機能９をオンプレミスに提供するサーバとして実装することもできる。 By causing an arbitrary computer to execute package software that realizes the skeleton recognition function 7, the technique recognition function 8, and the scoring support function 9, the above-described scoring support service can be provided. For example, the server device 10 can provide the skeleton recognition function 7, the technique recognition function 8, and the scoring support function 9 as a cloud service by being implemented as a SaaS (Software as a Service) type application. In addition, the server device 10 can be implemented as a server that provides the skeleton recognition function 7, the technique recognition function 8, and the scoring support function 9 on-premises.

クライアント端末３０は、上記のサービスの提供を受けるコンピュータの一例である。このようなクライアント端末３０は、上記のサービスのユーザのあくまで一例として、体操競技の関係者、例えば審判員や演技者３のトレーナなどにより使用される。例えば、クライアント端末３０は、パーソナルコンピュータなどのデスクトップ型のコンピュータなどであってよい。この他、クライアント端末３０は、ラップトップ型のコンピュータや携帯端末装置、ウェアラブル端末などの汎用の情報処理装置に限らず、ＨＭＩ（Human Machine Interface）などが採点支援用にカスタマイズされたものであってもよい。 The client terminal 30 is an example of a computer that receives the above services. Such a client terminal 30 is used by people involved in gymnastics, such as referees and trainers of the performers 3, as just one example of users of the above services. For example, the client terminal 30 may be a desktop computer such as a personal computer. In addition, the client terminal 30 is not limited to a general-purpose information processing device such as a laptop computer, a mobile terminal device, or a wearable terminal. good too.

なお、図１には、骨格認識機能７、技認識機能８および採点支援機能９がパッケージ化されて提供される例を挙げたが、各機能がモジュール化されて提供されてもよい。この場合、骨格認識機能７、技認識機能８および採点支援機能９の各々は、異なるコンピュータにより実現されてよく、また、同一のベンダまたは異なるベンダにより提供されてよい。 Note that FIG. 1 shows an example in which the skeleton recognition function 7, the technique recognition function 8, and the scoring support function 9 are provided as a package, but each function may be provided as a module. In this case, the skeleton recognition function 7, the technique recognition function 8, and the scoring support function 9 may each be realized by different computers, and may be provided by the same vendor or different vendors.

＜骨格認識機能＞
図２は、骨格認識機能７を示す模式図である。図２に示すように、骨格認識機能７は、あくまで一例として、機械学習モデルを用いる骨格認識とフィッティングとを組みわせたハイブリッド方式により実現できる。 <Skeleton recognition function>
FIG. 2 is a schematic diagram showing the skeleton recognition function 7. As shown in FIG. As shown in FIG. 2, the skeleton recognition function 7 can be realized by a hybrid system that combines skeleton recognition using a machine learning model and fitting, just as an example.

例えば、骨格認識には、深度画像を入力として３Ｄ骨格座標の推定値を出力する機械学習モデル７ｍ、例えばＣＮＮ（Convolutional Neural Network）系のニューラルネットワークを用いることができる。ここで言う「３Ｄ骨格座標」とは、頭や肩、背骨、肘、手首、腰、膝、足首などといった関節が３次元空間上で位置する座標を指し、「骨格情報」の一例に対応し得る。 For example, for skeletal recognition, a machine learning model 7m, such as a CNN (Convolutional Neural Network) neural network, which inputs depth images and outputs estimated values of 3D skeletal coordinates, can be used. The "3D skeletal coordinates" referred to here refer to the coordinates at which joints such as the head, shoulders, spine, elbows, wrists, hips, knees, and ankles are positioned in a three-dimensional space, and correspond to an example of "skeletal information." obtain.

このような機械学習モデル７ｍの訓練には、深度画像および正解ラベルの３Ｄ骨格座標が対応付けられた訓練データを含むデータセット７ＴＲを用いることができる。例えば、訓練データは、体操競技の３Ｄ骨格座標からコンピュータグラフィックス等により深度画像を生成することにより用意できる。このようなデータセットの下、学習フェイズでは、深度画像を機械学習モデル７ｍの説明変数とし、ラベルを機械学習モデル７ｍの目的変数とし、任意の機械学習のアルゴリズム、例えばディープラーニングなどにしたがって機械学習モデル７ｍを訓練できる。例えば、深度画像が入力された機械学習モデル７ｍが出力する３Ｄ骨格座標の推定値と、正解ラベルの３Ｄ骨格座標との損失に基づいて機械学習モデル７ｍのパラメータが更新される。これにより、訓練済みの機械学習モデル７Ｍが得られる。推論フェイズでは、体操器具や演技者３自身によるオクルージョンを克服するように設置された多視点の３Ｄレーザセンサ５Ａ～５Ｎから出力される多視点の深度画像を機械学習モデル７Ｍへ入力する。このように多視点の深度画像が入力された機械学習モデル７Ｍは、演技者３の３Ｄ骨格座標を出力する。 For training such a machine learning model 7m, a data set 7TR including training data in which depth images and 3D skeletal coordinates of correct labels are associated can be used. For example, the training data can be prepared by generating depth images using computer graphics or the like from 3D skeletal coordinates for gymnastics. Under such a data set, in the learning phase, the depth image is used as an explanatory variable for the machine learning model 7m, the label is used as the objective variable for the machine learning model 7m, and any machine learning algorithm, such as deep learning, is used for machine learning. Model 7m can be trained. For example, the parameters of the machine learning model 7m are updated based on the loss between the estimated value of the 3D skeleton coordinates output by the machine learning model 7m to which the depth image is input and the 3D skeleton coordinates of the correct labels. This yields a trained machine learning model 7M. In the inference phase, multi-viewpoint depth images output from multi-viewpoint 3D laser sensors 5A to 5N installed so as to overcome occlusion by the gymnastics equipment and the performer 3 himself are input to the machine learning model 7M. The machine learning model 7M to which the multi-viewpoint depth images are input in this way outputs the 3D skeleton coordinates of the performer 3 .

フィッティングでは、機械学習モデル７Ｍの３Ｄ骨格座標の出力や前フレームにおけるフィッティング結果などを初期値とし、多視点の深度画像が統合された３Ｄ点群に人体モデルを当てはめる。例えば、３Ｄ点群の座標と人体モデルの表面座標との一致度を示す評価関数（尤度）を定義し、最も尤度が高くなる関節角度を最適化により求めることで、３Ｄ骨格座標を決定する。 In the fitting, the output of the 3D skeleton coordinates of the machine learning model 7M, the fitting result in the previous frame, etc. are used as initial values, and the human body model is applied to the 3D point cloud in which the multi-viewpoint depth images are integrated. For example, by defining an evaluation function (likelihood) that indicates the degree of matching between the coordinates of the 3D point cloud and the surface coordinates of the human body model, and optimizing the joint angle with the highest likelihood, the 3D skeleton coordinates are determined. do.

３Ｄレーザセンサ５および骨格認識機能７により、マーカレスで演技者３の動きの３次元計測を実行する３Ｄセンシングが実現される。 The 3D laser sensor 5 and the skeleton recognition function 7 realize markerless 3D sensing that performs three-dimensional measurement of the movement of the performer 3 .

＜技認識機能＞
図３は、技認識機能８を示す模式図である。図３に示すように、技認識機能８は、３Ｄ骨格座標の時系列データを入力として体操競技における技名を出力する機械学習モデル８ｍ、例えばＣＮＮ系のニューラルネットワークにより実現できる。 <Technique recognition function>
FIG. 3 is a schematic diagram showing the technique recognition function 8. As shown in FIG. As shown in FIG. 3, the technique recognition function 8 can be realized by a machine learning model 8m, for example, a CNN-based neural network, which receives time-series data of 3D skeletal coordinates and outputs gymnastics technique names.

このような機械学習モデル８ｍの訓練には、３Ｄ骨格座標の時系列データおよび正解ラベルの技名が対応付けられた訓練データを含むデータセット８ＴＲを用いることができる。例えば、訓練データには、体操競技で演技が実演された３Ｄ骨格座標の時系列データを用いることができる。このようなデータセットの下、学習フェイズでは、３Ｄ骨格座標を機械学習モデル８ｍの説明変数とし、ラベルを機械学習モデル８ｍの目的変数とし、任意の機械学習のアルゴリズム、例えばディープラーニングなどにしたがって機械学習モデル８ｍを訓練できる。例えば、３Ｄ骨格座標の時系列データが入力された機械学習モデル８ｍが出力する技名の推定値と、正解ラベルの技名との損失に基づいて機械学習モデル８ｍのパラメータが更新される。これにより、訓練済みの機械学習モデル８Ｍが得られる。推論フェイズでは、骨格認識機能７による骨格認識で得られた３Ｄ骨格座標の時系列データを機械学習モデル８Ｍへ入力する。このように３Ｄ骨格座標の時系列データが入力された機械学習モデル８Ｍは、演技者３により実施された技名を技の認識結果として出力する。この際、技の認識結果に関する情報として、技名とルールや採点規則とを照合することにより識別できる情報、例えば技のグループ名や技の難易度、難易度評価点などを含めて出力することもできる。 For training such a machine learning model 8m, a data set 8TR including training data associated with time-series data of 3D skeleton coordinates and technique names of correct labels can be used. For example, the training data can be time-series data of 3D skeletal coordinates in which performances have been demonstrated in gymnastics. Under such a data set, in the learning phase, the 3D skeleton coordinates are used as explanatory variables of the machine learning model 8m, the labels are used as the objective variables of the machine learning model 8m, and any machine learning algorithm, such as deep learning, is used to machine A learning model 8m can be trained. For example, the parameters of the machine learning model 8m are updated based on the loss between the estimated skill name output by the machine learning model 8m to which the time series data of the 3D skeletal coordinates is input and the skill name of the correct label. This yields a trained machine learning model 8M. In the inference phase, time-series data of 3D skeleton coordinates obtained by skeleton recognition by the skeleton recognition function 7 is input to the machine learning model 8M. The machine learning model 8M to which the time-series data of the 3D skeletal coordinates is input in this way outputs the name of the technique performed by the performer 3 as the recognition result of the technique. At this time, information that can be identified by matching the technique name with rules and scoring rules, such as the technique group name, the difficulty level of the technique, the difficulty evaluation score, etc., should be output as information related to the recognition result of the technique. can also

＜採点支援機能＞
採点支援機能９は、骨格認識機能７による３Ｄ骨格座標、技認識機能８による技認識結果またはこれらの組合せに基づいて体操競技の関係者に対する情報提供を実行できる。例えば、採点支援機能９は、体操競技の関係者の一例に対応する審判員向けのユーザインタフェイスの例として、図４に示すマルチアングルビューや図５に示す自動採点ビューなどを提供できる。このような採点支援機能９がＷｅｂアプリケーションにより実現されることで、クライアント端末３０上で動作するブラウザを介して図４に示すマルチアングルビューや図５に示す自動採点ビューなどを提供できる。 <Scoring support function>
The scoring support function 9 can provide information to gymnastics participants based on 3D skeleton coordinates by the skeleton recognition function 7, technique recognition results by the technique recognition function 8, or a combination thereof. For example, the scoring support function 9 can provide a multi-angle view shown in FIG. 4, an automatic scoring view shown in FIG. 5, and the like as examples of user interfaces for referees corresponding to an example of a person involved in gymnastics. By realizing such a scoring support function 9 as a Web application, it is possible to provide the multi-angle view shown in FIG. 4 and the automatic scoring view shown in FIG.

図４は、マルチアングルビューの一例を示す図である。図４に示すように、マルチアングルビュー４００には、３Ｄレーザセンサ５が深度画像を取得するフレームごとに当該フレームにおける演技者３の３Ｄ骨格座標がモデル化された骨格モデルが正面や側面、平面、自由視点などの複数の視点で表示される。このように複数の視点で骨格モデルを表示するか否かは、タブ４１０およびタブ４２０に対する操作で切り替えることができる。例えば、“Ｍｕｌｔｉ”のタブ４１０が選択中である場合、図４に示す通り、演技者３の骨格モデルが複数の視点ごとに表示される。その一方で、“Ｓｉｎｇｌｅ”のタブ４２０が選択中である場合、指定の視点に対応する演技者３の骨格モデルが表示される。さらに、マルチアングルビュー４００では、シークバー４３０上でスライダ４３０Ａをスライドさせることにより、演技者３の骨格モデルを表示させるフレームを指定させることもできる。このようなマルチアングルビュー４００によれば、採点基準の１つとなり得る演技者３の関節角度を詳細に確認させることができる。 FIG. 4 is a diagram showing an example of a multi-angle view. As shown in FIG. 4, in the multi-angle view 400, the 3D skeletal coordinates of the performer 3 are modeled for each frame in which the 3D laser sensor 5 acquires the depth image. , Free viewpoint, etc. are displayed. Whether or not to display the skeletal model from a plurality of viewpoints can be switched by operating the tabs 410 and 420 . For example, when the "Multi" tab 410 is selected, the skeletal model of actor 3 is displayed for each of a plurality of viewpoints, as shown in FIG. On the other hand, when the "Single" tab 420 is selected, the skeletal model of actor 3 corresponding to the specified viewpoint is displayed. Furthermore, in the multi-angle view 400, by sliding the slider 430A on the seek bar 430, it is possible to specify the frame in which the skeleton model of the actor 3 is to be displayed. According to such a multi-angle view 400, it is possible to confirm in detail the joint angles of the performer 3, which can be one of the scoring criteria.

図５は、自動採点ビューの一例を示す図である。図５に示すように、自動採点ビュー５００には、骨格モデルビュー５１０および自動採点テーブル５２０が含まれる。例えば、骨格モデルビュー５１０には、ユーザ指定のアングルにおける演技者３の３Ｄ骨格座標がモデル化された骨格モデルが表示される。この骨格モデルビュー５１０に演技者３の骨格モデルを表示させるフレームは、シークバー５３０上でスライダ５３０Ａをスライドさせることにより指定させることができる。また、自動採点テーブル５２０には、技の認識結果が時系列に表示される。同図の例で言えば、自動採点ビュー５００のウィンドウの縦方向、すなわち上方向から下方向へ向けて、技名、技のグループ番号、技の難易度および難易度価値点が時系列に表示される。このように各技の情報が個別に表示される他、全演技のＤスコアの採点結果がさらに表示される。ここで、図５には、Ｄスコアのタブ５４０が選択中である例が示されているが、Ｅスコアのタブ５５０が選択された場合、全演技のＥスコアの採点結果の表示に切り替えることができる。このような自動採点ビュー５００によれば、審判業務の負担を軽減できる。 FIG. 5 is a diagram showing an example of an automatic scoring view. As shown in FIG. 5, autoscoring view 500 includes skeletal model view 510 and autoscoring table 520 . For example, the skeletal model view 510 displays a skeletal model in which the 3D skeletal coordinates of actor 3 at a user-specified angle are modeled. The frame in which the skeleton model of actor 3 is displayed in skeleton model view 510 can be specified by sliding slider 530 A on seek bar 530 . The automatic scoring table 520 displays the recognition results of tricks in chronological order. In the example shown in the figure, the name of the technique, the group number of the technique, the difficulty level of the technique, and the difficulty level value are displayed in chronological order in the vertical direction of the window of the automatic scoring view 500, that is, from the top to the bottom. be done. In addition to the individual display of information on each trick, the D-score scoring results for all performances are also displayed. Here, FIG. 5 shows an example in which the D-score tab 540 is being selected, but when the E-score tab 550 is selected, the display is switched to the display of the E-score scoring results for all performances. can be done. Such an automatic scoring view 500 can reduce the burden of referee work.

これらマルチアングルビューや自動採点ビューに加えて、技の認識結果に関する情報提供の一環として、技認識に用いられる機械学習モデル８Ｍが出力する技の尤度が提供され得る。これにより、機械学習モデル８Ｍが不正解の技のラベルを出力している可能性が高い場面、例えば過検知や誤検知などの場面で自動採点のミスに注意を喚起する側面がある。 In addition to the multi-angle view and the automatic scoring view, the likelihood of the technique output by the machine learning model 8M used for technique recognition can be provided as part of providing information on the technique recognition result. This has the aspect of calling attention to mistakes in automatic scoring in situations where there is a high possibility that the machine learning model 8M is outputting a label for an incorrect technique, for example, in situations such as over-detection or mis-detection.

なお、図４や図５では、技の認識結果に関する情報提供の例として、審判向けのコンテンツを例に挙げたが、情報提供の対象は審判員に限定されない。例えば、トレーナ向け、あるいはエンターテイメント向けのコンテンツを提供することとしてもかまわない。 In FIGS. 4 and 5, content for referees is used as an example of providing information on the recognition results of tricks, but the target of information provision is not limited to referees. For example, content for trainers or entertainment may be provided.

＜課題の一側面＞
上記の機械学習モデル８Ｍが出力する尤度は、以下に説明する通り、技の認識結果に関する情報提供の品質として十分なものであるとは限らない一面がある。 <One aspect of the challenge>
As will be described below, the likelihood output by the machine learning model 8M is not always of sufficient quality as the quality of providing information on the recognition result of the technique.

すなわち、上記の尤度は、機械学習モデル８Ｍの訓練時に訓練データとして用いられる骨格情報と、採点などの場面で入力される骨格情報との差に対応するものに過ぎず、真の確からしさとは一致しない。特に、機械学習モデル８Ｍが分類するクラスの種類が多いほど、さらには、機械学習モデル８Ｍによる分類精度に個人差が与える影響が大きいほど、上記の尤度と真の確からしさは一致しづらくなる。例えば、機械学習モデル８Ｍが出力する尤度が低い場合でもラベルが正解である場合もあれば、機械学習モデル８Ｍが出力する尤度が高い場合でもラベルが不正解である場合もある。加えて、機械学習モデル８Ｍによる判断プロセスは、人間と異なるものであってブラックボックスである場合があるので、機械学習モデル８Ｍが出力する尤度を提供したとしても人への説明に適さない一面もある。 In other words, the likelihood described above merely corresponds to the difference between the skeletal information used as training data when training the machine learning model 8M and the skeletal information input in situations such as scoring. do not match. In particular, the greater the number of types of classes classified by the machine learning model 8M, and the greater the impact of individual differences on the classification accuracy of the machine learning model 8M, the more difficult it is for the likelihood and true likelihood to match. . For example, the label may be correct even if the likelihood output by the machine learning model 8M is low, and the label may be incorrect even if the likelihood output by the machine learning model 8M is high. In addition, the judgment process by the machine learning model 8M is different from humans and may be a black box, so even if the likelihood output by the machine learning model 8M is provided, it is not suitable for explanation to humans. There is also

＜サーバ装置１０の構成＞
そこで、本実施例では、技の認識結果に関する情報提供の品質向上を実現できる情報提供機能を有するサーバ装置１０の機能的構成について説明する。図６は、実施例１に係るサーバ装置１０の機能構成例を示すブロック図である。図６には、サーバ装置１０が有する機能に対応するブロックが模式化されている。 <Configuration of server device 10>
Therefore, in this embodiment, a functional configuration of the server device 10 having an information providing function capable of improving the quality of providing information on the recognition results of tricks will be described. FIG. 6 is a block diagram illustrating a functional configuration example of the server device 10 according to the first embodiment. FIG. 6 schematically shows blocks corresponding to the functions of the server device 10 .

図６に示すように、サーバ装置１０は、通信インタフェイス部１１と、記憶部１３と、制御部１５とを有する。なお、図６には、上記の情報提供機能を含む採点支援サービスに関連する機能部が図示されているが、一部の機能に対応するモジュールが異なるコンピュータ、あるいは仮想マシンで動作することを妨げない。また、図６に示す機能部に限らず、既存のコンピュータがデフォルトまたはオプションで装備する機能がサーバ装置１０に備わることとしてもよい。 As shown in FIG. 6 , the server device 10 has a communication interface section 11 , a storage section 13 and a control section 15 . FIG. 6 shows functional units related to the scoring support service including the above information providing function. Absent. In addition, the server device 10 may be provided with functions that existing computers are provided with by default or as an option, without being limited to the functional units shown in FIG. 6 .

通信インタフェイス部１１は、他の装置、例えば３Ｄレーザセンサ５やクライアント端末３０などとの間で通信制御を行う通信制御部の一例に対応する。あくまで一例として、通信インタフェイス部１１は、ＬＡＮ（Local Area Network）カードなどのネットワークインターフェイスカードにより実現され得る。１つの側面として、通信インタフェイス部１１は、３Ｄレーザセンサ５から深度画像をフレーム単位で受け付けたり、また、技認識結果に関する情報をクライアント端末３０へ出力したりする。 The communication interface unit 11 corresponds to an example of a communication control unit that controls communication with other devices such as the 3D laser sensor 5 and the client terminal 30 . As an example only, the communication interface unit 11 can be realized by a network interface card such as a LAN (Local Area Network) card. As one aspect, the communication interface unit 11 receives depth images from the 3D laser sensor 5 on a frame-by-frame basis, and outputs information on skill recognition results to the client terminal 30 .

記憶部１３は、各種のデータを記憶する機能部である。あくまで一例として、記憶部１３は、ストレージ、例えば内部、外部または補助のストレージにより実現される。例えば、記憶部１３は、機械学習モデル８Ｍの生成に用いられた訓練データのデータセット８ＴＲを記憶する。このデータセット８ＴＲ以外にも、記憶部１３は、機械学習モデル７Ｍの生成に用いられた訓練データのデータセット７ＴＲ、３Ｄ骨格座標、技認識結果、あるいは自動採点結果などの各種のデータを記憶することができる。 The storage unit 13 is a functional unit that stores various data. By way of example only, the storage unit 13 is implemented by storage, such as internal, external or auxiliary storage. For example, the storage unit 13 stores a data set 8TR of training data used to generate the machine learning model 8M. In addition to this data set 8TR, the storage unit 13 stores various data such as a data set 7TR of training data used to generate the machine learning model 7M, 3D skeleton coordinates, technique recognition results, and automatic scoring results. be able to.

制御部１５は、サーバ装置１０の全体制御を行う処理部である。例えば、制御部１５は、ハードウェアプロセッサにより実現される。図６に示すように、制御部１５は、取得部１５Ａと、骨格認識部１５Ｂと、技認識部１５Ｃと、特徴量算出部１５Ｄと、特徴量差分算出部１５Ｅと、採点支援部１５Ｆとを有する。 The control unit 15 is a processing unit that performs overall control of the server device 10 . For example, the control unit 15 is realized by a hardware processor. As shown in FIG. 6, the control unit 15 includes an acquisition unit 15A, a skeleton recognition unit 15B, a technique recognition unit 15C, a feature amount calculation unit 15D, a feature amount difference calculation unit 15E, and a scoring support unit 15F. have.

取得部１５Ａは、深度画像を取得する処理部である。あくまで一例として、取得部１５Ａは、多視点の３Ｄレーザセンサ５Ａ～５Ｎから出力される多視点の深度画像をフレーム単位で取得することができる。ここで、取得部１５Ａが深度画像を取得する情報ソースは、任意の情報ソースであってよく、ネットワークＮＷを介する通信に限定されない。例えば、取得部１５Ａは、サーバ装置１０が有するストレージ、あるいはサーバ装置１０に着脱可能なリムーバブルメディア、例えばメモリカードやＵＳＢ（Universal Serial Bus）メモリなどから深度画像を取得することとしてもよい。 The acquisition unit 15A is a processing unit that acquires a depth image. As an example only, the acquiring unit 15A can acquire multi-viewpoint depth images output from the multi-viewpoint 3D laser sensors 5A to 5N on a frame-by-frame basis. Here, the information source from which the acquisition unit 15A acquires the depth image may be any information source, and is not limited to communication via the network NW. For example, the acquisition unit 15A may acquire the depth image from a storage included in the server device 10 or removable media that can be attached to and detached from the server device 10, such as a memory card or USB (Universal Serial Bus) memory.

骨格認識部１５Ｂは、骨格認識を実行する処理部である。骨格認識部１５Ｂは、図１に示す骨格認識機能７に対応し得る。一実施形態として、骨格認識部１５Ｂは、図２に示された訓練済みの機械学習モデル７Ｍを演技者３の骨格認識に用いることができる。例えば、骨格認識部１５Ｂは、取得部１５Ａにより深度画像の新規のフレームが取得された場合、当該新規のフレームに対応する多視点の深度画像を機械学習モデル７Ｍへ入力する。そして、骨格認識部１５Ｂは、多視点の深度画像が入力された機械学習モデル７Ｍから出力される演技者３の３Ｄ骨格座標の推定値を技認識部１５Ｃへ出力する。 The skeleton recognition unit 15B is a processing unit that executes skeleton recognition. The skeleton recognition unit 15B can correspond to the skeleton recognition function 7 shown in FIG. As one embodiment, the skeleton recognition unit 15B can use the trained machine learning model 7M shown in FIG. For example, when the acquisition unit 15A acquires a new frame of the depth image, the skeleton recognition unit 15B inputs the multi-viewpoint depth image corresponding to the new frame to the machine learning model 7M. Then, the skeleton recognition unit 15B outputs the estimated values of the 3D skeleton coordinates of the performer 3 output from the machine learning model 7M to which the multi-viewpoint depth image is input, to the technique recognition unit 15C.

技認識部１５Ｃは、技認識を実行する処理部である。技認識部１５Ｃは、図１に示す技認識機能８に対応し得る。一実施形態として、技認識部１５Ｃは、図３に示された訓練済みの機械学習モデル８Ｍを演技者３の技認識に用いることができる。例えば、技認識部１５Ｃは、骨格認識部１５Ｂにより新規のフレームに対応する３Ｄ骨格座標が出力された場合、当該新規のフレームから遡って特定の数のフレーム分の３Ｄ骨格座標の時系列データを機械学習モデル８Ｍへ入力する。そして、技認識部１５Ｃは、３Ｄ骨格座標の時系列データが入力された機械学習モデル８Ｍから出力される演技者３の技認識結果を特徴量算出部１５Ｄへ出力する。 The technique recognition unit 15C is a processing unit that executes technique recognition. The technique recognition unit 15C can correspond to the technique recognition function 8 shown in FIG. As one embodiment, the technique recognition unit 15C can use the trained machine learning model 8M shown in FIG. For example, when the skeleton recognition unit 15B outputs the 3D skeleton coordinates corresponding to a new frame, the technique recognition unit 15C acquires the time-series data of the 3D skeleton coordinates for a specific number of frames from the new frame. Input to machine learning model 8M. Then, the technique recognition unit 15C outputs the technique recognition result of the performer 3, which is output from the machine learning model 8M to which the time-series data of the 3D skeleton coordinates is input, to the feature amount calculation unit 15D.

特徴量算出部１５Ｄは、技に関する特徴量を算出する処理部である。一実施形態として、特徴量算出部１５Ｄは、技認識部１５Ｃにより技が認識された場合、例えば機械学習モデル８Ｍが出力するラベルが未検出、すなわち技なしでない場合、処理を起動できる。例えば、特徴量算出部１５Ｄは、技認識部１５Ｃにより認識された技の採点項目ごとに当該採点項目に対応する特徴量を算出する。すなわち、技の種類によって採点規則で定められた採点項目の種類や数が異なり得るので、技ごとに異なる種類および異なる数の特徴量が算出される。例えば、「前後開脚とび」を例に挙げれば、前後開脚、前膝角度およびジャンプ高さなどの特徴量が算出される。また、「前後開脚輪とび」を例に挙げれば、前後開脚、前膝角度およびジャンプ高さの特徴量に加えて、後脚の高さや上半身の反り角度などの特徴量がさらに算出される。例えば、特徴量算出部１５Ｄは、技の採点項目に対応する特徴量ごとに、機械学習モデル８Ｍに入力された３Ｄ骨格座標の時系列データのうち、当該特徴量の算出に用いるフレームを特定して当該フレームにおける３Ｄ骨格座標から特徴量を算出する。 The feature amount calculation unit 15D is a processing unit that calculates feature amounts related to techniques. As an embodiment, the feature amount calculation unit 15D can start processing when a technique is recognized by the technique recognition unit 15C, for example, when the label output by the machine learning model 8M is not detected, that is, when there is no technique. For example, the feature amount calculation unit 15D calculates a feature amount corresponding to each scoring item of the technique recognized by the technique recognizing unit 15C. That is, since the type and number of scoring items defined by the scoring rule may differ depending on the type of technique, different types and different numbers of feature values are calculated for each technique. For example, in the case of “forward-backward split leg jump”, feature amounts such as back-and-forth leg spread, front knee angle, and jump height are calculated. In addition, taking the example of "front and back split wheel jump", in addition to the front and rear split legs, the front knee angle and the jump height, the feature values such as the height of the back legs and the warp angle of the upper body are further calculated. be. For example, the feature amount calculation unit 15D specifies a frame to be used for calculating the feature amount from the time-series data of the 3D skeleton coordinates input to the machine learning model 8M for each feature amount corresponding to the scoring item of the technique. Then, the feature amount is calculated from the 3D skeleton coordinates in the frame.

特徴量差分算出部１５Ｅは、技認識に用いられた入力データおよび技の認識結果に対応する訓練データの間で特徴量の差分を算出する処理部である。例えば、入力データの特徴量として、特徴量算出部１５Ｄにより算出された特徴量が用いられる。一方、訓練データの特徴量として、機械学習モデル８Ｍの生成に用いられたデータセット８ＴＲのうち、技認識部１５Ｃにより認識された技と同一技のラベルが付与された訓練データに対応する３Ｄ骨格座標の時系列データから算出された特徴量が用いられる。なお、訓練データの特徴量については、オンデマンドで特徴量算出部１５Ｄに算出させることもできるが、事前に算出しておいた特徴量をプリセットとして用いることができる。 The feature amount difference calculation unit 15E is a processing unit that calculates a feature amount difference between the input data used for technique recognition and the training data corresponding to the technique recognition result. For example, the feature amount calculated by the feature amount calculation unit 15D is used as the feature amount of the input data. On the other hand, as a feature quantity of the training data, a 3D skeleton corresponding to the training data labeled with the same technique as the technique recognized by the technique recognition unit 15C in the data set 8TR used to generate the machine learning model 8M. A feature amount calculated from time-series data of coordinates is used. Note that the feature amount of the training data can be calculated by the feature amount calculation unit 15D on demand, but the feature amount calculated in advance can be used as a preset.

より詳細には、特徴量差分算出部１５Ｅは、技認識部１５Ｃにより認識された技と同一技のラベルが付与されたＬ個の訓練データごと、および、技認識部１５Ｃにより認識された技の採点項目の数に対応するＫ種類の特徴量ごとに、特徴量の差分を算出する。このように算出される特徴量の差分は、その値がゼロに近いほど入力データおよび訓練データの間で採点項目に対応する特徴量の距離が小さい、あるいは特徴量の類似度が高いことを意味し得る。つまり、特徴量の差分の値が小さいほど、訓練データに付与されたラベルの技に対応する採点項目にしたがって当該技が実施されている可能性が高まる。それ故、体操競技における採点規則にしたがって技が実施されている尤もらしさを数値化できる。この結果、技が実施されている真の確からしさを表現すると共に人の判断プロセスに適合する指標を提供できる。 More specifically, the feature amount difference calculation unit 15E calculates the number of L pieces of training data labeled with the same technique as the technique recognized by the technique recognition unit 15C and the technique recognized by the technique recognition unit 15C. A feature amount difference is calculated for each of K kinds of feature amounts corresponding to the number of scoring items. The difference between the feature amounts calculated in this way means that the closer the value is to zero, the smaller the distance of the feature amount corresponding to the scoring item between the input data and the training data, or the higher the similarity of the feature amounts. can. That is, the smaller the value of the difference between the feature amounts, the higher the possibility that the technique is performed according to the scoring item corresponding to the technique labeled with the training data. Therefore, it is possible to quantify the likelihood that the technique is performed according to the scoring rules in gymnastics. As a result, it is possible to provide an index that expresses the true likelihood that the technique is being performed and that is suitable for the human judgment process.

さらに、特徴量差分算出部１５Ｅは、上記の特徴量の差分から特徴量の尤度を算出する。以下、特徴量の尤度のことを指して「特徴量尤度」と記載する場合がある。あくまで一例として、特徴量尤度は、特徴量の差分を特定の数値範囲、例えば０～１の範囲へ正規化することにより算出することができる。この際、１に近づくほど特徴量が尤もらしい一方で０に近づくほど特徴量が尤もらしくない特徴量尤度に正規化する側面から、特徴量の差分の逆数をとることもできる。 Further, the feature quantity difference calculation unit 15E calculates the likelihood of the feature quantity from the difference in the feature quantity. Hereinafter, the likelihood of a feature amount may be referred to as a "feature amount likelihood". As just one example, the feature quantity likelihood can be calculated by normalizing the feature quantity difference to a specific numerical range, for example, a range of 0-1. At this time, the reciprocal of the difference of the feature amounts can be obtained from the aspect of normalization to the feature amount likelihood that the closer to 1 the more likely the feature amount is and the closer to 0 the less likely the feature amount is.

このようにＫ種類の特徴量尤度が算出された後、特徴量差分算出部１５Ｅは、Ｋ種類の特徴量尤度を代表する１つの代表値、例えば統計値を技の尤度として算出することができる。以下、技の尤度のことを指して「技尤度」と記載する場合がある。あくまで一例として、Ｋ種類の特徴量尤度の最小値を技尤度として導出することができる。この他、Ｋ種類の特徴量尤度の相加平均や加重平均などの平均値、あるいはＫ種類の特徴量尤度の中央値などを技尤度として導出することとしてもよい。このような技尤度をＬ個の訓練データごとに算出することにより、特徴量差分算出部１５Ｅは、Ｌ個の技尤度を算出する。 After the K types of feature quantity likelihoods are calculated in this way, the feature quantity difference calculating unit 15E calculates one representative value representing the K types of feature quantity likelihoods, for example, a statistical value, as the skill likelihood. be able to. Hereinafter, the likelihood of a technique may be referred to as "technique likelihood". As an example, the minimum value of K types of feature quantity likelihoods can be derived as the technique likelihood. In addition, an average value such as an arithmetic average or a weighted average of K types of feature quantity likelihoods, or a median value of K types of feature quantity likelihoods may be derived as skill likelihoods. By calculating such a skill likelihood for each of L pieces of training data, the feature quantity difference calculating unit 15E calculates L pieces of skill likelihood.

採点支援部１５Ｆは、採点を支援する処理部である。採点支援部１５Ｆは、図１に示す採点支援機能９に対応し得る。１つの側面として、採点支援部１５Ｆは、図４に示されたマルチアングルビュー４００や図５に示された自動採点ビュー５００をクライアント端末３０に表示させることができる。他の側面として、採点支援部１５Ｆは、技認識結果に関する情報提供の一環として、特徴量差分算出部１５Ｅにより算出された特徴量尤度や技尤度をクライアント端末３０に提供できる。例えば、採点支援部１５Ｆは、特徴量差分算出部１５Ｅにより算出されたＬ個の技尤度のうち最小の技尤度を抽出すると共に、最小の技尤度に対応する訓練データから算出されたＫ種類の特徴量尤度を抽出する。その上で、採点支援部１５Ｆは、最小の技尤度、あるいは最小の技尤度に対応する訓練データから算出されたＫ種類の特徴量尤度をクライアント端末３０に表示させる。 The scoring support unit 15F is a processing unit that supports scoring. The scoring support unit 15F can correspond to the scoring support function 9 shown in FIG. As one aspect, the scoring support unit 15F can cause the client terminal 30 to display the multi-angle view 400 shown in FIG. 4 and the automatic scoring view 500 shown in FIG. As another aspect, the scoring support unit 15F can provide the client terminal 30 with the feature amount likelihood and the technique likelihood calculated by the feature amount difference calculation unit 15E as part of providing information on the technique recognition result. For example, the scoring support unit 15F extracts the minimum skill likelihood from among the L number of skill likelihoods calculated by the feature amount difference calculation unit 15E, and K types of feature quantity likelihoods are extracted. After that, the scoring support unit 15F causes the client terminal 30 to display the minimum skill likelihood or the K types of feature amount likelihood calculated from the training data corresponding to the minimum skill likelihood.

図７は、採点支援ビューの一例を示す図である。図７に示すように、採点支援ビュー７００には、演技者３の３Ｄ骨格座標がモデル化された骨格モデルを表示する骨格モデルビュー７１０が含まれる。この骨格モデルビュー７１０に演技者３の骨格モデルを表示させるフレームは、シークバー７２０上のスライダ７２０Ａの位置に応じて連動させることができる。図７の例で言えば、シークバー７２０上のスライダ７２０Ａの位置に対応するフレームで実施中である技「前脚を曲げ伸ばした前後開脚とび」の骨格モデルが骨格モデルビュー７１０に表示されている。 FIG. 7 is a diagram showing an example of the scoring support view. As shown in FIG. 7, the scoring support view 700 includes a skeletal model view 710 that displays a skeletal model in which the 3D skeletal coordinates of actor 3 are modeled. The frame for displaying the skeleton model of the actor 3 in the skeleton model view 710 can be interlocked according to the position of the slider 720A on the seek bar 720. FIG. In the example of FIG. 7, the skeletal model view 710 displays the skeletal model of the technique "back and forth leg jump with front leg bent and stretched" being performed in the frame corresponding to the position of the slider 720A on the seek bar 720. .

さらに、シークバー７２０上のスライダ７２０Ａの位置に応じて、演技者３により実施される演技を時系列に表示するシンボルバー７３０の表示が連動する。例えば、シークバー７２０上のスライダ７２０Ａの位置に対応するフレームで実施中である技と、その前後で実施される技とが時系列に並べて表示される。図７の例で言えば、「ロンダート」、「後方伸身宙返り、両足着台」、「片足立ち１回ターン」、「前脚を曲げ伸ばした前後開脚とび」および「シソンヌ」の各技のシンボルが横方向、例えば左方向から右方向へ並べて表示される。 Furthermore, according to the position of the slider 720A on the seek bar 720, the display of the symbol bar 730 that displays the performance performed by the performer 3 in chronological order is interlocked. For example, the trick being performed in the frame corresponding to the position of the slider 720A on the seek bar 720 and the tricks performed before and after that are displayed side by side in chronological order. In the example of FIG. 7, each technique of "Ron dart", "Backward stretch somersault, two foot landing", "Single turn with one leg standing", "Back and forth leg jump with front leg bent and straightened", and "Sisonne". Symbols are displayed side by side from left to right, for example.

さらに、シンボルバー７３０に含まれるシンボルには、当該シンボルに対応する技尤度が対応付けて表示される。図７の例で言えば、シンボル「ロンダート」の下部には、技尤度「０．９１」が表示され、シンボル「後方伸身宙返り、両足着台」の下部には、技尤度「０．９９」が表示され、シンボル「片足立ち１回ターン」の下部には、技尤度「０．８７」が表示されている。さらに、シンボル「前脚を曲げ伸ばした前後開脚とび」の下部には、技尤度「０．２５」が表示され、シンボル「シソンヌ」の下部には、技尤度「０．８５」が表示されている。 Furthermore, symbols included in the symbol bar 730 are displayed in association with skill likelihoods corresponding to the symbols. In the example of FIG. 7, a skill likelihood of "0.91" is displayed below the symbol "Rondart", and a skill likelihood of "0. .99” is displayed, and the skill likelihood of “0.87” is displayed below the symbol “1 turn on one leg”. Furthermore, a technique likelihood of "0.25" is displayed below the symbol "forward and backward split jump with front leg bent and extended", and a technique likelihood of "0.85" is displayed below the symbol "sissonne". It is

ここで、シンボルバー７３０に表示されるシンボルや技尤度のうち、特定の条件を満たすシンボルまたは技尤度の表示形態と、他のシンボルまたは他の技尤度の表示形態とが区別される。例えば、技尤度が閾値以下である場合、あるいはシンボルバー７３０内で技尤度が最低である場合のいずれかの条件に該当するシンボルや技尤度を強調表示させることができる。図７の例で言えば、シンボル「前脚を曲げ伸ばした前後開脚とび」がハッチングで表示されると共に、その技尤度のフォントが太字で表示されている。 Here, among the symbols and technique likelihoods displayed on the symbol bar 730, the display form of symbols or technique likelihoods satisfying a specific condition and the display form of other symbols or other technique likelihoods are distinguished. . For example, if the skill likelihood is equal to or less than a threshold value, or if the skill likelihood is the lowest in the symbol bar 730, the symbol or skill likelihood corresponding to either condition can be highlighted. In the example of FIG. 7, the symbol "forward-backward split-leg jump with front leg bent and stretched" is displayed with hatching, and the skill likelihood font is displayed in bold.

このようなシンボルバー７３０によれば、技認識結果のうち過検知や誤検知のおそれが他の技よりも高い「前脚を曲げ伸ばした前後開脚とび」の注目度を高めることができる。この結果、「前脚を曲げ伸ばした前後開脚とび」の自動採点ミスに注意を喚起できる。これにより、例えば、「前脚を曲げ伸ばした前後開脚とび」と動きが類似する「前後開脚とび」と誤検知されていないかなどの推定やその確認などの対処も可能となる。 According to the symbol bar 730 as described above, it is possible to increase the degree of attention of the "back and forth leg jump with the front leg bent and extended" which is more likely to be over-detected or erroneously detected among the technique recognition results than other techniques. As a result, it is possible to draw attention to the automatic scoring error of "forward and backward open leg jump with front leg bent and extended". As a result, for example, it is possible to estimate whether or not a "front-rear split leg jump" similar in motion to a "front-rear split leg jump" is being erroneously detected, and to confirm the same.

図８は、トレーニング支援ビューの一例を示す図である。図８に示すように、トレーニング支援ビュー８００には、図７に示された骨格モデルビュー７１０、シークバー７２０、スライダ７２０Ａおよびシンボルバー７３０と同様のＧＵＩコンポーネントに加えて、特徴量尤度リスト８１０がさらに含まれる。 FIG. 8 is a diagram showing an example of a training assistance view. As shown in FIG. 8, the training support view 800 includes GUI components similar to the skeletal model view 710, seek bar 720, slider 720A and symbol bar 730 shown in FIG. Further included.

このような特徴量尤度リスト８１０には、シークバー７２０上のスライダ７２０Ａの位置に対応するフレームで実施中である技の採点項目に対応する種類の特徴量に関する特徴量尤度がリスト化されて表示される。 The feature quantity likelihood list 810 lists the feature quantity likelihoods of the types of feature quantities corresponding to the scoring items of the technique being executed in the frame corresponding to the position of the slider 720A on the seek bar 720. Is displayed.

図８の例で言えば、実施中の技が「前脚を曲げ伸ばした前後開脚とび」である側面から、特徴量尤度リスト８１０には、前後開脚「０．８８」、前膝角度「０．２５」およびジャンプ高さ「０．９８」の３つの特徴量尤度１～３が表示される。 In the example of FIG. 8, from the side where the technique being performed is "front-back split leg jump with front leg bent and straightened", the feature value likelihood list 810 includes front-back leg spread "0.88", front knee angle Three feature quantity likelihoods 1 to 3 of "0.25" and jump height "0.98" are displayed.

さらに、３つの特徴量尤度１～３のうち、前膝角度「０．２５」が他の特徴量尤度よりも相対的に低い、あるいは０～１の数値範囲において定め得る閾値、例えば０．４よりも低いという条件を満たすことから強調表示される。 Furthermore, among the three feature quantity likelihoods 1 to 3, the front knee angle "0.25" is relatively lower than the other feature quantity likelihoods, or a threshold value that can be set in a numerical range of 0 to 1, for example, 0 It is highlighted because it satisfies the condition that it is lower than .4.

このような特徴量尤度リスト８１０によれば、技認識結果「前脚を曲げ伸ばした前後開脚とび」と合わせて、前膝角度が「０．２５」という相対的および絶対的に低い値であることを演技者３やそのトレーナなどに提示できる。これにより、「前脚を曲げ伸ばした前後開脚とび」の採点規則で定められた通りの前脚の曲げ伸ばしが不十分であるという改善点のサジェスチョンが可能となる。 According to the feature value likelihood list 810, the front knee angle is a relatively and absolutely low value of "0.25" in combination with the technique recognition result "back-and-forth leg jump with the front leg bent and extended". Something can be presented to actor 3, his trainer, and so on. As a result, it is possible to make suggestions for improvements such as insufficient bending and stretching of the front leg as stipulated in the scoring rules for "front-back leg jump with the front leg bent and extended".

＜処理の流れ＞
図９は、実施例１に係る情報提供処理の手順を示すフローチャートである。図９に示す処理は、あくまで一例として、深度画像の新規のフレームが取得された場合に開始できる。図９に示すように、骨格認識部１５Ｂは、取得部１５Ａにより取得された深度画像を機械学習モデル７Ｍへ入力することにより、演技者３の３Ｄ骨格座標の推定値を機械学習モデル７Ｍに出力させることで、骨格認識を実行する（ステップＳ１０１）。 <Process flow>
FIG. 9 is a flowchart illustrating the procedure of information provision processing according to the first embodiment. The process illustrated in FIG. 9 may begin, by way of example only, when a new frame of depth images is acquired. As shown in FIG. 9, the skeleton recognition unit 15B inputs the depth image acquired by the acquisition unit 15A to the machine learning model 7M, thereby outputting the estimated values of the 3D skeleton coordinates of the performer 3 to the machine learning model 7M. Skeleton recognition is executed by causing the robot to perform skeleton recognition (step S101).

続いて、技認識部１５Ｃは、新規のフレームから遡って特定数のフレーム分の３Ｄ骨格座標の時系列データを機械学習モデル８Ｍへ入力することにより、演技者３の技認識結果を機械学習モデル８Ｍに出力させることで、技認識を実行する（ステップＳ１０２）。 Subsequently, the technique recognition unit 15C inputs the time-series data of the 3D skeleton coordinates for a specific number of frames retroactively from the new frame to the machine learning model 8M, thereby recognizing the technique recognition result of the performer 3 as the machine learning model. By causing 8M to output, technique recognition is executed (step S102).

このとき、ステップＳ１０２の技認識で技が認識された場合、例えば機械学習モデル８Ｍが出力するラベルが未検出、すなわち技なしでない場合（ステップＳ１０３Ｙｅｓ）、特徴量算出部１５Ｄは、次のような処理を実行する。すなわち、特徴量算出部１５Ｄは、ステップＳ１０２の技認識で認識された技の採点項目に対応するＫ種類の特徴量を算出する（ステップＳ１０４）。 At this time, if the technique is recognized in the technique recognition of step S102, for example, if the label output by the machine learning model 8M is not detected, that is, if there is no technique (Yes in step S103), the feature amount calculation unit 15D performs the following Execute the process. That is, the feature amount calculation unit 15D calculates K kinds of feature amounts corresponding to the scoring items of the technique recognized in the technique recognition of step S102 (step S104).

そして、特徴量差分算出部１５Ｅは、ステップＳ１０２の技認識で認識された技と同一技のラベルが付与された訓練データの個数Ｌに対応する回数の分、ステップＳ１０５からステップＳ１０７までの処理を繰り返すループ処理１を開始する。なお、ここでは、ループ処理が行われる例を挙げるが、訓練データの個数ＬごとにステップＳ１０５からステップＳ１０７の処理が並列して行われてもよい。 Then, the feature amount difference calculation unit 15E repeats the processing from step S105 to step S107 the number of times corresponding to the number L of training data to which the label of the same technique as the technique recognized in the technique recognition in step S102 is assigned. Repeat loop processing 1 is started. Although an example of loop processing is given here, the processing from step S105 to step S107 may be performed in parallel for each number L of training data.

さらに、特徴量差分算出部１５Ｅは、ステップＳ１０２の技認識で認識された技の採点項目の数に対応する特徴量の種類数Ｋに対応する回数の分、ステップＳ１０５およびステップＳ１０６の処理を繰り返すループ処理２を開始する。なお、ここでも、特徴量の種類数ＫごとにステップＳ１０５およびステップＳ１０６の処理が並列して行われてもよい。 Further, the feature amount difference calculation unit 15E repeats the processing of steps S105 and S106 for the number of times corresponding to the number of feature amount types K corresponding to the number of scoring items of the technique recognized in the technique recognition of step S102. Loop processing 2 is started. Also here, the processing of steps S105 and S106 may be performed in parallel for each type number K of feature amounts.

すなわち、特徴量差分算出部１５Ｅは、技認識に用いられた入力データおよび技の認識結果に対応する訓練データの間で特徴量の差分を算出する（ステップＳ１０５）。例えば、入力データの特徴量として、ステップＳ１０４で算出された特徴量が用いられる。一方、訓練データの特徴量として、機械学習モデル８Ｍの生成に用いられたデータセット８ＴＲのうち、ステップＳ１０２の技認識で認識された技と同一技のラベルが付与された訓練データに対応する３Ｄ骨格座標の時系列データから算出された特徴量が用いられる。 That is, the feature amount difference calculating unit 15E calculates the feature amount difference between the input data used for the technique recognition and the training data corresponding to the technique recognition result (step S105). For example, the feature amount calculated in step S104 is used as the feature amount of the input data. On the other hand, as a feature value of the training data, a 3D model corresponding to the training data labeled with the same technique as the technique recognized in the technique recognition in step S102 in the data set 8TR used to generate the machine learning model 8M. A feature amount calculated from the time-series data of the skeletal coordinates is used.

続いて、特徴量差分算出部１５Ｅは、ステップＳ１０５で算出された特徴量の差分から特徴量尤度を算出する（ステップＳ１０６）。 Subsequently, the feature quantity difference calculator 15E calculates a feature quantity likelihood from the feature quantity difference calculated in step S105 (step S106).

このようなループ処理２が繰り返されることにより、特徴量の種類数Ｋごとに特徴量尤度を得ることができる。ループ処理２が終了すると、特徴量差分算出部１５Ｅは、Ｋ種類の特徴量尤度を代表する１つの代表値、例えば最小値を技尤度として算出する（ステップＳ１０７）。さらに、ループ処理１が繰り返されることにより、訓練データの個数Ｌごとに技尤度を得ることができる。 By repeating such loop processing 2, the feature quantity likelihood can be obtained for each of the number K of types of feature quantities. When the loop process 2 ends, the feature amount difference calculation unit 15E calculates one representative value representing the K types of feature amount likelihoods, for example, the minimum value, as the skill likelihood (step S107). Furthermore, by repeating the loop process 1, the skill likelihood can be obtained for each number L of training data.

ループ処理１が終了すると、採点支援部１５Ｆは、ループ処理１で得られたＬ個の技尤度のうち最小の技尤度と、最小の技尤度に対応する訓練データから算出されたＫ種類の特徴量尤度をクライアント端末３０に表示させ（ステップＳ１０８）、処理を終了する。 When the loop process 1 ends, the scoring support unit 15F calculates the minimum skill likelihood among the L skill likelihoods obtained in the loop process 1 and the training data corresponding to the minimum skill likelihood. The client terminal 30 displays the feature amount likelihood of the type (step S108), and the process ends.

＜効果の一側面＞
上述してきたように、本実施例に係るサーバ装置１０は、技認識用の機械学習モデルに入力された骨格情報から求まる特徴量と、技認識用の機械学習モデルが出力する技と同一の技に関する訓練データの骨格情報から求まる特徴量との差分を提供する。このため、技が実施されている真の確からしさを表現すると共に人の判断プロセスに適合する指標を情報提供できる。したがって、本実施例に係るサーバ装置１０によれば、技の認識結果に関する情報提供の品質向上を実現できる。 <One aspect of the effect>
As described above, the server device 10 according to the present embodiment is configured to match the feature amount obtained from the skeleton information input to the machine learning model for technique recognition with the same technique as the technique output by the machine learning model for technique recognition. provides the difference from the feature value obtained from the skeleton information of the training data. For this reason, it is possible to express the true likelihood that the technique is being performed and to provide information on an index suitable for the human judgment process. Therefore, according to the server device 10 of the present embodiment, it is possible to improve the quality of providing information on the recognition results of tricks.

さて、上記の実施例１では、技認識で認識された技と同一技のラベルが付与された訓練データの個数Ｌごとに特徴量の差分、ひいては特徴量尤度や技尤度を算出する例を挙げたが、必ずしもＬ個の訓練データごとに特徴量の差分などを算出せずともよい。 Now, in the above-described first embodiment, there is an example in which the feature value difference, and thus the feature value likelihood and the skill likelihood are calculated for each number L of training data to which the label of the same skill as the skill recognized in the skill recognition is given. However, it is not always necessary to calculate the difference of the feature amount for each of the L pieces of training data.

すなわち、特徴量の算出時には、技認識で認識される技によって採点項目が異なるので、算出対象とする特徴量の種類を識別したり、特徴量の種類ごとに３Ｄ骨格座標の時系列データの中から特徴量の算出に用いるフレームを特定したりといった処理が行われる。 In other words, when calculating the feature amount, the scoring items differ depending on the technique recognized by the technique recognition. A process such as specifying a frame to be used for calculating a feature amount is performed from the frame.

このような処理負荷を軽減する側面から、本実施例では、技認識に用いられた入力データおよび技の認識結果に対応する訓練データの間で３Ｄ骨格座標の差分を算出し、差分が最小である訓練データを特徴量の差分の算出対象として絞り込む応用例を説明する。 From the aspect of reducing such processing load, in this embodiment, the difference in 3D skeleton coordinates is calculated between the input data used for technique recognition and the training data corresponding to the technique recognition result. An application example will be described in which certain training data is narrowed down as an object for calculating the difference of the feature quantity.

図１０は、実施例２に係るサーバ装置２０の機能構成例を示す図である。図１０では、図６に示されたサーバ装置１０と同様の機能を有する機能部には同一の符号を付し、その説明を省略することとする。図１０に示すように、サーバ装置２０は、図６に示すサーバ装置１０と比べて、制御部２５が骨格差分算出部２５Ａをさらに有する点で異なる。 FIG. 10 is a diagram illustrating a functional configuration example of the server device 20 according to the second embodiment. In FIG. 10, functional units having the same functions as those of the server device 10 shown in FIG. As shown in FIG. 10, server device 20 differs from server device 10 shown in FIG. 6 in that control unit 25 further includes skeleton difference calculation unit 25A.

骨格差分算出部２５Ａは、３Ｄ骨格座標の差分を算出する処理部である。以下、３Ｄ骨格座標の差分のことを指して「骨格差分」と記載する場合がある。 The skeleton difference calculation unit 25A is a processing unit that calculates a difference between 3D skeleton coordinates. Hereinafter, the difference in 3D skeleton coordinates may be referred to as "skeletal difference".

あくまで一例として、骨格差分算出部２５Ａは、技認識部１５Ｃにより技が認識された場合、例えば機械学習モデル８Ｍが出力するラベルが未検出、すなわち技なしでない場合、処理を起動できる。 By way of example only, the skeleton difference calculation unit 25A can start processing when a technique is recognized by the technique recognition unit 15C, for example, when the label output by the machine learning model 8M is not detected, ie, no technique.

例えば、骨格差分算出部２５Ａは、技認識に用いられた入力データおよび技の認識結果に対応する訓練データの間で３Ｄ骨格座標の差分を算出する。例えば、入力データとして、骨格認識部１５Ｂによる骨格認識で得られた３Ｄ骨格座標の時系列データ、別観点から言えば技認識部１５Ｃによる技認識に用いられた３Ｄ骨格座標の時系列データが用いられる。一方、訓練データとして、機械学習モデル８Ｍの生成に用いられたデータセット８ＴＲのうち、技認識部１５Ｃにより認識された技と同一技のラベルが付与された訓練データに対応する３Ｄ骨格座標の時系列データが用いられる。 For example, the skeleton difference calculation unit 25A calculates the difference in 3D skeleton coordinates between the input data used for technique recognition and the training data corresponding to the technique recognition result. For example, as the input data, time-series data of 3D skeleton coordinates obtained by skeleton recognition by the skeleton recognition unit 15B, and from another point of view, time-series data of 3D skeleton coordinates used for technique recognition by the technique recognition unit 15C are used. be done. On the other hand, the 3D skeleton coordinates corresponding to the training data labeled with the same technique as the technique recognized by the technique recognition unit 15C in the data set 8TR used to generate the machine learning model 8M as the training data. Series data are used.

より詳細には、骨格差分算出部２５Ａは、技認識部１５Ｃにより認識された技と同一技のラベルが付与されたＬ個の訓練データごとに、次のような処理を実行する。すなわち、骨格差分算出部２５Ａは、入力データおよび訓練データの間で３Ｄ骨格座標の時系列データのフレーム数が異なる場合、両者のフレーム数を統一する。このように入力データおよび訓練データの間でフレーム数を統一する場合、多い方のフレーム数を少ない方のフレーム数に合わせることもできれば、少ない方のフレーム数を多い方のフレーム数に合わせることもできる。例えば、少ない方のフレーム数を多い方のフレーム数に合わせる場合、不足分のフレームを補間するリサンプリングが必要となる。その一方で、多い方のフレーム数を少ない方のフレーム数に合わせる場合、超過分のフレームを間引くだけでよい。このような側面から、骨格差分算出部２５Ａは、入力データおよび訓練データのうち、フレーム数が多い方の３Ｄ骨格座標の時系列データから超過分のフレーム数に対応する３Ｄ骨格座標を間引くことにより、両者のフレーム数を統一する。 More specifically, the skeleton difference calculation unit 25A performs the following processing for each of L pieces of training data labeled with the same technique as the technique recognized by the technique recognition unit 15C. That is, when the input data and the training data have different numbers of frames of time-series data of 3D skeleton coordinates, the skeleton difference calculation unit 25A unifies the numbers of frames of both. When unifying the number of frames between the input data and the training data in this way, the larger number of frames can be matched with the smaller number of frames, or the smaller number of frames can be matched with the larger number of frames. can. For example, when matching a smaller number of frames to a larger number of frames, resampling is required to interpolate the missing frames. On the other hand, if you want to match the larger number of frames to the smaller number of frames, you only need to thin out the excess frames. From this aspect, the skeleton difference calculation unit 25A thins out the 3D skeleton coordinates corresponding to the excess number of frames from the time-series data of the 3D skeleton coordinates that have the larger number of frames in the input data and the training data. , to unify the number of frames of both.

フレーム数の統一後、骨格差分算出部２５Ａは、統一されたＭ個のフレームごと、および、骨格モデルに定義される関節の数Ｎごとに、入力データおよび訓練データの間で３Ｄ骨格座標の差分、すなわち骨格差分を算出する。ここで求める「差分」は、絶対値であってもよいし、二乗値であってもよい。Ｎ個の関節ごとに骨格差分が算出されると、骨格差分算出部２５Ａは、Ｎ個の骨格差分の合計値を算出する。その後、Ｍ個のフレームごとにＮ個の骨格差分の合計値が算出されると、骨格差分算出部２５Ａは、Ｍ個のフレーム分のＮ個の骨格差分の合計値から平均値を算出する。このようにＮ個分の骨格差分の平均値を代表値として算出することにより、Ｌ個の訓練データごとに訓練データおよび入力データの間で統一されるフレーム数Ｍが異なる場合でも、Ｌ個の訓練データの間でフレーム数の調整を不要化できる。そして、Ｌ個の訓練データごとにＮ個分の骨格差分の平均値が算出されると、骨格差分算出部２５Ａは、Ｌ個の訓練データのうちＮ個分の骨格差分の平均値が最小である訓練データを選択する。 After unifying the number of frames, the skeletal difference calculation unit 25A calculates the difference in 3D skeletal coordinates between the input data and the training data for each of the unified M frames and for each N of joints defined in the skeletal model. , that is, the skeleton difference is calculated. The "difference" obtained here may be an absolute value or a square value. After calculating the skeleton difference for each of the N joints, the skeleton difference calculator 25A calculates the total value of the N skeleton differences. After that, when the total value of N skeleton differences is calculated for each of M frames, the skeleton difference calculator 25A calculates an average value from the total values of N skeleton differences for M frames. By calculating the average value of N skeletal differences as a representative value in this way, even if the number of frames M to be unified between training data and input data is different for each L training data, L It eliminates the need to adjust the number of frames between training data. Then, when the average value of the N skeleton differences is calculated for each of the L pieces of training data, the skeleton difference calculation unit 25A determines that the average value of the N skeleton differences among the L pieces of training data is the smallest. Select some training data.

これにより、Ｌ個の訓練データのうち、入力データからの距離が最小であるか、あるいは入力データに対する類似度が最高である訓練データを絞り込むことができる。 As a result, among the L pieces of training data, the training data having the shortest distance from the input data or having the highest degree of similarity to the input data can be narrowed down.

図１１は、実施例２に係る情報提供処理の手順を示すフローチャートである。図１１に示す処理も、あくまで一例として、深度画像の新規のフレームが取得された場合に開始できる。図１１に示すように、骨格認識部１５Ｂは、取得部１５Ａにより取得された深度画像を機械学習モデル７Ｍへ入力することにより、演技者３の３Ｄ骨格座標の推定値を機械学習モデル７Ｍに出力させることで、骨格認識を実行する（ステップＳ１０１）。 FIG. 11 is a flowchart illustrating the procedure of information provision processing according to the second embodiment. The process illustrated in FIG. 11 can also be started, by way of example only, when a new frame of the depth image is acquired. As shown in FIG. 11, the skeleton recognition unit 15B inputs the depth image acquired by the acquisition unit 15A to the machine learning model 7M, thereby outputting the estimated values of the 3D skeleton coordinates of the performer 3 to the machine learning model 7M. Skeleton recognition is executed by causing the robot to perform skeleton recognition (step S101).

このとき、ステップＳ１０２の技認識で技が認識された場合、例えば機械学習モデル８Ｍが出力するラベルが未検出、すなわち技なしでない場合（ステップＳ１０３Ｙｅｓ）、骨格差分算出部２５Ａは、次のような処理を実行する。 At this time, if the technique is recognized in the technique recognition of step S102, for example, if the label output by the machine learning model 8M is not detected, that is, if the technique is not absent (Yes in step S103), the skeleton difference calculation unit 25A performs the following Execute the process.

すなわち、骨格差分算出部２５Ａは、ステップＳ１０２の技認識で認識された技と同一技のラベルが付与された訓練データの個数Ｌに対応する回数の分、ステップＳ２０１からステップＳ２０４までの処理を繰り返すループ処理１１を開始する。なお、ここでは、ループ処理が行われる例を挙げるが、訓練データの個数ＬごとにステップＳ２０１からステップＳ２０４までの処理が並列して行われてもよい。 That is, the skeleton difference calculation unit 25A repeats the processing from step S201 to step S204 the number of times corresponding to the number L of training data to which the label of the same technique as the technique recognized in the technique recognition in step S102 is assigned. A loop process 11 is started. Although an example in which loop processing is performed is given here, the processing from step S201 to step S204 may be performed in parallel for each number L of training data.

骨格差分算出部２５Ａは、入力データおよび訓練データのうち、フレーム数が多い方の３Ｄ骨格座標の時系列データから超過分のフレーム数に対応する３Ｄ骨格座標を間引くことにより、両者のフレーム数を統一する（ステップＳ２０１）。なお、入力データおよび訓練データの間でフレーム数が同一である場合、ステップＳ２０１の処理はスキップできる。 The skeleton difference calculation unit 25A thins out 3D skeleton coordinates corresponding to the excess number of frames from the time-series data of the 3D skeleton coordinates of whichever of the input data and the training data has the larger number of frames, thereby calculating the numbers of both frames. Unify (step S201). Note that if the number of frames is the same between the input data and the training data, the process of step S201 can be skipped.

その後、骨格差分算出部２５Ａは、ステップＳ２０１で統一されたフレームの数Ｍに対応する回数の分、ステップＳ２０２およびステップＳ２０３の処理を繰り返すループ処理１２を開始する。なお、ここでは、ループ処理が行われる例を挙げるが、フレームの数ＭごとにステップＳ２０２およびステップＳ２０３の処理が並列して行われてもよい。 Thereafter, the skeleton difference calculation unit 25A starts loop processing 12 that repeats the processing of steps S202 and S203 the number of times corresponding to the number M of frames unified in step S201. Although an example in which loop processing is performed is given here, the processing of steps S202 and S203 may be performed in parallel for each number M of frames.

さらに、骨格差分算出部２５Ａは、骨格モデルに定義される関節の数Ｎに対応する回数の分、ステップＳ２０２の処理を繰り返すループ処理１３を開始する。なお、ここでは、ループ処理が行われる例を挙げるが、関節の数ＮごとにステップＳ２０２の処理が並列して行われてもよい。 Further, the skeleton difference calculation unit 25A starts loop processing 13 for repeating the processing of step S202 a number of times corresponding to the number N of joints defined in the skeleton model. Although an example in which loop processing is performed is given here, the processing of step S202 may be performed in parallel for each number N of joints.

すなわち、骨格差分算出部２５Ａは、入力データおよび訓練データの間で３Ｄ骨格座標の差分、すなわち骨格差分を算出する（ステップＳ２０２）。このようなループ処理１３が繰り返されることにより、Ｎ個の関節ごとに骨格差分を得ることができる。 That is, the skeleton difference calculation unit 25A calculates the difference in 3D skeleton coordinates, that is, the skeleton difference, between the input data and the training data (step S202). By repeating such loop processing 13, a skeleton difference can be obtained for each of the N joints.

ループ処理１３が終了すると、骨格差分算出部２５Ａは、Ｎ個の骨格差分の合計値を算出する（ステップＳ２０３）。このようなループ処理１２が繰り返されることにより、Ｍ個のフレームごとにＮ個の骨格差分の合計値を得ることができる。 When the loop process 13 ends, the skeleton difference calculation unit 25A calculates the total value of N skeleton differences (step S203). By repeating such loop processing 12, the total value of N skeleton differences can be obtained for every M frames.

ループ処理１２が終了すると、骨格差分算出部２５Ａは、Ｍ個のフレーム分のＮ個の骨格差分の合計値から平均値を算出する（ステップＳ２０４）。このようなループ処理１１が繰り返されることにより、Ｌ個の訓練データごとにＮ個分の骨格差分の平均値を得ることができる。 When loop processing 12 ends, the skeleton difference calculation unit 25A calculates an average value from the total values of N skeleton differences for M frames (step S204). By repeating such a loop process 11, it is possible to obtain an average value of N skeletal differences for every L training data.

そして、ループ処理１１が終了すると、骨格差分算出部２５Ａは、Ｌ個の訓練データのうちＮ個分の骨格差分の平均値が最小である訓練データを選択する（ステップＳ２０５）。 Then, when the loop process 11 ends, the skeleton difference calculation unit 25A selects the training data having the smallest average value of N skeleton differences among the L pieces of training data (step S205).

その後、ステップＳ１０２の技認識で認識された技の採点項目に対応する特徴量の種類数Ｋに対応する回数の分、ステップＳ２０６からステップＳ２０８までの処理を繰り返すループ処理１４を開始する。なお、ここでは、ループ処理が行われる例を挙げるが、特徴量の種類数ＫごとにステップＳ２０６からステップＳ２０８までの処理が並列して行われてもよい。 After that, a loop process 14 is started in which the processes from step S206 to step S208 are repeated a number of times corresponding to the number K of feature amount types corresponding to the scoring items of the technique recognized in the technique recognition in step S102. Note that although an example of loop processing is described here, the processing from step S206 to step S208 may be performed in parallel for each type number K of feature amounts.

すなわち、特徴量算出部１５Ｄは、ステップＳ１０２の技認識で認識された技の採点項目に対応する特徴量を算出する（ステップＳ２０６）。 That is, the feature amount calculation unit 15D calculates the feature amount corresponding to the scoring item of the technique recognized in the technique recognition of step S102 (step S206).

そして、特徴量差分算出部１５Ｅは、技認識に用いられた入力データおよびステップＳ２０５で選択された訓練データの間で特徴量の差分を算出する（ステップＳ２０７）。例えば、入力データの特徴量として、ステップＳ２０６で算出された特徴量が用いられる。一方、訓練データの特徴量として、機械学習モデル８Ｍの生成に用いられたデータセット８ＴＲのうち、ステップＳ２０５で選択された訓練データに対応する３Ｄ骨格座標の時系列データから算出された特徴量が用いられる。 Then, the feature amount difference calculating unit 15E calculates the feature amount difference between the input data used for technique recognition and the training data selected in step S205 (step S207). For example, the feature amount calculated in step S206 is used as the feature amount of the input data. On the other hand, as the feature amount of the training data, the feature amount calculated from the time-series data of the 3D skeletal coordinates corresponding to the training data selected in step S205 in the data set 8TR used to generate the machine learning model 8M is Used.

続いて、特徴量差分算出部１５Ｅは、ステップＳ２０７で算出された特徴量の差分から特徴量尤度を算出する（ステップＳ２０８）。 Subsequently, the feature quantity difference calculation unit 15E calculates a feature quantity likelihood from the feature quantity difference calculated in step S207 (step S208).

このようなループ処理１４が繰り返されることにより、特徴量の種類数Ｋごとに特徴量尤度を得ることができる。ループ処理１４が終了すると、特徴量差分算出部１５Ｅは、Ｋ種類の特徴量尤度を代表する１つの代表値、例えば最小値を技尤度として算出する（ステップＳ２０９）。 By repeating such a loop process 14, it is possible to obtain the feature amount likelihood for each of the number K of types of feature amounts. When the loop process 14 ends, the feature amount difference calculation unit 15E calculates one representative value representing the K types of feature amount likelihoods, for example, the minimum value, as the technique likelihood (step S209).

最後に、採点支援部１５Ｆは、ステップＳ２０９で算出された技尤度と、ループ処理１４で得られたＫ種類の特徴量尤度をクライアント端末３０に表示させ（ステップＳ２１０）、処理を終了する。 Finally, the scoring support unit 15F causes the client terminal 30 to display the skill likelihood calculated in step S209 and the K types of feature quantity likelihood obtained in the loop processing 14 (step S210), and ends the process. .

上述してきたように、本実施例に係るサーバ装置２０は、上記の実施例１に係るサーバ装置１０と同様、技認識に用いられた入力データおよび技の認識結果に対応する訓練データの間で当該技の採点項目に対応する特徴量の差分を算出して出力する。このため、技が実施されている真の確からしさを表現すると共に人の判断プロセスに適合する指標を提供できる。したがって、本実施例に係るサーバ装置２０においても、技の認識結果に関する情報提供の品質向上を実現できる。 As described above, in the server device 20 according to the present embodiment, similar to the server device 10 according to the first embodiment, input data used for technique recognition and training data corresponding to the recognition result of the technique A difference in feature amount corresponding to the scoring item of the technique is calculated and output. For this reason, it is possible to provide an index that expresses the true likelihood that the technique is being performed and that is suitable for the human judgment process. Therefore, also in the server device 20 according to the present embodiment, it is possible to improve the quality of providing information on the recognition results of tricks.

さらに、本実施例に係るサーバ装置２０は、技認識に用いられた入力データおよび技の認識結果に対応する訓練データの間で３Ｄ骨格座標の差分を算出し、差分が最小である訓練データを特徴量の差分の算出対象として絞り込む。このため、本実施例に係るサーバ装置２０によれば、算出対象とする特徴量の種類を識別したり、特徴量の種類ごとに３Ｄ骨格座標の時系列データの中から特徴量の算出に用いるフレームを特定したりといった処理の負荷を削減できる。 Further, the server device 20 according to the present embodiment calculates the difference in 3D skeleton coordinates between the input data used for technique recognition and the training data corresponding to the technique recognition result, and selects the training data with the smallest difference. Narrow down the calculation target of the difference of the feature amount. For this reason, according to the server device 20 according to the present embodiment, it is possible to identify the type of feature amount to be calculated, and use the time-series data of the 3D skeleton coordinates for each type of feature amount to calculate the feature amount. It is possible to reduce the load of processing such as frame identification.

さて、これまで開示の装置に関する実施例について説明したが、本発明は上述した実施例以外にも、種々の異なる形態にて実施されてよいものである。そこで、以下では、本発明に含まれる他の実施例を説明する。 Although embodiments of the disclosed apparatus have been described so far, the present invention may be embodied in various forms other than the embodiments described above. Therefore, other embodiments included in the present invention will be described below.

＜訓練データ＞
上記の実施例１では、図９に示すステップＳ１０２の技認識で認識された技と同一技のラベルが付与されたＬ個の訓練データごとに特徴量尤度および技尤度を算出する例を挙げた。また、上記の実施例２では、図１１に示すステップＳ１０２の技認識で認識された技と同一技のラベルが付与されたＬ個の訓練データごとに骨格差分を算出する例を挙げた。これらの算出対象は、必ずしもＬ個の訓練データとせずともよい。 <Training data>
In the above-described first embodiment, an example of calculating the feature quantity likelihood and the technique likelihood for each of L pieces of training data labeled with the same technique as the technique recognized in the technique recognition in step S102 shown in FIG. mentioned. Further, in the above-described second embodiment, an example of calculating a skeleton difference for each of L pieces of training data labeled with the same technique as the technique recognized in the technique recognition in step S102 shown in FIG. 11 was given. These calculation targets do not necessarily have to be the L pieces of training data.

あくまで一例として、Ｌ個の訓練データの間で３Ｄ骨格座標の統計値、例えば相加平均や加重平均などを算出することによりＬ個の訓練データを代表する１つの訓練データを生成し、生成された訓練データを算出対象に用いることができる。他の一例として、Ｌ個の訓練データから抽出され得る訓練データのペアごとに３Ｄ骨格座標の差分を算出し、全ペアのうち３Ｄ骨格座標の差分が最大である訓練データのペアを算出対象に用いることもできる。 As an example only, one training data representative of the L training data is generated by calculating a statistical value of the 3D skeleton coordinates, such as an arithmetic average or a weighted average, among the L training data. It is possible to use the training data obtained for calculation. As another example, the difference in 3D skeleton coordinates is calculated for each pair of training data that can be extracted from L training data, and the pair of training data having the largest difference in 3D skeleton coordinates among all pairs is used as the calculation target. can also be used.

＜骨格差分＞
上記の実施例２では、Ｌ個の訓練データから１つの訓練データに絞り込む側面から骨格差分を用いる例を挙げたが、図１１に示すステップＳ２０５で選択された骨格差分は、ステップＳ２１０で出力される技尤度や特徴量尤度と共に出力することができる。 <Skeleton difference>
In the above-described second embodiment, an example of using skeleton differences from the aspect of narrowing down L training data to one training data was given, but the skeleton differences selected in step S205 shown in FIG. can be output together with the technique likelihood and feature likelihood.

このような骨格差分の情報提供により、以下のように、技尤度や特徴量尤度のみでは不十分な状況にも対応できる。例えば、座った姿勢から前方へ回転し倒立してそのままさらに前方へ回転し座った姿勢になる「前転」という技を例に挙げる。この技の特徴量としては、「足が着いた状態で始まる」、「１８０°回転」、「手をついた倒立姿勢になる」、「１８０°回転」、「足が着いた状態で終わる」という特徴量になる。これに対して、何かしらの技を失敗して平均台から落下する際に頭から落ちることを避けるためとっさに平均台を掴み、くるりと１回転して落ちる、という状況も発生し得る。この場合、状況によっては、足が着いた状態から落下し、１８０°回転した付近で平均台を掴み、さらに１８０°回転して、足を着くというように、技尤度や特徴量尤度を比較すると前転と類似する動きになる場合がある。このような場合でも、上記の骨格差分には、技失敗時と前転時との間で差が現れ易いので、技失敗時と前転時との識別に用いることができる。 By providing such skeleton difference information, as described below, it is possible to cope with situations in which the skill likelihood and the feature quantity likelihood alone are insufficient. For example, take a technique called ``forward roll'', which rotates forward from a sitting position, stands upside down, and then rotates further forward to a sitting position. The feature values of this technique are: "starts with feet on", "rotates 180 degrees", "stands upside down with hands on", "rotates 180 degrees", and "ends with feet on". is a feature quantity. On the other hand, there may be a situation in which the player suddenly grabs the balance beam to avoid falling headfirst from the balance beam due to a failure in some trick, and then makes a full turn and falls. In this case, depending on the situation, the technique likelihood and the feature quantity likelihood may be calculated by falling from the state where the feet are on the ground, grabbing the balance beam in the vicinity of 180° rotation, rotating 180° again, and landing on the feet. When compared, it may become a movement similar to a forward roll. Even in such a case, since a difference is likely to appear in the skeletal difference between when the technique fails and when the forward roll is performed, it can be used to distinguish between when the technique fails and when the forward roll is performed.

＜分散および統合＞
また、図示した各装置の各構成要素は、必ずしも物理的に図示の如く構成されておらずともよい。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。例えば、取得部１５Ａ、骨格認識部１５Ｂ、技認識部１５Ｃ、特徴量算出部１５Ｄ、特徴量差分算出部１５Ｅまたは採点支援部１５Ｆをサーバ装置１０の外部装置としてネットワーク経由で接続するようにしてもよい。また、取得部１５Ａ、骨格認識部１５Ｂ、技認識部１５Ｃ、骨格差分算出部２５Ａ、特徴量算出部１５Ｄ、特徴量差分算出部１５Ｅまたは採点支援部１５Ｆをサーバ装置２０の外部装置としてネットワーク経由で接続するようにしてもよい。また、取得部１５Ａ、骨格認識部１５Ｂ、技認識部１５Ｃ、特徴量算出部１５Ｄ、特徴量差分算出部１５Ｅまたは採点支援部１５Ｆを別の装置がそれぞれ有し、ネットワーク接続されて協働することで、上記のサーバ装置１０の機能を実現するようにしてもよい。また、取得部１５Ａ、骨格認識部１５Ｂ、技認識部１５Ｃ、骨格差分算出部２５Ａ、特徴量算出部１５Ｄ、特徴量差分算出部１５Ｅ又は採点支援部１５Ｆを別の装置がそれぞれ有し、ネットワーク接続されて協働することで、サーバ装置２０の機能を実現してもよい。 <Decentralization and integration>
Also, each component of each illustrated device may not necessarily be physically configured as illustrated. In other words, the specific form of distribution and integration of each device is not limited to the one shown in the figure, and all or part of them can be functionally or physically distributed and integrated in arbitrary units according to various loads and usage conditions. Can be integrated and configured. For example, the acquisition unit 15A, the skeleton recognition unit 15B, the technique recognition unit 15C, the feature amount calculation unit 15D, the feature amount difference calculation unit 15E, or the scoring support unit 15F may be connected as external devices of the server device 10 via a network. good. In addition, the acquisition unit 15A, the skeleton recognition unit 15B, the technique recognition unit 15C, the skeleton difference calculation unit 25A, the feature amount calculation unit 15D, the feature amount difference calculation unit 15E, or the scoring support unit 15F are external devices of the server device 20 via the network. You may make it connect. In addition, the acquisition unit 15A, the skeleton recognition unit 15B, the technique recognition unit 15C, the feature amount calculation unit 15D, the feature amount difference calculation unit 15E, or the scoring support unit 15F may be provided by separate devices, connected to a network, and cooperated. , the functions of the server device 10 may be realized. Further, another device has the acquisition unit 15A, the skeleton recognition unit 15B, the technique recognition unit 15C, the skeleton difference calculation unit 25A, the feature amount calculation unit 15D, the feature amount difference calculation unit 15E, or the scoring support unit 15F. The functions of the server device 20 may be realized by being and cooperating.

＜ハードウェア構成＞
また、上記の実施例で説明した各種の処理は、予め用意されたプログラムをパーソナルコンピュータやワークステーションなどのコンピュータで実行することによって実現することができる。そこで、以下では、図１２を用いて、実施例１及び実施例２と同様の機能を有する情報提供プログラムを実行するコンピュータの一例について説明する。 <Hardware configuration>
Moreover, various processes described in the above embodiments can be realized by executing a prepared program on a computer such as a personal computer or a work station. Therefore, an example of a computer that executes an information providing program having functions similar to those of the first and second embodiments will be described below with reference to FIG.

図１２は、ハードウェア構成例を示す図である。図１２に示すように、コンピュータ１００は、操作部１１０ａと、スピーカ１１０ｂと、カメラ１１０ｃと、ディスプレイ１２０と、通信部１３０とを有する。さらに、このコンピュータ１００は、ＣＰＵ１５０と、ＲＯＭ１６０と、ＨＤＤ１７０と、ＲＡＭ１８０とを有する。これら１１０～１８０の各部はバス１４０を介して接続される。 FIG. 12 is a diagram illustrating a hardware configuration example. As shown in FIG. 12, the computer 100 has an operation section 110a, a speaker 110b, a camera 110c, a display 120, and a communication section . Furthermore, this computer 100 has a CPU 150 , a ROM 160 , an HDD 170 and a RAM 180 . Each part of these 110 to 180 is connected via a bus 140 .

ＨＤＤ１７０には、図１２に示すように、上記の実施例１で示された取得部１５Ａ、骨格認識部１５Ｂ、技認識部１５Ｃ、特徴量算出部１５Ｄ、特徴量差分算出部１５Ｅ及び採点支援部１５Ｆと同様の機能を発揮する情報提供プログラム１７０ａが記憶され得る。また、ＨＤＤ１７０には、上記の実施例２に示す取得部１５Ａ、骨格認識部１５Ｂ、技認識部１５Ｃ、骨格差分算出部２５Ａ、特徴量算出部１５Ｄ、特徴量差分算出部１５Ｅ及び採点支援部１５Ｆと同様の機能を発揮する情報提供プログラム１７０ａが記憶され得る。この情報提供プログラム１７０ａは、図６または図１０に示す各構成要素と同様、統合又は分離してもよい。すなわち、ＨＤＤ１７０には、必ずしも上記の実施例１～２で示した全てのデータが格納されずともよく、処理に用いるデータがＨＤＤ１７０に格納されればよい。 As shown in FIG. 12, the HDD 170 includes the acquisition unit 15A, the skeleton recognition unit 15B, the technique recognition unit 15C, the feature amount calculation unit 15D, the feature amount difference calculation unit 15E, and the scoring support unit shown in the first embodiment. An information providing program 170a having the same function as 15F can be stored. The HDD 170 also includes the acquisition unit 15A, the skeleton recognition unit 15B, the technique recognition unit 15C, the skeleton difference calculation unit 25A, the feature amount calculation unit 15D, the feature amount difference calculation unit 15E, and the scoring support unit 15F described in the second embodiment. An information providing program 170a that exhibits the same function as that can be stored. This information providing program 170a may be integrated or separated like each component shown in FIG. 6 or FIG. In other words, the HDD 170 does not necessarily store all the data shown in the first and second embodiments, and the HDD 170 only needs to store data used for processing.

このような環境の下、ＣＰＵ１５０は、ＨＤＤ１７０から情報提供プログラム１７０ａを読み出した上でＲＡＭ１８０へ展開する。この結果、情報提供プログラム１７０ａは、図１２に示すように、情報提供プロセス１８０ａとして機能する。この情報提供プロセス１８０ａは、ＲＡＭ１８０が有する記憶領域のうち情報提供プロセス１８０ａに割り当てられた領域にＨＤＤ１７０から読み出した各種データを展開し、展開された各種データを用いて各種の処理を実行する。例えば、情報提供プロセス１８０ａが実行する処理の一例として、図９や図１１に示す処理などが含まれ得る。なお、ＣＰＵ１５０では、必ずしも上記の実施例１で示した全ての処理部が動作せずともよく、実行対象とする処理に対応する処理部が仮想的に実現されればよい。 Under such an environment, the CPU 150 reads out the information providing program 170 a from the HDD 170 and develops it in the RAM 180 . As a result, the information providing program 170a functions as an information providing process 180a as shown in FIG. The information providing process 180a deploys various data read from the HDD 170 in an area assigned to the information providing process 180a among the storage areas of the RAM 180, and executes various processes using the deployed various data. For example, examples of processing executed by the information providing process 180a may include the processing shown in FIGS. 9 and 11, and the like. Note that the CPU 150 does not necessarily have to operate all the processing units described in the first embodiment, as long as the processing units corresponding to the processes to be executed are virtually realized.

なお、上記の情報提供プログラム１７０ａは、必ずしも最初からＨＤＤ１７０やＲＯＭ１６０に記憶されておらずともかまわない。例えば、コンピュータ１００に挿入されるフレキシブルディスク、いわゆるＦＤ、ＣＤ－ＲＯＭ、ＤＶＤディスク、光磁気ディスク、ＩＣカードなどの「可搬用の物理媒体」に情報提供プログラム１７０ａを記憶させる。そして、コンピュータ１００がこれらの可搬用の物理媒体から情報提供プログラム１７０ａを取得して実行するようにしてもよい。また、公衆回線、インターネット、ＬＡＮ、ＷＡＮなどを介してコンピュータ１００に接続される他のコンピュータまたはサーバ装置などに情報提供プログラム１７０ａを記憶させておく。このように記憶された情報提供プログラム１７０ａをコンピュータ１００にダウンロードさせた上で実行させるようにしてもよい。 The information providing program 170a does not necessarily have to be stored in the HDD 170 or the ROM 160 from the beginning. For example, the information providing program 170a is stored in a “portable physical medium” such as a flexible disk inserted into the computer 100, a so-called FD, CD-ROM, DVD disk, magneto-optical disk, IC card, or the like. Then, the computer 100 may acquire and execute the information providing program 170a from these portable physical media. Also, the information providing program 170a is stored in another computer or server device connected to the computer 100 via a public line, Internet, LAN, WAN, or the like. The information providing program 170a stored in this manner may be downloaded to the computer 100 and executed.

以上の実施例を含む実施形態に関し、さらに以下の付記を開示する。 The following notes are further disclosed with respect to the embodiments including the above examples.

（付記１）技認識用の機械学習モデルに入力された骨格情報から前記技認識用の機械学習モデルが出力する技に対応する特徴量を算出し、
算出された特徴量と、前記技認識用の機械学習モデルの訓練に用いられた骨格情報の訓練データのうち前記技と同一の技のラベルが付与された訓練データから求まる特徴量との差分を算出し、
算出された特徴量の差分を出力する、
処理をコンピュータが実行することを特徴とする情報提供方法。 (Appendix 1) calculating a feature value corresponding to a technique output by the machine learning model for technique recognition from the skeleton information input to the machine learning model for technique recognition;
The difference between the calculated feature amount and the feature amount obtained from the training data labeled with the same technique as the technique among the training data of the skeleton information used for training the machine learning model for the technique recognition is calculated. calculate,
Output the difference of the calculated feature amount,
A method of providing information characterized in that processing is executed by a computer.

（付記２）前記技認識用の機械学習モデルに入力された骨格情報と、前記技認識用の機械学習モデルが出力する技と同一の技のラベルが付与された複数の訓練データとの間で骨格差分を算出する処理を前記コンピュータがさらに実行し、
前記特徴量の差分を算出する処理は、前記複数の訓練データのうち前記骨格差分が最小である一の訓練データから求まる特徴量と、前記特徴量を算出する処理で算出された特徴量との差分を算出する処理を含む、
ことを特徴とする付記１に記載の情報提供方法。 (Appendix 2) Between the skeleton information input to the machine learning model for technique recognition and a plurality of training data with the same technique label as the technique output by the machine learning model for technique recognition The computer further executes a process of calculating a skeleton difference,
The process of calculating the feature amount difference is performed by comparing the feature amount obtained from one of the plurality of training data having the smallest skeleton difference and the feature amount calculated in the feature amount calculation process. Including processing to calculate the difference,
The information providing method according to Supplementary Note 1, characterized by:

（付記３）前記骨格差分を算出する処理は、前記技認識用の機械学習モデルに入力された骨格情報と、前記複数の訓練データから算出される骨格情報の統計値との間で前記骨格差分を算出する処理を含む、
ことを特徴とする付記２に記載の情報提供方法。 (Appendix 3) The process of calculating the skeleton difference is performed by comparing the skeleton information input to the machine learning model for technique recognition and the statistical value of the skeleton information calculated from the plurality of training data. Including the process of calculating
The information providing method according to appendix 2, characterized by:

（付記４）前記骨格差分を算出する処理は、前記複数の訓練データから抽出される訓練データのペアごとに骨格差分を算出し、各ペアのうち前記骨格差分が最大であるペアの訓練データと、前記技認識用の機械学習モデルに入力された骨格情報との間で前記骨格差分を算出する処理を含む、
ことを特徴とする付記２に記載の情報提供方法。 (Additional remark 4) In the process of calculating the skeletal difference, the skeletal difference is calculated for each pair of training data extracted from the plurality of training data, and the pair of training data having the largest skeletal difference among the pairs is calculated. , a process of calculating the skeleton difference between the skeleton information input to the machine learning model for technique recognition,
The information providing method according to appendix 2, characterized by:

（付記５）前記出力する処理は、前記特徴量の差分が正規化された特徴量の尤度を出力する処理を含む、
ことを特徴とする付記１に記載の情報提供方法。 (Appendix 5) The outputting process includes a process of outputting the likelihood of the feature amount in which the difference of the feature amount is normalized.
The information providing method according to Supplementary Note 1, characterized by:

（付記６）前記特徴量を算出する処理は、前記技認識用の機械学習モデルが出力する技に対応する特徴量の種類ごとに前記特徴量を算出する処理を含み、
前記特徴量の差分を算出する処理は、前記特徴量の種類ごとに前記特徴量の差分を算出する処理を含み、
前記出力する処理は、前記特徴量の種類ごとに前記特徴量の差分が正規化された複数の特徴量の尤度を代表する統計値を出力する処理を含む、
ことを特徴とする付記１に記載の情報提供方法。 (Appendix 6) The process of calculating the feature amount includes a process of calculating the feature amount for each type of feature amount corresponding to the technique output by the machine learning model for technique recognition,
The process of calculating the difference in the feature amount includes a process of calculating the difference in the feature amount for each type of the feature amount,
The outputting process includes outputting a statistical value representing the likelihood of a plurality of feature amounts obtained by normalizing the feature amount difference for each type of the feature amount,
The information providing method according to Supplementary Note 1, characterized by:

（付記７）技認識用の機械学習モデルに入力された骨格情報から前記技認識用の機械学習モデルが出力する技に対応する特徴量を算出し、
算出された特徴量と、前記技認識用の機械学習モデルの訓練に用いられた骨格情報の訓練データのうち前記技と同一の技のラベルが付与された訓練データから求まる特徴量との差分を算出し、
算出された特徴量の差分を出力する、
処理をコンピュータに実行させることを特徴とする情報提供プログラム。 (Appendix 7) calculating a feature value corresponding to the technique output by the machine learning model for technique recognition from the skeleton information input to the machine learning model for technique recognition;
The difference between the calculated feature amount and the feature amount obtained from the training data labeled with the same technique as the technique among the training data of the skeleton information used for training the machine learning model for the technique recognition is calculated. calculate,
Output the difference of the calculated feature amount,
An information providing program characterized by causing a computer to execute processing.

（付記８）前記技認識用の機械学習モデルに入力された骨格情報と、前記技認識用の機械学習モデルが出力する技と同一の技のラベルが付与された複数の訓練データとの間で骨格差分を算出する処理を前記コンピュータにさらに実行させ、
前記特徴量の差分を算出する処理は、前記複数の訓練データのうち前記骨格差分が最小である一の訓練データから求まる特徴量と、前記特徴量を算出する処理で算出された特徴量との差分を算出する処理を含む、
ことを特徴とする付記７に記載の情報提供プログラム。 (Appendix 8) Between the skeleton information input to the machine learning model for technique recognition and a plurality of training data labeled with the same technique as the technique output by the machine learning model for technique recognition causing the computer to further execute a process of calculating a skeleton difference;
The process of calculating the feature amount difference is performed by comparing the feature amount obtained from one of the plurality of training data having the smallest skeleton difference and the feature amount calculated in the feature amount calculation process. Including processing to calculate the difference,
The information providing program according to Supplementary Note 7, characterized by:

（付記９）前記骨格差分を算出する処理は、前記技認識用の機械学習モデルに入力された骨格情報と、前記複数の訓練データから算出される骨格情報の統計値との間で前記骨格差分を算出する処理を含む、
ことを特徴とする付記８に記載の情報提供プログラム。 (Appendix 9) The process of calculating the skeleton difference is performed by comparing the skeleton information input to the machine learning model for technique recognition and the statistical value of the skeleton information calculated from the plurality of training data. Including the process of calculating
The information providing program according to appendix 8, characterized by:

（付記１０）前記骨格差分を算出する処理は、前記複数の訓練データから抽出される訓練データのペアごとに骨格差分を算出し、各ペアのうち前記骨格差分が最大であるペアの訓練データと、前記技認識用の機械学習モデルに入力された骨格情報との間で前記骨格差分を算出する処理を含む、
ことを特徴とする付記８に記載の情報提供プログラム。 (Supplementary Note 10) The process of calculating the skeletal difference calculates the skeletal difference for each pair of training data extracted from the plurality of training data. , a process of calculating the skeleton difference between the skeleton information input to the machine learning model for technique recognition,
The information providing program according to appendix 8, characterized by:

（付記１１）前記出力する処理は、前記特徴量の差分が正規化された特徴量の尤度を出力する処理を含む、
ことを特徴とする付記７に記載の情報提供プログラム。 (Appendix 11) The process of outputting includes a process of outputting the likelihood of the feature amount obtained by normalizing the difference of the feature amount.
The information providing program according to Supplementary Note 7, characterized by:

（付記１２）前記特徴量を算出する処理は、前記技認識用の機械学習モデルが出力する技に対応する特徴量の種類ごとに前記特徴量を算出する処理を含み、
前記特徴量の差分を算出する処理は、前記特徴量の種類ごとに前記特徴量の差分を算出する処理を含み、
前記出力する処理は、前記特徴量の種類ごとに前記特徴量の差分が正規化された複数の特徴量の尤度を代表する統計値を出力する処理を含む、
ことを特徴とする付記７に記載の情報提供プログラム。 (Appendix 12) The process of calculating the feature amount includes a process of calculating the feature amount for each type of feature amount corresponding to the technique output by the machine learning model for technique recognition,
The process of calculating the difference in the feature amount includes a process of calculating the difference in the feature amount for each type of the feature amount,
The outputting process includes outputting a statistical value representing the likelihood of a plurality of feature amounts obtained by normalizing the feature amount difference for each type of the feature amount,
The information providing program according to Supplementary Note 7, characterized by:

（付記１３）深度画像を取得するセンサ装置と、
前記深度画像に対する骨格認識を実行する骨格認識部と、前記骨格認識で得られる骨格情報を技認識用の機械学習モデルに入力することにより技を認識する技認識部と、前記技認識用の機械学習モデルに入力された骨格情報から前記技認識用の機械学習モデルが出力する技に対応する特徴量を算出する特徴量算出部と、算出された特徴量と、前記技認識用の機械学習モデルの訓練に用いられた骨格情報の訓練データのうち前記技と同一の技のラベルが付与された訓練データから求まる特徴量との差分を算出する特徴量差分算出部と、算出された特徴量の差分をクライアント端末へ出力する情報出力部と、を有するサーバ装置と、
を有することを特徴とする採点支援システム。 (Appendix 13) A sensor device that acquires a depth image;
a skeleton recognition unit that performs skeleton recognition on the depth image; a technique recognition unit that recognizes a technique by inputting the skeleton information obtained by the skeleton recognition into a machine learning model for technique recognition; and the technique recognition machine. A feature amount calculation unit that calculates a feature amount corresponding to a technique output by the machine learning model for technique recognition from skeleton information input to the learning model, and the calculated feature amount and the machine learning model for technique recognition. A feature amount difference calculation unit that calculates the difference between the feature amount obtained from the training data labeled with the same technique as the above technique among the training data of the skeleton information used in the training of , and the calculated feature amount of the a server device having an information output unit that outputs the difference to a client terminal;
A scoring support system characterized by having:

（付記１４）前記技認識用の機械学習モデルに入力された骨格情報と、前記技認識用の機械学習モデルが出力する技と同一の技のラベルが付与された複数の訓練データとの間で骨格差分を算出する骨格差分算出部を前記サーバ装置がさらに有し、
前記特徴量差分算出部は、前記複数の訓練データのうち前記骨格差分が最小である一の訓練データから求まる特徴量と、前記特徴量を算出する処理で算出された特徴量との差分を算出する、ことを特徴とする付記１３に記載の採点支援システム。 (Appendix 14) Between the skeleton information input to the machine learning model for technique recognition and a plurality of training data having the same technique label as the technique output by the machine learning model for technique recognition The server device further has a skeleton difference calculation unit that calculates a skeleton difference,
The feature amount difference calculation unit calculates a difference between a feature amount obtained from one of the plurality of training data having the smallest skeleton difference and the feature amount calculated in the process of calculating the feature amount. The scoring support system according to Supplementary Note 13, characterized by:

（付記１５）前記骨格差分算出部は、前記技認識用の機械学習モデルに入力された骨格情報と、前記複数の訓練データから算出される骨格情報の統計値との間で前記骨格差分を算出する、ことを特徴とする付記１４に記載の採点支援システム。 (Appendix 15) The skeleton difference calculation unit calculates the skeleton difference between skeleton information input to the machine learning model for technique recognition and statistical values of skeleton information calculated from the plurality of training data. 15. The scoring support system according to Supplementary Note 14, characterized by:

（付記１６）前記骨格差分算出部は、前記複数の訓練データから抽出される訓練データのペアごとに骨格差分を算出し、各ペアのうち前記骨格差分が最大であるペアの訓練データと、前記技認識用の機械学習モデルに入力された骨格情報との間で前記骨格差分を算出する、ことを特徴とする付記１４に記載の採点支援システム。 (Supplementary Note 16) The skeleton difference calculation unit calculates a skeleton difference for each pair of training data extracted from the plurality of training data, and the pair of training data having the largest skeleton difference among the pairs, and the 15. The scoring support system according to Supplementary note 14, wherein the skeleton difference is calculated between skeleton information input to a machine learning model for technique recognition.

（付記１７）前記情報出力部は、前記特徴量の差分が正規化された特徴量の尤度を出力する、ことを特徴とする付記１３に記載の採点支援システム。 (Supplementary note 17) The scoring support system according to Supplementary note 13, wherein the information output unit outputs the likelihood of the feature amount obtained by normalizing the difference of the feature amount.

（付記１８）前記特徴量算出部は、前記技認識用の機械学習モデルが出力する技に対応する特徴量の種類ごとに前記特徴量を算出し、
前記特徴量差分算出部は、前記特徴量の種類ごとに前記特徴量の差分を算出し、
前記情報出力部は、前記特徴量の種類ごとに前記特徴量の差分が正規化された複数の特徴量の尤度を代表する統計値を出力する、ことを特徴とする付記１３に記載の採点支援システム。 (Appendix 18) The feature amount calculation unit calculates the feature amount for each type of feature amount corresponding to the technique output by the machine learning model for technique recognition,
The feature amount difference calculation unit calculates the difference between the feature amounts for each type of the feature amount,
The scoring according to Supplementary Note 13, wherein the information output unit outputs a statistical value representing the likelihood of a plurality of feature amounts obtained by normalizing the difference between the feature amounts for each type of the feature amount. support system.

１採点支援システム
３演技者
５３Ｄレーザセンサ
１０サーバ装置
１１通信インタフェイス部
１３記憶部
１５制御部
１５Ａ取得部
１５Ｂ骨格認識部
１５Ｃ技認識部
１５Ｄ特徴量算出部
１５Ｅ特徴量差分算出部
１５Ｆ採点支援部 1 Scoring support system 3 Performer 5 3D laser sensor 10 Server device 11 Communication interface unit 13 Storage unit 15 Control unit 15A Acquisition unit 15B Skeleton recognition unit 15C Technique recognition unit 15D Feature amount calculation unit 15E Feature amount difference calculation unit 15F Scoring support Department

Claims

calculating a feature amount corresponding to a technique output by the machine learning model for technique recognition from skeleton information input to the machine learning model for technique recognition;
The difference between the calculated feature amount and the feature amount obtained from the training data labeled with the same technique as the technique among the training data of the skeleton information used for training the machine learning model for the technique recognition is calculated. calculate,
Output the difference of the calculated feature amount,
A method of providing information characterized in that processing is executed by a computer.

Calculate a skeleton difference between the skeleton information input to the machine learning model for technique recognition and a plurality of training data labeled with the same technique as the technique output by the machine learning model for technique recognition. The computer further executes a process to
The process of calculating the feature amount difference is performed by comparing the feature amount obtained from one of the plurality of training data having the smallest skeleton difference and the feature amount calculated in the feature amount calculation process. Including processing to calculate the difference,
2. The information providing method according to claim 1, characterized by:

The process of calculating the skeleton difference is a process of calculating the skeleton difference between the skeleton information input to the machine learning model for technique recognition and the statistical value of the skeleton information calculated from the plurality of training data. including,
3. The information providing method according to claim 2, characterized by:

The process of calculating the skeleton difference calculates the skeleton difference for each pair of training data extracted from the plurality of training data, and calculates the training data of the pair having the largest skeleton difference among the pairs, and the technique recognition data. including a process of calculating the skeleton difference between the skeleton information input to the machine learning model for
3. The information providing method according to claim 2, characterized by:

The process of outputting includes a process of outputting the likelihood of the feature amount obtained by normalizing the difference of the feature amount,
The information providing method according to any one of claims 1 to 4, characterized in that:

The process of calculating the feature amount includes a process of calculating the feature amount for each type of feature amount corresponding to the technique output by the machine learning model for technique recognition,
The process of calculating the difference in the feature amount includes a process of calculating the difference in the feature amount for each type of the feature amount,
The outputting process includes outputting a statistical value representing the likelihood of a plurality of feature amounts obtained by normalizing the feature amount difference for each type of the feature amount,
The information providing method according to any one of claims 1 to 4, characterized in that:

calculating a feature amount corresponding to a technique output by the machine learning model for technique recognition from skeleton information input to the machine learning model for technique recognition;
The difference between the calculated feature amount and the feature amount obtained from the training data labeled with the same technique as the technique among the training data of the skeleton information used for training the machine learning model for the technique recognition is calculated. calculate,
Output the difference of the calculated feature amount,
An information providing program characterized by causing a computer to execute processing.

a sensor device that acquires a depth image;
a skeleton recognition unit that performs skeleton recognition on the depth image; a technique recognition unit that recognizes a technique by inputting the skeleton information obtained by the skeleton recognition into a machine learning model for technique recognition; and the technique recognition machine. A feature amount calculation unit that calculates a feature amount corresponding to a technique output by the machine learning model for technique recognition from skeleton information input to the learning model, and the calculated feature amount and the machine learning model for technique recognition. A feature amount difference calculation unit that calculates the difference between the feature amount obtained from the training data labeled with the same technique as the above technique among the training data of the skeleton information used in the training of , and the calculated feature amount of the a server device having an information output unit that outputs the difference to a client terminal;
A scoring support system characterized by having: