JP2019197374A

JP2019197374A - Number-of-purchases determination assisting system for foods

Info

Publication number: JP2019197374A
Application number: JP2018090814A
Authority: JP
Inventors: 洋一平嶋; Yoichi Hirashima
Original assignee: Josho Gakuen Educational Foundation
Current assignee: Josho Gakuen Educational Foundation
Priority date: 2018-05-09
Filing date: 2018-05-09
Publication date: 2019-11-14
Anticipated expiration: 2038-05-09
Also published as: JP6886186B2

Abstract

【課題】値引き・廃棄・機会ロスを最小に抑えつつ、粗利を最大化するように各品目の仕入数を組み合わせることができる、食品仕入れ数決定支援システムを提示する。【解決手段】食品仕入れ数決定支援システムは、食品仕入れ数最適組み合わせの学習モデルを作成する学習部と、並びに、前記学習モデルに基づいて、各品目の仕入れ数候補毎の評価値、及び、一品目の仕入れ数が複数に割り当てられる場合の割当比率候補毎の評価値を算出する演算部とを備え、前記学習モデルは少なくとも、各品目における仕入れ数候補と、状態記述パラメータと、割引率と、及び第１の評価関数と、並びに、一品目の仕入れ数が複数に割り当てられる場合の割当比率候補と、状態記述パラメータと、及び第２の評価関数とで、構成される。【選択図】図１PROBLEM TO BE SOLVED: To provide a food purchase quantity determination support system capable of combining the purchase quantity of each item so as to maximize the gross profit while minimizing the discount, disposal and opportunity loss. A food purchase number determination support system includes a learning unit that creates a learning model of an optimum combination of food purchase numbers, and an evaluation value for each purchase number candidate of each item based on the learning model. And a calculation unit for calculating an evaluation value for each allocation ratio candidate when the number of purchases of an item is allocated to a plurality, the learning model is at least a purchase number candidate for each item, a state description parameter, and a discount rate, And a first evaluation function, an allocation ratio candidate when the purchase number of one item is allocated to a plurality of items, a state description parameter, and a second evaluation function. [Selection diagram] Figure 1

Description

本開示は、概略、食品仕入れ数決定支援システムに関し、特に、機会ロス、廃棄ロス、及び割引ロスを考慮した粗利最大化のための食品仕入れ数決定支援システムに関する。 The present disclosure relates generally to a food purchase number determination support system, and more particularly, to a food purchase number determination support system for maximizing gross profit considering opportunity loss, disposal loss, and discount loss.

食品仕入れ業務は各品目について、在庫数、賞味期限、売上実績などから仕入れ数を決定するものである。賞味期限間近の品目については値引き販売を行い、賞味期限切の品目が売れ残ると廃棄となる。値引きには値引きロス、廃棄には廃棄ロス、そして在庫切れには機会ロスが伴う。 In the food procurement business, the number of items to be purchased is determined for each item from the number of stocks, expiration date, sales record, and the like. Items that are close to the expiration date are sold at a discount, and if the items whose expiration date has expired remain unsold, they are discarded. Discounts are accompanied by discounts, disposal is lost, and out of stock is accompanied by opportunity losses.

現状では、食品仕入れ業務における意思決定部分は人間が行っており、非常に専門性の高い業務として扱われている。品目（商品）点数は、例えば、２万点程度であり、専門知識を有する人間が一人で全体について把握することは困難である。このことから、人材は常に不足している。 At present, the decision-making part in the food procurement business is done by humans and is treated as a highly specialized business. The number of items (products) is, for example, about 20,000, and it is difficult for a person with specialized knowledge to grasp the whole. For this reason, human resources are always in short supply.

また、品目（商品）点数が数万点であり、品目毎の仕入れ数の選択肢数が数個から数十個あることから、組合せ爆発を起こすことになるため、既存の数学的計画手法は、食品仕入れ数決定においては有効に機能しない。更に、仕入れ数について相関を持った品目が多数存在するため、例えば、既存のＡＩ手法の適用が困難な業務である。 In addition, since the number of items (products) is tens of thousands and there are several to tens of options for the number of purchases for each item, a combined explosion will occur. It does not function effectively in determining the number of food items purchased. Furthermore, since there are many items having a correlation with respect to the number of purchases, for example, it is difficult to apply the existing AI method.

特開２００３−１８２８５４号公報JP 2003-182854 A

R. Sutton and A. Barto: Reinforcement Learning, MIT Press (1999).R. Sutton and A. Barto: Reinforcement Learning, MIT Press (1999). Y. Hirashima: “A reinforcement learning method for train marshaling based on movements of locomotive”, IAENG Int. J. Comput. Sci., Vol. 38, No. 3, 242/248 (2011).Y. Hirashima: “A reinforcement learning method for train marshaling based on movements of locomotive”, IAENG Int. J. Comput. Sci., Vol. 38, No. 3, 242/248 (2011).

本開示は、値引き・廃棄・機会ロスを最小に抑えつつ、粗利を最大化するように各品目の仕入数を組み合わせることができ、更に、各組み合せに対して評価を行い、選択肢間の優劣を示すことができる、意思決定支援システムである食品仕入れ数決定支援システムを提示することを目的とする。 In this disclosure, the number of purchases of each item can be combined so as to maximize the gross profit while minimizing discounts / disposals / opportunity loss. The object is to present a food purchase number determination support system that is a decision support system that can show superiority or inferiority.

本開示は、上記の目的を達成するためのものである。本開示に係る食品仕入れ数決定支援システムは、食品仕入れ数の最適組合せ学習及び導出法に係る食品仕入れ数決定支援システムであって、
食品仕入れ数最適組み合わせの学習モデルを作成する学習部と、並びに、
前記学習モデルに基づいて、各品目の仕入れ数候補毎の評価値、及び、一品目の仕入れ数が複数に割り当てられる場合の割当比率候補毎の評価値を算出する演算部とを備え、
前記学習モデルは少なくとも、各品目における仕入れ数候補と、状態記述パラメータと、割引率と、及び第１の評価関数と、並びに、一品目の仕入れ数が複数に割り当てられる場合の割当比率候補と、状態記述パラメータと、及び第２の評価関数とで、構成され、
前記学習部は、
各品目について、ロス額に基づいて割引率を決定し、仕入れ数候補毎に、状態記述パラメータ及び確定した粗利に基づいて第１の評価関数を決定し、
更に、一品目の仕入れ数が複数に割り当てられる場合に、割当比率候補毎に、状態記述パラメータ及び確定した粗利に基づいて第２の評価関数を決定し、
前記演算部は、
各品目の仕入れ数候補毎の評価値を、前記学習モデルにおける、各品目の仕入れ数候補毎の第１の評価関数から導出し、
一品目の仕入れ数が複数に割り当てられる場合の割当比率候補毎の評価値を、前記学習モデルにおける、割当比率候補毎の第２の評価関数から導出する。 The present disclosure is intended to achieve the above object. The food purchase number determination support system according to the present disclosure is a food purchase number determination support system according to the optimal combination learning and derivation method of food purchase numbers,
A learning unit that creates a learning model for an optimal combination of food purchases, and
Based on the learning model, an evaluation value for each purchase number candidate of each item, and a calculation unit that calculates an evaluation value for each allocation ratio candidate when the number of purchases of one item is assigned to a plurality,
The learning model includes at least a purchase candidate for each item, a state description parameter, a discount rate, a first evaluation function, and an allocation ratio candidate when the number of purchases of one item is assigned to a plurality of items, A state description parameter, and a second evaluation function,
The learning unit
For each item, a discount rate is determined based on the loss amount, and for each purchase candidate, a first evaluation function is determined based on the state description parameter and the determined gross profit,
Furthermore, when the number of purchases of one item is allocated to a plurality, a second evaluation function is determined for each allocation ratio candidate based on the state description parameter and the determined gross profit,
The computing unit is
Deriving an evaluation value for each purchase number candidate for each item from the first evaluation function for each purchase number candidate for each item in the learning model,
The evaluation value for each allocation ratio candidate when the number of purchases of one item is allocated to a plurality is derived from the second evaluation function for each allocation ratio candidate in the learning model.

現状人間が行っている専門性の高い業務を、本開示に係る食品仕入れ数決定支援システムが実施し、取扱品目全体に係る仕入れを見直すことによって、様々なロスの削減が可能となる。更に、本開示に係る食品仕入れ数決定支援システムにより、仕入れ業務に関する教育効果を得ることができる。 The loss of various losses can be reduced by carrying out the highly specialized work currently performed by humans by the food purchase number determination support system according to the present disclosure and reviewing the purchase of the entire handling item. Furthermore, the food purchase quantity determination support system according to the present disclosure can provide an educational effect related to purchase operations.

図１は、本開示に係る食品仕入れ数決定支援システムにおける、意思決定フローを模式的に示す図である。FIG. 1 is a diagram schematically illustrating a decision-making flow in a food purchase number determination support system according to the present disclosure. 図２は、第１の実施形態に係る食品仕入れ数決定支援システムのブロック図である。FIG. 2 is a block diagram of the food purchase number determination support system according to the first embodiment. 図３は、第１の実施形態に係る食品仕入れ数決定支援システムのうちの、食品仕入れ数決定支援装置の動作を示すフローチャートである。FIG. 3 is a flowchart illustrating the operation of the food purchase number determination support device in the food purchase number determination support system according to the first embodiment. 図４は、第１の実施形態に係る食品仕入れ数決定支援装置における、予備学習の処理の詳細な動作を示すフローチャートである。FIG. 4 is a flowchart showing the detailed operation of the preliminary learning process in the food purchase quantity determination support device according to the first embodiment. 図５は、品目Ａについての仕入れ数の候補、及びその評価を表示する、第１の実施形態に係る食品仕入れ数決定支援装置の表示部による画面例である。FIG. 5 is an example of a screen displayed by the display unit of the food purchase number determination support apparatus according to the first embodiment, which displays the number of purchases for the item A and its evaluation.

以下、適宜図面を参照しながら、実施の形態を詳細に説明する。但し、必要以上に詳細な説明は省略する場合がある。例えば、既によく知られた事項の詳細説明や実質的に同一の構成に対する重複説明を省略する場合がある。これは、以下の説明が不必要に冗長になるのを避け、当業者の理解を容易にするためである。 Hereinafter, embodiments will be described in detail with reference to the drawings as appropriate. However, more detailed description than necessary may be omitted. For example, detailed descriptions of already well-known matters and repeated descriptions for substantially the same configuration may be omitted. This is to avoid the following description from becoming unnecessarily redundant and to facilitate understanding by those skilled in the art.

なお、発明者らは、当業者が本開示を十分に理解するために添付図面および以下の説明を提供するのであって、これらによって特許請求の範囲に記載の主題を限定することを意図するものではない。 In addition, the inventors provide the accompanying drawings and the following description in order for those skilled in the art to fully understand the present disclosure, and these are intended to limit the subject matter described in the claims. is not.

［本開示に至る経緯］
食品仕入れ業務は、数万点に及ぶ品目について毎日、仕入れ個数を決定・発注する作業を伴っており、スーパーマーケットや食料品店等の店舗の利益に直接影響を及ぼす重要な業務である。 [Background to this disclosure]
The food purchasing operation involves determining and ordering the number of items to be purchased every day for tens of thousands of items, and is an important operation that directly affects the profits of stores such as supermarkets and grocery stores.

各品目には消費期限が設定されており消費期限を過ぎると廃棄されるが、廃棄によるコストを廃棄ロスと呼ぶ。廃棄ロスを抑えるために消費期限が迫った商品について割引による販売促進対策をとる場合がある。この場合は希望小売価格に対する値引きロスが生じる。また、品切れによる販売機会の喪失を防止するために在庫を用意するのが一般的であるが、品切れによって生じる売上損失を機会ロスと呼ぶ。 Each item has a expiration date and is discarded after the expiration date. The cost of disposal is called a disposal loss. There are cases where sales promotion measures are taken with discounts for products whose expiry date is approaching in order to reduce waste loss. In this case, a discount loss with respect to the suggested retail price occurs. In general, inventory is prepared in order to prevent loss of sales opportunities due to out of stock, but a sales loss caused by out of stock is called an opportunity loss.

さらに、食品のうち生鮮食品は消費期限が加工品に比べて短いものが多く、在庫として保存可能な期間が品目ごとに同じとは限らない。そして、生食可能な魚類を焼き物、惣菜、弁当へ転用するなど、１品目の仕入れが複数の販売品目と相関しているものが存在する。つまり、食品仕入れ業務では、多数の品目の仕入れ数を個別に決定する際、在庫数、売行き、消費期限、相関のある商品の仕入れ・販売状況、廃棄・値引・機会ロスの状況、粗利などを考慮する必要がある。 Furthermore, fresh foods often have shorter expiry dates than processed products, and the period of storage as stock is not always the same for each item. And there is one in which the purchase of one item correlates with a plurality of sales items, such as diverting fish that can be eaten raw to grilled foods, side dishes, and lunch boxes. In other words, in the food procurement business, when individually determining the number of items to be purchased, the number of stocks, sales, expiry date, purchase / sales status of correlated products, disposal / discount / opportunity loss status, gross profit It is necessary to consider.

ロスを抑えつつ粗利を最大化することを目標として、品目数と考慮する要素との相関関係に基づいて仕入れ数を決定するために検討すべき組合せ数は、膨大である。よって、食品仕入れ業務は、高い専門性を有する少数の従業員が担当せざるを得ないのが現状である。これら従業員は代替不可能であり、同等の技能を有する人材を育成し確保するためには相当な現場経験が必要である。更に、かような技能を有する人材のための教育法や育成法も確立されていない。これらのことから、食品仕入れ業務を適切に行い得る人材は、慢性的に不足している。 With the goal of maximizing gross profit while suppressing losses, the number of combinations to be examined to determine the number of purchases based on the correlation between the number of items and factors to be considered is enormous. Therefore, the current situation is that a small number of employees with high specialties are in charge of food procurement operations. These employees are not substitutable, and considerable on-site experience is required to develop and secure human resources with equivalent skills. Furthermore, no education method or training method has been established for human resources with such skills. For these reasons, there is a chronic shortage of human resources who can properly perform food procurement operations.

一方、仕入れ数の組合せに対して、ロス額などを評価対象として、組合せ最適化問題としてモデル化を行うことは可能である。しかしながら、相関のある品目数に対しては、仕入れ数の組合せが指数関数的に増加するため、当該品目数が１００を超える場合には従来の数学的プログラミングによる求解は困難となる。仕入れ業務の現場では、天候、季節、イベント、時事情報、などにより複数の関連品目の売上が増減する現象もよく知られている。実際に従来手法では実用規模の問題を解決できていない。 On the other hand, it is possible to model a combination optimization problem with a loss amount or the like as an evaluation target for a combination of the number of purchases. However, since the combination of the number of purchases increases exponentially with respect to the number of items having a correlation, when the number of items exceeds 100, the solution by the conventional mathematical programming becomes difficult. It is well known that the sales of a plurality of related items increase or decrease depending on weather, seasons, events, current affairs information, etc. at the site of purchasing operations. Actually, the conventional method has not solved the problem of practical scale.

以上のような問題点を解決するべく、本願の発明者は、強化学習を用いた、ロスを評価基準とする、仕入れ数の最適組合せ学習・導出法に係る、本開示の食品仕入れ数決定支援システムの考案に到った。本開示に係る食品仕入れ数決定支援システムでは、仕入れ数を逐次決定し、決定済みの仕入れ数を状態記述パラメータとして順次加えていくことにより、品目間の相関に対応する。そして、具体的な個別仕入れ数の組合せについて、確定した粗利、ロス額から評価を計算し、評価値を各仕入れ数に伝播する。伝播は最適方程式に基づいて行い、最適な粗利とロス額を得るための最適な仕入れ数が各品目に関する意思決定段階において得られることを保証する。 In order to solve the above problems, the inventors of the present application support the determination of the number of food purchases according to the present disclosure related to the optimal combination learning / derivation method of purchase numbers using loss as an evaluation criterion. The system was devised. In the food purchase number determination support system according to the present disclosure, the number of purchases is sequentially determined, and the determined number of purchases is sequentially added as a state description parameter to cope with the correlation between items. Then, for a specific combination of individual purchase numbers, evaluation is calculated from the determined gross profit and loss amount, and the evaluation value is propagated to each purchase number. Propagation is based on optimal equations to ensure that the optimal number of purchases for optimal gross margin and loss is obtained at the decision-making stage for each item.

［１．本開示の基本的特徴］
本開示に係る食品仕入れ数決定支援システムについての基本的特徴を以下、説明する。 [1. Basic features of the present disclosure]
The basic features of the food purchase quantity determination support system according to the present disclosure will be described below.

［１．１．モデリング］
［１．１．１仕入れ数決定作業］
品目数をk、第i品目に対する仕入れ数の選択肢数をm_iとし、現場の仕入れ決定作業と同じ順序で仕入れ数o_iの決定を行う。現場の運用形態に依存するがこの際、外部（例えば、外部の管理サーバからの）入力として、決定対象品目の在庫数をs_i(i=1, ・・・ , k)、売行き予測をb_iと表し、初期値を与える。また、消費期限をl_i、予算残額をM_i、相関のある他商品の仕入れ数をp_i=[p_i1, ・・・ , p_il] とし、状態記述パラメータx_iを次式で表す：

[1.1. modeling]
[1.1.1 Purchase number determination work]
Assume that the number of items is k, the number of options for the number of purchases for the i-th item is m _i, and the number of purchases o _i is determined in the same order as the on-site purchase decision work. Although it depends on the operation mode at the site, at this time, as the external input (for example, from an external management server), the stock quantity of the item to be determined is set to s _i (i = 1,. _{Represents i} and gives the initial value. Also, let l _{i be} the expiry date, M _{i be} the remaining budget, and p _i = [p _i1 , _... , P _il ] be the number of purchases of other related products, and the state description parameter x _i is expressed by the following equation:

ただし，p_ij(1≦j≦l)が未決定の場合にはp_ij=0とする。全ての品目について仕入れ数が決定したのち、所定の確率モデルによって当日の販売数を推定し、ロス額と粗利を確定する。与えられた予算枠において、粗利を最大化する状態遷移の組合せ[x₁,x₂, ・・・ ,x_k] が最適解である。 However, if p _ij (1 ≦ j ≦ l) is not yet determined, p _ij = 0. After the number of purchases is determined for all items, the number of sales on the day is estimated by a predetermined probability model, and the loss amount and gross profit are determined. For a given budget, the state transition combination [x ₁ , x ₂ , _... , X _k ] that maximizes the gross profit is the optimal solution.

以上のモデルでは、解の構成要素が数万個に及ぶ実用規模では、解の評価を全て保持する必要がある手法の実現が困難となる。 In the above model, it is difficult to realize a method that needs to hold all solution evaluations on a practical scale of tens of thousands of solution components.

［１．１．２学習モデル］
（１）仕入れ数の評価値と更新則
上記［１．１．１仕入れ数決定作業］で述べたモデルに対し、第i品目に対する仕入れ数を、各候補

から選択する。この際、状態パラメータx_iに対し、仕入れ数候補との組み合わせに対する評価値を

としてQ1_iの値に基づいて選択を行う。 [1.1.2 Learning model]
(1) Purchase value evaluation value and update rule For the model described in [1.1.1 Purchase number determination work], the number of purchases for the i-th item is determined for each candidate.

Select from. At this time, the evaluation value for the combination with the number of purchase candidates for the state parameter x _i

To select based on the value of Q1 _i .

そして仕入れ数を決定した際、選択結果をo_iで表し、これに対応する評価値Q1を次式で更新する：

ここで、αは学習率（係数）、γは割引率である。 When the number of purchases is determined, the selection result is represented by o _i and the corresponding evaluation value Q1 is updated by the following formula:

Here, α is a learning rate (coefficient), and γ is a discount rate.

Ｒは報酬であり、当日の販売実施後、目標粗利額をG_o，粗利額をGとして次式で決定する：

つまり、Rは目標達成率であり、粗利確定までは値が０である。 R is a reward, and after the sale on the day, the target gross profit amount is set as _Go and the gross profit amount is determined as G:

That is, R is the target achievement rate, and the value is 0 until the gross profit is determined.

（２）１品目に対して複数の販売形態が存在する場合
生食可能な魚類を焼き物、惣菜、弁当へ転用するなど、１品目の仕入れが複数の販売品目と相関している場合、対応する販売品目に対する割当比率がロス額に影響する。そこで、割当比率を選択肢として、局所評価を行う。つまり、割当比率の候補を

から選択する。この際、状態パラメータx_iに対し、割当比率との組み合わせに対する評価値を

としてQ2_iの値に基づいて関連品目への割当比率を選択する。 (2) When there are multiple sales forms for one item When the purchase of one item correlates with multiple sales items, such as diverting raw fish that can be eaten to pottery, side dishes, lunch boxes, etc., the corresponding sales The allocation ratio for the item affects the loss amount. Therefore, local evaluation is performed using the allocation ratio as an option. In other words, the allocation ratio candidate

Select from. At this time, the evaluation value for the combination with the allocation ratio for the state parameter x _i

Select the allocation ratio to the related item based on the value of Q2 _i .

そして、

の中から選択した結果w_iに対応するQ2_iを次式で更新する：

And

Update Q2 _i corresponding to the result w _i selected from with:

適切に

を選択するために、必要に応じてp_iの構成要素の一部を他品目の仕入れ数から割当比率に変更する。 Properly

In order to select, a part of the components of p _i is changed from the number of other items purchased to the allocation ratio as necessary.

［１．１．３割引率の計算］
各品目の仕入れ数が販売後の粗利とロス額に影響を及ぼす。
このとき、各仕入れ数の候補の評価に粗利とロス額を反映するために、次式で第i品目のo_i,w_iに対する割引率γ_i(i=1, ・・・ , k) を計算する：

ここで、D_iは第i品目で生じたロス額、T_pはk品目によって得られた当日の総売上額、δ は割引率の範囲を調整するパラメータである。上式（５）はロス額が小さい仕入れ数の選択肢に対する割引率を小さく設定する。また、粗利が大きくなる仕入れ数の組合せに対しては相対的に割引率が小さくなる。 [1.1.3 Calculation of discount rate]
The number of purchases for each item affects the gross margin and loss after sales.
In this case, in order to reflect the gross profit and the loss amount in the evaluation of candidates for each purchase number, o _i of the i-th item in the following equation, the discount rate for _{_{w i γ i (i = 1}} , · · ·, k) Calculate:

Here, D _i is the loss amount generated in the i-th item, T _p is the total sales amount for the day obtained by the k item, and δ is a parameter for adjusting the range of the discount rate. In the above equation (5), the discount rate for the option of the number of purchases with a small loss amount is set small. In addition, the discount rate is relatively small for a combination of purchases in which the gross profit increases.

上式（２）、（４）は十分な更新回数実行後に

に収束する。Ｒは粗利確定後にのみに与えるので、仕入れ数の決定作業の進捗に伴って評価値が単調増加する。上式（２）〜（５）は、最良の選択肢に対する評価値を用いて伝播を行うので、各状態における評価値が目標状態に至る最良の仕入れ数の組合せを反映することになる。 The above formulas (2) and (4) are calculated after a sufficient number of updates

Converge to. Since R is given only after the gross profit is determined, the evaluation value increases monotonically with the progress of the work for determining the number of purchases. Since the above formulas (2) to (5) are propagated by using the evaluation value for the best option, the evaluation value in each state reflects the best purchase number combination that reaches the target state.

［１．１．４動作選択］
目標配置、移動対象、貨車移動先、移動貨車台数は、Soft-Max法（非特許文献１参照）によって確率的選択を行う。本更新則では、Q_i(i=1,2)の値が、割引回数の増加とともに小さくなる。一般に大規模問題では、品目数の増加に伴って、仕入れ数の組み合わせ数が大きくなるため、特に初期配置周辺の評価値が過剰に小さくなる。この場合、学習前半部分の導出における探索範囲が過剰に広がり、十分な学習効果が得られない。 [1.1.4 Operation selection]
The target arrangement, the movement target, the wagon destination, and the number of moving wagons are selected stochastically by the Soft-Max method (see Non-Patent Document 1). In this update rule, the value of Q _i (i = 1,2) decreases as the number of discounts increases. In general, in a large-scale problem, as the number of items increases, the number of purchases increases, so that the evaluation value particularly around the initial arrangement becomes excessively small. In this case, the search range in the derivation of the first half of learning is excessively widened, and a sufficient learning effect cannot be obtained.

改善方法として、Q_i(i=1,2)を正規化した

を用い、

に対する動作の選択確率Pn(x_i,u)を次式で決定する（非特許文献２参照）。

ただし、ξは温度定数であり、

の各値に対して動作選択確率の違いを拡大/縮小することによって、探索範囲を調整する役割を果たす（非特許文献２参照）。 Q _i (i = 1,2) was normalized as an improvement method

Use

The selection probability Pn (x _i , u) of the action for is determined by the following equation (see Non-Patent Document 2).

Where ξ is the temperature constant,

It plays the role of adjusting the search range by enlarging / reducing the difference in the motion selection probability for each value (see Non-Patent Document 2).

［１．２．意思決定支援］
十分な学習を実行後、各品目における仕入れ数候補には、少なくとも探索済み領域内において導出した粗利とロス額を反映した評価値が対応付けられているため、すべての候補について評価値を取出し、大小関係を比較することによって、仕入れ数の優劣に関する順位付けが可能になる。 [1.2. Decision support]
After performing sufficient learning, the number of purchases for each item is associated with an evaluation value that reflects at least the gross margin and the loss amount derived within the searched area, so that all candidate values are extracted. By comparing the magnitude relations, it is possible to rank the number of purchases.

評価値は品目ごとの各選択肢に対応しているため、仕入れ数決定に伴う状態遷移にしたがって順次参照可能になる。そして、すべての品目について最高順位の選択肢を選ぶ組み合わせが最良解であり、各候補の評価値を使って最良解からの乖離の大きさに関する情報が得られる。 Since the evaluation value corresponds to each option for each item, the evaluation value can be sequentially referred to according to the state transition accompanying the purchase number determination. A combination of selecting the highest-ranked option for all items is the best solution, and information on the magnitude of deviation from the best solution can be obtained using the evaluation value of each candidate.

［１．３．システムの動作］
意思決定支援を行うために予備学習を行い、Q_iによる評価関数を獲得する。Q_iは初期値０から始め、粗利確定時に受け取るＲが割引を伴って伝播する。予備学習を繰り返すことによって、全仕入れ数が決定した状態から初期状態に向かって徐々に評価値が広がっていくが、この際、決定仕入れ数に関わらず、最大の評価値を持つ状態から伝播を実施する。つまり、評価値が粗利に対するロス率を反映している場合にはロス率最小となる選択肢の評価が伝播し、Q_iによる評価関数が任意の状態から最良の仕入れ数組み合わせに対する評価を関数値としてとることになる。 [1.3. System operation]
Preliminary learning is performed to support decision making, and an evaluation function based on Q _i is acquired. Q _i starts with an initial value of 0, and R received when the gross profit is determined propagates with a discount. By repeating preliminary learning, the evaluation value gradually spreads from the state in which the total number of purchases is determined to the initial state, but at this time, propagation from the state having the maximum evaluation value is performed regardless of the number of purchases determined. carry out. In other words, if the evaluation value reflects the loss rate against gross profit, the evaluation of the option that minimizes the loss rate is propagated, and the evaluation function based on Q _i evaluates the best combination of purchases from any state to the function value. Will be taken as.

売上や来店予測を所定の確率モデルを使って構成した場合、各状態における仕入れ数の評価が、設定した確率分布にしたがった、ロス率の期待値を学習することになる。 When sales or store visit prediction is configured using a predetermined probability model, the evaluation of the number of purchases in each state learns the expected value of the loss rate according to the set probability distribution.

［１．４．意思決定フローの例］
図１は、本開示に係る食品仕入れ数決定支援システムにおける、意思決定フローを模式的に示す図である。例えば、品目Ａについては「仕入れ数１」、「仕入れ数２」・・・「仕入れ数ｎ」という、仕入数の選択肢が存在することが示されている。また、例えば、品目Ａについては「生食用」、「寿司用」、「弁当用」に割り当てられ得ることが示されている。更に、粗利、ロス率に基づく評価が、各仕入数候補に順次、伝播する様子を示している。 [1.4. Example of decision flow]
FIG. 1 is a diagram schematically illustrating a decision-making flow in a food purchase number determination support system according to the present disclosure. For example, for item A, it is shown that there are options for the number of purchases, such as “Number of purchases 1”, “Number of purchases 2”,... “Number of purchases n”. Further, for example, it is shown that the item A can be assigned to “for raw food”, “for sushi”, and “for lunch”. Furthermore, the evaluation based on the gross profit and the loss rate is shown to be sequentially propagated to each purchase number candidate.

［１．５．オリジナリティと優位性］
上記の［１．１．１仕入れ数決定作業］で述べた仕入れ数決定モデル、［１．１．２学習モデル］の学習モデル、［１．１．３割引率の計算］の割引計算法が夫々、意思決定支援、最適性、粗利とロス率の考慮に必要な基礎理論上の新規手法であり、既存技術では代替手法が存在しない。 [1.5. Originality and superiority]
The purchase number determination model described in [1.1.1 Purchase number determination work], the learning model of [1.1.2 Learning model], and the discount calculation method of [1.1.3 Calculation of discount rate] are as follows. Each is a new method based on the basic theory necessary for decision support, optimality, gross margin and loss rate, and there is no alternative method for existing technology.

対象としている問題については大量の教師データが蓄積されているわけではない。例えば、専門職員の発注記録データは残っている（存在している）ものの、判断基準や決定ルールなどについては記録が採られていない。一般に、現場のデータから意思決定ルールを抽出するためには専門職員を対象とする詳細な聞き取り調査が必要になるため、短期間で本開示の模倣に必要な解析を短期間で完了することは困難であり、本開示に係る研究による候補の拡張・抽出や来店予測モデルの拡張・開発の速度が、模倣のそれを上回ることができると考えられる。 A large amount of teacher data has not been accumulated for the subject problem. For example, although the order record data of professional staff remains (exists), no records are taken on judgment criteria and decision rules. In general, in order to extract decision-making rules from on-site data, it is necessary to conduct detailed interviews with specialized staff, so it is not possible to complete the analysis necessary for imitation of this disclosure in a short period of time. It is difficult, and it is considered that the speed of extension / extraction of candidates and extension / development of the store visit prediction model by the research according to the present disclosure can exceed that of imitation.

データから決定ルールを抽出する方法に関する国内における研究は、２００５年前後に、主に、化学プラントの反応器オペレータが持っていた、オペレーションに関する操作決定ルールを抽出するものに関して、盛んに行われていた経緯がある。化学プラントでは、反応器の動作に関する数値データは蓄積されているが、オペレータによる操作量や操作タイミングについての意図や判断基準などは、記録されていなかったため、特に職歴の長いオペレータの持つオペレーションに関する操作決定ルールを抽出する研究が必要となった次第である。当該分野にて数多くの研究が行われたが、有効な解決手段は示されていない。 Research in Japan regarding methods for extracting decision rules from data was actively conducted around 2005 mainly on the operation decision rules related to operations that the reactor operators of chemical plants had. There is a background. In a chemical plant, numerical data related to the operation of the reactor is accumulated, but the intentions and judgment criteria regarding the operation amount and operation timing by the operator were not recorded. As soon as research to extract decision rules is needed. A lot of research has been done in the field, but no effective solution has been shown.

［第１の実施形態］
続いて、以下、適宜図面を参照しながら、実施の形態を詳細に説明する。但し、必要以上に詳細な説明は省略する場合がある。例えば、既によく知られた事項の詳細説明や実質的に同一の構成に対する重複説明を省略する場合がある。これは、以下の説明が不必要に冗長になるのを避け、当業者の理解を容易にするためである。 [First Embodiment]
Next, embodiments will be described in detail with reference to the drawings as appropriate. However, more detailed description than necessary may be omitted. For example, detailed descriptions of already well-known matters and repeated descriptions for substantially the same configuration may be omitted. This is to avoid the following description from becoming unnecessarily redundant and to facilitate understanding by those skilled in the art.

なお、発明者は、当業者が本開示を十分に理解するために添付図面および以下の説明を提供するのであって、これらによって特許請求の範囲に記載の主題を限定することを意図するものではない。 The inventor provides the accompanying drawings and the following description in order for those skilled in the art to fully understand the present disclosure, and is not intended to limit the subject matter described in the claims. Absent.

［２．１．食品仕入れ数決定支援システムの構成］
図２は、第１の実施形態に係る食品仕入れ数決定支援システム１のブロック図である。図１に示す食品仕入れ数決定支援システム１は、ＰＤＡ等により構成され外部ネットワーク１６と接続する食品仕入れ数決定支援装置２と、及び、外部ネットワーク１６と接続する管理サーバ１８とを含む。外部ネットワーク１６は、例えば、ＬＡＮ、ＷＡＮ、及びインターネットなどである。 [2.1. Configuration of support system for determining the number of food purchases]
FIG. 2 is a block diagram of the food purchase number determination support system 1 according to the first embodiment. A food purchase number determination support system 1 shown in FIG. 1 includes a food purchase number determination support device 2 configured by a PDA or the like and connected to an external network 16, and a management server 18 connected to the external network 16. The external network 16 is, for example, a LAN, a WAN, or the Internet.

図１に示す食品仕入れ数決定支援装置２は、制御部４、入力部８、表示部６、記憶部１０、及び通信インタフェース部１２により構成される。制御部４は、演算処理及び装置全体の制御処理を行う。入力部８は、装置２に対する入力データを生成する若しくは受け取る部位であり、通常、キーボード、マウス、タッチパネル等により構成される。入力部８は、表示部６の表示面に重畳して設けられたタッチセンサで構成されてもよい。このとき入力部８は、表示部６によって表示された画像に対するタッチ（接触）操作を受け付ける。このように入力部８と表示部６とは一体的に構成されてもよいし、別体で構成されてもよい。 The food purchase number determination support device 2 shown in FIG. 1 includes a control unit 4, an input unit 8, a display unit 6, a storage unit 10, and a communication interface unit 12. The control unit 4 performs arithmetic processing and control processing for the entire apparatus. The input unit 8 is a part that generates or receives input data for the apparatus 2 and is generally configured by a keyboard, a mouse, a touch panel, and the like. The input unit 8 may be configured by a touch sensor provided so as to overlap the display surface of the display unit 6. At this time, the input unit 8 accepts a touch (contact) operation on the image displayed by the display unit 6. Thus, the input part 8 and the display part 6 may be comprised integrally, and may be comprised separately.

表示部６は、制御部４による処理結果等を画像により表示する部位であり、液晶ディスプレイ、有機ＥＬディスプレイ、プラズマディスプレイ等により構成される。表示部６は出力部として補助的にスピーカ等の聴覚用デバイスを含んでもよい。記憶部１０は、制御部４で稼働するプログラムや食品仕入れ数決定支援に必要なパラメータデータ等が記録されている。記憶部１０は、例えばフラッシュメモリで構成される。通信インタフェース部１２は、外部ネットワーク１６とのインタフェースとして動作する部位である。これらの制御部４、入力部８、表示部６、記憶部１０、及び通信インタフェース部１２は、適切なバス１４により相互に接続されている。 The display unit 6 is a part that displays a processing result or the like by the control unit 4 as an image, and includes a liquid crystal display, an organic EL display, a plasma display, or the like. The display unit 6 may include a hearing device such as a speaker as an output unit. The storage unit 10 stores a program that operates in the control unit 4, parameter data necessary for food purchase number determination support, and the like. The storage unit 10 is configured by a flash memory, for example. The communication interface unit 12 is a part that operates as an interface with the external network 16. The control unit 4, the input unit 8, the display unit 6, the storage unit 10, and the communication interface unit 12 are connected to each other by an appropriate bus 14.

食品仕入れ数決定支援装置２は、ＰＤＡ（ポータブルデジタルアシスタント）、タブレット端末、ノートパソコン、デスクトップパソコン、若しくはワークステーションなどの、情報処理装置で構成される。 The food purchase number determination support device 2 is configured by an information processing device such as a PDA (Portable Digital Assistant), a tablet terminal, a notebook personal computer, a desktop personal computer, or a workstation.

制御部４は、例えばＣＰＵやＭＰＵで構成され、食品仕入れ数決定支援装置２全体の動
作を制御する。制御部４における演算処理や装置全体の制御処理は、ハードウエア資源であるプロセッサと、記憶部１０に記録される、若しくは外部ネットワーク１６を介して外部から取り込まれる、ソフトウエアであるプログラムとの協働により実現される。制御部４は、専用に設計された電子回路や再構成可能な電子回路などのハードウエア回路（ＡＳＩＣ、ＦＰＧＡ等）で実現されてもよい。第１の実施形態に係る食品仕入れ数決定支援装置２の特徴的動作は、制御部４における演算処理及び制御処理により実現される。 The control part 4 is comprised by CPU and MPU, for example, and controls operation | movement of the food purchase number determination assistance apparatus 2 whole. Arithmetic processing in the control unit 4 and control processing of the entire apparatus are performed in cooperation with a processor, which is a hardware resource, and a program, which is software, recorded in the storage unit 10 or imported from the outside via the external network 16. Realized by work. The control unit 4 may be realized by a hardware circuit (ASIC, FPGA, etc.) such as an electronic circuit designed exclusively or a reconfigurable electronic circuit. The characteristic operation of the food purchase number determination support device 2 according to the first embodiment is realized by arithmetic processing and control processing in the control unit 4.

管理サーバ１８は、食品仕入れ業務を行うスーパーマーケットや食料品店に係る全体業務をサポートし管理するコンピュータシステムを構成する。管理サーバ１８は、在庫数、売行き予測、消費期限、予算残額、売上、イベント、来店数などのデータを格納しており、食品仕入れ数決定支援装置２は、外部ネットワーク１６及び通信インタフェース部１２を介して、適宜それらデータを取り込み、記憶部１０に記録したり、制御部４での演算処理に用いたりする。 The management server 18 constitutes a computer system that supports and manages the entire business relating to a supermarket or a grocery store that performs food procurement operations. The management server 18 stores data such as the number of stocks, sales forecasts, expiration date, remaining budget, sales, events, number of visits, etc. The food purchase quantity determination support device 2 includes the external network 16 and the communication interface unit 12. Thus, the data is appropriately captured and recorded in the storage unit 10 or used for arithmetic processing in the control unit 4.

在庫数、売行き予測、消費期限、予算残額、売上、イベント、来店数などのデータは、前に説明したように、食品仕入れ数決定支援に必要な状態記述パラメータのデータである。 Data such as the number of inventory, sales forecast, expiration date, remaining budget, sales, event, number of visits, etc. are data of state description parameters necessary for the food purchase quantity determination support as described above.

なお、食品仕入れ数決定支援装置２における処理のうちの、全部又は一部が、管理サーバ１８にて行われてもよい。また、上述の食品仕入れ数決定支援装置２における処理を実行するコンピュータプログラムが、管理サーバ１８の記憶部（図示せず）に格納されていてもよく、必要時に、食品仕入れ数決定支援装置２が自らの記憶部１０にダウンロードする、というような構成であってもよい。 Note that all or part of the processing in the food purchase quantity determination support device 2 may be performed by the management server 18. Moreover, the computer program which performs the process in the above-mentioned food purchase number determination support apparatus 2 may be stored in the memory | storage part (not shown) of the management server 18, and when needed, the food purchase number determination support apparatus 2 It may be configured to download to its own storage unit 10.

［２．２．食品仕入れ数決定支援システムの動作］
図３及び図４は、第１の実施形態に係る食品仕入れ数決定支援システム１のうちの、食品仕入れ数決定支援装置２の動作を示すフローチャートである。図３及び図４を用いて、第１の実施形態に係る食品仕入れ数決定支援装置２の動作を説明する。 [2.2. Operation of the food purchase number decision support system]
FIG.3 and FIG.4 is a flowchart which shows operation | movement of the food purchase number determination support apparatus 2 of the food purchase number determination support system 1 which concerns on 1st Embodiment. Operation | movement of the food purchase number determination assistance apparatus 2 which concerns on 1st Embodiment is demonstrated using FIG.3 and FIG.4.

作業開始（ステップＳ０２）して、先ず営業日数が決定される（ステップＳ０４）。営業日数は、例えば、入力部８から入力される。次に、決定する品目の仕入れ数の、初期化を行う（ステップＳ０６）。 When the work starts (step S02), the number of business days is first determined (step S04). The number of business days is input from the input unit 8, for example. Next, the number of purchased items to be determined is initialized (step S06).

次に、予備学習を実行する（ステップＳ０８）。予備学習の処理は、各営業日に対して実行され、例えば、管理サーバ１８から在庫数と売行き予測に関するデータを入力して各仕入数の評価を計算する。予備学習の処理の詳細な内容は、図４を用いて後で説明する。 Next, preliminary learning is executed (step S08). The preliminary learning process is executed for each business day. For example, the inventory number and the sales forecast data are input from the management server 18 to calculate the evaluation of each purchase quantity. Details of the preliminary learning process will be described later with reference to FIG.

予備学習の実行後、仕入れ数が未決定の品目が存在するか、判断される（ステップＳ１０）。仕入れ数が未決定の品目が存在するのであれば（ステップＳ１０・ＹＥＳ）、未決定の品目についての仕入れ数候補及びその評価が表示部６に表示され、これに基づき仕入れ数が選択される（ステップＳ１２）。仕入数候補の評価は、予備学習により算出されるものである。選択された仕入れ数は、仕入れ表p_iを構成する（ステップＳ１４）。 After the preliminary learning is performed, it is determined whether there is an item for which the number of purchases has not been determined (step S10). If there is an item for which the number of purchases has not been determined (step S10: YES), the number of purchases for the undecided item and its evaluation are displayed on the display unit 6, and the number of purchases is selected based on this ( Step S12). The evaluation of the number of purchase candidates is calculated by preliminary learning. The selected number of purchases constitutes a purchase table p _i (step S14).

なお、図５は、品目Ａについての仕入れ数の候補、及びその評価を表示する、食品仕入れ数決定支援装置２の表示部６における画面例である。食品仕入れ数決定支援装置２の操作者は、画面に表示される仕入れ数候補のうちの一つを選択する。 FIG. 5 is an example of a screen on the display unit 6 of the food purchase quantity determination support device 2 that displays the purchase quantity candidates for the item A and the evaluation thereof. The operator of the food purchase quantity determination support device 2 selects one of the purchase quantity candidates displayed on the screen.

次に、割当比率が未決定の品目が存在するか、判断される（ステップＳ１６）。割当比率が未決定の品目が存在するのであれば（ステップＳ１６・ＹＥＳ）、未決定の品目についての割当比率候補及びその評価が表示部６に表示され、これに基づき割当比率が選択される（ステップＳ１８）。割当比率候補の評価は、予備学習により算出されるものである。選択された割当比率は、割当表p_iを構成する（ステップＳ２０）。更に割当比率が未決定の品目が存在するのであれば（ステップＳ１６・ＹＥＳ）、その品目についてステップＳ１８、ステップＳ２０が実行される。なお、割当比率が未決定の品目が存在しないのであれば（ステップＳ１６・ＮＯ）、ステップＳ１０に戻る。 Next, it is determined whether there is an item for which the allocation ratio has not been determined (step S16). If there is an item for which the allocation ratio is undetermined (step S16, YES), the allocation ratio candidates and evaluations for the undetermined item are displayed on the display unit 6, and the allocation ratio is selected based on this ( Step S18). The evaluation of the allocation ratio candidate is calculated by preliminary learning. The selected allocation ratio constitutes an allocation table p _i (step S20). Further, if there is an item for which the allocation ratio has not been determined (step S16 / YES), step S18 and step S20 are executed for that item. If there is no item for which the allocation ratio has not been determined (NO in step S16), the process returns to step S10.

ステップＳ１０にて、仕入れ数が未決定の品目が更に存在するのであれば（ステップＳ１０・ＹＥＳ）、ステップＳ１２〜ステップＳ１６が実行される（繰り返される）。ステップＳ１０にて、仕入れ数が未決定の品目が存在しない、と判断されれば（ステップＳ１０・ＮＯ）、報酬Ｒが計算されて受理され、更に在庫数が更新される（ステップＳ２２）。 If there are more items for which the number of purchases is not yet determined in step S10 (step S10, YES), steps S12 to S16 are executed (repeated). If it is determined in step S10 that there is no item whose purchase number has not yet been determined (NO in step S10), a reward R is calculated and accepted, and the stock quantity is updated (step S22).

更に、作業を終了するかどうか確認され（ステップＳ２４）、ステップＳ０４にて決定した営業日数について全て作業が終了したのであれば（ステップＳ２４・ＹＥＳ）、全体作業を終了する（ステップＳ２６）。終了しないのであれば（ステップＳ２４・ＮＯ）、ステップＳ０６に戻る。ここで次の営業日についての作業に切り替わる。決定する品目の仕入れ数が初期化され、各品目の発注数は０に戻るが、在庫総数・賞味期限日の在庫数は前日の売上結果を引き継いで更新することになる。更にステップＳ０８〜が実行される。 Further, it is confirmed whether or not the work is to be ended (step S24). If all the work is completed for the number of business days determined in step S04 (step S24, YES), the entire work is ended (step S26). If not finished (NO in step S24), the process returns to step S06. Now switch to work on the next business day. The number of items to be determined is initialized, and the number of orders for each item returns to 0. However, the total number of stocks and the number of stocks on the expiration date are updated by taking over the sales results of the previous day. Further, steps S08- are executed.

図４は、第１の実施形態に係る食品仕入れ数決定支援装置２における、予備学習の処理の詳細な動作を示すフローチャートである。図４を用いて、予備学習の処理の詳細な動作を説明する。 FIG. 4 is a flowchart showing a detailed operation of the preliminary learning process in the food purchase quantity determination support device 2 according to the first embodiment. The detailed operation of the preliminary learning process will be described with reference to FIG.

学習開始（ステップＳ０８０２）して、決定する品目の仕入れ数の、初期化を行う（ステップＳ０８０４）。 Learning is started (step S0802), and the number of purchased items to be determined is initialized (step S0804).

次に、仕入れ数が未決定の品目が存在するか、判断される（ステップＳ０８０６）。仕入れ数が未決定の品目が存在するのであれば（ステップＳ０８０６・ＹＥＳ）、未決定の品目についての仕入れ数が選択され、これに基づき、割引率γ_iが計算され、Q1_i、Q2_iが更新される（ステップＳ０８０８）。なお、Q1_i、Q2_iの更新では、最も新しい選択結果が仕入れ数であったか、割当（比率）であったかによって更新対象が決まる。続いて、仕入れ表p_iを構成し、

を格納する（ステップＳ０８１０）。 Next, it is determined whether there is an item for which the number of purchases has not been determined (step S0806). If there is an item for which the number of purchases is undetermined (step S0806 YES), the number of purchases for the undecided item is selected, and based on this, the discount rate γ _i is calculated, and Q1 _i and Q2 _i are It is updated (step S0808). In updating Q1 _i and Q2 _i, the update target is determined depending on whether the latest selection result is the number of purchases or allocation (ratio). Subsequently, the purchase table p _i is constructed,

Is stored (step S0810).

次に、割当比率が未決定の品目が存在するか、判断される（ステップＳ０８１２）。割当比率が未決定の品目が存在するのであれば（ステップＳ０８１２・ＹＥＳ）、未決定の品目についての割当比率が選択され、これに基づき、割引率γ_iが計算され、Q2_i、Q1_iが更新される（ステップＳ０８１４）。なお、Q2_i、Q1_iの更新では、最も新しい選択結果が仕入れ数であったか、割当（比率）であったかによって更新対象が決まる。続いて、仕入れ表p_iを構成し、

を格納する（ステップＳ０８１６）。更に割当比率が未決定の品目が存在するのであれば（ステップＳ０８１２・ＹＥＳ）、その品目についてステップＳ０８１４、ステップＳ０８１６が実行される（繰り返される）。なお、割当比率が未決定の品目が存在しないのであれば（ステップＳ０８１２・ＮＯ）、ステップＳ０８０６に戻る。 Next, it is determined whether there is an item for which the allocation ratio has not been determined (step S0812). If there is an item for which the allocation ratio is undetermined (step S0812, YES), the allocation ratio for the undetermined item is selected, and based on this, the discount rate γ _i is calculated, and Q2 _i and Q1 _i are It is updated (step S0814). In the update of Q2 _i and Q1 _i, the update target is determined depending on whether the latest selection result is the number of purchases or the allocation (ratio). Subsequently, the purchase table p _i is constructed,

Is stored (step S0816). Further, if there is an item for which the allocation ratio has not been determined (step S0812, YES), step S0814 and step S0816 are executed (repeated) for that item. If there is no item for which the allocation ratio has not been determined (step S0812, NO), the process returns to step S0806.

ステップＳ０８０６にて、仕入れ数が未決定の品目が更に存在するのであれば（ステップＳ０８０６・ＹＥＳ）、ステップＳ０８０８〜ステップＳ０８１２が実行される（繰り返される）。ステップＳ０８０６にて、仕入れ数が未決定の品目が存在しない、と判断されれば（ステップＳ０８０６・ＮＯ）、報酬Ｒが計算されて受理され、更に在庫数が更新される（ステップＳ０８１８）。 If there are more items for which the number of purchases is not yet determined in step S0806 (YES in step S0806), steps S0808 to S0812 are executed (repeated). If it is determined in step S0806 that there is no item for which the number of purchases has not yet been determined (NO in step S0806), a reward R is calculated and accepted, and the stock quantity is updated (step S0818).

更に、作業を終了するかどうか確認され（ステップＳ０８２０）、終了するのであれば（ステップＳ０８２０・ＹＥＳ）、全体作業を終了する（ステップＳ０８２２）。終了しないのであれば（ステップＳ０８２０・ＮＯ）、ステップＳ０８０４に戻り、決定する品目の仕入れ数の初期化から予備学習を再開する。 Further, it is confirmed whether or not the work is to be finished (step S0820). If the work is to be finished (step S0820, YES), the whole work is finished (step S0822). If not finished (NO in step S0820), the process returns to step S0804, and the preliminary learning is restarted from the initialization of the number of items to be determined.

［他の実施形態］
以上のように、本出願において開示する技術の例示として、第１の実施形態を説明した。しかしながら、本開示における技術は、これに限定されず、適宜、変更、置き換え、付加、省略などを行った実施形態にも適用可能である。 [Other Embodiments]
As described above, the first embodiment has been described as an example of the technique disclosed in the present application. However, the technology in the present disclosure is not limited to this, and can also be applied to embodiments in which changes, replacements, additions, omissions, and the like have been made as appropriate.

また、実施形態を説明するために、添付図面および詳細な説明を提供した。したがって、添付図面および詳細な説明に記載された構成要素の中には、課題解決のために必須な構成要素だけでなく、上記技術を例示するために、課題解決のためには必須でない構成要素も含まれ得る。そのため、それらの必須ではない構成要素が添付図面や詳細な説明に記載されていることをもって、直ちに、それらの必須ではない構成要素が必須であるとの認定をするべきではない。 In addition, the accompanying drawings and detailed description are provided to explain the embodiments. Accordingly, among the components described in the accompanying drawings and the detailed description, not only the components essential for solving the problem, but also the components not essential for solving the problem in order to illustrate the above technique. May also be included. Therefore, it should not be immediately recognized that these non-essential components are essential as those non-essential components are described in the accompanying drawings and detailed description.

また、上述の実施の形態は、本開示における技術を例示するためのものであるから、特許請求の範囲またはその均等の範囲において種々の変更、置き換え、付加、省略などを行うことができる。 Moreover, since the above-mentioned embodiment is for demonstrating the technique in this indication, a various change, replacement, addition, abbreviation, etc. can be performed in a claim or its equivalent range.

１・・・食品仕入れ数決定支援システム、２・・・食品仕入れ数決定支援装置、４・・・制御部、６・・・表示部、８・・・入力部、１０・・・記憶部、１２・・・通信インタフェース部、１４・・・バス、１６・・・外部ネットワーク、１８・・・管理サーバ。 DESCRIPTION OF SYMBOLS 1 ... Food purchase number determination support system, 2 ... Food purchase number determination support apparatus, 4 ... Control part, 6 ... Display part, 8 ... Input part, 10 ... Memory | storage part, 12 ... Communication interface unit, 14 ... Bus, 16 ... External network, 18 ... Management server.

Claims

In the food purchase quantity decision support system related to the optimal combination learning and derivation method of food purchase quantity,
A learning unit that creates a learning model for an optimal combination of food purchases, and
Based on the learning model, an evaluation value for each purchase number candidate of each item, and a calculation unit that calculates an evaluation value for each allocation ratio candidate when the number of purchases of one item is assigned to a plurality,
The learning model includes at least a purchase candidate for each item, a state description parameter, a discount rate, a first evaluation function, and an allocation ratio candidate when the number of purchases of one item is assigned to a plurality of items, A state description parameter, and a second evaluation function,
The learning unit
For each item, a discount rate is determined based on the loss amount, and for each purchase candidate, a first evaluation function is determined based on the state description parameter and the determined gross profit,
Furthermore, when the number of purchases of one item is allocated to a plurality, a second evaluation function is determined for each allocation ratio candidate based on the state description parameter and the determined gross profit,
The computing unit is
Deriving an evaluation value for each purchase number candidate for each item from the first evaluation function for each purchase number candidate for each item in the learning model,
An evaluation value for each allocation ratio candidate when the number of purchases of one item is allocated to a plurality is derived from a second evaluation function for each allocation ratio candidate in the learning model.
Food purchase number decision support system.