JP2025062680A

JP2025062680A - Image right protection system, protection method and protection program

Info

Publication number: JP2025062680A
Application number: JP2023171860A
Authority: JP
Inventors: 薫渡部; Kaoru Watabe
Original assignee: Individual
Current assignee: Individual
Priority date: 2023-10-03
Filing date: 2023-10-03
Publication date: 2025-04-15

Abstract

To provide a portrait voice right management system capable of proving that data of human voice is unique by associating and managing the human voice with personal information on the owner of the voice and of efficiently and securely managing and utilizing voice data by guaranteeing the voice data to a "portrait voice right" corresponding to a portrait right.SOLUTION: In a portrait voice right management system, a guarantee system includes a voice data generation part for acquiring a sound signal to generate voice data D1, a relevant data acquisition part for acquiring personal information D2 including personal information, a portrait voice right management part for associating voice data with the personal information to generate portrait voice right data, and a plurality of nodes 90a to 90f for encrypting and storing at least part of the portrait voice right data. The nodes 90a to 90f aggregate at least part of the portrait voice right data at prescribed timing to form a block chain and share the block chain by a plurality of nodes to store the block chain as a distributed ledger.SELECTED DRAWING: Figure 9

Description

本発明は、個々の人ごとに、その人固有の特徴を含む声データを保護するためのシステム、方法及びプログラムに関する。 The present invention relates to a system, method and program for protecting voice data, including characteristics unique to each individual.

人間の音声を録音した声データは、個々の人ごとにその人の固有の特徴である、いわゆる「声紋」を含んでおり、周波数、ピッチ、トーン、強度など、多くの要素によって構成され、指紋や網膜スキャンと同様に、個々の人々を識別するバイオメトリクス（生体認証）の一つとして利用されている（例えば、特許文献１参照。）。 Voice data recorded from a human voice contains the so-called "voiceprint," a unique characteristic of each individual, and is made up of many elements such as frequency, pitch, tone, and intensity. Like fingerprints and retinal scans, voice data is used as one of the biometrics for identifying individual people (see, for example, Patent Document 1).

この声データは、個々の人を高い確率で識別できることから非常に個人的なデータであり、不正に取得・使用されるとプライバシー侵害につながる可能性があることから、その保存や転送において高度なセキュリティ対策が必要であるとともに、適切な管理と利用が求められる。 This voice data is highly personal because it can identify individual people with a high degree of certainty, and since its unauthorized acquisition and use could lead to a violation of privacy, it requires advanced security measures for its storage and transfer, as well as appropriate management and use.

特開2007-052496号公報JP 2007-052496 A

しかしながら、上述したような声データの管理が適正に行われていないときには、声データが匿名で流通して、その所有権や出典が不明確となっている場合が多く、不正使用や権利侵害が起きやすい状況が生じやすくなっている。また、声データは、デジタルデータであるため、容易に複製や改ざんが可能であり、その結果、不正コピーや悪用される可能性もある。さらに、声データは、上述した声紋のように、個人を特定可能な情報を含んでおり、これが不正に流用されるとプライバシー侵害や、事件・事故につながる可能性がある。 However, when voice data is not properly managed as described above, it is often distributed anonymously and its ownership and origin are unclear, creating a situation where unauthorized use and infringement of rights are likely to occur. In addition, because voice data is digital data, it can be easily copied or tampered with, which can result in unauthorized copying and misuse. Furthermore, voice data, like the voiceprints mentioned above, contains information that can identify individuals, and if this data is misused, it could lead to privacy violations or incidents or accidents.

そこで、本発明は、上記のような問題を解決するものであり、人の声とその声の持ち主の個人情報をブロックチェーンで紐付けて管理することで、その声データが唯一無二であることを証明し、それを肖像権に相当する「肖声権」まで保障することで、声データを効率的かつ安全に管理・活用できる監視システム、監視方法及び監視プログラムを提供することを目的とする。 The present invention aims to solve the above problems by providing a monitoring system, monitoring method, and monitoring program that can efficiently and safely manage and utilize voice data by linking a person's voice with the personal information of the person who owns that voice and managing it using a blockchain, thereby proving that the voice data is unique and even guaranteeing a "right to likeness," which is equivalent to the right to one's image.

上記課題を解決するために、本発明は、声データを肖声権として管理するシステムであって、
音響信号を取得して声データを生成する声データ生成部と、
個人情報を含む関連データを取得する関連データ取得部と、
前記声データと前記関連データとを関連づけて肖声権データとして生成する肖声権データ生成部と、
前記肖声権データの少なくとも一部を暗号化して記憶するデータ保管手段と
を備えることを特徴とする。 In order to solve the above problems, the present invention provides a system for managing voice data as a voice record, comprising:
a voice data generating unit that acquires an acoustic signal and generates voice data;
a related data acquisition unit for acquiring related data including personal information;
a voice data generating unit for generating voice data by associating the voice data with the related data;
and a data storage means for encrypting and storing at least a part of the image rights data.

また、本発明は、声データを肖声権として管理する方法であって、
（１）声データ生成部が、音響信号を取得して声データを生成するとともに、関連データ取得部が、個人情報を含む関連データを取得し、肖声権データ生成が、前記声データと前記関連データとを関連づけて肖声権データとして生成する肖声権データ生成ステップと、
（２）データ保管手段が、前記肖声権データの少なくとも一部を暗号化して記憶するデータ保管ステップと
を含むことを特徴とする。 The present invention also provides a method for managing voice data as a voice record, comprising the steps of:
(1) A voice data generating step in which a voice data generating unit acquires an acoustic signal to generate voice data, a related data acquiring unit acquires related data including personal information, and a voice data generating step associates the voice data with the related data to generate voice data;
(2) The method is characterized in that the data storage means includes a data storage step of encrypting and storing at least a part of the image right data.

上記発明において、前記データ保管手段は、通信ネットワーク上に構築された保証システムと連携して、前記肖声権データの少なくとも一部を暗号化して記憶させる保証システム連携部を含み、
前記保証システムは、前記肖声権データの少なくとも一部を暗号化して記憶する複数のノードを備え、
前記ノードは、前記肖声権データの少なくとも一部を所定のタイミングで集約してブロック化し、このブロックを既存のブロックに連結してブロックチェーンを形成し、該ブロックチェーンを複数の前記ノードで共有して分散台帳として記憶する
ことが好ましい。 In the above invention, the data storage means includes a guarantee system linking unit that links with a guarantee system constructed on a communication network and encrypts and stores at least a part of the image right data;
The security system includes a plurality of nodes that encrypt and store at least a portion of the image recognition data;
It is preferable that the node aggregates at least a portion of the voting rights data at a predetermined timing, forms a block, links the block to an existing block to form a blockchain, and shares the blockchain among multiple nodes and stores it as a distributed ledger.

上記発明において、前記声データ生成部は、取得した前記音響信号を取得した日時を含む時間情報を前記関連データに追加する機能を有し、
前記関連データ取得部は、追加された前記音響信号を取得した日時と、前記個人情報に含まれる年齢に関する情報との相関を解析し、その解析結果を前記関連データに含める機能を有することが好ましい。 In the above invention, the voice data generation unit has a function of adding time information including a date and time when the acquired acoustic signal was acquired to the associated data,
It is preferable that the associated data acquisition unit has a function of analyzing the correlation between the date and time when the added acoustic signal was acquired and information relating to age contained in the personal information, and including the analysis result in the associated data.

上記発明では、
前記声データに含まれる周波数、持続時間、振幅、強度又は周波数成分のいずれかを特徴パラメータとして抽出する音声特徴抽出部と、
前記特徴パラメータに基づいて前記声データの声質又は言語を変換する変換部と、
前記変換部が声質又は言語を変換した後の前記声データの特徴パラメータを変換パラメータとして記録する変換パラメータ記録部と
をさらに備えることが好ましい。 In the above invention,
a voice feature extraction unit for extracting any one of frequency, duration, amplitude, intensity, and frequency components contained in the voice data as a feature parameter;
a conversion unit that converts the voice quality or language of the voice data based on the feature parameters;
It is preferable that the voice data transmission apparatus further comprises a conversion parameter recording section for recording, as conversion parameters, characteristic parameters of the voice data after the conversion section converts the voice quality or language.

また、上記発明では、
前記声データに含まれる周波数、持続時間、振幅、強度又は周波数成分のいずれかを特徴パラメータとして抽出する音声特徴抽出部と、
前記特徴パラメータに基づいて前記声データの声質又は言語を変換する変換部と、
前記変換部が声質又は言語を変換した後の前記声データの特徴パラメータを変換パラメータとして記録する変換パラメータ記録部と
をさらに備えることが好ましい。 In addition, in the above invention,
a voice feature extraction unit for extracting any one of frequency, duration, amplitude, intensity, and frequency components contained in the voice data as a feature parameter;
a conversion unit that converts the voice quality or language of the voice data based on the feature parameters;
It is preferable that the voice data transmission apparatus further comprises a conversion parameter recording section for recording, as conversion parameters, characteristic parameters of the voice data after the conversion section converts the voice quality or language.

なお、上述した本発明に係るシステムや方法は、所定の言語で記述された本発明のプログラムをコンピューター上で実行することにより実現することができる。すなわち、本発明のプログラムを、携帯端末装置やスマートフォン、ウェアラブル端末、モバイルＰＣその他の情報処理端末、パーソナルコンピューターやサーバーコンピューター等の汎用コンピューターのＩＣチップ、メモリ装置にインストールし、ＣＰＵ上で実行することにより、上述した各機能を有するシステムを構築して、本発明に係る方法を実施することができる。 The above-mentioned system and method according to the present invention can be realized by executing the program of the present invention written in a specific language on a computer. That is, by installing the program of the present invention in an IC chip or memory device of a portable terminal device, smartphone, wearable terminal, mobile PC or other information processing terminal, or a general-purpose computer such as a personal computer or server computer and executing it on the CPU, a system having the above-mentioned functions can be constructed and the method according to the present invention can be implemented.

また、本発明のプログラムは、例えば、通信回線を通じて配布することが可能であり、またコンピューターで読み取り可能な記録媒体に記録することにより、スタンドアローンの計算機上で動作するパッケージアプリケーションとして譲渡することができる。具体的にこの記録媒体としては、フレキシブルディスクやカセットテープ等の磁気記録媒体、若しくはＣＤ-ＲＯＭやＤＶＤ-ＲＯＭ等の光ディスクの他、ＲＡＭカードなど、種々の記録媒体に記録することができる。そして、このプログラムを記録したコンピューター読み取り可能な記録媒体によれば、汎用のコンピューターや専用コンピューターを用いて、上述したシステム及び方法を簡便に実施することが可能となるとともに、プログラムの保存、運搬及びインストールを容易に行うことができる。 The program of the present invention can be distributed, for example, via communication lines, and can be transferred as a package application that runs on a stand-alone computer by recording it on a computer-readable recording medium. Specifically, the recording medium can be a magnetic recording medium such as a flexible disk or cassette tape, or an optical disk such as a CD-ROM or DVD-ROM, or a RAM card, or other various recording media. Furthermore, a computer-readable recording medium on which the program is recorded can make it possible to easily implement the above-mentioned system and method using a general-purpose computer or a dedicated computer, and the program can be easily stored, transported, and installed.

以上述べたように、これらの発明によれば、人の声とその声の持ち主の個人情報を紐付けて管理し、権利として保障することで、声データを効率的かつ安全に管理・活用できる。詳述すると、本発明によれば、声データを、「肖声権」として保護することができ、この肖声権により、声のデータに明確な所有権を設定することができ、不正使用や権利侵害を防ぐことが可能となる。 As described above, these inventions allow for the efficient and safe management and use of voice data by linking a person's voice with the personal information of the voice's owner and guaranteeing the rights to that voice data. In more detail, the present invention allows for the protection of voice data as a "voice image right," and this voice image right allows for clear ownership of the voice data, making it possible to prevent unauthorized use and infringement of rights.

また、本発明では、ブロックチェーン技術を介して、声データとその持ち主の個人情報が紐付けられることとなり、プライバシーが強固に保護されるとともに、肖声権が確立されることで、声データの不正使用に対して法的に対処することが容易となる。 In addition, with this invention, voice data is linked to the personal information of its owner via blockchain technology, providing strong privacy protection and establishing the right to voice ownership, making it easier to take legal action against unauthorized use of voice data.

さらに、本発明により、声データを肖声権として適正に保護することができることから、安全に商用化する新しいビジネスモデルが可能となり、声データに明確な所有権を設定できるとともに、声データの持ち主がその利用に対して適切に報酬を受けられるようになるため、声データをより自由に、かつ安心してクリエイティブな用途に応用することができ、また、声データが認証され、その信頼性が確保されるため、さまざまな産業での声データの活用が促進されることが期待できる。 Furthermore, the present invention makes it possible to properly protect voice data as a voice right, enabling new business models for safe commercialization; it also allows clear ownership of voice data and enables the owner of the voice data to be appropriately compensated for its use, allowing voice data to be applied more freely and safely for creative purposes; and, because voice data is authenticated and its reliability is ensured, it is expected that the use of voice data in a variety of industries will be promoted.

実施形態に係る肖声権管理システムの全体構成を示す概念図である。1 is a conceptual diagram showing an overall configuration of a copyright management system according to an embodiment; 実施形態に係るボイスライツ管理サーバーの内部構成を示すブロック図である。A block diagram showing the internal configuration of a voice rights management server according to an embodiment. （ａ）は実施形態に係るスマートフォンの内部構成を示すブロック図であり、（ｂ）は声データ生成部及び関連データ取得部の構成を示すブロック図である。FIG. 2A is a block diagram showing the internal configuration of a smartphone according to an embodiment, and FIG. 2B is a block diagram showing the configurations of a voice data generating unit and a related data acquiring unit. 実施形態に係る保証システムの内部構成を示すブロック図である。1 is a block diagram showing an internal configuration of a guarantee system according to an embodiment. 実施形態に係る肖声権管理サービスの動作を示すシーケンス図である。10 is a sequence diagram showing an operation of the voice right management service according to the embodiment; FIG. 実施形態に係る声データ取得から関連データ取得までにおける声データ再生成時の動作を示すシーケンス図である。A sequence diagram showing the operation when regenerating voice data from obtaining voice data to obtaining related data in the embodiment. 実施形態に係る肖声権管理システムのＮＦＴ移転時における手順を示すフロー図である。FIG. 1 is a flow diagram showing a procedure for NFT transfer in the image rights management system according to the embodiment. 実施形態に係る肖声権管理システムにおける公開鍵と秘密鍵との関係を例示する説明図である。2 is an explanatory diagram illustrating a relationship between a public key and a private key in the image rights management system according to the embodiment; 実施形態においてＮＦＴの発行・移転・消却に関するトランザクション（取引）の定義を例示する説明図である。An explanatory diagram illustrating the definition of transactions related to the issuance, transfer, and cancellation of NFTs in an embodiment. 実施形態に係る肖声権管理システムのブロックチェーンに関する説明図である。FIG. 1 is an explanatory diagram of a block chain of a copyright management system according to an embodiment. 実施形態に係る肖声権管理システムのブロックチェーンに関する説明図である。FIG. 1 is an explanatory diagram of a block chain of a copyright management system according to an embodiment. 実施形態に係る肖声権管理システムのブロックチェーンに関する説明図である。FIG. 1 is an explanatory diagram of a block chain of a copyright management system according to an embodiment.

以下に添付図面を参照して、本発明に係る肖声権管理方法、管理システム、管理プログラムの実施形態を詳細に説明する。本実施形態では、肖声権管理システムにより提供される電力取引サービスにおいて、本発明に係る環境価値デジタルアートの生成方法等を利用する場合を例示する。なお、以下に示す実施の形態は、この発明の技術的思想を具体化するための装置等を例示するものであって、この発明の技術的思想は、各構成部品の材質、形状、構造、配置等を下記のものに特定するものでない。この発明の技術的思想は、特許請求の範囲において、種々の変更を加えることができる。 Below, with reference to the attached drawings, an embodiment of the voice rights management method, management system, and management program according to the present invention will be described in detail. In this embodiment, an example is shown of a case where the environmental value digital art generation method according to the present invention is used in an energy trading service provided by a voice rights management system. Note that the embodiment shown below is an example of an apparatus for embodying the technical idea of the present invention, and the technical idea of the present invention does not specify the material, shape, structure, arrangement, etc. of each component part as described below. The technical idea of the present invention may be modified in various ways within the scope of the claims.

（肖声権管理肖声権管理システムの全体構成）
本実施形態では、肖声権管理システムにより肖声権管理サービスが提供される。図１は、本実施形態に係る肖声権管理サービスにおける取引形態を示す概念図である。同図に示すように、本実施形態では、通信ネットワーク２上に構築された肖声権管理システム１を通じて、肖声権（ボイスライツ）の管理及び利用を行う肖声権管理サービスが提供される。具体的に、本サービスでは、ユーザーの声データを、ユーザーが使用するスマートフォン５を通じて、ユーザーＵ１の声を声データＤ１として記録するとともに、ボイスライツ管理サーバー３により管理・運用を行う。 (Overall configuration of the photography rights management system)
In this embodiment, a voice right management service is provided by a voice right management system. Fig. 1 is a conceptual diagram showing a transaction form in the voice right management service according to this embodiment. As shown in the figure, in this embodiment, a voice right management service for managing and using voice rights is provided through a voice right management system 1 constructed on a communication network 2. Specifically, in this service, the voice data of a user U1 is recorded as voice data D1 through a smartphone 5 used by the user, and is managed and operated by a voice rights management server 3.

ここで肖声権（ボイスライツ）とは、ある人の声とその声の持ち主の個人情報Ｄ２をブロックチェーンで紐付けて管理し、権利として保障するものであり、この肖声権により、声のデータに明確な所有権を設定することができ、不正使用や権利侵害を防ぐことが可能となる。この声データの記録を、ユーザーＵ１一人について各年代（例えば乳児期～高齢期）においてその時点での声に肖声権を設定することができ、年代ごとの声が固有に有する価値を権利として保護するとともに利用することができる。 Here, voice rights refer to the linking and management of a person's voice and the personal information D2 of the voice owner via blockchain, and guaranteeing them as rights. Voice rights allow clear ownership to be set for the voice data, making it possible to prevent unauthorized use and infringement of rights. Voice rights can be set for the voice recorded for each user U1 at each age (for example, from infancy to old age) and the unique value of the voice of each age group can be protected as a right and used.

詳述すると、本実施形態に係る肖声権管理システム１は、声データＤ１を肖声権として管理するボイスライツ管理サーバー３と、各ユーザーが使用する情報処理端末であるスマートフォン５とが、通信ネットワーク２を通じて相互に接続されて構成されている。なお、本実施形態では、スマートフォン５を情報処理端末の一例として説明する。また、通信ネットワーク２上には、肖声権管理の保証を行うブロックチェーンインターフェースサービスを提供する保証システム４が設置され、この保証システム４を通じてブロックチェーンのような分散台帳システムに接続可能となっている。 More specifically, the voice rights management system 1 according to this embodiment is configured by connecting a voice rights management server 3, which manages voice data D1 as a voice right, and a smartphone 5, which is an information processing terminal used by each user, through a communication network 2. In this embodiment, the smartphone 5 is described as an example of an information processing terminal. In addition, a guarantee system 4 is installed on the communication network 2, which provides a blockchain interface service that guarantees the voice rights management, and it is possible to connect to a distributed ledger system such as a blockchain through this guarantee system 4.

通信ネットワーク２は、通信プロトコルＴＣＰ／ＩＰを用いたインターネットなどのＩＰ網であって、種々の通信回線（電話回線やＩＳＤＮ回線、ＡＤＳＬ回線、光回線などの公衆回線、専用回線、ＬＴＥなどの第４世代（４Ｇ）の通信方式、及び第５世代（５Ｇ）以降の通信方式等の他、Ｗｉｆｉ（登録商標）、Ｂｌｕｅｔｏｏｔｈ（登録商標）などの無線通信ネットワーク）を相互に接続して構築される分散型の通信ネットワークである。このＩＰ網には、１０ＢＡＳＥ－Ｔや１００ＢＡＳＥ－ＴＸ等によるイントラネット（企業内ネットワーク）や家庭内ネットワークなどのＬＡＮなども含まれる。 The communication network 2 is an IP network such as the Internet that uses the communication protocol TCP/IP, and is a distributed communication network constructed by interconnecting various communication lines (public lines such as telephone lines, ISDN lines, ADSL lines, and optical lines, dedicated lines, fourth generation (4G) communication methods such as LTE, and fifth generation (5G) and later communication methods, as well as wireless communication networks such as Wi-Fi (registered trademark) and Bluetooth (registered trademark). This IP network also includes LANs such as intranets (corporate networks) and home networks using 10BASE-T, 100BASE-TX, etc.

ボイスライツ管理サーバー３は、肖声権（ボイスライツ）の管理及び利用を行うサーバー装置であり、本実施形態では、スマートフォン５を通じて、ユーザーＵ１の声データＤ１を、肖声権（ボイスライツ）データＤ３として生成し、保存し、管理する機能を備え、単一のサーバー装置、又は複数のサーバー装置群により実現することができる。また、ボイスライツ管理サーバー３は、複数の機能モジュールをＣＰＵ上に仮想的に構築し、それぞれの機能モジュールが協動して処理を実行するように構成されている。また、このボイスライツ管理サーバー３は、通信機能により通信ネットワーク２を通じて、データ送受信を行うことができるとともに、Ｗｅｂサーバー機能によりブラウザソフトを通じてＷｅｂサイトの提示及びサービスの提供を行うことができる。 The voice rights management server 3 is a server device that manages and utilizes voice rights, and in this embodiment has the function of generating, storing, and managing the voice data D1 of the user U1 as voice rights data D3 through the smartphone 5, and can be realized by a single server device or a group of multiple server devices. The voice rights management server 3 is configured to virtually construct multiple functional modules on a CPU, with each functional module working together to execute processing. The voice rights management server 3 can also send and receive data through the communication network 2 using the communication function, and can present websites and provide services through browser software using the web server function.

一方、スマートフォン５は、無線通信を利用した携帯可能な情報処理端末装置であり、無線基地局２２等の中継点とこのスマートフォン５とが無線で通信し、通話やデータ通信等の通信サービスを移動しつつ受けることができる。この通信方式としては、例えば、３Ｇ（3rd. Generation）方式、ＬＴＥ（Long Term Evolution）方式、４Ｇ方式、ＦＤＭＡ方式、ＴＤＭＡ方式、ＣＤＭＡ方式、Ｗ－ＣＤＭＡの他、ＰＨＳ（Personal Handyphone System）方式等が挙げられる。また、このスマートフォン５は、デジタルカメラ機能、アプリケーションソフトの実行機能、ＧＰＳ（Global Positioning System）などによる位置情報取得機能等、種々の機能が搭載され、タブレットＰＣ等のモバイルコンピューターも含まれる。 On the other hand, the smartphone 5 is a portable information processing terminal device that uses wireless communication, and the smartphone 5 communicates wirelessly with relay points such as the wireless base station 22, allowing users to receive communication services such as phone calls and data communications while moving. Examples of communication methods include 3G (3rd Generation), LTE (Long Term Evolution), 4G, FDMA, TDMA, CDMA, W-CDMA, and PHS (Personal Handyphone System). The smartphone 5 also includes various functions such as a digital camera function, an application software execution function, and a location information acquisition function using the Global Positioning System (GPS), and also includes mobile computers such as tablet PCs.

なお、スマートフォン５に関する位置情報取得機能は、自機の位置を示す位置情報を取得し、記録する機能であり、この位置情報取得機能としては、図１に示すように、例えば、ＧＰＳのように、衛星２１からの信号によって自機の位置を検出する方法や、携帯電話の無線基地局２２や、Ｗｉｆｉ通信のアクセスポイントからの電波強度などによって位置を検出する方法が含まれる。 The location information acquisition function of the smartphone 5 is a function for acquiring and recording location information indicating the location of the smartphone itself. Examples of this location information acquisition function include, as shown in FIG. 1, a method of detecting the location of the smartphone itself using signals from a satellite 21, such as GPS, and a method of detecting the location using the radio wave strength from a mobile phone wireless base station 22 or a Wi-Fi communication access point.

そして、このスマートフォン５は、情報を表示する表示部としての液晶ディスプレイ等の表示部５３ａを備えるとともに、ユーザーが入力操作を行うための操作ボタン等の操作デバイスを備え、この操作デバイスとしては、液晶ディスプレイに重畳されて配置され、液晶ディスプレイ上の座標位置を指定するタッチ操作などによる操作信号を取得する入力部としてのタッチパネル５２ａが含まれる。具体的にこのタッチパネル５２ａは、ユーザーの指先やペンなどを用いたタッチ操作による圧力や静電検知その他によって操作信号を入力する入力デバイスであり、グラフィックを表示する液晶ディスプレイと、この液晶ディスプレイに表示されたグラフィックの座標位置に対応した操作信号を受け付けるタッチセンサーとが重畳されて構成されている。 The smartphone 5 is provided with a display unit 53a such as a liquid crystal display as a display unit that displays information, and an operation device such as an operation button that allows the user to perform input operations. The operation device includes a touch panel 52a as an input unit that is arranged superimposed on the liquid crystal display and acquires an operation signal by a touch operation that specifies a coordinate position on the liquid crystal display. Specifically, the touch panel 52a is an input device that inputs an operation signal by pressure or electrostatic detection by a touch operation using the user's fingertip or pen, and is configured by superimposing a liquid crystal display that displays graphics and a touch sensor that receives an operation signal corresponding to the coordinate position of the graphic displayed on the liquid crystal display.

（各装置の内部構造）
次いで、上述した肖声権管理システム１を構成する各装置の内部構造について説明する。図２は、本実施形態に係るボイスライツ管理サーバー３の内部構成を示すブロック図であり、図３は、本実施形態に係るスマートフォン５の内部構成を示すブロック図である。なお、説明中で用いられる「モジュール」とは、装置や機器等のハードウェア、或いはその機能を持ったソフトウェア、又はこれらの組み合わせなどによって構成され、所定の動作を達成するための機能単位を示す。 (Internal structure of each device)
Next, the internal structure of each device constituting the voice rights management system 1 will be described. Fig. 2 is a block diagram showing the internal configuration of the voice rights management server 3 according to this embodiment, and Fig. 3 is a block diagram showing the internal configuration of the smartphone 5 according to this embodiment. Note that the term "module" used in the description refers to a functional unit that is configured by hardware such as a device or equipment, software having the function, or a combination of these, and that performs a predetermined operation.

（１）ボイスライツ管理サーバー３
先ず、ボイスライツ管理サーバー３の内部構成について説明する。ボイスライツ管理サーバー３は、通信ネットワーク２上に配置されたサーバー装置であり、通信ネットワーク２を通じて、各スマートフォン５とデータの送受信を行えるようになっている。 (1) Voice Rights Management Server 3
First, there will be described an internal configuration of the voice rights management server 3. The voice rights management server 3 is a server device disposed on the communication network 2, and is capable of transmitting and receiving data to and from each smartphone 5 via the communication network 2.

具体的にボイスライツ管理サーバー３は、通信ネットワーク２を通じてデータ通信を行う通信インターフェース３１と、利用者や利用者端末の権限を認証する認証部３３と、各利用者端末の位置情報を収集して管理する位置情報管理部３２と、肖声権データの生成、保管及び管理などの制御を実行する肖声権データ制御部３６と、各ユーザーに対して情報を配信する情報配信部３４と、種々のデータベース群３５ａ～３５ｂとを備えている。 Specifically, the voice rights management server 3 includes a communication interface 31 that communicates data via the communication network 2, an authentication unit 33 that authenticates the authority of users and user terminals, a location information management unit 32 that collects and manages the location information of each user terminal, a voice rights data control unit 36 that controls the generation, storage, and management of voice rights data, an information distribution unit 34 that distributes information to each user, and various databases 35a to 35b.

データベース群としては、生成された肖声権を記憶するデータ保管手段としての肖声権データベース３５ａと、利用者に関する情報を蓄積するユーザーデータベース３５ｂと、インターネット上から収集された時事情報や、天候情報、経済情報等の外部情報を蓄積する蓄積情報データベース３５ｃとが含まれる。これらの各データベースとしては、単一のデータベースとしてもよく、複数のデータベースに分割し、相互にリレーションを設定することで各データ同士を紐付けたリレーショナルデータベースとすることができる。 The database group includes a portrait rights database 35a as a data storage means for storing the generated portrait rights, a user database 35b for storing information about users, and an accumulated information database 35c for storing external information such as current events, weather, and economic information collected from the Internet. Each of these databases may be a single database, or may be divided into multiple databases and a relational database in which each data is linked to the other data by setting relationships between them.

肖声権データベース３５ａは、肖声権データの少なくとも一部を暗号化して記憶するデータ保管手段である。この肖声権データベース３５ａに蓄積されるデータとしては、各ユーザーからアップロードされた声データと、その声データを取得した時点における時間情報及び位置情報に、個人情報データが紐付けられて、これらの情報が肖声権として記録されている。なお、肖声権データベース３５ａは、高度なセキュリティ機能を備えており、ユーザーの声データとそれに関連する情報を安全に保管するための重要な役割を果たしている。このデータベースの中の各データ項目は、ユニークな識別子で管理され、不正アクセスやデータの改ざんを防ぐための多層的なセキュリティ対策が施されている。 The ivory right database 35a is a data storage means that encrypts and stores at least a portion of the ivory right data. The data stored in this ivory right database 35a includes voice data uploaded by each user, time information and location information at the time the voice data was acquired, and personal information data linked to it, and this information is recorded as ivory right. The ivory right database 35a is equipped with advanced security functions and plays an important role in safely storing users' voice data and related information. Each data item in this database is managed by a unique identifier, and multi-layered security measures are implemented to prevent unauthorized access and data tampering.

ユーザーデータベース３５ｂに蓄積される情報としては、利用者又は利用者が使用している情報処理端末を特定する識別子（ユーザーＩＤ、端末ＩＤ）と、パスワード等とを紐付けた認証情報を含み、ユーザーＩＤに紐付けられた利用者の個人情報や、端末装置の機種なども含まれる。また、ユーザーデータベース３５ｂには、利用者ごと或いは利用者端末ごとの認証履歴（アクセス履歴）や、肖声権データベース３５ａとのリレーションによりユーザーごとの肖声権に関する情報（関連情報を含む。）も蓄積される。 The information stored in user database 35b includes authentication information that links an identifier (user ID, terminal ID) that identifies the user or the information processing terminal used by the user with a password, etc., and also includes personal information of the user linked to the user ID and the model of the terminal device. In addition, user database 35b also stores authentication history (access history) for each user or each user terminal, and information (including related information) regarding the portrait right for each user through a relationship with portrait right database 35a.

また、ユーザーデータベース３５ｂは、ユーザーの詳細な認証や身元確認を目的として、免許証、パスポート、マイナンバーカードなどの公的な身分証明書の画像情報も蓄積することができる。そして、ユーザーデータベース３５ｂは、これら身分証明書から抽出された画像データやテキストデータも蓄積されており、免許証の顔写真や氏名、住所、発行日、有効期限などの情報、パスポートの顔写真、氏名、国籍、発行日、有効期限などの情報、マイナンバーカードの顔写真、氏名、住所、発行日、有効期限、マイナンバーなどの情報もユーザー情報として蓄積される。なお、これらの個人情報は、非常に機密性が高いため、強力な暗号技術で暗号化されて保存され、このような身分証明書の画像情報及びテキストデータ等は、ユーザーの身元確認、年齢確認、取引の際の本人確認など、サービス利用における安全確保や法的要件を満たすために使用され、例えば、肖声権の売買や移転を行う際、取引の両当事者の身元確認を行い、不正や詐欺を防ぐためにこれらの情報が参照される。 In addition, the user database 35b can also store image information of official identification documents such as driver's licenses, passports, and My Number cards for the purpose of detailed authentication and identity confirmation of users. The user database 35b also stores image data and text data extracted from these identification documents, and information such as the face photo, name, address, issue date, and expiration date of the driver's license, information such as the face photo, name, nationality, issue date, and expiration date of the passport, and information such as the face photo, name, address, issue date, expiration date, and My Number card are stored as user information. Since this personal information is highly confidential, it is encrypted and stored using powerful encryption technology. Such image information and text data of identification documents are used to ensure safety and meet legal requirements for the use of services, such as user identity confirmation, age confirmation, and identity confirmation during transactions. For example, when buying, selling, or transferring portrait rights, this information is referenced to confirm the identities of both parties in the transaction and prevent fraud and dishonesty.

蓄積情報データベース３５ｃに蓄積される情報としては、時事ニュース情報、天気情報、交通情報、株価・為替情報、不動産価格情報、スポーツ・エンターテイメント情報などの外部情報が含まれる。詳述すると、蓄積情報データベース３５ｃは、肖声権データが生成された際の社会的背景や環境を捉えるための情報を集約しており、これにより、声データだけでなく、その声データが取得された時期の社会環境や文化的背景を再現・理解することが可能となる。具体的には、時事ニュース情報はその時代の重要な出来事やトピックを反映しており、天気情報はその時点での気候や天候状況、交通情報は移動手段や交通の混雑状況、株価・為替情報は経済の動向、不動産価格情報は地域の経済状況や人口動態、スポーツ・エンターテイメント情報はその時代の人々の娯楽や関心事を示すものとして蓄積されている。 The information stored in the accumulated information database 35c includes external information such as current affairs news information, weather information, traffic information, stock price and exchange rate information, real estate price information, and sports and entertainment information. More specifically, the accumulated information database 35c aggregates information for capturing the social background and environment at the time the voice data was generated, which makes it possible to reproduce and understand not only the voice data but also the social environment and cultural background at the time the voice data was acquired. Specifically, current affairs news information reflects important events and topics of the time, weather information is the climate and weather conditions at the time, traffic information is the means of transportation and traffic congestion, stock price and exchange rate information is economic trends, real estate price information is the local economic situation and demographics, and sports and entertainment information is stored as information indicating people's entertainment and interests at that time.

認証部３３は、通信インターフェース３１を通じて、各スマートフォン５と通信セッションを確立させ、その確立された通信セッションごとに認証処理を行うモジュールである。この認証処理としては、アクセス者である利用者のスマートフォン５から認証情報を取得し、ユーザーデータベース３５ｂを参照して、利用者等を特定し、それらの権限を認証する。この認証部３３による認証結果（ユーザーＩＤ、認証時刻、セッションＩＤ等）は、肖声権データ制御部３６に送出されるとともに、ユーザーデータベース３５ｂに認証履歴として蓄積される。 The authentication unit 33 is a module that establishes a communication session with each smartphone 5 through the communication interface 31 and performs authentication processing for each established communication session. This authentication processing involves obtaining authentication information from the smartphone 5 of the accessing user, referencing the user database 35b, identifying the user, etc., and authenticating their authority. The authentication results by this authentication unit 33 (user ID, authentication time, session ID, etc.) are sent to the authentication right data control unit 36 and are also stored in the user database 35b as authentication history.

位置情報管理部３２は、利用者のスマートフォン５側で取得されてボイスライツ管理サーバー３に送信される位置情報を取得するモジュールであり、位置情報管理部３２は、認証部３３による認証処理によって特定されたユーザーやユーザー端末装置の識別子（ユーザーＩＤ、端末ＩＤ等）と、それらの位置情報とを紐付けてユーザーデータベース３５ｂに利用履歴として蓄積する。 The location information management unit 32 is a module that acquires location information acquired by the user's smartphone 5 and sent to the voice rights management server 3. The location information management unit 32 links the identifiers (user ID, terminal ID, etc.) of the user or user terminal device identified by the authentication process by the authentication unit 33 with the location information, and stores it as a usage history in the user database 35b.

肖声権データ制御部３６は、肖声権データの生成（発行）、更新、移転及び消却を実行し、管理するモジュールであり保証システム連携部３６ｃと連携して各肖声権データの生成、更新、移転又は消却を実行すべく、ブロックチェーンの更新を行う。具体的に肖声権データ制御部３６は、声データ制御部３６ａと、関連データ制御部３６ｂと、保証システム連携部３６ｃとを備えている。 The portrait right data control unit 36 is a module that executes and manages the generation (issuance), update, transfer, and cancellation of portrait right data, and updates the blockchain in order to execute the generation, update, transfer, or cancellation of each portrait right data in cooperation with the guarantee system cooperation unit 36c. Specifically, the portrait right data control unit 36 includes a portrait data control unit 36a, a related data control unit 36b, and a guarantee system cooperation unit 36c.

声データ制御部３６ａは、ユーザーの声の音響信号を取得して声データを生成する声データ生成部の機能を有するモジュールである。関連データ制御部３６ｂは、個人情報を含む関連データを取得する関連データ取得部の機能を有するモジュールである。そして、肖声権データ制御部３６は、声データ制御部３６ａが生成した声データと、関連データ制御部３６ｂが取得した関連データとを関連づけて肖声権データとして生成する。 The voice data control unit 36a is a module that functions as a voice data generation unit that acquires an acoustic signal of the user's voice and generates voice data. The related data control unit 36b is a module that functions as a related data acquisition unit that acquires related data including personal information. The voice data control unit 36 associates the voice data generated by the voice data control unit 36a with the related data acquired by the related data control unit 36b to generate voice data.

保証システム連携部３６ｃは、ネットワーク上の保証システム４に対して、肖声権に関する情報その他、セキュリティ管理、移転記録の保管など、肖声権管理に必要な処理を保証システム４に依頼し、協働して処理を進めるモジュールである。肖声権データ制御部３６は、この保証システム連携部３６ｃを通じて保証システム４と連携をとることによって、肖声権の生成、変遷、及び移転を管理する。例えば、肖声権の取得者に関する公開アドレスを追加して、ブロックチェーンで証明される各種権利の所有者を変更することにより、各所有権を移転する。 The guarantee system linking unit 36c is a module that requests the guarantee system 4 on the network to perform the processes necessary for ivory right management, such as information related to the ivory right and other security management and storage of transfer records, and cooperates with the guarantee system 4 to proceed with the processes. The ivory right data control unit 36 manages the generation, transition, and transfer of ivory rights by linking with the guarantee system 4 through the guarantee system linking unit 36c. For example, each ownership is transferred by adding a public address related to the acquirer of the ivory right and changing the owner of the various rights certified by the blockchain.

ブロックチェーンインターフェースサービスである保証システム４は、肖声権データの少なくとも一部を暗号化して記憶する複数のノードを備える。これらのノードは、肖声権データに含まれる声データ及び関連情報とを紐付けてそれらの情報の少なくとも一部を所定のタイミングで集約してブロック化し、このブロックを既存のブロックに連結してブロックチェーンを形成し、ブロックチェーンを複数のノードで共有して分散台帳として記憶する。 The guarantee system 4, which is a blockchain interface service, has multiple nodes that encrypt and store at least a portion of the signature rights data. These nodes link the signature data and related information contained in the signature rights data, aggregate at least a portion of the information at a predetermined timing, and form blocks. These blocks are linked to existing blocks to form a blockchain, which is shared by multiple nodes and stored as a distributed ledger.

情報配信部３４は、ユーザーからのアクセスに応じて、ボイスライツ管理サービスに関するＷｅｂサイトを提示し、各ユーザーに固有のマイページなどを提供し、各ユーザーが所有する肖声権に関する情報を、個別的に閲覧できるようにするモジュールである。また、肖声権（ボイスライツ）をＮＦＴとして登録したり、取引したりするためのＮＦＴの取引プラットフォームを提示する機能も備えており、生成された、ＮＦＴ取引プラットフォームのグラフィックやユーザーインターフェースを、通信インターフェース３１を通じて，各ユーザーに配信する。 The information distribution unit 34 is a module that, in response to access from users, presents a website related to the voice rights management service, provides each user with a unique My Page, and enables each user to individually view information regarding the voice rights they own. It also has a function to present an NFT trading platform for registering and trading voice rights as NFTs, and distributes the generated graphics and user interface of the NFT trading platform to each user via the communication interface 31.

（２）スマートフォン５
次いで、スマートフォン５の内部構成について説明する。図３（ａ）及び（ｂ）に示すように、スマートフォン５は、通信インターフェース５１と、入力インターフェース５２と、出力インターフェース５３と、アプリケーション実行部５４と、メモリ５５とから概略構成されている。 (2) Smartphone 5
Next, a description will be given of the internal configuration of the smartphone 5. As shown in Figures 3(a) and (b), the smartphone 5 is roughly composed of a communication interface 51, an input interface 52, an output interface 53, an application execution unit 54, and a memory 55.

通信インターフェース５１は、通話若しくはデータ通信を行うための通信デバイスであり、無線等による非接触通信や、ケーブル、アダプタ手段等により接触（有線）通信をする機能を備えている。入力インターフェース５２は、マウス、キーボード、操作ボタンやタッチパネル５２ａなどユーザー操作を入力するデバイスである。また、出力インターフェース５３は、ディスプレイやスピーカーなど、映像や音響を出力するデバイスである。特に、この出力インターフェース５３には、液晶ディスプレイなどの表示部５３ａが含まれ、この表示部は、入力インターフェースであるタッチパネル５２ａに重畳されている。 The communication interface 51 is a communication device for making calls or data communications, and has the function of non-contact communication by radio or the like, and contact (wired) communication by a cable, adapter means, or the like. The input interface 52 is a device for inputting user operations, such as a mouse, keyboard, operation buttons, or touch panel 52a. The output interface 53 is a device for outputting video and audio, such as a display or speaker. In particular, the output interface 53 includes a display unit 53a such as a liquid crystal display, which is superimposed on the touch panel 52a, which is the input interface.

メモリ５５は、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）やファームウェア、各種のアプリケーション用のプログラム、その他のデータ等などを記憶する記憶装置であり、このメモリ５５内には、ユーザーを識別するユーザーＩＤの他、ボイスライツ管理サーバー３からダウンロードしたアプリケーションデータを蓄積するとともに、アプリケーション実行部５４で処理された各種データ等が蓄積される。特に、本実施形態では、メモリ５５には、ボイスライツ管理サーバー３から取得された権利情報等が格納されている。 Memory 55 is a storage device that stores the OS (Operating System), firmware, programs for various applications, and other data. In addition to a user ID that identifies the user, this memory 55 also stores application data downloaded from the voice rights management server 3, as well as various data processed by the application execution unit 54. In particular, in this embodiment, memory 55 stores rights information obtained from the voice rights management server 3.

アプリケーション実行部５４は、一般のＯＳやゲームアプリケーション、ブラウザソフトなどのアプリケーションを実行するモジュールであり、通常はＣＰＵ等により実現される。このアプリケーション実行部５４では、本発明に係るボイスライツ管理アプリケーションのプログラムが実行されることで、肖声権管理部５４３、表示データ生成部５４６、表示制御部５４５，位置情報取得部５４４が仮想的に構築される。 The application execution unit 54 is a module that executes applications such as a general OS, game applications, and browser software, and is usually realized by a CPU, etc. In this application execution unit 54, a voice rights management unit 543, a display data generation unit 546, a display control unit 545, and a location information acquisition unit 544 are virtually constructed by executing the program of the voice rights management application according to the present invention.

肖声権管理部５４３は、スマートフォン５上での肖声権データの生成、取得、変更、削除などの操作を行う機能を持つモジュールである。特に肖声権管理部５４３は、声データＤ１に個人情報Ｄ２を関連づけて肖声権データＤ３として生成する肖声権データ生成部としての機能を有している。そして、肖声権管理部５４３は、生成された肖声権データは、ユーザーＩＤや時刻、位置情報などのメタデータと結びつけてメモリ５５に保存される。本実施形態では、肖声権管理部５４３は、声データ生成部５４１と、関連データ取得部５４２と、同期処理部５４０とを備えている。 The voice right management unit 543 is a module having a function of performing operations such as generating, acquiring, changing, and deleting voice right data on the smartphone 5. In particular, the voice right management unit 543 functions as a voice right data generation unit that generates voice data D3 by associating personal information D2 with voice data D1. The voice right management unit 543 then associates the generated voice right data with metadata such as a user ID, time, and location information and stores it in the memory 55. In this embodiment, the voice right management unit 543 includes a voice data generation unit 541, an associated data acquisition unit 542, and a synchronization processing unit 540.

声データ生成部５４１は、音響信号を取得して声データを生成するモジュールであり、ユーザーからの声を音響信号の入力として受け取り、それをデジタルデータとして録音・保存するモジュールである。ユーザーがアプリケーション上で「声を録音する」などの操作を行うと、この部分が活性化され、内蔵マイクロフォン又は外部のマイクロフォンを使用して声のデータを取得し、それを適切なフォーマットでエンコードして保存する。さらに、録音した声データにタイムスタンプを付与し、そのデータがいつ取得されたのかについての情報も保存する。 The voice data generation unit 541 is a module that acquires an acoustic signal and generates voice data; it receives the user's voice as an input audio signal, and records and saves it as digital data. When the user performs an operation such as "record voice" on the application, this part is activated and acquires voice data using the built-in microphone or an external microphone, and encodes and saves it in an appropriate format. In addition, it assigns a timestamp to the recorded voice data, and also saves information about when the data was acquired.

本実施形態では、声データ生成部５４１は、言語変換部５４１ａ及び声質変換部５４１ｂを備えている。
言語変換部５４１ａは、音声特徴抽出部５４２ａが抽出した特徴パラメータに基づいて声データの言語を変換するモジュールである。具体的には、言語変換部５４１ａは、音声認識処理によりユーザーが発話した音声データ中のフレーズや単語を認識し、それに対応するテキストについて略語、省略形、スラングなどの非標準的な形式を標準的な形式に変換したうえで、目的の言語に翻訳する。この翻訳処理では、辞書データを参照したり、ニューラルネットワークベースのモデルを使用して翻訳を行い、必要に応じて、例えば、口語的なスタイルや特定のアクセントを持つ方言などのような非標準的な形式や文体に変換する。このように言語が変換されたテキストデータは、声データ生成部５４１の音声合成エンジンにより、声データとして再生成される。 In this embodiment, the voice data generation unit 541 includes a language conversion unit 541a and a voice quality conversion unit 541b.
The language conversion unit 541a is a module that converts the language of the voice data based on the feature parameters extracted by the voice feature extraction unit 542a. Specifically, the language conversion unit 541a recognizes phrases and words in the voice data uttered by the user through a voice recognition process, converts non-standard formats such as abbreviations, shorthands, and slang for the corresponding text into a standard format, and translates the text into the target language. In this translation process, translation is performed by referring to dictionary data or using a neural network-based model, and, as necessary, converts the text into a non-standard format or style such as a colloquial style or a dialect with a specific accent. The text data whose language has been converted in this way is regenerated as voice data by the voice synthesis engine of the voice data generation unit 541.

一方、声質変換部５４１ｂは、音声特徴抽出部５４２ａが抽出した特徴パラメータに基づいて声データの声質を変換するモジュールである。声質変換部５４１ｂによる音声変換は、信号処理の技術を用いて入力音声の特性を変更し、オリジナルの音声とは異なる音声を再生成する。具体的には、声質変換部５４１ｂによる音声変換処理としては、音声のピッチ（周波数）を変更することで音声の高さを変えたり、音声の速度（テンポ）を変更して音声の継続時間を短縮・延長する。この際、ピッチシフト技術を使用して、音声の速度を変えずにピッチだけを変更したり、ピッチを変更せずにテンポだけを変更することもできる。 On the other hand, the voice quality conversion unit 541b is a module that converts the voice quality of the voice data based on the feature parameters extracted by the voice feature extraction unit 542a. Voice conversion by the voice quality conversion unit 541b uses signal processing technology to change the characteristics of the input voice and regenerate a voice that is different from the original voice. Specifically, the voice conversion process by the voice quality conversion unit 541b involves changing the pitch (frequency) of the voice to change the pitch, or changing the speed (tempo) of the voice to shorten or extend the duration of the voice. At this time, it is also possible to use pitch shift technology to change only the pitch without changing the speed of the voice, or to change only the tempo without changing the pitch.

また、声質変換部５４１ｂは、音声の特性を決定する要素であるフォルマントを変更することで、異なる年齢や性別の声に変換でき、またエコーやリバーブを追加することで、音声に残響や遅延を付加したり、モジュレーションエフェクトやフィルタリングにより音声信号の周波数や振幅を変調したり、所定の周波数帯域だけを削除することで、ロボットの声や宇宙人の声などの特殊な音声効果を生み出すこともできる。声質変換部５４１ｂでは、これらの処理を組み合わせることで、さまざまな音声効果を生成できるとともに、ユーザーの設定操作により詳細な調整やカスタマイズが可能となっており、リアルな変換や特殊な音声効果を作成することができる。 The voice conversion unit 541b can also convert the voice to that of a different age or gender by changing the formant, which is an element that determines the characteristics of the voice, and can also add echo or reverb to add reverberation or delay to the voice, modulate the frequency or amplitude of the audio signal with modulation effects or filtering, or remove specific frequency bands to create special voice effects such as the voice of a robot or an alien. By combining these processes, the voice conversion unit 541b can generate a variety of voice effects, and detailed adjustments and customization are possible through user settings operations, making it possible to create realistic conversions and special voice effects.

また、声データ生成部５４１における声データの生成に際し、ユーザーから取得した声データの特徴パラメータを引き継いで、楽曲を歌わせる機能も備えている。具体的に声データ生成部５４１は、多様なフレーズや単語、音高、発音をカバーするために、さまざまな文章や歌の音声を録音等により取得し、その音声について、音声データをノイズ除去や正規化などの前処理を行い、音声特徴抽出部５４２ａを通じてメル周波数ケプストラム係数 (MFCC)、ピッチ、発音速度、音の強度などの特徴パラメータを抽出する。そして、抽出された特徴パラメータを基に、音声合成モデルを構築し、歌詞等のテキストやメロディ情報を元に、対象となる人物の声の音声を合成する。 In addition, when generating voice data in the voice data generation unit 541, it also has a function of singing a song by inheriting the feature parameters of the voice data acquired from the user. Specifically, in order to cover a variety of phrases, words, pitches, and pronunciations, the voice data generation unit 541 acquires the voice of various sentences and songs by recording, etc., and performs preprocessing such as noise removal and normalization on the voice data for the voice, and extracts feature parameters such as Mel Frequency Cepstral Coefficients (MFCC), pitch, pronunciation speed, and sound intensity via the voice feature extraction unit 542a. Then, based on the extracted feature parameters, a voice synthesis model is constructed, and the voice of the target person is synthesized based on text such as lyrics and melody information.

他方、関連データ取得部５４２は、声データと一緒に保存すべき、個人情報を含む関連データを取得するモジュールである。これには、例えば、声データの録音時の位置情報、ユーザーのデバイス情報、録音環境（例：屋内、屋外、騒音の有無など）や、その他のメタデータなどが含まれる。これらの関連データは、声データの解析や利用時に、より詳細なコンテキストや背景情報を提供することができる。本実施形態において関連データ取得部５４２は、音声特徴抽出部５４２ａと、変換パラメータ記録部５４２ｂと、加工情報制御部５４２ｃとを有している。 On the other hand, the related data acquisition unit 542 is a module that acquires related data, including personal information, to be stored together with the voice data. This includes, for example, location information at the time of recording the voice data, information about the user's device, the recording environment (e.g., indoors, outdoors, presence or absence of noise, etc.), and other metadata. These related data can provide more detailed context and background information when analyzing and using the voice data. In this embodiment, the related data acquisition unit 542 has a voice feature extraction unit 542a, a conversion parameter recording unit 542b, and a processing information control unit 542c.

音声特徴抽出部５４２ａは、声データに含まれる周波数、持続時間、振幅、強度又は周波数成分のいずれかを特徴パラメータとして抽出するモジュールである。具体的に音声特徴抽出部５４２ａは、個人の声のピッチを示す基本周波数や、個人の母音の特性を示す共鳴周波数（フォルマント周波数）、個人の声の特定の特性や紋様を捉えるためのメル周波数ケプストラム係数 (MFCCs): 、声帯の振動パターン、声の周波数成分の特性であるスペクトルエンベロープ、個人の声の強度や力（エネルギー）、スペクトル形状などが抽出される。 The voice feature extraction unit 542a is a module that extracts any of the frequency, duration, amplitude, intensity, or frequency components contained in the voice data as feature parameters. Specifically, the voice feature extraction unit 542a extracts the fundamental frequency that indicates the pitch of an individual's voice, the resonant frequency (formant frequency) that indicates the characteristics of an individual's vowels, the Mel-Frequency Cepstral Coefficients (MFCCs) for capturing specific characteristics and patterns of an individual's voice, the vibration pattern of the vocal cords, the spectral envelope that is a characteristic of the frequency components of the voice, the intensity or power (energy) of an individual's voice, the spectral shape, etc.

関連データ取得部５４２は、声データ生成部５４１で声質又は言語を変換した際に用いられた各種パラメータを関連データに含めて記録するモジュールである。特にこの変換パラメータ記録部５４２ｂの変換パラメータ記録部５４２ｂは、声データ生成部５４１で生成、変換、合成など、加工された後の声データの特徴パラメータも、変換パラメータとして記録する機能を提供している。 The related data acquisition unit 542 is a module that records various parameters used when the voice quality or language is converted by the voice data generation unit 541, including them in the related data. In particular, the conversion parameter recording unit 542b of this conversion parameter recording unit 542b provides a function to record, as conversion parameters, the characteristic parameters of the voice data after it has been processed, such as generated, converted, or synthesized, by the voice data generation unit 541.

加工情報制御部５４２ｃは、声データ生成部５４１で行われる音声の生成、変換、合成、加工等の処理に必要な各種情報を加工情報として制御するモジュールである。この加工情報としては、音声合成或いは音声変換のために必要な変換パラメータ（ピッチ、テンポ、フォルマント、エフェクト等）の他、プリセット情報、テキストデータ、キャラクター設定、楽曲データなどが含まれる。ここで、テキストデータとしては、文章や歌詞などのテキストのほか、言語、発音、アクセント、強調、スピード、特定の言語や方言での発音、単語やフレーズの強調度合いの調整などに必要な情報をテキスト化して記述されたものが含まれる。キャラクター設定としては、合成音声の人物（人、動物、クリーチャー等）の性別、年齢、性格、感情、国籍など、キャラクターの属性や特性に合わせて、音声のトーンや抑揚を調整するために必要な情報が含まれる。さらに、楽曲データとしては、メロディ、リズム、テンポ、音量のほか、楽曲のメロディやリズムに合わせて、音声の高さやスピードを変更したり、特定の部分での強調や、音量の調整に必要な情報が含まれる。 The processing information control unit 542c is a module that controls various information required for the processing of voice generation, conversion, synthesis, processing, etc. performed by the voice data generation unit 541 as processing information. This processing information includes conversion parameters (pitch, tempo, formant, effects, etc.) required for voice synthesis or voice conversion, as well as preset information, text data, character settings, music data, etc. Here, the text data includes text such as sentences and lyrics, as well as information required for language, pronunciation, accent, emphasis, speed, pronunciation in a specific language or dialect, and adjustment of the degree of emphasis of words and phrases, which are written in text form. The character settings include information required to adjust the tone and intonation of the voice in accordance with the attributes and characteristics of the character, such as the gender, age, personality, emotions, and nationality of the person (person, animal, creature, etc.) of the synthetic voice. Furthermore, the music data includes melody, rhythm, tempo, volume, as well as information required to change the pitch and speed of the voice in accordance with the melody and rhythm of the music, and to adjust the volume and emphasize specific parts.

なお、これらの加工情報は、プリセット情報として、ユーザーが事前に設定したり、またはソフトウェアが提供したりする、特定の声のプロファイルとすることができ、これらの情報を適切に組み合わせることで、多彩な音声変換や音声合成が可能であり、ユーザーの要望や目的に合わせて、これらの変換パラメータを調整することで、より自然でリアルな音声を生成することができる。 This processing information can be preset information, that is, a specific voice profile that can be set in advance by the user or provided by the software. By appropriately combining this information, a wide variety of voice conversions and voice synthesis are possible, and by adjusting these conversion parameters according to the user's requests and purposes, a more natural and realistic voice can be generated.

そして、このような加工情報は、例えば、ＧＵＩ制御部５４５ａにより提供されるグラフィカルユーザーインターフェース（ＧＵＩ）の、スライダーやドロップダウンメニュー、チェックボックスなどの視覚的な要素を使用して、ユーザーが直感的にパラメータを調整でき、テキストボックスに直接情報やコマンドをテキストデータとして入力してもよい。また、例えば、「老人」や「怒った猫」といったように、上記各種設定をプリセットとして用意しておき、それをユーザー操作による選択するようにしてもよい。また、これらのプロファイルや、音声データや楽曲データを関連データ取得部５４２に直接アップロードするようにしてもよく、アプリケーションが外部のサービスやデータベースと連携して、情報を自動的に取得・入力できるようにしてもよい。 Such processing information may be provided by, for example, a graphical user interface (GUI) provided by the GUI control unit 545a, with visual elements such as sliders, drop-down menus, and check boxes, allowing the user to intuitively adjust parameters, and information or commands may be entered directly into a text box as text data. In addition, the above-mentioned various settings may be prepared as presets, such as "old man" or "angry cat," and the user may select from these presets. These profiles, audio data, and music data may be uploaded directly to the related data acquisition unit 542, or the application may link with external services and databases to automatically acquire and input information.

同期処理部５４０は、スマートフォン５側でのボイスライツ管理処理と、ボイスライツ管理サーバー３側でのボイスライツ管理処理とを同期させるモジュールである。具体的には、ボイスライツ管理サーバー３側で、他のユーザーのキャラクターの位置や、オブジェクトの位置などに基づいて、発生し得るイベント処理を予測し、その発生条件をボイスライツ管理サーバー３側で発生させ、その条件をスマートフォン５側に送信し、この発生条件を同期処理部５４０で受信し、実際のイベント処理の発生やそのためのグラフィック処理は、ボイスライツ管理サーバー３から受信した発生条件に基づいてスマートフォン５側のＧＵＩ制御部５４５ａで実行する。このスマートフォン５側のＧＵＩ制御部５４５ａで実行されたイベント処理の結果（バトルやミニゲームの勝敗や得点等）は、同期処理部５４０を通じてボイスライツ管理サーバー３側のＧＵＩ制御部５４５ａに通知され、その後のゲーム進行処理に反映される。 The synchronization processing unit 540 is a module that synchronizes the voice rights management processing on the smartphone 5 side with the voice rights management processing on the voice rights management server 3 side. Specifically, the voice rights management server 3 side predicts possible event processing based on the positions of other users' characters and objects, generates the conditions for the event processing on the voice rights management server 3 side, transmits the conditions to the smartphone 5 side, and receives the conditions for the event processing on the synchronization processing unit 540. The actual event processing and the graphic processing for the event processing are executed by the GUI control unit 545a on the smartphone 5 side based on the conditions for the event processing received from the voice rights management server 3. The results of the event processing executed by the GUI control unit 545a on the smartphone 5 side (such as the outcome of a battle or mini-game, or points) are notified to the GUI control unit 545a on the voice rights management server 3 side through the synchronization processing unit 540, and are reflected in the subsequent game progress processing.

前記表示データ生成部５４６は、表示部５３ａに表示させるための表示データを生成するモジュールである。表示データは、グラフィックデータの他、画像データや、文字データ、動画データ、音声その他のデータを組み合わせて生成されるデータであり、例えば、肖声権のリスト表示や、特定の肖声権データの詳細表示などの画面を生成する。この表示データ生成部５４６で生成された表示データの表示処理は、表示制御部５４５で制御される。 The display data generating unit 546 is a module that generates display data to be displayed on the display unit 53a. The display data is data that is generated by combining graphic data, image data, character data, video data, audio data, and other data, and generates a screen that displays, for example, a list of portrait rights or a detailed display of specific portrait rights data. The display process of the display data generated by this display data generating unit 546 is controlled by the display control unit 545.

表示制御部５４５は，表示データ生成部５４６が生成したデータをもとに、具体的な画面表示の制御を行うモジュールであり、本実施形態では、表示制御部５４５はＧＵＩ制御部５４５ａを有しており、タッチパネル５２ａや表示部５３ａを介してユーザーとのインタラクションを実現し、ユーザーの操作に応じて適切な画面遷移や情報の更新を行う。 The display control unit 545 is a module that controls the specific screen display based on the data generated by the display data generation unit 546. In this embodiment, the display control unit 545 has a GUI control unit 545a, and realizes interaction with the user via the touch panel 52a and the display unit 53a, and performs appropriate screen transitions and information updates in response to user operations.

ＧＵＩ制御部５４５ａは、スマートフォン５のディスプレイ上でのグラフィカルユーザーインターフェース（ＧＵＩ）の表示と制御を担当するモジュールである。このＧＵＩ制御部５４５ａは、ユーザーが直感的にアプリケーションを操作できるように、各種のボタン、メニュー、アイコン、ウィンドウなどのグラフィカルな要素をディスプレイ上に表示し、ユーザーからのタッチ操作などの入力に応じて適切な反応やアクションを行う。ＧＵＩ制御部５４５ａは、表示データ生成部５４６と連携して動作する。 The GUI control unit 545a is a module responsible for displaying and controlling the graphical user interface (GUI) on the display of the smartphone 5. This GUI control unit 545a displays graphical elements such as various buttons, menus, icons, and windows on the display so that the user can intuitively operate applications, and performs appropriate reactions and actions in response to inputs such as touch operations from the user. The GUI control unit 545a works in conjunction with the display data generation unit 546.

表示データ生成部５４６が生成したデータや情報を基に、ＧＵＩ制御部５４５ａはそれを視覚的かつ操作しやすい形でユーザーに提示する。例えば、肖声権のリストを表示する際、ＧＵＩ制御部５４５ａは、そのリストのデザインや配置、色使いなどのデザイン要素を制御する。さらに、ＧＵＩ制御部５４５ａは、ユーザーからの操作に応じて、適切なアプリケーション機能を呼び出す役割も持つ。例えば、ユーザーが「新しい声データを録音する」ボタンをタップした場合、その操作を受け取り、肖声権管理部５４３に命令を送って声データの録音プロセスを開始する。 Based on the data and information generated by the display data generation unit 546, the GUI control unit 545a presents it to the user in a visual and easy-to-operate format. For example, when displaying a list of voice rights, the GUI control unit 545a controls design elements such as the design, layout, and color usage of the list. Furthermore, the GUI control unit 545a also has the role of calling up appropriate application functions in response to user operations. For example, when the user taps the "Record new voice data" button, the GUI control unit 545a receives the operation and sends a command to the voice rights management unit 543 to start the voice data recording process.

位置情報取得部５４４は、スマートフォン５の内蔵ＧＰＳセンサーや無線基地局との通信情報を利用して、現在の位置情報を取得する機能を持つモジュールである。具体的に位置情報取得部５４４は、人工衛星を利用した全地球測位システム（ＧＰＳ：Global Positioning System）や、基地局からの電波強度及び基地局情報に基づく三角測位による基地局測位、ＷｉｆｉのＳＳＩＤ（Service SetID）及び電波状況と経度緯度を組み合わせたデータベースを利用したＷｉｆｉ測位等により取得する。 The location information acquisition unit 544 is a module that has a function of acquiring current location information using the built-in GPS sensor of the smartphone 5 and communication information with a wireless base station. Specifically, the location information acquisition unit 544 acquires information using the Global Positioning System (GPS) using artificial satellites, base station positioning using triangulation based on the signal strength from the base station and base station information, and Wi-Fi positioning using a database that combines the Wi-Fi SSID (Service Set ID) and signal conditions with longitude and latitude.

また、本実施形態において、位置情報取得部５４４は、時刻情報取得部５４４ａを有している。この位置情報取得部５４４で取得された位置情報及び時刻情報は、肖声権データの生成時に関連データであるメタデータとして結びつけられ、その声データがいつどこで取得されたのかの情報として保存される。この機能により、声のデータに時間的・地理的なコンテキストを付与することが可能となり、その声データの背後にある物語や背景を豊かにすることができる。 In addition, in this embodiment, the location information acquisition unit 544 has a time information acquisition unit 544a. The location information and time information acquired by this location information acquisition unit 544 are linked as metadata, which is related data, when the voice data is generated, and are saved as information on when and where the voice data was acquired. This function makes it possible to give the voice data a temporal and geographical context, thereby enriching the story and background behind the voice data.

（３）保証システム４
上述したように本実施形態では、ブロックチェーン技術を用いて肖声権の保証を行う保証システム４が設けられている。具体的に、保証システム４は、図４に示すように、通信インターフェース４３と、認証部４２と、権利管理実行部４４と、ＮＦＴ取引履歴データベース４１ａと、鍵情報データベース４１ｂと、アカウントデータベース４１ｃとを備えている。 (3) Guarantee System 4
As described above, in this embodiment, a guarantee system 4 that guarantees the identity of a person using blockchain technology is provided. Specifically, as shown in Fig. 4, the guarantee system 4 includes a communication interface 43, an authentication unit 42, a rights management execution unit 44, an NFT transaction history database 41a, a key information database 41b, and an account database 41c.

通信インターフェース４３は、通信ネットワーク２を通じて、他の通信機器とデータの送受信を行うモジュールであり、本実施形態では、各ボイスライツ管理サーバー３に接続されている。認証部４２は、アクセス者の正当性を検証するコンピューター或いはその機能を持ったソフトウェアであり、各ユーザーを特定するユーザーＩＤに基づいて認証処理を実行する。本実施形態では、通信ネットワーク２を通じてアクセス者のスマートフォン５から、ユーザー固有の公開アドレスや公開鍵、ユーザーＩＤ及びパスワード等を取得し、鍵情報データベース４１ｂを照合することによって、アクセス者にその権利があるか否かや、そのアクセス者が本人であるか否かなどを確認する。 The communication interface 43 is a module that transmits and receives data to and from other communication devices via the communication network 2, and in this embodiment is connected to each voice rights management server 3. The authentication unit 42 is a computer or software with that function that verifies the legitimacy of the accessor, and executes authentication processing based on a user ID that identifies each user. In this embodiment, the authentication unit 42 obtains the user's unique public address, public key, user ID, password, etc. from the accessor's smartphone 5 via the communication network 2, and verifies whether the accessor has the right and whether the accessor is the person in question by comparing them with the key information database 41b.

権利管理実行部４４は、肖声権をＮＦＴ（Non-Fungible Token）技術を通じて取り扱うモジュールである。本実施形態において肖声権も所謂デジタルアセットの一つであり、権利管理実行部４４を通じて、ブロックチェーン技術をベースとして、肖声権のユニークさや希少性が証明・保証されることとなる。権利管理実行部４４は、このＮＦＴ技術を採用し、肖声権データの独自性や希少性を確認・保証する。 The rights management execution unit 44 is a module that handles the votive rights through NFT (Non-Fungible Token) technology. In this embodiment, the votive rights are also one of the so-called digital assets, and the uniqueness and rarity of the votive rights are proven and guaranteed through the rights management execution unit 44 based on blockchain technology. The rights management execution unit 44 employs this NFT technology to confirm and guarantee the uniqueness and rarity of the votive rights data.

本実施形態では、ユーザーが生成した肖声権データをＮＦＴとして発行し、それに関連する情報や属性をブロックチェーン上に保存する。このプロセスを通じて、肖声権データのオリジナリティや所有権が確実に保護され、第三者による無許可の複製や配布を防ぐことができる。また、ＮＦＴとしての肖声権データは、市場で取引され、ユーザーは自らの声に関する権利をＮＦＴとして売買し、新たな経済的価値を創出することが可能となる。 In this embodiment, the voice rights data generated by the user is issued as an NFT, and related information and attributes are stored on the blockchain. Through this process, the originality and ownership of the voice rights data are reliably protected, and unauthorized copying and distribution by third parties can be prevented. Furthermore, the voice rights data as an NFT can be traded on the market, allowing users to buy and sell rights related to their own voices as NFTs, creating new economic value.

本実施形態において、具体的にこのＮＦＴにおける肖声権は、正当な所有者の公開アカウントに紐付けられ、公開アカウントの関係を提示することでその正当な権利者であることを証明することができ、また、正当な権利者でなければ、当該肖声権について譲渡等の移転手続きを実行することができないようになっている。 Specifically, in this embodiment, the portrait right in the NFT is linked to the public account of the legitimate owner, and one can prove that one is the legitimate owner by presenting the public account relationship. In addition, only the legitimate owner can carry out transfer procedures such as assignment of the portrait right.

権利管理実行部４４は、ボイスライツ管理サーバー３側の保証システム連携部３６ｃと連携する機能を有しており、この保証システム連携機能は、ＮＦＴを用いた肖声権管理に関するセキュリティ管理、取引記録、サービス履歴の保管など、権利管理に関する処理を、他のサービス機関の装置、例えば上記ボイスライツ管理サーバー３の保証システム連携部３６ｃから依頼され、これらの各装置とのやり取りによりＮＦＴを通じた、権利管理処理を進める。具体的に権利管理実行部４４は、権利情報提供部４４ａと、公開アドレス管理部４４ｂと、正当性検証部４４ｃと、データ更新部４４ｄとを備える。 The rights management execution unit 44 has a function of linking with the guarantee system linking unit 36c on the voice rights management server 3 side, and this guarantee system linking function is requested by devices of other service providers, for example the guarantee system linking unit 36c of the voice rights management server 3, to perform rights management-related processes such as security management related to voice rights management using NFT, transaction records, and storage of service history, and proceeds with rights management processes through NFT by interacting with each of these devices. Specifically, the rights management execution unit 44 includes a rights information providing unit 44a, a public address management unit 44b, a validity verification unit 44c, and a data update unit 44d.

権利情報提供部４４ａは、ユーザーが肖声権をＮＦＴとして発行した際の詳細な権利情報を、インターネット等を通じて提供するモジュールである。この提供される情報には、肖声権の価格、肖声権の作成日（ユーザーの年齢・年代等）、現在の所有者、及び何回取引されたかなどの移転利益などの情報が含まれる。 The rights information provider 44a is a module that provides, via the Internet or the like, detailed rights information when a user issues a cryptocurrency as an NFT. The provided information includes the price of the cryptocurrency, the date the cryptocurrency was created (the user's age, generation, etc.), the current owner, and transfer benefits such as how many times it has been traded.

公開アドレス管理部４４ｂは、ユーザーの公開アカウント（公開アドレス）を管理するモジュールであり、公開鍵暗号方式における公開鍵から生成されて特定のユーザーを識別するための公開アドレス、及び前記公開鍵とペアとなって公開鍵を特定可能な秘密鍵であって公開アドレスを介した肖声権管理の電子署名に利用される秘密鍵を発行する秘密鍵発行機能を有し、これらの発行された公開アドレス及びそれに関する鍵情報については、鍵情報データベース４１ｂに蓄積される。ユーザーがＮＦＴの取引を行う際、この公開アドレスが取引の正当性を証明する鍵となり、公開アドレスは、ブロックチェーン上でのユーザーの識別子として機能し、そのユーザーが所有するＮＦＴや取引履歴を一意に識別するために使用される。 The public address management unit 44b is a module that manages users' public accounts (public addresses) and has a private key issuance function that issues public addresses generated from a public key in a public key cryptosystem to identify a specific user, and private keys that can be paired with the public keys to identify the public key and are used for electronic signatures for voice rights management via the public addresses, and these issued public addresses and their associated key information are stored in the key information database 41b. When a user trades NFTs, this public address becomes the key that proves the legitimacy of the transaction, and the public address functions as the user's identifier on the blockchain and is used to uniquely identify the NFTs and transaction history owned by the user.

正当性検証部４４ｃは、肖声権の移転や取引が行われる際に、その操作が正当なものであるかを確認するモジュールであり、具体的には、公開アドレス管理部４４ｂで管理される公開アドレスと取引情報を照合し、取引が正当な所有者によって行われているかを検証する。これにより、不正な取引や第三者による不正アクセスを防止する。例えば、正当性検証部４４ｃは、ＮＦＴ取引に係る肖声権が、正当な取引を経て現在の所有者に属していることや、改ざんされていないことを検証し、公開アカウントに紐付けられた現所有者の公開鍵により、その肖声権の正当性を確認するとともに、その肖声権に係る全ての取引をＮＦＴ取引履歴データベース４１ａに蓄積しており、公開鍵に基づいてＮＦＴ取引履歴データベース４１ａを照合して、その正当性を確認できるようになっている。 The legitimacy verification unit 44c is a module that verifies whether an operation is legitimate when a transfer or transaction of a imitation right is performed. Specifically, it verifies the transaction information against the public address managed by the public address management unit 44b, and verifies whether the transaction is performed by the legitimate owner. This prevents fraudulent transactions and unauthorized access by third parties. For example, the legitimacy verification unit 44c verifies that the imitation right related to an NFT transaction belongs to the current owner through a legitimate transaction and has not been tampered with, and verifies the legitimacy of the imitation right using the public key of the current owner linked to the public account. It also accumulates all transactions related to the imitation right in the NFT transaction history database 41a, and can verify the legitimacy by checking the NFT transaction history database 41a based on the public key.

データ更新部４４ｄは、肖声権に関する情報の更新を行うモジュールである。ユーザーが肖声権の情報を変更したい場合や、肖声権を新たに取得した場合、この部分が活性化される。データ更新部は、新しい情報をブロックチェーンに追加し、以前の情報との整合性を保つための手続きを行う。例えば、データ更新部４４ｄは、ＮＦＴを通じた肖声権に係る情報を取得し、新しいＮＦＴ所有者に関する公開アドレスを追加して、分散台帳で証明されるトークン所有者を変更することにより、肖声権（ＮＦＴ）の所有権を移転する。このデータ更新部４４ｄによるデータの更新は、高度なセキュリティにより保護されており、二重譲渡や取引履歴の改ざんが強固に防御されるようになっている。 The data update unit 44d is a module that updates information related to the token. This part is activated when a user wishes to change the information on the token or when a new token is acquired. The data update unit adds new information to the blockchain and performs procedures to maintain consistency with previous information. For example, the data update unit 44d obtains information related to the token through the NFT, adds a public address related to the new NFT owner, and changes the token owner certified in the distributed ledger, thereby transferring the ownership of the token (NFT). The data updates by the data update unit 44d are protected by high-level security, and double transfers and tampering with transaction history are firmly prevented.

（肖声権の管理方法）
以上説明した肖声権管理システム１を動作させることによって、本発明の肖声権の管理方法を実施することができる。図５は、肖声権管理システムの動作を示すシーケンス図である。なお、以下で説明する処理手順は一例に過ぎず、各処理は可能な限り変更されてもよい。また、以下で説明する処理手順について、実施の形態に応じて、適宜、ステップの省略、置換及び追加が可能である。 (Method of managing portrait rights)
The method for managing a portrait right of the present invention can be implemented by operating the portrait right management system 1 described above. Fig. 5 is a sequence diagram showing the operation of the portrait right management system. Note that the processing procedure described below is merely an example, and each process may be modified as much as possible. Furthermore, steps may be omitted, replaced, or added to the processing procedure described below as appropriate depending on the embodiment.

先ず、ユーザー端末であるスマートフォン５から、ボイスライツ管理サーバー３へアクセスし、認証処理を開始する（Ｓ２０１）。具体的には、ユーザーＵ１がスマートフォン５上のボイスライツ管理アプリケーションを操作し、ボイスライツ管理サーバー３へのアクセスを試みる。ボイスライツ管理サーバー３との接続が成功すると、ユーザーに対してログイン情報としてユーザーＩＤとパスワードの入力が求められ、ユーザーがこれらの情報を正確に入力すると、サーバーはユーザーデータベース３５ｂに蓄積された認証情報と照合する。この認証情報が一致する場合、ユーザーの認証は成功し、ユーザーが特定される（Ｓ１０１）。 First, the voice rights management server 3 is accessed from the smartphone 5, which is the user terminal, and authentication processing begins (S201). Specifically, user U1 operates the voice rights management application on the smartphone 5 and attempts to access the voice rights management server 3. If the connection to the voice rights management server 3 is successful, the user is prompted to enter a user ID and password as login information, and when the user enters this information correctly, the server compares it with the authentication information stored in the user database 35b. If this authentication information matches, the user is successfully authenticated and the user is identified (S101).

ユーザー端末側では、認証処理が成功した場合に、声データの取得（Ｓ２０２）、関連データの取得（Ｓ２０３）が実行され、肖声権データが生成される（Ｓ２０４）。詳述すると、スマートフォン５側において、認証処理が成功した場合、声データ生成部５４１において、音響信号をデジタルデータとして録音・保存する。ここでは、ユーザーＵ１がアプリケーション上で「声を録音する」などの操作を実行すると、スマートフォン５の内蔵マイクロフォンや外部のマイクロフォンを介して声の音響信号が声データとして取得される。取得された声データは適切なフォーマットでエンコードされ、同時にタイムスタンプが付与されることで、データがいつ生成されたのかという情報も一緒に保存される。 On the user terminal side, if the authentication process is successful, voice data is acquired (S202), related data is acquired (S203), and voice record data is generated (S204). More specifically, if the authentication process is successful on the smartphone 5 side, the voice data generation unit 541 records and stores an acoustic signal as digital data. Here, when user U1 performs an operation such as "record voice" on the application, the voice acoustic signal is acquired as voice data via the smartphone 5's built-in microphone or an external microphone. The acquired voice data is encoded in an appropriate format, and a timestamp is added at the same time, so that information on when the data was generated is also stored.

次に、関連データ取得部５４２において、声データＤ１と合わせて保存すべき各種の個人情報Ｄ２が収集される。この関連データには、声を録音した場所の現在位置情報や時刻情報、ユーザーのデバイス情報、録音された環境の詳細（例えば、屋内での録音か屋外での録音か、背景に騒音があったかどうかなど）やその他のメタデータが含まれる。これらの情報は、将来、声データを解析や利用する際に、その声データがどのような状況下で録音されたのかという背景やコンテキストを提供する目的で蓄積される。 Next, the related data acquisition unit 542 collects various personal information D2 to be stored together with the voice data D1. This related data includes the current location information of the place where the voice was recorded, time information, user device information, details of the recording environment (e.g., whether the recording was made indoors or outdoors, whether there was noise in the background, etc.), and other metadata. This information is accumulated for the purpose of providing the background and context under which the voice data was recorded when the voice data is analyzed or used in the future.

なお、本実施形態では、必要に応じて声データの取得（ステップＳ２０２）及び関連データの取得（Ｓ２０３）に際し、必要に応じてオリジナルの声データの声質や言語、その他のパラメータを変換して声データを再生成して、この新たな声データに基づいて肖声権データを生成することもできる。図６に声データの再生成の手順を示す。 In this embodiment, when acquiring voice data (step S202) and related data (S203), the voice quality, language, and other parameters of the original voice data can be converted as necessary to regenerate the voice data, and the related voice data can be generated based on this new voice data. Figure 6 shows the procedure for regenerating voice data.

ステップＳ２０２において、音響信号が声データとして取得され、音声特徴抽出部５４２ａが、声データに含まれる周波数、持続時間、振幅、強度又は周波数成分のいずれかを特徴パラメータとして抽出する。この特徴パラメータは声質変換部５４１ｂ及び言語変換部５４１ａに渡される。 In step S202, an acoustic signal is acquired as voice data, and the voice feature extraction unit 542a extracts any one of the frequency, duration, amplitude, intensity, or frequency components contained in the voice data as feature parameters. These feature parameters are passed to the voice quality conversion unit 541b and the language conversion unit 541a.

ステップＳ２０２ａでは、声質変換部５４１ｂが、音声特徴抽出部５４２ａが抽出した特徴パラメータに基づいて声データの声質を変換する。このとき、声質変換部５４１ｂは、加工情報制御部５４２ｃを通じて、音声の生成、変換、合成、加工等の処理に必要な各種情報を加工情報として取得する。この加工情報としては、音声合成或いは音声変換のために必要な変換パラメータ（ピッチ、テンポ、フォルマント、エフェクト等）の他、プリセット情報、テキストデータ、キャラクター設定、楽曲データなどが含まれる。例えば、２０代の女性の声がオリジナルの声データであった場合に、周波数を上げたり下げたりすることで、声のピッチを高くしたり低くしたりすると同時に、一部のフォルマントを調整することによって、年齢や性別を変えることができ、高齢者の声にしたり、１０代のアイドルの声にすることができる。また、国籍や方言のように、特定のアクセントや発音の特性を再現するためにフィルターやエフェクトを適用したり、入力音声のピッチを大幅に上げたり下げたりすることで、鳥やネズミのような小動物の鳴き声や、ライオンや熊のような猛獣の声を再現したり、さらにエコーやリバーブ、フィルタリング、変調等のエフェクターを施すことにより宇宙人やクリーチャーの声を再現することができる。声質変換部５４１ｂでは、これらの処理を組み合わせることで、さまざまな声への変換を実現しており、またAIや機械学習を利用してよりリアルな声の変換を実現することもできる。 In step S202a, the voice quality conversion unit 541b converts the voice quality of the voice data based on the characteristic parameters extracted by the voice feature extraction unit 542a. At this time, the voice quality conversion unit 541b acquires various information necessary for processing such as voice generation, conversion, synthesis, and processing as processing information through the processing information control unit 542c. This processing information includes conversion parameters (pitch, tempo, formant, effect, etc.) necessary for voice synthesis or voice conversion, as well as preset information, text data, character settings, music data, etc. For example, if the original voice data is the voice of a woman in her twenties, the age and gender can be changed by raising or lowering the frequency to raise or lower the pitch of the voice and at the same time adjusting some formants, making it possible to make the voice of an elderly person or the voice of a teenage idol. In addition, filters and effects can be applied to reproduce specific accents and pronunciation characteristics, such as nationalities and dialects, and the pitch of the input voice can be significantly raised or lowered to reproduce the cries of small animals such as birds and mice, or the voices of wild animals such as lions and bears, and the voices of aliens and creatures can be reproduced by applying effects such as echo, reverb, filtering, and modulation. The voice quality conversion unit 541b can convert into various voices by combining these processes, and can also use AI and machine learning to achieve more realistic voice conversion.

ステップＳ２０２ｂにおいて、言語変換部５４１ａは、音声特徴抽出部５４２ａが抽出した特徴パラメータに基づいて声データの言語を変換する。具体的に言語変換部５４１ａは、音声認識処理によりユーザーが発話した音声データ中のフレーズや単語を認識し、それに対応するテキストについて略語、省略形、スラングなどの非標準的な形式を標準的な形式に変換したうえで、目的の言語に翻訳する。この翻訳処理では、辞書データを参照したり、ニューラルネットワークベースのモデルを使用して翻訳を行い、必要に応じて、例えば、口語的なスタイルや特定のアクセントを持つ方言などのような非標準的な形式や文体に変換する。 In step S202b, the language conversion unit 541a converts the language of the voice data based on the feature parameters extracted by the voice feature extraction unit 542a. Specifically, the language conversion unit 541a recognizes phrases and words in the voice data spoken by the user through voice recognition processing, converts non-standard formats such as abbreviations, shortened forms, and slang for the corresponding text into standard formats, and translates the text into the target language. In this translation processing, translation is performed by referring to dictionary data or using a neural network-based model, and, if necessary, converts the text into a non-standard format or style, such as a colloquial style or a dialect with a specific accent.

このように、声質及び言語を変換することにより、多様な声データを生成することができる。例えば、図６に示すように、２０代女性の声データを、話せないはずのイタリア語での会話音声としたり、英語で歌を歌う１０代のアイドルとしたり、各国の言語を話す猫や宇宙人といったキャラクターやクリーチャーの声としたり、年齢を高くして将来の声による会話音声を生成することができる。 In this way, by converting the voice quality and language, a variety of voice data can be generated. For example, as shown in FIG. 6, the voice data of a woman in her twenties can be converted into a conversation voice in Italian, which she is not supposed to be able to speak, into a teenage idol singing in English, into the voice of a character or creature such as a cat or an alien speaking the languages of various countries, or even into a conversation voice in the future by increasing the age.

そして、声質及び言語が変換されたテキストデータは、声データ生成部５４１の音声合成エンジンにより、声データとして再生成される（Ｓ２０２ｃ）。合わせて、変換パラメータ記録部５４２ｂは、関連データ取得部５４２を介して、声データ生成部５４１で声質又は言語を変換した際に用いられた各種変換パラメータ、加工データを関連データに含めて記録する（Ｓ２０３）。このとき、変換パラメータ記録部５４２ｂは、声データ生成部５４１で生成、変換、合成など、加工された後の声データの特徴パラメータも、変換パラメータとして記録する（Ｓ２０２ｄ）。 Then, the text data with converted voice quality and language is regenerated as voice data by the voice synthesis engine of the voice data generation unit 541 (S202c). At the same time, the conversion parameter recording unit 542b records, via the related data acquisition unit 542, the various conversion parameters and processing data used when the voice quality or language was converted by the voice data generation unit 541, including them in the related data (S203). At this time, the conversion parameter recording unit 542b also records, as conversion parameters, the characteristic parameters of the voice data after processing such as generation, conversion, and synthesis by the voice data generation unit 541 (S202d).

次いで、これらの情報が全て集まると、肖声権データが生成される（Ｓ２０４）。この生成されたデータは、声と関連データを組み合わせて一つのパッケージとしてまとめられ、ユーザーの唯一無二の肖声権として扱われる。そして、生成された肖声権データを、ボイスライツ管理サーバー３へアップロードし（Ｓ２０５）、ボイスライツ管理サーバー３で受領される（Ｓ１０２）。具体的には、スマートフォン５側での一連の処理が完了し、肖声権データＤ３が生成されると、この肖声権データＤ３は、スマートフォン５から通信ネットワーク２を経由して、ボイスライツ管理サーバー３へと肖声権データのアップロードが開始される（Ｓ２０５）。 Next, once all of this information has been collected, voice rights data is generated (S204). This generated data is packaged by combining the voice and related data, and is treated as the user's one and only voice rights. The generated voice rights data is then uploaded to the voice rights management server 3 (S205), and is received by the voice rights management server 3 (S102). Specifically, once a series of processes on the smartphone 5 side is completed and voice rights data D3 is generated, uploading of this voice rights data D3 from the smartphone 5 via the communication network 2 to the voice rights management server 3 begins (S205).

このアップロードを受けて、ボイスライツ管理サーバー３側では、送信されてきた肖声権データを受け取る処理が行われる（Ｓ１０２）。肖声権データがボイスライツ管理サーバー３側で正確に受け取られると、続いて、データの正当性の検証が実行される（Ｓ１０３）。正当性の検証では、受領した肖声権データが信頼性があるものであるか、改ざんや不正が疑われる要素がないか、といった点を中心として、ボイスライツ管理サーバー３上に保存されている既存の情報や、他のユーザーからアップロードされたデータとの照合を通じて、新しく受け取ったデータの信頼性を確認する。 Upon receiving this upload, the voice rights management server 3 carries out a process to receive the transmitted voice rights data (S102). Once the voice rights data has been accurately received by the voice rights management server 3, the validity of the data is then verified (S103). In validity verification, the reliability of the newly received data is confirmed by comparing it with existing information stored on the voice rights management server 3 and data uploaded by other users, focusing on whether the received voice rights data is reliable and whether there are any elements that may be suspected of tampering or fraud.

そして、肖声権データの正当性の検証が成功した後、ユーザーによって権利設定操作がなされ（Ｓ１０７）、ボイスライツ管理サーバー３側で権利設定処理が実行される（Ｓ１０８）。具体的には、肖声権データが肖声権データベース３５ａに登録されるとともに、保証システム４を通じてブロックチェーンに保存され（Ｓ１０５）、肖声権の設定が完了する（Ｓ１０６）。ここでの権利設定に際し、ユーザーの意向や設定内容をユーザー操作に基づいて設定され、データの管理や利用に関するルールや制約を確立する。すなわち設定された肖声権データは、すなわち肖声権データベース３５ａに登録される。併せて、本実施形態では、データベースへの登録だけではなく、データの信頼性や不変性を高めるために、保証システム４を介して、ブロックチェーン技術を利用したシステムにも保存される。ブロックチェーンにより、データの改ざんを極めて困難なものとすることにより、肖声権のデータの安全性や信頼性をが確保される。このようにして、ユーザーの声に関する権利、すなわち肖声権の設定が完了する（Ｓ１０６）。 After the validity of the voice rights data is successfully verified, the user performs a rights setting operation (S107), and the voice rights management server 3 executes a rights setting process (S108). Specifically, the voice rights data is registered in the voice rights database 35a and stored in the blockchain through the guarantee system 4 (S105), and the setting of the voice rights is completed (S106). When setting the rights here, the user's intentions and setting contents are set based on the user's operation, and rules and restrictions regarding the management and use of data are established. That is, the set voice rights data is registered in the voice rights database 35a. In addition, in this embodiment, in addition to registering in the database, in order to increase the reliability and immutability of the data, it is also stored in a system using blockchain technology via the guarantee system 4. The blockchain makes it extremely difficult to tamper with the data, thereby ensuring the safety and reliability of the voice rights data. In this way, the setting of the user's right regarding his/her voice, that is, the voice rights, is completed (S106).

この肖声権の設定が完了した後、権利が発生した旨を通知する権利発生通知がスマートフォン５に向けて送信され、この通知を受信したスマートフォン５側では（Ｓ２０８）、権利設定完了処理が実行され、ユーザー端末側のアプリケーションデータが更新される。 After the setting of the portrait right is completed, a right creation notification is sent to the smartphone 5 to inform the user that the right has been created. On the smartphone 5 side that receives this notification (S208), a right creation completion process is executed, and the application data on the user terminal side is updated.

（保証システムの動作）
ここで、上述した保証システムで採用している分散台帳システムの仕組みについて詳述する。本実施形態において保証システムは、ブロックチェーンインターフェースサービスを提供しており、このサービスでは、ボイスライツ管理サーバー３が生成したデータの少なくとも一部又は全部を記憶する複数のノードを備え、これらのノードは、記憶したデータを所定のタイミングで集約してブロック化し、該ブロックを用いてブロックチェーンを形成し、該ブロックチェーンを複数の前記ノードで共有して分散台帳として記憶する。 (Operation of the warranty system)
Here, the mechanism of the distributed ledger system adopted in the above-mentioned guarantee system will be described in detail. In this embodiment, the guarantee system provides a blockchain interface service, and this service includes a plurality of nodes that store at least a part or all of the data generated by the voice rights management server 3, and these nodes aggregate and block the stored data at a predetermined timing, form a blockchain using the blocks, and share the blockchain among the plurality of nodes and store it as a distributed ledger.

具体的には、図９に示すように、本実施形態に係る肖声権管理システム１は、肖声権に係るＮＦＴの発行・移転・消却にあたり、保証システム４を通じて、公開鍵暗号方式に基づく公開鍵ＰＫａと秘密鍵ＳＫａとの鍵ペアを発行するとともに、発行したＮＦＴに対応する公開鍵ＰＫａから公開アドレスＰＡａを生成する。この公開アドレスＰＡａは、肖声権取引契約における譲受人（購入者）及び譲渡人（権利者）を示すアドレスとして活用される一方、秘密鍵ＳＫａは、公開アドレスＰＡａを移転元とする取引の電子署名に利用される。 Specifically, as shown in FIG. 9, the national anthem management system 1 according to this embodiment issues a key pair of a public key PKa and a private key SKa based on a public key cryptosystem through a guarantee system 4 when issuing, transferring, or canceling an NFT related to a national anthem, and generates a public address PAa from the public key PKa corresponding to the issued NFT. This public address PAa is used as an address indicating the transferee (purchaser) and transferor (rights holder) in the national anthem trading contract, while the private key SKa is used for the electronic signature of the transaction in which the public address PAa is the transfer source.

本実施形態に係る肖声権管理はＰ２Ｐ（Peer-to-Peer）ネットワーク９０上の２つのノード間で行われ（ここでは、権利者及び購入者間）、その取引情報はＰ２Ｐネットワーク９０内の各ノード９０ａ～９０ｆにブロードキャストされて共有される。これにより、Ｐ２Ｐネットワーク９０上において、分散台帳システムによる取引履歴データベース（いわゆるブロックチェーン）が形成され、各種トークン及び肖声権管理の取引履歴が保存される。 The copyright management in this embodiment is carried out between two nodes on a P2P (Peer-to-Peer) network 90 (here, between the rights holder and the purchaser), and the transaction information is broadcast to and shared among the nodes 90a to 90f in the P2P network 90. As a result, a transaction history database (a so-called blockchain) based on a distributed ledger system is formed on the P2P network 90, and transaction history for various tokens and copyright management is stored.

本実施形態では、この分散台帳システムによる取引は、ボイスライツ管理サーバー３を通じて、ＮＦＴの発行や、その所有者の書き換えを行う際に、肖声権取引契約の実行、承認及び管理を実施する。肖声権取引の仲介人であるボイスライツ管理サーバー３は、権利者と購入者の間で取引の仲介をするために、ＮＦＴプラットフォームとして、ボイスライツ管理サーバー３に固有（個々の取引トランザクションに固有）の仲介用公開アドレスＰＡ３を生成して、取引対象となっているトークンが、ボイスライツ管理サーバー３に一時的に預けられているとして、トークンの移転の中継を行う。 In this embodiment, transactions using this distributed ledger system are performed, approved, and managed through the voice rights management server 3 when issuing NFTs or rewriting their owners. The voice rights management server 3, which acts as an intermediary in voice rights transactions, generates an intermediary public address PA3 that is unique to the voice rights management server 3 (unique to each individual transaction) as an NFT platform to mediate transactions between the rights holder and the purchaser, and relays the transfer of tokens, with the tokens being the subject of the transaction being temporarily deposited in the voice rights management server 3.

そして、取引の当事者（権利者及び購入者）は、肖声権管理システム１を利用して、現在の権利者から、ボイスライツ管理サーバー３に設定された今回の取引トランザクションに固有の仲介用公開アドレスＰＡ３へＮＦＴを移転させることで一旦受領し、さらに仲介用公開アドレスＰＡ３を介して、新たな購入者に対して移転させることで、権利者と購入者との間におけるＮＦＴプラットフォーム上における売買契約を成立させる。 The parties to the transaction (rights holder and purchaser) then use the voice rights management system 1 to transfer the NFT from the current rights holder to an intermediary public address PA3 that is unique to this transaction and set in the voice rights management server 3, thereby initially receiving the NFT, and then transferring it to the new purchaser via the intermediary public address PA3, thereby establishing a sales contract on the NFT platform between the rights holder and the purchaser.

ここで、ＮＦＴ移転処理について詳述する。図７は、本実施形態にかかる電力取引システムの移転時における処理手順を例示するフロー図であり、図８は、本実施形態に係る公開鍵と秘密鍵との関係を例示する。本実施形態では、ＮＦＴ移転処理及びＮＦＴ移転に関するアカウント処理を、本実施形態に係る分散台帳システムの仕組みを利用している。ここでは、ＮＦＴ取引プラットフォームを通じて、権利者が新規の権利者となる購入者に対し、ＮＦＴを販売する場合を例に説明する。このＮＦＴ取引には、図８に示すように、公開アドレス及び秘密鍵の発行ステップＳ３０１と、関連するサービス履歴情報の登録処理ステップＳ３０２と、権利移転処理の実行ステップＳ３０３とが含まれる。 Here, the NFT transfer process will be described in detail. FIG. 7 is a flow diagram illustrating the processing procedure during transfer of the energy trading system according to this embodiment, and FIG. 8 illustrates the relationship between the public key and the private key according to this embodiment. In this embodiment, the NFT transfer process and the account processing related to the NFT transfer utilize the mechanism of the distributed ledger system according to this embodiment. Here, an example will be described in which a rights holder sells an NFT to a purchaser who becomes a new rights holder through an NFT trading platform. As shown in FIG. 8, this NFT transaction includes step S301 of issuing a public address and a private key, step S302 of registering related service history information, and step S303 of executing the rights transfer process.

先ず、ステップＳ３０１において、保証システム４では、公開アドレス管理部４４ｂが、ＮＦＴ取引プラットフォームに固有の仲介用公開アドレスＰＡ３と、この仲介用公開アドレスＰＡ３に対応する秘密鍵ＳＫ３とのペアを発行している。具体的には、図８に例示されるように、保証システム４は、乱数発生器等を用いて、トークン取引プラットフォームに固有の公開アドレスに対応付けられた秘密鍵ＳＫ３を公開鍵暗号方式で生成する。乱数発生器は、例えば、プログラムとして公開アドレス管理部４４ｂに内蔵させていてもよい。 First, in step S301, in the guarantee system 4, the public address management unit 44b issues a pair of an intermediary public address PA3 unique to the NFT trading platform and a private key SK3 corresponding to this intermediary public address PA3. Specifically, as illustrated in FIG. 8, the guarantee system 4 uses a random number generator or the like to generate the private key SK3 associated with the public address unique to the token trading platform by public key cryptography. The random number generator may be built into the public address management unit 44b as a program, for example.

次に、保証システム４では、例えば、楕円曲線ＤＳＡ（Elliptic Curve Digital Signature Algorithm, ESDSA）等の電子署名のアルゴリズムに基づいて、秘密鍵ＳＫ３から公開鍵ＰＫ３を生成する。生成される公開鍵ＰＫ３と秘密鍵ＳＫ３とは公開鍵暗号方式における鍵ペアとなり、この公開鍵暗号方式の性質上、秘密鍵ＳＫ３から公開鍵ＰＫ３を生成することは可能であるものの、公開鍵ＰＫ３から秘密鍵ＳＫ３を生成することは計算量の観点から不可能に構成される。すなわち、公開鍵ＰＫ３から秘密鍵ＳＫ３を特定することはできないが、秘密鍵ＳＫ３から公開鍵ＰＫ３を特定することはできる。なお、利用する電子署名のアルゴリズムの種類は楕円曲線ＤＳＡに限定される訳ではなく、実施の形態に応じて適宜選択されてもよい。 Next, the guarantee system 4 generates a public key PK3 from the private key SK3 based on an electronic signature algorithm such as Elliptic Curve Digital Signature Algorithm (ESDSA). The generated public key PK3 and private key SK3 form a key pair in a public key cryptosystem, and due to the nature of this public key cryptosystem, it is possible to generate the public key PK3 from the private key SK3, but it is configured such that it is impossible to generate the private key SK3 from the public key PK3 in terms of the amount of calculation required. In other words, it is not possible to identify the private key SK3 from the public key PK3, but it is possible to identify the public key PK3 from the private key SK3. Note that the type of electronic signature algorithm used is not limited to Elliptic Curve DSA, and may be selected appropriately depending on the embodiment.

続いて、保証システム４は、ＳＨＡ－２５６、ＲＩＰＥＭＤ－１６０等の一方向ハッシュ関数を公開鍵ＰＫ３に適用することで、公開鍵ＰＫ３から仲介用公開アドレスＰＡ３を生成する。例えば、保証システム４は、ＳＨＡ－２５６を公開鍵ＰＫ３に２回適用することによって、仲介用公開アドレスＰＡ３を生成することができる。すなわち、この仲介用公開アドレスＰＡ３は、上述したトランザクションの署名に利用される公開鍵のハッシュ値であり、ＮＦＴの移転先及び移転元を識別するために利用される。 Then, the assurance system 4 generates an intermediary public address PA3 from the public key PK3 by applying a one-way hash function such as SHA-256 or RIPEMD-160 to the public key PK3. For example, the assurance system 4 can generate the intermediary public address PA3 by applying SHA-256 twice to the public key PK3. In other words, this intermediary public address PA3 is a hash value of the public key used to sign the transaction described above, and is used to identify the transfer destination and transfer source of the NFT.

次のステップＳ３０２では、移転元である権利者がＮＦＴ取引プラットフォームに提供されたサービスの履歴など、ＮＦＴに関連付けられる取引履歴データを、ステップＳ３０１で生成された仲介用公開アドレスＰＡ３に紐付けてノードへの記録を行う。具体的には、保証システム４で取得された肖声権データなどが仲介用公開アドレスＰＡ３に紐付けられて公開される。この肖声権データは、仲介用公開アドレスＰＡ３に関する公開鍵ＰＫ３を入手したものであれば、自由に閲覧ができるようになっている。この結果、その電力が由来する発電方法や発電箇所等の履歴や取引履歴に不正があったり、改ざんされたり等の不正行為に対する検証が誰でも行えるようになる。 In the next step S302, transaction history data associated with the NFT, such as the history of services provided to the NFT trading platform by the rights holder who is the source of the transfer, is linked to the intermediary public address PA3 generated in step S301 and recorded in the node. Specifically, the power of attorney data obtained by the guarantee system 4 is linked to the intermediary public address PA3 and made public. This power of attorney data can be freely viewed by anyone who has obtained the public key PK3 related to the intermediary public address PA3. As a result, anyone can verify whether there has been any fraud or tampering with the history of the power generation method and power generation location from which the electricity originates, or the transaction history.

その後、ステップＳ３０３では、保証システム４は、ステップＳ３０１で生成した仲介用公開アドレスＰＡ３に対する権利移転の取引を行う。そして、当該移転が完了すると、保証システム４は、本動作例に係る処理を終了する。ここで、本実施形態に係るＮＦＴのやり取りには、例えばスマートフォン５上で実行されるアプリケーションが用いられる。そのため、保証システム４の公開アドレス管理部４４ｂにも、ＮＦＴ取引の仕組みを実行するアプリケーションがインストールされており、このアプリケーションによって、プラットフォームが管理する仲介用公開アドレスが制御されている。 After that, in step S303, the guarantee system 4 performs a transaction of transfer of rights to the intermediary public address PA3 generated in step S301. Then, when the transfer is completed, the guarantee system 4 ends the processing related to this operation example. Here, an application executed on the smartphone 5, for example, is used for the exchange of NFT related to this embodiment. Therefore, an application that executes the mechanism of NFT transactions is also installed in the public address management unit 44b of the guarantee system 4, and this application controls the intermediary public address managed by the platform.

ＮＦＴとしての肖声権が、権利者に属している間は、ＮＦＴは、権利者のスマートフォン５において、権利者固有の公開アドレスＰＡａはペアとなる秘密鍵ＳＫａと対応付けられており、ＮＦＴ取引プラットフォームにおいて、移転手続きが取られる際に、購入者は、スマートフォン等の情報端末を用いて、移転元の公開アドレスＰＡａ（移転元）から、ステップＳ３０１で、ＮＦＴプラットフォーが生成した仲介用公開アドレスＰＡ３（移転先）にＮＦＴを一時的に移転させる。 While the portrait rights as an NFT belong to the right holder, the NFT's unique public address PAa is associated with a paired private key SKa on the right holder's smartphone 5, and when the transfer procedure is carried out on the NFT trading platform, the purchaser uses an information terminal such as a smartphone to temporarily transfer the NFT from the source public address PAa (transfer source) to the intermediary public address PA3 (transfer destination) generated by the NFT platform in step S301.

これに対して、新たにＮＦＴの購入を希望する購入者は、自身のスマートフォン５等の情報処理端末装置を用いて、ＮＦＴが紐付けられている公開鍵ＰＫ３を入手し、当該購入者は、ＮＦＴ取引プラットフォームの仲介用公開アドレスＰＡ３に紐付けられた肖声権に関する声データや取引経過情報、関連する外部情報を閲覧することができる。 In response to this, a buyer who wishes to purchase a new NFT can use their own information processing terminal device, such as a smartphone 5, to obtain the public key PK3 linked to the NFT, and the buyer can view voice data related to the cryptocurrency, transaction progress information, and related external information linked to the intermediary public address PA3 of the NFT trading platform.

具体的には、購入者のスマートフォン５にもＮＦＴ取引用のアプリケーションがインストールされており、このアプリケーションによって、購入者の保有する公開アドレスＰＡｂが管理されている。公開アドレスＰＡｂには購入者の秘密鍵ＳＫｂが対応付けられており、これによって、自己の公開アドレスＰＡｂからＮＦＴを、さらに他人に移転することができる。つまり、各秘密鍵ＳＫｂによって、購入者は、公開アドレスＰＡｂに格納されたＮＦＴやその取引履歴を自在に利用することができる。ここでは、購入者が、スマートフォン５においてアプリケーションを利用して、ＮＦＴプラットフォーム固有の仲介用公開アドレスＰＡ３から公開アドレスＰＡｂに移転されたＮＦＴを受け取る。 Specifically, an application for NFT transactions is also installed on the purchaser's smartphone 5, and this application manages the public address PAb held by the purchaser. The purchaser's private key SKb is associated with the public address PAb, which allows the purchaser to further transfer the NFT from his or her public address PAb to another person. In other words, each private key SKb allows the purchaser to freely use the NFT stored in the public address PAb and its transaction history. Here, the purchaser uses the application on the smartphone 5 to receive the NFT transferred from the intermediary public address PA3 specific to the NFT platform to the public address PAb.

これにより、譲受人である購入者は、ＮＦＴを自分の公開アドレスＰＡｂで受領することができ、この仲介用公開アドレスＰＡ３に紐付けられたサービス履歴の閲覧や、サービスの利用が可能になる。なお、この公開アドレスの発行は、ボイスライツ管理サーバー３が行ってもよく、各取引ユーザー端末上のソフトウェアや、独立したサービス管理機関や金融機関のサーバーで行うこともできる。ここで、ＮＦＴの取引に関する詳細な仕組みについて具体的に説明する。図１０～図１２に、トークン取引履歴（ブロックチェーン）の一部を例示する。 As a result, the transferee (purchaser) can receive the NFT at his/her public address PAb, and can view the service history and use the service linked to this intermediary public address PA3. This public address may be issued by the voice rights management server 3, or by software on each transaction user terminal, or by the server of an independent service management institution or financial institution. Here, we will explain in detail the detailed mechanism of NFT transactions. Figures 10 to 12 show examples of part of the token transaction history (blockchain).

ＮＦＴの発行・移転・消却に関する取引履歴が、一連の電子署名の連鎖として定義される。このＮＦＴの所有者は、次の所有者にその取引履歴を移転する場合に、直前の取引のハッシュ値と、次の所有者に係る公開鍵のハッシュ値とを自身の秘密鍵で電子署名したものをトークンの取引履歴に追加する。なお、これらのハッシュ値の計算には、例えば、ＳＨＡ－２５６、ＲＩＰＥＭＤ－１６０等の一方向ハッシュ関数が用いられる。 The transaction history regarding the issuance, transfer, and cancellation of an NFT is defined as a chain of a series of electronic signatures. When the owner of this NFT transfers its transaction history to the next owner, they add to the token's transaction history the hash value of the previous transaction and the hash value of the public key related to the next owner, which are digitally signed with their own private key. Note that a one-way hash function such as SHA-256 or RIPEMD-160 is used to calculate these hash values.

図１０では、取引の具体例として、ＮＦＴが、所有者Ｚから所有者Ａに移転され、所有者Ａから所有者Ｂに移転され、さらに所有者Ｂから所有者Ｃに移転される場面が例示されている。この場合、所有者Ａから所有者Ｂにトークンを移転するときには、所有者Ａは、所有者Ｚから所有者Ａへの移転取引のハッシュ値と次の所有者である所有者Ｂの公開鍵のハッシュ値とを所有者Ａの秘密鍵で電子署名したものをトークンに追加する。 Figure 10 illustrates, as a specific example of a transaction, a scene in which an NFT is transferred from owner Z to owner A, from owner A to owner B, and then from owner B to owner C. In this case, when transferring a token from owner A to owner B, owner A adds to the token a hash value of the transfer transaction from owner Z to owner A and a hash value of the public key of the next owner, owner B, digitally signed with owner A's private key.

所有者Ｂを含むこの取引以降のトークンの所有者は、所有者Ａの公開鍵でこの電子署名を復号した値を所有者Ｚから所有者Ａへの移転取引のハッシュ値及び所有者Ｂの公開鍵のハッシュ値と照合することで、この取引が改ざんされているか否かを判定することができる。同様に、所有者Ｂから所有者Ｃにトークンを移転するときには、所有者Ｂは、所有者Ａから所有者Ｂへの移転取引のハッシュ値と次の所有者である所有者Ｃの公開鍵のハッシュ値とを所有者Ｂの秘密鍵で電子署名したものをトークンに追加する。これにより、所有者Ｂから所有者Ｃへの移転取引が改ざんされているか否かを判定することが可能になる。 Any subsequent owner of the token, including owner B, can determine whether the transaction has been tampered with by comparing the value obtained by decrypting this electronic signature with owner A's public key with the hash value of the transfer transaction from owner Z to owner A and the hash value of owner B's public key. Similarly, when transferring a token from owner B to owner C, owner B adds to the token the hash value of the transfer transaction from owner A to owner B and the hash value of the public key of the next owner, owner C, electronically signed with owner B's private key. This makes it possible to determine whether the transfer transaction from owner B to owner C has been tampered with.

ＮＦＴは、このような一連の電子署名の連鎖として定義することができる。ここで、公開鍵のハッシュ値は公開アドレスである。すなわち、この公開アドレスに保管されるトークン等を移転できるのは、この公開アドレスを移転元とする肖声権の電子署名を行える者、すなわち、この公開アドレスに対応する秘密鍵を有する者に限られる。そのため、秘密鍵は、一般的には、所有者以外に漏えいしないように秘匿される。なお、トークンやそれに関する取引履歴等のデータは、現在の所有者に紐付けられた公開アドレスに保管される。また、この電子署名だけでは、このトークンの過去における所有者のうちの誰かが当該トークンを多重使用（多重譲渡）していることを検証することはできないことから、本実施形態に係るトークン取引の仕組みでは、図１１及び図１２で例示されるブロックチェーンという仕組みを用いて、この多重使用を防止している。 NFT can be defined as a chain of such a series of electronic signatures. Here, the hash value of the public key is the public address. In other words, the tokens stored in this public address can only be transferred by those who can perform an electronic signature of the token with this public address as the transfer source, that is, by those who have the private key corresponding to this public address. Therefore, the private key is generally kept secret so as not to be leaked to anyone other than the owner. Note that the token and data related to it, such as transaction history, are stored in a public address linked to the current owner. In addition, since this electronic signature alone cannot verify that any of the past owners of the token have used the token multiple times (transferred it multiple times), the token transaction mechanism according to this embodiment uses a mechanism called a blockchain, exemplified in FIG. 11 and FIG. 12, to prevent this multiple use.

図１１及び図１２に例示されるように、トークン等に記録される各ブロックは、複数のトランザクションとＮｏｎｃｅと直前のブロックのハッシュ値とを格納している。Ｎｏｎｃｅは、暗号通信で用いられる使い捨てのランダムな値であり、ノード（マイナー）９０ａ～９０ｆのうち、この値を最初に発見したノード（マイナー）が、承認者として、Ｎｏｎｃｅを発見したブロックをブロックチェーンの末尾に追加することでブロックチェーンの更新を行う。 As illustrated in Figures 11 and 12, each block recorded in a token or the like stores multiple transactions, a nonce, and the hash value of the previous block. A nonce is a disposable random value used in cryptographic communications, and the node (miner) 90a to 90f that first discovers this value acts as an approver and updates the blockchain by adding the block in which the nonce was discovered to the end of the blockchain.

これにより、ブロックチェーンには一貫した取引履歴が記録されることになり、このブロックチェーンをＰ２Ｐネットワーク９０に参加するノード９０ａ～９０ｆ全体で共有することで、一貫した取引履歴をＰ２Ｐネットワーク９０全体で共有することができる。すなわち、このブロックチェーンが、上述した保証システム４におけるＮＦＴ取引履歴データベース４１ａ及び鍵情報データベース４１ｂの一部又は全部を担うこととなる。本実施形態において、公開鍵暗号方式に基づく肖声権管理では、このような仕組みによってＮＦＴの取引が行われる。 As a result, a consistent transaction history is recorded in the blockchain, and by sharing this blockchain among all nodes 90a to 90f participating in the P2P network 90, a consistent transaction history can be shared throughout the entire P2P network 90. In other words, this blockchain serves as part or all of the NFT transaction history database 41a and key information database 41b in the guarantee system 4 described above. In this embodiment, NFT transactions are carried out using this mechanism in the identity management based on the public key cryptography method.

（作用・効果）
以上述べたように本実施形態によれば、人の声とその声の持ち主の個人情報をブロックチェーンで紐付けてＮＦＴとして管理し、その人の声を権利として保障することでき、各個人の声データを効率的かつ安全に管理・活用できる。詳述すると、本実施形態によれば、声データを、「肖声権」として保護することができ、この肖声権と言う形で個人の声のデータに明確な所有権を設定することができ、不正使用や権利侵害を防ぐことが可能となる。 (Action and Effects)
As described above, according to this embodiment, a person's voice and the personal information of the owner of that voice are linked by a blockchain and managed as an NFT, and the person's voice can be guaranteed as a right, and each individual's voice data can be efficiently and safely managed and utilized. In more detail, according to this embodiment, the voice data can be protected as a "voice right," and a clear ownership right can be set for the individual's voice data in the form of this voice right, making it possible to prevent unauthorized use and infringement of rights.

本実施形態では、必要に応じてオリジナルの声データを、声質や言語等を変換してオリジナルとは異なる新たな声データを再生成するため、この新たな声データに基づく肖声権を多様なサービスで活用し、その付加価値を高めることができる。例えば、再生した声データを用いることで、ユーザーが自身の声で外国語を話すことで外国での旅行体験をシミュレーションするバーチャルトラベル体験や、楽曲の外国語バージョンによるプロモーション、ユーザーが動物キャラクターや宇宙人としてバーチャル空間での対話を楽しむ異空間での対話シミュレーション、ユーザーの声を低年齢化或いは高齢化させて過去や未来の自分との対話をシミュレートし、人生の選択や目標について考えるきっかけを提供したり、言語学習アシスタンスなど、教育やエンターテインメント、ゲーム、福祉、旅行など多様な分野で肖声権を活用することができる。 In this embodiment, the original voice data is converted as necessary in terms of voice quality, language, etc. to regenerate new voice data that is different from the original, and the voice rights based on this new voice data can be utilized in a variety of services to increase the added value of the services. For example, by using the reproduced voice data, the voice rights can be utilized in a variety of fields, such as education, entertainment, games, welfare, and travel, for example, a virtual travel experience in which the user simulates a travel experience in a foreign country by speaking a foreign language in their own voice, promotions using foreign language versions of songs, a dialogue simulation in a different space in which the user enjoys dialogue in a virtual space as an animal character or an alien, simulating a dialogue with the past or future self by making the user's voice younger or older, providing an opportunity to think about life choices and goals, and language learning assistance.

また、本実施形態では、ブロックチェーン技術を介して、声データとその持ち主の個人情報が紐付けられることとなり、プライバシーが強固に保護されるとともに、肖声権が確立されることで、声データの不正使用に対して法的に対処することが容易となる。さらに、本実施形態により、声データを肖声権として適正に保護することができることから、安全に商用化する新しいビジネスモデルが可能となり、声データに明確な所有権を設定できるとともに、声データの持ち主がその利用に対して適切に報酬を受けられるようになるため、声データをより自由に、かつ安心してクリエイティブな用途に応用することができ、また、声データが認証され、その信頼性が確保されるため、さまざまな産業での声データが活用されることが期待できる。 In addition, in this embodiment, voice data is linked to the personal information of its owner via blockchain technology, so privacy is strongly protected, and the establishment of a voice right makes it easier to take legal action against unauthorized use of the voice data. Furthermore, this embodiment allows voice data to be properly protected as a voice right, enabling a new business model for safe commercialization, and clear ownership rights can be set for the voice data, while the owner of the voice data can be appropriately compensated for its use. This allows the voice data to be applied more freely and safely to creative uses, and since the voice data is authenticated and its reliability is ensured, it is expected that voice data will be used in a variety of industries.

なお、本発明は、上記した各実施の形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記実施の形態に開示されている複数の構成要素の適宜な組み合せにより、種々の発明を形成できる。例えば、実施の形態に示される全構成要素から幾つかの構成要素を削除してもよい。 The present invention is not limited to the above-described embodiments as they are, and in the implementation stage, the components can be modified to the extent that does not deviate from the gist of the invention. In addition, various inventions can be created by appropriately combining multiple components disclosed in the above embodiments. For example, some components may be deleted from all of the components shown in the embodiments.

Ｄ１…声データ
Ｄ２…個人情報
Ｄ３…肖声権データ
ＰＡａ…公開アドレス
ＰＡｂ…公開アドレス
ＰＫａ…公開鍵
ＳＫａ…秘密鍵
ＳＫｂ…秘密鍵
Ｕ１…ユーザー
１…肖声権管理システム
２…通信ネットワーク
３…ボイスライツ管理サーバー
４…保証システム
５…スマートフォン
２１…衛星
２２…無線基地局
３１…通信インターフェース
３２…位置情報管理部
３３…認証部
３４…情報配信部
３５ａ…肖声権データベース
３５ｂ…ユーザーデータベース
３５ｃ…蓄積情報データベース
３６…肖声権データ制御部
３６ａ…声データ制御部
３６ｂ…関連データ制御部
３６ｃ…保証システム連携部
４１ａ…ＮＦＴ取引履歴データベース
４１ｂ…鍵情報データベース
４１ｃ…アカウントデータベース
４２…認証部
４３…通信インターフェース
４４…権利管理実行部
４４ａ…権利情報提供部
４４ｂ…公開アドレス管理部
４４ｃ…正当性検証部
４４ｄ…データ更新部
５１…通信インターフェース
５２…入力インターフェース
５２ａ…タッチパネル
５３…出力インターフェース
５３ａ…表示部
５４…アプリケーション実行部
５５…メモリ
９０…Ｐ２Ｐネットワーク
９０ａ～９０ｆ…ノード
５４０…同期処理部
５４１…声データ生成部
５４１ａ…言語変換部
５４１ｂ…声質変換部
５４２…関連データ取得部
５４２ａ…音声特徴抽出部
５４２ｂ…変換パラメータ記録部
５４２ｃ…加工情報制御部
５４３…肖声権管理部
５４４…位置情報取得部
５４４ａ…時刻情報取得部
５４５…表示制御部
５４５ａ…ＧＵＩ制御部
５４６…表示データ生成部
D1...Voice data D2...Personal information D3...Voice rights data PAa...Public address PAb...Public address PKa...Public key SKa...Private key SKb...Private key U1...User 1...Voice rights management system 2...Communication network 3...Voice rights management server 4...Guarantee system 5...Smartphone 21...Satellite 22...Wireless base station 31...Communication interface 32...Location information management unit 33...Authentication unit 34...Information distribution unit 35a...Voice rights database 35b...User database 35c...Storage information database 36...Voice rights data control unit 36a...Voice data control unit 36b...Related data control unit 36c...Guarantee system linkage unit 41a...NFT transaction history database 41b...Key information database 41c...Account database 42...Authentication unit 43...Communication interface 44...Rights management execution unit 44a...Rights information provision unit 44b...Public address management unit 44c... Validity verification unit 44d... Data update unit 51... Communication interface 52... Input interface 52a... Touch panel 53... Output interface 53a... Display unit 54... Application execution unit 55... Memory 90... P2P network 90a to 90f... Nodes 540... Synchronization processing unit 541... Voice data generation unit 541a... Language conversion unit 541b... Voice quality conversion unit 542... Related data acquisition unit 542a... Voice feature extraction unit 542b... Conversion parameter recording unit 542c... Processing information control unit 543... Voice rights management unit 544... Position information acquisition unit 544a... Time information acquisition unit 545... Display control unit 545a... GUI control unit 546... Display data generation unit

Claims

A system for managing voice data as a voice right, comprising:
a voice data generating unit that acquires an acoustic signal and generates voice data;
a related data acquisition unit for acquiring related data including personal information;
a voice data generating unit for generating voice data by associating the voice data with the related data;
and a data storage means for encrypting and storing at least a part of the copyright data.

The data storage means includes a guarantee system linking unit that links with a guarantee system constructed on a communication network to encrypt and store at least a part of the image right data;
The security system includes a plurality of nodes that encrypt and store at least a portion of the image recognition data;
The copyright management system according to claim 1, characterized in that the node aggregates at least a portion of the copyright data at a predetermined timing, forms a block, links the block to an existing block to form a blockchain, and shares the blockchain among a plurality of the nodes and stores it as a distributed ledger.

the voice data generating unit has a function of adding time information including a date and time of acquisition of the acquired acoustic signal to the associated data;
The voice rights management system according to claim 1, characterized in that the related data acquisition unit has a function of analyzing a correlation between a date and time when the added acoustic signal was acquired and information regarding age included in the personal information, and including the analysis result in the related data.

a voice feature extraction unit for extracting any one of frequency, duration, amplitude, intensity, and frequency components contained in the voice data as a feature parameter;
a conversion unit that converts the voice quality or language of the voice data based on the feature parameters;
The system according to claim 1 , further comprising a conversion parameter recording unit for recording, as conversion parameters, characteristic parameters of the voice data after the conversion unit converts the voice quality or language.

A method for managing voice data as a voice right using the voice right management system according to claim 1, comprising:
a voice data generating step in which a voice data generating unit obtains an acoustic signal to generate voice data, a related data obtaining unit obtains related data including personal information, and a voice data generating step in which the voice data and the related data are related to each other to generate voice data;
and a data storage step in which data storage means encrypts and stores at least a part of the voice right data.

the data storage step includes a guarantee system linking step of encrypting and storing at least a part of the image right data in cooperation with a guarantee system constructed on a communication network;
The security system includes a plurality of nodes that encrypt and store at least a portion of the image recognition data;
The method for managing rights to the personal computer as described in claim 5, characterized in that, in the data storage step, the node aggregates at least a portion of the personal computer data at a predetermined timing to form a block, links the block to an existing block to form a blockchain, and shares the blockchain among a plurality of the nodes to store it as a distributed ledger.

In the voice data generating step, the voice data generating unit adds time information including a date and time of acquisition of the acquired acoustic signal to the associated data,
The method for managing voice rights according to claim 5, characterized in that the related data acquisition unit analyzes the correlation between the date and time when the added acoustic signal was acquired and information regarding age included in the personal information, and includes the analysis result in the related data.

In the image data generating step,
a voice feature extraction step in which a voice feature extraction unit extracts any one of frequency, duration, amplitude, intensity, or frequency components contained in the voice data as a feature parameter;
a conversion step of converting a voice quality or a language of the voice data based on the feature parameters by a conversion unit;
6. The method of claim 5, further comprising a conversion parameter recording unit for recording characteristic parameters of the voice data after the conversion unit converts the voice quality or language as conversion parameters.

A program for managing voice data as a voice record, comprising:
a voice data generating unit that acquires an acoustic signal and generates voice data;
a related data acquisition unit for acquiring related data including personal information;
a voice data generating unit for generating voice data by associating the voice data with the related data;
a computer program product for managing a copyright, the computer program product being operable as a data storage means for encrypting and storing at least a portion of the copyright data;

The data storage means includes a guarantee system linking unit that links with a guarantee system constructed on a communication network to encrypt and store at least a part of the image right data;
The security system includes a plurality of nodes that encrypt and store at least a portion of the image recognition data;
The computer-implemented program for managing the right to vote according to claim 9, characterized in that the node aggregates at least a portion of the right to vote data at a predetermined timing, forms a block, links the block to an existing block to form a blockchain, and shares the blockchain among a plurality of the nodes and stores it as a distributed ledger.

the voice data generating unit has a function of adding time information including a date and time of acquisition of the acquired acoustic signal to the associated data;
The voice right management program according to claim 9, characterized in that the related data acquisition unit has a function of analyzing a correlation between a date and time when the added acoustic signal was acquired and information regarding age included in the personal information, and including the analysis result in the related data.

The information processing device,
a voice feature extraction unit for extracting any one of frequency, duration, amplitude, intensity, and frequency components contained in the voice data as a feature parameter;
a conversion unit that converts the voice quality or language of the voice data based on the feature parameters;
10. The voice right management program according to claim 9, further functioning as a conversion parameter recording unit for recording characteristic parameters of the voice data after the conversion unit converts the voice quality or language as conversion parameters.