[go: up one dir, main page]

WO2024241837A1 - Information processing method and information processing system - Google Patents

Information processing method and information processing system Download PDF

Info

Publication number
WO2024241837A1
WO2024241837A1 PCT/JP2024/016514 JP2024016514W WO2024241837A1 WO 2024241837 A1 WO2024241837 A1 WO 2024241837A1 JP 2024016514 W JP2024016514 W JP 2024016514W WO 2024241837 A1 WO2024241837 A1 WO 2024241837A1
Authority
WO
WIPO (PCT)
Prior art keywords
answer
vector
information processing
evaluation
uniqueness
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/JP2024/016514
Other languages
French (fr)
Japanese (ja)
Inventor
淳司 宮田
俊之 松村
仁志 富永
博人 柳川
由紀子 内田
真孝 中山
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Panasonic Holdings Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Panasonic Holdings Corp filed Critical Panasonic Holdings Corp
Publication of WO2024241837A1 publication Critical patent/WO2024241837A1/en
Anticipated expiration legal-status Critical
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/253Grammatical analysis; Style critique

Definitions

  • This disclosure relates to technology for evaluating people's ideas.
  • Non-Patent Document 1 discloses a technology that addresses the subjective limitations in evaluating people's creativity by utilizing recent research on automatically scoring linguistic creativity using semantic distance.
  • Non-Patent Document 1 is considered to be closely related to originality.
  • Non-Patent Document 1 does not take such issues into consideration, and therefore has the problem of being unable to accurately judge the uniqueness of an answer created from a certain topic.
  • the present disclosure provides a technique for more accurately assessing the originality of answers created from a given subject.
  • An information processing method is an information processing method in a computer, which obtains a first vector corresponding to a first answer text indicating an answer to a topic, obtains a second vector corresponding to a second answer text indicating an answer associated with the topic, and evaluates the uniqueness of the answer indicated by the first answer text based on the degree of similarity between the first vector and the second vector.
  • this comprehensive or specific aspect may be realized by a device, a system, an integrated circuit, a computer program, or a computer-readable recording medium, or may be realized by any combination of a method, a device, a system, an integrated circuit, a computer program, and a recording medium.
  • the computer-readable recording medium includes, for example, a non-volatile recording medium such as a CD-ROM (Compact Disc-Read Only Memory).
  • the present disclosure allows for a more accurate assessment of the uniqueness of answers created from a given topic.
  • FIG. 1 is a diagram illustrating an example of a network configuration of an information processing system according to an embodiment of the present disclosure.
  • FIG. 2 is a block diagram illustrating an example of a detailed configuration of the information processing system.
  • FIG. 13 is a diagram illustrating an example of a data configuration of a reference text table.
  • FIG. 13 is a diagram illustrating an example of a data configuration of an evaluation result table.
  • FIG. 2 is a sequence diagram showing a processing flow of the information processing system.
  • 13 is a flowchart illustrating an example of a process of the information processing system when acquiring a reference answer.
  • 13 is a flowchart illustrating an example of a process of the information processing system when acquiring an answer.
  • 8 is a flowchart showing details of the process of step S205 shown in FIG.
  • FIG. 9 is an explanatory diagram of the process of FIG. 8 .
  • 8 is a flowchart showing details of the process in step S407 in FIG. 7;
  • FIG. 11 is an explanatory diagram of the process of FIG. 10 .
  • 10 is a flowchart illustrating an example of a lighting control process.
  • FIG. 8 is a diagram showing an example of a display screen displayed on a user's terminal in step S409 in FIG. 7.
  • FIG. 1 is a diagram illustrating an example of a booth.
  • the inventors are conducting research into having a computer evaluate the degree of originality of a user's answer when a certain topic is given to the user.
  • a computer performs such an evaluation, the following method is assumed.
  • the computer represents each of the topic and the answer to that topic as a word vector.
  • the computer calculates the cosine distance between both word vectors. The larger the cosine distance, the more original the answer is determined to be.
  • the inventors therefore came up with the idea of the present disclosure, based on the knowledge that, rather than using a certain topic as a standard, answers with low originality on that topic are collected in advance, and answers created by users are evaluated based on the collected answers with low originality.
  • An information processing method is an information processing method in a computer, which obtains a first vector corresponding to a first answer text indicating an answer to a topic, obtains a second vector corresponding to a second answer text indicating an answer associated with the topic, and evaluates the uniqueness of the answer indicated by the first answer text based on the degree of similarity between the first vector and the second vector.
  • the originality of an answer is evaluated using the vector similarity between the first answer text and a second answer text indicating an answer that is associated with the topic and is considered to be less original, rather than the vector similarity between the topic and a first answer text indicating an answer to the topic.
  • the originality of an answer created from a certain topic can be evaluated more accurately.
  • the second answer text may include a plurality of texts
  • the second vector may be a vector corresponding to a center of gravity position determined from a plurality of vectors generated by converting each of the plurality of texts into a vector.
  • the second vector is composed of a single vector that represents the characteristics of the second answer text, making it easy to calculate the similarity between the first vector and the second vector.
  • the similarity may be a cosine similarity derived based on the first vector and the second vector.
  • the similarity between the first vector and the second vector can be accurately quantified, and the uniqueness of the first answer text can be more accurately evaluated.
  • the first answer text may be obtained by converting voice data based on the user's speech into text data.
  • the first answer text can be input by voice, reducing the input burden of the first answer text.
  • evaluating the uniqueness of the answer may include deriving a score according to the degree of uniqueness of the answer.
  • the first answer text is evaluated using a score according to the level of uniqueness, making it easier to evaluate the uniqueness of the first answer text.
  • a lighting device that illuminates the vicinity of the user may be controlled to lower the color temperature of the illumination light and increase the illuminance.
  • the lighting device is controlled to lower the color temperature of the illumination light and increase the illuminance, which can enhance the user's creativity and encourage the creation of a highly original first answer text.
  • This configuration allows for accurate determination of whether a score is low or not.
  • obtaining the first vector may include obtaining a plurality of first answer texts
  • evaluating the uniqueness of the answers may include deriving a plurality of scores for a plurality of answers corresponding to each of the plurality of first answer texts obtained during a certain period of time, and determining that the first condition is met when a predetermined number or more of the plurality of scores have a score equal to or less than a threshold during the certain period of time.
  • lighting devices are controlled based on the evaluation results of multiple answers created over a certain period of time, which helps to avoid frequent changes to lighting device control.
  • the first vector may be a vector that has the lowest similarity to the second vector among a plurality of vectors corresponding to a plurality of words included in the first answer text.
  • the characteristics of the first answer text can be expressed with a single index, so even if the first answer text is a long sentence, it is easy to calculate the similarity while capturing the characteristics of the long sentence.
  • a display screen may be displayed on the display to display the evaluation results of the uniqueness of the answer.
  • the first vector may include a plurality of word vectors calculated for each of a plurality of words included in the first answer text for each of a plurality of corpora
  • the second vector may include a plurality of reference points corresponding to the plurality of corpora
  • the evaluation of the uniqueness may include calculating a plurality of evaluation values related to the uniqueness corresponding to the plurality of corpora by comparing the plurality of word vectors with the plurality of reference points for each of the plurality of corpora, and calculating a final evaluation value from the plurality of evaluation values.
  • the final evaluation value is calculated from evaluation values corresponding to multiple corpora, so variation due to corpora is suppressed and the uniqueness of the answer can be calculated with high accuracy.
  • An information processing system is an information processing system having a processor, the processor executes a process of acquiring a first vector corresponding to a first answer text indicating an answer to a topic, acquiring a second vector corresponding to a second answer text indicating an answer associated with the topic, and evaluating the uniqueness of the answer indicated by the first answer text based on the degree of similarity between the first vector and the second vector.
  • This configuration provides an information processing system that can more accurately evaluate the uniqueness of answers created from a certain topic.
  • the present disclosure can also be realized as an information processing program that causes a computer to execute each of the characteristic components included in such an information processing system, or as an information processing device that operates by this information processing program.
  • a computer program can be distributed on a non-transitory computer-readable recording medium such as a CD-ROM or via a communication network such as the Internet.
  • FIG. 1 is a diagram showing an example of a network configuration of an information processing system 1 according to an embodiment of the present disclosure.
  • the information processing system 1 is a system that provides a topic to a user and evaluates the user's answer to the topic.
  • the information processing system 1 includes a terminal 100, a reference calculation server 200, a storage unit 300, a uniqueness evaluation server 400, a lighting control server 500, and a lighting device 600.
  • the terminals 100 to the lighting device 600 are connected to each other so as to be able to communicate with each other via a network.
  • the network is a wide area communication network including, for example, the Internet and a mobile phone communication network.
  • the terminal 100 and the lighting device 600 are installed in a space where a user is located.
  • An example of the space is a booth 1500, which will be described later in FIG. 14.
  • the terminal 100 is a computer used by a user.
  • the terminal 100 may be, for example, a portable computer such as a smartphone or a tablet computer, or may be a desktop computer.
  • the terminal 100 includes a microphone, a keyboard, a USB memory, and a monitor.
  • the microphone acquires voice data spoken by the user as an answer to the topic.
  • the keyboard acquires the text of the answer entered by the user as an answer to the topic.
  • this text is referred to as the target text.
  • the target text is an example of the first text.
  • the USB memory stores the target text.
  • the monitor displays a display screen showing the evaluation results for the user's answer.
  • the reference calculation server 200, the uniqueness evaluation server 400, and the lighting control server 500 are configured as computers such as a cloud server.
  • the storage unit 300 is configured as a computer equipped with a rewritable non-volatile storage device such as a hard disk drive (HDD) and a solid state drive.
  • the reference calculation server 200 calculates a reference point, which will be described later.
  • the uniqueness evaluation server 400 evaluates the uniqueness of the user's answer.
  • the lighting control server 500 controls the lighting device 600 based on the results of the uniqueness evaluation.
  • the storage unit 300 stores data required to evaluate the uniqueness of the answer.
  • the lighting device 600 illuminates the inside of the booth 1500 shown in FIG. 14 under the control of the lighting control server 500.
  • FIG. 2 is a block diagram showing an example of a detailed configuration of the information processing system 1.
  • the terminal 100 includes an input unit 101, a communication unit 102, a screen generation unit 103, a screen control unit 104, and a display unit 105.
  • the input unit 101 is composed of a microphone, keyboard, USB memory, etc., as shown in FIG. 1.
  • the input unit 101 acquires target text.
  • the input unit 101 acquires subject data indicating a subject input by the user.
  • the subject data may be text or may be audio data.
  • the communication unit 102 is composed of a communication circuit that connects the terminal 100 to a network.
  • the screen generation unit 103 generates a display screen indicating the evaluation result of the originality of the user's answer.
  • the screen control unit 104 displays the display screen generated by the screen generation unit 103 on the display unit 105.
  • the display unit 105 is composed of the monitor shown in FIG. 1, and displays the display screen under the control of the screen control unit 104.
  • the reference calculation server 200 includes a reference calculation unit 201, a communication unit 202, a data conversion unit 203, and a theme setting unit 204.
  • the reference calculation unit 201 calculates a reference point that is a representative value of a reference vector corresponding to the theme data set by the theme setting unit 204.
  • the reference point is a point in a vector space that corresponds to a reference answer text that indicates a reference answer that is an answer that is associated with the theme and has low uniqueness.
  • the reference answer text is an example of a second answer text.
  • the reference point is an example of a second vector.
  • the reference vector is calculated using a method for vectorizing words, such as Word2vec.
  • the communication unit 202 is composed of a communication circuit that connects the reference calculation server 200 to a network.
  • the communication unit 202 receives the subject data transmitted from the terminal 100.
  • the communication unit 202 receives the voice data of the reference answer or the reference text transmitted from the terminal 100.
  • the data conversion unit 203 converts the voice data into text data.
  • the data conversion unit 203 obtains the reference text by converting the voice data of the reference answer received by the communication unit 202 into text data.
  • the subject setting unit 204 sets the subject data received by the communication unit 202.
  • the criteria calculation unit 201, the data conversion unit 203, and the subject setting unit 204 may be realized by a processor such as a CPU executing an information processing program, or may be configured by a dedicated hardware circuit.
  • the originality evaluation server 400 includes an originality evaluation unit 401, a communication unit 402, a data conversion unit 403, and a theme setting unit 404.
  • the uniqueness evaluation unit 401 calculates an evaluation value for evaluating the uniqueness of the answer indicated by the target text, based on the magnitude of similarity between the target vector generated by the data conversion unit 403 and the reference vector calculated by the reference calculation server 200.
  • the similarity is defined as cosine similarity. If the target vector is a and the reference vector is b, the cosine similarity is expressed as a ⁇ b/
  • the communication unit 402 is composed of a communication circuit that connects the uniqueness evaluation server 400 to a network, and receives the reference points calculated by the reference calculation server 200 from the storage unit 300.
  • the communication unit 402 receives the subject data transmitted from the terminal 100.
  • the communication unit 402 receives the answer voice data or target text transmitted from the terminal 100.
  • the data conversion unit 403 When answering voice data is transmitted from the terminal 100, the data conversion unit 403 obtains the target text by converting the voice data into text data. The data conversion unit 403 calculates a target vector that represents the target text as a vector.
  • the target vector is vectorized using a vectorization method such as Word2vec.
  • the target vector is an example of a first vector.
  • the subject setting unit 404 sets the subject data received by the communication unit 402.
  • the uniqueness evaluation unit 401, the data conversion unit 403, and the theme setting unit 404 may be realized by a processor such as a CPU executing an information processing program, or may be realized by a dedicated hardware circuit.
  • the lighting control server 500 includes a lighting control unit 501 and a communication unit 502.
  • the lighting control unit 501 generates a control command for setting the illuminance and color temperature of the lighting device 600 based on the evaluation value calculated by the uniqueness evaluation server 400.
  • the communication unit 502 transmits the control command generated by the lighting control unit 501 to the lighting device 600.
  • the communication unit 502 receives the evaluation value calculated by the uniqueness evaluation server 400 from the storage unit 300.
  • the lighting control unit 501 may be realized by a processor such as a CPU executing an information processing program, or may be configured as a dedicated hardware circuit.
  • the communication unit 502 is configured as a communication circuit, and transmits a control command to the lighting device 600.
  • the storage unit 300 includes a storage section 301 and a communication section 302.
  • the storage section 301 is configured as a rewritable non-volatile storage device.
  • the storage section 301 stores a reference text table T1 shown in FIG. 3 and an evaluation result table T2 shown in FIG. 4.
  • the storage section 301 stores a learning model used when the reference calculation server 200 and the uniqueness evaluation server 400 calculate a reference vector and a target vector. For example, a learning model that is created in advance by machine learning technology to calculate the vector of an input word from the text of that word is adopted as the learning model. In this embodiment, a model that expresses words as vectors in Word2vec is adopted as the learning model.
  • FIG. 3 is a diagram showing an example of the data configuration of the reference text table T1.
  • the reference text table T1 stores reference texts that indicate answers with low originality for each of a number of topics.
  • the reference text table T1 stores themes, reference texts, and elements of reference vectors of 1 to 300 dimensions in association with each other. For example, for the theme “pot,” “cook vegetables” and “boil water” are stored as reference texts. In this way, the reference texts indicate answers with low originality that the user can easily associate with the theme.
  • the reference vector is composed of 300-dimensional elements, but this is just one example, and it may be composed of elements of any dimension other than 300.
  • FIG 4 is a diagram showing an example of the data configuration of the evaluation result table T2.
  • the evaluation result table T2 stores target texts indicating the user's answers for each of a number of topics, in association with the evaluation values for those answers.
  • the evaluation result table T2 includes items for topic, target text, evaluation words, evaluation values, and user.
  • the evaluation words are words extracted from the target text for evaluation.
  • the user item stores the user's identification information (e.g., name).
  • the record showing the evaluation result in the first row records that the user "Bob” answered “to make it the head of the robot" for the topic "pot,” and that "robot” and "head” were extracted as evaluation words and that "75” was calculated as the evaluation value.
  • the user who answered can be inferred from the identification information assigned to the terminal 100 of that user. For example, if the uniqueness evaluation unit 401 obtains the answer "Make it the head of a robot" from Bob's terminal 100, it can infer that the user who answered is Bob.
  • the storage unit 300 can store user information that associates the identification information of the terminal 100 with the identification information of the user. Then, the uniqueness evaluation unit 401 can identify the identification information of the user that corresponds to the identification information of the terminal 100 from the user information.
  • the user who answered may also be estimated by voiceprint matching.
  • the uniqueness evaluation unit 401 may estimate the user by matching the voice data of the answer sent from the terminal 100 with the voiceprint information. This mode is applied when a mode is adopted in which one terminal 100 acquires voice data of answers from multiple users, rather than a mode in which each user inputs an answer using his or her own terminal 100.
  • FIG. 5 is a sequence diagram showing the processing flow of the information processing system 1.
  • step S201 the terminal 100 is started.
  • the user decides what kind of theme to use, and inputs theme data indicating the decided theme into the terminal 100.
  • audio data of the theme is input as the theme data.
  • the input theme data is sent to the reference calculation server 200 (step S202).
  • the theme data may be text data.
  • the user inputs voice data of reference answers, which are answers with low originality associated with the determined topic, into the terminal 100.
  • voice data of the input reference answers is sent to the reference calculation server 200 (step S203).
  • the process of steps S202 to S208 will be described in detail below with reference to FIG. 6.
  • FIG. 6 is a flowchart showing an example of the processing of the information processing system 1 when acquiring a reference answer.
  • the subject setting unit 204 of the reference calculation server 200 acquires the subject data transmitted from the terminal 100 using the communication unit 202, and sets the subject indicated by the acquired subject data as the target subject.
  • step S203 the data conversion unit 203 of the reference calculation server 200 acquires the voice data of the reference answer transmitted from the terminal 100 using the communication unit 202.
  • the voice data of the reference answer is acquired from the terminal 100 of the user who will provide the answer that will be the subject of the originality evaluation.
  • the voice data of the reference answer may be acquired from the terminals 100 of an unspecified number of general users who are different from the user in question.
  • the reference answer may be an answer collected through crowdsourcing.
  • step S204 the data conversion unit 203 obtains the reference text by performing a process of converting the voice data of the reference answer into text data.
  • audio data of the reference answer is acquired, but this is just one example, and text data of the reference answer may be acquired.
  • a process of acquiring the reference text is executed.
  • step S205 the data conversion unit 203 converts the reference text into a reference vector by inputting the reference text into the learning model stored in the storage unit 300. Details of this process will be described later with reference to FIG. 6.
  • step S206 the reference calculation unit 201 determines whether or not all reference texts indicating all reference answers obtained in step S203 have been converted into reference vectors. If all reference texts have not been converted into reference vectors (NO in step S206), the process returns to step S205, and the next reference text is converted into a reference vector. If all reference texts have been converted into reference vectors (YES in step S206), the reference calculation unit 201 calculates a reference point using all the reference vectors.
  • the reference point is, for example, the center of gravity position in the vector space of all the reference vectors.
  • step S208 the reference calculation unit 201 stores the reference point in the storage unit 300.
  • the subject data acquired in step S202, the reference text acquired in step S203, and the reference vector converted in step S205 are stored in the reference text table T1.
  • the reference text is accumulated in the reference text table T1.
  • step S401 onwards shown in FIG. 5 shows the process of evaluating the answers.
  • steps S401 and S402 are the same as those in steps S201 and S202.
  • step S403 the user inputs voice data of the answer to the task indicated by the task data input in step S402 into the terminal 100.
  • target text may be input instead of the voice data of the answer. Details of the processing of steps S402 to S409 are explained below with reference to FIG. 7.
  • FIG. 7 is a flowchart showing an example of the processing of the information processing system 1 when acquiring an answer.
  • the data conversion unit 403 of the uniqueness evaluation server 400 acquires the voice data of the answer sent from the terminal 100 using the communication unit 202.
  • step S404 the data conversion unit 403 performs a process of converting the voice data of the response into text data and obtains the target text.
  • audio data of the response is acquired, but this is just one example, and the target text may be acquired.
  • processing to acquire the target text is executed.
  • step S405 the data conversion unit 403 converts the target text into a target vector by inputting the target text into the learning model stored in the storage unit 300. Details of this process will be described later with reference to FIG. 6.
  • step S406 the uniqueness evaluation unit 401 uses the communication unit 402 to obtain a reference point corresponding to the subject indicated by the subject data obtained from the storage unit 300 in step S402.
  • step S407 the uniqueness evaluation unit 401 calculates the cosine similarity between the reference point and the target vector, and calculates a final evaluation value for evaluating the uniqueness of the answer from the calculated cosine similarity. Details of this process will be described later with reference to FIG. 10.
  • step S408 the uniqueness evaluation unit 401 stores the calculated final evaluation value in the evaluation result table T2 in association with the target text, the user's identification information, the evaluation words, and the target vector.
  • the final evaluation value is stored in the evaluation value field of the evaluation result table T2.
  • step S409 the uniqueness evaluation unit 401 generates display data for displaying a display screen showing the evaluation results of the answer, and transmits the generated display data to the terminal 100 using the communication unit 402.
  • the terminal 100 that receives this display data displays the display screen on the display unit 105.
  • FIG. 8 is a flowchart showing the details of the processing of step S205 shown in FIG. 6 and step S405 shown in FIG. 7.
  • the reference text and the target text are collectively referred to as text.
  • the data conversion unit 203 and the data conversion unit 403 are collectively referred to as data conversion unit A.
  • step S2051 data conversion unit A acquires text.
  • step S2052 data conversion unit A separates the read text into words. For example, morphological analysis is used for separating words.
  • data conversion unit A refers to the excluded parts of speech list and deletes from the text words that are not to be evaluated. This extracts evaluation words from the text.
  • the evaluation words are nouns, adjectival verbs, verbs, and adjectives.
  • the excluded parts of speech list stores excluded parts of speech other than these parts of speech.
  • the excluded parts of speech list is stored, for example, in storage unit 300. Words with excluded parts of speech are deleted from the text data because words with excluded parts of speech do not contribute to the evaluation of uniqueness and may become noise.
  • step S2054 the data conversion unit A converts each of the evaluation words extracted from the text into a word vector.
  • the reference vector is composed of a data set of word vectors calculated for each evaluation word.
  • the reference calculation unit 201 may calculate the center of gravity of multiple word vectors calculated for each evaluation word for each of the multiple reference vectors as a reference point.
  • the target vector is composed of a data set of multiple word vectors calculated for each evaluation word.
  • FIG. 9 is an explanatory diagram of the process of FIG. 8.
  • “vacuum cleaner” is set as the subject.
  • the user inputs voice data of the answer using a microphone.
  • the answer “Play by straddling the handle” is input.
  • the data conversion unit 403 converts the voice data of the response into text data. As a result, the target text D1 is obtained. Next, the data conversion unit 403 separates the target text D1 into words. As a result, the target text D1 is separated into words such as "straddle/play/with/the/part/of/pattern.”
  • the data conversion unit 403 deletes unnecessary words, which are words with excluded parts of speech, from the segmented target text D1.
  • unnecessary words which are words with excluded parts of speech
  • the data conversion unit 403 converts each evaluation word into a word vector using a vectorization method such as Word2vec. This results in a word vector for each evaluation word.
  • FIG. 10 is a flowchart showing the details of the processing of step S407 in FIG. 7.
  • the uniqueness evaluation unit 401 acquires a reference score from the storage unit 300.
  • the uniqueness evaluation unit 401 calculates the cosine similarity between the reference score and the word vector for each evaluation word.
  • step S4073 the uniqueness evaluation unit 401 calculates an evaluation value for each evaluation word from the cosine similarity calculated for each evaluation word.
  • step S4074 the uniqueness evaluation unit 401 calculates the maximum evaluation value calculated for each evaluation word as the final evaluation value of the answer indicated by the target text.
  • FIG. 11 is an explanatory diagram of the process in FIG. 10. Unnecessary words are deleted from the segmented target text D1, and evaluation words are extracted from the target text D1. Here, “pattern,” “part,” “straddle,” and “play” are extracted as evaluation words.
  • the uniqueness evaluation unit 401 calculates a word vector for each of these four evaluation words.
  • Map 1101 shows the vector space into which the word vectors are mapped.
  • the evaluation word "straddle” has the greatest distance from the reference point and therefore the greatest evaluation value. Therefore, the uniqueness evaluation unit 401 calculates the evaluation value of the evaluation word "straddle” as the final evaluation value.
  • the word vector with the greatest evaluation value is an example of a vector that has the lowest similarity to the second vector among the multiple vectors corresponding to each of the multiple words included in the first answer text.
  • FIG. 12 is a flowchart showing an example of lighting control processing.
  • the lighting control unit 501 of the lighting control server 500 obtains the final evaluation value for the answer from the storage unit 300 using the communication unit 502.
  • the final evaluation value of each of multiple answers proposed by users for a certain topic over a certain period of time is obtained.
  • step S502 the lighting control unit 501 calculates the number of responses whose final evaluation value is equal to or less than a threshold value.
  • the final evaluation value is an example of multiple scores derived for multiple responses obtained during a certain period of time.
  • step S503 the lighting control unit 501 determines whether the number of answers is equal to or greater than a predetermined number. If the number of answers is equal to or greater than the predetermined number (YES in step S503), the lighting control unit 501 generates a control command to change the lighting light and transmits the generated control command to the lighting device 600 using the communication unit 502.
  • the lighting control unit 501 may generate a control command to lower the color temperature of the current lighting light by a predetermined level and increase the illuminance by a predetermined level.
  • the condition that the number of answers is equal to or greater than a predetermined number corresponds to an example of the first condition indicating a low score.
  • the process of FIG. 12 may be executed for the replies created during the next fixed period.
  • the lighting device 600 is controlled on a fixed period basis, but this is one example, and a determination may be made each time the user gives an answer as to whether or not to change the control of the lighting device. For example, if the final evaluation value for a certain answer is equal to or less than a threshold value, the lighting control unit 501 may generate a control command and send the control command to the lighting device 600.
  • the condition that the final evaluation value is equal to or less than a threshold value is another example of the first condition.
  • the user can be encouraged to create a more unique answer.
  • FIG. 13 is a diagram showing an example of a display screen G1 displayed on the terminal 100 of the user U1 in step S409 of FIG. 7.
  • the display screen G1 includes a user display section R1 that displays the user U1 who has answered and the other users who have answered, and an evaluation result display section R2 that displays the evaluation results for the answers of the user U1 and the other users.
  • the evaluation result display area R2 displays each user's answer and the final evaluation value for the answer. If each user has provided multiple answers, the evaluation result display area R2 displays the multiple answers as well as the average of the final evaluation values for each answer. Alternatively, the evaluation result display area R2 may display the final evaluation value for each answer.
  • FIG. 14 is a diagram showing an example of a booth 1500.
  • the booth 1500 is composed of a box that separates the space in which the user is located from the outside space.
  • a lighting device 600 is installed on the ceiling of the booth 1500.
  • a chair 1501 on which the user sits and a desk 1502 on which the user works are installed.
  • a terminal 100 and a microphone 1503 are installed on the desk 1502.
  • the user who answers uses the microphone 1503 or a keyboard to answer.
  • the answers are evaluated by the originality evaluation server 400, and the lighting device 600 is controlled according to the evaluation result. For example, if there are many answers with low evaluation results and the evaluation result is not good, the color temperature of the lighting device 600 is lowered and the illuminance is increased. This encourages the user to create highly original answers.
  • the originality of an answer is evaluated using the vector similarity between the target text and a reference point indicating the standard for answers with low originality associated with the topic, rather than the vector similarity between the topic and the target text indicating the answer to the topic.
  • the originality of an answer created from a certain topic can be evaluated more accurately.
  • the uniqueness evaluation unit 401 may evaluate answers using word vectors calculated for each of multiple corpora. For example, assume that there are two corpora C1 and C2. In this case, the learning models include a learning model M1 corresponding to corpus C1 and a learning model M2 corresponding to corpus C2.
  • the reference points include two reference points P1 and P2 corresponding to corpora C1 and C2, respectively.
  • the data conversion unit 203 inputs the evaluation words contained in the target text into the learning models M1 and M2, respectively, and calculates word vectors V1 and V2 for each evaluation word for each corpus C1 and C2.
  • the uniqueness evaluation unit 401 calculates an evaluation value between the word vector V1 and the reference point P1 for each evaluation word, and determines the maximum value K1 of the obtained evaluation values.
  • the uniqueness evaluation unit 401 calculates an evaluation value between the word vector V2 and the reference point P2 for each evaluation word, and determines the maximum value K2 of the obtained evaluation values.
  • the uniqueness evaluation unit 401 then calculates the average value of the maximum values K1 and K2 as the final evaluation value.
  • Modification 2 is the same as modification 1 up to the process of calculating word vectors V1 and V2 for each evaluation word using corpora C1 and C2.
  • the three word vectors V1 corresponding to corpus C1 are represented as V11, V12, and V13
  • the three word vectors V2 corresponding to corpus C2 are represented as V21, V22, and V23.
  • the uniqueness evaluation unit 401 calculates one word vector V1' by calculating the element product of the word vectors V11 to V13, and calculates one word vector V2' by calculating the element product of the word vectors V21 to V23.
  • the uniqueness evaluation unit 401 calculates the cosine similarity between the word vector V1' and the reference point P1, and calculates an evaluation value E1 of the answer corresponding to the corpus C1 from the calculated cosine similarity, and calculates the cosine similarity between the word vector V2' and the reference point P2, and calculates an evaluation value E2 of the answer corresponding to the corpus C2 from the calculated cosine similarity.
  • the uniqueness evaluation unit 401 calculates a final evaluation value by inputting evaluation value E1 and evaluation value E2 into a previously created covariance structure analysis model.
  • This covariance structure analysis model is a model that is trained to increase the weighting value of corpora that calculate evaluation values close to human evaluations and decrease the weighting value of corpora that calculate evaluation values far from human evaluations.
  • the weighting value of evaluation values corresponding to corpora that produce evaluations far from human evaluations is decreased while the weighting value of evaluation values corresponding to corpora that produce evaluations close to human evaluations is increased to calculate the final evaluation value. This makes it possible to evaluate answers with high accuracy while suppressing evaluation deviations due to corpora.
  • the reference calculation server 200, the storage unit 300, the uniqueness evaluation server 400, and the lighting control server 500 are configured as separate computers, but may be configured as one computer. Each block of the reference calculation server 200, the storage unit 300, the uniqueness evaluation server 400, and the lighting control server 500 may be provided in the terminal 100.
  • the uniqueness evaluation unit 401 acquires the target vector converted by the data conversion unit 403, but this is just one example, and the target vector calculated by a device provided outside the information processing system 1 may be acquired from the device.
  • the reference calculation unit 201 acquires the target vector converted by the data conversion unit 203, but this is just one example, and the target vector calculated by a device provided outside the information processing system 1 may be acquired from the device. That is, the acquisition of the first vector and the second vector in the present disclosure includes a mode in which the first vector and the second vector are acquired from an external device in addition to calculating the first vector and the second vector.
  • the cosine similarity is used as the similarity, but the present disclosure is not limited to this.
  • the Pearson correlation coefficient may be used as the similarity.
  • This disclosure is useful in the field of technology for improving user work efficiency.
  • Information processing system 100 Terminal 101: Input unit 102: Communication unit 103: Screen generation unit 104: Screen control unit 105: Display unit 200: Reference calculation server 201: Reference calculation unit 202: Communication unit 203: Data conversion unit 204: Subject setting unit 300: Storage unit 301: Storage unit 302: Communication unit 400: Uniqueness evaluation server 401: Uniqueness evaluation unit 402: Communication unit 403: Data conversion unit 404: Subject setting unit 500: Lighting control server 501: Lighting control unit 502: Communication unit 600: Lighting device

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

This information processing system: acquires a first vector corresponding to a first response text item indicating a response to a subject matter; acquires a second vector corresponding to a second response text item indicating a response arrived at by association with the subject matter; and evaluates the uniqueness of the response indicated by the first answer text item, on the basis of the magnitude of the similarity between the first vector and the second vector.

Description

情報処理方法及び情報処理システムInformation processing method and information processing system

 本開示は、人のアイデアを評価する技術に関するものである。 This disclosure relates to technology for evaluating people's ideas.

 近年、人の言語的創造性を評価するために様々な研究が行われている。例えば、非特許文献1には、セマンティックディスタンスにより言語的創造性を自動的にスコアリングする最近の研究を利用することによって、人の創造性を評価する際の主観的な限界に対処する技術が開示されている。 In recent years, various studies have been conducted to evaluate people's linguistic creativity. For example, Non-Patent Document 1 discloses a technology that addresses the subjective limitations in evaluating people's creativity by utilizing recent research on automatically scoring linguistic creativity using semantic distance.

Automating creativity assessment with SemDis: An openplatform for computing semantic distance、Roger E. Beaty & Dan R. Johnson、Published online: 31 August 2020Automating creativity assessment with SemDis: An op enplatform for computing semantic distance, Roger E. Beaty & Dan R. Johnson, Published online: 31 August 2020

 ここで、創造性は、独自性、流暢性、及び柔軟性といった主に3つの観点で評価され得るが、セマンティックディスタンス等の意味空間における距離は主に独自性に関連している。よって、非特許文献1で称される創造性とは、厳密には独自性に深く関連していると考えられる。 Here, creativity can be evaluated from three main perspectives: originality, fluency, and flexibility. However, distance in the meaning space, such as semantic distance, is primarily related to originality. Therefore, strictly speaking, the creativity referred to in Non-Patent Document 1 is considered to be closely related to originality.

 ある主題を元に創作された回答がありきたりな内容であっても、その回答の主題からのベクトル距離が離れているという理由で、独自性が高いと判断されることは望ましくない。非特許文献1の技術では、このような課題は考慮されていないので、ある主題から創作された回答の独自性を正確に判断できないという課題がある。 Even if an answer created based on a certain topic has ordinary content, it is undesirable for the answer to be judged to be highly unique simply because its vector distance from the topic is large. The technology in Non-Patent Document 1 does not take such issues into consideration, and therefore has the problem of being unable to accurately judge the uniqueness of an answer created from a certain topic.

 本開示は、ある主題から創作された回答の独自性をより正確に評価する技術を提供するものである。 The present disclosure provides a technique for more accurately assessing the originality of answers created from a given subject.

 本開示の一態様における情報処理方法は、コンピュータにおける情報処理方法であって、主題に対する回答を示す第1回答テキストに対応する第1ベクトルを取得し、前記主題から連想される回答を示す第2回答テキストに対応する第2ベクトルを取得し、前記第1ベクトル及び前記第2ベクトルの類似度の大きさに基づき、前記第1回答テキストが示す前記回答の独自性を評価する。なお、この包括的又は具体的な態様は、装置、システム、集積回路、コンピュータプログラム又はコンピュータ読み取り可能な記録媒体で実現されてもよく、方法、装置、システム、集積回路、コンピュータプログラム及び記録媒体の任意な組み合わせで実現されてもよい。コンピュータ読み取り可能な記録媒体は、例えばCD-ROM(CompactDisc-Read Only Memory)などの不揮発性の記録媒体を含む。 An information processing method according to one aspect of the present disclosure is an information processing method in a computer, which obtains a first vector corresponding to a first answer text indicating an answer to a topic, obtains a second vector corresponding to a second answer text indicating an answer associated with the topic, and evaluates the uniqueness of the answer indicated by the first answer text based on the degree of similarity between the first vector and the second vector. Note that this comprehensive or specific aspect may be realized by a device, a system, an integrated circuit, a computer program, or a computer-readable recording medium, or may be realized by any combination of a method, a device, a system, an integrated circuit, a computer program, and a recording medium. The computer-readable recording medium includes, for example, a non-volatile recording medium such as a CD-ROM (Compact Disc-Read Only Memory).

 本開示によれば、ある主題から創作された回答の独自性をより正確に評価できる。 The present disclosure allows for a more accurate assessment of the uniqueness of answers created from a given topic.

本開示の実施の形態における情報処理システムのネットワーク構成の一例を示す図である。FIG. 1 is a diagram illustrating an example of a network configuration of an information processing system according to an embodiment of the present disclosure. 情報処理システムの詳細構成の一例を示すブロック図である。FIG. 2 is a block diagram illustrating an example of a detailed configuration of the information processing system. 基準テキストテーブルのデータ構成の一例を示す図である。FIG. 13 is a diagram illustrating an example of a data configuration of a reference text table. 評価結果テーブルのデータ構成の一例を示す図である。FIG. 13 is a diagram illustrating an example of a data configuration of an evaluation result table. 情報処理システムの処理の流れを示すシーケンス図である。FIG. 2 is a sequence diagram showing a processing flow of the information processing system. 基準回答を取得する際の情報処理システムの処理の一例を示すフローチャートである。13 is a flowchart illustrating an example of a process of the information processing system when acquiring a reference answer. 回答を取得する際の情報処理システムの処理の一例を示すフローチャートである。13 is a flowchart illustrating an example of a process of the information processing system when acquiring an answer. 図6に示すステップS205及び図7に示すステップS405の処理の詳細を示すフローチャートである。8 is a flowchart showing details of the process of step S205 shown in FIG. 6 and step S405 shown in FIG. 7. 図8の処理の説明図である。FIG. 9 is an explanatory diagram of the process of FIG. 8 . 図7のステップS407の処理の詳細を示すフローチャートである。8 is a flowchart showing details of the process in step S407 in FIG. 7; 図10の処理の説明図である。FIG. 11 is an explanatory diagram of the process of FIG. 10 . 照明制御の処理の一例を示すフローチャートである。10 is a flowchart illustrating an example of a lighting control process. 図7のステップS409において、ユーザの端末に表示される表示画面の一例を示す図である。FIG. 8 is a diagram showing an example of a display screen displayed on a user's terminal in step S409 in FIG. 7. ブースの一例を示す図である。FIG. 1 is a diagram illustrating an example of a booth.

 (本開示の基礎となる知見)
 本発明者らは、ある主題をユーザに与えたときのユーザの回答がどの程度独自性を有しているかをコンピュータに評価させる研究を進めている。このような評価をコンピュータが行う場合、以下の手法が想定される。コンピュータは、主題とそれに対する回答とのそれぞれを単語ベクトルで表現する。次に、両単語ベクトルのコサイン距離を算出する。そして、コサイン距離が大きいほど、その回答の独自性は高い判断する。
(Foundational knowledge of the present disclosure)
The inventors are conducting research into having a computer evaluate the degree of originality of a user's answer when a certain topic is given to the user. When a computer performs such an evaluation, the following method is assumed. The computer represents each of the topic and the answer to that topic as a word vector. Next, the computer calculates the cosine distance between both word vectors. The larger the cosine distance, the more original the answer is determined to be.

 以下、具体例を示す。主題として「布団の使い方」をユーザに与え、「床に敷く」との回答が得られたとする。この主題に対する回答「床に敷く」は多くの人にとってありきたりと考えられており、独自性が低いと認識されている。しかしながら、「布団の使い方」と「床に敷く」とのそれぞれを単語ベクトルで表現した場合、両単語ベクトルのコサイン距離は離れてしまうことが起こり得る。この場合、回答「床に敷く」は独自性が高いと判断されてしまう。これでは、主題に対する回答の独自性を正確に評価できない。 Below is a concrete example. Suppose the topic "how to use a futon" is given to a user, and the answer "lay it on the floor" is obtained. The answer to this topic, "lay it on the floor," is considered commonplace by many people, and is recognized as having low uniqueness. However, if "how to use a futon" and "lay it on the floor" are each expressed as word vectors, the cosine distance between the two word vectors may be far apart. In this case, the answer "lay it on the floor" will be determined to be highly unique. This makes it impossible to accurately evaluate the uniqueness of answers to a topic.

 そこで、本発明者らは、ある主題を基準にするのではなくその主題に対して独自性の低い回答を事前に収集しておき、収集した独自性の低い回答を基準にユーザが創作した回答を評価すれば、かかる回答を正確に評価し得るとの知見を得て本開示を想到するに至った。 The inventors therefore came up with the idea of the present disclosure, based on the knowledge that, rather than using a certain topic as a standard, answers with low originality on that topic are collected in advance, and answers created by users are evaluated based on the collected answers with low originality.

 (1)本開示の一態様における情報処理方法は、コンピュータにおける情報処理方法であって、主題に対する回答を示す第1回答テキストに対応する第1ベクトルを取得し、前記主題から連想される回答を示す第2回答テキストに対応する第2ベクトルを取得し、前記第1ベクトル及び前記第2ベクトルの類似度の大きさに基づき、前記第1回答テキストが示す前記回答の独自性を評価する。 (1) An information processing method according to one aspect of the present disclosure is an information processing method in a computer, which obtains a first vector corresponding to a first answer text indicating an answer to a topic, obtains a second vector corresponding to a second answer text indicating an answer associated with the topic, and evaluates the uniqueness of the answer indicated by the first answer text based on the degree of similarity between the first vector and the second vector.

 この構成によれば、主題と主題に対する回答を示す第1回答テキストとのベクトル類似度ではなく、主題から連想される、独自性が低いと考えられる回答を示す第2回答テキストと第1回答テキストとのベクトルの類似度を用いて回答の独自性が評価されている。そのため、独自性が低いと考えられる回答が、誤って独自性が高いと評価されるリスクを低減できる。その結果、ある主題から創作された回答の独自性をより精度よく評価できる。 With this configuration, the originality of an answer is evaluated using the vector similarity between the first answer text and a second answer text indicating an answer that is associated with the topic and is considered to be less original, rather than the vector similarity between the topic and a first answer text indicating an answer to the topic. This reduces the risk that an answer that is considered to be less original will be erroneously evaluated as being more original. As a result, the originality of an answer created from a certain topic can be evaluated more accurately.

 (2)上記(1)記載の情報処理方法において、前記第2回答テキストは、複数のテキストを含み、前記第2ベクトルは、前記複数のテキストの各々をベクトルに変換することによって生成される複数のベクトルから決定される重心位置に対応するベクトルであってもよい。 (2) In the information processing method described in (1) above, the second answer text may include a plurality of texts, and the second vector may be a vector corresponding to a center of gravity position determined from a plurality of vectors generated by converting each of the plurality of texts into a vector.

 この構成によれば、第2ベクトルは、第2回答テキストの特徴を表す1つのベクトルで構成されるので、第1ベクトルと第2ベクトルとの類似度の演算が容易になる。 With this configuration, the second vector is composed of a single vector that represents the characteristics of the second answer text, making it easy to calculate the similarity between the first vector and the second vector.

 (3)上記(1)又は(2)記載の情報処理方法において、前記類似度は、前記第1ベクトルと前記第2ベクトルに基づいて導出されたコサイン類似度であってもよい。 (3) In the information processing method described in (1) or (2) above, the similarity may be a cosine similarity derived based on the first vector and the second vector.

 この構成によれば、第1ベクトルと第2ベクトルとの類似度を正確に数値化でき、第1回答テキストの独自性をより正確に評価できる。 With this configuration, the similarity between the first vector and the second vector can be accurately quantified, and the uniqueness of the first answer text can be more accurately evaluated.

 (4)上記(1)~(3)のいずれか1つに記載の情報処理方法において、前記第1回答テキストは、ユーザの発話に基づく音声データをテキストデータに変換することにより取得されてもよい。 (4) In the information processing method described in any one of (1) to (3) above, the first answer text may be obtained by converting voice data based on the user's speech into text data.

 この構成によれば、第1回答テキストを音声で入力できるので、第1回答テキストの入力負担が軽減される。 With this configuration, the first answer text can be input by voice, reducing the input burden of the first answer text.

 (5)上記(1)~(4)のいずれか1つに記載の情報処理方法において、前記回答の独自性を評価することは、前記回答の独自性の高さに応じたスコアを導出することを含んでもよい。 (5) In the information processing method described in any one of (1) to (4) above, evaluating the uniqueness of the answer may include deriving a score according to the degree of uniqueness of the answer.

 この構成によれば、独自性の高さに応じたスコアを用いて第1回答テキストが評価されているので、第1回答テキストの独自性の評価が容易になる。 With this configuration, the first answer text is evaluated using a score according to the level of uniqueness, making it easier to evaluate the uniqueness of the first answer text.

 (6)上記(5)記載の情報処理方法において、さらに、前記スコアが低いことを示す第1条件を満たす場合に、ユーザの付近を照明する照明装置に対して、照明光の色温度を下げ、かつ、照度を上げる制御を行ってもよい。 (6) In the information processing method described in (5) above, when a first condition indicating that the score is low is met, a lighting device that illuminates the vicinity of the user may be controlled to lower the color temperature of the illumination light and increase the illuminance.

 この構成によれば、スコアが低い場合、照明光の色温度が下げられ、かつ、照度が上がるように照明装置が制御されるので、ユーザの創造性を高めることができ、それによって独自性の高い第1回答テキストの創作を促すことができる。 With this configuration, if the score is low, the lighting device is controlled to lower the color temperature of the illumination light and increase the illuminance, which can enhance the user's creativity and encourage the creation of a highly original first answer text.

 (7)上記(6)記載の情報処理方法において、前記スコアが閾値よりも小さい場合に、前記第1条件を満たすと判定してもよい。 (7) In the information processing method described in (6) above, it may be determined that the first condition is satisfied if the score is smaller than a threshold value.

 この構成によれば、スコアが低いか否かの判断を正確に行うことができる。 This configuration allows for accurate determination of whether a score is low or not.

 (8)上記(6)記載の情報処理方法において、前記第1ベクトルの取得は、複数の第1回答テキストを取得することを含み、前記回答の独自性の評価は、ある期間において取得された前記複数の第1回答テキストのそれぞれに対応する複数の回答について複数のスコアを導出することを含み、前記第1条件の判定は、前記ある期間において、前記複数のスコアのうちスコアが閾値以下である回答が所定の数以上である場合に、前記第1条件を満たすと判定することを含んでもよい。 (8) In the information processing method described in (6) above, obtaining the first vector may include obtaining a plurality of first answer texts, evaluating the uniqueness of the answers may include deriving a plurality of scores for a plurality of answers corresponding to each of the plurality of first answer texts obtained during a certain period of time, and determining that the first condition is met when a predetermined number or more of the plurality of scores have a score equal to or less than a threshold during the certain period of time.

 この構成によれば、ある期間において創作された複数の回答の評価結果から照明装置が制御されるので、照明装置の制御変更が頻発する事態が回避される。 With this configuration, lighting devices are controlled based on the evaluation results of multiple answers created over a certain period of time, which helps to avoid frequent changes to lighting device control.

 (9)上記(1)~(8)のいずれか1つに記載の情報処理方法において、前記第1ベクトルは、前記第1回答テキストに含まれる複数の単語のそれぞれに対応する複数のベクトルの中で、最も前記第2ベクトルとの類似度が低いベクトルであってもよい。 (9) In the information processing method described in any one of (1) to (8) above, the first vector may be a vector that has the lowest similarity to the second vector among a plurality of vectors corresponding to a plurality of words included in the first answer text.

 この構成によれば、第1回答テキストの特徴を1つの指標で表すことができるので、第1回答テキストが長文であっても、その長文の特徴を捉えつつ類似度の算出が容易になる。 With this configuration, the characteristics of the first answer text can be expressed with a single index, so even if the first answer text is a long sentence, it is easy to calculate the similarity while capturing the characteristics of the long sentence.

 (10)上記(1)~(9)のいずれか1つに記載の情報処理方法において、さらに、前記回答に対する独自性の評価結果を表示する表示画面をディスプレイに表示してもよい。 (10) In the information processing method described in any one of (1) to (9) above, a display screen may be displayed on the display to display the evaluation results of the uniqueness of the answer.

 この構成によれば、回答に対する評価結果をユーザにフィードバックできる。 With this configuration, the evaluation results for the answers can be fed back to the user.

 (11)上記(1)~(10)のいずれか1つに記載の情報処理方法において、前記第1ベクトルは、複数のコーパスのそれぞれについて前記第1回答テキストに含まれる複数の単語ごとに算出された複数の単語ベクトルを含み、前記第2ベクトルは、複数のコーパスに対応する複数の基準点を含み、前記独自性の評価は、前記複数の単語ベクトルと、前記複数の基準点とを前記複数のコーパスごとに比較することで、前記複数のコーパスに対応する独自性に関する複数の評価値を算出することと、前記複数の評価値から最終評価値を算出することと、を含んでもよい。 (11) In the information processing method described in any one of (1) to (10) above, the first vector may include a plurality of word vectors calculated for each of a plurality of words included in the first answer text for each of a plurality of corpora, the second vector may include a plurality of reference points corresponding to the plurality of corpora, and the evaluation of the uniqueness may include calculating a plurality of evaluation values related to the uniqueness corresponding to the plurality of corpora by comparing the plurality of word vectors with the plurality of reference points for each of the plurality of corpora, and calculating a final evaluation value from the plurality of evaluation values.

 この構成によれば、複数のコーパスに対応する評価値から最終評価値が算出されているので、コーパスによるブレが抑制され、回答の独自性を高精度に算出できる。 With this configuration, the final evaluation value is calculated from evaluation values corresponding to multiple corpora, so variation due to corpora is suppressed and the uniqueness of the answer can be calculated with high accuracy.

 (12)本開示の別の一態様における情報処理システムは、プロセッサを有する情報処理システムであって、前記プロセッサは、主題に対する回答を示す第1回答テキストに対応する第1ベクトルを取得し、前記主題から連想される回答を示す第2回答テキストに対応する第2ベクトルを取得し、前記第1ベクトル及び前記第2ベクトルの類似度の大きさに基づき、前記第1回答テキストが示す前記回答の独自性を評価する、処理を実行する。 (12) An information processing system according to another aspect of the present disclosure is an information processing system having a processor, the processor executes a process of acquiring a first vector corresponding to a first answer text indicating an answer to a topic, acquiring a second vector corresponding to a second answer text indicating an answer associated with the topic, and evaluating the uniqueness of the answer indicated by the first answer text based on the degree of similarity between the first vector and the second vector.

 この構成によれば、ある主題から創作された回答の独自性をより精度よく評価できる情報処理システムを提供できる。 This configuration provides an information processing system that can more accurately evaluate the uniqueness of answers created from a certain topic.

 本開示は、このような情報処理システムに含まれる特徴的な各構成をコンピュータに実行させる情報処理プログラム、或いはこの情報処理プログラムによって動作する情報処理装置として実現することもできる。また、このようなコンピュータプログラムを、CD-ROM等のコンピュータ読取可能な非一時的な記録媒体あるいはインターネット等の通信ネットワークを介して流通させることができるのは、言うまでもない。 The present disclosure can also be realized as an information processing program that causes a computer to execute each of the characteristic components included in such an information processing system, or as an information processing device that operates by this information processing program. Needless to say, such a computer program can be distributed on a non-transitory computer-readable recording medium such as a CD-ROM or via a communication network such as the Internet.

 なお、以下で説明する実施の形態は、いずれも本開示の一具体例を示すものである。以下の実施の形態で示される数値、形状、構成要素、ステップ、ステップの順序などは、一例であり、本開示を限定する主旨ではない。また、以下の実施の形態における構成要素のうち、最上位概念を示す独立請求項に記載されていない構成要素については、任意の構成要素として説明される。また全ての実施の形態において、各々の内容を組み合わせることもできる。 Note that each of the embodiments described below represents a specific example of the present disclosure. The numerical values, shapes, components, steps, and order of steps shown in the following embodiments are merely examples and are not intended to limit the present disclosure. Furthermore, among the components in the following embodiments, those components that are not described in an independent claim that represents a top-level concept are described as optional components. Furthermore, in all of the embodiments, the respective contents can be combined.

 (実施の形態)
 図1は、本開示の実施の形態における情報処理システム1のネットワーク構成の一例を示す図である。情報処理システム1は、ユーザに主題を与え、その主題に対するユーザの回答を評価するシステムである。情報処理システム1は、端末100、基準計算サーバ200、記憶ユニット300、独自性評価サーバ400、照明制御サーバ500、及び照明装置600を含む。端末100~照明装置600は、ネットワークを介して相互に通信可能に接続されている。ネットワークは、例えばインターネット及び携帯電話通信網などを含む広域通信網である。図1において、端末100及び照明装置600は、ユーザが所在する空間に設置されている。空間の一例は図14で後述するブース1500である。
(Embodiment)
FIG. 1 is a diagram showing an example of a network configuration of an information processing system 1 according to an embodiment of the present disclosure. The information processing system 1 is a system that provides a topic to a user and evaluates the user's answer to the topic. The information processing system 1 includes a terminal 100, a reference calculation server 200, a storage unit 300, a uniqueness evaluation server 400, a lighting control server 500, and a lighting device 600. The terminals 100 to the lighting device 600 are connected to each other so as to be able to communicate with each other via a network. The network is a wide area communication network including, for example, the Internet and a mobile phone communication network. In FIG. 1, the terminal 100 and the lighting device 600 are installed in a space where a user is located. An example of the space is a booth 1500, which will be described later in FIG. 14.

 端末100は、ユーザにより使用されるコンピュータである。端末100は、例えば、スマートフォン及びタブレット型コンピュータ等の携帯型のコンピュータであってもよいし、デスクトップコンピュータであってもよい。端末100は、マイク、キーボード、USBメモリ、及びモニタなどを含む。マイクは主題に対する回答としてユーザに発話された音声データを取得する。キーボードは、主題に対する回答としてユーザにより入力された回答のテキストを取得する。以下、このテキストを対象テキストと呼ぶ。対象テキストは第1テキストの一例である。USBメモリは、対象テキストを記憶する。モニタは、ユーザの回答に対する評価結果を示す表示画面を表示する。 The terminal 100 is a computer used by a user. The terminal 100 may be, for example, a portable computer such as a smartphone or a tablet computer, or may be a desktop computer. The terminal 100 includes a microphone, a keyboard, a USB memory, and a monitor. The microphone acquires voice data spoken by the user as an answer to the topic. The keyboard acquires the text of the answer entered by the user as an answer to the topic. Hereinafter, this text is referred to as the target text. The target text is an example of the first text. The USB memory stores the target text. The monitor displays a display screen showing the evaluation results for the user's answer.

 基準計算サーバ200、独自性評価サーバ400、及び照明制御サーバ500は、例えばクラウドサーバなどのコンピュータで構成される。記憶ユニット300は、ハードディスクドライブ(HDD)及びソリッドステートドライブなどの書き換え可能な不揮発性の記憶装置を備えるコンピュータで構成されている。基準計算サーバ200は、後述する基準点を算出する。独自性評価サーバ400は、ユーザの回答の独自性を評価する。照明制御サーバ500は、独自性の評価結果に基づいて照明装置600を制御する。記憶ユニット300は、回答の独自性を評価するにあたり必要となるデータを記憶する。 The reference calculation server 200, the uniqueness evaluation server 400, and the lighting control server 500 are configured as computers such as a cloud server. The storage unit 300 is configured as a computer equipped with a rewritable non-volatile storage device such as a hard disk drive (HDD) and a solid state drive. The reference calculation server 200 calculates a reference point, which will be described later. The uniqueness evaluation server 400 evaluates the uniqueness of the user's answer. The lighting control server 500 controls the lighting device 600 based on the results of the uniqueness evaluation. The storage unit 300 stores data required to evaluate the uniqueness of the answer.

 照明装置600は、照明制御サーバ500の制御の下、図14に示すブース1500内を照明する。 The lighting device 600 illuminates the inside of the booth 1500 shown in FIG. 14 under the control of the lighting control server 500.

 図2は、情報処理システム1の詳細構成の一例を示すブロック図である。 FIG. 2 is a block diagram showing an example of a detailed configuration of the information processing system 1.

 端末100は、入力部101、通信部102、画面生成部103、画面制御部104、及び表示部105を含む。 The terminal 100 includes an input unit 101, a communication unit 102, a screen generation unit 103, a screen control unit 104, and a display unit 105.

 入力部101は、図1に示すマイク、キーボード、USBメモリなどで構成される。入力部101は、対象テキストを取得する。入力部101は、ユーザにより入力された主題を示す主題データを取得する。主題データは、テキストであってもよいし、音声データであってもよい。通信部102は、端末100をネットワークに接続する通信回路で構成される。画面生成部103はユーザによる回答の独自性の評価結果を示す表示画面を生成する。画面制御部104は、画面生成部103により生成された表示画面を表示部105に表示する。表示部105は、図1に示すモニタで構成され、画面制御部104の制御の下、表示画面を表示する。 The input unit 101 is composed of a microphone, keyboard, USB memory, etc., as shown in FIG. 1. The input unit 101 acquires target text. The input unit 101 acquires subject data indicating a subject input by the user. The subject data may be text or may be audio data. The communication unit 102 is composed of a communication circuit that connects the terminal 100 to a network. The screen generation unit 103 generates a display screen indicating the evaluation result of the originality of the user's answer. The screen control unit 104 displays the display screen generated by the screen generation unit 103 on the display unit 105. The display unit 105 is composed of the monitor shown in FIG. 1, and displays the display screen under the control of the screen control unit 104.

 基準計算サーバ200は、基準算出部201、通信部202、データ変換部203、及び主題設定部204を含む。 The reference calculation server 200 includes a reference calculation unit 201, a communication unit 202, a data conversion unit 203, and a theme setting unit 204.

 基準算出部201は、主題設定部204により設定された主題データに対応する基準ベクトルの代表値である基準点を算出する。基準点は、主題から連想される独自性が低い回答である基準回答を示す基準回答テキストに対応するベクトル空間上の点である。基準回答テキストは、第2回答テキストの一例である。基準点は第2ベクトルの一例である。基準ベクトルは、例えばWord2vecなどの単語をベクトル化する手法を用いて算出される。 The reference calculation unit 201 calculates a reference point that is a representative value of a reference vector corresponding to the theme data set by the theme setting unit 204. The reference point is a point in a vector space that corresponds to a reference answer text that indicates a reference answer that is an answer that is associated with the theme and has low uniqueness. The reference answer text is an example of a second answer text. The reference point is an example of a second vector. The reference vector is calculated using a method for vectorizing words, such as Word2vec.

 通信部202は、基準計算サーバ200をネットワークに接続する通信回路で構成される。通信部202は、端末100から送信された主題データを受信する。通信部202は、端末100から送信される基準回答の音声データ又は基準テキストを受信する。 The communication unit 202 is composed of a communication circuit that connects the reference calculation server 200 to a network. The communication unit 202 receives the subject data transmitted from the terminal 100. The communication unit 202 receives the voice data of the reference answer or the reference text transmitted from the terminal 100.

 データ変換部203は、通信部202により受信された主題データが音声データである場合、当該音声データをテキストデータに変換する。データ変換部203は、通信部202により受信された基準回答の音声データをテキストデータに変換することで基準テキストを取得する。 If the subject data received by the communication unit 202 is voice data, the data conversion unit 203 converts the voice data into text data. The data conversion unit 203 obtains the reference text by converting the voice data of the reference answer received by the communication unit 202 into text data.

 主題設定部204は、通信部202により受信された主題データを設定する。基準算出部201、データ変換部203、及び主題設定部204は例えばCPUなどのプロセッサが情報処理プログラムを実行することで実現されてもよいし、専用のハードウェア回路で構成されてもよい。 The subject setting unit 204 sets the subject data received by the communication unit 202. The criteria calculation unit 201, the data conversion unit 203, and the subject setting unit 204 may be realized by a processor such as a CPU executing an information processing program, or may be configured by a dedicated hardware circuit.

 独自性評価サーバ400は、独自性評価部401、通信部402、データ変換部403、及び主題設定部404を含む。 The originality evaluation server 400 includes an originality evaluation unit 401, a communication unit 402, a data conversion unit 403, and a theme setting unit 404.

 独自性評価部401は、データ変換部403により生成された対象ベクトルと基準計算サーバ200により算出された基準ベクトルとの類似度の大きさに基づき、対象テキストが示す回答の独自性を評価する評価値を算出する。例えば、類似度は、コサイン類似度で定義される。コサイン類似度は、対象ベクトルをa、基準ベクトルをbとすると、a・b/|a|・|b|で表される。コサイン類似度が大きいほど、対象ベクトルと基準ベクトルとの類似性が高まるので、回答の独自性は低くなる。したがって、評価値はコサイン類似度が小さいほど値が大きくなるように定義される。例えば、評価値は、(1-コサイン類似度)×50で定義される。 The uniqueness evaluation unit 401 calculates an evaluation value for evaluating the uniqueness of the answer indicated by the target text, based on the magnitude of similarity between the target vector generated by the data conversion unit 403 and the reference vector calculated by the reference calculation server 200. For example, the similarity is defined as cosine similarity. If the target vector is a and the reference vector is b, the cosine similarity is expressed as a·b/|a|·|b|. The larger the cosine similarity, the higher the similarity between the target vector and the reference vector, and therefore the lower the uniqueness of the answer. Therefore, the evaluation value is defined so that the smaller the cosine similarity, the higher the value. For example, the evaluation value is defined as (1-cosine similarity)×50.

 通信部402は、独自性評価サーバ400をネットワークに接続する通信回路で構成され、基準計算サーバ200により算出された基準点を記憶ユニット300から受信する。通信部402は、端末100から送信された主題データを受信する。通信部402は、端末100から送信された回答の音声データ又は対象テキストを受信する。 The communication unit 402 is composed of a communication circuit that connects the uniqueness evaluation server 400 to a network, and receives the reference points calculated by the reference calculation server 200 from the storage unit 300. The communication unit 402 receives the subject data transmitted from the terminal 100. The communication unit 402 receives the answer voice data or target text transmitted from the terminal 100.

 データ変換部403は、端末100から回答の音声データが送信された場合、当該音声データをテキストデータに変換することによって、対象テキストを取得する。データ変換部403は、対象テキストをベクトルで表現した対象ベクトルを算出する。対象ベクトルは、例えばWord2vecなどのベクトル化手法を用いてベクトル化される。対象ベクトルは第1ベクトルの一例である。 When answering voice data is transmitted from the terminal 100, the data conversion unit 403 obtains the target text by converting the voice data into text data. The data conversion unit 403 calculates a target vector that represents the target text as a vector. The target vector is vectorized using a vectorization method such as Word2vec. The target vector is an example of a first vector.

 主題設定部404は、通信部402で受信された主題データを設定する。 The subject setting unit 404 sets the subject data received by the communication unit 402.

 独自性評価部401、データ変換部403、及び主題設定部404は、CPUなどのプロセッサが情報処理プログラムを実行することで実現されてもよいし、専用のハードウェア回路により実現されてもよい。 The uniqueness evaluation unit 401, the data conversion unit 403, and the theme setting unit 404 may be realized by a processor such as a CPU executing an information processing program, or may be realized by a dedicated hardware circuit.

 照明制御サーバ500は、照明制御部501及び通信部502を含む。照明制御部501は、独自性評価サーバ400により算出された評価値に基づいて、照明装置600の照度及び色温度を設定するための制御コマンドを生成する。通信部502は、照明制御部501により生成された制御コマンドを照明装置600に送信する。通信部502は、独自性評価サーバ400により算出された評価値を記憶ユニット300から受信する。照明制御部501は、CPUなどのプロセッサが情報処理プログラムを実行することで実現されてもよいし、専用のハードウェア回路で構成されてもよい。通信部502は、通信回路で構成され、制御コマンドを照明装置600に送信する。 The lighting control server 500 includes a lighting control unit 501 and a communication unit 502. The lighting control unit 501 generates a control command for setting the illuminance and color temperature of the lighting device 600 based on the evaluation value calculated by the uniqueness evaluation server 400. The communication unit 502 transmits the control command generated by the lighting control unit 501 to the lighting device 600. The communication unit 502 receives the evaluation value calculated by the uniqueness evaluation server 400 from the storage unit 300. The lighting control unit 501 may be realized by a processor such as a CPU executing an information processing program, or may be configured as a dedicated hardware circuit. The communication unit 502 is configured as a communication circuit, and transmits a control command to the lighting device 600.

 記憶ユニット300は、記憶部301及び通信部302を含む。記憶部301は、書き換え可能な不揮発性の記憶装置で構成される。記憶部301は、図3に示す基準テキストテーブルT1及び図4に示す評価結果テーブルT2を記憶する。記憶部301は、基準計算サーバ200及び独自性評価サーバ400が基準ベクトル及び対象ベクトルを算出する際に使用する学習モデルを記憶する。学習モデルは、例えば、入力された単語のテキストからその単語のベクトルを算出するために予め機械学習技術により作成された学習モデルが採用される。本実施の形態では、学習モデルとしてWord2vecで単語をベクトル表現するモデルを採用する。 The storage unit 300 includes a storage section 301 and a communication section 302. The storage section 301 is configured as a rewritable non-volatile storage device. The storage section 301 stores a reference text table T1 shown in FIG. 3 and an evaluation result table T2 shown in FIG. 4. The storage section 301 stores a learning model used when the reference calculation server 200 and the uniqueness evaluation server 400 calculate a reference vector and a target vector. For example, a learning model that is created in advance by machine learning technology to calculate the vector of an input word from the text of that word is adopted as the learning model. In this embodiment, a model that expresses words as vectors in Word2vec is adopted as the learning model.

 図3は、基準テキストテーブルT1のデータ構成の一例を示す図である。基準テキストテーブルT1は、複数の主題のそれぞれについて独自性の低い回答を示す基準テキストを記憶する。 FIG. 3 is a diagram showing an example of the data configuration of the reference text table T1. The reference text table T1 stores reference texts that indicate answers with low originality for each of a number of topics.

 基準テキストテーブルT1は、主題、基準テキスト、及び1次元~300次元の基準ベクトルの要素を対応付けて記憶する。例えば主題「鍋」ついて「野菜を煮る」、「湯を沸かす」が基準テキストとして記憶されている。このように基準テキストは、主題からユーザが容易に連想できる独自性の低い回答を示す。ここでは、基準ベクトルは300次元の要素で構成されたが、これは一例であり、300以外の適宜の次元の要素で構成されてもよい。 The reference text table T1 stores themes, reference texts, and elements of reference vectors of 1 to 300 dimensions in association with each other. For example, for the theme "pot," "cook vegetables" and "boil water" are stored as reference texts. In this way, the reference texts indicate answers with low originality that the user can easily associate with the theme. Here, the reference vector is composed of 300-dimensional elements, but this is just one example, and it may be composed of elements of any dimension other than 300.

 図4は、評価結果テーブルT2のデータ構成の一例を示す図である。評価結果テーブルT2は、複数の主題のそれぞれについてユーザの回答を示す対象テキストと、その回答に対する評価値とを対応付けて記憶する。評価結果テーブルT2は、主題、対象テキスト、評価語、評価値、及びユーザの項目を含む。評価語は対象テキストの中から評価のために抽出された単語である。ユーザの項目にはユーザの識別情報(例えば名前)が記憶されている。例えば、1行目の評価結果を示すレコードには、主題「鍋」についてユーザ「Bob」は「ロボットの頭にする」と回答し、評価語として「ロボット」と「頭」とが抽出され、評価値として「75」が算出されたことが記録されている。 Figure 4 is a diagram showing an example of the data configuration of the evaluation result table T2. The evaluation result table T2 stores target texts indicating the user's answers for each of a number of topics, in association with the evaluation values for those answers. The evaluation result table T2 includes items for topic, target text, evaluation words, evaluation values, and user. The evaluation words are words extracted from the target text for evaluation. The user item stores the user's identification information (e.g., name). For example, the record showing the evaluation result in the first row records that the user "Bob" answered "to make it the head of the robot" for the topic "pot," and that "robot" and "head" were extracted as evaluation words and that "75" was calculated as the evaluation value.

 回答したユーザは、当該ユーザの端末100に付与された識別情報から推定できる。例えば、独自性評価部401は、ボブの端末100から「ロボットの頭にする」という回答を取得した場合、回答したユーザはボブと推定すればよい。この態様を採用する場合、記憶ユニット300は、端末100の識別情報とユーザの識別情報とを対応付けたユーザ情報を記憶すればよい。そして、独自性評価部401は、端末100の識別情報に対応するユーザの識別情報をユーザ情報から特定すればよい。 The user who answered can be inferred from the identification information assigned to the terminal 100 of that user. For example, if the uniqueness evaluation unit 401 obtains the answer "Make it the head of a robot" from Bob's terminal 100, it can infer that the user who answered is Bob. When this aspect is adopted, the storage unit 300 can store user information that associates the identification information of the terminal 100 with the identification information of the user. Then, the uniqueness evaluation unit 401 can identify the identification information of the user that corresponds to the identification information of the terminal 100 from the user information.

 また、回答したユーザは声紋照合によって推定されてもよい。例えば、記憶ユニット300が各ユーザの声紋情報を記憶している場合、独自性評価部401は、端末100から送信された回答の音声データと当該声紋情報とを照合することによって、ユーザを推定してもよい。この態様は、各ユーザが自身の端末100を用いて回答を入力する態様ではなく、1つの端末100が、複数のユーザの回答の音声データを取得する態様を採用する場合に適用される。 The user who answered may also be estimated by voiceprint matching. For example, if the storage unit 300 stores voiceprint information for each user, the uniqueness evaluation unit 401 may estimate the user by matching the voice data of the answer sent from the terminal 100 with the voiceprint information. This mode is applied when a mode is adopted in which one terminal 100 acquires voice data of answers from multiple users, rather than a mode in which each user inputs an answer using his or her own terminal 100.

 図5は、情報処理システム1の処理の流れを示すシーケンス図である。まず、ステップS201において、端末100が起動される。起動後、ユーザは、どういった主題にするかを決定し、決定した主題を示す主題データを端末100に入力する。ここでは、主題の音声データが主題データとして入力される。入力された主題データは基準計算サーバ200に送信される(ステップS202)。なお、主題データはテキストデータであってもよい。 FIG. 5 is a sequence diagram showing the processing flow of the information processing system 1. First, in step S201, the terminal 100 is started. After starting, the user decides what kind of theme to use, and inputs theme data indicating the decided theme into the terminal 100. Here, audio data of the theme is input as the theme data. The input theme data is sent to the reference calculation server 200 (step S202). Note that the theme data may be text data.

 次に、ユーザは決定した主題から連想される独自性の低い回答である基準回答の音声データを端末100に入力する。ここで、ユーザは思いつく限りの基準回答を入力する。入力された基準回答の音声データは基準計算サーバ200に送信される(ステップS203)。以下、図6を用いてステップS202~S208の処理の詳細を説明する。 Next, the user inputs voice data of reference answers, which are answers with low originality associated with the determined topic, into the terminal 100. Here, the user inputs as many reference answers as he or she can think of. The voice data of the input reference answers is sent to the reference calculation server 200 (step S203). The process of steps S202 to S208 will be described in detail below with reference to FIG. 6.

 図6は、基準回答を取得する際の情報処理システム1の処理の一例を示すフローチャートである。まず、ステップS202において、基準計算サーバ200の主題設定部204は、端末100から送信された主題データを通信部202を用いて取得し、取得した主題データが示す主題を対象となる主題として設定する。 FIG. 6 is a flowchart showing an example of the processing of the information processing system 1 when acquiring a reference answer. First, in step S202, the subject setting unit 204 of the reference calculation server 200 acquires the subject data transmitted from the terminal 100 using the communication unit 202, and sets the subject indicated by the acquired subject data as the target subject.

 次に、ステップS203において、基準計算サーバ200のデータ変換部203は、端末100から送信された基準回答の音声データを通信部202を用いて取得する。ここでは、基準回答の音声データは、これから独自性評価の対象となる回答を行うユーザの端末100から取得する例を示している。しかしながら、これは一例であり、当該ユーザとは異なる不特定多数の一般ユーザの端末100から基準回答の音声データは取得されてもよい。例えば、基準回答はクラウドソーシングにより募集された回答であってもよい。 Next, in step S203, the data conversion unit 203 of the reference calculation server 200 acquires the voice data of the reference answer transmitted from the terminal 100 using the communication unit 202. Here, an example is shown in which the voice data of the reference answer is acquired from the terminal 100 of the user who will provide the answer that will be the subject of the originality evaluation. However, this is just one example, and the voice data of the reference answer may be acquired from the terminals 100 of an unspecified number of general users who are different from the user in question. For example, the reference answer may be an answer collected through crowdsourcing.

 次に、ステップS204において、データ変換部203は、基準回答の音声データをテキストデータに変換する処理を行うことによって基準テキストを取得する。 Next, in step S204, the data conversion unit 203 obtains the reference text by performing a process of converting the voice data of the reference answer into text data.

 ここでは、基準回答の音声データが取得されたが、これは一例であり、基準回答のテキストデータが取得されてもよい。この場合、ステップS203、S204の処理に代えて、基準テキストを取得する処理が実行される。 Here, audio data of the reference answer is acquired, but this is just one example, and text data of the reference answer may be acquired. In this case, instead of the processes of steps S203 and S204, a process of acquiring the reference text is executed.

 次に、ステップS205において、データ変換部203は、基準テキストを記憶ユニット300に記憶された学習モデルに入力することで、基準テキストを基準ベクトルに変換する。この処理の詳細は図6を用いて後述する。 Next, in step S205, the data conversion unit 203 converts the reference text into a reference vector by inputting the reference text into the learning model stored in the storage unit 300. Details of this process will be described later with reference to FIG. 6.

 次に、ステップS206において、基準算出部201は、ステップS203で取得した全ての基準回答を示す全ての基準テキストが基準ベクトルに変換済みであるか否かを判定する。全ての基準テキストが基準ベクトルに変換済みでない場合(ステップS206でNO)、処理はステップS205に戻り、次の基準テキストが基準ベクトルに変換される。全ての基準テキストが基準ベクトルに変換済みである場合(ステップS206でYES)、基準算出部201は、全ての基準ベクトルを用いて基準点を算出する。基準点は、例えば、全ての基準ベクトルのベクトル空間上における重心位置である。 Next, in step S206, the reference calculation unit 201 determines whether or not all reference texts indicating all reference answers obtained in step S203 have been converted into reference vectors. If all reference texts have not been converted into reference vectors (NO in step S206), the process returns to step S205, and the next reference text is converted into a reference vector. If all reference texts have been converted into reference vectors (YES in step S206), the reference calculation unit 201 calculates a reference point using all the reference vectors. The reference point is, for example, the center of gravity position in the vector space of all the reference vectors.

 次に、ステップS208において、基準算出部201は、基準点を記憶ユニット300に保存する。このとき、ステップS202で取得された主題データ、ステップS203で取得された基準テキスト、ステップS205で変換された基準ベクトルは、基準テキストテーブルT1に保存される。これにより、基準テキストテーブルT1に基準テキストが蓄積されていく。 Next, in step S208, the reference calculation unit 201 stores the reference point in the storage unit 300. At this time, the subject data acquired in step S202, the reference text acquired in step S203, and the reference vector converted in step S205 are stored in the reference text table T1. As a result, the reference text is accumulated in the reference text table T1.

 次に、回答を評価する処理について説明する。図5に示すステップS401以降の処理は回答を評価する処理を示している。ステップS401、S402の処理は、ステップS201、S202と同じである。 Next, the process of evaluating the answers will be described. The process from step S401 onwards shown in FIG. 5 shows the process of evaluating the answers. The processes in steps S401 and S402 are the same as those in steps S201 and S202.

 ステップS403において、ユーザはステップS402で入力した課題データが示す課題に対する回答の音声データを端末100に入力する。なお、回答の音声データに代えて対象テキストが入力されてもよい。以下、図7を用いて、ステップS402~S409の処理の詳細を説明する。 In step S403, the user inputs voice data of the answer to the task indicated by the task data input in step S402 into the terminal 100. Note that target text may be input instead of the voice data of the answer. Details of the processing of steps S402 to S409 are explained below with reference to FIG. 7.

 図7は、回答を取得する際の情報処理システム1の処理の一例を示すフローチャートである。ステップS403において、独自性評価サーバ400のデータ変換部403は端末100から送信された回答の音声データを通信部202を用いて取得する。 FIG. 7 is a flowchart showing an example of the processing of the information processing system 1 when acquiring an answer. In step S403, the data conversion unit 403 of the uniqueness evaluation server 400 acquires the voice data of the answer sent from the terminal 100 using the communication unit 202.

 次に、ステップS404において、データ変換部403は、回答の音声データをテキストデータに変換する処理を行い、対象テキストを取得する。 Next, in step S404, the data conversion unit 403 performs a process of converting the voice data of the response into text data and obtains the target text.

 ここでは、回答の音声データが取得されたが、これは一例であり、対象テキストが取得されてもよい。この場合、ステップS403、S404の処理に代えて、対象テキストを取得する処理が実行される。 Here, audio data of the response is acquired, but this is just one example, and the target text may be acquired. In this case, instead of the processing of steps S403 and S404, processing to acquire the target text is executed.

 次に、ステップS405において、データ変換部403は、対象テキストを記憶ユニット300に記憶された学習モデルに入力することで、対象テキストを対象ベクトルに変換する。この処理の詳細は図6を用いて後述する。 Next, in step S405, the data conversion unit 403 converts the target text into a target vector by inputting the target text into the learning model stored in the storage unit 300. Details of this process will be described later with reference to FIG. 6.

 次に、ステップS406において、独自性評価部401は、記憶ユニット300からステップS402で取得された主題データが示す主題に対応する基準点を、通信部402を用いて取得する。 Next, in step S406, the uniqueness evaluation unit 401 uses the communication unit 402 to obtain a reference point corresponding to the subject indicated by the subject data obtained from the storage unit 300 in step S402.

 次に、ステップS407において、独自性評価部401は、基準点と対象ベクトルとのコサイン類似度を算出し、算出したコサイン類似度から回答の独自性を評価する最終評価値を算出する。この処理の詳細は図10を用いて後述する。 Next, in step S407, the uniqueness evaluation unit 401 calculates the cosine similarity between the reference point and the target vector, and calculates a final evaluation value for evaluating the uniqueness of the answer from the calculated cosine similarity. Details of this process will be described later with reference to FIG. 10.

 次に、ステップS408において、独自性評価部401は、算出した最終評価値を、対象テキスト、ユーザの識別情報、評価語、及び対象ベクトルと対応付けて評価結果テーブルT2に保存する。なお、最終評価値は評価結果テーブルT2の評価値の項目に記憶される。 Next, in step S408, the uniqueness evaluation unit 401 stores the calculated final evaluation value in the evaluation result table T2 in association with the target text, the user's identification information, the evaluation words, and the target vector. The final evaluation value is stored in the evaluation value field of the evaluation result table T2.

 次に、ステップS409において、独自性評価部401は、回答の評価結果を示す表示画面を表示するための表示データを生成し、生成した表示データを通信部402を用いて端末100に送信する。この表示データを受信した端末100は、表示部105に表示画面を表示する。 Next, in step S409, the uniqueness evaluation unit 401 generates display data for displaying a display screen showing the evaluation results of the answer, and transmits the generated display data to the terminal 100 using the communication unit 402. The terminal 100 that receives this display data displays the display screen on the display unit 105.

 図8は、図6に示すステップS205及び図7に示すステップS405の処理の詳細を示すフローチャートである。図8の説明において、基準テキスト及び対象テキストを総称してテキストと呼ぶ。また、データ変換部203及びデータ変換部403を総称してデータ変換部Aと呼ぶ。 FIG. 8 is a flowchart showing the details of the processing of step S205 shown in FIG. 6 and step S405 shown in FIG. 7. In the explanation of FIG. 8, the reference text and the target text are collectively referred to as text. Furthermore, the data conversion unit 203 and the data conversion unit 403 are collectively referred to as data conversion unit A.

 ステップS2051において、データ変換部Aは、テキストを取得する。次に、ステップS2052において、データ変換部Aは、読み込んだテキストを単語ごとに分かち書きする。分かち書きには例えば、形態素解析が用いられる。 In step S2051, data conversion unit A acquires text. Next, in step S2052, data conversion unit A separates the read text into words. For example, morphological analysis is used for separating words.

 次に、ステップS2053において、データ変換部Aは、除外品詞リストを参照し、評価対象外となる単語をテキストから削除する。これにより、テキストから評価語が抽出される。評価語は名詞、形容動詞、動詞、及び形容詞である。除外品詞リストはこれらの品詞以外の除外品詞を記憶する。品詞除外リストは例えば記憶ユニット300に記憶されている。除外品詞を持つ単語をテキストデータから削除するのは、除外品詞を持つ単語は、独自性の評価に寄与せず、ノイズとなる恐れがあるからである。 Next, in step S2053, data conversion unit A refers to the excluded parts of speech list and deletes from the text words that are not to be evaluated. This extracts evaluation words from the text. The evaluation words are nouns, adjectival verbs, verbs, and adjectives. The excluded parts of speech list stores excluded parts of speech other than these parts of speech. The excluded parts of speech list is stored, for example, in storage unit 300. Words with excluded parts of speech are deleted from the text data because words with excluded parts of speech do not contribute to the evaluation of uniqueness and may become noise.

 次に、ステップS2054において、データ変換部Aは、テキストから抽出した評価語のそれぞれを単語ベクトルに変換する。基準ベクトルは、評価語ごとに算出された単語ベクトルのデータセットで構成される。この場合、基準算出部201は、複数の基準ベクトルのそれぞれについて評価語ごとに算出された複数の単語ベクトルの重心位置を基準点として算出すればよい。対象ベクトルは、評価語ごとに算出された複数の単語ベクトルのデータセットで構成される。 Next, in step S2054, the data conversion unit A converts each of the evaluation words extracted from the text into a word vector. The reference vector is composed of a data set of word vectors calculated for each evaluation word. In this case, the reference calculation unit 201 may calculate the center of gravity of multiple word vectors calculated for each evaluation word for each of the multiple reference vectors as a reference point. The target vector is composed of a data set of multiple word vectors calculated for each evaluation word.

 図9は、図8の処理の説明図である。この例では、主題として「掃除機」が設定されている。ユーザは主題「掃除機」に反応してマイクを用いて回答の音声データを入力する。ここでは、回答として「柄の部分に跨って遊ぶ」が入力されている。 FIG. 9 is an explanatory diagram of the process of FIG. 8. In this example, "vacuum cleaner" is set as the subject. In response to the subject "vacuum cleaner," the user inputs voice data of the answer using a microphone. In this case, the answer "Play by straddling the handle" is input.

 データ変換部403は、回答の音声データをテキストデータに変換する。これにより、対象テキストD1が得られる。次に、データ変換部403は、対象テキストD1を分かち書きする。これにより、対象テキストD1は「柄/の/部分に/跨る/遊ぶ」というように分かち書きされる。 The data conversion unit 403 converts the voice data of the response into text data. As a result, the target text D1 is obtained. Next, the data conversion unit 403 separates the target text D1 into words. As a result, the target text D1 is separated into words such as "straddle/play/with/the/part/of/pattern."

 次に、データ変換部403は、分かち書きされた対象テキストD1から除外品詞を持つ単語である不要語を削除する。これにより、対象テキストD1から、「柄」、「部分」、「跨る」、「遊ぶ」のそれぞれが評価語として抽出される。 Then, the data conversion unit 403 deletes unnecessary words, which are words with excluded parts of speech, from the segmented target text D1. As a result, "pattern," "part," "straddle," and "play" are extracted as evaluation words from the target text D1.

 次に、データ変換部403は、各評価語をWord2vecなどのベクトル化手法を用いて単語ベクトルに変換する。これにより、評価語ごとの単語ベクトルが得られる。 Next, the data conversion unit 403 converts each evaluation word into a word vector using a vectorization method such as Word2vec. This results in a word vector for each evaluation word.

 図10は、図7のステップS407の処理の詳細を示すフローチャートである。ステップS4071において、独自性評価部401は、記憶ユニット300から基準点を取得する。次に、ステップS4072において、独自性評価部401は、評価語ごとに、基準点と、単語ベクトルとのコサイン類似度を算出する。 FIG. 10 is a flowchart showing the details of the processing of step S407 in FIG. 7. In step S4071, the uniqueness evaluation unit 401 acquires a reference score from the storage unit 300. Next, in step S4072, the uniqueness evaluation unit 401 calculates the cosine similarity between the reference score and the word vector for each evaluation word.

 次に、ステップS4073において、独自性評価部401は、評価語ごとに算出したコサイン類似度から評価語ごとの評価値を算出する。 Next, in step S4073, the uniqueness evaluation unit 401 calculates an evaluation value for each evaluation word from the cosine similarity calculated for each evaluation word.

 次に、ステップS4074において、独自性評価部401は、評価語ごとに算出した評価値の最大値を対象テキストが示す回答の最終評価値として算出する。 Next, in step S4074, the uniqueness evaluation unit 401 calculates the maximum evaluation value calculated for each evaluation word as the final evaluation value of the answer indicated by the target text.

 図11は、図10の処理の説明図である。分かち書きされた対象テキストD1から不要語が削除され、対象テキストD1から評価語が抽出される。ここでは、「柄」、「部分」、「跨る」、「遊ぶ」が評価語として抽出されている。 FIG. 11 is an explanatory diagram of the process in FIG. 10. Unnecessary words are deleted from the segmented target text D1, and evaluation words are extracted from the target text D1. Here, "pattern," "part," "straddle," and "play" are extracted as evaluation words.

 独自性評価部401は、これら4つの評価語のそれぞれについて単語ベクトルを算出する。マップ1101は、単語ベクトルがマッピングされたベクトル空間を示している。これら4つの評価語のうち、評価語「跨る」は基準点との距離が最大となるので、評価値が最大となる。そのため、独自性評価部401は、評価語「跨る」の評価値を最終評価値として算出する。評価値が最大の単語ベクトルは、第1回答テキストに含まれる複数の単語のそれぞれに対応する複数のベクトルの中で、最も第2ベクトルとの類似度が低いベクトルの一例である。 The uniqueness evaluation unit 401 calculates a word vector for each of these four evaluation words. Map 1101 shows the vector space into which the word vectors are mapped. Of these four evaluation words, the evaluation word "straddle" has the greatest distance from the reference point and therefore the greatest evaluation value. Therefore, the uniqueness evaluation unit 401 calculates the evaluation value of the evaluation word "straddle" as the final evaluation value. The word vector with the greatest evaluation value is an example of a vector that has the lowest similarity to the second vector among the multiple vectors corresponding to each of the multiple words included in the first answer text.

 このように、複数の評価語のうち基準点との距離が最大となる評価語を特定し、その評価語の評価値を用いて回答を評価することで、長文の回答に対する評価が容易となる。 In this way, by identifying the evaluation word that is the greatest distance from the reference point among multiple evaluation words and evaluating the answer using the evaluation value of that evaluation word, it becomes easier to evaluate long answers.

 次に、評価値に基づく照明制御について説明する。図12は、照明制御の処理の一例を示すフローチャートである。まず、ステップS501において、照明制御サーバ500の照明制御部501は、回答に対する最終評価値を記憶ユニット300から通信部502を用いて取得する。ここでは、ある主題に対して一定期間においてユーザが発案した複数の回答のそれぞれの最終評価値が取得される。 Next, lighting control based on evaluation values will be described. FIG. 12 is a flowchart showing an example of lighting control processing. First, in step S501, the lighting control unit 501 of the lighting control server 500 obtains the final evaluation value for the answer from the storage unit 300 using the communication unit 502. Here, the final evaluation value of each of multiple answers proposed by users for a certain topic over a certain period of time is obtained.

 次に、ステップS502において、照明制御部501は、最終評価値が閾値以下の回答数を算出する。最終評価値は、ある期間において取得された複数の回答について導出される複数のスコアの一例である。 Next, in step S502, the lighting control unit 501 calculates the number of responses whose final evaluation value is equal to or less than a threshold value. The final evaluation value is an example of multiple scores derived for multiple responses obtained during a certain period of time.

 次に、ステップS503において、照明制御部501は、回答数が所定数以上であるか否かを判定する。回答数が所定数以上の場合(ステップS503でYES)、照明制御部501は、照明光を変更する制御コマンドを生成し、生成した制御コマンドを照明装置600に、通信部502を用いて送信する。ユーザの創造性は、照明光の色温度を下げ、且つ、照度を上げると高まることが知られている(例えば、非特許文献:冨本浩一郎、他3名、「ワーカーの創造性の向上を目的としたオフィス照明システム-クリエイティブオフィスのシーンに適した照明条件の研究-」、日本人間工学会関西支部大会講演論文集 巻:2009 ページ:171-174 発行年:2009年12月05日)。そこで、照明制御部501は、現在の照明光の色温度を所定レベル下げ、且つ照度を所定レベル上げる制御コマンドを生成すればよい。回答数が所定数以上という条件は、スコアが低いことを示す第1条件の一例に相当する。 Next, in step S503, the lighting control unit 501 determines whether the number of answers is equal to or greater than a predetermined number. If the number of answers is equal to or greater than the predetermined number (YES in step S503), the lighting control unit 501 generates a control command to change the lighting light and transmits the generated control command to the lighting device 600 using the communication unit 502. It is known that a user's creativity is enhanced by lowering the color temperature of the lighting light and increasing the illuminance (for example, non-patent document: Koichiro Tomimoto and three others, "Office Lighting System Aimed at Improving Worker Creativity - Study of Lighting Conditions Suitable for Creative Office Scenes", Proceedings of the Kansai Branch Conference of the Japan Ergonomics Society, Volume: 2009, Pages: 171-174, Publication Year: December 5, 2009). Therefore, the lighting control unit 501 may generate a control command to lower the color temperature of the current lighting light by a predetermined level and increase the illuminance by a predetermined level. The condition that the number of answers is equal to or greater than a predetermined number corresponds to an example of the first condition indicating a low score.

 一方、回答数が所定数未満の場合(ステップS503でNO)、処理は終了する。この場合、次の一定期間が経過後、次の一定期間に創作された回答について図12の処理が実行されてもよい。図12では、一定期間単位で照明装置600が制御されたが、これは一例であり、ユーザが回答する都度、照明装置の制御を変更するか否かの判定が行われてもよい。例えば、照明制御部501は、ある回答に対する最終評価値が閾値以下であった場合、制御コマンドを生成し、その制御コマンドを照明装置600に送信してもよい。この場合、最終評価値が閾値以下であるという条件は、第1条件の別の一例である。 On the other hand, if the number of replies is less than the predetermined number (NO in step S503), the process ends. In this case, after the next fixed period has passed, the process of FIG. 12 may be executed for the replies created during the next fixed period. In FIG. 12, the lighting device 600 is controlled on a fixed period basis, but this is one example, and a determination may be made each time the user gives an answer as to whether or not to change the control of the lighting device. For example, if the final evaluation value for a certain answer is equal to or less than a threshold value, the lighting control unit 501 may generate a control command and send the control command to the lighting device 600. In this case, the condition that the final evaluation value is equal to or less than a threshold value is another example of the first condition.

 図12の処理によりユーザの回答の独自性が低い又は独自性が低下した場合にユーザの照明環境を制御することによって、独自性の高い回答の創作をユーザに促すことができる。 By controlling the lighting environment of the user when the uniqueness of the user's answer is low or has decreased as a result of the process in FIG. 12, the user can be encouraged to create a more unique answer.

 図13は、図7のステップS409において、ユーザU1の端末100に表示される表示画面G1の一例を示す図である。表示画面G1には、回答をしたユーザU1及び回答をした他のユーザを表示するユーザ表示欄R1と、ユーザU1及び他のユーザの回答に対する評価結果を表示する評価結果表示欄R2とを含む。 FIG. 13 is a diagram showing an example of a display screen G1 displayed on the terminal 100 of the user U1 in step S409 of FIG. 7. The display screen G1 includes a user display section R1 that displays the user U1 who has answered and the other users who have answered, and an evaluation result display section R2 that displays the evaluation results for the answers of the user U1 and the other users.

 ここでは、ユーザU1を含む4名のユーザが回答したので、ユーザ表示欄R1にはこれら4名のユーザの映像が表示されている。評価結果表示欄R2には各ユーザの回答と、回答に対する最終評価値が表示されている。各ユーザが複数の回答をした場合、評価結果表示欄R2には、複数の回答が表示されるとともに、各回答の最終評価値の平均値が表示される。或いは、評価結果表示欄R2には、回答ごとの最終評価値が表示されてもよい。 In this case, four users, including user U1, have provided answers, so images of these four users are displayed in the user display area R1. The evaluation result display area R2 displays each user's answer and the final evaluation value for the answer. If each user has provided multiple answers, the evaluation result display area R2 displays the multiple answers as well as the average of the final evaluation values for each answer. Alternatively, the evaluation result display area R2 may display the final evaluation value for each answer.

 これにより、ユーザU1は、他のユーザがどのような回答を行い、自分及び他のユーザの回答がどのように評価されているかを確認できる。 This allows user U1 to see what answers other users have given and how their answers and those of other users have been evaluated.

 図14は、ブース1500の一例を示す図である。ブース1500は、ユーザが所在する空間を外部空間から仕切る箱体で構成されている。ブース1500の天井には、照明装置600が設置されている。ブース1500の内部には、ユーザが座る椅子1501及びユーザが作業する机1502が設置されている。机1502には端末100及びマイク1503が設置されている。回答するユーザはマイク1503又はキーボードを使って回答を行う。この回答は独自性評価サーバ400により評価され、評価結果に応じて照明装置600が制御される。例えば、評価結果の低い回答が多く、評価結果が芳しくない場合、照明装置600の色温度を下げ且つ照度が上げられる。これにより、ユーザに対して独自性の高い回答の創作が促される。 FIG. 14 is a diagram showing an example of a booth 1500. The booth 1500 is composed of a box that separates the space in which the user is located from the outside space. A lighting device 600 is installed on the ceiling of the booth 1500. Inside the booth 1500, a chair 1501 on which the user sits and a desk 1502 on which the user works are installed. A terminal 100 and a microphone 1503 are installed on the desk 1502. The user who answers uses the microphone 1503 or a keyboard to answer. The answers are evaluated by the originality evaluation server 400, and the lighting device 600 is controlled according to the evaluation result. For example, if there are many answers with low evaluation results and the evaluation result is not good, the color temperature of the lighting device 600 is lowered and the illuminance is increased. This encourages the user to create highly original answers.

 このように本実施の形態によれば、主題と主題に対する回答を示す対象テキストとのベクトル類似度ではなく、主題から連想される独自性が低い回答の基準を示す基準点と対象テキストとのベクトルの類似度を用いて回答の独自性が評価されている。そのため、独自性の低い回答が、誤って独自性が高いと評価されるリスクを低減できる。その結果、ある主題から創作された回答の独自性をより精度よく評価できる。 In this way, according to this embodiment, the originality of an answer is evaluated using the vector similarity between the target text and a reference point indicating the standard for answers with low originality associated with the topic, rather than the vector similarity between the topic and the target text indicating the answer to the topic. This reduces the risk that an answer with low originality will be erroneously evaluated as being highly original. As a result, the originality of an answer created from a certain topic can be evaluated more accurately.

 本開示は以下の変形例が採用できる。 This disclosure can employ the following modifications:

 (変形例1)独自性評価部401は、複数のコーパスごとに算出された単語ベクトルを用いて回答を評価してもよい。例えば2つのコーパスC1、C2があるとする。この場合、学習モデルはコーパスC1に対応する学習モデルM1とコーパスC2に対応する学習モデルM2とを含む。 (Variation 1) The uniqueness evaluation unit 401 may evaluate answers using word vectors calculated for each of multiple corpora. For example, assume that there are two corpora C1 and C2. In this case, the learning models include a learning model M1 corresponding to corpus C1 and a learning model M2 corresponding to corpus C2.

 基準点は、コーパスC1、C2のそれぞれに対応する2つの基準点P1、P2を含む。データ変換部203は対象テキストに含まれる評価語を学習モデルM1、M2のそれぞれに入力し、コーパスC1、C2のそれぞれについて、評価語ごとに単語ベクトルV1、V2を算出する。独自性評価部401は、評価語ごとに、単語ベクトルV1と基準点P1との評価値を算出し、得られた評価値の最大値K1を決定する。同様に、独自性評価部401は、評価語ごとに単語ベクトルV2と基準点P2との評価値を算出し、得られた評価値の最大値K2を決定する。そして、独自性評価部401は、最大値K1と最大値K2との平均値を最終評価値として算出する。 The reference points include two reference points P1 and P2 corresponding to corpora C1 and C2, respectively. The data conversion unit 203 inputs the evaluation words contained in the target text into the learning models M1 and M2, respectively, and calculates word vectors V1 and V2 for each evaluation word for each corpus C1 and C2. The uniqueness evaluation unit 401 calculates an evaluation value between the word vector V1 and the reference point P1 for each evaluation word, and determines the maximum value K1 of the obtained evaluation values. Similarly, the uniqueness evaluation unit 401 calculates an evaluation value between the word vector V2 and the reference point P2 for each evaluation word, and determines the maximum value K2 of the obtained evaluation values. The uniqueness evaluation unit 401 then calculates the average value of the maximum values K1 and K2 as the final evaluation value.

 このように複数のコーパスを用いて最終評価値を算出することにより、コーパスによるブレが抑制され、最終評価値を精度よく算出できる。ここでは、説明の便宜上2つのコーパスのケースを例示したが、コーパスの数は3つ以上であってもよい。 By calculating the final evaluation value using multiple corpora in this way, the inconsistencies caused by the corpora are suppressed, and the final evaluation value can be calculated with high accuracy. Here, for the sake of convenience, the case of two corpora is illustrated, but the number of corpora may be three or more.

 (変形例2)対象ベクトルの特徴を表す1つの単語ベクトルを算出するものである。 (Variation 2) Calculates a single word vector that represents the characteristics of the target vector.

 以下、2つのコーパスC1、C2を用いる場合を例に挙げて説明する。コーパスC1、C2を用いて評価語ごとに単語ベクトルV1、V2を算出する処理までは変形例2は変形例1と同じである。ここでは、評価語は3つであるとする。したがって、コーパスC1に対応する3つの単語ベクトルV1を、V11、V12、V13と表し、コーパスC2に対応する3つの単語ベクトルV2をV21、V22、V23と表す。 Below, an example will be described in which two corpora C1 and C2 are used. Modification 2 is the same as modification 1 up to the process of calculating word vectors V1 and V2 for each evaluation word using corpora C1 and C2. Here, it is assumed that there are three evaluation words. Therefore, the three word vectors V1 corresponding to corpus C1 are represented as V11, V12, and V13, and the three word vectors V2 corresponding to corpus C2 are represented as V21, V22, and V23.

 独自性評価部401は、単語ベクトルV11~V13の要素積を算出することで1つの単語ベクトルV1´を算出するとともに、単語ベクトルV21~V23の要素積を算出することで1つの単語ベクトルV2´を算出する。 The uniqueness evaluation unit 401 calculates one word vector V1' by calculating the element product of the word vectors V11 to V13, and calculates one word vector V2' by calculating the element product of the word vectors V21 to V23.

 次に、独自性評価部401は、単語ベクトルV1´と基準点P1とのコサイン類似度を算出し、算出したコサイン類似度から、コーパスC1に対応する回答の評価値E1を算出するとともに、単語ベクトルV2´と基準点P2とのコサイン類似度を算出し、算出したコサイン類似度から、コーパスC2に対応する回答の評価値E2を算出する。 Next, the uniqueness evaluation unit 401 calculates the cosine similarity between the word vector V1' and the reference point P1, and calculates an evaluation value E1 of the answer corresponding to the corpus C1 from the calculated cosine similarity, and calculates the cosine similarity between the word vector V2' and the reference point P2, and calculates an evaluation value E2 of the answer corresponding to the corpus C2 from the calculated cosine similarity.

 次に、独自性評価部401は、評価値E1と評価値E2とを予め作成された共分散構造解析モデルに入力することで最終評価値を算出する。この共分散構造解析モデルは、人の評価に近い評価値を算出するコーパスの重み値を大きくし、人の評価と離れた評価値を算出するコーパスの重みが小さくなるように学習されたモデルである。共分散構造解析モデルを用いることで、人の評価から離れた評価を行うコーパスに対応する評価値の重みが小さくされる一方、人の評価に近い評価を行うコーパスに対応する評価値の重みが大きくされて、最終評価値が算出される。これにより、コーパスによる評価のブレを抑制しつつ、回答を高精度に評価できる。 Next, the uniqueness evaluation unit 401 calculates a final evaluation value by inputting evaluation value E1 and evaluation value E2 into a previously created covariance structure analysis model. This covariance structure analysis model is a model that is trained to increase the weighting value of corpora that calculate evaluation values close to human evaluations and decrease the weighting value of corpora that calculate evaluation values far from human evaluations. By using the covariance structure analysis model, the weighting value of evaluation values corresponding to corpora that produce evaluations far from human evaluations is decreased while the weighting value of evaluation values corresponding to corpora that produce evaluations close to human evaluations is increased to calculate the final evaluation value. This makes it possible to evaluate answers with high accuracy while suppressing evaluation deviations due to corpora.

 (変形例3)
 図1において、基準計算サーバ200、記憶ユニット300、独自性評価サーバ400、及び照明制御サーバ500は、別のコンピュータで構成されているが、1つのコンピュータで構成されてもよい。基準計算サーバ200、記憶ユニット300、独自性評価サーバ400、及び照明制御サーバ500が有する各ブロックは、端末100が備えていてもよい。
(Variation 3)
1, the reference calculation server 200, the storage unit 300, the uniqueness evaluation server 400, and the lighting control server 500 are configured as separate computers, but may be configured as one computer. Each block of the reference calculation server 200, the storage unit 300, the uniqueness evaluation server 400, and the lighting control server 500 may be provided in the terminal 100.

 (変形例4)
 上記実施の形態では、独自性評価部401は、データ変換部403が変換した対象ベクトルを取得していたが、これは一例であり、情報処理システム1の外部に設けられた装置からその装置が算出した対象ベクトルを取得してもよい。同様に、基準算出部201は、データ変換部203が変換した対象ベクトルを取得していたが、これは一例であり、情報処理システム1の外部に設けられた装置から、その装置が算出した対象ベクトルを取得してもよい。すなわち、本開示における第1ベクトル及び第2ベクトルの取得とは、第1ベクトル及び第2ベクトルを算出することに加えて、外部の装置から第1ベクトル及び第2ベクトルを取得する態様も含む。
(Variation 4)
In the above embodiment, the uniqueness evaluation unit 401 acquires the target vector converted by the data conversion unit 403, but this is just one example, and the target vector calculated by a device provided outside the information processing system 1 may be acquired from the device. Similarly, the reference calculation unit 201 acquires the target vector converted by the data conversion unit 203, but this is just one example, and the target vector calculated by a device provided outside the information processing system 1 may be acquired from the device. That is, the acquisition of the first vector and the second vector in the present disclosure includes a mode in which the first vector and the second vector are acquired from an external device in addition to calculating the first vector and the second vector.

 (変形例5)
 上記実施の形態では、類似度としてコサイン類似度が使用されたが、本開示はこれに限定されない。例えば、ピアソンの相関係数が類似度として用いられてもよい。
(Variation 5)
In the above embodiment, the cosine similarity is used as the similarity, but the present disclosure is not limited to this. For example, the Pearson correlation coefficient may be used as the similarity.

 本開示は、ユーザの作業効率を高める技術分野において有用である。 This disclosure is useful in the field of technology for improving user work efficiency.

1     :情報処理システム
100   :端末
101   :入力部
102   :通信部
103   :画面生成部
104   :画面制御部
105   :表示部
200   :基準計算サーバ
201   :基準算出部
202   :通信部
203   :データ変換部
204   :主題設定部
300   :記憶ユニット
301   :記憶部
302   :通信部
400   :独自性評価サーバ
401   :独自性評価部
402   :通信部
403   :データ変換部
404   :主題設定部
500   :照明制御サーバ
501   :照明制御部
502   :通信部
600   :照明装置
1: Information processing system 100: Terminal 101: Input unit 102: Communication unit 103: Screen generation unit 104: Screen control unit 105: Display unit 200: Reference calculation server 201: Reference calculation unit 202: Communication unit 203: Data conversion unit 204: Subject setting unit 300: Storage unit 301: Storage unit 302: Communication unit 400: Uniqueness evaluation server 401: Uniqueness evaluation unit 402: Communication unit 403: Data conversion unit 404: Subject setting unit 500: Lighting control server 501: Lighting control unit 502: Communication unit 600: Lighting device

Claims (12)

 コンピュータにおける情報処理方法であって、
 主題に対する回答を示す第1回答テキストに対応する第1ベクトルを取得し、
 前記主題から連想される回答を示す第2回答テキストに対応する第2ベクトルを取得し、
 前記第1ベクトル及び前記第2ベクトルの類似度の大きさに基づき、前記第1回答テキストが示す前記回答の独自性を評価する、
 情報処理方法。
An information processing method in a computer, comprising:
obtaining a first vector corresponding to a first answer text indicating an answer to the topic;
obtaining a second vector corresponding to a second answer text indicating an answer associated with the topic;
evaluating the uniqueness of the answer indicated by the first answer text based on a degree of similarity between the first vector and the second vector;
Information processing methods.
 前記第2回答テキストは、複数のテキストを含み、
 前記第2ベクトルは、前記複数のテキストの各々をベクトルに変換することによって生成される複数のベクトルから決定される重心位置に対応するベクトルである、
 請求項1に記載の情報処理方法。
the second answer text includes a plurality of texts;
the second vector is a vector corresponding to a center of gravity determined from a plurality of vectors generated by converting each of the plurality of texts into a vector;
The information processing method according to claim 1 .
 前記類似度は、前記第1ベクトルと前記第2ベクトルに基づいて導出されたコサイン類似度である、
 請求項1に記載の情報処理方法。
the similarity is a cosine similarity derived based on the first vector and the second vector;
The information processing method according to claim 1 .
 前記第1回答テキストは、ユーザの発話に基づく音声データをテキストデータに変換することにより取得される、
 請求項1に記載の情報処理方法。
The first answer text is obtained by converting voice data based on a user's speech into text data.
The information processing method according to claim 1 .
 前記回答の独自性を評価することは、前記回答の独自性の高さに応じたスコアを導出することを含む、
 請求項1に記載の情報処理方法。
Evaluating the uniqueness of the answer includes deriving a score corresponding to the uniqueness of the answer.
The information processing method according to claim 1 .
 さらに、前記スコアが低いことを示す第1条件を満たす場合に、ユーザの付近を照明する照明装置に対して、照明光の色温度を下げ、かつ、照度を上げる制御を行う、
 請求項5に記載の情報処理方法。
and when a first condition indicating that the score is low is satisfied, controlling an illumination device that illuminates the vicinity of the user to lower a color temperature of illumination light and to increase an illuminance.
The information processing method according to claim 5.
 前記スコアが閾値よりも小さい場合に、前記第1条件を満たすと判定する、
 請求項6に記載の情報処理方法。
If the score is smaller than a threshold, it is determined that the first condition is satisfied.
The information processing method according to claim 6.
 前記第1ベクトルの取得は、複数の第1回答テキストを取得することを含み、
 前記回答の独自性の評価は、ある期間において取得された前記複数の第1回答テキストのそれぞれに対応する複数の回答について複数のスコアを導出することを含み、
 前記第1条件の判定は、前記ある期間において、前記複数のスコアのうちスコアが閾値以下である回答が所定の数以上である場合に、前記第1条件を満たすと判定することを含む、
 請求項6に記載の情報処理方法。
obtaining the first vector includes obtaining a plurality of first answer texts;
evaluating the uniqueness of the answers includes deriving a plurality of scores for a plurality of answers corresponding to the plurality of first answer texts obtained during a certain period of time;
The determination of whether the first condition is satisfied includes determining that the first condition is satisfied when a predetermined number or more of the answers have a score equal to or less than a threshold among the plurality of scores during the certain period.
The information processing method according to claim 6.
 前記第1ベクトルは、前記第1回答テキストに含まれる複数の単語のそれぞれに対応する複数のベクトルの中で、最も前記第2ベクトルとの類似度が低いベクトルである、
 請求項1に記載の情報処理方法。
the first vector is a vector having the lowest similarity to the second vector among a plurality of vectors corresponding to a plurality of words included in the first answer text;
The information processing method according to claim 1 .
 さらに、前記回答に対する独自性の評価結果を表示する表示画面をディスプレイに表示する、
 請求項1記載の情報処理方法。
Furthermore, a display screen for displaying the evaluation result of the originality of the answer is displayed on a display.
2. The information processing method according to claim 1.
 前記第1ベクトルは、複数のコーパスのそれぞれについて前記第1回答テキストに含まれる複数の単語ごとに算出された複数の単語ベクトルを含み、
 前記第2ベクトルは、複数のコーパスに対応する複数の基準点を含み、
 前記独自性の評価は、前記複数の単語ベクトルと、前記複数の基準点とを前記複数のコーパスごとに比較することで、前記複数のコーパスに対応する独自性に関する複数の評価値を算出することと、
 前記複数の評価値から最終評価値を算出することと、を含む、
 請求項1記載の情報処理方法。
the first vector includes a plurality of word vectors calculated for each of a plurality of words included in the first answer text for each of a plurality of corpora;
the second vector includes a plurality of reference points corresponding to a plurality of corpora;
The evaluation of the uniqueness includes calculating a plurality of evaluation values related to uniqueness corresponding to the plurality of corpora by comparing the plurality of word vectors with the plurality of reference points for each of the plurality of corpora;
Calculating a final evaluation value from the plurality of evaluation values.
2. The information processing method according to claim 1.
 プロセッサを有する情報処理システムであって、
 前記プロセッサは、
 主題に対する回答を示す第1回答テキストに対応する第1ベクトルを取得し、
 前記主題から連想される回答を示す第2回答テキストに対応する第2ベクトルを取得し、
 前記第1ベクトル及び前記第2ベクトルの類似度の大きさに基づき、前記第1回答テキストが示す前記回答の独自性を評価する、処理を実行する、
 情報処理システム。
An information processing system having a processor,
The processor,
obtaining a first vector corresponding to a first answer text indicating an answer to the topic;
obtaining a second vector corresponding to a second answer text indicating an answer associated with the topic;
execute a process for evaluating the uniqueness of the answer indicated by the first answer text based on a degree of similarity between the first vector and the second vector;
Information processing system.
PCT/JP2024/016514 2023-05-23 2024-04-26 Information processing method and information processing system Pending WO2024241837A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2023-084805 2023-05-23
JP2023084805 2023-05-23

Publications (1)

Publication Number Publication Date
WO2024241837A1 true WO2024241837A1 (en) 2024-11-28

Family

ID=93590023

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2024/016514 Pending WO2024241837A1 (en) 2023-05-23 2024-04-26 Information processing method and information processing system

Country Status (1)

Country Link
WO (1) WO2024241837A1 (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004021433A (en) * 2002-06-13 2004-01-22 Fujitsu Ltd Questionnaire implementation program, questionnaire implementation device and questionnaire implementation method
JP2020170460A (en) * 2019-04-05 2020-10-15 株式会社エナジード Information processing system, information processing method and program

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004021433A (en) * 2002-06-13 2004-01-22 Fujitsu Ltd Questionnaire implementation program, questionnaire implementation device and questionnaire implementation method
JP2020170460A (en) * 2019-04-05 2020-10-15 株式会社エナジード Information processing system, information processing method and program

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
SUNAYAMA, WATARU: "Visual Support System for Report Distinctiveness Evaluation", TRANSACTIONS OF THE JAPANESE SOCIETY FOR ARTIFICIAL INTELLIGENCE., vol. 23, no. 6, 1 January 2008 (2008-01-01), pages 392 - 401, XP093241720 *

Similar Documents

Publication Publication Date Title
US11294953B2 (en) Similar face retrieval method, device and storage medium
CN111339246B (en) Query statement template generation method, device, equipment and medium
CN107818798B (en) Customer service quality evaluation method, device, equipment and storage medium
US12001465B2 (en) Response selecting apparatus, response selecting method, and response selecting program
WO2019184103A1 (en) Person ip-based human-computer interaction method and system, medium and device
CN110580516B (en) Interaction method and device based on intelligent robot
US20140351228A1 (en) Dialog system, redundant message removal method and redundant message removal program
JP2019053126A (en) Growth type interactive device
WO2018006727A1 (en) Method and apparatus for transferring from robot customer service to human customer service
CN107577689A (en) Decision tree generating means, decision tree generation method, non-transitory recording medium and enquirement system
CN109710935B (en) Museum navigation and knowledge recommendation method based on cultural relic knowledge graph
JP2022125222A (en) Information processing system, information processing method, and information processing apparatus
US20200357302A1 (en) Method for digital learning and non-transitory machine-readable data storage medium
CN119418725A (en) A multimodal classroom emotion recognition method and system based on modality adaptive learning
CN112860995B (en) Interaction method, device, client, server and storage medium
US10836044B2 (en) Robot control device and robot control method
US20120185417A1 (en) Apparatus and method for generating activity history
WO2020076179A1 (en) Method for determining tags for hotels and device for the implementation thereof
CN117670367A (en) Complaint work order processing method, complaint work order processing device, terminal equipment and storage medium
JP2022114906A (en) psychological state management device
KR101113938B1 (en) Community system for providing communication service with dead person and Community method thereof
KR20210071713A (en) Speech Skill Feedback System
WO2024241837A1 (en) Information processing method and information processing system
US20140095402A1 (en) System and Method of Scoring Candidate Audio Responses for a Hiring Decision
CN109344221B (en) Recording text generation method, device and equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24810845

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE