WO2018214859A1 - Method and device for analyzing man-machine dialogue data - Google Patents
Method and device for analyzing man-machine dialogue data Download PDFInfo
- Publication number
- WO2018214859A1 WO2018214859A1 PCT/CN2018/087768 CN2018087768W WO2018214859A1 WO 2018214859 A1 WO2018214859 A1 WO 2018214859A1 CN 2018087768 W CN2018087768 W CN 2018087768W WO 2018214859 A1 WO2018214859 A1 WO 2018214859A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- data
- analysis
- query
- user
- query instruction
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/338—Presentation of query results
Definitions
- the present disclosure relates to the field of data analysis technologies, and in particular, to a human-machine dialog data analysis method and apparatus.
- an object of the present disclosure is to provide a human-machine dialog data analysis method, which is applied to a data analysis system, and the method includes:
- Providing a data analysis interaction interface receiving a query instruction input by the user on the data analysis interaction interface, and displaying the query instruction in a dialog interaction manner, wherein the query instruction includes the query content;
- the visual analysis result is displayed in the form of a dialog interaction at the data analysis interaction interface.
- the step of determining data to be analyzed includes:
- the selected data is taken as the data to be analyzed.
- the receiving the query instruction input by the user in the data analysis interaction interface, and displaying the query instruction in the form of a dialog interaction including any one of the following:
- the voice information input by the user through the voice input device is received, the voice information is converted into a text command, and the text command is displayed in the form of a dialog interaction.
- the step of providing a data analysis interaction interface, receiving a query instruction input by the user on the data analysis interaction interface, and displaying the query instruction in the form of a dialog interaction includes:
- the step of displaying the visual analysis result in the form of a dialog interaction in the data analysis interaction interface includes:
- the query instruction includes a statement in a natural language form; and the step of performing statistical analysis on the data to be analyzed according to the query instruction to obtain corresponding statistical data, including:
- the step of determining data to be analyzed includes:
- the step of parsing the statement in the natural language form to obtain a database query analysis statement includes:
- the step of determining the target technical field of the data to be analyzed includes:
- Target technology area Extracting a file name of the data to be analyzed and a name of each data item in the data to be analyzed, performing statistics on the keyword in the file name and the data item name, and determining the data to be analyzed according to the statistical result.
- the step of determining a data presentation manner according to the query instruction, and generating a visual analysis result of the statistical data according to the data display manner includes:
- the visual analysis result of the statistical data is generated according to a default data display manner
- the visual analysis result of the statistical data is generated according to the specified data display manner.
- the method further includes:
- the default data display method receives and displaying a default data display mode configuration instruction input by the user in the data analysis interaction interface, where the default data display mode configuration instruction includes a default data display mode specified by the user, and configuring a visual analysis result according to the default data display mode configuration instruction.
- the data display manner includes a graphic display manner; the graphic display manner includes one or more of a graphic style, a color quantity, a color type, and a graphic size; and the graphic style includes: a line drawing, a histogram, Dashboard, pie, radar, heat, funnel, or scatter plots.
- the method further includes:
- the newly generated visual analysis result is displayed on the data analysis interactive interface.
- the method further includes:
- the statistical range corresponding to the current visual analysis result is re-determined or modified according to the statistical range update instruction, and a new visual analysis result is generated.
- the method further includes:
- the method further includes:
- the analytical interaction record is stored or transmitted in response to user operations.
- the method before the step of performing statistical analysis on the data to be analyzed according to the query instruction to obtain corresponding statistical data, the method further includes:
- the result of the visual analysis associated with the query hash value is not queried, performing a statistical analysis on the data to be analyzed according to the query instruction to obtain corresponding statistical data, and obtaining the corresponding statistical data according to the statistical data After visualizing the analysis result, the visual analysis result is stored, and the query hash value is associated with the obtained visual analysis result.
- the method further includes:
- Another object of the present invention is to provide a device for analyzing human-machine dialog data, which is applied to a data analysis system, and the device includes:
- a file determination module configured to determine data to be analyzed
- the instruction acquisition module is configured to provide a data analysis interaction interface, and receive and display a query instruction input by the user on the data analysis interaction interface, where the query instruction includes the query content;
- the statistical analysis module is configured to perform statistical analysis on the data to be analyzed according to the query instruction, to obtain corresponding statistical data;
- a result generating module configured to determine a data display manner according to the query instruction, and generate a visual analysis result of the statistical data according to the data display manner;
- the result display module is configured to display the visual analysis result on the data analysis interaction interface.
- the file determining module is specifically configured to provide a data selection interface; in response to the operation of the user in the data selection interface, the selected data is used as data to be analyzed.
- the instruction acquiring module is specifically configured to receive and display text information input by the user, and display the query instruction in a dialog interaction manner; or receive voice information input by the user through the voice input device, and the voice information is Convert to a text instruction and display the text instruction in the form of a dialog interaction.
- the instruction obtaining module is specifically configured to provide a data analysis interaction interface by using a third-party communication software configured on the user terminal, and receive a query instruction sent by the user terminal by using the third-party communication software;
- the result display module is specifically configured to send the generated visual analysis result to the user terminal, so that the user terminal displays the visual analysis result in the form of a dialog interaction through a data analysis interaction interface provided by the third-party communication software. .
- the present disclosure has at least the following beneficial effects:
- the human-machine dialog data analysis method and apparatus receive a query instruction of the user in the interface data by providing a friendly data analysis interaction interface, and perform data analysis according to the query instruction, in the form of human-machine dialogue.
- the analysis result is displayed on the analysis interaction interface.
- FIG. 1 is a schematic diagram of interaction between a data analysis device and a user terminal according to an embodiment of the present disclosure
- FIG. 2 is a schematic structural diagram of hardware of a data analysis device according to an embodiment of the present disclosure
- FIG. 3 is a schematic flowchart of steps of a human-machine dialog data analysis method according to an embodiment of the present disclosure
- FIG. 4 is a schematic diagram of a file selection interface according to an embodiment of the present disclosure.
- FIG. 5 is a schematic diagram of a data analysis interaction interface according to an embodiment of the present disclosure.
- FIG. 6 is a schematic diagram of a human-machine dialog data analyzing apparatus according to an embodiment of the present disclosure.
- Icon 100-data analysis device; 110-human-machine dialog data analysis device; 111-file determination module; 112-instruction acquisition module; 113-statistical analysis module; 114-result generation module; 115-result display module; Terminal; 300-network.
- the human-machine dialog data analysis method provided in this embodiment is applied to a data analysis device.
- the data analysis device 100 may be connected to the user terminal 200 through a network 300.
- the user terminal 200 receives a user input query analysis content and sends the data to the data analysis device 100 for data analysis. And receiving the analysis result fed back by the data analysis device 100.
- the data analysis device 100 can also independently receive the query analysis content input by the user, and analyze the result to feed back the analysis result to the user.
- FIG. 2 is a block diagram of the data analysis device 100 shown in FIG. 1.
- the data analysis device 100 includes a human-machine dialog data analysis device 110, a memory 120, and a processor 130.
- the components of the memory 120 and the processor 130 are electrically connected directly or indirectly to each other to implement data transmission or interaction.
- the components can be electrically connected to one another via one or more communication buses or signal lines.
- the human-machine dialog data analyzing device 110 includes at least one software function that can be stored in the memory 120 or solidified in an operating system (OS) of the data analyzing device 100 in the form of software or firmware. Module.
- the processor 130 is configured to execute an executable module stored in the memory 120, for example, a software function module, a computer program, and the like included in the human-machine dialog data analyzing device 110.
- the memory 120 may be, but not limited to, a random access memory (RAM), a read only memory (ROM), and a programmable read-only memory (PROM). Erasable Programmable Read-Only Memory (EPROM), Electric Erasable Programmable Read-Only Memory (EEPROM), and the like.
- RAM random access memory
- ROM read only memory
- PROM programmable read-only memory
- EPROM Erasable Programmable Read-Only Memory
- EEPROM Electric Erasable Programmable Read-Only Memory
- the memory 120 is configured to store a program, and the processor 130 executes the program after receiving an execution instruction.
- FIG. 2 is a flowchart of a human-machine dialog data analysis method applied to the data analysis device 100 shown in FIG. 1, and the steps including the steps will be described in detail below.
- step S110 the data to be analyzed is determined.
- the data analysis device 100 can provide a data selection interface.
- the data analysis device 100 can provide a web page, the data molecule device 100 further including a communication unit configured to perform data interaction with the user terminal 200, the data analysis device 100 through the communication The unit receives the access request sent by the user terminal 200, and sends the web page to the user terminal 200, so that the user terminal 200 loads the web page to display the data selection interface; or the data analysis device 100 can also A special APP installation package is provided, so that the user terminal 200 can obtain a special APP by downloading and installing the APP installation package, and display the data selection interface after launching the APP.
- the data analysis device 100 can include or be coupled to at least one display device through which the data analysis device 100 can display the data selection interface.
- the data analysis device 100 can display at least one data file on the data selection interface. For example, a plurality of data files uploaded by the user may be displayed; or a plurality of data files may be determined and displayed in the data analysis device 100 in response to user operations; or data interaction with other devices may be performed in response to user operations to obtain from other devices. At least one data file specified by the user.
- the data file may include, but is not limited to, an EXCEL data file, a relational database table, and the like.
- the data file may be arranged to display the plurality of data files by means of a chart and/or a text description, and at the same time, information such as the number of rows of data in each data file and the date of creation of the data file may be realized. In this way, the user can visually see the data file on the data selection interface.
- the processor 130 of the data analysis device 100 can then use the selected data as data to be analyzed in response to the user's operation at the data selection interface.
- the data analysis device 100 may use one or more data selected by the user as data to be analyzed.
- the data analysis device 100 may respond to a user's click on the data file on the file selection interface, and use the data file clicked by the user as the data to be analyzed.
- the data to be analyzed may also be determined for an instruction input by text or voice.
- Step S120 providing a data analysis interaction interface, receiving a query instruction input by the user on the data analysis interaction interface, and displaying the query instruction in the form of a dialog interaction, wherein the query instruction includes the query content.
- the data analysis interface displays a data analysis interaction interface.
- the data analysis device 100 may provide the data analysis interaction interface in the same or similar manner to provide the data selection interface. For details, refer to the description of the data selection interface in step S110, and details are not described herein again.
- the processor 130 of the data analysis device 100 can receive a query instruction input by the user at the data analysis interaction interface.
- the query instructions may include the need to query content, such as "showing revenue trends for each month of the most recent year.”
- the query instruction may also include a combination of the query content and the specified data presentation manner, such as "using a line chart to show the revenue trend of each month in the most recent year.”
- the manner of displaying the query instruction in the form of a dialog interaction including any one of the following:
- the voice information input by the user through the voice input device is received, the voice information is converted into a text command, and the text command is displayed in the form of a dialog interaction.
- a text input box may be displayed on the data analysis interaction interface, and the user may input text information in a text input box, and the data analysis device 100 displays the form in a dialogue interaction in the data analysis interaction interface. text information.
- a voice input button may also be displayed on the data analysis interface, the user may input voice information by clicking a voice input button, and the data analysis device 100 converts the voice information into text information and displays the voice information in the form of a dialog interaction. Data analysis on the interactive interface.
- Step S130 Perform statistical analysis on the data to be analyzed according to the query instruction, and obtain corresponding statistical data.
- the query instruction comprises a statement in the form of a natural language.
- the data analysis device 100 parses the received statement in the natural language form to obtain a structured database query analysis statement, for example, a SQL (Structured Query Language) query statement. Then, according to the database query analysis statement, statistical analysis is performed on the data to be analyzed, and corresponding statistical data is obtained.
- a structured database query analysis statement for example, a SQL (Structured Query Language) query statement.
- SQL Structured Query Language
- statistical analysis is performed on the data to be analyzed, and corresponding statistical data is obtained.
- users who do not understand the professional database query statement can also perform data analysis through the natural language to issue query instructions, which is convenient for the user to use.
- the processor 130 of the data analysis device 100 may obtain the database query statement by analyzing the statement in the natural language form by using a general semantic analysis algorithm, for example, using a semantic recognition system combined with a deep learning model. , will not repeat them here.
- a plurality of semantic analysis models for different technical fields may be pre-configured in the data analysis device 100.
- the data analysis device 100 may parse according to the technical domain of the data to be analyzed using different semantic analysis models. For example, for different fields such as meteorology, hydrology, finance, etc., a plurality of semantic analysis models are pre-configured in the data analysis device 100, and different semantic analysis models are configured to adopt different analysis algorithms or the thesaurus to the natural language of the corresponding technical field. Perform a quick analysis.
- step S110 after determining the data to be analyzed, the data analysis device 100 further needs to determine a target technical field to which the data to be analyzed belongs.
- the data analysis device 100 may display a plurality of selectable technical field options to the user through the mobile terminal, and respond to the user operation, and use the technical field selected by the user as the target technical field of the data to be analyzed.
- the data analysis device 100 displays a plurality of selection items of weather, hydrology, finance, etc. to the user through the mobile terminal after determining the data to be analyzed, and when the user selects the “financial” option, the data analysis device 100 The financial field is determined as the target technical field of the data to be analyzed.
- the data analysis device 100 may extract a file name of the data to be analyzed and a name of each data item in the data to be analyzed, and a keyword in the file name and the data item name. Statistics are performed, and the target technical field described in the data to be analyzed is determined according to the statistical result. For example, when the file name of the data to be analyzed selected by the user is “revenue and expenditure data statistics table”, or when the name of the data item in the data to be analyzed includes “revenue”, “expenditure”, “balance”, and the like, The data analysis device 100 may determine the target technology area described by the data to be analyzed as the financial field.
- step S130 the data analysis device 100 parses the natural language form query instruction according to the target technology domain determined in step S110 to obtain the database query analysis statement. .
- the data analysis device 100 uses the corresponding semantic analysis model for parsing, which can effectively improve the parsing speed of the query instruction in the natural language form, thereby reducing the waiting time of the user in the query process. Improve the user experience.
- the data analysis device 100 records the corresponding database query analysis statement according to the query instruction in the natural language form input by the user, and records the query instruction and the parsed database query analysis statement to a semantic comparison. table.
- the data analysis device 100 first searches whether the query instruction currently input by the user has been recorded in the semantic comparison table.
- the data analysis device 100 directly retrieves the query instruction directly from the semantic comparison table. Corresponding database query analysis statement.
- the data analysis device 100 adopts a preset semantic analysis model to query the natural language form. Analyze and get the corresponding structured database query analysis statement.
- the database analysis device 100 searches for the database query analysis statement corresponding to the query instruction in the semantic comparison table, and sequentially performs the query instruction currently input by the user and the query instruction in the semantic comparison table. Similarity analysis.
- the first preset threshold may be set to 95%.
- the query instruction in the semantic comparison table record is similar to the query instruction currently input by the user.
- the query instruction whose degree is greater than the second preset threshold is based on the database query analysis statement corresponding to the query instruction with the highest similarity among the query instructions having the similarity greater than the second threshold, and is adjusted according to the query instruction currently input by the user.
- the database query analysis statement uses the adjusted query analysis statement as a database query analysis statement corresponding to the query instruction currently input by the user.
- the second preset threshold may be set to 80%.
- the data analysis device uses the preset semantic analysis model to query the natural language form. Analyze and get the corresponding structured database query analysis statement.
- the query instruction currently input by the user is first compared with the previously interpreted query instruction, so as to directly use the structured database query analysis statement that has been parsed before or on the basis of the database query analysis statement obtained before parsing. Adjustment and modification can effectively avoid repeated analysis of the same "query command" and improve the operating efficiency of the system.
- the semantic comparison table further records a heat value used by each query instruction, and the database query analysis statement of the record in the semantic comparison table is selected in the query instruction similarity matching process. If yes, the heat value of the query instruction corresponding to the database query analysis statement is increased.
- the query instruction currently input by the user is matched and matched with the query instruction recorded in the semantic comparison table
- the query instruction of the record in the semantic comparison table and the current user are sequentially sequentially according to the order of the heat value.
- the input query instructions are compared. In this way, the query instruction with a higher frequency of use in the semantic comparison table can preferentially match the current input query instruction of the user, which can improve the matching speed.
- Step S140 Determine a data display manner according to the query instruction, and generate a visual analysis result of the statistical data according to the data display manner.
- the processor 130 of the data analysis device 100 determines whether the specified data display mode is included in the query instruction.
- the processor 130 When the specified data display mode is not included in the query instruction, the processor 130 generates a visual analysis result of the statistical data according to a default data display manner. For example, when the query instruction only includes “display sales in the last 10 months”, the default data display manner is used to display the data to be analyzed.
- the default data presentation mode is preset by the user, for example, a histogram, a line chart or a data table.
- the processor 130 When the query instruction includes the specified data display mode, the processor 130 generates a visual analysis result of the statistical data according to the specified data display manner. For example, when the query instruction includes “displaying the income trend of each month in the last year by using a line graph”, the data to be analyzed is generated as a line graph according to the specified data display manner.
- the data display manner includes a graphic display manner.
- the graphic display manner includes one or more of a graphic style, a color quantity, a color type, and a graphic size.
- Graphic styles include, but are not limited to, line charts, histograms, dashboard charts, pie charts, radar charts, heat maps, funnel charts, or scatter plots.
- Step S150 displaying the visual analysis result in the form of a dialog interaction in the data analysis interaction interface.
- the processor 130 of the data analysis device 100 displays the analysis result on the data analysis interaction page in a manner of replying to the dialogue.
- the input data or the query instruction of the user is received through the interactive interface of the human-machine dialogue, and the visualized query result is fed back to the user in a manner of replying to the dialogue, compared with the current In the technology, the user needs to use a special database query language to perform data query in a special database query interface.
- the solution provided in this embodiment provides a more friendly human-computer interaction mode, and does not require the user to have professional database query analysis knowledge. Convenient and intuitive
- the third-party communication software configured by the user terminal 200 provides the data selection interface or the data analysis interaction interface, and the information or instructions sent by the user are A server of the three-party communication software is sent to the data analysis device 100.
- the visual analysis result generated by the data analysis device 100 is sent to the user terminal 200 via a server of the third-party communication software, and the third-party communication software configured on the user terminal 200 provides data analysis interaction.
- the interface displays the analysis results of the visualization.
- the third party communication software may be instant messaging software such as WeChat.
- step S120 the data analysis device 100 provides a data analysis interaction interface through the third-party communication software configured on the user terminal, and receives the query instruction sent by the user terminal through the third-party communication software. Then, in step S150, the data analysis device 100 sends the generated visual analysis result to the user terminal, so that the user terminal displays the interaction in the form of a dialog interaction through the data analysis interaction interface provided by the third-party communication software.
- the third party communication software and its server should be considered to assist the data analysis device 100 in providing the data selection interface or data analysis interaction interface, and thus the above solution should also be considered as protection in the present disclosure.
- the method may further include: the data analysis device 100 receives and displays a display mode update instruction input by the user on the data analysis interaction interface, where the display mode update instruction includes a data display manner. And then, according to the display mode update instruction, the data display manner is regenerated, and the visual analysis result of the statistical data is regenerated.
- the user continues to input "convert to histogram", and the data analysis device 100 receives and displays the display mode update. instruction. Then, the data analysis device 100 generates the data to be analyzed into a histogram according to the re-determined data display manner based on the data to be analyzed that is queried in step S130. The newly generated visual analysis result is then displayed on the data analysis interactive interface.
- the method may further include the data analysis device 100 receiving and displaying a statistical range update instruction input by the user on the data analysis interaction interface, and then re-determining according to the statistical range update instruction or Modify the statistical range corresponding to the current visual analysis result and generate a new visual analysis result.
- the user continues to input "remove the first quarter data", and the data analysis device 100 receives and displays the statistical range. Update instructions. Then, the data analysis device 100 generates a new visual analysis result according to the re-determined statistical range on the basis of the data to be analyzed that is queried in step S130. The newly generated visual analysis result is then displayed on the data analysis interactive interface.
- the method may further include: the data analysis device 100 receiving, by the user, the selection of the visual analysis result on the data analysis interaction interface, and performing the selected visual analysis result on the selected Store or send.
- the data analysis device 100 receives a user click on the displayed line graph on the data analysis interaction interface to display at least one operational option (eg, save, send, etc.).
- the generated analysis result is stored or transmitted in response to the user selecting the operation option.
- the method may further include: the data analysis device 100 records a query instruction input by the user on the data analysis interaction interface, and the data analysis system generates a visual analysis in response to the query instruction. As a result, an analysis interaction record is generated.
- the analytical interaction record is stored or transmitted in response to user operations.
- the method may further include: the data analysis device 100 receiving and displaying a default data presentation mode configuration instruction input by the user on the data analysis interaction interface, where the default data display mode configuration instruction is The user-specified default data display mode is configured, and the default data display mode of the visual analysis result is configured according to the default data display mode configuration instruction.
- the user may input “default line graph display data” in the data analysis interaction interface, and the data analysis device 100 receives and displays the default data display mode configuration instruction, and visually analyzes the default data according to the instruction.
- the presentation mode is configured to be displayed by a line chart.
- the database analysis device 100 may display an error correction feedback option on the analysis interaction interface after displaying the visual analysis result.
- the error correction instruction may be sent to the data analysis device 100 by selecting the error correction feedback option.
- the database analysis device 100 re-analyzes the user input query instruction or re-displays another query instruction whose similarity is above the first preset threshold from the semantic comparison table. For the user to select, and after the user re-analyzes the semantics or the user reselects the query instruction, a new visual analysis result is generated and displayed to the user.
- the method provided by the embodiment provides a friendly human-machine dialogue interaction interface.
- the interaction process is intuitive and concise, so that users without professional knowledge can easily use the data. Analysis function.
- the data analysis device 100 may repeatedly perform the same query analysis action multiple times in a short time. Therefore, in this embodiment, before performing step S130, the file identifier of the data to be analyzed and the keyword in the query instruction are hashed to obtain a query hash value. Then find out if the visual analysis results associated with the query hash value have been generated.
- the file identifier of the data to be analyzed may be metadata of the file to be analyzed or other identifier information that identifies the unique identity of the data to be analyzed.
- step S150 If the result of the visual analysis associated with the query hash value is queried, the same query statistics operation has been performed for the same data to be analyzed, and the data analysis device 100 directly jumps without performing the query statistics operation again. Go to step S150, and display the queryed visual analysis result in the form of dialog interaction in the data analysis interaction interface.
- the query statistical operation required by the query instruction has not been executed for the same data to be analyzed, and the data analysis device 100 performs step S130 according to the The query instruction performs statistical analysis on the data to be analyzed to obtain corresponding statistical data.
- the query hash value is associated with the obtained visual analysis result. In this way, the next time the same statistical operation is performed, the generated visual analysis result can be directly queried according to the query hash value.
- the user sends a query instruction for "representing the income trend of each month of the most recent year in a line graph" for the data file A 20 minutes ago, then the data analysis device pairs the metadata according to the data file A, the keyword " The line chart, the keyword “every month of the most recent year” and the keyword “income trend” are hashed, get the query hash value X, and then find out whether it has been associated with the query hash value X. Visualize the results of the analysis.
- the analysis device 100 performs analysis according to the query instruction input by the user, generates a visual analysis result B and presents it to the user, and associates the visual analysis result B with the query hash value.
- the data analysis device pairs the metadata according to the data file A, the keyword " The line chart, the keyword “every month of the most recent year” and the keyword “income trend” are hashed, get the query hash value X, and then find out whether it has been associated with the query hash value X. Visualize the results of the analysis.
- the data analysis device 100 can query the visual analysis result B according to the query hash value X, and the data analysis device 100 directly displays the visual analysis result. B feeds back the user instead of performing the statistical analysis based on the query command again.
- the data analysis device 100 may clear the storage time longer than the second preset duration every interval of the first preset duration.
- the visual analysis result is described, wherein the first preset duration is greater than the second preset duration.
- the data analysis device 100 is aware of the visual analysis results stored on the data analysis device 100 for more than 20 minutes every 5 hours.
- the embodiment further provides a human-machine dialog data analyzing apparatus 110 applied to the data analyzing apparatus 100 shown in FIG. 1.
- the apparatus includes a file determining module 111, an instruction acquiring module 112, and a statistical analyzing module 113.
- the file determining module 111 is configured to determine data to be analyzed.
- the file determining module 111 may be configured to perform step S110 shown in FIG. 2, and the description of the step S110 may be referred to with a specific description of the file determining module 111.
- the instruction obtaining module 112 is configured to provide a data analysis interaction interface, and receive and display a query instruction input by the user on the data analysis interaction interface, where the query instruction includes the query content.
- the instruction obtaining module 112 may be configured to perform the step S120 shown in FIG. 2, and the description of the step S120 may be referred to in the specific description of the instruction acquiring module 112.
- the statistical analysis module 113 is configured to perform statistical analysis on the data to be analyzed according to the query instruction, to obtain corresponding statistical data.
- the statistical analysis module 113 may be configured to perform the step S130 shown in FIG. 2, and the specific description of the statistical analysis module 113 may refer to the description of the step S130.
- the result generating module 114 is configured to determine a data display manner according to the query instruction, and generate a visual analysis result of the statistical data according to the data display manner.
- the result generating module 114 may be configured to perform step S140 shown in FIG. 2, and a detailed description of the result generating module 114 may refer to the description of the step S140.
- the result display module 115 is configured to display the visual analysis result on the data analysis interaction interface.
- the result display module 115 can be configured to perform the sub-step S150 shown in FIG. 2, and a detailed description of the result display module 115 can refer to the description of the sub-step S150.
- the file determining module 111 is specifically configured to provide a data selection interface; in response to the user's operation in the data selection interface, the selected data is used as data to be analyzed.
- the instruction obtaining module 112 is specifically configured to receive and display text information input by the user, and display the query instruction in a dialog interaction manner; or receive the voice input by the user through the voice input device. Information, converting the voice information into a text instruction and displaying the text instruction in the form of a dialog interaction.
- the instruction obtaining module 112 is specifically configured to provide a data analysis interaction interface by using a third-party communication software configured on the user terminal, and receive the query sent by the user terminal by using the third-party communication software. instruction;
- the result display module is specifically configured to send the generated visual analysis result to the user terminal, so that the user terminal displays the visual analysis result in the form of a dialog interaction through a data analysis interaction interface provided by the third-party communication software. .
- the human-machine dialog data analysis method and apparatus receive a query instruction of the user in the interface data by providing a friendly data analysis interaction interface, and perform data analysis according to the query instruction, and then use the human-machine
- the form of the dialog displays the analysis results on the analysis interaction interface.
- the human-machine dialog data analysis method and apparatus receive a query instruction of the user in the interface data by providing a friendly data analysis interaction interface, and perform data analysis according to the query instruction, in the form of human-machine dialogue.
- the analysis result is displayed on the analysis interaction interface.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Artificial Intelligence (AREA)
- Human Computer Interaction (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Description
相关申请的交叉引用Cross-reference to related applications
本公开要求于2017年05月22日提交中国专利局的申请号为2017103639701、名称为“人机对话数据分析方法及装置”的中国专利申请的优先权,其全部内容通过引用结合在本公开中。The present disclosure claims priority to Chinese Patent Application No. 2017103639701, entitled "Human-Machine Dialogue Data Analysis Method and Apparatus", filed on May 22, 2017, the entire contents of which are incorporated herein by reference. .
本公开涉及数据分析技术领域,具体而言,涉及一种人机对话数据分析方法及装置。The present disclosure relates to the field of data analysis technologies, and in particular, to a human-machine dialog data analysis method and apparatus.
随着数字信息技术的发展,对数据的处理分析变得越来越重要,数据分析在各行各业中都发挥着巨大的作用。同时,数据分析的方式也日趋智能化,数据处理设备可以根据用户输入的分析命令对指定的数据进行分析并反馈分析结果。但是,现有技术的数据分析方法长期存在一个问题,即,需要专业人士进行数据分析,而专业人士的招聘和培养都耗费了企业大量的时间成本及劳动力成本,其中需要在专门数据库软件使用专门的数据库查询语言进行数据查询分析,待分析目标的确认、分析内容的输入等操作较为繁琐,交互方式不人性化,可操作性不强。随着没有专业数据分析知识或者不了解数据库操作的普通用户对数据分析的需求也越来越强,怎样提供一种简单有效的数据分析交互方式长期困扰着本领域技术人员,这阻碍了数据分析方法的广泛应用。With the development of digital information technology, the analysis of data processing has become more and more important, and data analysis plays a huge role in all walks of life. At the same time, the way of data analysis is becoming more and more intelligent, and the data processing device can analyze the specified data according to the analysis command input by the user and feed back the analysis result. However, there is a long-standing problem with the prior art data analysis methods, that is, professionals need to perform data analysis, and the recruitment and training of professionals consumes a large amount of time and labor costs of the enterprise, which requires special use in specialized database software. The database query language performs data query analysis, and the operations of confirming the analysis target and inputting the analysis content are cumbersome, the interaction mode is not human, and the operability is not strong. As ordinary users who do not have professional data analysis knowledge or do not understand database operations are increasingly demanding data analysis, how to provide a simple and effective data analysis interaction method has long plagued the technical personnel in the field, which hinders data analysis. Wide application of methods.
发明内容Summary of the invention
为了克服现有技术中的至少一个不足,本公开的目的在于提供一种人机对话数据分析方法,应用于数据分析系统,所述方法包括:In order to overcome at least one of the deficiencies in the prior art, an object of the present disclosure is to provide a human-machine dialog data analysis method, which is applied to a data analysis system, and the method includes:
确定待分析数据;Determining the data to be analyzed;
提供一数据分析交互界面,接收用户在该数据分析交互界面输入的查询指令,并以对话交互的形式显示所述查询指令,其中,所述查询指令包括查询内容;Providing a data analysis interaction interface, receiving a query instruction input by the user on the data analysis interaction interface, and displaying the query instruction in a dialog interaction manner, wherein the query instruction includes the query content;
根据所述查询指令在所述待分析数据中进行统计分析,得到相应的统计数据;Performing statistical analysis on the data to be analyzed according to the query instruction, and obtaining corresponding statistical data;
根据所述查询指令确定数据展示方式,按照所述数据展示方式生成所述统计数据的可视化分析结果;Determining, according to the query instruction, a data display manner, and generating a visual analysis result of the statistical data according to the data display manner;
在所述数据分析交互界面以对话交互的形式显示所述可视化分析结果。The visual analysis result is displayed in the form of a dialog interaction at the data analysis interaction interface.
可选地,所述确定待分析数据的步骤,包括:Optionally, the step of determining data to be analyzed includes:
提供一数据选择界面;Providing a data selection interface;
响应用户在数据选择界面的操作,将选中的数据作为待分析数据。In response to the user's operation on the data selection interface, the selected data is taken as the data to be analyzed.
可选地,所述接收用户在该数据分析交互界面输入的查询指令,并以对话交互的形式显示所述查询指令的方式,包括以下任意之一:Optionally, the receiving the query instruction input by the user in the data analysis interaction interface, and displaying the query instruction in the form of a dialog interaction, including any one of the following:
接收并显示用户输入的文字信息,并以对话交互的形式显示所述查询指令;或Receiving and displaying text information input by the user, and displaying the query instruction in the form of a dialog interaction; or
接收用户通过语音输入设备输入的语音信息,将所述语音信息转换为文字指令并以对话交互的形式显示所述文字指令。The voice information input by the user through the voice input device is received, the voice information is converted into a text command, and the text command is displayed in the form of a dialog interaction.
可选地,所述提供一数据分析交互界面,接收用户在该数据分析交互界面输入的查询指令,并以对话交互的形式显示所述查询指令的步骤,包括:Optionally, the step of providing a data analysis interaction interface, receiving a query instruction input by the user on the data analysis interaction interface, and displaying the query instruction in the form of a dialog interaction, includes:
通过用户终端上配置的第三方通信软件提供一数据分析交互界面,接收所述用户终端通过该第三方通信软件发送的查询指令;Providing a data analysis interaction interface by using a third-party communication software configured on the user terminal, and receiving a query instruction sent by the user terminal through the third-party communication software;
所述在所述数据分析交互界面以对话交互的形式显示可视化分析结果的步骤,包括:The step of displaying the visual analysis result in the form of a dialog interaction in the data analysis interaction interface includes:
将生成的可视化分析结果发送给所述用户终端,使所述用户终端通过所述第三方通信软件提供的数据分析交互界面以对话交互的形式显示所述可视化分析结果。Sending the generated visual analysis result to the user terminal, so that the user terminal displays the visual analysis result in the form of a dialog interaction through a data analysis interaction interface provided by the third-party communication software.
可选地,所述查询指令包括自然语言形式的语句;所述根据所述查询指令在所述待分析数据中进行统计分析,得到相应的统计数据的步骤,包括:Optionally, the query instruction includes a statement in a natural language form; and the step of performing statistical analysis on the data to be analyzed according to the query instruction to obtain corresponding statistical data, including:
对所述自然语言形式的语句进行解析,得到数据库查询分析语句;Parsing the statement in the natural language form to obtain a database query analysis statement;
根据所述数据库查询分析语句在所述待分析数据中进行统计分析,得到相应的统计数据。Performing statistical analysis on the data to be analyzed according to the database query analysis statement to obtain corresponding statistical data.
可选地,所述确定待分析数据的步骤包括:Optionally, the step of determining data to be analyzed includes:
确定所述待分析数据,并确定所述待分析数据所属的目标技术领域;Determining the data to be analyzed, and determining a target technical field to which the data to be analyzed belongs;
所述对所述自然语言形式的语句进行解析,得到数据库查询分析语句的步骤包括:The step of parsing the statement in the natural language form to obtain a database query analysis statement includes:
根据所述目标技术领域确定相应语言解析模型对所述自然语言形式的查询指令进行解析,得到所述数据库查询分析语句。Determining, by the corresponding language analysis model, the query instruction of the natural language form according to the target technical field to obtain the database query analysis statement.
可选地,所述确定所述待分析数据所述的目标技术领域的步骤,包括:Optionally, the step of determining the target technical field of the data to be analyzed includes:
响应用户操作,将用户选取的技术领域作为所述目标技术领域;或Responding to user operations, using the technical field selected by the user as the target technical field; or
提取所述待分析数据的文件名称及所述待分析数据中各数据项的名称,对所述文件名称及数据项名称中的关键字进行统计,根据统计结果确定所述待分析数据所述的目标技术领域。Extracting a file name of the data to be analyzed and a name of each data item in the data to be analyzed, performing statistics on the keyword in the file name and the data item name, and determining the data to be analyzed according to the statistical result. Target technology area.
可选地,所述根据所述查询指令确定数据展示方式,按照所述数据展示方式生成所述统计数据的可视化分析结果的步骤,包括:Optionally, the step of determining a data presentation manner according to the query instruction, and generating a visual analysis result of the statistical data according to the data display manner, includes:
判断所述查询指令中是否包括指定的数据显示方式;Determining whether the specified data display mode is included in the query instruction;
当所述查询指令中不包括指定的数据显示方式时,按照默认的数据展示方式生成所述统计数据的可视化分析结果;When the specified data display mode is not included in the query instruction, the visual analysis result of the statistical data is generated according to a default data display manner;
当所述查询指令中包括指定的数据显示方式时,按照所述指定的数据展示方式生成所述统计数据的可视化分析结果。When the specified data display mode is included in the query instruction, the visual analysis result of the statistical data is generated according to the specified data display manner.
可选地,所述方法还包括:Optionally, the method further includes:
接收并显示用户在该数据分析交互界面输入的默认数据展示方式配置指令,所述默认数据展示方式配置指令中包括用户指定的默认数据展示方式,根据所述默认数据展示方式配置指令配置可视化分析结果默认的数据展示方式。Receiving and displaying a default data display mode configuration instruction input by the user in the data analysis interaction interface, where the default data display mode configuration instruction includes a default data display mode specified by the user, and configuring a visual analysis result according to the default data display mode configuration instruction. The default data display method.
可选地,所述数据显示方式包括图形显示方式;所述图形显示方式包括图形样式、颜色数量、颜色种类、图形尺寸中的一种或多种;图形样式,包括:折线图、柱状图、仪表盘图、饼图、雷达图、热力图、漏斗图或散点图。Optionally, the data display manner includes a graphic display manner; the graphic display manner includes one or more of a graphic style, a color quantity, a color type, and a graphic size; and the graphic style includes: a line drawing, a histogram, Dashboard, pie, radar, heat, funnel, or scatter plots.
可选地,所述方法还包括:Optionally, the method further includes:
接收并显示用户在该数据分析交互界面输入的显示方式更新指令,所述显示方式更新指令中包括数据显示方式;Receiving and displaying a display mode update instruction input by the user on the data analysis interaction interface, where the display mode update instruction includes a data display mode;
按照所述显示方式更新指令中的数据显示方式,重新生成所述统计数据的可视化分析结果;And re-generating the visual analysis result of the statistical data according to the data display manner in the display mode update instruction;
在所述数据分析交互界面对新生成的可视化分析结果进行显示。The newly generated visual analysis result is displayed on the data analysis interactive interface.
可选地,所述方法还包括:Optionally, the method further includes:
接收并显示用户在该数据分析交互界面输入的统计范围更新指令;Receiving and displaying a statistical range update instruction input by the user on the data analysis interaction interface;
按照所述统计范围更新指令重新确定或修改当前可视化分析结果对应的统计范围,并生成新的可视化分析结果。The statistical range corresponding to the current visual analysis result is re-determined or modified according to the statistical range update instruction, and a new visual analysis result is generated.
可选地,所述方法还包括:Optionally, the method further includes:
接收用户在所述数据分析交互界面上对所述可视化分析结果的选取,对选中的所述可视化分析结果进行存储或发送。Receiving, by the user, the selection of the visual analysis result on the data analysis interaction interface, and storing or transmitting the selected visual analysis result.
可选地,所述方法还包括:Optionally, the method further includes:
记录用户在所述数据分析交互界面上输入的查询指令及所述数据分析系统响应所述查询指令生成可视化分析结果,生成分析交互记录;Recording a query instruction input by the user on the data analysis interaction interface, and the data analysis system generates a visual analysis result in response to the query instruction, and generates an analysis interaction record;
响应用户操作,对所述分析交互记录进行存储或发送。The analytical interaction record is stored or transmitted in response to user operations.
可选地,所述根据所述查询指令在所述待分析数据中进行统计分析,得到相应的统计数据的步骤之前,所述方法还包括:Optionally, before the step of performing statistical analysis on the data to be analyzed according to the query instruction to obtain corresponding statistical data, the method further includes:
对所述待分析数据的文件标识及所述查询指令中的关键字进行哈希计算,得到查询哈希值;Performing a hash calculation on the file identifier of the data to be analyzed and the keyword in the query instruction to obtain a query hash value;
查找是否已生成过与该查询哈希值关联的可视化分析结果;Find if the visual analysis result associated with the query hash value has been generated;
若查询到与该查询哈希值关联的可视化分析结果,则执行在所述数据分析交互界面以对话交互的形式显示所述可视化分析结果的步骤;If the visual analysis result associated with the query hash value is queried, performing the step of displaying the visual analysis result in the form of a dialog interaction in the data analysis interaction interface;
若未查询到与该查询哈希值关联的可视化分析结果,则执行根据所述查询指令在所述待分析数据中进行统计分析,得到相应的统计数据的步骤,并在根据所述统计数据得到可视化分析结果后,存储所述可视化分析结果,将所述查询哈希值与得到的可视化分析结果关联。If the result of the visual analysis associated with the query hash value is not queried, performing a statistical analysis on the data to be analyzed according to the query instruction to obtain corresponding statistical data, and obtaining the corresponding statistical data according to the statistical data After visualizing the analysis result, the visual analysis result is stored, and the query hash value is associated with the obtained visual analysis result.
可选地,所述方法还包括:Optionally, the method further includes:
每间隔第一预设时长清除存储时间超过第二预设时长的所述可视化分析结果,其中,所述第一预设时长大于所述第二预设时长。And displaying, by the first preset duration, the visual analysis result that the storage time exceeds the second preset duration, wherein the first preset duration is greater than the second preset duration.
本实施例的另一目的在于提供一种人机对话数据分析方法装置,应用于数据分析系统,所述装置包括:Another object of the present invention is to provide a device for analyzing human-machine dialog data, which is applied to a data analysis system, and the device includes:
文件确定模块,配置成确定待分析数据;a file determination module configured to determine data to be analyzed;
指令获取模块,配置成提供一数据分析交互界面,接收并显示用户在该数据分析交互界面输入的查询指令,所述查询指令包括查询内容;The instruction acquisition module is configured to provide a data analysis interaction interface, and receive and display a query instruction input by the user on the data analysis interaction interface, where the query instruction includes the query content;
统计分析模块,配置成根据所述查询指令在所述待分析数据中进行统计分析,得到相应的统计数据;The statistical analysis module is configured to perform statistical analysis on the data to be analyzed according to the query instruction, to obtain corresponding statistical data;
结果生成模块,配置成根据所述查询指令确定数据展示方式,按照所述数据展示方式生成所述统计数据的可视化分析结果;a result generating module, configured to determine a data display manner according to the query instruction, and generate a visual analysis result of the statistical data according to the data display manner;
结果显示模块,配置成在所述数据分析交互界面对所述可视化分析结果进行显示。The result display module is configured to display the visual analysis result on the data analysis interaction interface.
可选地,所述文件确定模块具体用于提供一数据选择界面;响应用户在数据选择界面的操作,将选中的数据作为待分析数据。Optionally, the file determining module is specifically configured to provide a data selection interface; in response to the operation of the user in the data selection interface, the selected data is used as data to be analyzed.
可选地,所述指令获取模块具体用于接收并显示用户输入的文字信息,并以对话交互的形式显示所述查询指令;或接收用户通过语音输入设备输入的语音信息,将所述语音信息转换为文字指令并以对话交互的形式显示所述文字指令。Optionally, the instruction acquiring module is specifically configured to receive and display text information input by the user, and display the query instruction in a dialog interaction manner; or receive voice information input by the user through the voice input device, and the voice information is Convert to a text instruction and display the text instruction in the form of a dialog interaction.
可选地,所述指令获取模块具体用于通过用户终端上配置的第三方通信软件提供一数据分析交互界面,接收所述用户终端通过该第三方通信软件发送的查询指令;Optionally, the instruction obtaining module is specifically configured to provide a data analysis interaction interface by using a third-party communication software configured on the user terminal, and receive a query instruction sent by the user terminal by using the third-party communication software;
所述结果显示模块具体用于将生成的可视化分析结果发送给所述用户终端,使所述用户终端通过所述第三方通信软件提供的数据分析交互界面以对话交互的形式显示所述可视化分析结果。The result display module is specifically configured to send the generated visual analysis result to the user terminal, so that the user terminal displays the visual analysis result in the form of a dialog interaction through a data analysis interaction interface provided by the third-party communication software. .
本公开的另一目的在于提供一种可读存储介质,所述可读存储介质存储有机器可执行指令,所述机器可执行指令再被处理器执行时,促使所述处理器实现本公开提供的人机对话数据分析方法。It is another object of the present disclosure to provide a readable storage medium storing machine executable instructions that, when executed by a processor, cause the processor to implement the present disclosure Human-machine dialogue data analysis method.
相对于现有技术而言,本公开至少具有以下有益效果:Compared with the prior art, the present disclosure has at least the following beneficial effects:
本公开实施例提供的人机对话数据分析方法及装置,通过提供友好的数据分析交互界面,接收用户在该界面数据的查询指令,根据所述查询指令进行数据分析后,以人机对话的形式在所述分析交互界面显示分析结果。如此,通过人机对话的方式进行交互,使得进行数据分析时的操作更加简洁直观,交互方式更人性化,降低了数据分析功能的使用门槛。The human-machine dialog data analysis method and apparatus provided by the embodiments of the present disclosure receive a query instruction of the user in the interface data by providing a friendly data analysis interaction interface, and perform data analysis according to the query instruction, in the form of human-machine dialogue. The analysis result is displayed on the analysis interaction interface. In this way, interaction through human-machine dialogue makes the operation of data analysis more concise and intuitive, and the interaction mode is more user-friendly, which reduces the threshold for using data analysis functions.
为了更清楚地说明本公开实施例的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,应当理解,以下附图仅示出了本公开的某些实施例,因此不应被看作是对范围的限定,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他相关的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present disclosure, the drawings to be used in the embodiments will be briefly described below. It should be understood that the following drawings show only certain embodiments of the present disclosure, and thus It should be seen as a limitation on the scope, and those skilled in the art can obtain other related drawings according to these drawings without any creative work.
图1为本公开实施例提供的数据分析设备与用户终端的交互示意图;FIG. 1 is a schematic diagram of interaction between a data analysis device and a user terminal according to an embodiment of the present disclosure;
图2为本公开实施例提供的数据分析设备的硬件结构示意图;2 is a schematic structural diagram of hardware of a data analysis device according to an embodiment of the present disclosure;
图3为本公开实施例提供的人机对话数据分析方法的步骤流程示意图;FIG. 3 is a schematic flowchart of steps of a human-machine dialog data analysis method according to an embodiment of the present disclosure;
图4为本公开实施例提供的文件选择界面的示意图;FIG. 4 is a schematic diagram of a file selection interface according to an embodiment of the present disclosure;
图5为本公开实施例提供的数据分析交互界面的示意图之;FIG. 5 is a schematic diagram of a data analysis interaction interface according to an embodiment of the present disclosure;
图6为本公开实施例提供的人机对话数据分析装置的示意图。FIG. 6 is a schematic diagram of a human-machine dialog data analyzing apparatus according to an embodiment of the present disclosure.
图标:100-数据分析设备;110-人机对话数据分析装置;111-文件确定模块;112-指令获取模块;113-统计分析模块;114-结果生成模块;115-结果显示模块;200-用户终端;300-网络。Icon: 100-data analysis device; 110-human-machine dialog data analysis device; 111-file determination module; 112-instruction acquisition module; 113-statistical analysis module; 114-result generation module; 115-result display module; Terminal; 300-network.
为使本公开实施例的目的、技术方案和优点更加清楚,下面将结合本公开实施例中的附图,对本公开实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本公开一部分实施例,而不是全部的实施例。通常在此处附图中描述和示出的本公开实施例的组件可以以各种不同的配置来布置和设计。The technical solutions in the embodiments of the present disclosure will be clearly and completely described in conjunction with the drawings in the embodiments of the present disclosure. It is a partial embodiment of the present disclosure, and not all of the embodiments. The components of the disclosed embodiments, which are generally described and illustrated in the figures herein, can be arranged and designed in various different configurations.
因此,以下对在附图中提供的本公开的实施例的详细描述并非旨在限制要求保护的本公开的范围,而是仅仅表示本公开的选定实施例。基于本公开中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本公开保护的范围。The detailed description of the embodiments of the present disclosure, which is set forth in the claims All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present disclosure without departing from the inventive scope are the scope of the disclosure.
应注意到:相似的标号和字母在下面的附图中表示类似项,因此,一旦某一项在一个附图中被定义,则在随后的附图中不需要对其进行进一步定义和解释。It should be noted that similar reference numerals and letters indicate similar items in the following figures, and therefore, once an item is defined in a drawing, it is not necessary to further define and explain it in the subsequent drawings.
在本公开的描述中,还需要说明的是,除非另有明确的规定和限定,术语“设置”、“安装”、“相连”、“连接”应做广义理解,例如,可以是固定连接,也可以是可拆卸连接,或一体地连接;可以是机械连接,也可以是电连接;可以是直接相连,也可以通过中间媒介间接相连,可以是两个元件内部的连通。对于本领域的普通技术人员而言,可以具体情况理解上述术语在本公开中的具体含义。In the description of the present disclosure, it should be further noted that the terms "set", "install", "connected", and "connected" are to be understood broadly, and may be fixed connections, for example, unless otherwise specifically defined and defined. It can also be a detachable connection, or an integral connection; it can be a mechanical connection or an electrical connection; it can be directly connected or indirectly connected through an intermediate medium, and can be internal communication between the two elements. The specific meanings of the above terms in the present disclosure can be understood in the specific circumstances by those skilled in the art.
本实施例提供的人机对话数据分析方法应用于数据分析设备。请参照图1,在一个例子中,所述数据分析设备100可以通过网络300与用户终端200连接,所述用户终端200接收用户输入查询分析内容并发送给所述数据分析设备100进行数据分析,并接收所述数据分析设备100反馈的分析结果。在另一个例子中,所述数据分析设备100也可以独立接收用户输入的查询分析内容,进行分析后将分析结果反馈给用户。The human-machine dialog data analysis method provided in this embodiment is applied to a data analysis device. Referring to FIG. 1 , in an example, the
请参照图2,图2为图1所示的数据分析设备100的方框示意图。所述数据分析设备100包括人机对话数据分析装置110、存储器120、处理器130。Please refer to FIG. 2. FIG. 2 is a block diagram of the
所述存储器120、处理器130各元件相互之间直接或间接地电性连接,以实现数据的传输或交互。例如,这些元件相互之间可通过一条或多条通讯总线或信号线实现电性连接。所述人机对话数据分析装置110包括至少一个可以软件或固件(firmware)的形式存储于所述存储器120中或固化在所述数据分析设备100的操作系统(operating system,OS)中的软件功能模块。所述处理器130用于执行所述存储器120中存储的可执行模块,例如所述人机对话数据分析装置110所包括的软件功能模块及计算机程序等。The components of the
其中,所述存储器120可以是,但不限于,随机存取存储器(Random Access Memory,RAM),只读存储器(Read Only Memory,ROM),可编程只读存储器(Programmable Read-Only Memory,PROM),可擦除只读存储器(Erasable Programmable Read-Only Memory,EPROM),电可擦除只读存储器(Electric Erasable Programmable Read-Only Memory,EEPROM)等。其中,存储器120用于存储程序,所述处理器130在接收到执行指令后,执行所述程序。The
请参照图2,图2为应用于图1所示的数据分析设备100的一种人机对话数据分析方法的流程图,以下将对所述方法包括各个步骤进行详细阐述。Please refer to FIG. 2. FIG. 2 is a flowchart of a human-machine dialog data analysis method applied to the
步骤S110,确定待分析数据。In step S110, the data to be analyzed is determined.
请参照图3,在本实施例中,所述数据分析设备100可以提供一数据选择界面。Referring to FIG. 3, in the embodiment, the
在一个例子中,所述数据分析设备100可以提供一个Web页面,所述数据分子设备100还包括配置成与所述用户终端200进行数据交互的通信单元,所述数据分析设备100通过 所述通信单元接收所述用户终端200发送的访问请求,并将所述Web页面发送给用户终端200,使用户终端200加载所述Web页面从而显示所述数据选择界面;或者所述数据分析设备100还可以提供专门的APP安装包,使所述用户终端200可以通过下载并安装所述APP安装包获得一专门的APP,并在启动该APP后显示所述数据选择界面。In one example, the
在另一个例子中,所述数据分析设备100可以包括至少一个显示装置或者与至少一个显示装置连接,所述数据分析设备100可以通过所述显示装置显示所述数据选择界面。In another example, the
所述数据分析设备100可以在该数据选择界面显示至少一个数据文件。例如,可以显示用户上传的多个数据文件;或者响应用户操作,在所述数据分析设备100中确定多个数据文件并显示;或者响应用户操作,与其他设备进行数据交互,以从其他设备获取用户指定的至少一个数据文件。其中,所述数据文件可以包括,但不仅限于EXCEL数据文件、关系型数据库表等。所述数据文件可以通过图表和/或文字描述的方式对所述多个数据文件进行排列显示,同时还可以现实各数据文件中数据的行数及数据文件的创建日期等信息。如此,用户可以在数据选择界面上直观看出所述数据文件。The
然后所述数据分析设备100的处理器130可以响应用户在数据选择界面的操作,将选中的数据作为待分析数据。其中,所述数据分析设备100可以将用户选中的一个或多个数据作为待分析数据。The
例如,所述数据分析设备100可以响应用户在所述文件选择界面上对数据文件的点击,将用户点击的数据文件作为所述待分析数据。也可以为通过文字或语音输入的指令确定所述待分析数据。For example, the
步骤S120,提供一数据分析交互界面,接收用户在该数据分析交互界面输入的查询指令,并以对话交互的形式显示所述查询指令,其中,所述查询指令包括查询内容。Step S120, providing a data analysis interaction interface, receiving a query instruction input by the user on the data analysis interaction interface, and displaying the query instruction in the form of a dialog interaction, wherein the query instruction includes the query content.
请参照图4,在所述数据分析界面在用户确定所述待分析数据后,显示一数据分析交互界面。所述数据分析设备100可以以提供所述数据选择界面相同或相似的方式提供所述数据分析交互界面,具体请参照步骤S110中对提供数据选择界面的描述,在此不再赘述。Referring to FIG. 4, after the user determines the data to be analyzed, the data analysis interface displays a data analysis interaction interface. The
所述数据分析设备100的处理器130可以接收用户在所述数据分析交互界面输入的查询指令。所述查询指令可以包括需要查询内容,如“展示最近一年每个月的收入趋势”。所述查询指令也可包括查询内容和指定的数据展示方式的组合,如“用折线图展示最近一年每个月的收入趋势”。The
所述以对话交互的形式显示所述查询指令的方式,包括以下任意之一:The manner of displaying the query instruction in the form of a dialog interaction, including any one of the following:
接收并显示用户输入的文字信息,并以对话交互的形式显示所述查询指令;或Receiving and displaying text information input by the user, and displaying the query instruction in the form of a dialog interaction; or
接收用户通过语音输入设备输入的语音信息,将所述语音信息转换为文字指令并以对话交互的形式显示所述文字指令。The voice information input by the user through the voice input device is received, the voice information is converted into a text command, and the text command is displayed in the form of a dialog interaction.
例如,在所述数据分析交互界面上可以显示有文字输入框,用户可以在文字输入框中输入文字信息,所述数据分析设备100在所述数据分析交互界面中一对话交互的形式显示所述文字信息。在所述数据分析界面上还可以显示有语音输入按钮,用户可以通过点击语音输入按钮然后输入语音信息,所述数据分析设备100将语音信息转换为文字信息并通过对话交互的形式显示在所述数据分析交互界面上。For example, a text input box may be displayed on the data analysis interaction interface, and the user may input text information in a text input box, and the
步骤S130,根据所述查询指令在所述待分析数据中进行统计分析,得到相应的统计数据。Step S130: Perform statistical analysis on the data to be analyzed according to the query instruction, and obtain corresponding statistical data.
可选地,所述查询指令包括自然语言形式的语句。所述数据分析设备100对接收到的所述自然语言形式的语句进行解析,得到结构化的数据库查询分析语句,例如,SQL(Structured Query Language,结构化查询语言)查询语句。然后根据所述数据库查询分析语句在所述待分析数据中进行统计分析,得到相应的统计数据。如此,使得不了解专业的数据库查询语句的用户也可以通过自然语言下达查询指令进行数据分析,方便了用户的使用。Optionally, the query instruction comprises a statement in the form of a natural language. The
在本实施例中,数据分析设备100的处理器130可以通过使用通用的语义分析算法对所述自然语言形式的语句进行分析得到所述数据库查询语句,例如采用例如结合深度学习模型的语义识别系统,在此不再赘述。In this embodiment, the
申请人注意到,不同的技术领域对数据分析的需求不同,相应地,用户输入的自然语言形式的查询指令及解析获得的数据库查询分析语句存在较大差异,若采用相同的语义解析算法或词库进行语义解析,会造成语义解析速度较慢,用户等待时间过长。Applicants have noticed that different technical fields have different requirements for data analysis. Correspondingly, the query instructions in the natural language form input by the user and the database query analysis statements obtained by the analysis are quite different, if the same semantic analysis algorithm or word is used. Semantic parsing of the library will result in slower semantic parsing and long waiting time for users.
因此,为了提高语义解析的速度,在本实施例中,所述数据分析设备100中可以预先配置有多个针对不同技术领域的语义解析模型。在将自然语言形式的查询指令解析成所述数据库查询分析语句时,所述数据分析设备100可以根据待分析数据的所述技术领域采用不同的语义解析模型进行解析。例如,针对气象、水文、金融等不同领域,所述数据分析设备100中预先配置了多个语义解析模型,不同的语义解析模型配置成采用不同的解析算法或词库对相应技术领域的自然语言进行快速地解析。Therefore, in order to improve the speed of the semantic analysis, in the embodiment, a plurality of semantic analysis models for different technical fields may be pre-configured in the
具体地,在步骤S110中,所述数据分析设备100在确定所述待分析数据后,还需要确定所述待分析数据所属的目标技术领域。Specifically, in step S110, after determining the data to be analyzed, the
在一个例子中,所述数据分析设备100可以通过所述移动终端向用户显示多个可选的技术领域选项,并响应用户操作,将用户选取的技术领域作为所述待分析数据的目标技术领域。例如,所述数据分析设备100在确定所述待分析数据后通过所述移动终端向用户显示气象、水文、金融等多个选择项,当用户选择“金融”选项时,所述数据分析设备100将金融领域确定为所述待分析数据的目标技术领域。In an example, the
或者,在另一个例子中,所述数据分析设备100可以提取所述待分析数据的文件名称及所述待分析数据中各数据项的名称,对所述文件名称及数据项名称中的关键字进行统计,根据统计结果确定所述待分析数据所述的目标技术领域。例如,当用户选取的待分析数据的文件名称为“收支数据统计表”,或者当所述待分析数据中数据项的名称包括“收入”、“支出”、“结余”等时,所述数据分析设备100可以将该待分析数据所述的目标技术领域确定为金融领域。Alternatively, in another example, the
然后,在步骤S130中,所述数据分析设备100根据在步骤S110中确定的所述目标技术领域采用相应语言解析模型对所述自然语言形式的查询指令进行解析,以得到所述数据库查询分析语句。Then, in step S130, the
如此,针对不同领域的待分析数据,所述数据分析设备100采用相应的语义解析模型进行解析,可以有效提高对自然语言形式的查询指令的解析速度,从而减少用户在查询过程中的等待时间,提高了用户体验。In this way, for the data to be analyzed in different fields, the
在某些实施例中,所述数据分析设备100根据用户输入的自然语言形式的查询指令解析出对应的数据库查询分析语句后,将该查询指令与解析出的数据库查询分析语句记录至一语义对照表。在其他用户输入自然语言形式的查询指令时,先查找用户当前输入的查询指令是否已经记录在所述语义对照表中。In some embodiments, the
若检查到用户当前输入的查询指令已记录在所述语义对照表中,表示该查询指令已经被解析过,则所述数据分析设备100直接从所述语义对照表中直接调取与该查询指令对应的数据库查询分析语句。If it is checked that the query instruction currently input by the user has been recorded in the semantic comparison table, indicating that the query instruction has been parsed, the
若检查到用户当前输入的查询指令未记录在所述语义对照表中,表示该查询指令尚未被解析过,则所述数据分析设备100采用预设的语义解析模型对该自然语言形式的查询指令进行解析,得到相应的结构化的数据库查询分析语句。If it is detected that the query instruction currently input by the user is not recorded in the semantic comparison table, indicating that the query instruction has not been parsed, the
优选地,所述数据库分析设备100在所述语义对照表中查找是否存在与该查询指令对应的数据库查询分析语句时,将用户当前输入的查询指令与所述语义对照表中的查询指令依次进行相似度分析。Preferably, the
在从所述语义对照表中查找到与用户当前输入的查询指令相似度大于第一预设阈值的查询指令时,则直接调取该查询指令对应的数据库查询分析语句。其中,所述第一预设阈值可以设置为95%。When the query instruction that is similar to the query instruction currently input by the user is greater than the first preset threshold from the semantic comparison table, the database query analysis statement corresponding to the query instruction is directly retrieved. The first preset threshold may be set to 95%.
若所述语义对照表记录中的查询指令与用户当前输入的查询指令相似度均未达到第一预设阈值,但所述语义对照表记录中的查询指令中存在与用户当前输入的查询指令相似度大于第二预设阈值的查询指令,则将相似度大于所述第二阈值的查询指令中相似度最大的查询指令对应的数据库查询分析语句作为基础,并根据用户当前输入的查询指令调整该数 据库查询分析语句,将调整后的查询分析语句作为与用户当前输入的查询指令对应的数据库查询分析语句。其中,所述第二预设阈值可以设置为80%If the similarity between the query instruction in the semantic comparison table record and the query instruction currently input by the user does not reach the first preset threshold, the query instruction in the semantic comparison table record is similar to the query instruction currently input by the user. The query instruction whose degree is greater than the second preset threshold is based on the database query analysis statement corresponding to the query instruction with the highest similarity among the query instructions having the similarity greater than the second threshold, and is adjusted according to the query instruction currently input by the user. The database query analysis statement uses the adjusted query analysis statement as a database query analysis statement corresponding to the query instruction currently input by the user. The second preset threshold may be set to 80%.
若所述语义对照表记录中的查询指令与用户当前输入的查询指令相似度均未达到第二预设阈值,则所述数据分析设备采用预设的语义解析模型对该自然语言形式的查询指令进行解析,得到相应的结构化的数据库查询分析语句。If the similarity between the query instruction in the semantic reference record and the query instruction currently input by the user does not reach the second preset threshold, the data analysis device uses the preset semantic analysis model to query the natural language form. Analyze and get the corresponding structured database query analysis statement.
如此,先将用户当前输入的查询指令与之前已解析过的查询指令进行比对,以直接使用之前已解析获得的结构化的数据库查询分析语句或在之前解析获得的数据库查询分析语句基础上进行调整修改,可以有效地避免重复的解析相同的“查询指令”,提高系统的操作效率等。In this way, the query instruction currently input by the user is first compared with the previously interpreted query instruction, so as to directly use the structured database query analysis statement that has been parsed before or on the basis of the database query analysis statement obtained before parsing. Adjustment and modification can effectively avoid repeated analysis of the same "query command" and improve the operating efficiency of the system.
在某些实施例中,所述语义对照表中还记录有各查询指令被使用的热度值,在所述语义对照表中的记录的数据库查询分析语句在查询指令相似度匹配过程中被选中使用过,则增加该数据库查询分析语句对应的查询指令的热度值。在将用户当前输入的查询指令与所述语义对照表中记录的查询指令进行比对匹配时,根据热度值从大到小的顺序依次将所述语义对照表中的记录的查询指令与用户当前输入的查询指令做比对。如此,所述语义对照表中被使用频率较高的查询指令可以优先与用户当前输入查询指令进行匹配,可以提高匹配的速度。In some embodiments, the semantic comparison table further records a heat value used by each query instruction, and the database query analysis statement of the record in the semantic comparison table is selected in the query instruction similarity matching process. If yes, the heat value of the query instruction corresponding to the database query analysis statement is increased. When the query instruction currently input by the user is matched and matched with the query instruction recorded in the semantic comparison table, the query instruction of the record in the semantic comparison table and the current user are sequentially sequentially according to the order of the heat value. The input query instructions are compared. In this way, the query instruction with a higher frequency of use in the semantic comparison table can preferentially match the current input query instruction of the user, which can improve the matching speed.
步骤S140,根据所述查询指令确定数据展示方式,按照所述数据展示方式生成所述统计数据的可视化分析结果。Step S140: Determine a data display manner according to the query instruction, and generate a visual analysis result of the statistical data according to the data display manner.
所述数据分析设备100的处理器130在获得待分析数据后,判断所述查询指令中是否包括指定的数据显示方式。After obtaining the data to be analyzed, the
当所述查询指令中不包括指定的数据显示方式时,所述处理器130按照默认的数据展示方式生成所述统计数据的可视化分析结果。例如,所述查询指令仅包括“显示最近10个月的销售额”时,采用默认的数据展示方式进行待分析数据的展示。所述默认的数据展示方式由用户预先设置得到,例如,柱状图、折线图或数据表格等。When the specified data display mode is not included in the query instruction, the
当所述查询指令中包括指定的数据显示方式时,所述处理器130按照所述指定的数据展示方式生成所述统计数据的可视化分析结果。例如,所述查询指令包括“用折线图展示最近一年每个月的收入趋势”时,将待分析数据按照指定的数据展示方式生成为折线图。When the query instruction includes the specified data display mode, the
可选地,所述数据显示方式包括图形显示方式。所述图形显示方式包括图形样式、颜色数量、颜色种类、图形尺寸中的一种或多种。图形样式包括,但不仅限于折线图、柱状图、仪表盘图、饼图、雷达图、热力图、漏斗图或散点图。Optionally, the data display manner includes a graphic display manner. The graphic display manner includes one or more of a graphic style, a color quantity, a color type, and a graphic size. Graphic styles include, but are not limited to, line charts, histograms, dashboard charts, pie charts, radar charts, heat maps, funnel charts, or scatter plots.
步骤S150,在所述数据分析交互界面以对话交互的形式显示所述可视化分析结果。Step S150, displaying the visual analysis result in the form of a dialog interaction in the data analysis interaction interface.
请再次参照图4,所述数据分析设备100的处理器130在生成所述可视化分析结果后, 以回复对话的方式在所述数据分析交互页面对分析结果进行显示。Referring to FIG. 4 again, after generating the visual analysis result, the
基于上述设计,本实施例提供的人机对话数据方案中,通过人机对话的交互界面接收用户的输入数据或查询指令,并以回复对话的方式反馈可视化的查询结果给用户,相较于现有技术中需要用户在专门的数据库查询界面使用专门数据库查询语言进行数据查询的方式,本实施例提供的方案提供了更友好的人机交互方式,不需要用户具备专业的数据库查询分析知识就能方便直观地进行Based on the above design, in the human-machine dialog data solution provided by the embodiment, the input data or the query instruction of the user is received through the interactive interface of the human-machine dialogue, and the visualized query result is fed back to the user in a manner of replying to the dialogue, compared with the current In the technology, the user needs to use a special database query language to perform data query in a special database query interface. The solution provided in this embodiment provides a more friendly human-computer interaction mode, and does not require the user to have professional database query analysis knowledge. Convenient and intuitive
值得说明的是,本实施例其他可用的变形方案还包括,由所述用户终端200配置的第三方通信软件提供所述数据选择界面或数据分析交互界面,用户发送的信息或指令经所述第三方通信软件的服务器发送给所述数据分析设备100。所述数据分析设备100生成的所述可视化分析结果经所述第三方通信软件的服务器发送至所述用户终端200,再由所述用户终端200上配置的所述第三方通信软件提供数据分析交互界面显示所述可视化的分析结果。其中,所述第三方通信软件可以为诸如微信的即时通信软件。It is to be noted that other available variants of the embodiment further include that the third-party communication software configured by the
例如,在步骤S120中,所述数据分析设备100通过用户终端上配置的第三方通信软件提供一数据分析交互界面,接收所述用户终端通过该第三方通信软件发送的查询指令。然后在步骤S150中,所述数据分析设备100将生成的可视化分析结果发送给所述用户终端,使所述用户终端通过所述第三方通信软件提供的数据分析交互界面以对话交互的形式显示所述可视化分析结果。在这种方案中,所述第三方通信软件及其服务器应视为协助所述数据分析设备100提供了所述数据选择界面或数据分析交互界面,因此上述方案也应视为在本公开的保护范围之内。For example, in step S120, the
可选地,在本实施例中,所述方法还可以包括所述数据分析设备100接收并显示用户在该数据分析交互界面输入的显示方式更新指令,所述显示方式更新指令中包括数据显示方式,然后按照所述显示方式更新指令中的数据显示方式,重新生成所述统计数据的可视化分析结果。Optionally, in this embodiment, the method may further include: the
例如,在步骤S120-S150举例的情形中通过折线图展示最近一年每个月的收入趋势后,用户继续输入“转换为柱状图”,所述数据分析设备100接收并显示这条显示方式更新指令。然后,所述数据分析设备100在步骤S130查询到的待分析数据的基础上,按照重新确定的数据显示方式,将待分析数据生成为柱状图。然后在所述数据分析交互界面对新生成的可视化分析结果进行显示。For example, after the income trend of each month of the most recent year is displayed by a line graph in the case exemplified in steps S120-S150, the user continues to input "convert to histogram", and the
可选地,在本实施例中,所述方法还可以包括所述数据分析设备100接收并显示用户在该数据分析交互界面输入的统计范围更新指令,然后按照所述统计范围更新指令重新确定或修改当前可视化分析结果对应的统计范围,并生成新的可视化分析结果。Optionally, in this embodiment, the method may further include the
例如,在步骤S120-S150举例的情形中通过折线图展示最近一年每个月的收入趋势后, 用户继续输入“去掉第一季度数据”,所述数据分析设备100接收并显示这条统计范围更新指令。然后,所述数据分析设备100在步骤S130查询到的待分析数据的基础上,按照重新确定的统计范围,生成新的可视化分析结果。然后在所述数据分析交互界面对新生成的可视化分析结果进行显示。For example, after the income trend of each month of the most recent year is displayed by the line graph in the case exemplified in steps S120-S150, the user continues to input "remove the first quarter data", and the
可选地,在本实施例中,所述方法还可以包括所述数据分析设备100接收用户在所述数据分析交互界面上对所述可视化分析结果的选取,对选中的所述可视化分析结果进行存储或发送。例如,所述数据分析设备100接收用户在所述数据分析交互界面上对显示的折线图的点击,显示至少一个操作选项(如,保存、发送等)。响应用户对所述操作选项的选取,对生成的分析结果进行存储或发送。Optionally, in this embodiment, the method may further include: the
可选地,在本实施例中,所述方法还可以包括所述数据分析设备100记录用户在所述数据分析交互界面上输入的查询指令及所述数据分析系统响应所述查询指令生成可视化分析结果,生成分析交互记录。响应用户操作,对所述分析交互记录进行存储或发送。Optionally, in this embodiment, the method may further include: the
可选地,在本实施例中,所述方法还可以包括所述数据分析设备100接收并显示用户在该数据分析交互界面输入的默认数据展示方式配置指令,所述默认数据展示方式配置指令中包括用户指定的默认数据展示方式,根据所述默认数据展示方式配置指令配置可视化分析结果默认的数据展示方式。例如,用户可以在所述数据分析交互界面输入“默认使用折线图展示数据”,所述数据分析设备100接收并显示这条默认数据展示方式配置指令,并根据该指令将可视化分析结果默认的数据展示方式配置为通过折线图展示。Optionally, in the embodiment, the method may further include: the
在某些实施例中,所述数据库分析设备100在显示所述可视化分析结果后,还可以在所述分析交互界面显示一纠错反馈选项。当用户发现所述数据库分析设备100反馈的可视化分析结果不理想时可以通过选取所述纠错反馈选项向所述数据分析设备100发送纠错指令。所述数据库分析设备100在接收到所述纠错指令后,重新对用户输入查询指令进行语义解析或者重新从所述语义对照表中显示另一个相似度在第一预设阈值之上的查询指令以供用户选择,并在用户重新进行语义解析或者用户重新选择查询指令后生成新的可视化分析结果显示给用户。In some embodiments, the
基于上述设计,本实施例提供的方法,提供了一种友好的人机对话交互界面,在进行数据分析的过程中,交互过程直观简洁,可以使不具有专业知识的用户也能轻松地使用数据分析功能。Based on the above design, the method provided by the embodiment provides a friendly human-machine dialogue interaction interface. In the process of data analysis, the interaction process is intuitive and concise, so that users without professional knowledge can easily use the data. Analysis function.
如果用户在短时间内针对某一个待分析数据进行多次同样的查询动作,可能会造成所述数据分析设备100短时间内重复执行多次相同的查询分析动作。故在本实施例中,在执行步骤S130之前,先对所述待分析数据的文件标识及所述查询指令中的关键字进行哈 希计算,得到查询哈希值。然后查找是否已生成过与该查询哈希值关联的可视化分析结果。其中,所述待分析数据的文件标识可以为所述待分析数据的文件的元数据或者其他标识该待分析数据唯一身份的标识信息。If the user performs the same query action for a certain data to be analyzed in a short time, the
若查询到与该查询哈希值关联的可视化分析结果,则标识针对相同的待分析数据已执行过相同的查询统计操作,不需要再次重复执行查询统计操作,则所述数据分析设备100直接跳转执行步骤S150,在所述数据分析交互界面以对话交互的形式显示查询到的可视化分析结果。If the result of the visual analysis associated with the query hash value is queried, the same query statistics operation has been performed for the same data to be analyzed, and the
若未查询到与该查询哈希值关联的可视化分析结果,则针对相同的待分析数据尚未执行过所述查询指令要求的查询统计操作,则所述数据分析设备100执行步骤S130,根据所述查询指令在所述待分析数据中进行统计分析,得到相应的统计数据。并在步骤S150根据所述统计数据得到可视化分析结果后,将所述查询哈希值与得到的可视化分析结果关联。如此,下一次执行相同统计操作时可以直接根据所述查询哈希值查询到已生成过的可视化分析结果。If the result of the visual analysis associated with the query hash value is not queried, the query statistical operation required by the query instruction has not been executed for the same data to be analyzed, and the
例如,用户在20分钟前针对数据文件A发送了“用折线图展示最近一年每个月的收入趋势”的查询指令,则所述数据分析设备对根据数据文件A的元数据、关键字“折线图”、关键字“最近一年每个月”及关键字“收入趋势”几个元素进行哈希,得到查询哈希值X,然后查找是否已经生成过与该查询哈希值X关联的可视化分析结果。For example, the user sends a query instruction for "representing the income trend of each month of the most recent year in a line graph" for the data file A 20 minutes ago, then the data analysis device pairs the metadata according to the data file A, the keyword " The line chart, the keyword "every month of the most recent year" and the keyword "income trend" are hashed, get the query hash value X, and then find out whether it has been associated with the query hash value X. Visualize the results of the analysis.
由于之前用户没有针对数据文件A发送了“用折线图展示最近一年每个月的收入趋势”的查询指令,也就没有生成过与查询哈希值A对应的可视化分析结果,则所述数据分析设备100根据用户输入的查询指令进行分析,生成了可视化分析结果B并展示给了用户,并将可视化分析结果B与查询哈希值关联。Since the user did not send a query instruction for displaying the income trend of each month in the last year by using a line graph A, the data analysis result corresponding to the query hash value A is not generated, and the data is not generated. The
然后用户在20分钟以后再次针对数据文件A发送了“用折线图展示最近一年每个月的收入趋势”的查询指令,则所述数据分析设备对根据数据文件A的元数据、关键字“折线图”、关键字“最近一年每个月”及关键字“收入趋势”几个元素进行哈希,得到查询哈希值X,然后查找是否已经生成过与该查询哈希值X关联的可视化分析结果。Then, after 20 minutes, the user again sends a query instruction for "displaying the income trend of each month of the most recent year with a line graph" for the data file A, then the data analysis device pairs the metadata according to the data file A, the keyword " The line chart, the keyword "every month of the most recent year" and the keyword "income trend" are hashed, get the query hash value X, and then find out whether it has been associated with the query hash value X. Visualize the results of the analysis.
由于之前已经生成了与查询哈希值X关联的可视化查询结果B,所述数据分析设备100根据查询哈希值X可以查询到可视化分析结果B,则所述数据分析设备100直接将可视化分析结果B反馈该用户,而不是再次执行根据查询指令进行统计分析的动作。Since the visual query result B associated with the query hash value X has been previously generated, the
进一步地,在本实施例中,为了方式所述数据分析设备100存储过多的可视化分析结果,所述数据分析设备100可以每间隔第一预设时长清除存储时间超过第二预设时长的所述可视化分析结果,其中,所述第一预设时长大于所述第二预设时长。例如,所述数据分析设备100每间隔5个小时清楚该数据分析设备100上存储时长超过20分钟的 可视化分析结果。Further, in this embodiment, in order to store the excessive visual analysis result, the
请参照图5,本实施例还提供一种应用于图1所示数据分析设备100的人机对话数据分析装置110,所述装置包括文件确定模块111、指令获取模块112、统计分析模块113、结果生成模块114及结果显示模块115。Referring to FIG. 5, the embodiment further provides a human-machine dialog
所述文件确定模块111,配置成确定待分析数据。The
本实施例中,所述文件确定模块111可配置成执行图2所示的步骤S110,关于所述文件确定模块111的具体描述可参对所述步骤S110的描述。In this embodiment, the
所述指令获取模块112,配置成提供一数据分析交互界面,接收并显示用户在该数据分析交互界面输入的查询指令,所述查询指令包括查询内容。The
本实施例中,所述指令获取模块112可配置成执行图2所示的步骤S120,关于所述指令获取模块112的具体描述可参对所述步骤S120的描述。In this embodiment, the
所述统计分析模块113,配置成根据所述查询指令在所述待分析数据中进行统计分析,得到相应的统计数据。The
本实施例中,所述统计分析模块113可配置成执行图2所示的步骤S130,关于所述统计分析模块113的具体描述可参对所述步骤S130的描述。In this embodiment, the
所述结果生成模块114,配置成根据所述查询指令确定数据展示方式,按照所述数据展示方式生成所述统计数据的可视化分析结果。The
本实施例中,所述结果生成模块114可配置成执行图2所示的步骤S140,关于所述结果生成模块114的具体描述可参对所述步骤S140的描述。In this embodiment, the
所述结果显示模块115,配置成在所述数据分析交互界面对所述可视化分析结果进行显示。The
本实施例中,所述结果显示模块115可配置成执行图2所示的子步骤S150,关于所述结果显示模块115的具体描述可参对所述子步骤S150的描述。In this embodiment, the
可选地,在本实施例中,所述文件确定模块111具体用于提供一数据选择界面;响应用户在数据选择界面的操作,将选中的数据作为待分析数据。Optionally, in this embodiment, the
可选地,在本实施例中,所述指令获取模块112具体用于接收并显示用户输入的文字信息,并以对话交互的形式显示所述查询指令;或接收用户通过语音输入设备输入的语音信息,将所述语音信息转换为文字指令并以对话交互的形式显示所述文字指令。Optionally, in the embodiment, the
可选地,在本实施例中,所述指令获取模块112具体用于通过用户终端上配置的第三方通信软件提供一数据分析交互界面,接收所述用户终端通过该第三方通信软件发送的查询指令;Optionally, in the embodiment, the
所述结果显示模块具体用于将生成的可视化分析结果发送给所述用户终端,使所述用户终端通过所述第三方通信软件提供的数据分析交互界面以对话交互的形式显示所述可视化分析结果。The result display module is specifically configured to send the generated visual analysis result to the user terminal, so that the user terminal displays the visual analysis result in the form of a dialog interaction through a data analysis interaction interface provided by the third-party communication software. .
综上所述,本公开提供的人机对话数据分析方法及装置,通过提供友好的数据分析交互界面,接收用户在该界面数据的查询指令,根据所述查询指令进行数据分析后,以人机对话的形式在所述分析交互界面显示分析结果。如此,通过人机对话的方式进行交互,使得进行数据分析时的操作更加简洁直观,交互方式更人性化,降低了数据分析功能的使用门槛。In summary, the human-machine dialog data analysis method and apparatus provided by the present disclosure receive a query instruction of the user in the interface data by providing a friendly data analysis interaction interface, and perform data analysis according to the query instruction, and then use the human-machine The form of the dialog displays the analysis results on the analysis interaction interface. In this way, interaction through human-machine dialogue makes the operation of data analysis more concise and intuitive, and the interaction mode is more user-friendly, which reduces the threshold for using data analysis functions.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It is to be understood that the term "comprises", "comprising" or any other variations thereof is intended to encompass a non-exclusive inclusion, such that a process, method, article, or device that comprises a It also includes other elements that are not explicitly listed, or elements that are inherent to such a process, method, item, or device. An element that is defined by the phrase "comprising a ..." does not exclude the presence of additional equivalent elements in the process, method, item, or device that comprises the element.
以上所述,仅为本公开的具体实施方式,但本公开的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本公开揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本公开的保护范围之内。因此,本公开的保护范围应所述以权利要求的保护范围为准。The above is only the specific embodiment of the present disclosure, but the scope of the present disclosure is not limited thereto, and any person skilled in the art can easily think of changes or substitutions within the technical scope of the disclosure. It should be covered within the scope of protection of the present disclosure. Therefore, the scope of protection of the present disclosure should be determined by the scope of the claims.
本公开实施例提供的人机对话数据分析方法及装置,通过提供友好的数据分析交互界面,接收用户在该界面数据的查询指令,根据所述查询指令进行数据分析后,以人机对话的形式在所述分析交互界面显示分析结果。如此,通过人机对话的方式进行交互,使得进行数据分析时的操作更加简洁直观,交互方式更人性化,降低了数据分析功能的使用门槛。The human-machine dialog data analysis method and apparatus provided by the embodiments of the present disclosure receive a query instruction of the user in the interface data by providing a friendly data analysis interaction interface, and perform data analysis according to the query instruction, in the form of human-machine dialogue. The analysis result is displayed on the analysis interaction interface. In this way, interaction through human-machine dialogue makes the operation of data analysis more concise and intuitive, and the interaction mode is more user-friendly, which reduces the threshold for using data analysis functions.
Claims (20)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201710363970.1A CN107193948B (en) | 2017-05-22 | 2017-05-22 | Human-computer dialogue data analysing method and device |
| CN201710363970.1 | 2017-05-22 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2018214859A1 true WO2018214859A1 (en) | 2018-11-29 |
Family
ID=59874378
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2018/087768 Ceased WO2018214859A1 (en) | 2017-05-22 | 2018-05-22 | Method and device for analyzing man-machine dialogue data |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN107193948B (en) |
| WO (1) | WO2018214859A1 (en) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113743622A (en) * | 2021-08-04 | 2021-12-03 | 珠海许继芝电网自动化有限公司 | Power distribution network analysis system and method based on N-1 criterion |
| CN115712455A (en) * | 2022-11-23 | 2023-02-24 | 南方电网调峰调频发电有限公司检修试验分公司 | Data analysis method, device, equipment and storage medium |
Families Citing this family (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107193948B (en) * | 2017-05-22 | 2018-04-20 | 邢加和 | Human-computer dialogue data analysing method and device |
| CN107748784B (en) * | 2017-10-26 | 2021-05-25 | 江苏赛睿信息科技股份有限公司 | Method for realizing structured data search through natural language |
| CN108108391A (en) * | 2017-11-21 | 2018-06-01 | 众安信息技术服务有限公司 | For the processing method and device of the information of data visualization |
| CN108241735A (en) * | 2017-12-05 | 2018-07-03 | 新华智云科技有限公司 | A kind of data analysing method and equipment |
| CN109508441B (en) * | 2018-08-21 | 2023-12-08 | 江苏赛睿信息科技股份有限公司 | Method and device for realizing data statistical analysis through natural language and electronic equipment |
| CN109669966A (en) * | 2018-12-12 | 2019-04-23 | 江苏赛睿信息科技股份有限公司 | Multi-stage data analysis method, device and electronic equipment |
| CN111161804B (en) * | 2019-12-27 | 2024-03-08 | 北京百迈客生物科技有限公司 | Query method and system for species genomics database |
| CN112883168A (en) * | 2021-03-19 | 2021-06-01 | 数预智能科技(上海)有限公司 | Intelligent reading method based on data |
| CN117076881A (en) * | 2023-08-07 | 2023-11-17 | 金蝶软件(中国)有限公司 | Service data processing method, device and computer equipment |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6766320B1 (en) * | 2000-08-24 | 2004-07-20 | Microsoft Corporation | Search engine with natural language-based robust parsing for user query and relevance feedback learning |
| CN105247511A (en) * | 2013-06-07 | 2016-01-13 | 苹果公司 | Intelligent automated assistant |
| CN105677835A (en) * | 2016-01-04 | 2016-06-15 | 浪潮软件集团有限公司 | Method and device for making data visualization chart |
| CN107193948A (en) * | 2017-05-22 | 2017-09-22 | 邢加和 | Human-computer dialogue data analysing method and device |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN100578539C (en) * | 2006-02-28 | 2010-01-06 | 腾讯科技(深圳)有限公司 | Automatic Question Answering Method and System |
| US9335911B1 (en) * | 2014-12-29 | 2016-05-10 | Palantir Technologies Inc. | Interactive user interface for dynamic data analysis exploration and query processing |
| CN105786772A (en) * | 2016-01-25 | 2016-07-20 | 山东省科技统计分析研究中心 | Statistic analysis system in cloud platform |
| CN106202476B (en) * | 2016-07-14 | 2017-06-06 | 广州安望信息科技有限公司 | A kind of interactive method and device of knowledge based collection of illustrative plates |
-
2017
- 2017-05-22 CN CN201710363970.1A patent/CN107193948B/en active Active
-
2018
- 2018-05-22 WO PCT/CN2018/087768 patent/WO2018214859A1/en not_active Ceased
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6766320B1 (en) * | 2000-08-24 | 2004-07-20 | Microsoft Corporation | Search engine with natural language-based robust parsing for user query and relevance feedback learning |
| CN105247511A (en) * | 2013-06-07 | 2016-01-13 | 苹果公司 | Intelligent automated assistant |
| CN105677835A (en) * | 2016-01-04 | 2016-06-15 | 浪潮软件集团有限公司 | Method and device for making data visualization chart |
| CN107193948A (en) * | 2017-05-22 | 2017-09-22 | 邢加和 | Human-computer dialogue data analysing method and device |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113743622A (en) * | 2021-08-04 | 2021-12-03 | 珠海许继芝电网自动化有限公司 | Power distribution network analysis system and method based on N-1 criterion |
| CN113743622B (en) * | 2021-08-04 | 2024-05-03 | 珠海许继芝电网自动化有限公司 | N-1 criterion-based power distribution network analysis system and method |
| CN115712455A (en) * | 2022-11-23 | 2023-02-24 | 南方电网调峰调频发电有限公司检修试验分公司 | Data analysis method, device, equipment and storage medium |
Also Published As
| Publication number | Publication date |
|---|---|
| CN107193948B (en) | 2018-04-20 |
| CN107193948A (en) | 2017-09-22 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2018214859A1 (en) | Method and device for analyzing man-machine dialogue data | |
| US11698900B2 (en) | Leveraging search query history in a search interface | |
| US10824691B2 (en) | Page rendering method, device, and data storage medium | |
| US11886430B1 (en) | Intent-based natural language processing system | |
| US9043717B2 (en) | Multi-lane time-synched visualizations of machine data events | |
| WO2020253399A1 (en) | Log classification rule generation method, device, apparatus, and readable storage medium | |
| CN110659349A (en) | Log query method, apparatus, device, and computer-readable storage medium | |
| CN104778241A (en) | Report generation method and system | |
| CN109491736B (en) | Display method and device of pop-up frame window | |
| CN110955674A (en) | Asynchronous export method and component based on java service | |
| CN109360557A (en) | The method, apparatus and computer equipment of voice control application program | |
| CN113515564B (en) | J2 EE-based data access method, device, equipment and storage medium | |
| WO2019076050A1 (en) | Business report query method and apparatus, computer device and storage medium | |
| CN111399941A (en) | Batch configuration method, device and equipment for BMC (baseboard management controller) functions | |
| CN111026775A (en) | Determination method, device, server and storage medium for associated indicators | |
| CN104572813A (en) | Report generation method and device | |
| CN112306312A (en) | Data processing method, data processing system, information creation method and device | |
| WO2017088702A1 (en) | Editing method and device for database editor | |
| WO2020037921A1 (en) | Expression picture prompting method and apparatus, computer device, and storage medium | |
| CN114297443A (en) | Processing method, device and equipment for graph data query statement and storage medium | |
| CN109542408B (en) | Intelligent template configuration method and device | |
| CN116186123A (en) | A system and method for generating BI charts based on big data applications | |
| CN108694172A (en) | Information output method and device | |
| CN103092942B (en) | Query Information obtains and the client of displaying, method and system | |
| CN116303428A (en) | Data processing method and device |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 18806668 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 18806668 Country of ref document: EP Kind code of ref document: A1 |
|
| 32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 12.06.2020.) |