CN115701633B - Voice control method and device of application program, storage medium and electronic equipment - Google Patents
Voice control method and device of application program, storage medium and electronic equipmentInfo
- Publication number
- CN115701633B CN115701633B CN202110881207.4A CN202110881207A CN115701633B CN 115701633 B CN115701633 B CN 115701633B CN 202110881207 A CN202110881207 A CN 202110881207A CN 115701633 B CN115701633 B CN 115701633B
- Authority
- CN
- China
- Prior art keywords
- application program
- voice
- target application
- determining
- voice control
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Landscapes
- Telephonic Communication Services (AREA)
- Stored Programmes (AREA)
Abstract
The invention provides a voice control method and device of an application program, a computer readable storage medium and electronic equipment, wherein the method comprises the steps of identifying an acquired voice instruction based on the voice control program and determining a semantic identification result corresponding to the voice instruction; the method comprises the steps of determining a target application program based on a semantic recognition result, determining a communication execution mode between the voice control program and the target application program based on voice service registration information corresponding to the target application program and a preset configuration table, and controlling the target application program to execute operation corresponding to the voice instruction based on the communication execution mode. The technical scheme provided by the invention can realize voice control on the downloaded target application program, and is convenient for users to use.
Description
Technical Field
The present invention relates to the field of speech control technology, and more particularly, to a speech control method and apparatus for an application program, a storage medium, and an electronic device.
Background
With the continuous progress of man-machine interaction technology, voice control functions on intelligent terminals are becoming more and more common. At present, the voice control function on the intelligent terminal is usually finished through factory debugging, in daily life, a user often downloads and installs various application programs in the application market of the intelligent terminal according to own requirements, and the intelligent terminal cannot perform voice control on the application programs newly downloaded from the application market, so that the voice control method for determining an application program has important significance.
Disclosure of Invention
The invention provides a voice control method and device of an application program, a computer readable storage medium and electronic equipment, and aims to solve the technical problem that an existing intelligent terminal cannot perform voice control on the application program newly downloaded from an application market.
In a first aspect, a method for controlling voice of an application program is provided, which specifically includes the following steps:
identifying the acquired voice command based on a voice control program, and determining a semantic identification result corresponding to the voice command;
Determining a target application program based on the semantic recognition result;
Determining a communication execution mode between the voice control program and the target application program based on voice service registration information corresponding to the target application program and a preset configuration table;
and controlling the target application program to execute the operation corresponding to the voice instruction based on the communication execution mode.
In a second aspect, a voice control device for an application program is provided, which specifically includes the following modules:
the recognition processing module is used for recognizing the acquired voice command based on the voice control program and determining a semantic recognition result corresponding to the voice command;
The program determining module is used for determining a target application program based on the semantic recognition result;
the mode determining module is used for determining the communication execution mode between the voice control program and the target application program based on the voice service registration information corresponding to the target application program and a preset configuration table;
and the operation processing module is used for controlling the target application program to execute the operation corresponding to the voice instruction based on the communication execution mode.
In a third aspect, a computer-readable storage medium storing a computer program for executing the voice control method of the application program described above is provided.
In a fourth aspect, there is provided an electronic device comprising:
A processor;
A memory for storing the processor-executable instructions;
the processor is configured to read the executable instruction from the memory, and execute the instruction to implement the voice control method of the application program.
Compared with the prior art, the voice control method and device for the application program, the computer readable storage medium and the electronic equipment provided by the invention at least comprise the following beneficial effects:
After a user gives a voice command, the technical scheme provided by the invention identifies the acquired voice command through a voice control program supporting a voice function in the system, determines a semantic recognition result corresponding to the voice command, and determines a target application program which the user intends to control through the voice command according to the semantic recognition result. Further determining voice service registration information corresponding to the target application program, wherein the existence of the voice service registration information indicates that the target application program has performed voice service registration to the voice control program, namely that the target application program has the authority of the voice control program to support voice service, namely that the downloaded application program is allowed to perform voice service registration in the technical scheme provided by the invention, so that a precondition is provided for voice control of the downloaded application program. Because more application programs are in the market, in order to realize information interaction between the voice control program and various downloaded application programs, different communication execution modes are determined for different downloaded application programs, so that when corresponding voice service registration information exists in a target application program, the communication execution mode between the voice control program and the target application program is determined by combining a preset configuration table acquired in advance, namely, the voice control program and the target application program can perform information interaction through the determined communication execution mode, the target application program can acquire information related to voice instructions, so that the target application program can execute operations corresponding to the voice instructions, voice control of a user on the target application program is realized, and convenience is brought to the user.
Drawings
In order to more clearly illustrate the technical solutions of the present invention, the drawings that are needed in the description of the present invention will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings may be obtained according to these drawings without inventive effort to a person skilled in the art.
FIG. 1 is a flowchart illustrating a method for controlling speech of an application according to an exemplary embodiment of the present invention;
FIG. 2 is a flowchart illustrating a method for controlling voice of an application according to an exemplary embodiment of the present invention;
FIG. 3 is a flowchart illustrating a method for controlling speech of an application according to an exemplary embodiment of the present invention;
FIG. 4 is a flowchart illustrating a method for controlling voice of an application according to an exemplary embodiment of the present invention;
FIG. 5 is a flowchart of a method for controlling voice of an application according to an exemplary embodiment of the present invention;
fig. 6 is a schematic structural diagram of a voice control apparatus for an application according to an exemplary embodiment of the present invention;
fig. 7 is a block diagram of an electronic device according to an exemplary embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made more apparent and fully hereinafter with reference to the accompanying drawings, in which some, but not all embodiments of the invention are shown. All other embodiments, which can be made by one of ordinary skill in the art without creative efforts, are within the scope of protection of the embodiments of the present invention according to the embodiments of the present invention.
Exemplary method
Fig. 1 is a flowchart of a voice control method of an application program according to an exemplary embodiment of the present invention, which specifically includes the following steps:
step 10, based on the voice control program, the acquired voice command is identified, and a semantic identification result corresponding to the voice command is determined.
In one embodiment, the voice control program is a voice control program supporting voice functions installed in the intelligent terminal system, wherein the intelligent terminal includes, but is not limited to, a mobile phone, a computer, a tablet computer, and a car machine. Specifically, a voice control program A is installed in the vehicle-mounted system, after a user gives a voice instruction, the voice acquisition device acquires the voice instruction and transmits the voice instruction to the voice control program A, the voice control program A identifies the received voice instruction, and a semantic identification result corresponding to the voice instruction is determined.
And step 20, determining a target application program based on the semantic recognition result.
In one embodiment, the target application refers to a program object that the user intends to control through voice instructions. For example, if the determined semantic recognition result is "application B" and "play song", it is determined that the target application is application B. Of course, when the user issues a voice command, the application program B may not be explicitly mentioned, for example, the voice command is "open news C", at this time, "news C" in the semantic recognition result is extracted, and if the news C is the content corresponding to the application program D, the application program D is the target application program.
And step 30, determining a communication execution mode between the voice control program and the target application program based on the voice service registration information corresponding to the target application program and a preset configuration table.
In an embodiment, after determining the target application program, determining voice service registration information corresponding to the target application program, if the target application program performs voice service registration to the voice control program, the target application program has the corresponding voice service registration information, and if the target application program does not perform voice service registration to the voice control program, the target application program does not have the corresponding voice service registration information, that is, in the technical scheme provided by the invention, the downloaded application program is allowed to perform voice service registration, thereby providing preconditions for voice control of the downloaded application program. In addition, because more application programs are available in the market, in order to realize information interaction between the voice control program and various downloaded application programs, different communication execution modes are determined for different downloaded application programs, and after corresponding voice service registration information exists in the target application program, the communication execution mode between the voice control program and the target application program can be determined according to a preset configuration table. Wherein the communication execution mode includes, but is not limited to, a calling mode and a virtual clicking mode.
Specifically, whether the target application program has corresponding voice service registration information is judged, if so, the communication execution mode between the voice control program and the target application program is determined based on a preset configuration table, and if not, the current flow is stopped, namely, the target application program cannot be controlled through voice.
And step 40, controlling the target application program to execute the operation corresponding to the voice instruction based on the communication execution mode.
In an embodiment, through the determined communication execution mode, information interaction between the voice control program and the target application program can be achieved, and then the target application program can acquire information related to the voice instruction, so that the target application program can execute an operation corresponding to the voice instruction.
In the above embodiment, after the user issues the voice command, the voice control program supporting the voice function in the system identifies the acquired voice command, determines the semantic recognition result corresponding to the voice command, and determines the target application program that the user intends to control through the voice command according to the semantic recognition result. Further determining voice service registration information corresponding to the target application program, wherein the presence of the voice service registration information indicates that the target application program has performed voice service registration with the voice control program, that is, the target application program has permission of the voice control program to support voice service, that is, in the technical scheme provided by the embodiment, the downloaded application program is allowed to perform voice service registration, thereby providing a precondition for performing voice control on the downloaded application program. Because more application programs are in the market, in order to realize information interaction between the voice control program and various downloaded application programs, different communication execution modes are determined for different downloaded application programs, so that when corresponding voice service registration information exists in a target application program, the communication execution mode between the voice control program and the target application program is determined by combining a preset configuration table acquired in advance, namely, the voice control program and the target application program can perform information interaction through the determined communication execution mode, the target application program can acquire information related to voice instructions, so that the target application program can execute operations corresponding to the voice instructions, voice control of a user on the target application program is realized, and convenience is brought to the user.
As shown in fig. 2, in an exemplary embodiment of the present invention based on the embodiment shown in fig. 1, step 30 is based on the voice service registration information and the preset configuration table corresponding to the target application, the determining the communication execution mode between the voice control program and the target application program specifically comprises the following steps:
step 301, if the target application program has voice service registration information, determining a call relationship between the voice control program and the target application program based on a preset configuration table.
Step 302, determining a communication execution mode based on the calling relation.
In the above embodiment, the preset configuration table is recorded with configuration related information of the application program, and according to the preset configuration table, it can be determined whether the call relationship between the voice control program and each application program, i.e. the downloaded application program is actively called by the voice control program, or the downloaded application program actively calls the voice control program, where different call modes correspond to different communication execution modes. Therefore, when the target application program has voice service registration information, the calling relation between the voice control program and the target application program is further determined according to the preset configuration table, and further the communication execution mode is determined. With the increase of application programs in the market, a part of downloaded application programs can actively call a voice control program, and meanwhile, the situation that the protocol of the part of downloaded application programs is not adaptive to that of an intelligent terminal system exists, so that the voice control program is required to actively call the part of application programs, and in the embodiment, two possible calling relations are considered, so that conditions are provided for realizing voice control on various different application programs.
As shown in fig. 3, based on the embodiment shown in fig. 2, in an exemplary embodiment of the present invention, step 302 determines a communication execution mode based on the call relationship, which includes step 3021, if the call relationship is that the voice control program calls the target application program, determining that the communication execution mode is middleware communication.
In an embodiment, when the protocol of the downloaded application program is not matched with the protocol of the intelligent terminal system, in order to meet the requirement of the user on voice control of the part of application program, a developer needs to adapt according to the externally checked SDK of the part of application program, and configures middleware, so that the voice control program can call the part of application program through the middleware, and therefore when the call relation is that the voice control program calls the target application program, the communication execution mode is middleware communication. Specifically, for downloading an application program commonly used in an application market, downloading an application program with the downloading capacity of the application market being the first 100, if the application program is not matched with the protocol of the intelligent terminal system, a developer can perform the adaptation work, after the developer performs the adaptation, the packet name corresponding to the application program is added into a preset configuration table, so that a list of the application program which can be actively invoked by the voice control program can be obtained from the preset configuration table.
Step 40, based on the communication execution mode, controlling the target application program to execute the operation corresponding to the voice instruction, including:
Step 401, if the communication execution mode is middleware communication, determining target configuration information in the middleware based on the semantic recognition result.
Step 402, calling a target interface of the target application program based on the target configuration information, so that the target application program executes the operation corresponding to the voice instruction.
In the above embodiment, different configuration information exists for different application programs, so after determining that the communication execution mode is middleware communication, the target configuration information in the middleware is further determined according to the semantic recognition result, a target interface of the target application program can be called according to the target configuration information, and an operation action corresponding to the voice instruction can be realized through the target interface. In this embodiment, communication between the voice control program and the target application program can be effectively implemented by adopting a middleware communication manner, so that conditions are provided for voice control of the commonly used downloaded application program, so that a user can perform voice control on more downloaded application programs.
For example, a developer performs adaptation on the application program E, and configures information in the middleware, so that the application program E can be called by the middleware, a control function in the SDK in the application program E is implemented, and a packet name corresponding to the application program E is added to a preset configuration table, that is, it can be determined that the voice control program actively calls the application program E through the preset configuration table. When the voice command of the user is "play music with application program E", it can determine that the target application program is application program E, determine whether the application program E has voice service registration information, if the application program E has voice service registration information, further determine whether the application program E is in a preset configuration table, if the application program E is in the preset configuration table, determine a call relationship between the application program E and the voice control program, if it is determined that the call relationship is that the voice control program calls the target application program, it is known that the communication execution mode is middleware communication, according to the determined target application program, i.e. application program E, and the target action, it determines target configuration information, and calls a target interface for realizing a play function in the target application program according to the target configuration information, so that the target interface executes the play action, thereby realizing voice control of the downloaded target application program by voice.
When the call relation between the voice control program and the application program is that the voice control program actively calls the application program, the call relation is completed through middleware when the application program registers voice service with the voice control program. In one possible implementation manner, a first application program is acquired, if the voice control program is determined to call the first application program based on a preset configuration table, a first voice service registration request sent by a middleware is received, and voice service registration information corresponding to the first application program is determined based on the first voice service registration request. The middleware can be controlled to send a first voice service registration request to the voice control program if the voice control program is determined to call the first application program based on the preset configuration table after the first application program is acquired in a downloading mode, so that the voice control program determines the authority of the voice service of the first application program according to the first voice service registration request. After the first application program is downloaded, the system will broadcast the new downloaded application program, and the middleware will send the first voice service registration request to the voice control program based on the broadcast, so as to register the voice service for the first application program in time after the first application program is downloaded. The first application program can be acquired through a mode selected by a user, namely, when the user wants to realize voice control on the first application program, the first application program is actively selected on a voice function registration interface to acquire the first application program, and further, the voice service registration of the downloaded first application program is realized.
As shown in fig. 4, based on the embodiment shown in fig. 2, in an exemplary embodiment of the present invention, the step 302 of determining the communication execution mode based on the call relationship includes step 3022 of determining that the communication execution mode is direct call communication if the call relationship is that the target application call the voice control program.
In an embodiment, if the call relationship is that the target application program calls the voice control program, that is, the target application program performs active call, information interaction can be directly realized by the way that the target application program calls the voice control program, without using middleware. Specifically, in order to popularize and use a part of application programs, the application programs can configure the application programs according to the SDK checked by the outside of the voice control program, so that the protocol of the application programs is matched with the protocol of the intelligent terminal system, namely after the application programs are downloaded, the application programs can call the voice control program, and the part of application programs can be recorded in a preset configuration table, so that the call relation can be determined to call the voice control program for the target application program through the preset configuration table.
Step 40, based on the communication execution mode, controlling the target application program to execute the operation corresponding to the voice instruction, including:
Step 403, if the communication execution mode is direct call communication, receiving a call request of the target application program;
And step 404, returning the logic result determined based on the call request to the target application program so that the target application program executes the operation corresponding to the voice instruction.
In the above embodiment, if the communication execution mode is direct call communication, the voice control program directly receives a call request of the target application program, calls the call request according to the call request of the target application program, and returns a logic result determined according to the call request to the target application program, so that the target application program executes an operation corresponding to the voice instruction. In one possible implementation, after the speech control program identifies the target application program, notification information is passed to the target application program, and the target application program passes the call request to the target application program based on the notification information. In this embodiment, the voice control program can be interacted through the mode of actively calling the target application program, so as to realize voice control on the target application program, and realize voice control on more downloaded application programs by a user.
When the call relation between the voice control program and the application program is that the application program actively calls the voice control program, the call relation is completed by the application program when the application program registers voice service with the voice control program. In one possible implementation, a second application program is acquired, if the second application program is determined to call the voice control program based on a preset configuration table, a second voice service registration request of the second application program is received, and voice service registration information of the second application program is determined based on the second voice service registration request. The second application program directly transmits a second voice service registration request to the voice control program when the second application program starts to be installed after the downloading is completed if the second application program is determined to call the voice control program based on the preset configuration table after the second application program is obtained in a downloading mode, so that the voice control program determines the authority of the voice service of the second application program according to the second voice service registration request, namely, the voice service registration is performed for the second application program in time after the second application program is downloaded. The second application program can be acquired through a mode selected by a user, namely, when the user wants to realize voice control on the second application program for the downloaded and installed second application program, the user actively selects on a voice function registration interface to acquire the second application program, and further, the voice service registration of the downloaded second application program is realized.
As shown in fig. 5, based on the embodiment shown in fig. 1, in an exemplary embodiment of the present invention, step 30 is based on the voice service registration information and the preset configuration table corresponding to the target application, the determining the communication execution mode between the voice control program and the target application program specifically comprises the following steps:
step 303, if the target application program has voice service registration information and the target application program is not in the preset configuration table, determining that the communication execution mode between the voice control program and the target application program is virtual click.
Step 40, based on the communication execution mode, controlling the target application program to execute the operation corresponding to the voice instruction, including:
step 405, if the communication execution mode is virtual click, determining a target key corresponding to the semantic recognition result in the target application program;
step 406, triggering the target key to enable the target application program to execute the operation corresponding to the voice instruction.
In the above embodiment, if the target application program has voice service registration information, but the target application program is not in the preset configuration table, the target application program and the voice control program cannot be called each other due to interface difference, and in order to satisfy voice control of such application program, a communication execution mode of virtual clicking is adopted in this embodiment. Specifically, the system side can acquire the display text of the application program, register the display text as corresponding voice, determine a target key in the target application program according to the registered content after a certain display text exists in the semantic recognition result, and trigger the target key to enable the target application program to execute the operation corresponding to the voice instruction. In the embodiment, the voice control can be performed on the application programs in a virtual click mode, so that the requirement of a user for controlling more application programs is met.
When the call relationship does not exist between the voice control program and the application program, the voice service registration can be performed on the application program of the part by acquiring a third application program, if the third application program is not in the preset configuration table, determining the association relationship between the display text and the keys, and extracting the display text of the third application program to perform the voice service registration. The third application program can be acquired through a downloading mode, namely the third application program is downloaded, if the third application program is not in a preset configuration table, the display characters of the third application program are acquired, keys corresponding to each display character are determined, namely the association relation between the display characters and the keys is determined, and then the display characters are registered in the voice control program for voice service, so that the voice control program determines voice service registration information of the third application program according to the voice service registration of the display characters, confirms the authority of voice service of the third application program, and realizes the voice service registration for the third application program in time. The third application program can be acquired through a mode selected by a user, namely, when the user wants to realize control over the third application program for the third application program which is downloaded and installed, the user actively selects on a voice function registration interface, so that the third application program is acquired, and further, voice service registration of the third application program which is downloaded is realized.
In an exemplary embodiment of the present invention, the extracting the display text of the third application program for voice service registration includes extracting the display text of the third application program, determining a display position of the display text, and performing voice service registration based on the display text and the display position.
In the above embodiment, the display text of the third application program is extracted, and the display position corresponding to the display text is further determined, so that when the voice service registration is performed, not only the specific content of the display text but also the display position is considered. When the same display text exists at different positions of the application program, if voice service registration is performed only according to the display text, voice control may not be performed accurately. Therefore, in order to achieve more accurate voice control, the display text and the display position are considered at the same time when voice service registration is performed, so that the target key is accurately determined by displaying the content of the text and the display position.
In an exemplary embodiment of the invention, the method further comprises updating the preset configuration table based on the received configuration information. In the above embodiment, along with the update of the application program in the market, the advancement of the adaptation work of the developer, and the popularization requirement of the application program, the content of the preset configuration table needs to be updated, so that the preset configuration table can be managed through the configuration information, so that the information in the preset configuration table has real-time performance, and the user can perform voice control on the latest and hottest downloaded application program, thereby having better convenience.
Exemplary apparatus
According to the same concept as the method embodiment of the present invention, as shown in fig. 6, the embodiment of the present invention further provides a voice control device for an application program, which specifically includes the following modules:
The recognition processing module 61 is configured to recognize the acquired voice command based on a voice control program, and determine a semantic recognition result corresponding to the voice command;
a program determining module 62, configured to determine a target application program based on the semantic recognition result;
A mode determining module 63, configured to determine a communication execution mode between the voice control program and the target application program based on the voice service registration information corresponding to the target application program and a preset configuration table;
And the operation processing module 64 is configured to control the target application program to execute an operation corresponding to the voice instruction based on the communication execution mode.
In an exemplary embodiment of the invention, the mode determining module comprises
The relation determining unit is used for determining the calling relation between the voice control program and the target application program based on a preset configuration table if the target application program has voice service registration information;
and the mode determining unit is used for determining a communication execution mode based on the calling relation.
In an exemplary embodiment of the present invention, the manner determining unit is configured to determine that the communication execution manner is middleware communication if the calling relationship is that the voice control program calls the target application program;
The operation processing module includes:
The information determining unit is used for determining target configuration information in the middleware based on the semantic identification result if the communication execution mode is middleware communication;
and the first operation unit is used for calling a target interface of the target application program based on the target configuration information so that the target application program executes the operation corresponding to the voice instruction.
In an exemplary embodiment of the invention, the apparatus further comprises:
the first acquisition module is used for acquiring a first application program, and if the voice control program is determined to call the first application program based on a preset configuration table, a first voice service registration request sent by the middleware is received;
And the first registration module is used for determining voice service registration information corresponding to the first application program based on the first voice service registration request.
In an exemplary embodiment of the present invention, the manner determining unit is configured to determine that the communication execution manner is direct call communication if the call relationship is that the target application calls the voice control program.
The operation processing module includes:
The call processing unit is used for receiving a call request of the target application program if the communication execution mode is direct call communication;
And the second operation unit is used for returning the logic result determined based on the call request to the target application program so that the target application program executes the operation corresponding to the voice instruction.
In an exemplary embodiment of the invention, the apparatus further comprises:
the second acquisition module is used for acquiring a second application program, and if the second application program is determined to call the voice control program based on a preset configuration table, a second voice service registration request of the second application program is received;
And the second registration module is used for determining the voice service registration information of the second application program based on the second voice service registration request.
In an exemplary embodiment of the present invention, the mode determining module includes:
The virtual processing unit is used for determining that the communication execution mode between the voice control program and the target application program is virtual clicking if the voice service registration information exists in the target application program and the target application program is not in a preset configuration table;
The operation processing module includes:
the key determining unit is used for determining a target key corresponding to the semantic recognition result in the target application program if the communication execution mode is virtual clicking;
And the third operation unit is used for triggering the target key so that the target application program executes the operation corresponding to the voice instruction.
In an exemplary embodiment of the invention, the apparatus further comprises:
and the third registration module is used for acquiring a third application program, determining the association relation between the display text and the keys if the third application program is not in the preset configuration table, and extracting the display text of the third application program for voice service registration.
In an exemplary embodiment of the present invention, the third registration module is configured to obtain a third application, determine an association between the display text and the key if the third application is not in the preset configuration table, extract the display text of the third application, determine a display position of the display text, and perform voice service registration based on the display text and the display position.
In an exemplary embodiment of the invention, the apparatus further comprises:
and the updating processing module is used for updating the preset configuration table based on the received configuration information.
Exemplary electronic device
Fig. 7 illustrates a block diagram of an electronic device according to an embodiment of the invention.
As shown in fig. 7, the electronic device 100 includes one or more processors 101 and memory 102.
The processor 101 may be a Central Processing Unit (CPU) or other form of processing unit having data processing and/or instruction execution capabilities and may control other components in the electronic device 100 to perform desired functions.
Memory 102 may include one or more computer program products that may include various forms of computer-readable storage media, such as volatile memory and/or non-volatile memory. The volatile memory may include, for example, random Access Memory (RAM) and/or cache memory (cache), and the like. The non-volatile memory may include, for example, read Only Memory (ROM), hard disk, flash memory, and the like. One or more computer program instructions may be stored on the computer readable storage medium that can be executed by the processor 101 to implement the voice control method and/or other desired functions of the application of the various embodiments of the present invention described above.
In one example, electronic device 100 may also include input device 103 and output device 104, which are interconnected by a bus system and/or other forms of connection mechanisms (not shown).
Of course, only some of the components of the electronic device 100 relevant to the present invention are shown in fig. 7 for simplicity, components such as buses, input/output interfaces, etc. being omitted. In addition, the electronic device 100 may include any other suitable components depending on the particular application.
Exemplary computer program product and computer readable storage Medium
In addition to the methods and apparatus described above, embodiments of the invention may also be a computer program product comprising computer program instructions which, when executed by a processor, cause the processor to perform steps in a speech control method according to an application of various embodiments of the invention described in the "exemplary methods" section of this specification.
The computer program product may write program code for performing operations of embodiments of the present invention in any combination of one or more programming languages, including an object oriented programming language such as Java, C++ or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device, partly on a remote computing device, or entirely on the remote computing device or server.
Furthermore, embodiments of the present invention may also be a computer-readable storage medium, on which computer program instructions are stored, which, when being executed by a processor, cause the processor to perform the steps in a speech control method according to an application program of various embodiments of the present invention described in the above section "exemplary method" of the present specification.
The computer readable storage medium may employ any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. The readable storage medium may include, for example, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or a combination of any of the foregoing. More specific examples (a non-exhaustive list) of a readable storage medium include an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
The basic principles of the present invention have been described above in connection with specific embodiments, but it should be noted that the advantages, benefits, effects, etc. mentioned in the present invention are merely examples and not intended to be limiting, and these advantages, benefits, effects, etc. are not to be construed as necessarily possessed by the various embodiments of the invention. Furthermore, the specific details of the invention described above are for purposes of illustration and understanding only, and are not intended to be limiting, as the invention may be practiced with the specific details described above.
The block diagrams of the devices, apparatuses, devices, systems referred to in the present invention are only illustrative examples and are not intended to require or imply that the connections, arrangements, configurations must be made in the manner shown in the block diagrams. As will be appreciated by one of skill in the art, the devices, apparatuses, devices, systems may be connected, arranged, configured in any manner. Words such as "including," "comprising," "having," and the like are words of openness and mean "including but not limited to," and are used interchangeably therewith. The terms "or" and "as used herein refer to and are used interchangeably with the term" and/or "unless the context clearly indicates otherwise. The term "such as" as used herein refers to, and is used interchangeably with, the phrase "such as, but not limited to.
It is also noted that in the apparatus, devices and methods of the present invention, the components or steps may be disassembled and/or assembled. Such decomposition and/or recombination should be considered as equivalent aspects of the present invention.
The previous description of the inventive aspects is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these aspects will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other aspects without departing from the scope of the invention. Thus, the present invention is not intended to be limited to the aspects shown herein but is to be accorded the widest scope consistent with the principles and novel features herein.
The foregoing description has been presented for purposes of illustration and description. Furthermore, this description is not intended to limit embodiments of the invention to the form disclosed herein. Although a number of example aspects and embodiments have been discussed above, a person of ordinary skill in the art will recognize certain variations, modifications, alterations, additions, and subcombinations thereof.
Claims (12)
1. The voice control method of the application program is characterized by comprising the following steps:
identifying the acquired voice command based on a voice control program, and determining a semantic identification result corresponding to the voice command;
Determining a target application program based on the semantic recognition result;
Determining a communication execution mode between the voice control program and the target application program based on voice service registration information corresponding to the target application program and a preset configuration table;
Based on the communication execution mode, controlling the target application program to execute the operation corresponding to the voice instruction;
the determining, based on the voice service registration information and the preset configuration table corresponding to the target application program, a communication execution mode between the voice control program and the target application program includes:
If the target application program has voice service registration information, determining a calling relationship between the voice control program and the target application program based on a preset configuration table;
Determining a communication execution mode based on the calling relation;
the determining the communication execution mode based on the calling relation comprises the following steps:
If the calling relation is that the voice control program calls the target application program, determining that the communication execution mode is middleware communication;
and if the calling relation is that the target application program calls the voice control program, determining that the communication execution mode is direct calling communication.
2. The method according to claim 1, wherein the controlling the target application program to execute the operation corresponding to the voice instruction based on the communication execution mode includes:
If the communication execution mode is middleware communication, determining target configuration information in the middleware based on the semantic identification result;
And calling a target interface of the target application program based on the target configuration information so that the target application program executes the operation corresponding to the voice instruction.
3. The method of claim 2, the method further comprising:
acquiring a first application program, and if the voice control program is determined to call the first application program based on a preset configuration table, receiving a first voice service registration request sent by a middleware;
And determining voice service registration information corresponding to the first application program based on the first voice service registration request.
4. The method according to claim 1, wherein the controlling the target application program to execute the operation corresponding to the voice instruction based on the communication execution mode includes:
If the communication execution mode is direct call communication, receiving a call request of the target application program;
And returning a logic result determined based on the call request to the target application program so that the target application program executes the operation corresponding to the voice instruction.
5. The method of claim 4, the method further comprising:
Acquiring a second application program, and if the second application program is determined to call the voice control program based on a preset configuration table, receiving a second voice service registration request of the second application program;
and determining voice service registration information of the second application program based on the second voice service registration request.
6. The method according to claim 1, wherein the determining, based on the voice service registration information corresponding to the target application program and a preset configuration table, a communication execution manner between the voice control program and the target application program includes:
If the target application program has voice service registration information and the target application program is not in the preset configuration table, determining that the communication execution mode between the voice control program and the target application program is virtual click;
the controlling the target application program to execute the operation corresponding to the voice instruction based on the communication execution mode comprises the following steps:
if the communication execution mode is virtual clicking, determining a target key corresponding to the semantic identification result in the target application program;
And triggering the target key to enable the target application program to execute the operation corresponding to the voice instruction.
7. The method of claim 6, the method further comprising:
And acquiring a third application program, if the third application program is not in the preset configuration table, determining the association relation between the display text and the keys, and extracting the display text of the third application program for voice service registration.
8. The method of claim 7, wherein extracting the display text of the third application for voice service registration comprises:
extracting display characters of the third application program;
determining the display position of the display text;
And registering voice service based on the display text and the display position.
9. The method of any one of claims 1-8, further comprising:
And updating the preset configuration table based on the received configuration information.
10. The voice control device for the application program is characterized by comprising the following modules:
the recognition processing module is used for recognizing the acquired voice command based on the voice control program and determining a semantic recognition result corresponding to the voice command;
The program determining module is used for determining a target application program based on the semantic recognition result;
the mode determining module is used for determining the communication execution mode between the voice control program and the target application program based on the voice service registration information corresponding to the target application program and a preset configuration table;
the operation processing module is used for controlling the target application program to execute the operation corresponding to the voice instruction based on the communication execution mode;
The mode determining module comprises:
the relation determining unit is used for determining the calling relation between the voice control program and the target application program based on a preset configuration table if the target application program has voice service registration information;
The mode determining unit is used for determining a communication execution mode based on the calling relation;
The mode determining unit is used for determining that the communication execution mode is middleware communication if the calling relation is that the voice control program calls the target application program;
And the mode determining unit is used for determining that the communication execution mode is direct call communication if the call relation is that the target application program calls the voice control program.
11. A computer-readable storage medium storing a computer program for executing the speech control method of the application program according to any one of the preceding claims 1-9.
12. An electronic device, the electronic device comprising:
A processor;
A memory for storing the processor-executable instructions;
The processor being configured to read the executable instructions from the memory and execute the instructions to implement the method of speech control of an application program according to any one of the preceding claims 1-9.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202110881207.4A CN115701633B (en) | 2021-08-02 | 2021-08-02 | Voice control method and device of application program, storage medium and electronic equipment |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202110881207.4A CN115701633B (en) | 2021-08-02 | 2021-08-02 | Voice control method and device of application program, storage medium and electronic equipment |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN115701633A CN115701633A (en) | 2023-02-10 |
| CN115701633B true CN115701633B (en) | 2025-11-11 |
Family
ID=85142475
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202110881207.4A Active CN115701633B (en) | 2021-08-02 | 2021-08-02 | Voice control method and device of application program, storage medium and electronic equipment |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN115701633B (en) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119028333A (en) * | 2023-05-25 | 2024-11-26 | 荣耀终端有限公司 | A voice input method, device, electronic device and storage medium |
| CN119446146A (en) * | 2024-10-28 | 2025-02-14 | 芜湖雄狮汽车科技有限公司 | Semantic data processing method, device, middleware, storage medium and product |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN106910499A (en) * | 2015-12-23 | 2017-06-30 | 北京奇虎科技有限公司 | The control method and device of application program |
Family Cites Families (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6931376B2 (en) * | 2000-07-20 | 2005-08-16 | Microsoft Corporation | Speech-related event notification system |
| US10224034B2 (en) * | 2016-02-03 | 2019-03-05 | Hua Xu | Voice recognition system and construction method thereof |
| CN107277588A (en) * | 2017-05-27 | 2017-10-20 | 深圳酷旗互联网有限公司 | A kind of voice key realization method of intelligent terminal |
| CN107562407B (en) * | 2017-09-15 | 2021-05-18 | 微鲸科技有限公司 | Device control method and electronic device |
| CN110100447B (en) * | 2017-11-30 | 2021-06-11 | 腾讯科技(深圳)有限公司 | Information processing method and device, multimedia device and storage medium |
| CN110875940B (en) * | 2018-08-31 | 2021-08-10 | 比亚迪股份有限公司 | Application program calling method, device and equipment based on virtual robot |
| CN109324840A (en) * | 2018-09-29 | 2019-02-12 | 五八有限公司 | A kind of operation processing method, device, equipment and storage medium |
| CN112825030B (en) * | 2020-02-28 | 2023-09-19 | 腾讯科技(深圳)有限公司 | Application program control method, device, equipment and storage medium |
| EP4170650B1 (en) * | 2020-06-29 | 2025-07-09 | Baidu Online Network Technology (Beijing) Co., Ltd. | Speech control method for mini-program, and devices and storage medium |
| CN112035872A (en) * | 2020-08-12 | 2020-12-04 | 博泰车联网(南京)有限公司 | Application management method, terminal and computer storage medium |
| CN112634896B (en) * | 2020-12-30 | 2023-04-11 | 智道网联科技(北京)有限公司 | Operation method of application program on intelligent terminal and intelligent terminal |
-
2021
- 2021-08-02 CN CN202110881207.4A patent/CN115701633B/en active Active
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN106910499A (en) * | 2015-12-23 | 2017-06-30 | 北京奇虎科技有限公司 | The control method and device of application program |
Also Published As
| Publication number | Publication date |
|---|---|
| CN115701633A (en) | 2023-02-10 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US10074365B2 (en) | Voice control method, mobile terminal device, and voice control system | |
| US20180183651A1 (en) | Content push method and server, and terminal | |
| CN106406940B (en) | System upgrading method, device and terminal | |
| CN111192578B (en) | Application control method and device, storage medium and electronic equipment | |
| CN104281478B (en) | The method and device of more new application | |
| CN103732452B (en) | Method for controlling functional devices in a vehicle during voice command operation | |
| WO2012109781A1 (en) | App icon processing method and communication terminal | |
| CN110688232A (en) | Application program calling method, terminal device and computer readable storage medium | |
| CN115701633B (en) | Voice control method and device of application program, storage medium and electronic equipment | |
| WO2023137909A1 (en) | Application compiling method and apparatus, application running method and apparatus, and storage medium | |
| CN110309006A (en) | A kind of funcall method, apparatus, terminal device and storage medium | |
| CN107748690A (en) | Using jump method, device and computer-readable storage medium | |
| CN108090345B (en) | Linux system external command execution method and device | |
| CN108091333A (en) | Sound control method and Related product | |
| CN114756306A (en) | Service calling method, device, equipment and storage medium | |
| CN109033302A (en) | A kind of method for page jump, device, terminal and storage medium | |
| CN111143010A (en) | Terminal device control method, terminal device, and storage medium | |
| CN112825030B (en) | Application program control method, device, equipment and storage medium | |
| CN107590137A (en) | Interpretation method, device and computer-readable recording medium | |
| CN106648875A (en) | Application switching method and device | |
| CN111243580A (en) | Voice control method, device and computer readable storage medium | |
| CN111610863B (en) | Information input method, device, terminal equipment and storage medium | |
| CN113590242A (en) | Method, device, terminal and storage medium for switching file language of application program | |
| CN109976790B (en) | Application updating method, device, terminal and storage medium | |
| CN111552537A (en) | Theme switching method and device, computer equipment and storage medium |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |