WO2019015613A1 - Procédé, appareil et dispositif terminal de lecture vocale de livre électronique - Google Patents
Procédé, appareil et dispositif terminal de lecture vocale de livre électronique Download PDFInfo
- Publication number
- WO2019015613A1 WO2019015613A1 PCT/CN2018/096162 CN2018096162W WO2019015613A1 WO 2019015613 A1 WO2019015613 A1 WO 2019015613A1 CN 2018096162 W CN2018096162 W CN 2018096162W WO 2019015613 A1 WO2019015613 A1 WO 2019015613A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- audio
- content
- book
- voice
- played
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
- G11B20/10527—Audio or video recording; Data buffering arrangements
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
- G11B20/10527—Audio or video recording; Data buffering arrangements
- G11B2020/10537—Audio or video recording
- G11B2020/10546—Audio or video recording specifically adapted for audio data
Definitions
- Embodiments of the present invention relate to the field of electronic book data processing technologies, and in particular, to an electronic book voice playing method, apparatus, and terminal device.
- An e-book is a publication that digitizes information such as text, pictures, sounds, and images using computer technology.
- traditional paper reading methods have gradually been replaced by e-books.
- People are increasingly using Internet and computer technology to download e-books through e-book reading applications for reading e-books. Read it.
- the embodiments of the present invention provide a method, a device, and a terminal device for playing an e-book voice, so as to solve the problem that the user reads the e-book under the condition of eye fatigue or poor light.
- a method for playing an e-book voice includes: determining an e-book content to be played by a voice according to a voice play instruction for instructing an e-book to perform voice playback; obtaining the e-book The content corresponds to the real vocal audio and plays the real vocal audio.
- an electronic book voice playback apparatus including: a content determining module, configured to determine an e-book to be played by voice according to a voice play instruction for instructing an e-book to perform voice play And an audio playing module, configured to obtain real vocal audio corresponding to the e-book content, and play the real vocal audio.
- a terminal device includes: a processor, a memory, a communication interface, and a communication bus, wherein the processor, the memory, and the communication interface are completed by using the communication bus Communication with each other; the memory is for storing at least one executable instruction that causes the processor to perform an operation corresponding to the e-book voice playback method as described above.
- the e-book voice playing solution provided by the embodiment of the invention can perform the voice playing of the corresponding e-book content through the voice playing instruction in the case of the user's eye fatigue or poor light, thereby realizing the "listening" of the e-book reading application. "Features. Moreover, in the embodiment of the present invention, real vocal audio is used, and compared with the machine-synthesized audio, the real vocal audio is far superior to the machine synthesis in terms of voice intonation and fluency because of recording through real vocals, so that the user Can get a better "listening" experience.
- FIG. 1 is a flow chart showing the steps of a method for playing an e-book voice according to a first embodiment of the present invention
- FIG. 2 is a flow chart showing the steps of a method for playing an e-book voice according to a second embodiment of the present invention
- FIG. 3 is a block diagram showing the structure of an electronic book voice playing device according to a third embodiment of the present invention.
- FIG. 4 is a block diagram showing the structure of an electronic book voice playback apparatus according to Embodiment 4 of the present invention.
- FIG. 5 is a schematic structural diagram of a terminal device according to Embodiment 5 of the present invention.
- FIG. 1 a flow chart of steps of an e-book voice playing method according to a first embodiment of the present invention is shown.
- Step S102 Determine an e-book content to be played by the voice according to a voice play instruction for instructing the e-book to perform voice play.
- the generation of the voice play instruction may be implemented in any suitable manner, including but not limited to: receiving the user's operation on the voice play button or option displayed in the e-book interface, or receiving the user's display of the e-book page.
- the setting operation (such as double-clicking, clicking, long-pressing) is generated, or is received after the user performs the voice playing setting through the corresponding setting menu, and the like, which is not limited by the embodiment of the present invention.
- the content of the e-book to be played by the voice may be the content set by the e-book reading application, such as the entire content of the currently displayed e-book, or one or more segments, one or more lines, one or more sentences selected by the user. And so on.
- Step S104 Obtain real vocal audio corresponding to the content of the e-book to be played by the voice, and play the real vocal audio.
- the real vocal audio corresponding to the content of the e-book can be obtained, and then played.
- the real vocal audio is the voice generated by the real person's voice, such as audio generated by a real person reading aloud, or audio generated by a real person's dialogue, or audio generated by processing a real human voice. (such as the audio generated by re-splitting and re-synthesizing sentences that have been read by real people) and so on.
- the user when the user is tired or the light is bad, the user can perform the voice playing of the corresponding e-book content through the voice playing instruction, thereby realizing the "listening" of the e-book reading application.
- real vocal audio is used, and compared with the machine-synthesized audio, the real vocal audio is far superior to the machine synthesis in terms of voice intonation and fluency because of recording through real vocals, so that the user Can get a better "listening" experience.
- the e-book voice playing method of this embodiment may be executed by any suitable device having data processing capability, including but not limited to: various terminal devices (including PCs, tablets, mobile terminals, etc.) and servers.
- FIG. 2 a flow chart of steps of an e-book voice playing method according to a second embodiment of the present invention is shown.
- Step S202 Determine the e-book content to be played by the voice according to the voice play instruction for performing voice play on the electronic book and the selection operation of the display content of the electronic book.
- the device where the e-book application is located receives the corresponding user.
- a corresponding voice play instruction is generated to indicate that the corresponding e-book content is played by voice.
- the content of the e-book to be played by the voice may be the content set by the e-book reading application by default, or may be the content selected by the user.
- an e-book voice play solution provided by an embodiment of the present invention is described by taking a user selection as an example.
- the user When the user selects the content of the e-book to be played by the voice, the user can select a certain segment or a certain segment of the content of the e-book, a certain line or a certain number of lines, the content of a certain sentence or a certain sentence, etc., by which the method can improve
- the flexibility of the user's "listening to the book” content enhances the user's "listening to the book” experience.
- the e-book content to be voice-played by default in the e-book reading application described in the first embodiment can also be applied to the solution of the embodiment.
- the operation of the user to indicate the voice play and the operation of the user to select the e-book content may be in any suitable order.
- the voice playback may be first indicated by an appropriate method, and then the e-book content may be selected; or the e-book content may be selected first, and then the selected e-book content may be voice-played.
- the latter embodiment is taken as an example to describe the solution of the embodiment of the present invention.
- those skilled in the art can implement the e-book voice playing solution based on the previous mode by referring to the embodiment.
- the selection operation of the display content of the e-book may be first received, and the e-book content to be played by the voice is determined according to the selection operation.
- a first operation of the display content of the electronic book by the user may be received, a first action point of the first operation in the display content is determined, and a second operation of the display content by the user is received, Determining a second action point of the second operation in the display content; determining display content between the first action point and the second action point as the e-book content to be played by the voice.
- the first operation and the second operation include, but are not limited to, a click operation.
- the user may receive a third operation of the display content of the electronic book, determine a third action point of the third operation in the display content, and use the third action point as a reference point, which will include
- the display content in the first setting range including the third action point is determined as the electronic book content to be played by the voice; or the display content in the second setting range starting from the third action point is determined as the to-be-voiced
- the content of the e-book to be played; or, the content of the third setting range ending with the third point of action is determined as the content of the e-book to be played by the voice.
- the first setting range, the second setting range, and the third setting range may be the same or different, and may be set by a person skilled in the art according to actual needs.
- the display content in the first setting range is determined as the content of the electronic book to be played by the voice, but is not limited thereto, and the third action point may not be the end point.
- the third operation includes, but is not limited to, a click operation. In this way, user operations are simplified and the operating burden of the system is reduced.
- the user may receive a selection operation of the display content of the electronic book, determine a content tag corresponding to the display content selected by the selection operation, and determine the content marked by the content tag as the to-be-voiced voice.
- a corresponding content mark is preset in the e-book content, and the content mark can be set by a person skilled in the art according to actual needs, such as setting a content mark for each chapter or each section, or setting one for each page.
- each segment is set to a content tag, or, based on an analysis of the e-book content, each complete episode (such as the teacher and student's dialogue in the classroom) or each complete scene (such as a sea scene) Set a content tag, and more.
- a selection operation for example, a certain portion of the e-book content is selected by the first operation and the second operation; or, a click operation is performed at any position of the currently displayed e-book content, such as The third operation mode; or, when the content tag is displayed to the user in an appropriate prompt manner in the e-book, after the user operates the corresponding prompt, the e-book reading application first determines the corresponding content tag, and further, the content is The entire portion of the e-book content marked by the tag is determined as the e-book content to be played by the voice.
- the method is not limited to the above manner.
- other suitable manners for determining the content of the e-book to be played by the voice are also applicable to the solution of the embodiment of the present invention, such as determining the content of the entire page currently displayed by the e-book as the to-be-voiced voice.
- Step S204 Obtain real vocal audio corresponding to the content of the e-book to be played by the voice, and play the real vocal audio.
- the real vocal audio includes at least one of the following: a film and television audio obtained from a movie drama corresponding to the electronic book; a spoken audio corresponding to the electronic book content of the electronic book; and a user recording of the electronic book reading application where the electronic book is located User audio.
- the electronic book "Three Kingdoms" corresponds to the original sound of the original voice, and in this case, the start position of the audio corresponding to the content of the electronic book to be played by the voice can be determined, and the play is performed from the home position.
- the user of the e-book reading application reads all or part of the content of the e-book and records it into audio, or combines the e-book content for voice commenting and saving it as audio, in the case where the audio can be used, such as
- the audio is set by the user to be shared, or sent to others, or published in an appropriate way in an e-book reading application, such as by e-book comment posting or by sharing or by other appropriate means, etc.
- the audio can be used to implement the "listening".
- the user before determining the e-book content to be played by the voice according to the voice play instruction of the e-book, the user can also receive the spoken audio recorded by the user through the e-book reading application for the content of the e-book, and the recorded audio and The content of the corresponding e-book is stored in association; and/or, the user receives the comment audio recorded by the e-book reading application for the content of the e-book, and associates the comment audio with the content of the corresponding e-book.
- the "listening" function is realized based on the recorded audio of the user recorded and associated storage, further enhancing the user's experience of using the e-book reading application.
- the above-mentioned real vocal audio can be further processed, such as splitting and re-synthesizing to meet the real vocal audio playing needs in certain situations, such as video
- the real vocal audio can also be synthesized with the background audio and/or the business audio to generate synthesized audio, in which case the synthesized audio corresponding to the electronic book content to be played by the voice will be obtained, wherein the synthesized audio includes In addition to the real vocal audio, background audio and/or service audio is also included; and the synthesized audio is played.
- the background audio can be background music, and the background audio can further enhance the atmosphere, so that the user can feel the atmosphere of the part of the e-book content;
- the service audio can be the business audio recorded by the person in the current real vocal audio, or It is a business audio related to the content of the e-book to be played by voice, such as a story-related business audio.
- the business audio can be inserted at any appropriate position at the beginning, end, or beginning to end of the current real vocal audio.
- the business audio can be implemented as an advertising audio.
- the content tag may be pre-set for the e-book, and the audio tag may be pre-set for the real vocal audio. That is, at least one content tag for marking the content of the e-book is pre-set in the e-book, and at least one audio tag for marking the audio content is pre-set in the real vocal audio, based on which the content tag can be Correspondence between audio tags, obtaining real vocal audio corresponding to the contents of the e-book.
- a content tag corresponding to the e-book content to be played by the voice may be determined; and an audio tag corresponding to the content tag is determined according to the correspondence between the pre-stored content tag and the audio tag; and the determined audio tag is obtained Corresponding audio content.
- the content mark and audio mark the real vocal audio corresponding to the e-book content can be obtained quickly and accurately, and the response speed of the "listening" function to the user operation is improved.
- the e-book to be played by the voice may be determined in advance (for example, in a voice play instruction for performing voice play on the e-book according to the indication) Before the step of content) performing voice recognition on the existing or acquired real vocal audio, obtaining the corresponding text content; determining the e-book content in the e-book that matches the text content; establishing and storing the real content corresponding to the text content The correspondence between the vocal audio and the determined e-book content.
- voice recognition is performed on a piece of video audio of a period of 30 minutes, and corresponding multi-segment text content is obtained; further, the multi-segment text content is respectively matched with the e-book content, and the multi-segment text content and the e-book are determined according to the matching result.
- Corresponding relationship between multiple pieces of content; further, according to the relationship between the two the correspondence between the plurality of parts of the real vocal audio corresponding to the plurality of pieces of text content recognized by the speech and the contents of the plurality of pieces of e-book contents can be established and stored relationship. Based on this, when the real vocal audio corresponding to the e-book content to be played by the voice is obtained, the real vocal audio corresponding to the e-book content to be played by the voice can be obtained according to the correspondence.
- real vocal audio includes a plurality of, for example, at least two of the audio-visual line audio, the e-book content reading audio, and the user audio
- real vocal audio corresponding to the e-book content can be obtained from at least two of the audio and video audio, the e-book content reading audio, and the user audio according to a preset priority; or Receiving, by the user, a selection operation of at least two corresponding options of the audio-visual line audio, the e-book content reading audio, and the user audio, and obtaining the real vocal audio corresponding to the e-book content selected by the selecting operation; or
- the user may also determine the audio type preference of the user according to the historical data of the user playing the real vocal audio; according to the user's audio type preference, at least two of the audio and video audio, the e-book content reading audio, and the user audio are obtained and to be obtained.
- the real vocal audio corresponding to the e-book content of the voice playback For example, the user's historical data indicates that the user has had ten voice playback records. Among them, the audio and video audio is used eight times. When the user plays the voice again, the audio and video audio can be directly used to perform the corresponding e-book content. Voice playback.
- the priority of setting three types of audio such as audio and video audio, e-book content reading audio, and user audio is from high to low: user audio, audio and video audio, and e-book content reading audio.
- the user audio is played; and if a part of the text of the e-book only corresponds to some of the audio, for example, the audio and electronic contents of the electronic book and the e-book content are read aloud.
- Audio will play the film and television audio, and if the part of the text only corresponds to the e-book content reading audio, the e-book content will be played aloud audio.
- priority setting is only an exemplary description, and may be appropriately set by a person skilled in the art according to actual needs, which is not limited by the embodiment of the present invention. By setting the priority, it is possible to ensure, as much as possible, that the e-book text corresponds to audio, and the form of the audio is diversified.
- the user is provided with greater flexibility in selecting the real vocal audio corresponding to the e-book content, and the user can select the audio and play it.
- the options corresponding to the audio and video audio, the e-book content reading audio, and the user audio may be appropriately set by a person skilled in the art according to actual needs.
- the audio and video audio may be displayed through a pop-up window or a transparent overlay.
- the e-book content reads the audio and user audio options.
- the e-book application After receiving a voice play instruction for performing voice playback on a part of the e-book content, the e-book application presents a corresponding audio option to the user through a pop-up window or a transparent overlay layer for the user to select, and after playing the user's selection result, playing The real vocal audio corresponding to the selection result, for example, if the user selects the film and television word audio, the audio and video audio corresponding to the part of the electronic book content is played. Based on the interface for displaying the content of the e-book, the audio option is displayed through the pop-up window or the transparent overlay layer, which facilitates the user's operation and improves the user experience.
- step S206 or step S208 can be further performed.
- Step S206 in the process of playing the real vocal audio, receiving the page turning operation of the e-book, suspending the playing of the real vocal audio; re-determining the e-book content to be played by the voice according to the page turning operation; obtaining and re-creating The actual vocal audio corresponding to the determined e-book content is played and played.
- the audio has not been played yet, and the user has performed corresponding operations, such as page turning or page turning, and the e-book reading application is monitored during the audio playback process.
- the playing of the audio is automatically suspended; further, the e-book content to be played by the voice is re-determined according to the page turning operation, for example, determining the final target page of the page turning operation, and then re-creating according to the content of the target page. Determine the content of the e-book to be played.
- the current real vocal audio is playing the content of the first sentence of the third paragraph of the fifth page of the e-book.
- the user performs a continuous page turning operation, and finally stops at the e-book.
- Page 10 of the page in this case, you can stop the previous audio and play the real vocal audio of the e-book content on page 10 (such as the audio corresponding to the content tag of the first e-book on page 10, or , the audio corresponding to the start text on page 10, or the audio of the scene on page 10 or the scene, etc.); it is also possible to stop the previous audio and receive the user's selection of the e-book content on page 10, The real vocal audio corresponding to the e-book content selected by the selection operation is played.
- the page turning operation is similar to the page turning operation, and will not be described here.
- the previous audio may be stopped, and the real vocal audio corresponding to the e-book content of the fifth page may be re-determined, for example, the audio corresponding to the content mark of the first e-book of the fifth page, or, page 5
- the initial text corresponds to the audio, or the episode on page 5 or the audio corresponding to the scene, and so on.
- the way of continuing the playback of the real vocal audio before the interruption is closer to the real needs of the user "listening to the book” than the other methods, and improving the user's "listening to the book” experience.
- Step S208 In the process of playing the real vocal audio, receiving an audio processing instruction for the played real vocal audio, and performing an operation indicated by the audio processing instruction on the real vocal audio.
- the audio processing instruction includes, but is not limited to, at least one of: a pause instruction for instructing suspension of real human voice audio playback, a first adjustment instruction for indicating a playback speed of adjusting real human voice audio, and an instruction for adjusting the real person.
- a second adjustment instruction of the playback progress of the audio and audio an exit instruction for instructing the exit of the real human voice audio, and a switching instruction for indicating the type of switching the real human voice audio.
- the user may send a pause instruction to the e-book reading application by operating the “pause” or the similar operation option to pause the playing of the current audio; or, when it is detected that the user interrupts the e-book reading application and uses other applications, the e-book reading application can automatically generate a corresponding pause instruction to suspend the playback of the current audio.
- the user may send an exit instruction indicating that the real human voice audio is exited to the e-book reading application by operating a “stop” or the like operation option to stop the playing of the current real human voice audio.
- the user may perform a selection operation on other audio types displayed, or by " A switch vocal" or similar operation option sends a switch instruction to the e-book reading application indicating the type of switching real vocal audio.
- the current real vocal audio is user audio
- the user selects one of a plurality of displayed audio types by the operation of the “switch vocal” operation option, for example, switching the user audio to the audio and video audio or electronic The contents of the book read the audio.
- the first adjusting instruction for adjusting the playing speed of the real vocal audio can be sent to the e-book reading application through the corresponding playing speed adjusting operation option to adjust the playing speed of the current audio.
- the playback speed of the current real vocal audio will be adjusted to 2 times of the original playback speed.
- the second adjustment instruction indicating that the playback progress of the real vocal audio is adjusted may be sent to the e-book reading application through the corresponding play progress adjustment operation option.
- the user can adjust the playing progress of the current real vocal audio by clicking the “fast forward” or similar operation option, or by dragging the audio playback progress bar.
- the foregoing audio processing instructions may be implemented by any suitable setting by those skilled in the art.
- the audio processing instructions may be displayed by a floating icon or a floating window or a transparent overlay.
- the user when the user is tired or the light is bad, the user can perform the voice playing of the corresponding e-book content through the voice playing instruction, thereby realizing the "listening" of the e-book reading application.
- real vocal audio is used, and compared with the machine-synthesized audio, the real vocal audio is far superior to the machine synthesis in terms of voice intonation and fluency because of recording through real vocals, so that the user Can get a better "listening" experience.
- the e-book voice playing method of this embodiment may be executed by any suitable device having data processing capability, including but not limited to: various terminal devices (including PCs, tablets, mobile terminals, etc.) and servers.
- FIG. 3 a block diagram of a structure of an electronic book voice playback apparatus according to a third embodiment of the present invention is shown.
- the e-book voice playing device of the embodiment includes: a content determining module 302, configured to determine an e-book content to be played by the voice according to a voice playing instruction for instructing the e-book to perform voice playing; and an audio playing module 304, configured to obtain and The e-book content corresponds to real vocal audio and plays the real vocal audio.
- the user when the user is tired or the light is bad, the user can perform the voice playing of the corresponding e-book content through the voice playing instruction, thereby realizing the "listening" of the e-book reading application.
- real vocal audio is used, and compared with the machine-synthesized audio, the real vocal audio is far superior to the machine synthesis in terms of voice intonation and fluency because of recording through real vocals, so that the user Can get a better "listening" experience.
- FIG. 4 a block diagram of a structure of an electronic book voice playback apparatus according to a fourth embodiment of the present invention is shown.
- the e-book voice playing device of the embodiment includes: a content determining module 402, configured to determine an e-book content to be played by the voice according to a voice playing instruction for instructing the e-book to perform voice playing; and an audio playing module 404, configured to obtain The e-book content corresponds to real vocal audio and plays the real vocal audio.
- the real vocal audio includes at least one of: a film and television audio obtained from a movie drama corresponding to the electronic book; a reading audio corresponding to the electronic book content of the electronic book; and an e-book reading application where the electronic book is located User audio recorded by the user.
- the audio playing module 404 is configured to obtain synthesized audio corresponding to the electronic book content to be played, wherein the synthesized audio includes background audio and/or service audio in addition to the real human voice audio; and is used for playing The synthesized audio.
- At least one content tag for marking the content of the e-book is pre-set in the e-book, and at least one audio tag for marking the audio content is pre-set in the real vocal audio;
- the audio playing module 404 is configured to Marking a correspondence relationship with the audio mark, obtaining real vocal audio corresponding to the electronic book content, and playing the real vocal audio.
- the audio play module 404 is configured to determine a content mark corresponding to the electronic book content to be played by the voice; determine an audio mark corresponding to the content mark according to the corresponding relationship between the pre-stored content mark and the audio mark; acquire and determine The audio tag corresponds to the audio content and plays the audio content.
- the e-book voice playing device of the embodiment further includes: a relationship establishing module 406, configured to determine, according to the voice playing instruction for instructing the e-book to perform voice playing, the content of the e-book to be played by the voice Previously, speech recognition is performed on real vocal audio to obtain corresponding text content; e-book content matching the text content in the e-book is determined; real vocal audio corresponding to the text content and determined electronic are established and stored Corresponding relationship between the contents of the book; the audio playing module 404 is configured to obtain, according to the correspondence between the real vocal audio corresponding to the text content and the determined content of the electronic book, corresponding to the content of the electronic book to be played by the voice Real vocal audio and play the real vocal audio.
- a relationship establishing module 406 configured to determine, according to the voice playing instruction for instructing the e-book to perform voice playing, the content of the e-book to be played by the voice Previously, speech recognition is performed on real vocal audio to obtain corresponding text content; e-book content matching the
- the audio playing module 404 is configured to use the audio and television content and the e-book content according to the preset priority. Reading at least two of the audio and the user audio, obtaining real vocal audio corresponding to the e-book content, and playing the real vocal audio; or, the audio playing module 404 is configured to receive the user's audio and video a book content reading audio, and a selection operation of at least two corresponding options in the user audio, obtaining real vocal audio selected by the selection operation corresponding to the e-book content, and playing the real vocal Audio; or, the audio playing module 404 is configured to determine the user's audio type preference according to the historical data of the user playing the real vocal audio; according to the user's audio type preference, from the audio and video audio, the e-book content, the audio and the user audio In at least two, real vocal audio corresponding to the content of the e-book to be played by the voice is
- the electronic book voice playing device of the embodiment further includes: a display module 408, configured to receive, by the audio playing module 404, at least two corresponding options of the user for the audio and video audio, the electronic book content reading audio, and the user audio. Before the selection operation, at least two corresponding options of the audio and video audio, the e-book content reading audio, and the user audio are displayed through a pop-up window or a transparent overlay.
- the content determining module 402 is configured to determine, according to the voice play instruction for performing voice play on the electronic book and the selection operation of the display content of the electronic book, the electronic book content to be played by the voice.
- the electronic book voice playing device of the embodiment further includes: a content selection module 410, configured to select, according to the voice playing instruction for instructing the electronic book to perform voice playing, and the selection content of the electronic book in the content determining module 402 The operation, before determining the content of the electronic book to be played by the voice, receives a selection operation of the display content of the electronic book, and determines the content of the electronic book to be played by the voice according to the selection operation.
- a content selection module 410 configured to select, according to the voice playing instruction for instructing the electronic book to perform voice playing, and the selection content of the electronic book in the content determining module 402 The operation, before determining the content of the electronic book to be played by the voice, receives a selection operation of the display content of the electronic book, and determines the content of the electronic book to be played by the voice according to the selection operation.
- the content selection module 410 includes: a first selection module 4102, configured to receive a first operation of the display content of the electronic book by the user, determine a first action point of the first operation in the display content, and receive the display content of the user The second operation determines a second action point of the second operation in the display content; determining display content between the first action point and the second action point as the e-book content to be played by the voice.
- a first selection module 4102 configured to receive a first operation of the display content of the electronic book by the user, determine a first action point of the first operation in the display content, and receive the display content of the user The second operation determines a second action point of the second operation in the display content; determining display content between the first action point and the second action point as the e-book content to be played by the voice.
- the content selection module 410 includes: a second selection module 4104, configured to receive a third operation of the display content of the electronic book by the user, determine a third action point of the third operation in the display content;
- the action point is a reference point, and the display content in the first setting range including the third action point is determined as the content of the e-book to be played by the voice; or the second setting range starting from the third action point
- the display content inside is determined as the e-book content to be played by the voice; or the display content in the third setting range ending with the third action point is determined as the e-book content to be played by the voice.
- the content selection module 410 includes: a third selection module 4106, configured to receive a user's selection operation on the display content of the electronic book, determine a content tag corresponding to the display content selected by the selection operation, and mark the content The marked content is determined as the e-book content to be played by the voice.
- a third selection module 4106 configured to receive a user's selection operation on the display content of the electronic book, determine a content tag corresponding to the display content selected by the selection operation, and mark the content The marked content is determined as the e-book content to be played by the voice.
- the e-book voice playing device of the embodiment further includes: a recording storage module 412, configured to determine, according to the voice playing instruction for instructing the e-book to perform voice playing, the content determining module 402 to determine the e-book content to be played by the voice Previously, receiving the spoken audio recorded by the user through the e-book reading application for the content of the e-book, associating the recorded audio with the content of the corresponding e-book; and/or receiving the user recording the content of the e-book through the e-book reading application
- the comment audio stores the comment audio associated with the content of the corresponding e-book.
- the e-book voice playback device of this embodiment further includes: an audio processing module 414, configured to receive an audio processing instruction for the played real vocal audio, and perform the audio processing instruction on the real vocal audio Indicated action.
- an audio processing module 414 configured to receive an audio processing instruction for the played real vocal audio, and perform the audio processing instruction on the real vocal audio Indicated action.
- the audio processing instruction includes at least one of: a pause instruction for instructing suspension of the real human voice audio playback, a first adjustment instruction for indicating a playback speed of the real human voice audio, And a second adjustment instruction indicating that the playback progress of the real vocal audio is adjusted, an exit instruction for instructing to exit the real vocal audio play, and a switching instruction for indicating a type of switching the real vocal audio.
- the display module 408 is further configured to display the audio processing instruction by using a floating icon or a floating window or a transparent overlay.
- the e-book voice playback device of the embodiment further includes: a re-determination module 416, configured to receive a page turning operation on the e-book during the process of playing the real vocal audio, and suspend the real vocal audio Playback; re-determine the e-book content to be played by the voice according to the page turning operation; obtain real vocal audio corresponding to the re-determined e-book content and play.
- a re-determination module 416 configured to receive a page turning operation on the e-book during the process of playing the real vocal audio, and suspend the real vocal audio Playback; re-determine the e-book content to be played by the voice according to the page turning operation; obtain real vocal audio corresponding to the re-determined e-book content and play.
- the e-book voice playback device of the present embodiment is used to implement the corresponding e-book voice playback method in the foregoing multiple method embodiments, and has the beneficial effects of the corresponding method embodiments, and details are not described herein again.
- FIG. 5 a schematic structural diagram of a terminal device according to Embodiment 5 of the present invention is shown.
- the specific implementation of the present invention does not limit the specific implementation of the terminal device.
- the terminal device may include a processor 502, a communications interface 504, a memory 506, and a communication bus 508.
- Processor 502, communication interface 504, and memory 506 complete communication with one another via communication bus 508.
- the communication interface 504 is configured to communicate with network elements of other devices, such as other terminal devices or servers.
- the processor 502 is configured to execute the program 510, and specifically, the related steps in the foregoing embodiment of the electronic book voice playing method.
- program 510 can include program code, the program code including computer operating instructions.
- the processor 502 may be a central processing unit CPU, or an Application Specific Integrated Circuit (ASIC), or one or more integrated circuits configured to implement embodiments of the present invention.
- the one or more processors included in the terminal device may be the same type of processor, such as one or more CPUs; or may be different types of processors, such as one or more CPUs and one or more ASICs.
- the memory 506 is configured to store the program 510.
- Memory 506 may include high speed RAM memory and may also include non-volatile memory, such as at least one disk memory.
- the program 510 may be specifically configured to cause the processor 502 to: determine the e-book content to be played by the voice according to the voice play instruction indicating the voice play of the e-book; and obtain the real vocal audio corresponding to the e-book content. And play the real vocal audio.
- the real vocal audio includes at least one of: audio and video audio obtained from a movie drama corresponding to the electronic book; reading audio corresponding to the electronic book content of the electronic book; User audio recorded by the user of the e-book reading application.
- the program 510 is further configured to enable the processor 502 to obtain and play the real vocal audio corresponding to the e-book content to be played, and play the real vocal audio.
- At least one content tag for marking the content of the e-book is pre-set in the e-book, and at least one audio tag for marking the audio content is pre-set in the real vocal audio; the program 510 is also used to When the processor 502 obtains the real vocal audio corresponding to the e-book content, according to the correspondence between the content tag and the audio tag, obtaining a real vocal corresponding to the e-book content Audio.
- the program 510 is further configured to enable the processor 502 to obtain real vocal audio corresponding to the e-book content according to the correspondence between the content tag and the audio tag. Determining a content tag corresponding to the content of the e-book to be played by the voice; determining an audio tag corresponding to the content tag according to the correspondence between the pre-stored content tag and the audio tag; acquiring the audio tag corresponding to the determined Audio content.
- the program 510 is further configured to cause the processor 502 to perform voice on the real vocal audio before determining the e-book content to be played by the voice according to the voice play instruction for performing the voice play on the electronic book according to the indication. Identifying, obtaining corresponding text content; determining e-book content in the e-book that matches the text content; establishing and storing between the real vocal audio corresponding to the text content and the determined content of the e-book Corresponding relationship; the program 510 is further configured to: when the processor 502 obtains the real vocal audio corresponding to the e-book content to be played, obtain the real vocal corresponding to the e-book content to be played according to the correspondence relationship Audio.
- the program 510 is further configured to cause the processor 502 to obtain and When the real vocal audio corresponding to the e-book content corresponds to at least two of the audio-visual line audio, the e-book content reading audio, and the user audio, the content corresponding to the e-book content is obtained according to a preset priority.
- Real vocal audio or, receiving a user's selection operation of at least two corresponding options of the audio-visual line audio, the e-book content reading audio, and the user audio, and obtaining the e-book content selected by the selection operation
- Corresponding real vocal audio or, according to the historical data of the user playing real vocal audio, determining the user's audio type preference; according to the user's audio type preference, from the audio and video audio, the e-book content reading audio, and the user audio In at least two, real vocal audio corresponding to the content of the e-book to be played is obtained.
- the program 510 is further configured to cause the processor 502 to pass the user's selection operation of the at least two corresponding options of the station audio, the e-book content reading audio, and the user audio.
- the pop-up window or transparent overlay displays options corresponding to at least two of the audio and video audio, the e-book content reading audio, and the user audio.
- the program 510 is further configured to: when the processor 502 determines the e-book content to be played by the voice according to the voice play instruction for performing the voice play on the electronic book according to the instruction, perform voice on the e-book according to the indication.
- the played voice play command and the selection operation of the display content of the e-book determine the content of the e-book to be played by the voice.
- the program 510 is further configured to: determine, by the processor 502, a voice play instruction for performing voice play on the electronic book according to the indication and a selection operation on the display content of the electronic book, and determine an electronic book to be played by the voice. Before the content, a selection operation of the display content of the electronic book is received, and the electronic book content to be played by the voice is determined according to the selection operation.
- the program 510 is further configured to: when the processor 502 receives the selection operation of the display content of the electronic book, and determines the electronic book content to be played by the voice according to the selecting operation, receiving the user to the electronic a first operation of displaying content of the book, determining a first action point of the first operation in the display content; receiving a second operation of the display content by the user, determining a second operation of the second operation in the display content The action point; the display content between the first action point and the second action point is determined as the content of the e-book to be played by the voice.
- the program 510 is further configured to: when the processor 502 receives the selection operation of the display content of the electronic book, and determines the electronic book content to be played by the voice according to the selecting operation, receiving the user to the electronic a third operation of displaying the content of the book, determining a third action point of the third operation in the display content; using the third action point as a reference point, the first set range including the third action point
- the display content is determined as the e-book content to be played by the voice; or the display content in the second setting range starting from the third action point is determined as the e-book content to be played by the voice; or, the third action point is to be
- the display content in the third setting range of the end point is determined as the content of the e-book to be played by the voice.
- the program 510 is further configured to: when the processor 502 receives the selection operation of the display content of the electronic book, and determines the electronic book content to be played by the voice according to the selecting operation, receiving the user to the electronic a selection operation of the display content of the book, determining a content tag corresponding to the display content selected by the selection operation; and determining the content marked by the content tag as the e-book content to be played by the voice.
- the program 510 is further configured to: when the processor 502 determines the e-book content to be played by the voice according to the voice play instruction of the e-book, receive the content that the user reads the application into the e-book through the e-book. Recording aloud audio, storing the recorded audio in association with the content of the corresponding e-book; and/or receiving the comment audio recorded by the user through the e-book reading application for the content of the e-book, and the content of the comment audio and the corresponding e-book Associate storage.
- the program 510 is further configured to cause the processor 502 to receive an audio processing instruction for the played real vocal audio, the real vocal audio being subjected to the operation indicated by the audio processing instruction.
- the audio processing instruction includes at least one of: a pause instruction for instructing suspension of real human voice audio playback, a first adjustment instruction for indicating a playback speed of adjusting real human voice audio, a second adjustment instruction for instructing adjustment of the playback progress of the real vocal audio, an exit instruction for indicating the exit of the real vocal audio playback, and a switching instruction for indicating the type of switching the real vocal audio.
- the program 510 is further configured to cause the processor 502 to display the audio processing instructions via a floating icon or a floating window or a transparent overlay.
- the program 510 is further configured to enable the processor 502 to receive a page turning operation on the e-book during the playing of the real human voice audio, and suspend the playing of the real human voice audio;
- the page turning operation redetermines the content of the e-book to be played by the voice; the real vocal audio corresponding to the content of the re-determined e-book is obtained and played.
- the voice play of the corresponding e-book content can be performed by the voice play instruction, and the “listening to book” function of the e-book reading application is realized.
- real vocal audio is used, and compared with the machine-synthesized audio, the real vocal audio is far superior to the machine synthesis in terms of voice intonation and fluency because of recording through real vocals, so that the user Can get a better "listening" experience.
- the above method according to an embodiment of the present invention may be implemented in hardware, firmware, or implemented as software or computer code that may be stored in a recording medium such as a CD ROM, a RAM, a floppy disk, a hard disk, or a magneto-optical disk, or implemented by
- the network downloads computer code originally stored in a remote recording medium or non-transitory machine readable medium and stored in a local recording medium so that the methods described herein can be stored using a general purpose computer, a dedicated processor or programmable
- Such software processing on a recording medium of dedicated hardware such as an ASIC or an FPGA.
- a computer, processor, microprocessor controller or programmable hardware includes storage components (eg, RAM, ROM, flash memory, etc.) that can store or receive software or computer code, when the software or computer code is The e-book voice playback method described herein is implemented when the processor or hardware accesses and executes. Moreover, when a general purpose computer accesses code for implementing the e-book voice playback method shown herein, execution of the code converts the general purpose computer into a special purpose computer for executing the electronic book voice playback method shown herein.
- storage components eg, RAM, ROM, flash memory, etc.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- User Interface Of Digital Computer (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
L'invention concerne un procédé, un appareil et un dispositif terminal de lecture vocale de livre électronique ; selon une instruction de lecture vocale utilisée pour ordonner à un livre électronique d'effectuer une lecture vocale, le contenu d'un livre électronique à lire est déterminé (S102) ; un audio vocal humain réel du contenu correspondant au livre électronique à lire est obtenu, et l'audio vocal humain réel est lu (S104). Ainsi, l'utilisateur bénéficie d'une meilleure expérience « écoute de livre ».
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201710601433.6A CN107369462B (zh) | 2017-07-21 | 2017-07-21 | 电子书语音播放方法、装置及终端设备 |
| CN201710601433.6 | 2017-07-21 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2019015613A1 true WO2019015613A1 (fr) | 2019-01-24 |
Family
ID=60307242
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2018/096162 Ceased WO2019015613A1 (fr) | 2017-07-21 | 2018-07-18 | Procédé, appareil et dispositif terminal de lecture vocale de livre électronique |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN107369462B (fr) |
| WO (1) | WO2019015613A1 (fr) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12316000B2 (en) | 2021-11-11 | 2025-05-27 | Harada Industry Co., Ltd. | Low-profile antenna device |
Families Citing this family (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107369462B (zh) * | 2017-07-21 | 2020-06-26 | 阿里巴巴(中国)有限公司 | 电子书语音播放方法、装置及终端设备 |
| CN107992250A (zh) * | 2017-12-20 | 2018-05-04 | 维沃移动通信有限公司 | 一种电子书文件内容的显示方法、移动终端 |
| CN108509605A (zh) * | 2018-04-03 | 2018-09-07 | 优视科技有限公司 | 一种新闻信息的语音播放方法、装置和终端设备 |
| CN108874266A (zh) * | 2018-06-27 | 2018-11-23 | 北京微播视界科技有限公司 | 文本播放方法、客户端、终端和存储介质 |
| CN110797001B (zh) * | 2018-07-17 | 2022-04-12 | 阿里巴巴(中国)有限公司 | 电子书语音音频的生成方法、装置及可读存储介质 |
| TWI717627B (zh) * | 2018-08-09 | 2021-02-01 | 台灣大哥大股份有限公司 | 電子書語音朗讀裝置及其方法 |
| CN109189983A (zh) * | 2018-09-18 | 2019-01-11 | 王全志 | 用于学习的语音播放方法及装置 |
| CN110032355B (zh) * | 2018-12-24 | 2022-05-17 | 阿里巴巴集团控股有限公司 | 语音播放方法、装置、终端设备及计算机存储介质 |
| CN109828711A (zh) * | 2019-01-25 | 2019-05-31 | 努比亚技术有限公司 | 一种移动终端的阅读管理方法、移动终端及存储介质 |
| CN111833903B (zh) * | 2019-04-22 | 2024-06-18 | 珠海金山办公软件有限公司 | 一种执行操作任务的方法及装置 |
| CN111324330B (zh) * | 2020-02-07 | 2021-04-30 | 掌阅科技股份有限公司 | 电子书的播放处理方法、计算设备及计算机存储介质 |
| CN111459446B (zh) * | 2020-03-27 | 2021-08-17 | 掌阅科技股份有限公司 | 电子书的资源处理方法、计算设备及计算机存储介质 |
| CN113779204B (zh) * | 2020-06-09 | 2024-06-11 | 浙江未来精灵人工智能科技有限公司 | 数据处理方法、装置、电子设备及计算机存储介质 |
Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1412687A (zh) * | 2001-10-17 | 2003-04-23 | 英业达集团(南京)电子技术有限公司 | 可播放背景音乐及朗读电子书的装置及方法 |
| CN1653517A (zh) * | 2002-05-09 | 2005-08-10 | 汤姆森特许公司 | 用于手持设备的文本语音转换 |
| US20110119590A1 (en) * | 2009-11-18 | 2011-05-19 | Nambirajan Seshadri | System and method for providing a speech controlled personal electronic book system |
| CN102576251A (zh) * | 2009-09-02 | 2012-07-11 | 亚马逊技术股份有限公司 | 触摸屏用户界面 |
| CN102723004A (zh) * | 2011-03-29 | 2012-10-10 | 汉王科技股份有限公司 | 电子文档点读控制方法及装置 |
| CN105869446A (zh) * | 2016-03-29 | 2016-08-17 | 广州阿里巴巴文学信息技术有限公司 | 一种电子阅读装置和语音阅读加载方法 |
| CN107369462A (zh) * | 2017-07-21 | 2017-11-21 | 广州阿里巴巴文学信息技术有限公司 | 电子书语音播放方法、装置及终端设备 |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101968969B (zh) * | 2010-10-22 | 2015-01-21 | 康佳集团股份有限公司 | 电子书移动装置及电子书的背景音乐播放方法 |
| CN106960051B (zh) * | 2017-03-31 | 2019-12-10 | 掌阅科技股份有限公司 | 基于电子书的音频播放方法、装置和终端设备 |
-
2017
- 2017-07-21 CN CN201710601433.6A patent/CN107369462B/zh active Active
-
2018
- 2018-07-18 WO PCT/CN2018/096162 patent/WO2019015613A1/fr not_active Ceased
Patent Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1412687A (zh) * | 2001-10-17 | 2003-04-23 | 英业达集团(南京)电子技术有限公司 | 可播放背景音乐及朗读电子书的装置及方法 |
| CN1653517A (zh) * | 2002-05-09 | 2005-08-10 | 汤姆森特许公司 | 用于手持设备的文本语音转换 |
| CN102576251A (zh) * | 2009-09-02 | 2012-07-11 | 亚马逊技术股份有限公司 | 触摸屏用户界面 |
| US20110119590A1 (en) * | 2009-11-18 | 2011-05-19 | Nambirajan Seshadri | System and method for providing a speech controlled personal electronic book system |
| CN102723004A (zh) * | 2011-03-29 | 2012-10-10 | 汉王科技股份有限公司 | 电子文档点读控制方法及装置 |
| CN105869446A (zh) * | 2016-03-29 | 2016-08-17 | 广州阿里巴巴文学信息技术有限公司 | 一种电子阅读装置和语音阅读加载方法 |
| CN107369462A (zh) * | 2017-07-21 | 2017-11-21 | 广州阿里巴巴文学信息技术有限公司 | 电子书语音播放方法、装置及终端设备 |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12316000B2 (en) | 2021-11-11 | 2025-05-27 | Harada Industry Co., Ltd. | Low-profile antenna device |
Also Published As
| Publication number | Publication date |
|---|---|
| CN107369462A (zh) | 2017-11-21 |
| CN107369462B (zh) | 2020-06-26 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2019015613A1 (fr) | Procédé, appareil et dispositif terminal de lecture vocale de livre électronique | |
| JP7065740B2 (ja) | アプリケーション機能情報表示方法、装置、及び端末装置 | |
| US9213705B1 (en) | Presenting content related to primary audio content | |
| CN110992993B (zh) | 视频编辑方法、视频编辑装置、终端和可读存储介质 | |
| US20150213727A1 (en) | Custom Narration of Electronic Books | |
| CN112068750A (zh) | 一种房源的处理方法和装置 | |
| CN112799630B (zh) | 使用网络可寻址设备创建电影化的讲故事体验 | |
| CN112231021B (zh) | 软件新功能的引导方法和装置 | |
| JP2015517684A (ja) | コンテンツのカスタマイズ | |
| CN104301771A (zh) | 视频文件播放进度的调整方法及装置 | |
| CN107403011B (zh) | 虚拟现实环境语言学习实现方法和自动录音控制方法 | |
| US20170194031A1 (en) | Method and device for generating video slides | |
| US11511200B2 (en) | Game playing method and system based on a multimedia file | |
| CN109634501B (zh) | 电子书批注添加方法、电子设备及计算机存储介质 | |
| WO2014154097A1 (fr) | Méthode de lecture à haute voix automatique de contenu de page et dispositif associé | |
| CN116366917A (zh) | 视频编辑方法、装置、电子设备及存储介质 | |
| WO2016202176A1 (fr) | Procédé, dispositif et appareil de synthèse de fichier multimédia | |
| CN118784942B (zh) | 视频生成方法、电子设备、存储介质及产品 | |
| WO2018094952A1 (fr) | Procédé et appareil de recommandation de contenu | |
| WO2025067501A1 (fr) | Procédé et appareil d'édition vidéo, dispositif, et support | |
| WO2025137255A1 (fr) | Utilisation d'un modèle génératif dans la génération d'un résumé de contenu de forme longue | |
| US20240394077A1 (en) | Digital Character Interactions with Media Items in a Conversational Session | |
| US20140297285A1 (en) | Automatic page content reading-aloud method and device thereof | |
| CN112114770A (zh) | 基于语音交互的界面引导方法、装置及设备 | |
| KR101832464B1 (ko) | 동영상 제공 장치, 동영상 제공 방법, 및 컴퓨터 프로그램 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 18835085 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 18835085 Country of ref document: EP Kind code of ref document: A1 |