[go: up one dir, main page]

CN111935537A - Music video generation method and device, electronic equipment and storage medium - Google Patents

Music video generation method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN111935537A
CN111935537A CN202010611868.0A CN202010611868A CN111935537A CN 111935537 A CN111935537 A CN 111935537A CN 202010611868 A CN202010611868 A CN 202010611868A CN 111935537 A CN111935537 A CN 111935537A
Authority
CN
China
Prior art keywords
video
lyrics
acquiring
music
song audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010611868.0A
Other languages
Chinese (zh)
Inventor
陈明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN202010611868.0A priority Critical patent/CN111935537A/en
Publication of CN111935537A publication Critical patent/CN111935537A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4398Processing of audio elementary streams involving reformatting operations of audio signals
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • H04N21/4355Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream involving reformatting operations of additional data, e.g. HTML pages on a television screen
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The application discloses a music short film video generation method and device, electronic equipment and a storage medium, and relates to the fields of artificial intelligence and video processing. The specific implementation scheme is as follows: acquiring a song audio to be processed; acquiring a lyric text corresponding to the audio frequency of the song; semantic analysis is carried out on the lyrics in the lyric text to generate content keywords of the lyrics; acquiring a video clip associated with the lyrics according to the content keywords of the lyrics; and synthesizing the song audio and the video segment associated with the lyrics to generate a corresponding music short video. According to the method and the device, the tag information is established for association through analysis of the video material and the song audio content, so that the video material associated with the song is automatically found according to the tag, the song audio and the video material associated with the song audio are automatically synthesized, the difficulty in making the music short-film video can be reduced, and automation and intellectualization for making the MV music short-film video are realized.

Description

Music video generation method and device, electronic equipment and storage medium
Technical Field
The present application relates to the field of computer technologies, and in particular, to the field of artificial intelligence and video processing, and in particular, to a method and an apparatus for generating a music short film video, an electronic device, and a storage medium.
Background
The human vision and hearing are connected, and people can feel similar feeling and emotional experience when listening to music and enjoying pictures or videos. The relation between music and videos plays a key role in the production of music short-film videos, and meanwhile, the massive videos and image databases ensure that a music short-film video producer always needs to spend a great deal of time and energy to find or produce images or videos related to the music short-film, so that manpower and material resources are wasted, professional knowledge is required, and amateurs are difficult to produce high-quality music short-film videos expected by the amateurs.
Disclosure of Invention
The disclosure provides a music short video generation method, a music short video generation device, an electronic device and a storage medium.
According to a first aspect of the present disclosure, there is provided a music short video generating method, including:
acquiring a song audio to be processed;
acquiring a lyric text corresponding to the song audio;
semantic analysis is carried out on the lyrics in the lyric text, and content keywords of the lyrics are generated;
acquiring a video clip associated with the lyrics according to the content keywords of the lyrics;
and synthesizing the song audio and the video segment associated with the lyrics to generate a corresponding music video.
According to a second aspect of the present disclosure, there is provided a music short video generating apparatus comprising:
the first acquisition module is used for acquiring the audio of the song to be processed;
the lyric text acquisition module is used for acquiring a lyric text corresponding to the song audio;
the first generation module is used for performing semantic analysis on the lyrics in the lyric text to generate content keywords of the lyrics;
the second acquisition module is used for acquiring a video clip associated with the lyrics according to the content keywords of the lyrics;
and the second generation module is used for synthesizing the song audio and the video segment associated with the lyrics to generate a corresponding music short video.
According to a third aspect of the present disclosure, there is provided an electronic device comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of generating music short videos of the first aspect.
According to a fourth aspect of the present disclosure, there is provided a non-transitory computer readable storage medium storing computer instructions for causing the computer to execute the music short video generation method of the first aspect described above.
It should be understood that the statements in this section do not necessarily identify key or critical features of the embodiments of the present disclosure, nor do they limit the scope of the present disclosure. Other features of the present disclosure will become apparent from the following description.
Drawings
The drawings are included to provide a better understanding of the present solution and are not intended to limit the present application. Wherein:
FIG. 1 is a flow diagram of a music short video generation method according to one embodiment of the present application;
FIG. 2 is an exemplary diagram of obtaining textual content of lyrics according to an embodiment of the present application;
fig. 3 is an exemplary diagram of acquiring a video clip material tag according to an embodiment of the present application;
FIG. 4 is a flow diagram of a music short video generation method according to another embodiment of the present application;
FIG. 5 is an exemplary diagram of music video composition according to an embodiment of the present application;
fig. 6 is a block diagram of a music short video generating apparatus according to an embodiment of the present application;
fig. 7 is a block diagram of a music short video generating apparatus according to another embodiment of the present application;
fig. 8 is a block diagram of a music short video generating apparatus according to still another embodiment of the present application;
fig. 9 is a block diagram of an electronic device for implementing a music short video generation method according to an embodiment of the present application.
Detailed Description
The following description of the exemplary embodiments of the present application, taken in conjunction with the accompanying drawings, includes various details of the embodiments of the application for the understanding of the same, which are to be considered exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
Fig. 1 is a flow chart of a music short video generation method according to one embodiment of the present application. It should be noted that the music video generation method according to the embodiment of the present application is applicable to the music video generation apparatus according to the embodiment of the present application, and the apparatus may be configured in an electronic device. The electronic device may be a mobile terminal, for example, a mobile phone, a tablet computer, a personal digital assistant, and other hardware devices with various operating systems.
It should be further noted that, in the embodiment of the present application, the generation of the music short video may be realized through an artificial intelligence technology. It is understood that Artificial Intelligence (Artificial Intelligence), abbreviated in english as AI. The method is a new technical science for researching and developing theories, methods, technologies and application systems for simulating, extending and expanding human intelligence. Artificial intelligence is a branch of computer science that attempts to understand the essence of intelligence and produce a new intelligent machine that can react in a manner similar to human intelligence, a field of research that includes robotics, language recognition, image recognition, natural language processing, and expert systems, among others. Since the birth of artificial intelligence, theories and technologies become mature day by day, and application fields are expanded continuously, so that science and technology products brought by the artificial intelligence in the future can be assumed to be 'containers' of human intelligence. The artificial intelligence can simulate the information process of human consciousness and thinking.
As shown in fig. 1, the music short video generating method may include:
step 101, acquiring a song audio to be processed.
For example, it is assumed that the method for generating a Music Video in the embodiment of the present application is applied to an electronic device, wherein the electronic device provides an application program for a user to produce a Music Video (MV) Video. The application program may provide an upload interface for song audio through which a user may upload song audio that needs to be processed. Optionally, the application may also provide a song audio database from which the user may select one audio as the song audio to be processed.
And 102, acquiring a lyric text corresponding to the song audio.
In some embodiments of the present application, the lyric text corresponding to the song audio may be captured from the internet or locally on the electronic device according to the identification information of the song audio. In other embodiments of the present application, a lyric text corresponding to a song audio may also be obtained by performing speech recognition on the song audio.
Optionally, the song audio is subjected to speech recognition through an artificial intelligence algorithm to analyze and understand the content of the lyrics, so that a lyric text corresponding to the song audio is obtained. For example, a speech recognition model may be pre-established by using an artificial intelligence technique, a natural language processing technique, and a speech recognition technique, and the speech recognition model may be used to perform speech recognition on the song audio, so as to obtain the text content of the lyrics corresponding to the song audio.
For example, as shown in fig. 2, a speech recognition model may be pre-established by using a deep learning algorithm in an artificial intelligence algorithm, and a song audio may be input into the speech recognition model for frame analysis, so as to obtain a lyric text content corresponding to the song audio.
And 103, performing semantic analysis on the lyrics in the lyric text to generate content keywords of the lyrics.
Optionally, semantic analysis is performed on the lyrics in the lyric text by using a natural language processing technology to obtain content keywords of the lyrics. For example, taking the lyric "lock door and close window and turn off light" as an example, semantic analysis is performed on the lyric to obtain the content keywords of the lyric as "lock door" and "close window". For another example, taking lyrics "pedestrian lanes on a city on a road" as an example, semantic analysis is performed on the lyrics to obtain a content keyword of the lyrics as "city".
In order to enable the generated MV video content to better accord with songs and enable the MV video content to be more diversified, in some embodiments of the application, semantic analysis can be performed on each lyric in a lyric text to generate a content keyword of each lyric, so that a video clip associated with each lyric can be obtained according to the content keyword of each lyric, and then an MV video of the song is generated based on the video clip associated with each lyric, so that the generated MV video content is more accordant with the songs, and the MV video content is more diversified.
It should be noted that, in some embodiments of the present application, the content keyword may further include an emotion type. Optionally, in order to find a video segment that is more matched with the song audio, that is, to find a video segment that can more represent the emotion expressed by the song audio, in the embodiment of the present application, when performing semantic analysis on each lyric in the text of the lyric, emotion analysis may also be performed on each lyric to determine the emotion type expressed by each lyric, so as to subsequently find a video segment associated with each lyric based on the emotion type of each lyric, so that the video segment can more represent the emotion expressed by the lyric. For example, taking the example of the lyric "how i can make you die away", semantic analysis and emotion analysis can be performed on the lyric to obtain the content keywords of the lyric as "leave" and "pain", wherein the keyword "leave" is obtained based on the semantic analysis, and the keyword "leave" is obtained based on the emotion analysis.
And 104, acquiring a video clip associated with the lyrics according to the content keywords of the lyrics.
Optionally, after obtaining the content keyword of each lyric, the video segment associated with each lyric may be obtained according to the content keyword of each lyric. It should be noted that, the video clips may be provided by the user in advance, for example, the user provides a video and the song audio to be processed, so that some or all of the clips in the video are intended to be used for synthesizing with the song audio to generate the MV video of the song. Or, the embodiment of the application can establish a video material library in advance, and can acquire a video segment associated with each lyric from the video material library based on the content keyword of the lyric.
In some embodiments of the present application, before acquiring the audio of the song to be processed, a plurality of video clip materials may be acquired, the tag information of each video clip material may be acquired, and a video material library may be established according to the plurality of video clip materials and the tag information of each video clip material.
Optionally, some video segment materials are obtained in advance, content understanding and analysis are performed on the video segment materials by using an artificial intelligence technology, tags are distributed to the video segment materials according to the analyzed content to obtain tag information of each video segment material, and then a video material library can be established according to the video segment materials and the tag information thereof.
For example, as shown in fig. 3, some video clip materials may be crawled from the internet, and the video clip materials are analyzed frame by frame through an artificial intelligence algorithm to obtain the element information displayed by each frame of picture in the video clip materials, and the tag information of the video clip materials is obtained by counting the element information displayed by each frame of picture. For example, the video clip material may be analyzed frame by frame through an image recognition algorithm in an artificial intelligence algorithm to obtain the element information displayed by each frame of picture, and then the element information displayed by each frame of picture is counted, for example, if the element information displayed by the pictures of most of the frames of pictures in the video clip material is seaside and beauty, it may be determined that the tag information of the video clip material is seaside and beauty.
In the embodiment of the application, the tag information matched with the content keyword is found from the video material library, and the video segment material corresponding to the found tag information is determined as the video segment associated with the lyric corresponding to the content keyword. That is, the content keyword of the lyric can be used to find out the tag information matching with the content keyword from the video material library, and find out the video segment material corresponding to the tag information in the video material library, and determine the video segment material as the video segment associated with the lyric corresponding to the content keyword. For example, assuming that the content keyword of a certain lyric is "seaside", the tag information "seaside" matching with the content keyword "seaside" can be found from the video material library, and the video segment material corresponding to the tag information is used as the video segment associated with the certain lyric. It should be noted that the tag information of the video clip material in the video material library may further include tags of emotion types, so that the obtained video clip can better present the emotion expressed by the lyrics.
And 105, synthesizing the song audio and the video segment associated with each sentence of lyrics to generate a corresponding music video.
Optionally, synthesizing an audio corresponding to each lyric in the song audio with a video segment associated with the lyric to generate a mixed audio/video mixed with the audio and video segments, and finally splicing the mixed audio/video mixed with the audio and video segments according to the sequence of each lyric to obtain a music video of the song.
In some embodiments of the application, the playing time of each lyric can be obtained according to the song audio, the playing time of the video segment associated with each lyric is adjusted according to the playing time, then the video segments associated with each lyric are synthesized into one video according to the sequence of each lyric in the song audio, and the synthesized video and the song audio are synthesized to generate the corresponding music video.
In the embodiment of the present application, the manner of adjusting the playing duration of the video segment associated with each lyric may include, but is not limited to, fast playing, slow playing, and the like, for example, the duration of the video segment is too long, and a video segment with the same playing duration may be intercepted from the video segment. If the time length of the video clip is too short, the video clips associated with the lyrics can be spliced, so that the playing time length of the spliced video clip is consistent with the playing time length of the audio frequency of the lyrics.
According to the music video generation method, the audio frequency of the song to be processed is obtained, voice recognition is carried out on the audio frequency of the song, a lyric text corresponding to the audio frequency of the song is obtained, semantic analysis is carried out on lyrics in the lyric text, content keywords of the lyrics are generated, video fragments related to the lyrics are obtained according to the content keywords of the lyrics, the audio frequency of the song and the video fragments related to the lyrics are synthesized, and the corresponding music video of the short piece is generated. In the whole process, the label information can be established for association through analysis of the video material and the song audio content, so that the video material associated with the song can be automatically found according to the label, the song audio and the video material associated with the song audio can be automatically synthesized, the requirement that a user must have the professional technology of video production is not required, the production difficulty of the music short video is reduced, the user experience can be greatly increased, the whole process is more concise and intelligent, and the labor and time cost are saved.
In order to further improve the user experience, increase the participation of the user, and meet the personalized requirements of the user, in some embodiments of the present application, as shown in fig. 4, the specific implementation process of obtaining the video clip associated with each lyric according to the content keyword of each lyric may include:
step 401, acquiring a video provided by a user, and segmenting the video according to a scene to obtain a plurality of video segments.
Alternatively, the user pre-selects a video that is desired to be combined with the song audio to be processed to generate an MV video for the song. Specifically, after a video provided by a user is acquired, the video can be segmented according to scenes so as to obtain a plurality of video segments correspondingly. It will be appreciated that in addition to the above-described manner of segmenting video from scenes, video may be segmented in other manners, such as segmenting video based on content, and the like, which is not specifically modern in this application.
Step 402, obtaining the label information of each video clip.
Optionally, for each video clip, performing frame-by-frame analysis on the video clip to obtain element information displayed by each frame of picture in the video clip; and acquiring label information of the video clip according to the element information displayed by each frame of picture.
In step 403, the similarity between the content keyword of the lyric and the tag information of each video clip is calculated.
Optionally, a similarity measure algorithm is used to calculate the similarity between the content keyword of each lyric and the tag information of each video clip. For example, the content keyword and the tag information may be converted into corresponding vector features, respectively, and the similarity between the content keyword and the tag information may be calculated using the vector features. The similarity metric algorithm may include, but is not limited to, a cosine similarity algorithm, a manhattan distance algorithm, a euclidean distance algorithm, etc.
Step 404, obtaining target tag information with similarity greater than a preset threshold and corresponding target content keywords from the tag information of each video clip and the content keywords of the lyrics.
Step 405, determining the video segment corresponding to the target tag information as the video segment associated with the lyric corresponding to the target content keyword.
For example, suppose that a video provided by a user is divided into a plurality of corresponding video segments according to a scene division manner, and tag information of each video segment is obtained, a content keyword of each lyric in an audio to be processed is matched with the tag information of the video segment to obtain a video segment associated with each lyric, as shown in fig. 5, the playing time of the video segment associated with each lyric is adjusted according to the playing time of each lyric, then the video segments associated with each lyric are synthesized into a video according to the sequence of each lyric in a song audio, and the synthesized video is synthesized with the song audio to generate a completed MV video.
Therefore, the user can freely select the video by setting the video providing interface for the user, the participation sense of the user is increased, the personalized requirements of the user are met, and the user experience can be further improved.
Fig. 6 is a block diagram of a music short video generating apparatus according to an embodiment of the present application. As shown in fig. 6, the music short video generating apparatus 600 may include: a first obtaining module 610, a lyric text obtaining module 620, a first generating module 630, a second obtaining module 640, and a second generating module 650.
Specifically, the first obtaining module 610 is configured to obtain song audio to be processed.
The lyric text obtaining module 620 is configured to obtain a lyric text corresponding to the audio frequency of the song.
The first generating module 630 is configured to perform semantic analysis on the lyrics in the lyrics text, and generate content keywords of the lyrics.
The second obtaining module 640 is configured to obtain a video clip associated with the lyrics according to the content keywords of the lyrics.
The second generation module 650 is used to synthesize the song audio and the video segment associated with the lyrics to generate the corresponding music video. In some embodiments of the present application, the second generating module 650 is specifically configured to: acquiring the playing time of the lyrics according to the audio frequency of the song; adjusting the playing time length of the video clip associated with the lyrics according to the playing time length; synthesizing video clips associated with the lyrics into a video according to the sequence of the lyrics in the audio of the song; and synthesizing the synthesized video and the song audio to generate a corresponding music video.
In some embodiments of the present application, as shown in fig. 7, the second obtaining module 640 may include: a first acquisition unit 641, a video division unit 642, a second acquisition unit 643, a similarity calculation unit 644, a third acquisition unit 645, and a determination unit 646. The first obtaining unit 641 is configured to obtain a video provided by a user; the video segmentation unit 642 is configured to segment the video according to the scene to obtain a plurality of video segments; the second obtaining unit 643 is configured to obtain label information of each video clip; the similarity calculation unit 644 is configured to calculate a similarity between a content keyword of the lyrics and the tag information of each video clip; the third obtaining unit 645 is configured to obtain target tag information with a similarity greater than a preset threshold and a target content keyword corresponding to the target tag information from the tag information of each video clip and the content keyword of the lyric; the determining unit 646 is configured to determine a video segment corresponding to the target tag information as a video segment associated with the lyric corresponding to the target content keyword.
In some embodiments of the present application, the second obtaining unit 643 is specifically configured to: analyzing the video clips aiming at each video clip to obtain the element information displayed by each frame of picture in the video clips; and acquiring label information of the video clip according to the element information displayed by each frame of picture.
In some embodiments of the present application, as shown in fig. 8, the music short video generating apparatus 600 may further include: a third acquisition module 660, a fourth acquisition module 670, and a setup module 680. The third obtaining module 660 is configured to obtain a plurality of video clip materials; the fourth obtaining module 670 is configured to obtain tag information of each video segment material; the creating module 680 is configured to create a video material library according to the plurality of video segment materials and the tag information of each video segment material.
In this embodiment of the application, the second obtaining module 640 finds out tag information matched with the content keyword from the video material library; and determining the video segment material corresponding to the found label information as the video segment associated with the lyric corresponding to the content keyword.
According to the music video clip generation device provided by the embodiment of the application, the tag information can be established for association through the analysis of the video material and the song audio content, so that the video material associated with the song can be automatically found according to the tag, the song audio and the video material associated with the song audio can be automatically synthesized, the user does not need to be required to have the professional technology of video production, the production difficulty of the music video clip is reduced, the user experience can be greatly increased, the whole process is more concise and intelligent, and the labor and time cost are saved.
According to an embodiment of the present application, an electronic device and a readable storage medium are also provided.
As shown in fig. 9, it is a block diagram of an electronic device to implement a music short video generation method according to an embodiment of the present application. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular phones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be examples only, and are not meant to limit implementations of the present application that are described and/or claimed herein.
As shown in fig. 9, the electronic apparatus includes: one or more processors 901, memory 902, and interfaces for connecting the various components, including a high-speed interface and a low-speed interface. The various components are interconnected using different buses and may be mounted on a common motherboard or in other manners as desired. The processor may process instructions for execution within the electronic device, including instructions stored in or on the memory to display graphical information of a GUI on an external input/output apparatus (such as a display device coupled to the interface). In other embodiments, multiple processors and/or multiple buses may be used, along with multiple memories and multiple memories, as desired. Also, multiple electronic devices may be connected, with each device providing portions of the necessary operations (e.g., as a server array, a group of blade servers, or a multi-processor system). Fig. 9 illustrates an example of a processor 901.
Memory 902 is a non-transitory computer readable storage medium as provided herein. Wherein the memory stores instructions executable by at least one processor to cause the at least one processor to perform the music short video generation method provided herein. The non-transitory computer-readable storage medium of the present application stores computer instructions for causing a computer to execute the music short video generation method provided by the present application.
The memory 902, which is a non-transitory computer readable storage medium, may be used to store non-transitory software programs, non-transitory computer executable programs, and modules, such as program instructions/modules corresponding to the music short video generation method in the embodiment of the present application (for example, the first obtaining module 610, the speech recognition module 620, the first generation module 630, the second obtaining module 640, and the second generation module 650 shown in fig. 6). The processor 901 executes various functional applications of the server and data processing by running non-transitory software programs, instructions, and modules stored in the memory 902, that is, implements the music short video generation method in the above-described method embodiment.
The memory 902 may include a program storage area and a data storage area, wherein the program storage area may store an operating system, an application program required for at least one function; the storage data area may store data created according to use of an electronic device to implement the music short video generation method, and the like. Further, the memory 902 may include high speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid state storage device. In some embodiments, the memory 902 may optionally include a memory remotely located from the processor 901, which may be connected via a network to an electronic device configured to implement the music short video generation method. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The electronic device to implement the music short video generating method may further include: an input device 903 and an output device 904. The processor 901, the memory 902, the input device 903 and the output device 904 may be connected by a bus or other means, and fig. 9 illustrates the connection by a bus as an example.
The input device 903 may receive input numeric or character information and generate key signal inputs related to user settings and function control of an electronic apparatus to implement the music short video generation method, such as an input device of a touch screen, a keypad, a mouse, a track pad, a touch pad, a pointing stick, one or more mouse buttons, a track ball, a joystick, or the like. The output devices 904 may include a display device, auxiliary lighting devices (e.g., LEDs), tactile feedback devices (e.g., vibrating motors), and the like. The display device may include, but is not limited to, a Liquid Crystal Display (LCD), a Light Emitting Diode (LED) display, and a plasma display. In some implementations, the display device can be a touch screen.
Various implementations of the systems and techniques described here can be realized in digital electronic circuitry, integrated circuitry, application specific ASICs (application specific integrated circuits), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs that are executable and/or interpretable on a programmable system including at least one programmable processor, which may be special or general purpose, receiving data and instructions from, and transmitting data and instructions to, a storage system, at least one input device, and at least one output device.
These computer programs (also known as programs, software applications, or code) include machine instructions for a programmable processor, and may be implemented using high-level procedural and/or object-oriented programming languages, and/or assembly/machine languages. As used herein, the terms "machine-readable medium" and "computer-readable medium" refer to any computer program product, apparatus, and/or device (e.g., magnetic discs, optical disks, memory, Programmable Logic Devices (PLDs)) used to provide machine instructions and/or data to a programmable processor, including a machine-readable medium that receives machine instructions as a machine-readable signal. The term "machine-readable signal" refers to any signal used to provide machine instructions and/or data to a programmable processor.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and a pointing device (e.g., a mouse or a trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic, speech, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a back-end component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such back-end, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), Wide Area Networks (WANs), and the Internet.
The computer system may include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
According to the technical scheme of the embodiment of the application, the tag information can be established for association through analysis of the video material and the song audio content, so that the video material associated with the song can be automatically found according to the tag, the song audio and the video material associated with the song audio are automatically synthesized, a user does not need to be required to have the professional technology of video production, the production difficulty of the music short-film video is reduced, the user experience can be greatly improved, the whole process is simpler and more intelligent, and the labor and time cost are saved.
It should be understood that various forms of the flows shown above may be used, with steps reordered, added, or deleted. For example, the steps described in the present application may be executed in parallel, sequentially, or in different orders, and the present invention is not limited thereto as long as the desired results of the technical solutions disclosed in the present application can be achieved.
The above-described embodiments should not be construed as limiting the scope of the present application. It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may be made in accordance with design requirements and other factors. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present application shall be included in the protection scope of the present application.

Claims (14)

1. A music short video generation method comprises the following steps:
acquiring a song audio to be processed;
acquiring a lyric text corresponding to the song audio;
semantic analysis is carried out on the lyrics in the lyric text, and content keywords of the lyrics are generated;
acquiring a video clip associated with the lyrics according to the content keywords of the lyrics;
and synthesizing the song audio and the video segment associated with the lyrics to generate a corresponding music video.
2. The music filmlet video generation method of claim 1, wherein the obtaining video segments associated with the lyrics according to the content keywords of the lyrics comprises:
acquiring a video, and segmenting the video according to scenes to obtain a plurality of video segments;
acquiring label information of each video clip;
calculating the similarity between the content keywords of the lyrics and the label information of each video clip;
acquiring target label information with similarity larger than a preset threshold and corresponding target content keywords from the label information of each video clip and the content keywords of the lyrics;
and determining the video segment corresponding to the target label information as the video segment associated with the lyrics corresponding to the target content keywords.
3. The music short video generating method according to claim 2, wherein said obtaining the tag information of each of the video segments comprises:
analyzing the video clips to obtain element information displayed by each frame of picture in the video clips;
and acquiring the label information of the video clip according to the element information displayed by each frame of picture.
4. The music short video generation method of claim 1, prior to obtaining the song audio to be processed, the method further comprising:
acquiring a plurality of video clip materials;
acquiring label information of each video clip material;
and establishing a video material library according to the plurality of video segment materials and the label information of each video segment material.
5. The music filmlet video generation method of claim 4, wherein the obtaining the video segment associated with the lyrics according to the content keyword of the lyrics comprises:
finding out label information matched with the content keywords from the video material library;
and determining the video segment material corresponding to the found label information as the video segment associated with the lyric corresponding to the content keyword.
6. The music filmlet video generation method of any one of claims 1 to 5, wherein the synthesizing of the song audio and the video segment associated with the lyrics to generate a corresponding music filmlet video comprises:
acquiring the playing time of the lyrics according to the song audio;
adjusting the playing time length of a video clip associated with the lyrics according to the playing time length;
synthesizing video clips associated with the lyrics into a video according to the sequence of the lyrics in the song audio;
and synthesizing the synthesized video and the song audio to generate a corresponding music video.
7. A music filmlet video generating apparatus comprising:
the first acquisition module is used for acquiring the audio of the song to be processed;
the lyric text acquisition module is used for acquiring a lyric text corresponding to the song audio;
the first generation module is used for performing semantic analysis on the lyrics in the lyric text to generate content keywords of the lyrics;
the second acquisition module is used for acquiring a video clip associated with the lyrics according to the content keywords of the lyrics;
and the second generation module is used for synthesizing the song audio and the video segment associated with the lyrics to generate a corresponding music short video.
8. The music short video generating apparatus according to claim 7, wherein the second obtaining means comprises:
a first acquisition unit configured to acquire a video;
a video segmentation unit for segmenting the video according to scenes to obtain a plurality of video segments;
a second obtaining unit, configured to obtain tag information of each of the video clips;
a similarity calculation unit for calculating a similarity between a content keyword of the lyrics and tag information of each of the video clips;
a third obtaining unit, configured to obtain, from the tag information of each video clip and the content keywords of the lyrics, target tag information with a similarity greater than a preset threshold and corresponding target content keywords thereof;
and the determining unit is used for determining the video segment corresponding to the target label information as the video segment associated with the lyric corresponding to the target content keyword.
9. The apparatus for generating music short video according to claim 8, wherein the second obtaining unit is specifically configured to:
analyzing the video clips to obtain element information displayed by each frame of picture in the video clips;
and acquiring the label information of the video clip according to the element information displayed by each frame of picture.
10. The music short video generating apparatus according to claim 7, further comprising:
the third acquisition module is used for acquiring a plurality of video clip materials;
the fourth acquisition module is used for acquiring the label information of each video clip material;
and the establishing module is used for establishing a video material library according to the plurality of video segment materials and the label information of each video segment material.
11. The apparatus for generating music short video according to claim 10, wherein the second obtaining module is specifically configured to:
finding out label information matched with the content keywords from the video material library;
and determining the video segment material corresponding to the found label information as the video segment associated with the lyric corresponding to the content keyword.
12. The apparatus for generating music short video according to any of claims 7 to 11, wherein the second generating module is specifically configured to:
acquiring the playing time of the lyrics according to the song audio;
adjusting the playing time length of a video clip associated with the lyrics according to the playing time length;
synthesizing video clips associated with the lyrics into a video according to the sequence of the lyrics in the song audio;
and synthesizing the synthesized video and the song audio to generate a corresponding music video.
13. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the music short video generation method of any one of claims 1 to 6.
14. A non-transitory computer-readable storage medium storing computer instructions for causing a computer to execute the music short video generation method of any one of claims 1 to 6.
CN202010611868.0A 2020-06-30 2020-06-30 Music video generation method and device, electronic equipment and storage medium Pending CN111935537A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010611868.0A CN111935537A (en) 2020-06-30 2020-06-30 Music video generation method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010611868.0A CN111935537A (en) 2020-06-30 2020-06-30 Music video generation method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN111935537A true CN111935537A (en) 2020-11-13

Family

ID=73317506

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010611868.0A Pending CN111935537A (en) 2020-06-30 2020-06-30 Music video generation method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN111935537A (en)

Cited By (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112423107A (en) * 2020-11-18 2021-02-26 北京字跳网络技术有限公司 Lyric video display method and device, electronic equipment and computer readable medium
CN112487248A (en) * 2020-12-01 2021-03-12 深圳市易平方网络科技有限公司 Video file label generation method and device, intelligent terminal and storage medium
CN112541353A (en) * 2020-12-24 2021-03-23 北京百度网讯科技有限公司 Video generation method, device, equipment and medium
CN112632326A (en) * 2020-12-24 2021-04-09 北京风平科技有限公司 Video production method and device based on video script semantic recognition
CN112784056A (en) * 2020-12-31 2021-05-11 北京视连通科技有限公司 Short video generation method based on video intelligent identification and intelligent semantic search
CN112800263A (en) * 2021-02-03 2021-05-14 上海艾麒信息科技股份有限公司 Video synthesis system, method and medium based on artificial intelligence
CN112911379A (en) * 2021-01-15 2021-06-04 北京字跳网络技术有限公司 Video generation method and device, electronic equipment and storage medium
CN113050857A (en) * 2021-03-26 2021-06-29 北京字节跳动网络技术有限公司 Music sharing method and device, electronic equipment and storage medium
CN113329258A (en) * 2021-06-10 2021-08-31 王之华 Song video synthesis method and player
CN113365134A (en) * 2021-06-02 2021-09-07 北京字跳网络技术有限公司 Audio sharing method, device, equipment and medium
CN113377971A (en) * 2021-05-31 2021-09-10 北京达佳互联信息技术有限公司 Multimedia resource generation method and device, electronic equipment and storage medium
CN113434733A (en) * 2021-06-28 2021-09-24 平安科技(深圳)有限公司 Text-based video file generation method, device, equipment and storage medium
CN113518160A (en) * 2021-01-12 2021-10-19 腾讯科技(深圳)有限公司 Video generation method, device, equipment and storage medium
CN113572977A (en) * 2021-07-06 2021-10-29 上海哔哩哔哩科技有限公司 Video production method and device
CN113628637A (en) * 2021-07-02 2021-11-09 北京达佳互联信息技术有限公司 Audio identification method, device, equipment and storage medium
CN113676772A (en) * 2021-08-16 2021-11-19 上海哔哩哔哩科技有限公司 Video generation method and device
CN113709548A (en) * 2021-08-09 2021-11-26 北京达佳互联信息技术有限公司 Image-based multimedia data synthesis method, device, equipment and storage medium
CN113709529A (en) * 2021-04-13 2021-11-26 腾讯科技(深圳)有限公司 Video synthesis method and device, electronic equipment and computer readable medium
CN113792178A (en) * 2021-08-31 2021-12-14 北京达佳互联信息技术有限公司 A song generation method, device, electronic device and storage medium
CN114245171A (en) * 2021-12-15 2022-03-25 百度在线网络技术(北京)有限公司 Video editing method, video editing device, electronic equipment and media
CN114242070A (en) * 2021-12-20 2022-03-25 阿里巴巴(中国)有限公司 Video generation method, device, equipment and storage medium
CN114286169A (en) * 2021-08-31 2022-04-05 腾讯科技(深圳)有限公司 Video generation method, device, terminal, server and storage medium
CN114513706A (en) * 2022-03-22 2022-05-17 中国平安人寿保险股份有限公司 Video generation method and device, computer equipment and storage medium
CN115442540A (en) * 2022-08-31 2022-12-06 中国联合网络通信集团有限公司 Music video generation method, device, computer equipment and storage medium
CN115599951A (en) * 2021-07-09 2023-01-13 广州酷狗计算机科技有限公司(Cn) Video display method and device, storage medium and electronic equipment
CN115964531A (en) * 2022-11-30 2023-04-14 海尔优家智能科技(北京)有限公司 Audio file processing method and device, storage medium and electronic device
CN116226453A (en) * 2023-05-10 2023-06-06 北京小糖科技有限责任公司 Method, device and terminal equipment for identifying dancing teaching video clips
CN116386659A (en) * 2023-02-02 2023-07-04 北京达佳互联信息技术有限公司 A music video generation method, device, electronic equipment and storage medium
CN116489478A (en) * 2023-04-10 2023-07-25 杭州网易云音乐科技有限公司 Video generation method, device, medium and computing device
CN117041426A (en) * 2023-09-19 2023-11-10 天翼爱音乐文化科技有限公司 Video color ring optimization manufacturing method, system, equipment and storage medium
WO2024046484A1 (en) * 2022-09-02 2024-03-07 北京字跳网络技术有限公司 Video generation method and apparatus, device, storage medium, and program product
CN117956247A (en) * 2023-12-27 2024-04-30 北京信息科技大学 Music-driven video automatic generation method, system, equipment and medium
CN119071389A (en) * 2024-08-08 2024-12-03 咪咕文化科技有限公司 Method, device, equipment, medium and product for generating digital human video ringback tone
WO2024251224A1 (en) * 2023-06-08 2024-12-12 北京字跳网络技术有限公司 Music video generation method and apparatus, and electronic device and storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103793446A (en) * 2012-10-29 2014-05-14 汤晓鸥 Music video generation method and system
CN105224581A (en) * 2014-07-03 2016-01-06 北京三星通信技术研究有限公司 The method and apparatus of picture is presented when playing music
US20160134855A1 (en) * 2013-06-26 2016-05-12 Kddi Corporation Scenario generation system, scenario generation method and scenario generation program
CN105930485A (en) * 2016-04-28 2016-09-07 深圳市金立通信设备有限公司 Audio media playing method, communication device and network system
CN107610725A (en) * 2017-09-19 2018-01-19 广东小天才科技有限公司 Video production method and terminal
CN110121107A (en) * 2018-02-06 2019-08-13 上海全土豆文化传播有限公司 Video material collection method and device
CN110619673A (en) * 2018-06-19 2019-12-27 阿里巴巴集团控股有限公司 Method for generating and playing sound chart, method, system and equipment for processing data
US20200051536A1 (en) * 2017-09-30 2020-02-13 Tencent Technology (Shenzhen) Company Limited Method and apparatus for generating music

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103793446A (en) * 2012-10-29 2014-05-14 汤晓鸥 Music video generation method and system
US20160134855A1 (en) * 2013-06-26 2016-05-12 Kddi Corporation Scenario generation system, scenario generation method and scenario generation program
CN105224581A (en) * 2014-07-03 2016-01-06 北京三星通信技术研究有限公司 The method and apparatus of picture is presented when playing music
CN105930485A (en) * 2016-04-28 2016-09-07 深圳市金立通信设备有限公司 Audio media playing method, communication device and network system
CN107610725A (en) * 2017-09-19 2018-01-19 广东小天才科技有限公司 Video production method and terminal
US20200051536A1 (en) * 2017-09-30 2020-02-13 Tencent Technology (Shenzhen) Company Limited Method and apparatus for generating music
CN110121107A (en) * 2018-02-06 2019-08-13 上海全土豆文化传播有限公司 Video material collection method and device
CN110619673A (en) * 2018-06-19 2019-12-27 阿里巴巴集团控股有限公司 Method for generating and playing sound chart, method, system and equipment for processing data

Cited By (53)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112423107B (en) * 2020-11-18 2022-05-17 北京字跳网络技术有限公司 Lyric video display method and device, electronic equipment and computer readable medium
CN112423107A (en) * 2020-11-18 2021-02-26 北京字跳网络技术有限公司 Lyric video display method and device, electronic equipment and computer readable medium
US12061647B2 (en) 2020-11-18 2024-08-13 Beijing Zitiao Network Technology Co., Ltd. Method and apparatus for lyric video display, electronic device, and computer-readable medium
CN112487248A (en) * 2020-12-01 2021-03-12 深圳市易平方网络科技有限公司 Video file label generation method and device, intelligent terminal and storage medium
CN112541353A (en) * 2020-12-24 2021-03-23 北京百度网讯科技有限公司 Video generation method, device, equipment and medium
CN112632326A (en) * 2020-12-24 2021-04-09 北京风平科技有限公司 Video production method and device based on video script semantic recognition
CN112784056A (en) * 2020-12-31 2021-05-11 北京视连通科技有限公司 Short video generation method based on video intelligent identification and intelligent semantic search
CN112784056B (en) * 2020-12-31 2021-11-23 北京视连通科技有限公司 Short video generation method based on video intelligent identification and intelligent semantic search
CN113518160A (en) * 2021-01-12 2021-10-19 腾讯科技(深圳)有限公司 Video generation method, device, equipment and storage medium
CN113518160B (en) * 2021-01-12 2025-07-25 腾讯科技(深圳)有限公司 Video generation method, device, equipment and storage medium
CN112911379A (en) * 2021-01-15 2021-06-04 北京字跳网络技术有限公司 Video generation method and device, electronic equipment and storage medium
US12033671B2 (en) 2021-01-15 2024-07-09 Beijing Zitiao Network Technology Co., Ltd. Video generation method and apparatus, electronic device, and storage medium
CN112800263A (en) * 2021-02-03 2021-05-14 上海艾麒信息科技股份有限公司 Video synthesis system, method and medium based on artificial intelligence
CN113050857A (en) * 2021-03-26 2021-06-29 北京字节跳动网络技术有限公司 Music sharing method and device, electronic equipment and storage medium
US12293062B2 (en) 2021-03-26 2025-05-06 Beijing Bytedance Network Technology Co., Ltd. Music sharing method and apparatus, electronic device, and storage medium
US11914845B2 (en) 2021-03-26 2024-02-27 Beijing Bytedance Network Technology Co., Ltd. Music sharing method and apparatus, electronic device, and storage medium
CN113709529A (en) * 2021-04-13 2021-11-26 腾讯科技(深圳)有限公司 Video synthesis method and device, electronic equipment and computer readable medium
CN113377971A (en) * 2021-05-31 2021-09-10 北京达佳互联信息技术有限公司 Multimedia resource generation method and device, electronic equipment and storage medium
CN113377971B (en) * 2021-05-31 2024-02-27 北京达佳互联信息技术有限公司 Multimedia resource generation method and device, electronic equipment and storage medium
US12271578B2 (en) 2021-06-02 2025-04-08 Beijing Zitiao Network Technology Co., Ltd. Audio sharing method and apparatus, device and medium
CN113365134A (en) * 2021-06-02 2021-09-07 北京字跳网络技术有限公司 Audio sharing method, device, equipment and medium
CN113365134B (en) * 2021-06-02 2022-11-01 北京字跳网络技术有限公司 Audio sharing method, device, equipment and medium
CN113329258A (en) * 2021-06-10 2021-08-31 王之华 Song video synthesis method and player
CN113329258B (en) * 2021-06-10 2023-02-17 王之华 Song video synthesis method and player
CN113434733A (en) * 2021-06-28 2021-09-24 平安科技(深圳)有限公司 Text-based video file generation method, device, equipment and storage medium
CN113628637A (en) * 2021-07-02 2021-11-09 北京达佳互联信息技术有限公司 Audio identification method, device, equipment and storage medium
CN113572977B (en) * 2021-07-06 2024-02-27 上海哔哩哔哩科技有限公司 Video production method and device
CN113572977A (en) * 2021-07-06 2021-10-29 上海哔哩哔哩科技有限公司 Video production method and device
CN115599951A (en) * 2021-07-09 2023-01-13 广州酷狗计算机科技有限公司(Cn) Video display method and device, storage medium and electronic equipment
CN113709548B (en) * 2021-08-09 2023-08-25 北京达佳互联信息技术有限公司 Image-based multimedia data synthesis method, device, equipment and storage medium
CN113709548A (en) * 2021-08-09 2021-11-26 北京达佳互联信息技术有限公司 Image-based multimedia data synthesis method, device, equipment and storage medium
CN113676772B (en) * 2021-08-16 2023-08-08 上海哔哩哔哩科技有限公司 Video generation method and device
CN113676772A (en) * 2021-08-16 2021-11-19 上海哔哩哔哩科技有限公司 Video generation method and device
CN113792178A (en) * 2021-08-31 2021-12-14 北京达佳互联信息技术有限公司 A song generation method, device, electronic device and storage medium
CN114286169A (en) * 2021-08-31 2022-04-05 腾讯科技(深圳)有限公司 Video generation method, device, terminal, server and storage medium
CN114286169B (en) * 2021-08-31 2023-06-20 腾讯科技(深圳)有限公司 Video generation method, device, terminal, server and storage medium
CN114245171B (en) * 2021-12-15 2023-08-29 百度在线网络技术(北京)有限公司 Video editing method and device, electronic equipment and medium
CN114245171A (en) * 2021-12-15 2022-03-25 百度在线网络技术(北京)有限公司 Video editing method, video editing device, electronic equipment and media
US12387762B2 (en) 2021-12-15 2025-08-12 Baidu Online Network Technology (Beijing) Co., Ltd. Video editing method and apparatus, electronic device and medium
CN114242070A (en) * 2021-12-20 2022-03-25 阿里巴巴(中国)有限公司 Video generation method, device, equipment and storage medium
CN114513706A (en) * 2022-03-22 2022-05-17 中国平安人寿保险股份有限公司 Video generation method and device, computer equipment and storage medium
CN115442540B (en) * 2022-08-31 2024-05-03 中国联合网络通信集团有限公司 Music video generation method, device, computer equipment and storage medium
CN115442540A (en) * 2022-08-31 2022-12-06 中国联合网络通信集团有限公司 Music video generation method, device, computer equipment and storage medium
WO2024046484A1 (en) * 2022-09-02 2024-03-07 北京字跳网络技术有限公司 Video generation method and apparatus, device, storage medium, and program product
CN115964531A (en) * 2022-11-30 2023-04-14 海尔优家智能科技(北京)有限公司 Audio file processing method and device, storage medium and electronic device
CN116386659A (en) * 2023-02-02 2023-07-04 北京达佳互联信息技术有限公司 A music video generation method, device, electronic equipment and storage medium
CN116489478A (en) * 2023-04-10 2023-07-25 杭州网易云音乐科技有限公司 Video generation method, device, medium and computing device
CN116226453B (en) * 2023-05-10 2023-09-26 北京小糖科技有限责任公司 Method, device and terminal equipment for identifying dancing teaching video clips
CN116226453A (en) * 2023-05-10 2023-06-06 北京小糖科技有限责任公司 Method, device and terminal equipment for identifying dancing teaching video clips
WO2024251224A1 (en) * 2023-06-08 2024-12-12 北京字跳网络技术有限公司 Music video generation method and apparatus, and electronic device and storage medium
CN117041426A (en) * 2023-09-19 2023-11-10 天翼爱音乐文化科技有限公司 Video color ring optimization manufacturing method, system, equipment and storage medium
CN117956247A (en) * 2023-12-27 2024-04-30 北京信息科技大学 Music-driven video automatic generation method, system, equipment and medium
CN119071389A (en) * 2024-08-08 2024-12-03 咪咕文化科技有限公司 Method, device, equipment, medium and product for generating digital human video ringback tone

Similar Documents

Publication Publication Date Title
CN111935537A (en) Music video generation method and device, electronic equipment and storage medium
KR102510317B1 (en) Method for generating tag of video, electronic device, and storage medium
KR102346046B1 (en) 3d virtual figure mouth shape control method and device
CN114578969B (en) Methods, devices, equipment and media for human-computer interaction
CN111476871B (en) Method and apparatus for generating video
CN109688463B (en) Clip video generation method and device, terminal equipment and storage medium
CN110519636B (en) Voice information playing method and device, computer equipment and storage medium
JP7240505B2 (en) Voice packet recommendation method, device, electronic device and program
JP2021192222A (en) Video image interactive method and apparatus, electronic device, computer readable storage medium, and computer program
CN111221984A (en) Multimodal content processing method, device, equipment and storage medium
WO2024235271A1 (en) Movement generation method and apparatus for virtual character, and construction method and apparatus for movement library of virtual avatar
CN111862277A (en) Method, apparatus, device and storage medium for generating animation
CN111883101B (en) A model training and speech synthesis method, device, equipment and medium
CN113572976B (en) Video processing method, device, electronic device and readable storage medium
CN113572981A (en) Video dubbing method and device, electronic equipment and storage medium
CN111414506A (en) Emotion processing method and device based on artificial intelligence, electronic equipment and storage medium
EP4546162A1 (en) Video generation method, apparatus, device and storage medium
CN115861491A (en) Generation method, electronic device, storage medium and program product of dance animation
CN112685592B (en) Method and device for generating sports video soundtrack
CN114422824A (en) Data processing method, video processing method, display method and device
CN117373455B (en) Audio and video generation method, device, equipment and storage medium
CN118781239A (en) A method, device, equipment and storage medium for generating dynamic speaking face
CN118692142A (en) Method and device for detecting sequential motion
CN118690049A (en) Method, device and electronic device for generating highlights video based on artificial intelligence
CN116939288A (en) Video generation method and device and computer equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20201113

RJ01 Rejection of invention patent application after publication