WO2023174066A1 - Video generation method and apparatus, electronic device, and storage medium - Google Patents
Video generation method and apparatus, electronic device, and storage medium Download PDFInfo
- Publication number
- WO2023174066A1 WO2023174066A1 PCT/CN2023/079301 CN2023079301W WO2023174066A1 WO 2023174066 A1 WO2023174066 A1 WO 2023174066A1 CN 2023079301 W CN2023079301 W CN 2023079301W WO 2023174066 A1 WO2023174066 A1 WO 2023174066A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- target
- picture
- video
- music
- target picture
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/81—Monomedia components thereof
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
- H04N5/265—Mixing
Definitions
- the present disclosure relates to the field of information technology, and in particular, to a video generation method, device, electronic device and storage medium.
- Embodiments of the present disclosure provide a video generation method, which includes:
- the target music and the target picture are synthesized to obtain a target video, and the switching rhythm of the image material in the target video matches the rhythm of the target music.
- An embodiment of the present disclosure also provides a video generation device, which includes:
- the first determination module is used to determine the content characteristics of the target image
- a second determination module configured to determine target music based on the content characteristics, and the rhythm of the target music matches the content characteristics
- a processing module configured to synthesize the target music and the target picture to obtain a target video, where the switching rhythm of the image material in the target video matches the rhythm of the target music.
- An embodiment of the present disclosure also provides an electronic device, where the electronic device includes:
- processors one or more processors
- a storage device for storing one or more programs
- the one or more processors When the one or more programs are executed by the one or more processors, the one or more processors are caused to implement the video generation method as described above.
- Embodiments of the present disclosure also provide a computer-readable storage medium on which a computer program is stored.
- the program is executed by a processor, the video generation method as described above is implemented.
- An embodiment of the present disclosure also provides a computer program, including: instructions that, when executed by a processor, cause the processor to perform the video generation method as described above.
- Figure 1 is a flow chart of a video generation method in an embodiment of the present disclosure
- Figure 2 is a schematic diagram of a video track and an audio track in an embodiment of the present disclosure
- Figure 3 is a schematic diagram of a video track in an embodiment of the present disclosure.
- Figure 4 is a schematic diagram of an audio track in an embodiment of the present disclosure.
- Figure 5 is a schematic diagram of a video track in an embodiment of the present disclosure.
- Figure 6 is a schematic diagram of an audio track and a video track in an embodiment of the present disclosure
- Figure 7 is a flow chart of a video generation method in an embodiment of the present disclosure.
- Figure 8 is a schematic structural diagram of a video generation device in an embodiment of the present disclosure.
- FIG. 9 is a schematic structural diagram of an electronic device in an embodiment of the present disclosure.
- the term “include” and its variations are open-ended, ie, “including but not limited to.”
- the term “based on” means “based at least in part on.”
- the term “one embodiment” means “at least one embodiment”; the term “another embodiment” means “at least one additional embodiment”; and the term “some embodiments” means “at least some embodiments”. Relevant definitions of other terms will be given in the description below.
- Figure 1 is a flow chart of a video generation method in an embodiment of the present disclosure.
- the method can be executed by a video generation device.
- the device can be implemented in software and/or hardware.
- the device can be configured in electronic equipment, such as Terminals specifically include but are not limited to smartphones, PDAs, tablets, wearable devices with displays, desktops, laptops, all-in-one computers, smart home devices, etc.
- the method specifically includes the following steps.
- Step 110 Determine the content characteristics of the target image.
- the target picture can be an image material captured by the user in real time through the camera, an image material imported locally from the terminal, or an image material downloaded from the Internet.
- the content characteristics of the target picture may include the main color of the target picture, the object category in the target picture (such as people or scenery), etc., and the actions of the object (such as running, jumping, sleeping, opening mouth, etc.).
- determining the content characteristics of the target image includes:
- the picture features include at least one of the color of the target picture, the object category in the target picture (such as people or scenery), and the actions of the objects in the target picture (such as running, jumping, sleeping, opening mouth, etc.).
- the color recognition algorithm is used to identify the color of the target picture; the target picture is processed through a neural network model based on machine learning, and the object category and object action in the target picture are identified.
- the color of the target image specifically refers to the main color of the target image, such as red, yellow, blue, etc.
- the main color of the target image can be determined by counting the number of pixels of each color.
- the target image includes two colors: red and blue.
- the number of red pixels is 10,000 and the number of blue pixels is 20,000.
- the main color of the target image is blue, which is the picture feature. is blue.
- determining the emotional characteristics of the target picture based on the picture characteristics of the target picture includes:
- the emotional features that have a mapping relationship with the picture features of the target picture are determined.
- the mapping relationship may be set before determining the picture characteristics of the target picture, for example, manually or automatically by a device or algorithm.
- mapping relationship between the emotional characteristics there is a mapping relationship between the "green” picture characteristics and the “calm and indifferent” emotional characteristics; there is a mapping relationship between the "gray” picture characteristics and the "dark and low” emotional characteristics.
- mapping relationship between the picture feature that the object's action in the target picture is "jumping” and the emotional feature “bright and brisk”
- mapping relationship between the picture feature that the object's action in the target picture is "sleeping” and the emotional feature “gentle and graceful” Mapping relationships, etc.
- determining the content characteristics of the target picture according to the emotional characteristics of the target picture includes:
- the emotional characteristics of the target picture are determined as the content characteristics of the target picture.
- determining the content characteristics of the target picture based on the emotional characteristics of the target picture includes:
- the emotional characteristics of the two target pictures are determined as the content characteristics of the target picture
- the emotional characteristics of the two target pictures are not the same, the emotional characteristics of any target picture are determined as the content characteristics of the target picture.
- the target picture is determined according to the emotional characteristics of the target picture.
- Content characteristics of the film include:
- the emotional characteristics of any target picture are determined as the content characteristics of the target picture.
- the number of target pictures is three, which are respectively recorded as the first target picture, the second target picture and the third target picture.
- the emotional characteristics of the first target picture are "strong and bold”
- the emotional characteristics of the second target picture are “bright and brisk”
- the emotional characteristics of the third target picture are “gentle and graceful”
- "strong and bold” and "bright” can be Either one of "bright” and "gentle and graceful” is determined as the content feature of the two target pictures, that is, the content feature of the target picture can be "strong and bold", “bright and light” or “gentle and graceful”.
- determining the content characteristics of the target picture according to the emotional characteristics of the target picture includes:
- the emotional feature that appears most frequently will be determined as the content feature of the target picture.
- the number of target pictures is three, which are respectively recorded as the first target picture, the second target picture and the third target picture. Assume that the emotional characteristics of the first target picture are "strong and bold”, the emotional characteristics of the second target picture are “strong and bold”, and the emotional characteristics of the third target picture are "gentle and graceful", then the most frequent "strong and bold” will be Determine the content characteristics of the target image.
- the number of target pictures is 6. Assume that the emotional features of 4 of the target pictures are all "gentle and graceful", then it can be determined that the most frequent emotional feature is “gentle and graceful", that is, “gentle and graceful” is determined as this Content characteristics of three target images.
- each target picture corresponds to an emotional feature
- the content features of the target picture are determined based on the emotional features of the target picture, including: selecting the emotional feature with the largest number of corresponding target pictures as the content feature.
- the number of target pictures is 3, which are marked as the first target picture, the second target picture and the third target picture respectively.
- the emotional feature corresponding to the first target picture is A
- the emotional feature corresponding to the second target picture is B
- the emotional feature corresponding to the third target picture is A, that is, the number of target pictures corresponding to emotional feature A is 2, and the number of target pictures corresponding to emotional feature B is 1, so emotional feature A is determined as the content feature.
- Step 120 Determine target music based on the content characteristics, and the rhythm of the target music matches the content characteristics.
- the rhythm of the target music matches the content characteristics, so as to control the carousel of pictures and the switching rhythm of the target pictures through the rhythm of the target music.
- the target music can be passionate and fast-paced music; when the content feature is "dark and low”, the target music can be music with slow-paced changes.
- the target picture includes people, and further includes smiling faces
- the target music can be compared Cheerful music; for example, if the target picture includes a couple, the target music can be music about sweet emotions, etc.
- a mapping relationship between content features and target music can be preset. After obtaining the content features of the target picture, the corresponding target music can be determined by searching for the mapping relationship.
- Step 130 Synthesize the target music and the target picture to obtain a target video, and the switching rhythm of the image material in the target video matches the rhythm of the target music.
- the image material in the target video can be a target picture, or it can also be a special effect, such as a fast-paced flash cut effect, a slow-paced blur to clear transition effect, etc.
- synthesizing the target music and the target picture to obtain a target video includes: determining the position of each target picture in the target video based on at least one of the playback duration of the target music, the number of target pictures, or the rhythm of the target music. Switching time; generate target video based on the switching time.
- determining the switching moment of each target picture in the target video based on at least one of the playback duration of the target music, the number of the target pictures, or the rhythm of the target music includes: The playback duration of the target music is evenly divided according to the number of target pictures; each split point is determined as the switching moment of the target picture in the target video, so that the playback duration of each target picture in the target video is the same.
- each target picture is switched and displayed at a constant speed according to the playing time of the target music.
- the playing time of the target music is 3 seconds
- the number of target pictures is three, which are recorded as target picture A, target picture B and target picture C respectively.
- the target music and the target picture are synthesized to obtain the target video.
- the target video is a 3s video including target picture A, target picture B and target picture C.
- Play the target picture A in the 0-1s of the target video that is, the video clip corresponding to the target picture A in the 0-1s of the target video.
- the target video When the target video is played to the 1s, switch the video picture from the target picture A to the target picture B. . Play the target picture B in the 1-2s of the target video, and the 1-2s of the target video correspond to the video clip of the target picture B. When the target video is played to the 2s, the video picture is switched from the target picture B to the target picture C. Play the target picture C in the 2-3s of the target video, and the 2-3s of the target video correspond to the video clip of the target picture C.
- Each target picture plays for the same duration in the target video, that is, each target picture is played at a constant speed. For example, the target music is relatively soft slow-tempo music.
- target picture A, target picture B and target picture C are played in the target video can be pre-specified by the user, can be determined randomly, or can be determined dynamically based on some strategies, such as according to the shooting order of each target picture.
- the time determines the playback order.
- the switching moment of each target picture in the target video is determined based on at least one of the playback duration of the target music, the number of the target pictures, or the rhythm of the target music, include:
- the switching moment of each target picture in the target video is determined so that the playing duration of each target picture in the target video is the target duration.
- the target duration can be preset. That is, based on the target duration of each target picture played in the target video, the switching moment of each target picture in the target video is determined so that the playing duration of each target picture in the target video is the target duration. For example, pre-specify that the playback time of each target picture in the target video is 1s. Assume that there are 5 target pictures in total, namely target picture A, target picture B, target picture C, target picture D and target picture E. Example As shown in Figure 3, the target picture A is played in the 0-1s of the target video.
- the video picture is switched from the target picture A to the target picture B; in the 1-2s of the target video Play the target picture B.
- the video screen will be switched from the target picture B to the target picture C. Play the target picture C at the 2-3s of the target video.
- the target video is played to the 3s, the video screen will be switched to the target picture C.
- the video picture is switched from the target picture C to the target picture D; the target picture D is played in the 3-4s of the target video.
- the target video is played to the 4s, the video picture is switched from the target picture D to the target picture E; in the target video The first 4-5s of playback target picture E.
- the total duration of the target video is 5s. If the total duration of the target music is greater than 5s, it will automatically end at the 5th second. If the total duration of the target music is less than 5s, assuming it is 3s, it will automatically repeat from the beginning at the 3rd second. .
- the switching moment of each target picture in the target video is determined based on at least one of the playback duration of the target music, the number of the target pictures, or the rhythm of the target music, Including: determining the accent rhythm point of the target music according to the rhythm of the target music; determining the accent rhythm point as the switching moment of the target picture in the target video. That is, the switching moment of each target picture in the target video is determined according to the stuck moment of the target music, so that the switching moment of the target picture in the target video is the stuck moment.
- the target music includes five accent rhythm points, that is, stuck moments, which are 1.5s, 2.3s, 3.0s, 3.9s and 4.4s respectively.
- Figure 4 shows a schematic diagram of an audio track.
- FIG. 5 shows a schematic diagram of a video track.
- the target video starts playing from 0s, and the track information on the corresponding timeline is read at the same time, that is Pictures + music, and the start and end times of playback are based on the main video track. Therefore, the video ends when the 4.4s is played, and the overall presentation effect is that the playback is stuck and the rhythm changes briskly.
- the target music may be faster tempo music.
- the target music and the target picture are synthesized to obtain the target video, including: assuming that the number of target pictures is 5, the target music is a 12s long slow-tempo music, and the length of each target picture is specified. If the playback duration is 2s, a new audio track will be generated inside the video player for management, consistent with the starting time of the video, that is, starting from 0s, playing to the 10th second, and the target music will also play to the 10th second. The overall effect is a uniform slow-tempo playback.
- synthesizing the target music and the target picture to obtain the target video also includes: determining the switching moment of the special effects in the target video based on the rhythm of the target music; and based on the switching moment of the target picture in the target video and the special effects.
- the target video is generated at the switching moment.
- the special effects include, for example, fast-paced flash-cut special effects, slow-paced blur-to-clear transition special effects, etc.
- the video generation method provided by the embodiment of the present disclosure realizes personalized control of the playback rhythm of the target picture in the target video according to the content characteristics of the target picture, so that the presentation effect of the target video is better and the user experience is improved. For example, if the content feature of the target image is bright and cheerful, a video with a strong rhythm will eventually be generated; if the content feature of the target image is dark and gloomy, a video with a slow rhythm will be generated. As a result, the playback effect of the target video is associated with the content characteristics of the target picture, and the generation of personalized target videos is achieved. Moreover, it realizes the automation of video generation, simplifies the operation of users to generate videos based on pictures, and improves the efficiency and user experience of video generation.
- the content characteristics are determined based on the target picture, and then the target music is determined based on the content characteristics, and the rhythm change of the video is controlled based on the target music.
- the target video including the target picture.
- FIG. 8 is a schematic structural diagram of a video generation device in an embodiment of the present disclosure.
- the video generation device specifically includes: a first determination module 810, a second determination module 820 and a processing module 830.
- the first determination module 810 is used to determine the content characteristics of the target picture; the second determination module 820 is used to determine the target music based on the content characteristics, and the rhythm of the target music matches the content characteristics; the processing module 830 is used to The target music and the target picture are synthesized to obtain a target video, and the switching rhythm of the image material in the target video matches the rhythm of the target music.
- the first determination module 810 includes: a first determination unit, configured to determine the picture characteristics of the target picture through a picture characteristics determination algorithm; and a second determination unit, configured to determine based on the picture characteristics of the target picture. Determine the emotional characteristics of the target picture; a third determination unit is used to determine the content characteristics of the target picture according to the emotional characteristics of the target picture.
- the picture characteristics include at least one of the color of the target picture, the category of the object in the target picture, or the action of the object in the target picture.
- the second determination unit is specifically configured to: based on the mapping relationship between the picture features and the emotional features, determine the emotional features that have a mapping relationship with the picture features of the target picture.
- each target picture corresponds to an emotional feature
- the third determination unit is specifically configured to: select the emotional feature with the largest number of corresponding target pictures as the content feature. .
- the processing module 830 includes: a determining unit configured to determine, based on at least one of the playing duration of the target music, the number of the target pictures, or the rhythm of the target music, that each of the target pictures is in the target A switching moment in the video; a generating unit configured to generate the target video according to the switching moment.
- the determining unit is specifically configured to: evenly divide the playback duration of the target music according to the number of the target pictures; determine each dividing point as the number of the target pictures in the target video.
- the switching time is such that the playback duration of each target picture in the target video is the same.
- the determination unit is specifically configured to: determine the switching moment of each target picture in the target video according to the target duration of playback of each target picture in the target video, so that each The playback duration of the target picture in the target video is the target duration.
- the determining unit is specifically configured to: determine the accent rhythm point of the target music according to the rhythm of the target music; determine the accent rhythm point as the switching moment of the target picture in the target video. .
- the determining unit is further configured to determine the switching moment of the special effects in the target video based on the rhythm of the target music; and the generating unit is configured to: switch the target picture in the target video based on the rhythm of the target music.
- the target video is generated at the time and the switching time of the special effects.
- the video generation device provided by the embodiments of the present disclosure can execute the steps in the method provided by the method embodiments of the present disclosure. The execution steps and beneficial effects will not be described again here.
- FIG. 9 is a schematic structural diagram of an electronic device in an embodiment of the present disclosure.
- the electronic device 500 in the embodiment of the present disclosure may include, but is not limited to, mobile phones, notebook computers, digital broadcast receivers, PDAs (personal digital assistants), PADs (tablet computers), PMP (portable multimedia players), vehicle-mounted terminals ( Such as car navigation terminals), wearable electronic devices, etc., as well as mobile terminals such as digital TVs, desktop computers, Fixed terminals for smart home devices and more.
- the electronic device shown in FIG. 9 is only an example and should not impose any limitations on the functions and scope of use of the embodiments of the present disclosure.
- the electronic device 500 may include a processing device (eg, central processing unit, graphics processor, etc.) 501, which may be loaded into a random access device according to a program stored in a read-only memory (ROM) 502 or from a storage device 508.
- the program in the memory (RAM) 503 performs various appropriate actions and processes to implement the video generation method according to the embodiments of the present disclosure.
- various programs and data required for the operation of the electronic device 500 are also stored.
- the processing device 501, ROM 502 and RAM 503 are connected to each other via a bus 504.
- An input/output (I/O) interface 505 is also connected to bus 504.
- input devices 506 including, for example, a touch screen, touch pad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, a liquid crystal display (LCD), speakers, vibration
- An output device 507 such as a computer
- a storage device 508 including a magnetic tape, a hard disk, etc.
- Communication device 509 may allow electronic device 500 to communicate wirelessly or wiredly with other devices to exchange data.
- FIG. 9 illustrates electronic device 500 with various means, it should be understood that implementation or availability of all illustrated means is not required. More or fewer means may alternatively be implemented or provided.
- embodiments of the present disclosure include a computer program product, which includes a computer program carried on a non-transitory computer-readable medium, the computer program including program code for executing the method shown in the flowchart, thereby achieving the above The video generation method described.
- the computer program may be downloaded and installed from the network via communication device 509, or from storage device 508, or from ROM 502.
- the processing device 501 When the computer program is executed by the processing device 501, the above-mentioned functions defined in the method of the embodiment of the present disclosure are performed.
- the computer-readable medium mentioned above in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two.
- the computer-readable storage medium may be, for example, but is not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or any combination thereof. More specific examples of computer readable storage media may include, but are not limited to: an electrical connection having one or more wires, a portable computer disk, a hard drive, random access memory (RAM), read only memory (ROM), removable Programmd read-only memory (EPROM or flash memory), fiber optics, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
- a computer-readable storage medium may be any tangible medium that contains or stores a program for use by or in connection with an instruction execution system, apparatus, or device.
- computer readable A signal medium may include a data signal propagated in baseband or as part of a carrier wave carrying computer-readable program code therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above.
- a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device .
- Program code embodied on a computer-readable medium may be transmitted using any suitable medium, including but not limited to: wire, optical cable, RF (radio frequency), etc., or any suitable combination of the above.
- the client and server can communicate using any currently known or future developed network protocol such as HTTP (HyperText Transfer Protocol), and can communicate with digital data in any form or medium.
- Communications e.g., communications network
- communications networks include local area networks (“LAN”), wide area networks (“WAN”), the Internet (e.g., the Internet), and end-to-end networks (e.g., ad hoc end-to-end networks), as well as any currently known or developed in the future network of.
- the above-mentioned computer-readable medium may be included in the above-mentioned electronic device; it may also exist independently without being assembled into the electronic device.
- the computer-readable medium carries one or more programs.
- the electronic device determines the content characteristics of the target picture; determines the target music based on the content characteristics, The rhythm of the target music matches the content characteristics; the target music and the target picture are synthesized to obtain a target video, and the switching rhythm of the image material in the target video matches the rhythm of the target music.
- the electronic device may also perform other steps described in the above embodiments.
- Computer program code for performing the operations of the present disclosure may be written in one or more programming languages, including but not limited to object-oriented programming languages—such as Java, Smalltalk, C++, and Includes conventional procedural programming languages—such as "C” or similar programming languages.
- the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
- the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as an Internet service provider through Internet connection).
- LAN local area network
- WAN wide area network
- Internet service provider such as an Internet service provider through Internet connection
- each block in the flowchart or block diagram may represent a module, segment, or portion of code that contains one or more logic functions that implement the specified executable instructions.
- the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown one after another may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved.
- each block of the block diagram and/or flowchart illustration, and combinations of blocks in the block diagram and/or flowchart illustration can be implemented by special purpose hardware-based systems that perform the specified functions or operations. , or can be implemented using a combination of specialized hardware and computer instructions.
- the units involved in the embodiments of the present disclosure can be implemented in software or hardware. Among them, the name of a unit does not constitute a limitation on the unit itself under certain circumstances.
- FPGAs Field Programmable Gate Arrays
- ASICs Application Specific Integrated Circuits
- ASSPs Application Specific Standard Products
- SOCs Systems on Chips
- CPLD Complex Programmable Logical device
- a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with an instruction execution system, apparatus, or device.
- the machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium.
- Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices or devices, or any suitable combination of the foregoing.
- machine-readable storage media would include one or more wire-based electrical connections, laptop disks, hard drives, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
- RAM random access memory
- ROM read only memory
- EPROM or flash memory erasable programmable read only memory
- CD-ROM portable compact disk read-only memory
- magnetic storage device or any suitable combination of the above.
- the present disclosure provides a video generation method, including: determining content features of a target picture; determining target music based on the content features, and the rhythm of the target music is consistent with the content features Match; synthesize the target music and the target picture to obtain a target video, and the switching rhythm of the image material in the target video matches the rhythm of the target music.
- determining the content characteristics of the target picture includes: determining the picture characteristics of the target picture through a picture characteristic determination algorithm. ; Determine the emotional characteristics of the target picture based on the picture characteristics of the target picture; Determine the content characteristics of the target picture based on the emotional characteristics of the target picture.
- the picture characteristics include the color of the target picture, the object category in the target picture, or the action of the object in the target picture. At least one.
- determining the emotional characteristics of the target picture based on the picture characteristics of the target picture includes: based on the picture characteristics and The mapping relationship between emotional features determines the emotional features that have a mapping relationship with the picture features of the target picture.
- each target picture corresponds to an emotional feature
- the Determining the content features of the target picture according to the emotional features of the target picture includes: selecting the emotional feature with the largest number of corresponding target pictures as the content feature.
- the synthesis of the target music and the target picture to obtain a target video includes: At least one of the playback duration, the number of the target pictures, or the rhythm of the target music determines the switching moment of each of the target pictures in the target video; the target video is generated according to the switching moment.
- At least one of determining the switching moment of each target picture in the target video includes: evenly dividing the playback duration of the target music according to the number of the target pictures; determining each dividing point as the The switching time of the target picture in the target video is such that the playback duration of each target picture in the target video is the same.
- At least one of determining the switching moment of each of the target pictures in the target video includes: determining the switching moment of each of the target pictures in the target video according to the target duration of playing of each of the target pictures in the target video.
- the switching time is such that the playback duration of each target picture in the target video is the target duration.
- At least one of determining the switching moment of each target picture in the target video includes: determining the accent rhythm point of the target music according to the rhythm of the target music; determining the accent rhythm point as the target picture The switching moment in the target video.
- the synthesis of the target music and the target picture to obtain the target video includes: based on the rhythm of the target music Determine the switching moment of the special effects in the target video; generate the target video according to the switching moment of the target picture in the target video and the switching moment of the special effects.
- the present disclosure provides a video generation device, including: a first determination module for determining content characteristics of a target picture; a second determination module for determining based on the content characteristics Target music, the rhythm of the target music matches the content characteristics; a processing module, used to synthesize the target music and the target picture to obtain a target video, the switching rhythm of the image material in the target video matches the Match the tempo of the target music.
- the present disclosure provides an electronic device, including:
- processors one or more processors
- Memory used to store one or more programs
- the one or more processors are caused to implement any of the video generation methods provided by this disclosure.
- the present disclosure provides a computer-readable storage medium having a computer program stored thereon.
- the program is executed by a processor, the video generation as described in any one provided by the present disclosure is implemented. method.
- the present disclosure provides a computer program including: instructions that, when executed by a processor, cause the processor to perform the video generation method as described above.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Television Signal Processing For Recording (AREA)
Abstract
Description
相关申请的交叉引用Cross-references to related applications
本申请是以中国申请号为202210270283.6,申请日为2022年3月18日的申请为基础,并主张其优先权,该中国申请的公开内容在此作为整体引入本申请中。This application is based on the application with Chinese application number 202210270283.6 and the filing date is March 18, 2022, and claims its priority. The disclosure content of the Chinese application is hereby incorporated into this application as a whole.
本公开涉及信息技术领域,尤其涉及一种视频生成方法、装置、电子设备和存储介质。The present disclosure relates to the field of information technology, and in particular, to a video generation method, device, electronic device and storage medium.
随着网络技术和移动终端技术的快速发展,出现了越来越多的应用程序。例如与图片编辑相关的应用程序,用户通过此类应用程序可以对图片进行编辑,其中包括生成与图片相关的视频。With the rapid development of network technology and mobile terminal technology, more and more applications have emerged. For example, applications related to picture editing, through which users can edit pictures, including generating videos related to pictures.
目前,在对图片进行编辑以生成与图片相关的视频时,需要用户手动选择不同的音乐来控制视频或者说图片的播放节奏。Currently, when editing a picture to generate a video related to the picture, the user needs to manually select different music to control the playback rhythm of the video or picture.
发明内容Contents of the invention
本公开实施例提供了一种视频生成方法,该方法包括:Embodiments of the present disclosure provide a video generation method, which includes:
确定目标图片的内容特征;Determine the content characteristics of the target image;
基于所述内容特征确定目标音乐,所述目标音乐的节奏与所述内容特征相匹配;Determine target music based on the content characteristics, the rhythm of the target music matches the content characteristics;
将所述目标音乐与所述目标图片合成得到目标视频,所述目标视频中影像素材的切换节奏与所述目标音乐的节奏相匹配。The target music and the target picture are synthesized to obtain a target video, and the switching rhythm of the image material in the target video matches the rhythm of the target music.
本公开实施例还提供了一种视频生成装置,该装置包括:An embodiment of the present disclosure also provides a video generation device, which includes:
第一确定模块,用于确定目标图片的内容特征;The first determination module is used to determine the content characteristics of the target image;
第二确定模块,用于基于所述内容特征确定目标音乐,所述目标音乐的节奏与所述内容特征相匹配;a second determination module, configured to determine target music based on the content characteristics, and the rhythm of the target music matches the content characteristics;
处理模块,用于将所述目标音乐与所述目标图片合成得到目标视频,所述目标视频中影像素材的切换节奏与所述目标音乐的节奏相匹配。A processing module configured to synthesize the target music and the target picture to obtain a target video, where the switching rhythm of the image material in the target video matches the rhythm of the target music.
本公开实施例还提供了一种电子设备,所述电子设备包括: An embodiment of the present disclosure also provides an electronic device, where the electronic device includes:
一个或多个处理器;one or more processors;
存储装置,用于存储一个或多个程序;A storage device for storing one or more programs;
当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现如上所述的视频生成方法。When the one or more programs are executed by the one or more processors, the one or more processors are caused to implement the video generation method as described above.
本公开实施例还提供了一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现如上所述的视频生成方法。Embodiments of the present disclosure also provide a computer-readable storage medium on which a computer program is stored. When the program is executed by a processor, the video generation method as described above is implemented.
本公开实施例还提供了一种计算机程序,包括:指令,所述指令当由处理器执行时使所述处理器执行如上所述的视频生成方法。An embodiment of the present disclosure also provides a computer program, including: instructions that, when executed by a processor, cause the processor to perform the video generation method as described above.
结合附图并参考以下具体实施方式,本公开各实施例的上述和其他特征、优点及方面将变得更加明显。贯穿附图中,相同或相似的附图标记表示相同或相似的元素。应当理解附图是示意性的,原件和元素不一定按照比例绘制。The above and other features, advantages, and aspects of various embodiments of the present disclosure will become more apparent with reference to the following detailed description taken in conjunction with the accompanying drawings. Throughout the drawings, the same or similar reference numbers refer to the same or similar elements. It is to be understood that the drawings are schematic and that elements and elements are not necessarily drawn to scale.
图1为本公开实施例中的一种视频生成方法的流程图;Figure 1 is a flow chart of a video generation method in an embodiment of the present disclosure;
图2为本公开实施例中的一种视频轨道和音频轨道的示意图;Figure 2 is a schematic diagram of a video track and an audio track in an embodiment of the present disclosure;
图3为本公开实施例中的一种视频轨道的示意图;Figure 3 is a schematic diagram of a video track in an embodiment of the present disclosure;
图4为本公开实施例中的一种音频轨道的示意图;Figure 4 is a schematic diagram of an audio track in an embodiment of the present disclosure;
图5为本公开实施例中的一种视频轨道的示意图;Figure 5 is a schematic diagram of a video track in an embodiment of the present disclosure;
图6为本公开实施例中的一种音频轨道和视频轨道的示意图;Figure 6 is a schematic diagram of an audio track and a video track in an embodiment of the present disclosure;
图7为本公开实施例中的一种视频生成方法的流程图;Figure 7 is a flow chart of a video generation method in an embodiment of the present disclosure;
图8为本公开实施例中的一种视频生成装置的结构示意图;Figure 8 is a schematic structural diagram of a video generation device in an embodiment of the present disclosure;
图9为本公开实施例中的一种电子设备的结构示意图。FIG. 9 is a schematic structural diagram of an electronic device in an embodiment of the present disclosure.
下面将参照附图更详细地描述本公开的实施例。虽然附图中显示了本公开的某些实施例,然而应当理解的是,本公开可以通过各种形式来实现,而且不应该被解释为限于这里阐述的实施例,相反提供这些实施例是为了更加透彻和完整地理解本公开。应当理解的是,本公开的附图及实施例仅用于示例性作用,并非用于限制本公开的保护范围。Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although certain embodiments of the disclosure are shown in the drawings, it should be understood that the disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein, which rather are provided for A more thorough and complete understanding of this disclosure. It should be understood that the drawings and embodiments of the present disclosure are for illustrative purposes only and are not intended to limit the scope of the present disclosure.
应当理解,本公开的方法实施方式中记载的各个步骤可以按照不同的顺序执行, 和/或并行执行。此外,方法实施方式可以包括附加的步骤和/或省略执行示出的步骤。本公开的范围在此方面不受限制。It should be understood that the various steps described in the method implementations of the present disclosure can be performed in different orders. and/or parallel execution. Furthermore, method embodiments may include additional steps and/or omit performance of illustrated steps. The scope of the present disclosure is not limited in this regard.
本文使用的术语“包括”及其变形是开放性包括,即“包括但不限于”。术语“基于”是“至少部分地基于”。术语“一个实施例”表示“至少一个实施例”;术语“另一实施例”表示“至少一个另外的实施例”;术语“一些实施例”表示“至少一些实施例”。其他术语的相关定义将在下文描述中给出。As used herein, the term "include" and its variations are open-ended, ie, "including but not limited to." The term "based on" means "based at least in part on." The term "one embodiment" means "at least one embodiment"; the term "another embodiment" means "at least one additional embodiment"; and the term "some embodiments" means "at least some embodiments". Relevant definitions of other terms will be given in the description below.
需要注意,本公开中提及的“第一”、“第二”等概念仅用于对不同的装置、模块或单元进行区分,并非用于限定这些装置、模块或单元所执行的功能的顺序或者相互依存关系。It should be noted that concepts such as “first” and “second” mentioned in this disclosure are only used to distinguish different devices, modules or units, and are not used to limit the order of functions performed by these devices, modules or units. Or interdependence.
需要注意,本公开中提及的“一个”、“多个”的修饰是示意性而非限制性的,本领域技术人员应当理解,除非在上下文另有明确指出,否则应该理解为“一个或多个”。It should be noted that the modifications of "one" and "plurality" mentioned in this disclosure are illustrative and not restrictive. Those skilled in the art will understand that unless the context clearly indicates otherwise, it should be understood as "one or Multiple”.
本公开实施方式中的多个装置之间所交互的消息或者信息的名称仅用于说明性的目的,而并不是用于对这些消息或信息的范围进行限制。The names of messages or information exchanged between multiple devices in the embodiments of the present disclosure are for illustrative purposes only and are not used to limit the scope of these messages or information.
发明人在进行分析后发现,相关技术存在用户操作繁琐、智能化程度较低的问题。After conducting analysis, the inventor found that the related technology had problems such as cumbersome user operations and low intelligence.
图1为本公开实施例中的一种视频生成方法的流程图,该方法可以由视频生成装置执行,该装置可以采用软件和/或硬件的方式实现,该装置可配置于电子设备中,例如终端,具体包括但不限于智能手机、掌上电脑、平板电脑、带显示屏的可穿戴设备、台式机、笔记本电脑、一体机、智能家居设备等。Figure 1 is a flow chart of a video generation method in an embodiment of the present disclosure. The method can be executed by a video generation device. The device can be implemented in software and/or hardware. The device can be configured in electronic equipment, such as Terminals specifically include but are not limited to smartphones, PDAs, tablets, wearable devices with displays, desktops, laptops, all-in-one computers, smart home devices, etc.
如图1所示,该方法具体包括如下步骤。As shown in Figure 1, the method specifically includes the following steps.
步骤110、确定目标图片的内容特征。Step 110: Determine the content characteristics of the target image.
目标图片可以是用户通过相机实时拍摄的图像素材,也可以是从终端本地导入的图像素材,还可以是从网络上下载的图像素材。The target picture can be an image material captured by the user in real time through the camera, an image material imported locally from the terminal, or an image material downloaded from the Internet.
目标图片的内容特征可以包括目标图片的主色调、目标图片中的对象类别(例如人物或者风景)等以及对象的动作(例如跑、跳、睡觉、张嘴等)。The content characteristics of the target picture may include the main color of the target picture, the object category in the target picture (such as people or scenery), etc., and the actions of the object (such as running, jumping, sleeping, opening mouth, etc.).
可选的,所述确定目标图片的内容特征,包括:Optionally, determining the content characteristics of the target image includes:
通过画面特征的确定算法确定目标图片的画面特征;基于目标图片的画面特征确定目标图片的情绪特征;根据目标图片的情绪特征确定目标图片的内容特征。Determine the picture characteristics of the target picture through the picture feature determination algorithm; determine the emotional characteristics of the target picture based on the picture characteristics of the target picture; determine the content characteristics of the target picture based on the emotional characteristics of the target picture.
画面特征包括目标图片的色彩、目标图片中的对象类别(例如人物或者风景)以及目标图片中对象的动作中(例如跑、跳、睡觉、张嘴等)的至少一种。可选的,可 以通过色彩识别算法识别目标图片的色彩;通过基于机器学习的神经网络模型对目标图片进行图像处理,识别目标图片中的对象类别以及对象的动作等。The picture features include at least one of the color of the target picture, the object category in the target picture (such as people or scenery), and the actions of the objects in the target picture (such as running, jumping, sleeping, opening mouth, etc.). Optional, can The color recognition algorithm is used to identify the color of the target picture; the target picture is processed through a neural network model based on machine learning, and the object category and object action in the target picture are identified.
目标图片的色彩具体指目标图片的主色调,例如红色、黄色、蓝色等。具体的可以通过统计每种颜色的像素点数量确定目标图片的主色调。例如目标图片包括红色和蓝色两种颜色,其中红色的像素点数量为1万个,蓝色的像素点数量为2万个,则可确定该目标图片的主色调为蓝色,即画面特征为蓝色。The color of the target image specifically refers to the main color of the target image, such as red, yellow, blue, etc. Specifically, the main color of the target image can be determined by counting the number of pixels of each color. For example, the target image includes two colors: red and blue. The number of red pixels is 10,000 and the number of blue pixels is 20,000. Then it can be determined that the main color of the target image is blue, which is the picture feature. is blue.
进一步的,所述基于所述目标图片的画面特征确定所述目标图片的情绪特征,包括:Further, determining the emotional characteristics of the target picture based on the picture characteristics of the target picture includes:
基于画面特征与情绪特征之间的映射关系,确定与目标图片的画面特征存在映射关系的情绪特征。该映射关系可以是在确定目标图片的画面特征之前设置的,例如,人工设置的、或者由设备或算法自动设置的。Based on the mapping relationship between the picture features and the emotional features, the emotional features that have a mapping relationship with the picture features of the target picture are determined. The mapping relationship may be set before determining the picture characteristics of the target picture, for example, manually or automatically by a device or algorithm.
例如“红色”的画面特征与“浓烈豪放”的情绪特征存在映射关系;“黄色”的画面特征与“明亮轻快”的情绪特征存在映射关系;“蓝色”的画面特征与“温柔婉约”的情绪特征存在映射关系;“绿色”的画面特征与“冷静淡然”的情绪特征存在映射关系;“灰色”的画面特征与“阴暗低沉”的情绪特征存在映射关系。在确定目标图片的画面特征之后,基于上述预设映射关系即可确定与目标图片的画面特征存在映射关系的情绪特征。For example, there is a mapping relationship between the picture characteristics of "red" and the emotional characteristics of "strong and bold"; there is a mapping relationship between the picture characteristics of "yellow" and the emotional characteristics of "bright and brisk"; there is a mapping relationship between the picture characteristics of "blue" and the emotional characteristics of "gentle and graceful". There is a mapping relationship between the emotional characteristics; there is a mapping relationship between the "green" picture characteristics and the "calm and indifferent" emotional characteristics; there is a mapping relationship between the "gray" picture characteristics and the "dark and low" emotional characteristics. After determining the picture characteristics of the target picture, the emotional characteristics that have a mapping relationship with the picture characteristics of the target picture can be determined based on the above preset mapping relationship.
或者,例如目标图片中对象的动作是“跳”的画面特征与“明亮轻快”的情绪特征存在映射关系;目标图片中对象的动作是“睡觉”的画面特征与“温柔婉约”的情绪特征存在映射关系等。Or, for example, there is a mapping relationship between the picture feature that the object's action in the target picture is "jumping" and the emotional feature "bright and brisk"; there is a mapping relationship between the picture feature that the object's action in the target picture is "sleeping" and the emotional feature "gentle and graceful" Mapping relationships, etc.
在一些实施例中,当目标图片的数量为一个时,根据所述目标图片的情绪特征确定所述目标图片的内容特征,包括:In some embodiments, when the number of target pictures is one, determining the content characteristics of the target picture according to the emotional characteristics of the target picture includes:
将目标图片的情绪特征确定为目标图片的内容特征。The emotional characteristics of the target picture are determined as the content characteristics of the target picture.
当目标图片的数量为两个时,所述根据所述目标图片的情绪特征确定所述目标图片的内容特征,包括:When the number of target pictures is two, determining the content characteristics of the target picture based on the emotional characteristics of the target picture includes:
若两个目标图片的情绪特征相同,则将目标图片的情绪特征确定为目标图片的内容特征;If the emotional characteristics of the two target pictures are the same, the emotional characteristics of the target picture are determined as the content characteristics of the target picture;
若两个目标图片的情绪特征不相同,则将任一目标图片的情绪特征确定为目标图片的内容特征。If the emotional characteristics of the two target pictures are not the same, the emotional characteristics of any target picture are determined as the content characteristics of the target picture.
当目标图片的数量为至少三个时,根据所述目标图片的情绪特征确定所述目标图 片的内容特征,包括:When the number of target pictures is at least three, the target picture is determined according to the emotional characteristics of the target picture. Content characteristics of the film include:
若各目标图片的情绪特征互不相同,则将任一目标图片的情绪特征确定为目标图片的内容特征。例如目标图片的数量为三个,分别记为第一目标图片、第二目标图片和第三目标图片。假设第一目标图片的情绪特征为“浓烈豪放”,第二目标图片的情绪特征为“明亮轻快”,第三目标图片的情绪特征为“温柔婉约”,则可以将“浓烈豪放”、“明亮轻快”以及“温柔婉约”中的任意一个确定为这两个目标图片的内容特征,即目标图片的内容特征可以是“浓烈豪放”、“明亮轻快”或者“温柔婉约”。If the emotional characteristics of each target picture are different from each other, the emotional characteristics of any target picture are determined as the content characteristics of the target picture. For example, the number of target pictures is three, which are respectively recorded as the first target picture, the second target picture and the third target picture. Assuming that the emotional characteristics of the first target picture are "strong and bold", the emotional characteristics of the second target picture are "bright and brisk", and the emotional characteristics of the third target picture are "gentle and graceful", then "strong and bold" and "bright" can be Either one of "bright" and "gentle and graceful" is determined as the content feature of the two target pictures, that is, the content feature of the target picture can be "strong and bold", "bright and light" or "gentle and graceful".
当目标图片的数量为至少三个时,根据所述目标图片的情绪特征确定所述目标图片的内容特征,包括:When the number of target pictures is at least three, determining the content characteristics of the target picture according to the emotional characteristics of the target picture includes:
若存在不同目标图片的情绪特征相同的情况,则将出现次数最多的情绪特征确定为目标图片的内容特征。例如目标图片的数量为三个,分别记为第一目标图片、第二目标图片和第三目标图片。假设第一目标图片的情绪特征为“浓烈豪放”,第二目标图片的情绪特征为“浓烈豪放”,第三目标图片的情绪特征为“温柔婉约”,则将出现次数最多的“浓烈豪放”确定为目标图片的内容特征。再例如,目标图片的数量为6个,假设其中4个目标图片的情绪特征均为“温柔婉约”,则可确定出现次数最多的情绪特征为“温柔婉约”,即将“温柔婉约”确定为这三个目标图片的内容特征。If there are situations where the emotional features of different target pictures are the same, the emotional feature that appears most frequently will be determined as the content feature of the target picture. For example, the number of target pictures is three, which are respectively recorded as the first target picture, the second target picture and the third target picture. Assume that the emotional characteristics of the first target picture are "strong and bold", the emotional characteristics of the second target picture are "strong and bold", and the emotional characteristics of the third target picture are "gentle and graceful", then the most frequent "strong and bold" will be Determine the content characteristics of the target image. For another example, the number of target pictures is 6. Assume that the emotional features of 4 of the target pictures are all "gentle and graceful", then it can be determined that the most frequent emotional feature is "gentle and graceful", that is, "gentle and graceful" is determined as this Content characteristics of three target images.
概括性的,当目标图片的数量为多个时,每个目标图片分别对应有情绪特征,根据目标图片的情绪特征确定目标图片的内容特征,包括:选取对应目标图片数量最多的情绪特征作为内容特征。例如目标图片的数量为3个,分别标记为第一目标图片、第二目标图片和第三目标图片,其中,第一目标图片对应的情绪特征是A,第二目标图片对应的情绪特征是B,第三目标图片对应的情绪特征是A,即情绪特征A对应目标图片的数量为2,情绪特征B对应目标图片的数量为1,故将情绪特征A确定为内容特征。In summary, when there are multiple target pictures, each target picture corresponds to an emotional feature, and the content features of the target picture are determined based on the emotional features of the target picture, including: selecting the emotional feature with the largest number of corresponding target pictures as the content feature. For example, the number of target pictures is 3, which are marked as the first target picture, the second target picture and the third target picture respectively. Among them, the emotional feature corresponding to the first target picture is A, and the emotional feature corresponding to the second target picture is B. , the emotional feature corresponding to the third target picture is A, that is, the number of target pictures corresponding to emotional feature A is 2, and the number of target pictures corresponding to emotional feature B is 1, so emotional feature A is determined as the content feature.
步骤120、基于所述内容特征确定目标音乐,所述目标音乐的节奏与所述内容特征相匹配。Step 120: Determine target music based on the content characteristics, and the rhythm of the target music matches the content characteristics.
可选的,目标音乐的节奏与内容特征相匹配,以通过目标音乐的节奏控制图片的轮播,控制目标图片的切换节奏。比如内容特征为“浓烈豪放”时,目标音乐可以是热情飞快的快节奏变化的音乐;内容特征为“阴暗低沉”时,目标音乐可以是缓慢节奏变化的音乐。Optionally, the rhythm of the target music matches the content characteristics, so as to control the carousel of pictures and the switching rhythm of the target pictures through the rhythm of the target music. For example, when the content feature is "strong and bold", the target music can be passionate and fast-paced music; when the content feature is "dark and low", the target music can be music with slow-paced changes.
可选的,例如目标图片中包括人物,进一步的包括笑脸,则目标音乐可以是比较 欢快的音乐;例如目标图片中包括情侣,则目标音乐可以是关于甜蜜情感的音乐等。Optional, for example, the target picture includes people, and further includes smiling faces, then the target music can be compared Cheerful music; for example, if the target picture includes a couple, the target music can be music about sweet emotions, etc.
可选的,可预设内容特征与目标音乐之间的映射关系,当获得目标图片的内容特征之后,通过查找映射关系即可确定对应的目标音乐。Optionally, a mapping relationship between content features and target music can be preset. After obtaining the content features of the target picture, the corresponding target music can be determined by searching for the mapping relationship.
步骤130、将所述目标音乐与所述目标图片合成得到目标视频,所述目标视频中影像素材的切换节奏与所述目标音乐的节奏相匹配。Step 130: Synthesize the target music and the target picture to obtain a target video, and the switching rhythm of the image material in the target video matches the rhythm of the target music.
目标视频中的影像素材可以是目标图片,还可以是特效,例如快节奏的闪切特效、慢节奏的从模糊到清晰的转场特效等。The image material in the target video can be a target picture, or it can also be a special effect, such as a fast-paced flash cut effect, a slow-paced blur to clear transition effect, etc.
示例性的,将所述目标音乐与所述目标图片合成得到目标视频,包括:基于目标音乐的播放时长、目标图片的数量或目标音乐的节奏中的至少一个确定各目标图片在目标视频中的切换时刻;根据切换时刻生成目标视频。Exemplarily, synthesizing the target music and the target picture to obtain a target video includes: determining the position of each target picture in the target video based on at least one of the playback duration of the target music, the number of target pictures, or the rhythm of the target music. Switching time; generate target video based on the switching time.
可选的,所述基于所述目标音乐的播放时长、所述目标图片的数量或所述目标音乐的节奏中的至少一个确定各所述目标图片在所述目标视频中的切换时刻,包括:根据目标图片的数量对目标音乐的播放时长进行均匀切分;将各切分点确定为目标图片在目标视频中的切换时刻,以使各目标图片在目标视频中的播放时长相同。Optionally, determining the switching moment of each target picture in the target video based on at least one of the playback duration of the target music, the number of the target pictures, or the rhythm of the target music includes: The playback duration of the target music is evenly divided according to the number of target pictures; each split point is determined as the switching moment of the target picture in the target video, so that the playback duration of each target picture in the target video is the same.
换言之,根据目标音乐的播放时长,匀速对各目标图片进行切换展示,例如目标音乐的播放时长为3s,目标图片的数量为三个,分别记为目标图片A、目标图片B和目标图片C。将目标音乐与目标图片合成得到目标视频,目标视频是包括目标图片A、目标图片B和目标图片C的一段时长为3s的视频。示例性的,参考如图2所示的一种视频轨道和音频轨道的示意图。在目标视频的第0-1s播放目标图片A,即目标视频的第0-1s对应目标图片A的视频片段,当目标视频播放到第1s时,将视频画面由目标图片A切换为目标图片B。在目标视频的第1-2s播放目标图片B,目标视频的第1-2s对应目标图片B的视频片段,当目标视频播放到第2s时,将视频画面由目标图片B切换为目标图片C。在目标视频的第2-3s播放目标图片C,目标视频的第2-3s对应目标图片C的视频片段。各目标图片在目标视频中的播放时长相同,即匀速播放各目标图片。示例性的,该目标音乐为比较柔和的慢节奏音乐。In other words, each target picture is switched and displayed at a constant speed according to the playing time of the target music. For example, the playing time of the target music is 3 seconds, and the number of target pictures is three, which are recorded as target picture A, target picture B and target picture C respectively. The target music and the target picture are synthesized to obtain the target video. The target video is a 3s video including target picture A, target picture B and target picture C. For example, refer to the schematic diagram of a video track and an audio track as shown in Figure 2. Play the target picture A in the 0-1s of the target video, that is, the video clip corresponding to the target picture A in the 0-1s of the target video. When the target video is played to the 1s, switch the video picture from the target picture A to the target picture B. . Play the target picture B in the 1-2s of the target video, and the 1-2s of the target video correspond to the video clip of the target picture B. When the target video is played to the 2s, the video picture is switched from the target picture B to the target picture C. Play the target picture C in the 2-3s of the target video, and the 2-3s of the target video correspond to the video clip of the target picture C. Each target picture plays for the same duration in the target video, that is, each target picture is played at a constant speed. For example, the target music is relatively soft slow-tempo music.
需要说明的是,目标图片A、目标图片B和目标图片C在目标视频中的播放顺序可以由用户预先指定,也可以随机确定,还可以基于一些策略动态化确定,例如按照各目标图片的拍摄时间确定播放顺序。It should be noted that the order in which target picture A, target picture B and target picture C are played in the target video can be pre-specified by the user, can be determined randomly, or can be determined dynamically based on some strategies, such as according to the shooting order of each target picture. The time determines the playback order.
在一些实施例中,所述基于所述目标音乐的播放时长、所述目标图片的数量或所述目标音乐的节奏中的至少一个确定各所述目标图片在所述目标视频中的切换时刻, 包括:In some embodiments, the switching moment of each target picture in the target video is determined based on at least one of the playback duration of the target music, the number of the target pictures, or the rhythm of the target music, include:
根据各目标图片在目标视频中播放的目标时长,确定各目标图片在目标视频中的切换时刻,以使各目标图片在目标视频中的播放时长为目标时长。目标时长可以是预先设置的。即根据各目标图片在目标视频中播放的目标时长,确定各目标图片在目标视频中的切换时刻,以使各目标图片在目标视频中的播放时长为目标时长。例如,预先指定每个目标图片在目标视频中的播放时长为1s,假设一共有5张目标图片,分别是目标图片A、目标图片B、目标图片C、目标图片D和目标图片E,示例性的如图3所示,在目标视频的第0-1s播放目标图片A,当目标视频播放到第1s时,将视频画面由目标图片A切换为目标图片B;在目标视频的第1-2s播放目标图片B,当目标视频播放到第2s时,将视频画面由目标图片B切换为目标图片C;在目标视频的第2-3s播放目标图片C,当目标视频播放到第3s时,将视频画面由目标图片C切换为目标图片D;在目标视频的第3-4s播放目标图片D,当目标视频播放到第4s时,将视频画面由目标图片D切换为目标图片E;在目标视频的第4-5s播放目标图片E。即目标视频的总时长为5s,若目标音乐的总时长大于5s,则播放到第5s时自动结束,若目标音乐的总时长不足5s,假设是3s,则播放到第3s时自动从头重复播放。According to the target duration of each target picture played in the target video, the switching moment of each target picture in the target video is determined so that the playing duration of each target picture in the target video is the target duration. The target duration can be preset. That is, based on the target duration of each target picture played in the target video, the switching moment of each target picture in the target video is determined so that the playing duration of each target picture in the target video is the target duration. For example, pre-specify that the playback time of each target picture in the target video is 1s. Assume that there are 5 target pictures in total, namely target picture A, target picture B, target picture C, target picture D and target picture E. Example As shown in Figure 3, the target picture A is played in the 0-1s of the target video. When the target video is played to the 1s, the video picture is switched from the target picture A to the target picture B; in the 1-2s of the target video Play the target picture B. When the target video is played to the 2s, the video screen will be switched from the target picture B to the target picture C. Play the target picture C at the 2-3s of the target video. When the target video is played to the 3s, the video screen will be switched to the target picture C. The video picture is switched from the target picture C to the target picture D; the target picture D is played in the 3-4s of the target video. When the target video is played to the 4s, the video picture is switched from the target picture D to the target picture E; in the target video The first 4-5s of playback target picture E. That is, the total duration of the target video is 5s. If the total duration of the target music is greater than 5s, it will automatically end at the 5th second. If the total duration of the target music is less than 5s, assuming it is 3s, it will automatically repeat from the beginning at the 3rd second. .
在一些实施例中,所述基于所述目标音乐的播放时长、所述目标图片的数量或所述目标音乐的节奏中的至少一个确定各所述目标图片在所述目标视频中的切换时刻,包括:根据目标音乐的节奏确定目标音乐的重音节奏点;将重音节奏点确定为目标图片在目标视频中的切换时刻。即根据目标音乐的卡点时刻确定各目标图片在目标视频中的切换时刻,以使目标图片在目标视频中的切换时刻为卡点时刻。In some embodiments, the switching moment of each target picture in the target video is determined based on at least one of the playback duration of the target music, the number of the target pictures, or the rhythm of the target music, Including: determining the accent rhythm point of the target music according to the rhythm of the target music; determining the accent rhythm point as the switching moment of the target picture in the target video. That is, the switching moment of each target picture in the target video is determined according to the stuck moment of the target music, so that the switching moment of the target picture in the target video is the stuck moment.
例如目标音乐包括五个重音节奏点,即卡点时刻,分别是第1.5s、第2.3s、第3.0s、第3.9s和第4.4s。图4示出了一种音频轨道的示意图。在基于目标图片(假设目标图片的数量为5)应用该目标音乐生成目标视频时,在目标视频的第0-1.5s的时间段内播放目标图片A,在第1.5s的时刻时将视频画面由目标图片A切换为目标图片B;在第1.5s-第2.3s的时间段内播放目标图片B,在第2.3s时将视频画面由目标图片B切换为目标图片C;在第2.3s-第3.0s的时间段内播放目标图片C,在第3.0s的时刻时将视频画面由目标图片C切换为目标图片D;在第3.0s-第3.9s的时间段内播放目标图片D,在第3.9s的时刻将视频画面由目标图片D切换为目标图片E;在第3.9s-第4.4s的时间段内播放目标图片E,当到达第4.4s时,目标视频播放结束。图5示出了一种视频轨道的示意图。目标视频从0s开始播放,同时读取对应时间线上的轨道信息,即 图片+音乐,并且播放起止时刻以视频主轨道为主。因此播放到第4.4s时视频结束,整体呈现效果是卡点播放并且轻快节奏变化。该目标音乐可以是较快节奏的音乐。For example, the target music includes five accent rhythm points, that is, stuck moments, which are 1.5s, 2.3s, 3.0s, 3.9s and 4.4s respectively. Figure 4 shows a schematic diagram of an audio track. When applying the target music to generate a target video based on the target picture (assuming the number of target pictures is 5), play the target picture A in the 0-1.5s time period of the target video, and change the video picture at the 1.5s time period. Switch from target picture A to target picture B; play target picture B in the time period from 1.5s to 2.3s, and switch the video picture from target picture B to target picture C at 2.3s; in 2.3s- Play the target picture C in the 3.0s time period, and switch the video picture from the target picture C to the target picture D at the 3.0s time period; play the target picture D in the 3.0s-3.9s time period. At the 3.9s moment, the video screen is switched from the target picture D to the target picture E; the target picture E is played during the time period from the 3.9s to the 4.4s. When the 4.4s is reached, the target video playback ends. Figure 5 shows a schematic diagram of a video track. The target video starts playing from 0s, and the track information on the corresponding timeline is read at the same time, that is Pictures + music, and the start and end times of playback are based on the main video track. Therefore, the video ends when the 4.4s is played, and the overall presentation effect is that the playback is stuck and the rhythm changes briskly. The target music may be faster tempo music.
进一步的,匀速播放控制视频节奏时,将目标音乐与目标图片合成得到目标视频,包括:假设目标图片的数量为5,目标音乐为一首12s长的慢节奏音乐,并且指定每张目标图片的播放时长为2s,则在视频播放器内部会生成一条新的音频轨道进行管理,与视频起始时刻保持一致,即从0s开始,播放到第10s结束,目标音乐也播放到第10s时结束,整体效果为匀速慢节奏播放。对应的可以参考如图6所示的一种音频轨道和视频轨道的示意图。Furthermore, when the video rhythm is controlled by playing at a constant speed, the target music and the target picture are synthesized to obtain the target video, including: assuming that the number of target pictures is 5, the target music is a 12s long slow-tempo music, and the length of each target picture is specified. If the playback duration is 2s, a new audio track will be generated inside the video player for management, consistent with the starting time of the video, that is, starting from 0s, playing to the 10th second, and the target music will also play to the 10th second. The overall effect is a uniform slow-tempo playback. Correspondingly, reference can be made to the schematic diagram of an audio track and a video track as shown in Figure 6.
可选的,将所述目标音乐与所述目标图片合成得到目标视频,还包括:基于目标音乐的节奏确定目标视频中特效的切换时刻;根据所述目标图片在目标视频中的切换时刻以及特效的切换时刻生成目标视频。所述特效例如是快节奏的闪切特效,慢节奏的从模糊到清晰的转场特效等。Optionally, synthesizing the target music and the target picture to obtain the target video also includes: determining the switching moment of the special effects in the target video based on the rhythm of the target music; and based on the switching moment of the target picture in the target video and the special effects. The target video is generated at the switching moment. The special effects include, for example, fast-paced flash-cut special effects, slow-paced blur-to-clear transition special effects, etc.
本公开实施例提供的视频生成方法,实现了根据目标图片的内容特征个性化地控制目标图片在目标视频中的播放节奏,使得目标视频的呈现效果更好,提高用户的使用体验。例如如果目标图片的内容特征是明亮欢快,则最终会生成节奏感较强的视频;如果目标图片的内容特征是阴暗低沉,则会生成节奏感缓慢的视频。从而使得目标视频的播放效果与目标图片的内容特征相关联,实现个性化目标视频的生成。并且,实现了视频生成的自动化,简化了用户基于图片生成视频的操作,提高了视频生成的效率和用户体验。The video generation method provided by the embodiment of the present disclosure realizes personalized control of the playback rhythm of the target picture in the target video according to the content characteristics of the target picture, so that the presentation effect of the target video is better and the user experience is improved. For example, if the content feature of the target image is bright and cheerful, a video with a strong rhythm will eventually be generated; if the content feature of the target image is dark and gloomy, a video with a slow rhythm will be generated. As a result, the playback effect of the target video is associated with the content characteristics of the target picture, and the generation of personalized target videos is achieved. Moreover, it realizes the automation of video generation, simplifies the operation of users to generate videos based on pictures, and improves the efficiency and user experience of video generation.
在上述实施例的基础上,参考如图7所示的一种视频生成方法的流程示意图,具体是基于目标图片确定内容特征,然后基于内容特征确定目标音乐,基于目标音乐控制视频的节奏变化,获得包括目标图片的目标视频。On the basis of the above embodiments, with reference to the flow diagram of a video generation method shown in Figure 7, specifically, the content characteristics are determined based on the target picture, and then the target music is determined based on the content characteristics, and the rhythm change of the video is controlled based on the target music. Obtain the target video including the target picture.
图8为本公开实施例中的一种视频生成装置的结构示意图。该视频生成装置具体包括:第一确定模块810、第二确定模块820和处理模块830。Figure 8 is a schematic structural diagram of a video generation device in an embodiment of the present disclosure. The video generation device specifically includes: a first determination module 810, a second determination module 820 and a processing module 830.
第一确定模块810用于确定目标图片的内容特征;第二确定模块820用于基于所述内容特征确定目标音乐,所述目标音乐的节奏与所述内容特征相匹配;处理模块830,用于将所述目标音乐与所述目标图片合成得到目标视频,所述目标视频中影像素材的切换节奏与所述目标音乐的节奏相匹配。The first determination module 810 is used to determine the content characteristics of the target picture; the second determination module 820 is used to determine the target music based on the content characteristics, and the rhythm of the target music matches the content characteristics; the processing module 830 is used to The target music and the target picture are synthesized to obtain a target video, and the switching rhythm of the image material in the target video matches the rhythm of the target music.
可选的,第一确定模块810包括:第一确定单元,用于通过画面特征的确定算法确定所述目标图片的画面特征;第二确定单元,用于基于所述目标图片的画面特征确 定所述目标图片的情绪特征;第三确定单元,用于根据所述目标图片的情绪特征确定所述目标图片的内容特征。Optionally, the first determination module 810 includes: a first determination unit, configured to determine the picture characteristics of the target picture through a picture characteristics determination algorithm; and a second determination unit, configured to determine based on the picture characteristics of the target picture. Determine the emotional characteristics of the target picture; a third determination unit is used to determine the content characteristics of the target picture according to the emotional characteristics of the target picture.
可选的,所述画面特征包括目标图片的色彩、目标图片中的对象类别或目标图片中对象的动作中的至少一种。Optionally, the picture characteristics include at least one of the color of the target picture, the category of the object in the target picture, or the action of the object in the target picture.
可选的,所述第二确定单元具体用于:基于画面特征与情绪特征之间的映射关系,确定与所述目标图片的画面特征存在映射关系的情绪特征。Optionally, the second determination unit is specifically configured to: based on the mapping relationship between the picture features and the emotional features, determine the emotional features that have a mapping relationship with the picture features of the target picture.
可选的,当所述目标图片的数量为多个时,每个目标图片分别对应有情绪特征,所述第三确定单元具体用于:选取对应目标图片数量最多的情绪特征作为所述内容特征。Optionally, when there are multiple target pictures, each target picture corresponds to an emotional feature, and the third determination unit is specifically configured to: select the emotional feature with the largest number of corresponding target pictures as the content feature. .
可选的,处理模块830包括:确定单元,用于基于所述目标音乐的播放时长、所述目标图片的数量或所述目标音乐的节奏中的至少一个确定各所述目标图片在所述目标视频中的切换时刻;生成单元,用于根据所述切换时刻生成所述目标视频。Optionally, the processing module 830 includes: a determining unit configured to determine, based on at least one of the playing duration of the target music, the number of the target pictures, or the rhythm of the target music, that each of the target pictures is in the target A switching moment in the video; a generating unit configured to generate the target video according to the switching moment.
可选的,所述确定单元具体用于:根据所述目标图片的数量对所述目标音乐的播放时长进行均匀切分;将各切分点确定为所述目标图片在所述目标视频中的切换时刻,以使各所述目标图片在所述目标视频中的播放时长相同。Optionally, the determining unit is specifically configured to: evenly divide the playback duration of the target music according to the number of the target pictures; determine each dividing point as the number of the target pictures in the target video. The switching time is such that the playback duration of each target picture in the target video is the same.
可选的,所述确定单元具体用于:根据各所述目标图片在所述目标视频中播放的目标时长,确定各所述目标图片在所述目标视频中的切换时刻,以使各所述目标图片在所述目标视频中的播放时长为目标时长。Optionally, the determination unit is specifically configured to: determine the switching moment of each target picture in the target video according to the target duration of playback of each target picture in the target video, so that each The playback duration of the target picture in the target video is the target duration.
可选的,所述确定单元具体用于:根据所述目标音乐的节奏确定所述目标音乐的重音节奏点;将所述重音节奏点确定为所述目标图片在所述目标视频中的切换时刻。Optionally, the determining unit is specifically configured to: determine the accent rhythm point of the target music according to the rhythm of the target music; determine the accent rhythm point as the switching moment of the target picture in the target video. .
可选的,所述确定单元还用于:基于所述目标音乐的节奏确定所述目标视频中特效的切换时刻;所述生成单元用于:根据所述目标图片在所述目标视频中的切换时刻以及所述特效的切换时刻生成所述目标视频。Optionally, the determining unit is further configured to determine the switching moment of the special effects in the target video based on the rhythm of the target music; and the generating unit is configured to: switch the target picture in the target video based on the rhythm of the target music. The target video is generated at the time and the switching time of the special effects.
本公开实施例提供的视频生成装置,可执行本公开方法实施例所提供的方法中的步骤,具备执行步骤和有益效果此处不再赘述。The video generation device provided by the embodiments of the present disclosure can execute the steps in the method provided by the method embodiments of the present disclosure. The execution steps and beneficial effects will not be described again here.
图9为本公开实施例中的一种电子设备的结构示意图。下面具体参考图9,其示出了适于用来实现本公开实施例中的电子设备500的结构示意图。本公开实施例中的电子设备500可以包括但不限于诸如移动电话、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、车载终端(例如车载导航终端)、可穿戴电子设备等等的移动终端以及诸如数字TV、台式计算机、 智能家居设备等等的固定终端。图9示出的电子设备仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。FIG. 9 is a schematic structural diagram of an electronic device in an embodiment of the present disclosure. Referring specifically to FIG. 9 below, a schematic structural diagram of an electronic device 500 suitable for implementing an embodiment of the present disclosure is shown. The electronic device 500 in the embodiment of the present disclosure may include, but is not limited to, mobile phones, notebook computers, digital broadcast receivers, PDAs (personal digital assistants), PADs (tablet computers), PMP (portable multimedia players), vehicle-mounted terminals ( Such as car navigation terminals), wearable electronic devices, etc., as well as mobile terminals such as digital TVs, desktop computers, Fixed terminals for smart home devices and more. The electronic device shown in FIG. 9 is only an example and should not impose any limitations on the functions and scope of use of the embodiments of the present disclosure.
如图9所示,电子设备500可以包括处理装置(例如中央处理器、图形处理器等)501,其可以根据存储在只读存储器(ROM)502中的程序或者从存储装置508加载到随机访问存储器(RAM)503中的程序而执行各种适当的动作和处理以实现如本公开所述的实施例的视频生成方法。在RAM 503中,还存储有电子设备500操作所需的各种程序和数据。处理装置501、ROM 502以及RAM 503通过总线504彼此相连。输入/输出(I/O)接口505也连接至总线504。As shown in FIG. 9, the electronic device 500 may include a processing device (eg, central processing unit, graphics processor, etc.) 501, which may be loaded into a random access device according to a program stored in a read-only memory (ROM) 502 or from a storage device 508. The program in the memory (RAM) 503 performs various appropriate actions and processes to implement the video generation method according to the embodiments of the present disclosure. In the RAM 503, various programs and data required for the operation of the electronic device 500 are also stored. The processing device 501, ROM 502 and RAM 503 are connected to each other via a bus 504. An input/output (I/O) interface 505 is also connected to bus 504.
通常,以下装置可以连接至I/O接口505:包括例如触摸屏、触摸板、键盘、鼠标、摄像头、麦克风、加速度计、陀螺仪等的输入装置506;包括例如液晶显示器(LCD)、扬声器、振动器等的输出装置507;包括例如磁带、硬盘等的存储装置508;以及通信装置509。通信装置509可以允许电子设备500与其他设备进行无线或有线通信以交换数据。虽然图9示出了具有各种装置的电子设备500,但是应理解的是,并不要求实施或具备所有示出的装置。可以替代地实施或具备更多或更少的装置。Generally, the following devices may be connected to the I/O interface 505: input devices 506 including, for example, a touch screen, touch pad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, a liquid crystal display (LCD), speakers, vibration An output device 507 such as a computer; a storage device 508 including a magnetic tape, a hard disk, etc.; and a communication device 509. Communication device 509 may allow electronic device 500 to communicate wirelessly or wiredly with other devices to exchange data. Although FIG. 9 illustrates electronic device 500 with various means, it should be understood that implementation or availability of all illustrated means is not required. More or fewer means may alternatively be implemented or provided.
特别地,根据本公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其包括承载在非暂态计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码,从而实现如上所述的视频生成方法。在这样的实施例中,该计算机程序可以通过通信装置509从网络上被下载和安装,或者从存储装置508被安装,或者从ROM 502被安装。在该计算机程序被处理装置501执行时,执行本公开实施例的方法中限定的上述功能。In particular, according to embodiments of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product, which includes a computer program carried on a non-transitory computer-readable medium, the computer program including program code for executing the method shown in the flowchart, thereby achieving the above The video generation method described. In such embodiments, the computer program may be downloaded and installed from the network via communication device 509, or from storage device 508, or from ROM 502. When the computer program is executed by the processing device 501, the above-mentioned functions defined in the method of the embodiment of the present disclosure are performed.
需要说明的是,本公开上述的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开中,计算机可读 信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、RF(射频)等等,或者上述的任意合适的组合。It should be noted that the computer-readable medium mentioned above in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two. The computer-readable storage medium may be, for example, but is not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or any combination thereof. More specific examples of computer readable storage media may include, but are not limited to: an electrical connection having one or more wires, a portable computer disk, a hard drive, random access memory (RAM), read only memory (ROM), removable Programmed read-only memory (EPROM or flash memory), fiber optics, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above. In this disclosure, a computer-readable storage medium may be any tangible medium that contains or stores a program for use by or in connection with an instruction execution system, apparatus, or device. In this disclosure, computer readable A signal medium may include a data signal propagated in baseband or as part of a carrier wave carrying computer-readable program code therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above. A computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device . Program code embodied on a computer-readable medium may be transmitted using any suitable medium, including but not limited to: wire, optical cable, RF (radio frequency), etc., or any suitable combination of the above.
在一些实施方式中,客户端、服务器可以利用诸如HTTP(HyperText Transfer Protocol,超文本传输协议)之类的任何当前已知或未来研发的网络协议进行通信,并且可以与任意形式或介质的数字数据通信(例如,通信网络)互连。通信网络的示例包括局域网(“LAN”),广域网(“WAN”),网际网(例如,互联网)以及端对端网络(例如,ad hoc端对端网络),以及任何当前已知或未来研发的网络。In some embodiments, the client and server can communicate using any currently known or future developed network protocol such as HTTP (HyperText Transfer Protocol), and can communicate with digital data in any form or medium. Communications (e.g., communications network) interconnections. Examples of communications networks include local area networks ("LAN"), wide area networks ("WAN"), the Internet (e.g., the Internet), and end-to-end networks (e.g., ad hoc end-to-end networks), as well as any currently known or developed in the future network of.
上述计算机可读介质可以是上述电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。The above-mentioned computer-readable medium may be included in the above-mentioned electronic device; it may also exist independently without being assembled into the electronic device.
上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该电子设备执行时,使得该电子设备:确定目标图片的内容特征;基于所述内容特征确定目标音乐,所述目标音乐的节奏与所述内容特征相匹配;将所述目标音乐与所述目标图片合成得到目标视频,所述目标视频中影像素材的切换节奏与所述目标音乐的节奏相匹配。The computer-readable medium carries one or more programs. When the one or more programs are executed by the electronic device, the electronic device: determines the content characteristics of the target picture; determines the target music based on the content characteristics, The rhythm of the target music matches the content characteristics; the target music and the target picture are synthesized to obtain a target video, and the switching rhythm of the image material in the target video matches the rhythm of the target music.
可选的,当上述一个或者多个程序被该电子设备执行时,该电子设备还可以执行上述实施例所述的其他步骤。Optionally, when one or more of the above programs are executed by the electronic device, the electronic device may also perform other steps described in the above embodiments.
可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操作的计算机程序代码,上述程序设计语言包括但不限于面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。Computer program code for performing the operations of the present disclosure may be written in one or more programming languages, including but not limited to object-oriented programming languages—such as Java, Smalltalk, C++, and Includes conventional procedural programming languages—such as "C" or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In situations involving remote computers, the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as an Internet service provider through Internet connection).
附图中的流程图和框图,图示了按照本公开各种实施例的系统、方法和计算机程 序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowcharts and block diagrams in the figures illustrate systems, methods, and computer programs in accordance with various embodiments of the present disclosure. Possible architecture, functionality, and operations of a program product. In this regard, each block in the flowchart or block diagram may represent a module, segment, or portion of code that contains one or more logic functions that implement the specified executable instructions. It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown one after another may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved. It will also be noted that each block of the block diagram and/or flowchart illustration, and combinations of blocks in the block diagram and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or operations. , or can be implemented using a combination of specialized hardware and computer instructions.
描述于本公开实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现。其中,单元的名称在某种情况下并不构成对该单元本身的限定。The units involved in the embodiments of the present disclosure can be implemented in software or hardware. Among them, the name of a unit does not constitute a limitation on the unit itself under certain circumstances.
本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如,非限制性地,可以使用的示范类型的硬件逻辑部件包括:现场可编程门阵列(FPGA)、专用集成电路(ASIC)、专用标准产品(ASSP)、片上系统(SOC)、复杂可编程逻辑设备(CPLD)等等。The functions described above herein may be performed, at least in part, by one or more hardware logic components. For example, and without limitation, exemplary types of hardware logic components that may be used include: Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (ASSPs), Systems on Chips (SOCs), Complex Programmable Logical device (CPLD) and so on.
在本公开的上下文中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的更具体示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。In the context of this disclosure, a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices or devices, or any suitable combination of the foregoing. More specific examples of machine-readable storage media would include one or more wire-based electrical connections, laptop disks, hard drives, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
根据本公开的一个或多个实施例,本公开提供了一种视频生成方法,包括:确定目标图片的内容特征;基于所述内容特征确定目标音乐,所述目标音乐的节奏与所述内容特征相匹配;将所述目标音乐与所述目标图片合成得到目标视频,所述目标视频中影像素材的切换节奏与所述目标音乐的节奏相匹配。According to one or more embodiments of the present disclosure, the present disclosure provides a video generation method, including: determining content features of a target picture; determining target music based on the content features, and the rhythm of the target music is consistent with the content features Match; synthesize the target music and the target picture to obtain a target video, and the switching rhythm of the image material in the target video matches the rhythm of the target music.
根据本公开的一个或多个实施例,在本公开提供的视频生成方法中,可选的,所述确定目标图片的内容特征,包括:通过画面特征的确定算法确定所述目标图片的画面特征;基于所述目标图片的画面特征确定所述目标图片的情绪特征;根据所述目标图片的情绪特征确定所述目标图片的内容特征。 According to one or more embodiments of the present disclosure, in the video generation method provided by the present disclosure, optionally, determining the content characteristics of the target picture includes: determining the picture characteristics of the target picture through a picture characteristic determination algorithm. ; Determine the emotional characteristics of the target picture based on the picture characteristics of the target picture; Determine the content characteristics of the target picture based on the emotional characteristics of the target picture.
根据本公开的一个或多个实施例,在本公开提供的视频生成方法中,可选的,所述画面特征包括目标图片的色彩、目标图片中的对象类别或目标图片中对象的动作中的至少一种。According to one or more embodiments of the present disclosure, in the video generation method provided by the present disclosure, optionally, the picture characteristics include the color of the target picture, the object category in the target picture, or the action of the object in the target picture. At least one.
根据本公开的一个或多个实施例,在本公开提供的视频生成方法中,可选的,所述基于所述目标图片的画面特征确定所述目标图片的情绪特征,包括:基于画面特征与情绪特征之间的映射关系,确定与所述目标图片的画面特征存在映射关系的情绪特征。According to one or more embodiments of the present disclosure, in the video generation method provided by the present disclosure, optionally, determining the emotional characteristics of the target picture based on the picture characteristics of the target picture includes: based on the picture characteristics and The mapping relationship between emotional features determines the emotional features that have a mapping relationship with the picture features of the target picture.
根据本公开的一个或多个实施例,在本公开提供的视频生成方法中,可选的,当所述目标图片的数量为多个时,每个目标图片分别对应有情绪特征,并且所述根据所述目标图片的情绪特征确定所述目标图片的内容特征,包括:选取对应目标图片数量最多的情绪特征作为所述内容特征。According to one or more embodiments of the present disclosure, in the video generation method provided by the present disclosure, optionally, when the number of target pictures is multiple, each target picture corresponds to an emotional feature, and the Determining the content features of the target picture according to the emotional features of the target picture includes: selecting the emotional feature with the largest number of corresponding target pictures as the content feature.
根据本公开的一个或多个实施例,在本公开提供的视频生成方法中,可选的,所述将所述目标音乐与所述目标图片合成得到目标视频,包括:基于所述目标音乐的播放时长、所述目标图片的数量或所述目标音乐的节奏中的至少一个确定各所述目标图片在所述目标视频中的切换时刻;根据所述切换时刻生成所述目标视频。According to one or more embodiments of the present disclosure, in the video generation method provided by the present disclosure, optionally, the synthesis of the target music and the target picture to obtain a target video includes: At least one of the playback duration, the number of the target pictures, or the rhythm of the target music determines the switching moment of each of the target pictures in the target video; the target video is generated according to the switching moment.
根据本公开的一个或多个实施例,在本公开提供的视频生成方法中,可选的,所述基于所述目标音乐的播放时长、所述目标图片的数量或所述目标音乐的节奏中的至少一个确定各所述目标图片在所述目标视频中的切换时刻,包括:根据所述目标图片的数量对所述目标音乐的播放时长进行均匀切分;将各切分点确定为所述目标图片在所述目标视频中的切换时刻,以使各所述目标图片在所述目标视频中的播放时长相同。According to one or more embodiments of the present disclosure, in the video generation method provided by the present disclosure, optionally, based on the playback duration of the target music, the number of the target pictures, or the rhythm of the target music At least one of determining the switching moment of each target picture in the target video includes: evenly dividing the playback duration of the target music according to the number of the target pictures; determining each dividing point as the The switching time of the target picture in the target video is such that the playback duration of each target picture in the target video is the same.
根据本公开的一个或多个实施例,在本公开提供的视频生成方法中,可选的,所述基于所述目标音乐的播放时长、所述目标图片的数量或所述目标音乐的节奏中的至少一个确定各所述目标图片在所述目标视频中的切换时刻,包括:根据各所述目标图片在所述目标视频中播放的目标时长,确定各所述目标图片在所述目标视频中的切换时刻,以使各所述目标图片在所述目标视频中的播放时长为目标时长。According to one or more embodiments of the present disclosure, in the video generation method provided by the present disclosure, optionally, based on the playback duration of the target music, the number of the target pictures, or the rhythm of the target music At least one of determining the switching moment of each of the target pictures in the target video includes: determining the switching moment of each of the target pictures in the target video according to the target duration of playing of each of the target pictures in the target video. The switching time is such that the playback duration of each target picture in the target video is the target duration.
根据本公开的一个或多个实施例,在本公开提供的视频生成方法中,可选的,所述基于所述目标音乐的播放时长、所述目标图片的数量或所述目标音乐的节奏中的至少一个确定各所述目标图片在所述目标视频中的切换时刻,包括:根据所述目标音乐的节奏确定所述目标音乐的重音节奏点;将所述重音节奏点确定为所述目标图片在所述目标视频中的切换时刻。 According to one or more embodiments of the present disclosure, in the video generation method provided by the present disclosure, optionally, based on the playback duration of the target music, the number of the target pictures, or the rhythm of the target music At least one of determining the switching moment of each target picture in the target video includes: determining the accent rhythm point of the target music according to the rhythm of the target music; determining the accent rhythm point as the target picture The switching moment in the target video.
根据本公开的一个或多个实施例,在本公开提供的视频生成方法中,可选的,所述将所述目标音乐与所述目标图片合成得到目标视频包括:基于所述目标音乐的节奏确定所述目标视频中特效的切换时刻;根据所述目标图片在所述目标视频中的切换时刻以及所述特效的切换时刻生成所述目标视频。According to one or more embodiments of the present disclosure, in the video generation method provided by the present disclosure, optionally, the synthesis of the target music and the target picture to obtain the target video includes: based on the rhythm of the target music Determine the switching moment of the special effects in the target video; generate the target video according to the switching moment of the target picture in the target video and the switching moment of the special effects.
根据本公开的一个或多个实施例,本公开提供了一种视频生成装置,包括:第一确定模块,用于确定目标图片的内容特征;第二确定模块,用于基于所述内容特征确定目标音乐,所述目标音乐的节奏与所述内容特征相匹配;处理模块,用于将所述目标音乐与所述目标图片合成得到目标视频,所述目标视频中影像素材的切换节奏与所述目标音乐的节奏相匹配。According to one or more embodiments of the present disclosure, the present disclosure provides a video generation device, including: a first determination module for determining content characteristics of a target picture; a second determination module for determining based on the content characteristics Target music, the rhythm of the target music matches the content characteristics; a processing module, used to synthesize the target music and the target picture to obtain a target video, the switching rhythm of the image material in the target video matches the Match the tempo of the target music.
根据本公开的一个或多个实施例,本公开提供了一种电子设备,包括:According to one or more embodiments of the present disclosure, the present disclosure provides an electronic device, including:
一个或多个处理器;one or more processors;
存储器,用于存储一个或多个程序;Memory, used to store one or more programs;
当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现如本公开提供的任一所述的视频生成方法。When the one or more programs are executed by the one or more processors, the one or more processors are caused to implement any of the video generation methods provided by this disclosure.
根据本公开的一个或多个实施例,本公开提供了一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现如本公开提供的任一所述的视频生成方法。According to one or more embodiments of the present disclosure, the present disclosure provides a computer-readable storage medium having a computer program stored thereon. When the program is executed by a processor, the video generation as described in any one provided by the present disclosure is implemented. method.
根据本公开的一个或多个实施例,本公开提供了一种计算机程序,包括:指令,所述指令当由处理器执行时使所述处理器执行如上所述的视频生成方法。According to one or more embodiments of the present disclosure, the present disclosure provides a computer program including: instructions that, when executed by a processor, cause the processor to perform the video generation method as described above.
以上描述仅为本公开的较佳实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本公开中所涉及的公开范围,并不限于上述技术特征的特定组合而成的技术方案,同时也应涵盖在不脱离上述公开构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本公开中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。The above description is only a description of the preferred embodiments of the present disclosure and the technical principles applied. Those skilled in the art should understand that the disclosure scope involved in the present disclosure is not limited to technical solutions composed of specific combinations of the above technical features, but should also cover solutions composed of the above technical features or without departing from the above disclosed concept. Other technical solutions formed by any combination of equivalent features. For example, a technical solution is formed by replacing the above features with technical features with similar functions disclosed in this disclosure (but not limited to).
此外,虽然采用特定次序描绘了各操作,但是这不应当理解为要求这些操作以所示出的特定次序或以顺序次序执行来执行。在一定环境下,多任务和并行处理可能是有利的。同样地,虽然在上面论述中包含了若干具体实现细节,但是这些不应当被解释为对本公开的范围的限制。在单独的实施例的上下文中描述的某些特征还可以组合地实现在单个实施例中。相反地,在单个实施例的上下文中描述的各种特征也可以单独地或以任何合适的子组合的方式实现在多个实施例中。 Furthermore, although operations are depicted in a specific order, this should not be understood as requiring that these operations be performed in the specific order shown or performed in a sequential order. Under certain circumstances, multitasking and parallel processing may be advantageous. Likewise, although several specific implementation details are included in the above discussion, these should not be construed as limiting the scope of the present disclosure. Certain features that are described in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination.
尽管已经采用特定于结构特征和/或方法逻辑动作的语言描述了本主题,但是应当理解所附权利要求书中所限定的主题未必局限于上面描述的特定特征或动作。相反,上面所描述的特定特征和动作仅仅是实现权利要求书的示例形式。 Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are merely example forms of implementing the claims.
Claims (15)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202210270283.6 | 2022-03-18 | ||
| CN202210270283.6A CN116800908A (en) | 2022-03-18 | 2022-03-18 | A video generation method, device, electronic device and storage medium |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2023174066A1 true WO2023174066A1 (en) | 2023-09-21 |
Family
ID=88022347
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2023/079301 Ceased WO2023174066A1 (en) | 2022-03-18 | 2023-03-02 | Video generation method and apparatus, electronic device, and storage medium |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN116800908A (en) |
| WO (1) | WO2023174066A1 (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2025256503A1 (en) * | 2024-06-14 | 2025-12-18 | 北京字跳网络技术有限公司 | Method and apparatus for determining beat synchronization point of audio, device, and product |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20140212106A1 (en) * | 2011-02-08 | 2014-07-31 | Google Inc. | Music soundtrack recommendation engine for videos |
| CN105142017A (en) * | 2015-08-12 | 2015-12-09 | 北京金山安全软件有限公司 | Picture switching method and picture switching device during picture video playing |
| CN108882015A (en) * | 2018-06-27 | 2018-11-23 | Oppo广东移动通信有限公司 | Method for adjusting playing speed of memory video and related equipment |
| CN109615682A (en) * | 2018-12-07 | 2019-04-12 | 北京微播视界科技有限公司 | Animation producing method, device, electronic equipment and computer readable storage medium |
| CN111683209A (en) * | 2020-06-10 | 2020-09-18 | 北京奇艺世纪科技有限公司 | Mixed-cut video generation method and device, electronic equipment and computer-readable storage medium |
| CN112153460A (en) * | 2020-09-22 | 2020-12-29 | 北京字节跳动网络技术有限公司 | Video dubbing method and device, electronic equipment and storage medium |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110278388B (en) * | 2019-06-19 | 2022-02-22 | 北京字节跳动网络技术有限公司 | Display video generation method, device, equipment and storage medium |
| CN111831615B (en) * | 2020-05-28 | 2024-03-12 | 北京达佳互联信息技术有限公司 | Method, device and system for generating video file |
| CN112040273B (en) * | 2020-09-11 | 2022-02-25 | 腾讯科技(深圳)有限公司 | Video synthesis method and device |
-
2022
- 2022-03-18 CN CN202210270283.6A patent/CN116800908A/en active Pending
-
2023
- 2023-03-02 WO PCT/CN2023/079301 patent/WO2023174066A1/en not_active Ceased
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20140212106A1 (en) * | 2011-02-08 | 2014-07-31 | Google Inc. | Music soundtrack recommendation engine for videos |
| CN105142017A (en) * | 2015-08-12 | 2015-12-09 | 北京金山安全软件有限公司 | Picture switching method and picture switching device during picture video playing |
| CN108882015A (en) * | 2018-06-27 | 2018-11-23 | Oppo广东移动通信有限公司 | Method for adjusting playing speed of memory video and related equipment |
| CN109615682A (en) * | 2018-12-07 | 2019-04-12 | 北京微播视界科技有限公司 | Animation producing method, device, electronic equipment and computer readable storage medium |
| CN111683209A (en) * | 2020-06-10 | 2020-09-18 | 北京奇艺世纪科技有限公司 | Mixed-cut video generation method and device, electronic equipment and computer-readable storage medium |
| CN112153460A (en) * | 2020-09-22 | 2020-12-29 | 北京字节跳动网络技术有限公司 | Video dubbing method and device, electronic equipment and storage medium |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2025256503A1 (en) * | 2024-06-14 | 2025-12-18 | 北京字跳网络技术有限公司 | Method and apparatus for determining beat synchronization point of audio, device, and product |
Also Published As
| Publication number | Publication date |
|---|---|
| CN116800908A (en) | 2023-09-22 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN110677711B (en) | Video dubbing method and device, electronic equipment and computer readable medium | |
| JP7199527B2 (en) | Image processing method, device, hardware device | |
| JP2023553101A (en) | Live streaming interaction methods, apparatus, devices and media | |
| CN113207025B (en) | Video processing method and device, electronic equipment and storage medium | |
| CN113365134A (en) | Audio sharing method, device, equipment and medium | |
| CN114422698B (en) | Video generation method, device, equipment and storage medium | |
| WO2021057740A1 (en) | Video generation method and apparatus, electronic device, and computer readable medium | |
| WO2022007565A1 (en) | Image processing method and apparatus for augmented reality, electronic device and storage medium | |
| JP2023501813A (en) | VIDEO GENERATION METHOD AND APPARATUS, ELECTRONIC DEVICE, AND COMPUTER READABLE MEDIUM | |
| CN115062168A (en) | Media content display method, device, device and storage medium | |
| WO2023016349A1 (en) | Text input method and apparatus, and electronic device and storage medium | |
| WO2023109813A1 (en) | Video generation method and apparatus, and electronic device and medium | |
| US20240348914A1 (en) | Photographing method and apparatus, electronic device, and storage medium | |
| CN115981769A (en) | Page display method, device, equipment, computer readable storage medium and product | |
| WO2023088484A1 (en) | Method and apparatus for editing multimedia resource scene, device, and storage medium | |
| JP2023538943A (en) | Audio data processing methods, devices, equipment and storage media | |
| WO2024082948A1 (en) | Multimedia data processing method, apparatus, device and medium | |
| WO2024046360A1 (en) | Media content processing method and apparatus, device, readable storage medium, and product | |
| WO2021160141A1 (en) | Video processing method and apparatus, readable medium and electronic device | |
| CN117596452A (en) | Video generation method, device, medium and electronic equipment | |
| WO2023169356A1 (en) | Image processing method and apparatus, and device and storage medium | |
| CN113806306B (en) | Media file processing method, device, equipment, readable storage medium and product | |
| WO2023174066A1 (en) | Video generation method and apparatus, electronic device, and storage medium | |
| WO2024078293A1 (en) | Image processing method and apparatus, electronic device, and storage medium | |
| WO2024046484A1 (en) | Video generation method and apparatus, device, storage medium, and program product |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23769572 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 08/01/2025) |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 23769572 Country of ref document: EP Kind code of ref document: A1 |