CN104038832A - Video playing method and device - Google Patents
Video playing method and device Download PDFInfo
- Publication number
- CN104038832A CN104038832A CN201410063134.8A CN201410063134A CN104038832A CN 104038832 A CN104038832 A CN 104038832A CN 201410063134 A CN201410063134 A CN 201410063134A CN 104038832 A CN104038832 A CN 104038832A
- Authority
- CN
- China
- Prior art keywords
- frame
- frames
- target video
- index map
- key
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 76
- 230000015654 memory Effects 0.000 claims description 54
- 230000008859 change Effects 0.000 claims description 19
- 238000009826 distribution Methods 0.000 claims description 14
- 230000005540 biological transmission Effects 0.000 claims description 5
- 230000006870 function Effects 0.000 description 20
- 238000004891 communication Methods 0.000 description 19
- 238000010586 diagram Methods 0.000 description 11
- 230000008569 process Effects 0.000 description 10
- 238000004590 computer program Methods 0.000 description 8
- 238000001514 detection method Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 238000012545 processing Methods 0.000 description 6
- 230000001360 synchronised effect Effects 0.000 description 6
- 230000003287 optical effect Effects 0.000 description 5
- 238000013461 design Methods 0.000 description 4
- 239000000835 fiber Substances 0.000 description 4
- 238000010219 correlation analysis Methods 0.000 description 3
- 230000001133 acceleration Effects 0.000 description 2
- 230000005484 gravity Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000005236 sound signal Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000007599 discharging Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000002085 persistent effect Effects 0.000 description 1
- 238000010079 rubber tapping Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000010897 surface acoustic wave method Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Landscapes
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
The disclosure discloses a video playing method and device, and belongs to the technical field of videos. The method comprises that an index graph of a target video is generated according to images in multiple frames of the target video. The index graph comprises multiple areas which are corresponding to the images in the multiple frames. A request of a terminal for a target video index is received, and the index graph is sent to the terminal. When an instruction, which is returned by the terminal, for selecting target areas in the multiple areas of the index graph is received, the target video is transmitted to the terminal according to the frames corresponding to the target areas. The index graph for the target video is generated according to the images corresponding to the multiple frames in the target video; and when a user clicks and selects the target areas in the index graph, the target video is played according to the frames corresponding to the target areas. As for the generated index graph, contents in the video can be completely displayed so that video index efficiency is enhanced.
Description
Technical Field
The present disclosure relates to the field of video technologies, and in particular, to a method and an apparatus for playing a video.
Background
With the development of the internet, more and more video contents are stored in the network for people to learn and work.
When the video is played, the index of the video is loaded, so that a user can select the more popular playing time point in the video according to the index. The index is displayed in a video playing progress bar, a specific identifier is set at the hot playing time point, and the hot playing time point is described in a text form. And the user selects the preferred watching content according to the description corresponding to the popular playing time point and clicks the corresponding specific identification to set the playing progress of the video.
The inventors found that the related art has at least the following problems:
the index is displayed through the progress bar of the video, so that only the literal description of the content in the video can be obtained, and the popular playing time points corresponding to the indexes are set according to the preference of the user or the video producer, so that the obtained indexes of the video are not comprehensive, and the complete video content cannot be covered.
Disclosure of Invention
In order to solve the problems of the related art, the embodiments of the present disclosure provide a method and an apparatus for playing a video. The technical scheme is as follows:
according to a first aspect of the embodiments of the present disclosure, there is provided a method for playing a video, including:
generating an index map of a target video according to images in a plurality of frames in the target video, wherein the index map comprises a plurality of areas corresponding to the images in the plurality of frames;
receiving a request of a terminal for requesting the target video index, and sending the index map to the terminal;
and when a selection command of a target area in a plurality of areas of the index map returned by the terminal is received, starting to transmit the target video to the terminal according to a frame corresponding to the target area.
According to a second aspect of the embodiments of the present disclosure, there is provided a method of playing a video, including:
acquiring an index map of a target video, wherein the index map is generated according to images in a plurality of frames in the target video, and the index map comprises a plurality of areas corresponding to the images in the plurality of frames;
and when a selection command of a target area in a plurality of areas of the index map is received, starting to play the target video according to a frame corresponding to the target area.
According to a third aspect of the embodiments of the present disclosure, there is provided an apparatus for playing a video, including:
the generating module is used for generating an index map of a target video according to images in a plurality of frames in the target video, wherein the index map comprises a plurality of areas corresponding to the images in the plurality of frames;
the receiving module is used for receiving a request of a terminal for requesting the target video index and sending the index map to the terminal;
and the transmission module is used for starting to transmit the target video to the terminal according to the frame corresponding to the target area when receiving a selection command of the target area in the plurality of areas of the index map returned by the terminal.
According to a fourth aspect of the embodiments of the present disclosure, there is provided an apparatus for playing a video, including:
the acquisition module is used for acquiring an index map of a target video, wherein the index map is generated according to images in a plurality of frames in the target video, and the index map comprises a plurality of areas corresponding to the images in the plurality of frames;
and the playing module is used for starting to play the target video according to the frame corresponding to the target area when receiving a selection command of the target area in the plurality of areas of the index map.
According to a fifth aspect of embodiments of the present disclosure, there is provided a server comprising a memory, and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for:
generating an index map of a target video according to images in a plurality of frames in the target video, wherein the index map comprises a plurality of areas corresponding to the images in the plurality of frames;
receiving a request of a terminal for requesting the target video index, and sending the index map to the terminal;
and when a selection command of a target area in a plurality of areas of the index map returned by the terminal is received, starting to transmit the target video to the terminal according to a frame corresponding to the target area.
According to a sixth aspect of embodiments of the present disclosure, there is provided a terminal comprising a memory, and one or more programs, wherein the one or more programs are stored in the memory and configured for execution by the one or more processors to include instructions for:
acquiring an index map of a target video, wherein the index map is generated according to images in a plurality of frames in the target video, and the index map comprises a plurality of areas corresponding to the images in the plurality of frames;
and when a selection command of a target area in a plurality of areas of the index map is received, starting to play the target video according to a frame corresponding to the target area.
The technical scheme provided by the embodiment of the disclosure can have the following beneficial effects:
generating an index map of the target video according to images corresponding to a plurality of frames in the target video; and when the user clicks the target area in the index map, starting to play the target video according to the frame corresponding to the target area. The generated index graph can completely reflect the content in the video, and the efficiency of indexing the video is improved.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and together with the description, serve to explain the principles of the disclosure.
FIG. 1 is a flow chart illustrating a method of playing a video according to an exemplary embodiment of the present disclosure;
FIG. 2 is a flow chart illustrating a method of playing a video according to an exemplary embodiment of the present disclosure;
FIG. 3 is a flow chart illustrating a method of playing a video according to an exemplary embodiment of the present disclosure;
FIG. 4 is a diagram illustrating an index map in a method of playing a video according to an illustrative embodiment;
fig. 5 is a schematic diagram illustrating a terminal playing a key frame corresponding to a target area after a user clicks the target area in an index map in a method for playing a video according to an exemplary embodiment of the disclosure;
FIG. 6 is a flow chart illustrating a method of playing a video according to an exemplary embodiment of the present disclosure;
FIG. 7 is a schematic diagram illustrating an apparatus for playing video according to an exemplary embodiment of the present disclosure;
FIG. 8 is a schematic diagram illustrating an apparatus for playing video according to an exemplary embodiment of the present disclosure;
FIG. 9 is a schematic block diagram illustrating a server device according to an exemplary embodiment of the present disclosure;
fig. 10 is a schematic structural diagram of a terminal device shown in accordance with an exemplary embodiment of the present disclosure.
Detailed Description
To make the objects, technical solutions and advantages of the present disclosure more apparent, embodiments of the present disclosure will be described in detail with reference to the accompanying drawings.
Example one
The embodiment of the present disclosure provides a method for playing a video, and referring to fig. 1, the method flow includes:
in step 101, generating an index map of a target video according to images in a plurality of frames in the target video, wherein the index map comprises a plurality of areas corresponding to the images in the plurality of frames, and the method of the embodiment is used in a server;
in step 102, a request of a terminal for requesting a target video index is received, and an index map is sent to the terminal, wherein the method of the embodiment is used in a server;
in step 103, when a selection command for a target area in a plurality of areas of the index map returned by the terminal is received, the target video starts to be transmitted to the terminal according to a frame corresponding to the target area.
The method comprises the steps of generating an index map of a target video according to images corresponding to a plurality of frames in the target video; and when the user clicks the target area in the index map, starting to play the target video according to the frame corresponding to the target area. The generated index graph can completely reflect the content in the video, and the efficiency of indexing the video is improved.
Example two
The embodiment of the present disclosure provides a method for playing a video, and referring to fig. 2, the method flow includes:
in step 201, acquiring an index map of a target video, the index map being generated according to images in a plurality of frames in the target video, the index map including a plurality of regions corresponding to the images in the plurality of frames, the method of the embodiment being used in a terminal;
in step 202, when a selection command for a target area in a plurality of areas of the index map is received, a target video is played according to a frame corresponding to the target area.
The method comprises the steps of generating an index map of a target video according to images corresponding to a plurality of frames in the target video; and when the user clicks the target area in the index map, starting to play the target video according to the frame corresponding to the target area. The generated index graph can completely reflect the content in the video, and the efficiency of indexing the video is improved.
EXAMPLE III
An embodiment of the present disclosure provides a method for playing a video, referring to fig. 3, the method includes:
in step 301, an index map of the target video is generated according to images in a plurality of frames in the target video, wherein the index map comprises a plurality of areas corresponding to the images in the plurality of frames, and the method of the embodiment is used in a server.
The process of generating the index map according to the target video comprises the steps of extracting a plurality of key frames in the target video, splicing images in the key frames into a picture according to a time sequence, wherein the image of one key frame corresponds to one region in the picture.
The key frames can be understood as representing more prominent images in different scenes, and the video is composed of different images in a plurality of scenes, so that one key frame is selected from each scene, and the images in each key frame are spliced to be called an index map, so that a user can completely know the content of the target video, and can select the playing time of the target video according to the preference of the user.
The process of extracting the key frame comprises the following steps:
in step 3011, the correlation between frames in the target video is analyzed, and the frames with the correlation are grouped into a frame set.
The correlation analysis is to calculate correlation values among frames in the target video through a correlation algorithm, and judge whether the frames have correlation through a preset threshold. The combination of frames with correlation is referred to as a frame set, that is, the frames belong to the same scene to form a frame set.
The process of analyzing the correlation between frames in the target video in step 3011 may be implemented in the following two ways, which may be one of the two ways, or may include a combination of the two ways, to obtain a final correlation analysis result. Optionally, step 3011 may further be:
in step 30111, a first frame and a second frame are selected from the target video, and the method of this embodiment is used in a server;
in step 30112, when the similarity of the image color distributions of the first frame and the second frame is greater than the preset similarity, it is determined that the first frame and the second frame have the similarity.
Wherein the similarity is determined by the image color distribution between frames. The image color distribution can be obtained through a histogram algorithm, and the image color distribution is compared to obtain the image correlation among frames.
In step 30113, when the fluctuation amplitude of the audio frequency corresponding to the first frame and the second frame is smaller than the preset amplitude and the time between the first frame and the second frame is continuous, it is determined that the first frame and the second frame have similarity.
The correlation between frames is determined by the frequency of the sound waveform corresponding to each frame in time, wherein the frames with the correlation are required to be kept continuous in time. The condition for determining the frames with the correlation is as follows: and determining the frames with similarity, wherein the audio frequency fluctuation amplitude in the continuous time is smaller than the preset amplitude.
In step 3012, a key frame is selected from each frame set, and an image in the key frame of each frame set is obtained.
After the correlation between the frames is analyzed, after the renting of the frames with the correlation is called a frame set, at least one key frame in each frame set needs to be selected to represent the frame set. In this step, the process of selecting the key frame may be calculated by an image saliency detection algorithm to obtain the key frame in each frame set.
Optionally, the process of selecting the key frame in step 3012 may be:
in step 30121, selecting a plurality of frames arranged in time series in each frame set; when the change rate between image pixels in a plurality of frames which are continuously arranged in time is smaller than a preset change rate, one of the plurality of frames which are continuously arranged in time is randomly selected as a key frame.
The frames in each frame set may not be completely continuous in time, that is, the same scene may appear repeatedly at different times, so that the temporally continuous frames in each frame set need to be selected as the condition for selecting the key frame. In addition, if the change rate between the image pixels in each frame is smaller than the preset change rate, it means that the images in the frames are relatively stable and do not change much within a certain time, so the frame in such a case is selected as the key frame.
In step 30122, a complete image in the key frame is obtained, a key region higher than the average value of pixels of the complete image is obtained in the complete image, and the image in the key region is determined as the image of the key frame.
In the step, a key area in the image is further determined in the image of the selected key frame, and the key area is used as the image of the key frame and spliced into the index map. The algorithm for further extracting the key area can be further obtained through an image saliency detection algorithm, and the method can highlight the theme of the image in the key frame and more accurately position the content which the image in the key frame wants to express.
For example: the image in the key frame is a person standing on a piece of grassland, the image saliency detection algorithm can calculate the average pixel value of the image, and the overall average pixel of the area where the person in the image is located is higher than the average pixel of the whole image, so that the area where the person is located can be used as a key area to extract the image as the key frame.
In step 3013, according to the playing time sequence of the key frames of each frame set, the images of the key frames of each frame set are spliced into an index map of the target video, and the correspondence of the images of the key frames of each frame set in each area in the index map is recorded.
When the pictures are spliced, images corresponding to the key frames are arranged according to the time sequence corresponding to each extracted key frame, the playing time corresponding to the key frames is recorded, meanwhile, the address corresponding relation between the index map and the target video is set, so that when a user clicks a target area in the index map in a terminal, the address of the target video and the playing time of the key frame corresponding to the target area can be transmitted to a server, and the server starts to transmit a video stream of the target video to the terminal from the key frame corresponding to the target area.
Alternatively, due to the long-time videos such as movies, and the limited resolution of the index map, the contents in the videos cannot be completely displayed in one index map. Therefore, the complete video can be segmented into multiple segments of videos according to a preset time period, and each segment of segmented video is completely executed through each step of step 301 in the embodiment of the present disclosure to obtain an index map corresponding to each segment of segmented video, so that a user can obtain more detailed video index content according to the index map corresponding to each segment of segmented video.
As shown in fig. 4, the figure shows a schematic diagram of an index map, which includes scene pictures corresponding to six key frames, and stores the scene pictures corresponding to the six key frames in an index from top to bottom in time order.
In step 302, a request for obtaining an index map of a target video is sent to a server, and the index map returned by the server is received, wherein the index map is generated according to images in a plurality of frames in the target video, and the index map comprises a plurality of areas corresponding to the images in the plurality of frames.
The step is carried out when the user watches the video at the terminal.
In step 303, a request for requesting a target video index from a terminal is received, and an index map is sent to the terminal.
In step 304, a selection command for a target area in the plurality of areas of the index map is sent to the server, and the method of the embodiment is used in the terminal.
As shown in fig. 4, when the user clicks the area corresponding to the scene picture corresponding to the third key frame in the index map, the area of the scene picture corresponding to the third key frame is the target area, and a selection command for acquiring the video stream from the third key frame is sent to the server.
In step 305, when a selection command for a target area in a plurality of areas of the index map returned by the terminal is received, the transmission of the target video to the terminal is started according to a frame corresponding to the target area.
In step 306, a target video transmitted by the server from a frame corresponding to the target area is received and played, and the method of the embodiment is used in the terminal.
As shown in fig. 5, the diagram shows a schematic diagram that the terminal plays the key frame corresponding to the target area after the user clicks the target area in the index map, that is, after the user clicks the area corresponding to the scene picture corresponding to the third key frame in the index map in the terminal, the server returns the video stream starting from the third key frame to the terminal, and plays from the third key frame.
The method comprises the steps of generating an index map of a target video according to images corresponding to a plurality of frames in the target video; and when the user clicks the target area in the index map, starting to play the target video according to the frame corresponding to the target area. The generated index graph can completely reflect the content in the video, and the efficiency of indexing the video is improved.
Example four
An embodiment of the present disclosure provides a method for playing a video, referring to fig. 6, the method includes:
in step 601, an index map of the target video is generated according to images in a plurality of frames in the target video, and the method of the embodiment is used in the terminal.
The method of the embodiment is used in the terminal.
Since the target video is stored locally in the terminal in the embodiment of the disclosure, the process of generating the target video can be performed when the user views and prepares to play the target video; and an index map corresponding to the target video can be generated in advance, and the generated index map is directly displayed when the user views and prepares to play the target video.
Optionally, the process of generating the index map of the target video may be:
in step 6011, the correlation between frames in the target video is analyzed, and the frames with the correlation are combined into a frame set.
The process of analyzing the correlation between frames in the target video in step 6011 may be implemented in the following two ways, one of the two ways may be used separately, or the two ways may be used in combination to obtain a final correlation analysis result. Optionally, step 3011 may further be:
in step 60111, a first frame and a second frame are selected from the target video, and the method of this embodiment is used in a terminal;
in step 60112, when the similarity of the image color distributions of the first frame and the second frame is greater than the preset similarity, it is determined that the first frame and the second frame have the similarity.
Wherein the similarity is determined by the image color distribution between frames. The image color distribution can be obtained through a histogram algorithm, and the image color distribution is compared to obtain the image correlation among frames.
In step 60113, when the fluctuation amplitude of the audio frequency corresponding to the first frame and the second frame is smaller than the preset amplitude, and the time between the first frame and the second frame is continuous, it is determined that the first frame and the second frame have similarity.
The correlation between frames is determined by the frequency of the sound waveform corresponding to each frame in time, wherein the frames with the correlation are required to be kept continuous in time. The condition for determining the frames with the correlation is as follows: and determining the frames with similarity, wherein the audio frequency fluctuation amplitude in the continuous time is smaller than the preset amplitude.
In step 6012, a key frame is selected from each frame set, and an image in the key frame of each frame set is acquired.
Optionally, the process of acquiring the key frame and acquiring the image in the key frame may further be:
in step 60121, a plurality of frames arranged in succession in time are selected in each frame set; when the change rate between image pixels in a plurality of frames which are continuously arranged in time is smaller than a preset change rate, randomly selecting one frame from the plurality of frames which are continuously arranged in time as a key frame, wherein the method of the embodiment is used in the terminal;
in step 60122, acquiring a complete image in the key frame, acquiring a key area in the complete image higher than the average value of pixels of the complete image, and determining the image in the key area as the image of the key frame.
In step 6013, according to the playing time sequence of the key frames of each frame set, the images of the key frames of each frame set are merged into an index map of the target video, and a correspondence relationship of the images of the key frames of each frame set in each area in the index map is recorded.
In step 602, when a selection command for a target area in a plurality of areas of an index map is received, the target video is played from a frame corresponding to the target area in a locally stored target video.
In this step, the user performs an operation of playing the target video by clicking the target area in the index map.
The method comprises the steps of generating an index map of a target video according to images corresponding to a plurality of frames in the target video; and when the user clicks the target area in the index map, starting to play the target video according to the frame corresponding to the target area. The generated index graph can completely reflect the content in the video, and the efficiency of indexing the video is improved.
EXAMPLE five
An embodiment of the present disclosure provides an apparatus for playing a video, and referring to fig. 7, the apparatus includes:
a generating module 701, configured to generate an index map of a target video according to images in multiple frames in the target video, where the index map includes multiple regions corresponding to the images in the multiple frames;
a receiving module 702, configured to receive a request for requesting a target video index from a terminal, and send an index map to the terminal;
the transmission module 703 is configured to start to transmit the target video to the terminal according to the frame corresponding to the target area when a selection command for the target area in the multiple areas of the index map returned by the terminal is received.
The generating module 701 includes:
an analyzing unit 7011, configured to analyze correlations between frames in the target video, and form a frame set with the frames having the correlations;
a first obtaining unit 7012, configured to select a key frame in each frame set, and obtain an image in the key frame of each frame set;
the generating unit 7013 is configured to splice the images of the key frames of each frame set into an index map of the target video according to the playing time sequence of the key frames of each frame set, and record a corresponding relationship of the images of the key frames of each frame set in each area in the index map.
Among them, the analysis unit 7011 includes:
the first selection subunit is used for selecting a first frame and a second frame in the target video;
the first determining subunit is used for determining that the first frame and the second frame have similarity when the similarity of the image color distribution of the first frame and the second frame is greater than the preset similarity;
and the second determining subunit is used for determining that the first frame and the second frame have similarity when the audio frequency fluctuation amplitude corresponding to the first frame and the second frame is smaller than the preset amplitude and the time between the first frame and the second frame is continuous.
The first obtaining unit 7012 includes:
a second selecting subunit, configured to select, in each frame set, a plurality of frames that are arranged temporally consecutively; when the change rate among image pixels in a plurality of frames which are continuously arranged in time is smaller than a preset change rate, randomly selecting one frame from the plurality of frames which are continuously arranged in time as a key frame;
and the first acquisition subunit is used for acquiring the complete image in the key frame, acquiring a key area higher than the average value of pixels of the complete image in the complete image, and determining the image in the key area as the image of the key frame.
The method comprises the steps of generating an index map of a target video according to images corresponding to a plurality of frames in the target video; and when the user clicks the target area in the index map, starting to play the target video according to the frame corresponding to the target area. The generated index graph can completely reflect the content in the video, and the efficiency of indexing the video is improved.
EXAMPLE six
An embodiment of the present disclosure provides an apparatus for playing a video, and referring to fig. 8, the apparatus includes:
an obtaining module 801, configured to obtain an index map of a target video, where the index map is generated according to images in multiple frames in the target video, and the index map includes multiple regions corresponding to the images in the multiple frames;
the playing module 802 is configured to start playing a target video according to a frame corresponding to a target area when a selection command for the target area in a plurality of areas of the index map is received.
The obtaining module 801 includes:
a second obtaining unit 8011, configured to send a request to obtain an index map of a target video to a server, and receive an index map returned by the server;
a third obtaining unit 8012, configured to generate an index map of the target video according to images in multiple frames of the target video;
the third capturing unit 8012 includes:
the analysis subunit is used for analyzing the correlation among the frames in the target video and forming a frame set by the frames with the correlation;
the second acquisition subunit is used for selecting a key frame from each frame set and acquiring an image in the key frame of each frame set;
and the generating subunit is configured to splice the images of the key frames of each frame set into an index map of the target video according to the playing time sequence of the key frames of each frame set, and record a corresponding relationship of the images of the key frames of each frame set in each area in the index map.
Wherein the analysis subunit is configured to:
selecting a third frame and a fourth frame in the target video;
when the similarity of the image color distribution of the third frame and the fourth frame is greater than the preset similarity, determining that the third frame and the fourth frame have the similarity;
and when the audio frequency fluctuation amplitude corresponding to the third frame and the fourth frame is smaller than the preset amplitude and the time between the third frame and the fourth frame is continuous, determining that the third frame and the fourth frame have similarity.
Wherein the second obtaining subunit is configured to:
selecting a plurality of frames which are arranged in succession in time in each frame set; when the change rate among image pixels in a plurality of frames which are continuously arranged in time is smaller than a preset change rate, randomly selecting one frame from the plurality of frames which are continuously arranged in time as a key frame;
acquiring a complete image in the key frame, acquiring a key area higher than the average value of pixels of the complete image in the complete image, and determining the image in the key area as the image of the key frame.
The played module 802 includes:
a first playing unit 8021, configured to send a command for selecting a target area in multiple areas of the index map to the server, and receive and play a target video that is transmitted by the server from a frame corresponding to the target area;
the second playing unit 8022 is configured to play the target video from a frame corresponding to the target area in the target video stored locally.
The method comprises the steps of generating an index map of a target video according to images corresponding to a plurality of frames in the target video; and when the user clicks the target area in the index map, starting to play the target video according to the frame corresponding to the target area. The generated index graph can completely reflect the content in the video, and the efficiency of indexing the video is improved.
EXAMPLE seven
The embodiment of the present disclosure provides a server device, and refer to fig. 9, which is a schematic structural diagram of the server device in the embodiment of the present disclosure. Referring to fig. 9, the server may be used to implement the methods provided in the above embodiments.
The server device 1900, which may vary widely in configuration or performance, may include one or more Central Processing Units (CPUs) 1922 (e.g., one or more processors) and memory 1932, one or more storage media 1930 (e.g., one or more mass storage devices) storing applications 1942 or data 1944. Memory 1932 and storage medium 1930 can be, among other things, transient or persistent storage. The program stored in the storage medium 1930 may include one or more modules (not shown), each of which may include a series of instructions operating on a server. Still further, the central processor 1922 may be arranged to communicate with the storage medium 1930 to perform a series of instruction operations in the storage medium 1930 on the server device 1900.
Server device 1900 may also include one or more power supplies 1926, one or more wired or wireless network interfaces 1950, one or more input-output interfaces 1958, one or more keyboards 1956, and/or one or more operating systems 1941, such as Windows Server, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM, etc.
The method comprises the steps of generating an index map of a target video according to images corresponding to a plurality of frames in the target video; and when the user clicks the target area in the index map, starting to play the target video according to the frame corresponding to the target area. The generated index graph can completely reflect the content in the video, and the efficiency of indexing the video is improved.
Further, the terminal of the present disclosure may typically be a variety of handheld terminal devices, such as cell phones, Personal Digital Assistants (PDAs), etc., and thus the scope of the present disclosure should not be limited to a particular type of terminal.
Furthermore, the method according to the present disclosure may also be implemented as a computer program executed by a CPU. The computer program, when executed by the CPU, performs the above-described functions defined in the method of the present disclosure.
Further, the above method steps and system elements may also be implemented using a controller and a computer readable storage device for storing a computer program for causing the controller to implement the functions of the above steps or elements.
Further, it should be appreciated that the computer-readable storage devices (e.g., memories) herein can be either volatile memory or nonvolatile memory, or can include both volatile and nonvolatile memory. By way of example, and not limitation, nonvolatile memory can include Read Only Memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM), which can act as external cache memory. By way of example and not limitation, RAM is available in a variety of forms such as synchronous RAM (DRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), double data rate SDRAM (DDR SDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), and Direct Rambus RAM (DRRAM). The storage devices of the disclosed aspects are intended to comprise, without being limited to, these and other suitable types of memory.
Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the disclosure herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as software or hardware depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present disclosure.
The various illustrative logical blocks, modules, and circuits described in connection with the disclosure herein may be implemented or performed with the following components designed to perform the functions herein: a general purpose processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination of these components. A general purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration.
The steps of a method or algorithm described in connection with the disclosure herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. The ASIC may reside in a user terminal. In the alternative, the processor and the storage medium may reside as discrete components in a user terminal.
In one or more exemplary designs, the functions may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a general purpose or special purpose computer. By way of example, and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a general-purpose or special-purpose computer, or a general-purpose or special-purpose processor. Also, any connection is properly termed a computer-readable medium. For example, if the software is transmitted from a website, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, Digital Subscriber Line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium. Disk and disc, as used herein, includes Compact Disc (CD), laser disc, optical disc, Digital Versatile Disc (DVD), floppy disk, blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer-readable media.
Example eight
The embodiment of the present disclosure provides a terminal device, and referring to fig. 10, it is a schematic structural diagram of the terminal device in the embodiment of the present disclosure. Referring to fig. 10, the terminal may be used to implement the methods provided in the above embodiments. Preferably:
terminal device 1000 can include components such as a communication unit 110, a memory 120 including one or more computer-readable storage media, an input unit 130, a display unit 140, a sensor 150, an audio circuit 160, a WIFI (Wireless Fidelity) module 170, a processor 180 including one or more processing cores, and a power supply 190. Those skilled in the art will appreciate that the terminal device configurations shown in the figures are not intended to be limiting of terminal devices and may include more or fewer components than those shown, or some components may be combined, or a different arrangement of components. Wherein:
the communication unit 110 may be used for receiving and transmitting information or signals during a call, and the communication unit 110 may be an RF (Radio Frequency) circuit, a router, a modem, or other network communication devices. In particular, when the communication unit 110 is an RF circuit, downlink information of the base station is received and then processed by the one or more processors 180; in addition, data relating to uplink is transmitted to the base station. Generally, the RF circuit as a communication unit includes, but is not limited to, an antenna, at least one Amplifier, a tuner, one or more oscillators, a Subscriber Identity Module (SIM) card, a transceiver, a coupler, an LNA (Low Noise Amplifier), a duplexer, and the like. In addition, the communication unit 110 may also communicate with a network and other devices through wireless communication. The wireless communication may use any communication standard or protocol, including but not limited to GSM (Global System for Mobile communications), GPRS (General Packet Radio Service), CDMA (Code Division Multiple Access), WCDMA (Wideband Code Division Multiple Access), LTE (Long Term Evolution), email, SMS (Short Messaging Service), etc. The memory 120 may be used to store software programs and modules, and the processor 180 executes various functional applications and data processing by operating the software programs and modules stored in the memory 120. The memory 120 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required by at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may store data (such as audio data, a phonebook, etc.) created according to the use of the terminal device 1000, and the like. Further, the memory 120 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device. Accordingly, the memory 120 may further include a memory controller to provide the processor 180 and the input unit 130 with access to the memory 120.
The input unit 130 may be used to receive input numeric or character information and generate keyboard, mouse, joystick, optical or trackball signal inputs related to user settings and function control. Preferably, the input unit 130 may include a touch-sensitive surface 131 and other input devices 132. The touch-sensitive surface 131, also referred to as a touch display screen or a touch pad, may collect touch operations by a user on or near the touch-sensitive surface 131 (e.g., operations by a user on or near the touch-sensitive surface 131 using any suitable object or attachment such as a finger, a stylus, etc.) and drive the corresponding connection device according to a predetermined program. Alternatively, the touch sensitive surface 131 may comprise two parts, a touch detection means and a touch controller. The touch detection device detects the touch direction of a user, detects a signal brought by touch operation and transmits the signal to the touch controller; the touch controller receives touch information from the touch sensing device, converts the touch information into touch point coordinates, sends the touch point coordinates to the processor 180, and can receive and execute commands sent by the processor 180. Additionally, the touch-sensitive surface 131 may be implemented using various types of resistive, capacitive, infrared, and surface acoustic waves. In addition to the touch-sensitive surface 131, the input unit 130 may also include other input devices 132. Preferably, other input devices 132 may include, but are not limited to, one or more of a physical keyboard, function keys (such as volume control keys, switch keys, etc.), a trackball, a mouse, a joystick, and the like.
The display unit 140 may be used to display information input by a user or information provided to the user and various graphic user interfaces of the terminal apparatus 1000, which may be configured by graphics, text, icons, video, and any combination thereof. The Display unit 140 may include a Display panel 141, and optionally, the Display panel 141 may be configured in the form of an LCD (Liquid Crystal Display), an OLED (Organic Light-Emitting Diode), or the like. Further, the touch-sensitive surface 131 may cover the display panel 141, and when a touch operation is detected on or near the touch-sensitive surface 131, the touch operation is transmitted to the processor 180 to determine the type of the touch event, and then the processor 180 provides a corresponding visual output on the display panel 141 according to the type of the touch event. Although in FIG. 10, touch-sensitive surface 131 and display panel 141 are shown as two separate components to implement input and output functions, in some embodiments, touch-sensitive surface 131 may be integrated with display panel 141 to implement input and output functions.
Terminal device 1000 can also include at least one sensor 150, such as a light sensor, motion sensor, and other sensors. The light sensor may include an ambient light sensor that adjusts the brightness of the display panel 141 according to the brightness of ambient light, and a proximity sensor that turns off the display panel 141 and/or a backlight when the terminal device 1000 moves to the ear. As one of the motion sensors, the gravity acceleration sensor can detect the magnitude of acceleration in each direction (generally, three axes), can detect the magnitude and direction of gravity when the mobile phone is stationary, and can be used for applications of recognizing the posture of the mobile phone (such as horizontal and vertical screen switching, related games, magnetometer posture calibration), vibration recognition related functions (such as pedometer and tapping), and the like; as for other sensors such as a gyroscope, a barometer, a hygrometer, a thermometer, and an infrared sensor, which can be configured on the terminal device 1000, further description is omitted here.
Audio circuitry 160, speaker 161, and microphone 162 may provide an audio interface between a user and terminal device 1000. The audio circuit 160 may transmit the electrical signal converted from the received audio data to the speaker 161, and convert the electrical signal into a sound signal for output by the speaker 161; on the other hand, the microphone 162 converts the collected sound signal into an electric signal, converts the electric signal into audio data after being received by the audio circuit 160, and outputs the audio data to the processor 180 for processing, and then transmits the audio data to, for example, another terminal device via the RF circuit 110, or outputs the audio data to the memory 120 for further processing. The audio circuit 160 may also include an earbud jack to provide communication of peripheral headphones with the terminal device 1000.
In order to realize wireless communication, a wireless communication unit 170 may be configured on the terminal device, and the wireless communication unit 170 may be a WIFI module. WIFI belongs to a short-distance wireless transmission technology, and the terminal device 1000 can help a user to send and receive e-mails, browse webpages, access streaming media, and the like through the wireless communication unit 170, and provides wireless broadband internet access for the user. Although the wireless communication unit 170 is shown in the drawing, it is understood that it does not belong to the essential constitution of the terminal device 1000 and may be omitted entirely as needed within the scope not changing the essence of the invention.
The processor 180 is a control center of the terminal device 1000, connects various parts of the entire mobile phone by using various interfaces and lines, and performs various functions of the terminal device 1000 and processes data by operating or executing software programs and/or modules stored in the memory 120 and calling data stored in the memory 120, thereby performing overall monitoring of the mobile phone. Optionally, processor 180 may include one or more processing cores; preferably, the processor 180 may integrate an application processor, which mainly handles operating systems, user interfaces, application programs, etc., and a modem processor, which mainly handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into the processor 180.
Terminal device 1000 can also include a power supply 190 (e.g., a battery) for powering the various components, which may preferably be logically coupled to processor 180 via a power management system to provide management of charging, discharging, and power consumption via the power management system. The power supply 190 may also include any component including one or more of a dc or ac power source, a recharging system, a power failure detection circuit, a power converter or inverter, a power status indicator, and the like.
Although not shown, the terminal device 1000 may further include a camera, a bluetooth module, and the like, which are not described herein. In this embodiment, the terminal device further includes a memory, and one or more programs, where the one or more programs are stored in the memory, and the one or more programs are configured to be executed by the one or more processors and include instructions for performing the method provided by the embodiments of the present invention.
The method comprises the steps of generating an index map of a target video according to images corresponding to a plurality of frames in the target video; and when the user clicks the target area in the index map, starting to play the target video according to the frame corresponding to the target area. The generated index graph can completely reflect the content in the video, and the efficiency of indexing the video is improved.
Furthermore, the mobile terminal of the present disclosure may typically be a variety of handheld terminal devices, such as cell phones, Personal Digital Assistants (PDAs), etc., and thus the scope of the present disclosure should not be limited to a particular type of mobile terminal.
Furthermore, the method according to the present disclosure may also be implemented as a computer program executed by a CPU. The computer program, when executed by the CPU, performs the above-described functions defined in the method of the present disclosure.
Further, the above method steps and system elements may also be implemented using a controller and a computer readable storage device for storing a computer program for causing the controller to implement the functions of the above steps or elements.
Further, it should be appreciated that the computer-readable storage devices (e.g., memories) herein can be either volatile memory or nonvolatile memory, or can include both volatile and nonvolatile memory. By way of example, and not limitation, nonvolatile memory can include Read Only Memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM), which can act as external cache memory. By way of example and not limitation, RAM is available in a variety of forms such as synchronous RAM (DRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), double data rate SDRAM (DDR SDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), and Direct Rambus RAM (DRRAM). The storage devices of the disclosed aspects are intended to comprise, without being limited to, these and other suitable types of memory.
Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the disclosure herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as software or hardware depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present disclosure.
The various illustrative logical blocks, modules, and circuits described in connection with the disclosure herein may be implemented or performed with the following components designed to perform the functions herein: a general purpose processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination of these components. A general purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration.
The steps of a method or algorithm described in connection with the disclosure herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. The ASIC may reside in a user terminal. In the alternative, the processor and the storage medium may reside as discrete components in a user terminal.
In one or more exemplary designs, the functions may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a general purpose or special purpose computer. By way of example, and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a general-purpose or special-purpose computer, or a general-purpose or special-purpose processor. Also, any connection is properly termed a computer-readable medium. For example, if the software is transmitted from a website, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, Digital Subscriber Line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium. Disk and disc, as used herein, includes Compact Disc (CD), laser disc, optical disc, Digital Versatile Disc (DVD), floppy disk, blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer-readable media.
While the foregoing disclosure shows illustrative embodiments of the disclosure, it should be noted that various changes and modifications could be made herein without departing from the scope of the disclosure as defined by the appended claims. The functions, steps and/or actions of the method claims in accordance with the disclosed embodiments described herein need not be performed in any particular order. Furthermore, although elements of the disclosure may be described or claimed in the singular, the plural is contemplated unless limitation to the singular is explicitly stated.
The above-mentioned embodiments, objects, technical solutions and advantages of the present disclosure are described in further detail, it should be understood that the above-mentioned embodiments are merely illustrative of the present disclosure and are not intended to limit the scope of the present disclosure, and any modifications, equivalent substitutions, improvements and the like made within the spirit and principle of the present disclosure should be included in the scope of the present disclosure.
Claims (22)
1. A method of playing a video, the method comprising:
generating an index map of a target video according to images in a plurality of frames in the target video, wherein the index map comprises a plurality of areas corresponding to the images in the plurality of frames;
receiving a request of a terminal for requesting the target video index, and sending the index map to the terminal;
and when a selection command of a target area in a plurality of areas of the index map returned by the terminal is received, starting to transmit the target video to the terminal according to a frame corresponding to the target area.
2. The method of claim 1, wherein generating an index map of a target video from images in a plurality of frames in the target video comprises:
analyzing the correlation among frames in the target video, and forming a frame set by the frames with the correlation;
selecting a key frame in each frame set, and acquiring an image in the key frame of each frame set;
according to the playing time sequence of the key frames of each frame set, splicing the images of the key frames of each frame set into an index map of the target video, and recording the corresponding relation of the images of the key frames of each frame set in each area of the index map.
3. The method of claim 2, wherein analyzing the correlation between frames in the target video comprises:
selecting a first frame and a second frame in the target video;
when the similarity of the image color distribution of the first frame and the second frame is greater than a preset similarity, determining that the first frame and the second frame have the similarity;
and when the audio frequency fluctuation amplitude corresponding to the first frame and the second frame is smaller than a preset amplitude and the time between the first frame and the second frame is continuous, determining that the first frame and the second frame have similarity.
4. The method of claim 2, wherein selecting a key frame in each frame set and acquiring an image in each key frame comprises:
selecting a plurality of frames which are arranged in succession in time in each frame set; when the change rate among the image pixels in the plurality of frames which are continuously arranged in time is smaller than a preset change rate, randomly selecting one frame from the plurality of frames which are continuously arranged in time as a key frame;
acquiring a complete image in the key frame, acquiring a key area higher than the average value of pixels of the complete image in the complete image, and determining the image in the key area as the image of the key frame.
5. A method of playing a video, the method comprising:
acquiring an index map of a target video, wherein the index map is generated according to images in a plurality of frames in the target video, and the index map comprises a plurality of areas corresponding to the images in the plurality of frames;
and when a selection command of a target area in a plurality of areas of the index map is received, starting to play the target video according to a frame corresponding to the target area.
6. The method of claim 5, wherein obtaining the index map of the target video comprises:
sending a request for acquiring an index map of a target video to a server, and receiving the index map returned by the server; or,
and generating an index map of the target video according to images in a plurality of frames in the target video.
7. The method of claim 6, wherein generating an index map of a target video from images in a plurality of frames in the target video comprises:
analyzing the correlation among frames in the target video, and forming a frame set by the frames with the correlation;
selecting a key frame in each frame set, and acquiring an image in the key frame of each frame set;
according to the playing time sequence of the key frames of each frame set, splicing the images of the key frames of each frame set into an index map of the target video, and recording the corresponding relation of the images of the key frames of each frame set in each area of the index map.
8. The method of claim 7, wherein analyzing the correlation between frames in the target video and grouping the frames with correlation into a frame set comprises:
selecting a third frame and a fourth frame in the target video;
when the similarity of the image color distribution of the third frame and the fourth frame is greater than a preset similarity, determining that the third frame and the fourth frame have the similarity;
and when the audio frequency fluctuation amplitude corresponding to the third frame and the fourth frame is smaller than a preset amplitude and the time between the third frame and the fourth frame is continuous, determining that the third frame and the fourth frame have similarity.
9. The method of claim 7, wherein selecting a key frame in each frame set and acquiring an image in each key frame comprises:
selecting a plurality of frames which are arranged in succession in time in each frame set; when the change rate among the image pixels in the plurality of frames which are continuously arranged in time is smaller than a preset change rate, randomly selecting one frame from the plurality of frames which are continuously arranged in time as a key frame;
acquiring a complete image in the key frame, acquiring a key area higher than the average value of pixels of the complete image in the complete image, and determining the image in the key area as the image of the key frame.
10. The method according to claim 5, wherein the starting of playing the target video according to the frame corresponding to the target area comprises:
sending a selection command of a target area in a plurality of areas of the index map to a server, and receiving and playing the target video transmitted by the server from a frame corresponding to the target area; or,
and in the locally stored target video, starting to play the target video from a frame corresponding to the target area.
11. An apparatus for playing video, the apparatus comprising:
the generating module is used for generating an index map of a target video according to images in a plurality of frames in the target video, wherein the index map comprises a plurality of areas corresponding to the images in the plurality of frames;
the receiving module is used for receiving a request of a terminal for requesting the target video index and sending the index map to the terminal;
and the transmission module is used for starting to transmit the target video to the terminal according to the frame corresponding to the target area when receiving a selection command of the target area in the plurality of areas of the index map returned by the terminal.
12. The apparatus of claim 11, wherein the generating module comprises:
the analysis unit is used for analyzing the correlation among the frames in the target video and forming the frames with the correlation into a frame set;
the first acquisition unit is used for selecting a key frame in each frame set and acquiring an image in the key frame of each frame set;
and the generating unit is used for splicing the images of the key frames of each frame set into an index map of the target video according to the playing time sequence of the key frames of each frame set, and recording the corresponding relation of the images of the key frames of each frame set in each area in the index map.
13. The apparatus of claim 12, wherein the analysis unit comprises:
the first selecting subunit is used for selecting a first frame and a second frame in the target video;
a first determining subunit, configured to determine that the first frame and the second frame have similarity when similarity of image color distributions of the first frame and the second frame is greater than a preset similarity;
and the second determining subunit is configured to determine that the first frame and the second frame have similarity when the audio frequency fluctuation amplitude corresponding to the first frame and the second frame is smaller than a preset amplitude and the time between the first frame and the second frame is continuous.
14. The apparatus of claim 12, wherein the first obtaining unit comprises:
a second selecting subunit, configured to select, in each frame set, a plurality of frames that are arranged temporally consecutively; when the change rate among the image pixels in the plurality of frames which are continuously arranged in time is smaller than a preset change rate, randomly selecting one frame from the plurality of frames which are continuously arranged in time as a key frame;
the first acquiring subunit is configured to acquire a complete image in the key frame, acquire a key region higher than the average value of pixels of the complete image in the complete image, and determine an image in the key region as an image of the key frame.
15. An apparatus for playing video, the apparatus comprising:
the acquisition module is used for acquiring an index map of a target video, wherein the index map is generated according to images in a plurality of frames in the target video, and the index map comprises a plurality of areas corresponding to the images in the plurality of frames;
and the playing module is used for starting to play the target video according to the frame corresponding to the target area when receiving a selection command of the target area in the plurality of areas of the index map.
16. The apparatus of claim 15, wherein the obtaining module comprises:
the second acquisition unit is used for sending a request for acquiring the index map of the target video to the server and receiving the index map returned by the server;
and the third acquisition unit is used for generating an index map of the target video according to the images in the frames in the target video.
17. The apparatus of claim 16, wherein the third obtaining unit comprises:
the analysis subunit is used for analyzing the correlation among the frames in the target video and forming a frame set by the frames with the correlation;
the second acquisition subunit is used for selecting a key frame from each frame set and acquiring an image in the key frame of each frame set;
and the generating subunit is configured to splice the images of the key frames of each frame set into an index map of the target video according to the playing time sequence of the key frames of each frame set, and record a corresponding relationship of the images of the key frames of each frame set in each area in the index map.
18. The apparatus of claim 17, wherein the analysis subunit is configured to:
selecting a third frame and a fourth frame in the target video;
when the similarity of the image color distribution of the third frame and the fourth frame is greater than a preset similarity, determining that the third frame and the fourth frame have the similarity;
and when the audio frequency fluctuation amplitude corresponding to the third frame and the fourth frame is smaller than a preset amplitude and the time between the third frame and the fourth frame is continuous, determining that the third frame and the fourth frame have similarity.
19. The apparatus of claim 17, wherein the second obtaining subunit is configured to:
selecting a plurality of frames which are arranged in succession in time in each frame set; when the change rate among the image pixels in the plurality of frames which are continuously arranged in time is smaller than a preset change rate, randomly selecting one frame from the plurality of frames which are continuously arranged in time as a key frame;
acquiring a complete image in the key frame, acquiring a key area higher than the average value of pixels of the complete image in the complete image, and determining the image in the key area as the image of the key frame.
20. The apparatus of claim 15, wherein the played module comprises:
the first playing unit is used for sending a selection command of a target area in a plurality of areas of the index map to a server, and receiving and playing the target video transmitted by the server from a frame corresponding to the target area;
and the second playing unit is used for playing the target video from the frame corresponding to the target area in the locally stored target video.
21. A server comprising a memory, and one or more programs, wherein the one or more programs are stored in the memory and configured for execution by one or more processors the one or more programs including instructions for:
generating an index map of a target video according to images in a plurality of frames in the target video, wherein the index map comprises a plurality of areas corresponding to the images in the plurality of frames;
receiving a request of a terminal for requesting the target video index, and sending the index map to the terminal;
and when a selection command of a target area in a plurality of areas of the index map returned by the terminal is received, starting to transmit the target video to the terminal according to a frame corresponding to the target area.
22. A terminal comprising a memory, and one or more programs, wherein the one or more programs are stored in the memory and configured for execution by one or more processors to perform the one or more programs including instructions for:
acquiring an index map of a target video, wherein the index map is generated according to images in a plurality of frames in the target video, and the index map comprises a plurality of areas corresponding to the images in the plurality of frames;
and when a selection command of a target area in a plurality of areas of the index map is received, starting to play the target video according to a frame corresponding to the target area.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201410063134.8A CN104038832A (en) | 2014-02-22 | 2014-02-22 | Video playing method and device |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201410063134.8A CN104038832A (en) | 2014-02-22 | 2014-02-22 | Video playing method and device |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN104038832A true CN104038832A (en) | 2014-09-10 |
Family
ID=51469399
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201410063134.8A Pending CN104038832A (en) | 2014-02-22 | 2014-02-22 | Video playing method and device |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN104038832A (en) |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104219535A (en) * | 2014-09-12 | 2014-12-17 | 无锡天脉聚源传媒科技有限公司 | Program system and program broadcasting method |
| CN106326098A (en) * | 2015-07-01 | 2017-01-11 | 上海云双信息科技有限公司 | Method and device for testing page perceived performance |
| CN109976646A (en) * | 2019-03-22 | 2019-07-05 | 上海沈德医疗器械科技有限公司 | Magnetic resonance imaging control and image transmission method, server-side and program |
| CN112383819A (en) * | 2020-11-06 | 2021-02-19 | 平安科技(深圳)有限公司 | Video frame extraction method and related equipment |
| CN113259780A (en) * | 2021-07-15 | 2021-08-13 | 中国传媒大学 | Holographic multidimensional audio and video playing progress bar generating, displaying and playing control method |
| CN114117120A (en) * | 2021-11-23 | 2022-03-01 | 上海赛连信息科技有限公司 | Video file intelligent index generation system and method based on content analysis |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030061610A1 (en) * | 2001-03-27 | 2003-03-27 | Errico James H. | Audiovisual management system |
| CN101106637A (en) * | 2006-07-13 | 2008-01-16 | 中兴通讯股份有限公司 | Method for playing media files on external storage device through set-top box |
| CN101257615A (en) * | 2007-10-25 | 2008-09-03 | 复旦大学 | Streaming media distribution and user VCR operation method based on video segmentation technology |
| CN102077580A (en) * | 2009-04-30 | 2011-05-25 | 索尼公司 | Display control device, display control method, and program |
| CN102932679A (en) * | 2012-11-21 | 2013-02-13 | 合一网络技术(北京)有限公司 | Network video previewing system and method |
-
2014
- 2014-02-22 CN CN201410063134.8A patent/CN104038832A/en active Pending
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030061610A1 (en) * | 2001-03-27 | 2003-03-27 | Errico James H. | Audiovisual management system |
| CN101106637A (en) * | 2006-07-13 | 2008-01-16 | 中兴通讯股份有限公司 | Method for playing media files on external storage device through set-top box |
| CN101257615A (en) * | 2007-10-25 | 2008-09-03 | 复旦大学 | Streaming media distribution and user VCR operation method based on video segmentation technology |
| CN102077580A (en) * | 2009-04-30 | 2011-05-25 | 索尼公司 | Display control device, display control method, and program |
| CN102932679A (en) * | 2012-11-21 | 2013-02-13 | 合一网络技术(北京)有限公司 | Network video previewing system and method |
Cited By (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104219535A (en) * | 2014-09-12 | 2014-12-17 | 无锡天脉聚源传媒科技有限公司 | Program system and program broadcasting method |
| CN104219535B (en) * | 2014-09-12 | 2017-09-15 | 无锡天脉聚源传媒科技有限公司 | A kind of performance system and program broadcasting method |
| CN106326098A (en) * | 2015-07-01 | 2017-01-11 | 上海云双信息科技有限公司 | Method and device for testing page perceived performance |
| CN106326098B (en) * | 2015-07-01 | 2020-09-01 | 上海云双信息科技有限公司 | Method and device for testing page perception performance |
| CN109976646A (en) * | 2019-03-22 | 2019-07-05 | 上海沈德医疗器械科技有限公司 | Magnetic resonance imaging control and image transmission method, server-side and program |
| CN109976646B (en) * | 2019-03-22 | 2020-11-10 | 上海沈德医疗器械科技有限公司 | Magnetic resonance scanning control and image transmission method, server and program |
| CN112383819A (en) * | 2020-11-06 | 2021-02-19 | 平安科技(深圳)有限公司 | Video frame extraction method and related equipment |
| CN112383819B (en) * | 2020-11-06 | 2022-03-11 | 平安科技(深圳)有限公司 | Video frame extraction method and related equipment |
| CN113259780A (en) * | 2021-07-15 | 2021-08-13 | 中国传媒大学 | Holographic multidimensional audio and video playing progress bar generating, displaying and playing control method |
| CN113259780B (en) * | 2021-07-15 | 2021-11-05 | 中国传媒大学 | Holographic multidimensional audio and video playing progress bar generating, displaying and playing control method |
| CN114117120A (en) * | 2021-11-23 | 2022-03-01 | 上海赛连信息科技有限公司 | Video file intelligent index generation system and method based on content analysis |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN110278449B (en) | Video detection method, device, equipment and medium | |
| RU2612362C1 (en) | Method of recording, method of playback, device, terminal and system | |
| CN104852885B (en) | Method, device and system for verifying verification code | |
| CN103871051A (en) | Image processing method, device and electronic equipment | |
| CN104967896A (en) | Method for displaying bulletscreen comment information, and apparatus thereof | |
| CN103596017B (en) | Video downloading method and system | |
| US9760998B2 (en) | Video processing method and apparatus | |
| CN103873883B (en) | Video playing method and device and terminal equipment | |
| CN106203235B (en) | Living body identification method and apparatus | |
| CN104618794A (en) | Method and device for playing video | |
| CN106231433B (en) | A kind of methods, devices and systems playing network video | |
| CN104038832A (en) | Video playing method and device | |
| WO2020048392A1 (en) | Application virus detection method, apparatus, computer device, and storage medium | |
| CN107124555A (en) | Method, device, computer equipment and computer-readable storage medium for controlling focusing | |
| CN107644016A (en) | A kind of multimedia titles interpretation method, multimedia titles lookup method and device | |
| CN113377976A (en) | Resource searching method and device, computer equipment and storage medium | |
| CN106791916B (en) | Method, device and system for recommending audio data | |
| US20210219028A1 (en) | Method and apparatus for selecting audio track from audio and video file | |
| CN103869977B (en) | Method for displaying image, device and electronics | |
| CN110163051B (en) | Text extraction method, device and storage medium | |
| KR102340251B1 (en) | Method for managing data and an electronic device thereof | |
| CN105513098B (en) | Image processing method and device | |
| CN107819946B (en) | Method, device and mobile terminal for voice recognition | |
| CN114510417A (en) | Image rendering effect testing method and device, electronic equipment and storage medium | |
| CN106341436B (en) | Method and device for detecting acceleration effect |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20140910 |