CN112770118B

CN112770118B - Video frame image motion estimation method and related equipment

Info

Publication number: CN112770118B
Application number: CN202011639504.XA
Authority: CN
Inventors: 索士尧; 罗小伟; 郭春磊; 李�荣
Original assignee: Spreadtrum Communications Tianjin Co Ltd
Current assignee: Spreadtrum Communications Tianjin Co Ltd
Priority date: 2020-12-31
Filing date: 2020-12-31
Publication date: 2022-09-13
Anticipated expiration: 2040-12-31
Also published as: CN115633178A; CN112770118A

Abstract

Embodiments of the present invention disclose a motion estimation method and related equipment for video frame images, wherein the method includes: acquiring a first frame image in a target video, dividing the first frame image into multiple image blocks; The first center image block of the Determining multiple second candidate image blocks in the second candidate search set; using the first center image block to perform precise search for multiple second candidate image blocks, and determining the motion estimation result of the first center image block according to the precise search result; The motion estimation result of the first frame image is determined according to the motion estimation results of the plurality of image blocks of the first frame image. The embodiments of the present invention improve the overall efficiency and accuracy of motion estimation.

Description

Video frame image motion estimation method and related equipment

技术领域technical field

本发明涉及视频插帧技术领域，尤其涉及一种视频帧图像运动估计方法及相关设备。The present invention relates to the technical field of video frame interpolation, and in particular, to a video frame image motion estimation method and related equipment.

背景技术Background technique

视频插帧的运动估计过程中，需先将一帧视频图像分成不重叠的图像块，对于当前帧中的每一个图像块到参考帧中的一定搜索范围，找到与当前帧中的图像块最相似的匹配块，进而计算出运动向量。In the motion estimation process of video frame insertion, a frame of video image needs to be divided into non-overlapping image blocks. Similar matching blocks, and then calculate the motion vector.

现有多种运动估计算法。光流法是对每个像素点都独立计算运动向量获得光流场来进行运动估计。像素递归法通过对每个像素采用递归的方式更新预测值从而获得运动向量。基于块匹配的运动估计算法，通过像素间的物理距离来搜索图像中相邻帧的图像块，并根据匹配法则找到最好的结果。最简单的块匹配算法是全搜索法，在给定一个搜索范围内，例如整幅图像，依次搜索匹配块。三维递归搜索(3-Dimension Recursive Search，3DRS)法是一种块匹配算法，它的当前块继承了临近块的运动向量，根据匹配准则，计算当前块和每个候选向量对应的块的代价值，通过比较代价值找最相似的块。Various motion estimation algorithms exist. The optical flow method calculates the motion vector independently for each pixel to obtain the optical flow field for motion estimation. The pixel recursion method obtains the motion vector by recursively updating the predicted value for each pixel. The motion estimation algorithm based on block matching searches the image blocks of adjacent frames in the image through the physical distance between pixels, and finds the best result according to the matching rule. The simplest block matching algorithm is the full search method, which searches for matching blocks in sequence within a given search range, such as the entire image. The 3-Dimension Recursive Search (3DRS) method is a block matching algorithm. Its current block inherits the motion vectors of adjacent blocks. According to the matching criteria, the cost value of the current block and the block corresponding to each candidate vector is calculated. , find the most similar block by comparing the cost value.

但是采用3DRS算法进行视频帧图像的运动估计，存在搜索范围较小，没有考虑视频的整体运动情况，没有进行精确搜索等问题，而3DRS算法与其他现有算法结合进行运动估计，又存在计算复杂，运动估计耗费时间长等问题。However, using the 3DRS algorithm for motion estimation of video frame images has problems such as a small search range, does not consider the overall motion of the video, and does not perform accurate search, and the 3DRS algorithm is combined with other existing algorithms for motion estimation, which is computationally complex. , motion estimation takes a long time and so on.

发明内容SUMMARY OF THE INVENTION

本发明实施例提供了一种视频帧图像的运动估计方法及相关设备，可以增加运动估计的搜索范围，同时不需要大量的计算，提升运动估计的整体效率和准确率。The embodiments of the present invention provide a motion estimation method and related equipment for video frame images, which can increase the search range of motion estimation without requiring a large amount of calculation and improve the overall efficiency and accuracy of motion estimation.

第一方面，本发明实施例提供了一种视频帧图像的运动估计方法，该方法包括：获取目标视频中的第一帧图像，将第一帧图像划分为多个图像块；获得多个图像块中的第一中心图像块；采用第一中心图像块进行针对第一候选搜索集合中的多个第一候选图像块的粗略搜索，根据粗略搜索结果确定第二中心图像块，其中，第一候选图像块根据第一帧图像的相邻帧图像中的图像块确定；根据第二中心图像块确定第二候选搜索集合中的多个第二候选图像块，第二候选图像块根据第二中心图像块与其他图像块之间的步长距离确定，其他图像块与第二中心图像块位于同一帧图像；采用第一中心图像块进行针对多个第二候选图像块的精确搜索，根据精确搜索结果确定第一中心图像块的运动估计结果；根据第一帧图像的多个图像块的运动估计结果确定第一帧图像的运动估计结果。In a first aspect, an embodiment of the present invention provides a motion estimation method for a video frame image, the method includes: obtaining a first frame image in a target video, dividing the first frame image into multiple image blocks; obtaining multiple images The first center image block in the block; the first center image block is used to perform a rough search for a plurality of first candidate image blocks in the first candidate search set, and the second center image block is determined according to the rough search result, wherein the first The candidate image blocks are determined according to the image blocks in the adjacent frame images of the first frame image; the multiple second candidate image blocks in the second candidate search set are determined according to the second center image block, and the second candidate image blocks are determined according to the second center image block. The step distance between the image block and other image blocks is determined, and the other image blocks and the second center image block are located in the same frame of image; the first center image block is used to perform an accurate search for multiple second candidate image blocks, according to the precise search As a result, the motion estimation result of the first central image block is determined; the motion estimation result of the first frame image is determined according to the motion estimation results of multiple image blocks of the first frame image.

可见，在本申请实施例中，对视频帧图像中的图像块进行运动估计时，采用全搜索法进行帧图像上每个图像块的运动估计，针对每个图像块的搜索过程，进行两次搜索，分别为粗略搜索和精确搜索，粗略搜索的第一候选搜索集合根据当前进行运动估计的帧图像的相邻帧图像的图像块确定，精确搜索的第二候选搜索集合基于粗略搜索的结果获取预设步长内的图像块，该过程通过两次搜索扩大了搜索范围，提升了搜索结果。另外获取搜索集合的过程计算简单，提升了搜索过程的效率。It can be seen that in this embodiment of the present application, when performing motion estimation on image blocks in a video frame image, the full search method is used to estimate the motion of each image block on the frame image, and the search process for each image block is performed twice The search is a rough search and a precise search, respectively. The first candidate search set of the rough search is determined according to the image blocks of the adjacent frame images of the frame image currently undergoing motion estimation, and the second candidate search set of the precise search is obtained based on the result of the rough search. Image blocks within a preset step size, the process expands the search range and improves the search results through two searches. In addition, the process of obtaining the search set is simple in calculation, which improves the efficiency of the search process.

第二方面，本发明实施例提供了一种运动估计装置，该装置包括：获取模块，用于获取目标视频中的第一帧图像；处理模块，用于将第一帧图像划分为多个图像块，获得多个图像块中的第一中心图像块；该处理模块，还用于采用第一中心图像块进行针对第一候选搜索集合中的多个第一候选图像块的粗略搜索，根据粗略搜索结果确定第二中心图像块，其中，第一候选图像块根据第一帧图像的相邻帧图像中的图像块确定；该处理模块，还用于根据第二中心图像块确定第二候选搜索集合中的多个第二候选图像块，第二候选图像块根据第二中心图像块与其他图像块之间的步长距离确定，其他图像块与第二中心图像块位于同一帧图像；该处理模块，还用于采用第一中心图像块进行针对多个第二候选图像块的精确搜索，根据精确搜索结果确定第一中心图像块的运动估计结果；根据第一帧图像的多个图像块的运动估计结果确定第一帧图像的运动估计结果。In a second aspect, an embodiment of the present invention provides a motion estimation apparatus, the apparatus includes: an acquisition module for acquiring a first frame of image in a target video; a processing module for dividing the first frame of image into multiple images block, to obtain the first center image block in the multiple image blocks; the processing module is further configured to use the first center image block to perform a rough search for the multiple first candidate image blocks in the first candidate search set, according to the rough The search result determines the second center image block, wherein the first candidate image block is determined according to the image blocks in the adjacent frame images of the first frame image; the processing module is further configured to determine the second candidate search block according to the second center image block Multiple second candidate image blocks in the set, the second candidate image block is determined according to the step distance between the second center image block and other image blocks, and the other image blocks and the second center image block are located in the same frame of image; this process The module is also used to use the first central image block to perform an accurate search for a plurality of second candidate image blocks, and determine the motion estimation result of the first central image block according to the accurate search result; The motion estimation result determines the motion estimation result of the first frame image.

第三方面，本发明实施例提供了一种运动估计设备，包括：处理器和存储器；In a third aspect, an embodiment of the present invention provides a motion estimation device, including: a processor and a memory;

所述处理器和存储器相连，其中，所述存储器用于存储程序代码，所述处理器用于调用所述程序代码，以执行如第一方面所述的视频帧图像的运动估计方法。The processor is connected to a memory, wherein the memory is used for storing program codes, and the processor is used for calling the program codes to execute the motion estimation method for a video frame image according to the first aspect.

第四方面，本申请实施例提供一种芯片系统，包括：处理器，处理器与存储器耦合，存储器用于存储程序或指令，当程序或指令被处理器执行时，使得该芯片系统实现上述第一方面或第一方面的任一种可能的实现方式中的方法。In a fourth aspect, an embodiment of the present application provides a chip system, including: a processor, where the processor is coupled to a memory, and the memory is used to store programs or instructions, and when the programs or instructions are executed by the processor, the chip system enables the above-mentioned first A method in an aspect or any possible implementation of the first aspect.

第五方面，本发明实施例提供了一种计算机存储介质，所述计算机存储介质存储有计算机程序，所述计算机程序包括程序指令，所述程序指令当被处理器执行时，执行如第一方面所述的视频帧图像的运动估计方法。In a fifth aspect, an embodiment of the present invention provides a computer storage medium, where the computer storage medium stores a computer program, and the computer program includes program instructions, and when executed by a processor, the program instructions are executed as in the first aspect The motion estimation method of the video frame image.

第六方面，本申请实施例提供一种计算机程序产品，当计算机读取并执行计算机程序产品时，使得计算机执行上述第一方面或第一方面的任一种可能的实现方式中的方法。In a sixth aspect, an embodiment of the present application provides a computer program product that, when the computer reads and executes the computer program product, causes the computer to execute the method in the first aspect or any possible implementation manner of the first aspect.

附图说明Description of drawings

为了更清楚地说明本发明实施例技术方案，下面将对实施例描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图是本发明的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动的前提下，还可以根据这些附图获得其他的附图。In order to explain the technical solutions of the embodiments of the present invention more clearly, the following briefly introduces the accompanying drawings used in the description of the embodiments. Obviously, the drawings in the following description are some embodiments of the present invention, which are of great significance to the art For those of ordinary skill, other drawings can also be obtained from these drawings without any creative effort.

图1为本申请实施例提供的一种全搜索匹配方法示意图；1 is a schematic diagram of a full search matching method provided by an embodiment of the present application;

图2A为本申请实施例提供一种视频帧图像运动估计方法流程图；FIG. 2A provides a flowchart of a method for estimating motion of a video frame image according to an embodiment of the present application;

图2B为本申请实施例提供的一种第一帧图像划分图像块的示意图；FIG. 2B is a schematic diagram of dividing an image block of a first frame of an image according to an embodiment of the present application;

图2C为本申请实施例提供的一种获取多个第一候选图像块的过程示意图；2C is a schematic diagram of a process for acquiring multiple first candidate image blocks according to an embodiment of the present application;

图2D为本申请实施例中提供的一种运动向量聚类流程图；2D is a flowchart of a motion vector clustering provided in an embodiment of the present application;

图2E为本申请实施例提供的一种确定多个第二候选图像块的过程示意图；2E is a schematic diagram of a process for determining multiple second candidate image blocks according to an embodiment of the present application;

图3为本发明实施例提供的一种运动估计装置的结构示意图；FIG. 3 is a schematic structural diagram of a motion estimation apparatus according to an embodiment of the present invention;

图4为本发明实施例提供的一种运动估计设备的结构示意图。FIG. 4 is a schematic structural diagram of a motion estimation device according to an embodiment of the present invention.

具体实施方式Detailed ways

下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention.

应当理解，本申请的说明书和权利要求书及附图中的术语“第一”、“第二”等是用于区别不同对象，而不是用于描述特定顺序。此外，术语“包括”和“具有”以及它们任何变形，意图在于覆盖不排他的包含。例如包含了一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元，而是可选地还包括没有列出的步骤或单元，或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或单元。It should be understood that the terms "first", "second" and the like in the description and claims of the present application and the drawings are used to distinguish different objects, rather than to describe a specific order. Furthermore, the terms "comprising" and "having" and any variations thereof are intended to cover non-exclusive inclusion. For example, a process, method, system, product or device comprising a series of steps or units is not limited to the listed steps or units, but optionally also includes unlisted steps or units, or optionally also includes For other steps or units inherent to these processes, methods, products or devices.

在本发明中提及“实施例”意味着，结合实施例描述的特定特征、结构或特性可以包含在本发明的至少一个实施例中。在说明书中的各个位置出现该短语并不一定均是指相同的实施例，也不是与其它实施例互斥的独立的或备选的实施例。本领域技术人员显式地和隐式地理解的是，本发明所描述的实施例可以与其它实施例相结合。Reference herein to an "embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the present invention. The appearances of the phrase in various places in the specification are not necessarily all referring to the same embodiment, nor a separate or alternative embodiment that is mutually exclusive of other embodiments. It is explicitly and implicitly understood by those skilled in the art that the described embodiments of the present invention may be combined with other embodiments.

首先对本申请实施例中可能涉及的专业术语进行介绍。First, the technical terms that may be involved in the embodiments of this application are introduced.

运动估计(Motion Estimation，ME):将一帧图像分成许多互不重叠的图像块，并认为图像块内所有像素的位移量都相同，对每个图像块到参考帧某一给定搜索范围内根据一定的匹配准则找出与其最相似的图像块，即匹配块，并得出两个图像块空间位置的相对偏移量的过程。Motion Estimation (ME): A frame of image is divided into many non-overlapping image blocks, and the displacement of all pixels in the image block is considered to be the same, and each image block is within a given search range of the reference frame. The process of finding the most similar image block, that is, the matching block, according to certain matching criteria, and obtaining the relative offset of the spatial position of the two image blocks.

运动向量(Motion Vector，MV):匹配块与当前块(被用于进行匹配搜索的图像块)的相对偏移量即为运动向量。Motion Vector (MV): The relative offset between the matching block and the current block (image block used for matching search) is the motion vector.

运动向量场:一帧图像中所有图像块的运动向量构成运动向量场。Motion vector field: The motion vectors of all image blocks in a frame of image constitute a motion vector field.

运动补偿：通过原始帧和运动信息，重建出原本不存在的中间帧的过程。Motion compensation: The process of reconstructing intermediate frames that did not originally exist through the original frame and motion information.

视频插帧:在视频序列中，通过运动估计与运动补偿等操作，产生新的帧，提升视频的时间分辨率的过程。Video frame interpolation: In a video sequence, through motion estimation and motion compensation, new frames are generated to improve the temporal resolution of the video.

内插帧：一个视频序列的两帧之间，通过视频插帧方法产生的新的帧。Interpolated frame: A new frame generated by the video interpolation method between two frames of a video sequence.

基于运动估计和运动补偿的视频序列的插帧过程需要利用前后相邻帧的信息，估计出内插帧相对于它们的运动。内插帧的质量好坏，取决于运动估计的准确性。为了得到质量好的内插帧，需要选用能得到真实运动向量场的运动估计算法。现有的运动估计算法，有的计算复杂度高，不易于实现，如光流场；全搜索法为最简单的块匹配算法，在给定一个搜索范围内，例如整幅图像，依次搜索匹配块，具体如图1所示，对当前帧中的图像块，在参考帧中的搜索范围内搜索匹配块然后获得运动向量，该搜索算法精度高但是计算量过于庞大；在限定的搜索范围中搜索匹配块，虽然降低了运算量，但是给定的图像块不能很好的表示物体的真实运动。The frame interpolation process of video sequences based on motion estimation and motion compensation needs to use the information of the adjacent frames before and after to estimate the motion of the interpolated frames relative to them. The quality of the interpolated frame depends on the accuracy of the motion estimation. In order to obtain high-quality interpolated frames, it is necessary to select a motion estimation algorithm that can obtain the true motion vector field. Some of the existing motion estimation algorithms have high computational complexity and are not easy to implement, such as optical flow field; the full search method is the simplest block matching algorithm, within a given search range, such as the entire image, search for matching As shown in Figure 1, for the image block in the current frame, the matching block is searched in the search range in the reference frame and then the motion vector is obtained. The search algorithm has high precision but the calculation amount is too large; in the limited search range Searching for matching blocks reduces the computational complexity, but a given image block cannot well represent the true motion of the object.

基于上述描述，请参阅图2A，图2A为本申请实施例提供一种视频帧图像运动估计方法流程图，如图2A所示，该方法包括如下步骤：Based on the above description, please refer to FIG. 2A. FIG. 2A provides a flowchart of a video frame image motion estimation method according to an embodiment of the present application. As shown in FIG. 2A, the method includes the following steps:

101、获取目标视频中的第一帧图像，将第一帧图像划分为多个图像块；101. Obtain a first frame of image in the target video, and divide the first frame of image into multiple image blocks;

102、获得多个图像块中的第一中心图像块；102. Obtain a first center image block in multiple image blocks;

103、采用第一中心图像块进行针对第一候选搜索集合中的多个第一候选图像块的粗略搜索，根据粗略搜索结果确定第二中心图像块，其中，第一候选图像块根据第一帧图像的相邻帧图像中的图像块确定；103. Use the first center image block to perform a rough search for multiple first candidate image blocks in the first candidate search set, and determine a second center image block according to the rough search result, wherein the first candidate image block is based on the first frame Image blocks in adjacent frame images of the image are determined;

104、根据第二中心图像块确定第二候选搜索集合中的多个第二候选图像块，第二候选图像块根据第二中心图像块与其他图像块之间的步长距离确定，其他图像块与第二中心图像块位于同一帧图像；104. Determine a plurality of second candidate image blocks in the second candidate search set according to the second center image block, the second candidate image blocks are determined according to the step distance between the second center image block and other image blocks, and the other image blocks is located in the same frame image as the second central image block;

105、采用第一中心图像块进行针对多个第二候选图像块的精确搜索，根据精确搜索结果确定第一中心图像块的运动估计结果；105. Use the first center image block to perform an accurate search for a plurality of second candidate image blocks, and determine the motion estimation result of the first center image block according to the precise search result;

106、根据第一帧图像中的多个图像块的运动估计结果确定第一帧图像的运动估计结果。106. Determine a motion estimation result of the first frame of image according to the motion estimation results of the multiple image blocks in the first frame of image.

在本申请实施例中，目标视频为需要进行插帧的视频，获取目标视频图像中的第一帧图像，第一帧图像为未知运动方向的帧图像，且企图在第一帧图像后进行视频插帧，即使说，假设第二帧图像为第一帧图像相邻的下一帧图像，为了确定如何在第一帧图像和第二帧图像之间进行视频插帧，需要对第一帧图像进行运动估计，具体为获取第一帧图像运动到第二帧图像所对应的运行向量。In the embodiment of the present application, the target video is a video that needs to be framed, and the first frame image in the target video image is obtained, and the first frame image is a frame image with an unknown moving direction, and it is attempted to perform a video after the first frame image. Frame interpolation, even if it is said that the second frame image is the next frame image adjacent to the first frame image, in order to determine how to perform video frame interpolation between the first frame image and the second frame image, the first frame image needs to be interpolated. Performing motion estimation, specifically, acquiring a running vector corresponding to the motion of the first frame of image to the second frame of image.

将第一帧图像划分为多个图像块，请参阅图2B，图2B为本申请实施例提供的一种第一帧图像划分图像块的示意图，如图2B中的(a)所示，可以是将第一帧图像划分为多个大小形状相同的图像块，或者，如图2B中的(b)也可以是将第一帧图像按照像素值划分大小不同的图像块，例如，针对帧图像中像素值较大(颜色较深)的区域，划分更小的图像块，即进行更细致的划分，针对帧图像中像素值较小(颜色较浅)的区域，划分更大的图像块，即进行更粗略的划分。这种划分方式是基于通常情况下认为颜色较深的区域有更多的细节，因此需要进行更细致的匹配。The first frame of image is divided into a plurality of image blocks, please refer to FIG. 2B . FIG. 2B is a schematic diagram of dividing the first frame of image into image blocks according to an embodiment of the present application. As shown in (a) of FIG. 2B , you can is to divide the first frame image into a plurality of image blocks of the same size and shape, or, as shown in (b) in FIG. 2B , may also be to divide the first frame image into image blocks of different sizes according to pixel values, for example, for the frame image In the area with larger pixel value (darker color), divide smaller image blocks, that is, perform more detailed division, and divide larger image blocks for the area with smaller pixel value (lighter color) in the frame image, That is, a rougher division is made. This division is based on the assumption that darker areas have more detail and therefore require more detailed matching.

在本申请实施例中，以将第一帧图像划分为多个大小形状相同的图像块的方法为例进行说明，将第一帧图像按照预设的尺寸划分为若干个大小相同的矩形，每一个矩形对应为一个图像块。可以选择其中任意一个图像块作为第一中心图像块，用于进行运动估计。如图2B中的(a)所述，图像块C即为本申请实施例所选择的第一中心图像块。In the embodiment of the present application, the method of dividing the first frame of image into a plurality of image blocks of the same size and shape is taken as an example for description, and the first frame of image is divided into several rectangles of the same size according to the preset size, and A rectangle corresponds to an image block. Any one of the image blocks can be selected as the first central image block for motion estimation. As shown in (a) of FIG. 2B , the image block C is the first central image block selected in this embodiment of the present application.

然后采用第一中心图像块进行与参考帧中图像块的搜索匹配，确定与第一中心图像块匹配度最高的匹配块，计算获得第一中心块与匹配块之间的运动向量，并根据该运动向量完成对第一中心图像块的运动估计。其中参考帧可以是前向参考帧或后向参考帧，前向参考帧表示当前帧(当前需要进行运动估计的图像帧)在当前时刻的前一时刻所对应的图像帧，后向参考帧表示当前帧在下一时刻可能运动到的图像帧，当前帧与前向参考帧的搜索匹配表示针对当前时刻的前一时刻的运动估计，当前帧与后向参考帧的搜索匹配表示针对当前时刻的下一时刻的运动估计。本申请实施例中以与后向参考帧的搜索匹配进行运动估计为例进行说明。Then use the first center image block to search and match the image blocks in the reference frame, determine the matching block with the highest degree of matching with the first center image block, calculate and obtain the motion vector between the first center block and the matching block, and according to the The motion vector completes the motion estimation for the first central image block. The reference frame can be a forward reference frame or a backward reference frame. The forward reference frame represents the image frame corresponding to the current frame (the image frame currently requiring motion estimation) at the previous moment of the current moment, and the backward reference frame represents the image frame corresponding to the current moment. The image frame to which the current frame may move at the next moment, the search match between the current frame and the forward reference frame indicates the motion estimation at the previous moment at the current moment, and the search match between the current frame and the backward reference frame indicates the next moment for the current moment. Motion estimation at one moment. In the embodiments of the present application, motion estimation is described by taking the search and matching with the backward reference frame as an example for description.

在本申请实施例中，采用第一中心图像块进行与参考帧(与第一帧图像相邻的下一帧图像)中图像块的搜索匹配，具体包括：采用第一中心图像块进行针对第一候选搜索集合中的多个第一候选图像块的粗略搜索，根据粗略搜索结果确定第二中心图像块，其中，第一候选图像块根据第一帧图像的相邻帧图像中的图像块确定；采用第二中心图像块进行针对第二候选搜索集合中的多个第二候选图像块的精确搜索，根据精确搜索结果确定第一帧图像的运动估计结果，其中，第二候选图像块根据第二中心图像块与其他图像块之间的步长距离确定，其他图像块与第二中心图像块位于同一帧图像。In the embodiment of the present application, using the first central image block to search and match the image blocks in the reference frame (the next frame image adjacent to the first frame image) specifically includes: using the first central image block A rough search of a plurality of first candidate image blocks in a candidate search set, the second central image block is determined according to the rough search result, wherein the first candidate image block is determined according to the image blocks in the adjacent frame images of the first frame image ; Use the second center image block to carry out an accurate search for a plurality of second candidate image blocks in the second candidate search set, and determine the motion estimation result of the first frame image according to the precise search result, wherein the second candidate image block is based on the first frame image. The step distance between the second center image block and other image blocks is determined, and the other image blocks and the second center image block are located in the same frame of image.

根据上述描述可知，在第一中心图像块进行与参考帧中图像块的搜索匹配的过程中，需要进行两次搜索，第一次为粗略搜索，用于确定第二中心图像块，第二次为精确搜索，用于最终确定运动估计结果。粗略搜索实际上是将第一中心图像块与第一候选搜索集合中的多个第一候选图像块进行匹配运算，然后根据匹配结果确定一个第二中心图像块。第二中心图像块可以是第一候选图像块中与第一中心图像块匹配度最高的匹配块，也可以是在匹配块基础上经过调整后获得的图像块。其中第一候选图像块是根据第一帧图像的相邻帧图像中的图像块确定的，例如第一候选图像块可以是第一中心图像块的相邻图像块在相邻帧图像中对应的图像块，相邻帧图像可能是第一帧图像的上一帧图像或者下一帧图像，对应的图像块表示同一个像素图像在不同时刻时，在不同帧图像上对应的图像块。或者第一候选图像块可以是根据上一帧图像块的运动向量预测到的第一中心图像块可能在下一帧图像上对应的图像块等。According to the above description, in the process of searching and matching the first central image block with the image blocks in the reference frame, two searches are required. The first is a rough search to determine the second central image block, and the second is a rough search. For precise search, it is used to finalize the motion estimation result. The rough search is actually to perform a matching operation between the first central image block and a plurality of first candidate image blocks in the first candidate search set, and then determine a second central image block according to the matching result. The second central image block may be a matching block with the highest matching degree with the first central image block among the first candidate image blocks, or may be an image block obtained after adjustment on the basis of the matching block. The first candidate image block is determined according to the image blocks in the adjacent frame images of the first frame image, for example, the first candidate image block may be the adjacent image blocks of the first central image block corresponding to the adjacent frame images Image block, the adjacent frame image may be the previous frame image or the next frame image of the first frame image, and the corresponding image block represents the image block corresponding to the same pixel image on different frame images at different times. Alternatively, the first candidate image block may be an image block or the like that may correspond to the first central image block predicted according to the motion vector of the image block of the previous frame on the image of the next frame.

根据粗略搜索结果确定第二中心图像块后，再根据第二中心图像块确定第二候选搜索集合中的多个第二候选图像块，第二中心图像块为与第一帧图像相邻的下一帧图像(可以被命名为第二帧图像)上的图像块，即是说，经过粗略搜索后，确定第一中心图像块在第一帧图像变换到下一时刻的第二帧图像时，可能位于第二中心图像块的位置。进一步地，第二中心图像块可能并不是第二帧图像上与第一中心图像块匹配度最高的图像块，因此，还可以进一步获取第二帧图像上的多个第二候选图像块，然后从第二候选图像块中获取与第一中心图像块匹配度最高的匹配块，确定为第一中心图像块在第一帧图像变换到下一时刻的第二帧图像时，最终可能所在的位置。After the second center image block is determined according to the rough search result, multiple second candidate image blocks in the second candidate search set are determined according to the second center image block, and the second center image block is the lower image block adjacent to the first frame image. An image block on one frame of image (which can be named as the second frame of image), that is to say, after a rough search, it is determined that when the first central image block is transformed from the first frame of image to the second frame of image at the next moment, Possibly at the location of the second center image patch. Further, the second center image block may not be the image block with the highest degree of matching with the first center image block on the second frame image. Therefore, multiple second candidate image blocks on the second frame image may be further acquired, and then Obtain the matching block with the highest matching degree with the first central image block from the second candidate image block, and determine the final possible position of the first central image block when the first frame image is transformed to the second frame image at the next moment .

第一帧图像中的每个图像块都可以被选择成为第一中心图像块，然后采用上述方法进行运动估计，最后根据每个图像块的运动向量确定第一帧图像的运动估计结果，进而确定进行视频插帧的位置。Each image block in the first frame of image can be selected as the first central image block, and then the above method is used for motion estimation, and finally the motion estimation result of the first frame of image is determined according to the motion vector of each image block, and then the motion estimation result is determined. The position for video interpolation.

可见，在本申请实施例中，通过获取目标视频图像中的第一帧图像，再获取第一帧图像中的第一中心图像块，针对第一中心图像块进行粗略搜索和精确搜索，其中粗略搜索时用于进行匹配搜索的图像块根据第一帧图像的相邻帧图像中的图像块确定，精确搜索时用于进行匹配搜索的图像块根据粗略搜索确定的第二中心图像块，以及第二中心图像块与预设步长范围内的图像块的匹配结果确定，最后获取与第一中心图像块匹配度最高的图像块作为最终匹配块，再确定第一中心图像块的运动估计结果。该过程通过两次搜索扩大了搜索范围，提升了搜索结果。另外获取候选图像块的过程计算简单，提升了搜索过程的效率。It can be seen that, in the embodiment of the present application, by acquiring the first frame image in the target video image, and then acquiring the first center image block in the first frame image, a rough search and a precise search are performed for the first center image block, wherein the rough search is performed. The image block used for the matching search during the search is determined according to the image blocks in the adjacent frame images of the first frame image, the image block used for the matching search during the precise search is determined according to the second center image block determined by the rough search, and the second center image block determined by the rough search. The matching result between the second center image block and the image block within the preset step size is determined, and finally the image block with the highest matching degree with the first center image block is obtained as the final matching block, and then the motion estimation result of the first center image block is determined. This process widens the search scope and improves the search results with two searches. In addition, the process of obtaining candidate image blocks is simple in calculation, which improves the efficiency of the search process.

可选地，该方法还包括确定第一候选搜索集合中的多个第一候选图像块。具体请参阅图2C，图2C为本申请实施例提供的一种获取多个第一候选图像块的过程示意图，如图2C中的(b)所示，f_n为第一帧图像，也即为当前帧，其中的图像块C为获取的第一中心图像块，首先，在当前帧f_n上，图像块C的左边和上方的第一图像块S1和第二图像块S2已经完成运动估计搜索，且空间上较其他块与图像块C的更接近，所以选择S1和S2的预测向量作为空间预测向量，即是说，如图2C中的(c)所示，根据S1和S2各自对应的预测运动向量计算获取其各自在第二帧图像f_n+1上的第三图像块S1’和第四图像块S2’，作为两个第一候选图像块。第二帧图像f_n+1为与f_n相邻的下一帧图像。Optionally, the method further includes determining a plurality of first candidate image blocks in the first candidate search set. Please refer to FIG. 2C for details. FIG. 2C is a schematic diagram of a process of acquiring multiple first candidate image blocks according to an embodiment of the present application. As shown in (b) of FIG. 2C , f _n is the first frame of image, that is, is the current frame, in which the image block C is the first central image block obtained. First, on the current frame f _n , the first image block S1 and the second image block S2 on the left and above the image block C have completed motion estimation search, and the space is closer to the image block C than other blocks, so the prediction vectors of S1 and S2 are selected as the spatial prediction vectors, that is to say, as shown in (c) in FIG. 2C, according to the corresponding corresponding The predicted motion vector of , obtains its respective third image block S1 ′ and fourth image block S2 ′ on the second frame image f _n+1 as two first candidate image blocks. The second frame image f _n+1 is the next frame image adjacent to f _n .

另外如图2C中的(a)所示，f_n-1是当前帧f_n的前一帧图像，被称为第三帧图像，选取f_n-1中对应图像块C’的右边相邻的第五图像块T1和下边相邻的第六图像块T2作为时间预测向量，其中图像块C’为f_n中图像块C在上一帧图像上对应的图像块。按照图像中物体运动的连续性，假定图像块C跟图像块T1及T2有相同的运动向量，那么T1和T2在f_n中对应的图像块为T1’和T2’，也为图像块C右边相邻的图像块和下边相邻的图像块，T1’和T2’保持上一帧图像上T1和T2的运动向量，可以获得其在f_n+1上对应的第七图像块T1”和第八图像块T2”，也作为另外两个第一候选图像块。In addition, as shown in (a) of FIG. 2C, f _n-1 is the previous frame image of the current frame f _n , which is called the third frame image, and the right adjacent image block C' in f _n-1 is selected The fifth image block T1 and the lower adjacent sixth image block T2 are used as temporal prediction vectors, where the image block C′ is the image block corresponding to the image block C in f _n on the previous frame of image. According to the continuity of object motion in the image, assuming that image block C has the same motion vector as image blocks T1 and T2, the corresponding image blocks of T1 and T2 in f _n are T1' and T2', which are also the right side of image block C. For the adjacent image blocks and the adjacent image blocks below, T1' and T2' keep the motion vectors of T1 and T2 on the previous frame of image, and the seventh image block T1" and the corresponding seventh image block T1" on f _n+1 can be obtained. Eight image blocks T2" are also used as the other two first candidate image blocks.

另外，获取图像块C在f_n+1中的相同位置对应的图像块Zero，作为零点图像块。相同位置是指图像块Zero在f_n+1中的坐标位置与图像块C在f_n中的坐标位置相同。零点图像块也为一个第一候选图像块。In addition, the image block Zero corresponding to the same position of the image block C in f _n+1 is obtained as a zero-point image block. The same position means that the coordinate position of the image block Zero in f _n+1 is the same as the coordinate position of the image block C in f _n . The zero point image block is also a first candidate image block.

对于运动复杂的区域，还可以增加一些时空邻域的图像块提高插帧质量，因此可以添加一些时间域的全局运动向量作为第一候选搜索集合的补充。For areas with complex motion, some image blocks in the temporal and spatial neighborhoods can also be added to improve the frame insertion quality, so some global motion vectors in the temporal domain can be added as a supplement to the first candidate search set.

全局运动向量主要是考虑到对于运动复杂的区域，采用根据第一中心图像块的相邻图像块确定的第一候选图像块进行搜索，可能无法捕获需要的变量，所以通过增加全局运动向量的方法来弥补，将上一帧图像对应的全局运动向量用于第一中心图像块的粗略搜索过程。The global motion vector mainly considers that for areas with complex motion, the first candidate image block determined according to the adjacent image blocks of the first central image block is used for searching, and the required variables may not be captured. Therefore, by increasing the global motion vector method To compensate, the global motion vector corresponding to the previous frame of image is used for the rough search process of the first central image block.

全局运动向量表示的是整帧图像中每个图像块的运动向量通过一定方法分类后，数量占比较多的类别的运动向量。为了找到大部分图像块的运动向量，可以对上一帧图像中的运动向量进行分类。The global motion vector represents the motion vector of the category with a larger number after the motion vector of each image block in the whole frame image is classified by a certain method. In order to find the motion vectors for most of the image blocks, the motion vectors in the previous frame of image can be classified.

具体请参阅图2D，图2D为本申请实施例中提供的一种运动向量聚类流程图，如图2D所示，第三帧图像(第一帧图像的上一帧图像)的全局运动向量获取过程包括如下步骤：Please refer to FIG. 2D for details. FIG. 2D is a flow chart of a motion vector clustering provided in an embodiment of the present application. As shown in FIG. 2D , the global motion vector of the third frame image (the previous frame image of the first frame image) The acquisition process includes the following steps:

201、将类的中心初始化为零向量，类别个数k＝1，选取的类别x＝0，设定一个距离D；201. Initialize the center of the class as a zero vector, the number of classes k=1, the selected class x=0, and a distance D is set;

202、判断选取的类别个数x是否小于4；202. Determine whether the number of selected categories x is less than 4;

203、如果是，则计算一行图像块中的运动向量和已有类的中心向量的距离d，并将距离d与距离同D进行比较；203. If so, calculate the distance d between the motion vector in a row of image blocks and the center vector of the existing class, and compare the distance d with the distance D;

204、如果距离d小于或等于距离D，则将该图像块的运动向量与已有类划分为一类，添加新的运动向量后，类中的运动向量个数+1，表示为count++；204. If the distance d is less than or equal to the distance D, the motion vector of the image block and the existing class are divided into a class, and after adding a new motion vector, the number of motion vectors in the class +1, expressed as count++;

205、如果距离d大于距离D，则将该图像的运动向量归为一个新的类，类别个数k++；确定类的总个数k是否小于K，如果是，则执行步骤206；如果否，则停止聚类；205. If the distance d is greater than the distance D, classify the motion vector of the image into a new class, the number of classes is k++; determine whether the total number of classes k is less than K, if so, execute step 206; if not, then stop clustering;

207、一个类中添加新的运动向量，重新计算当前类中运动向量的平均值，并将该平均值更新为新的类中心向量mv_c，执行步骤202；207, add a new motion vector in a class, recalculate the average value of the motion vector in the current class, and update the average value to a new class center vector mv_c, and execute step 202;

208、聚类完成后，获取类中运动向量个数count超过帧图像每行图像块个数的1/8，且运动向量个数排行前四的目标类；208. After the clustering is completed, obtain the target class whose number of motion vectors in the class exceeds 1/8 of the number of image blocks in each row of the frame image, and the number of motion vectors ranks in the top four;

209、获取目标类的目标中心向量mv_c，根据目标中心向量和第一中心图像块确定f_n+1上的全局图像块。209. Obtain the target center vector mv_c of the target class, and determine the global image block on f _n+1 according to the target center vector and the first center image block.

根据上述描述可知，对第三帧图像的每一个图像块的运动向量进行聚类，获得若干个分类类别，该若干个分类类别总个数不能大于K，本申请实施例中K可以最大设置为16，在一些情况下，如果第三帧图像为一个面积较小的图像，则K可以为小于16的值。选取其中运动向量个数超过第三帧图像每行图像块个数的1/8，且运动向量个数排行前四的目标类，每行图像块个数即为对第三帧图像横向划分的图像块个数。获得排行前四的目标类，则可以获取该4个目标类的中心向量，根据该4个中心向量，可以计算获得第一中心图像块在第二图像帧上对应的4个全局图像块，也为第一候选搜索集合中的图像块。According to the above description, the motion vector of each image block of the third frame image is clustered to obtain several classification categories, and the total number of the several classification categories cannot be greater than K. In this embodiment of the present application, K can be set to a maximum of 16. In some cases, if the third frame image is an image with a smaller area, K may be a value smaller than 16. Select the target class in which the number of motion vectors exceeds 1/8 of the number of image blocks in each line of the third frame image, and the number of motion vectors ranks in the top four, and the number of image blocks in each line is the horizontal division of the third frame image. The number of image blocks. To obtain the top four target classes, the center vectors of the four target classes can be obtained, and according to the four center vectors, the four global image blocks corresponding to the first center image block on the second image frame can be obtained, and also Image patches in the set are searched for the first candidate.

根据上述描述可知，第一候选搜索集合中包括总共9个第一候选图像块。获得第一候选搜索集合中的全部第一候选图像块后，将第一中心图像块与多个第一候选图像块分别进行匹配运算(也即粗略搜索)，获得与第一中心图像块匹配度最高的图像块，作为第二中心图像块。图像之间的匹配算法包括基于灰度的匹配算法和基于特征的匹配算法，基于特征的匹配方法中又包括平均绝对差(median absolute deviation，MAD)算法，绝对误差和(sum of absolute differences，SAD)算法，误差平方和(sum of squared difference，SSD)算法等。本申请实施例中采用SAD算法，通过求取第一中心图像块与第一候选图像块每一个对应的像素块内像素值之差的绝对值之和来确定第一中心图像块与第一候选图像块的匹配度，SAD值越大，表示两个图像块的匹配度越低。该过程具有计算复杂度低的优势，能够保证较高的粗略搜索效率。最后，获取第一候选图像块中与第一中心图像块匹配度最高的图像块，作为第二中心图像块。According to the above description, the first candidate search set includes a total of 9 first candidate image blocks. After obtaining all the first candidate image blocks in the first candidate search set, the first center image block and multiple first candidate image blocks are respectively subjected to a matching operation (that is, a rough search) to obtain a degree of matching with the first center image block. The highest image block, as the second center image block. The matching algorithm between images includes gray-based matching algorithm and feature-based matching algorithm, and feature-based matching method includes mean absolute deviation (MAD) algorithm, sum of absolute differences (SAD) algorithm. ) algorithm, sum of squared difference (SSD) algorithm, etc. The SAD algorithm is used in the embodiments of the present application, and the first central image block and the first candidate image block are determined by calculating the sum of the absolute values of the differences between the pixel values in the pixel blocks corresponding to each of the first central image block and the first candidate image block. The matching degree of the image block, the larger the SAD value, the lower the matching degree of the two image blocks. This process has the advantage of low computational complexity and can ensure high rough search efficiency. Finally, the image block with the highest matching degree with the first central image block among the first candidate image blocks is obtained as the second central image block.

在可选情况下，为了对不断获得的实际数据和原预测数据给以加权平均使预测结果更接近于实际情况，也可以在计算第一候选图像块与第一中心图像块的匹配度时，在根据SAD算法获得两者之间的像素值之差的绝对值之和(第一绝对误差和)之和，再添加平滑项smoothness1，即对第一绝对误差和与平滑项求和，获得第一估计值，平滑项具体为：In an optional case, in order to give a weighted average to the continuously obtained actual data and the original predicted data to make the prediction result closer to the actual situation, when calculating the matching degree between the first candidate image block and the first central image block, According to the SAD algorithm, the sum of the absolute values of the difference between the two pixel values (the first absolute error sum) is obtained, and then the smoothing term smoothness1 is added, that is, the first absolute error sum and the smoothing term are summed to obtain the first absolute error sum and the smoothing term. An estimated value, the smoothing term is specifically:

smoothness1＝∑|mv_c1-mv_neighbor| (1)smoothness1=∑|mv _c1 -mv _neighbor | (1)

其中mv_c1表示第一候选图像块的运动向量，mv_neighbor表示第一中心图像块在第一帧图像上的8个相邻图像块的运动向量，其中S1和S2的运动向量为空间预测向量，T1和T2的运动向量为时间预测向量，Zero的运动向量为零向量。where mv _c1 represents the motion vector of the first candidate image block, mv _neighbor represents the motion vector of 8 adjacent image blocks of the first central image block on the first frame image, and the motion vectors of S1 and S2 are spatial prediction vectors, The motion vectors of T1 and T2 are temporal prediction vectors, and the motion vector of Zero is a zero vector.

最后根据第一估计值确定第一中心图像块与第一候选图像块之间的匹配度，第一估计值越大，表示匹配度越低。Finally, the matching degree between the first central image block and the first candidate image block is determined according to the first estimated value, and the larger the first estimated value is, the lower the matching degree is.

可见，在本申请实施例中，在选择第一中心图像块进行粗略搜索对应的第一候选搜索集合中的多个第一候选图像块时，考虑了时间预测向量结合第一中心图像块相邻图像块获得的图像块，以及空间预测向量结合第一中心图像块相邻图像块获得的图像块，以及第一中心图像块在下一帧图像上相同位置对应的图像块，还有根据上一帧图像的全局运动向量确定的全局图像块。该过程充分考虑了第一中心图像块的相邻图像块在各种情况下可能对应的候选图像块，同时考虑了上一帧图像的全局运动向量，提升了获得的第一候选搜索集合的代表性和全面性，进而提升了粗略搜索结果的可靠性。It can be seen that in this embodiment of the present application, when selecting the first central image block to roughly search for multiple first candidate image blocks in the corresponding first candidate search set, the temporal prediction vector combined with the adjacent first central image block is considered The image block obtained from the image block, and the image block obtained by combining the spatial prediction vector with the adjacent image blocks of the first center image block, and the image block corresponding to the same position of the first center image block on the next frame of image, and the image block according to the previous frame The global image patch determined by the global motion vector of the image. This process fully considers the candidate image blocks that may correspond to the adjacent image blocks of the first central image block in various situations, and also considers the global motion vector of the previous frame image, which improves the obtained representative of the first candidate search set. This improves the reliability of rough search results.

在完成粗略搜索后，进一步进行精确搜索，根据前面过程可知，精确搜索时所采用的第二候选搜索集合中的多个第二候选图像块由第二中心图像块与其他图像块之间的步长距离确定。第二中心图像块和其他图像块都位于第二帧图像上，步长距离表示图像块之间的直线距离，相邻图像块之间的步长距离为1。After the rough search is completed, the precise search is further performed. According to the previous process, the multiple second candidate image blocks in the second candidate search set used in the precise search are determined by the steps between the second central image block and other image blocks. Long distance ok. The second central image block and other image blocks are all located on the second frame image, the step distance represents the straight-line distance between image blocks, and the step distance between adjacent image blocks is 1.

具体地，请参阅图2E，图2E为本申请实施例提供的一种确定多个第二候选图像块的过程示意图，第二中心图像块为图像块C0，如图2E中的(a)所示，获取与第二中心图像块步长距离为第一距离的图像块作为第一步长图像块，第一距离例如可以为1，即获取与第二中心图像块步长距离为1的图像块作为第一步长图像块，具体为图中标记为1的图像块，包括与图像块C0相邻的8个图像块。然后将这8个图像块分别与图像块C0进行匹配运算，同样可以使用前述描述的图像匹配算法，获得与图像块C0匹配度最高的第一步长图像块，为第三中心图像块1-C0，然后获取与图像块1-C0步长距离为1的图像块作为第二步长图像块，即为图中标记为1’的图像块，实际上第二步长图像块与第一步长图像块有重叠，重叠部分作为第一步长图像块，不再重复记录为第二步长图像块。Specifically, please refer to FIG. 2E. FIG. 2E is a schematic diagram of a process for determining multiple second candidate image blocks according to an embodiment of the present application. The second central image block is an image block C0, as shown in (a) of FIG. 2E. As shown, the image block whose step distance from the second center image block is the first distance is obtained as the first step image block, and the first distance can be, for example, 1, that is, the image whose step distance from the second center image block is 1 The block is used as the first long image block, specifically the image block marked as 1 in the figure, including 8 image blocks adjacent to the image block C0. Then, the 8 image blocks are respectively matched with the image block C0, and the image matching algorithm described above can also be used to obtain the first-length image block with the highest matching degree with the image block C0, which is the third center image block 1- C0, and then obtain the image block with a step distance of 1 from the image block 1-C0 as the second step size image block, which is the image block marked 1' in the figure. In fact, the second step size image block is the same as the first step size. The long image blocks overlap, and the overlapping part is used as the first step length image block, and is not repeatedly recorded as the second step length image block.

然后，如图2E中的(b)所示，获取与第二中心图像块步长距离为第二距离的图像块作为第三步长图像块，第二距离可以为3，即获取与第二中心图像块步长距离为3的图像块作为第三步长图像块，具体为图中标记为3的8个图像块。将这8个图像块分别与图像块C0进行匹配运算，获得与图像块C0匹配度最高的第三步长图像块，作为第四中心图像块3-C0，然后获取与图像块3-C0步长距离为3的图像块作为第四步长图像块，即为图中标记为3’的图像块。同样的，第四步长图像块中包含与第三步长图像块重叠的图像块，重叠部分作为第三步长图像块，不再重复记录为第四步长图像块。Then, as shown in (b) of FIG. 2E , an image block whose step distance from the second central image block is a second distance is obtained as a third step size image block, and the second distance can be 3, that is, the image block with the second distance from the second center image block is acquired The image block with the step size distance of 3 in the center image block is used as the third step size image block, which is specifically the 8 image blocks marked as 3 in the figure. The 8 image blocks are respectively matched with the image block C0, and the third step size image block with the highest matching degree with the image block C0 is obtained as the fourth center image block 3-C0, and then the step with the image block 3-C0 is obtained. The image block whose long distance is 3 is used as the fourth step size image block, which is the image block marked 3' in the figure. Similarly, the fourth step size image block includes image blocks that overlap with the third step size image block, and the overlapping portion is used as the third step size image block, and is not repeatedly recorded as the fourth step size image block.

根据上述方法获取的第二中心图像块，第一步长图像块，第二步长图像块，第三步长图像块和第四步长图像块组成第二候选搜索集合中的多个第二候选图像块。然后采用第一中心图像块进行针对多个第二候选图像块的精确搜索，即是将第二中心图像块分别与多个第二候选图像块中的每个图像块进行图像匹配，采用的匹配方法例如可以为前述描述的基于灰度的匹配算法或者基于特征的匹配算法，特别地，可以采用SAD算法，能够提升匹配效率。最后获取多个第二候选图像块中与第一中心图像块匹配度最高的图像块，被称为最终匹配块。The second center image block obtained according to the above method, the first step image block, the second step image block, the third step image block and the fourth step image block form a plurality of second image blocks in the second candidate search set candidate image blocks. Then use the first central image block to perform an accurate search for multiple second candidate image blocks, that is, perform image matching between the second central image block and each image block in the multiple second candidate image blocks, and use the matching The method can be, for example, the gray-scale-based matching algorithm or the feature-based matching algorithm described above, in particular, the SAD algorithm can be used, which can improve the matching efficiency. Finally, the image block with the highest matching degree with the first central image block among the plurality of second candidate image blocks is obtained, which is called the final matching block.

可选情况下，在计算第二候选图像块与第一中心图像块的匹配度时，在根据SAD算法获得两者之间的像素值之差的绝对值之和(第二绝对误差和)之和，也可以再添加平滑项smoothness2和距离差distance，即对第二绝对误差和与平滑项以及距离差求和，获得第二估计值，对应公式为：Optionally, when calculating the degree of matching between the second candidate image block and the first central image block, the sum of the absolute values of the pixel value differences between the two (the second absolute error sum) is obtained according to the SAD algorithm. and, the smoothing term smoothness2 and the distance difference distance can also be added, that is, the second absolute error sum, the smoothing term and the distance difference are summed to obtain the second estimated value, and the corresponding formula is:

smoothness2＝∑|mv_c2-mv_neighbor| (2)smoothness2=∑|mv _c2 -mv _neighbor | (2)

distance＝max(|x|,|y|) (3)distance=max(|x|,|y|) (3)

其中mv_c2表示第二候选图像块的运动向量，mv_neighbor表示第一中心图像块在第一帧图像上的8个相邻图像块的运动向量，第二候选图像块的运动向量也可以为时间预测向量或空间预测向量。距离差(第一距离差)为第一中心图像块与第二候选图像块之间的运动向量的x或者y方向偏移的绝对值的最大值。Where mv _c2 represents the motion vector of the second candidate image block, mv _neighbor represents the motion vector of 8 adjacent image blocks of the first central image block on the first frame image, and the motion vector of the second candidate image block can also be time Prediction vector or spatial prediction vector. The distance difference (first distance difference) is the maximum value of the absolute value of the offset in the x or y direction of the motion vector between the first central image block and the second candidate image block.

最后根据第二估计值确定第一中心图像块与第二候选图像块之间的匹配度，第二估计值越大，表示匹配度越低。Finally, the matching degree between the first central image block and the second candidate image block is determined according to the second estimated value, and the larger the second estimated value is, the lower the matching degree is.

可见，在本申请实施例中，在选择第一中心图像块进行精确搜索对应的第二候选搜索集合中的多个第二候选图像块时，根据步长距离分别为第一距离和第二距离获得了第二步长图像块和第四步长图像块，然后根据第一中心图像块与第二步长图像块和第四步长图像块的匹配结果获得了精确搜索结果，这个过程能够进一步增加运动估计过程中的搜索范围，同时增加搜索图像块的数量，并且获取候选图像块的过程简单，进而提升了运动估计的效率和准确率。It can be seen that, in the embodiment of the present application, when selecting the first central image block to precisely search for multiple second candidate image blocks in the corresponding second candidate search set, the step distance is the first distance and the second distance respectively. The second step size image block and the fourth step size image block are obtained, and then the exact search results are obtained according to the matching results of the first center image block and the second step size image block and the fourth step size image block. This process can be further The search range in the motion estimation process is increased, the number of search image blocks is increased, and the process of acquiring candidate image blocks is simple, thereby improving the efficiency and accuracy of motion estimation.

基于上述视频帧图像的运动估计方法实施例的描述，本发明实施例还公开了一种运动估计装置，参考图3，图3是本发明实施例提供的一种运动估计装置的结构示意图，该运动估计装置300包括：Based on the description of the above embodiments of the motion estimation method for video frame images, an embodiment of the present invention further discloses a motion estimation apparatus. Referring to FIG. 3 , FIG. 3 is a schematic structural diagram of a motion estimation apparatus provided by an embodiment of the present invention. The motion estimation apparatus 300 includes:

获取模块301，用于获取目标视频中的第一帧图像；An acquisition module 301 is used to acquire the first frame of image in the target video;

处理模块302，用于将第一帧图像划分为多个图像块，获得多个图像块中的第一中心图像块；a processing module 302, configured to divide the first frame image into multiple image blocks, and obtain the first central image block in the multiple image blocks;

处理模块302，还用于采用第一中心图像块进行针对第一候选搜索集合中的多个第一候选图像块的粗略搜索，根据粗略搜索结果确定第二中心图像块，其中，第一候选图像块根据第一帧图像的相邻帧图像中的图像块确定；The processing module 302 is further configured to use the first central image block to perform a rough search for a plurality of first candidate image blocks in the first candidate search set, and determine the second central image block according to the rough search result, wherein the first candidate image The block is determined according to the image blocks in the adjacent frame images of the first frame image;

处理模块302，还用于根据第二中心图像块确定第二候选搜索集合中的多个第二候选图像块，第二候选图像块根据第二中心图像块与其他图像块之间的步长距离确定，其他图像块与第二中心图像块位于同一帧图像；The processing module 302 is further configured to determine a plurality of second candidate image blocks in the second candidate search set according to the second center image block, and the second candidate image blocks are based on the step distance between the second center image block and other image blocks. It is determined that the other image blocks and the second center image block are located in the same frame of image;

处理模块302，还用于采用第一中心图像块进行针对多个第二候选图像块的精确搜索，根据精确搜索结果确定第一中心图像块的运动估计结果；根据第一帧图像的多个图像块的运动估计结果确定第一帧图像的运动估计结果。The processing module 302 is further configured to use the first center image block to perform an accurate search for multiple second candidate image blocks, and determine the motion estimation result of the first center image block according to the precise search result; The motion estimation result of the block determines the motion estimation result of the first frame image.

可选地，处理模块302还用于确定第一候选搜索集合中的多个第一候选图像块，具体用于：Optionally, the processing module 302 is further configured to determine multiple first candidate image blocks in the first candidate search set, and is specifically configured to:

获取第一图像块的第一运动向量和第二图像块的第二运动向量，并根据第一图像块和第一运动向量确定第一帧图像运动到第二帧图像时第一图像块对应的第三图像块，根据第二图像块和第二运动向量确定第一帧图像运动到第二帧图像时第二图像块对应的第四图像块，其中，第一图像块为第一中心图像块的左侧相邻图像块，第二图像块为第一中心图像块的上方相邻图像块，第二帧图像为与第一帧图像相邻的下一帧图像；Obtain the first motion vector of the first image block and the second motion vector of the second image block, and determine, according to the first image block and the first motion vector, the corresponding motion of the first image block when the first frame of image moves to the second frame of image The third image block is to determine, according to the second image block and the second motion vector, the fourth image block corresponding to the second image block when the first frame of image moves to the second frame of image, wherein the first image block is the first central image block The left adjacent image block of , the second image block is the adjacent image block above the first central image block, and the second frame image is the next frame image adjacent to the first frame image;

获取第五图像块由第三帧图像上的位置运动到当前位置的第三运动向量，和第六图像块由第三帧图像上的位置运动到当前位置的第四运动向量，并根据第五图像块和第三运动向量确定第一帧图像运动到第二帧图像时第五图像块对应的第七图像块，根据第六图像块和第四运动向量确定第一帧图像运动到第二帧图像时第六图像块对应的第八图像块，其中，第五图像块为第一中心图像块的右侧相邻图像块，第六图像块为第一中心图像块的下方相邻图像块，第三帧图像为与第一帧图像相邻的上一帧图像；Obtain the third motion vector of the fifth image block moving from the position on the third frame image to the current position, and the fourth motion vector of the sixth image block moving from the position on the third frame image to the current position, and according to the fifth The image block and the third motion vector determine the seventh image block corresponding to the fifth image block when the image of the first frame moves to the image of the second frame, and the image of the first frame is determined to move to the second frame according to the sixth image block and the fourth motion vector The eighth image block corresponding to the sixth image block in the image, wherein the fifth image block is the adjacent image block on the right side of the first central image block, and the sixth image block is the adjacent image block below the first central image block, The third frame image is the previous frame image adjacent to the first frame image;

获取第一中心图像块在第二帧图像中对应的零点图像块，零点图像块在第二帧图像中的坐标位置与第一中心图像块在第一帧图像中的坐标位置相同；Obtain the zero point image block corresponding to the first center image block in the second frame image, and the coordinate position of the zero point image block in the second frame image is the same as the coordinate position of the first center image block in the first frame image;

获取第三帧图像的全局运动向量，全局运动向量根据对第三帧图像划分的多个图像块对应的运动向量聚类获得；Obtaining the global motion vector of the third frame of image, the global motion vector is obtained according to the motion vector clustering corresponding to the plurality of image blocks divided into the third frame of image;

根据全局运动向量获取第一中心图像块在第二帧图像上的对应的图像块作为全局图像块；According to the global motion vector, the corresponding image block of the first central image block on the second frame image is obtained as the global image block;

第三图像块，第四图像块，第七图像块，第八图像块，零点图像块和全局图像块组成第一候选搜索集合中的多个第一候选图像块。可选地，粗略搜索结果为第一中心图像块与多个第一候选图像块之间的匹配结果，根据粗略搜索结果确定第二中心图像块，包括：The third image block, the fourth image block, the seventh image block, the eighth image block, the zero point image block and the global image block form a plurality of first candidate image blocks in the first candidate search set. Optionally, the rough search result is a matching result between the first center image block and a plurality of first candidate image blocks, and determining the second center image block according to the rough search result, including:

确定第一候选图像块中与第一中心图像块匹配度最高的图像块作为第二中心图像块。An image block with the highest matching degree with the first central image block among the first candidate image blocks is determined as the second central image block.

可选地，处理模块还用于确定所述第一中心图像块与所述多个第一候选图像块之间的匹配度，具体用于：Optionally, the processing module is further configured to determine the degree of matching between the first central image block and the plurality of first candidate image blocks, specifically for:

计算获得所述第一中心图像块与所述第一候选图像块的第一绝对误差和，所述第一绝对误差和为所述第一中心图像块中的多个像素点与所述第一候选图像块中待匹配的多个像素点之间，像素值之差的绝对值之和；Calculate and obtain the first absolute error sum of the first central image block and the first candidate image block, where the first absolute error sum is a plurality of pixels in the first central image block and the first The sum of the absolute values of the difference between the pixel values between the multiple pixels to be matched in the candidate image block;

将所述第一绝对误差和与第一平滑项求和，获得第一估计值，所述第一平滑项根据所述第一候选图像块的运动向量与所述第一中心图像块的相邻图像块的运动向量的差值和确定；Summing the first absolute error sum and a first smoothing term to obtain a first estimated value, and the first smoothing term is based on the motion vector of the first candidate image block and the neighbor of the first central image block The difference value sum determination of the motion vector of the image block;

根据所述第一估计值确定所述第一中心图像块与所述第一候选图像块之间的匹配度。The degree of matching between the first central image block and the first candidate image block is determined according to the first estimated value.

可选地，处理模块302还用于确定第二候选搜索集合中的多个第二候选图像块，具体用于：Optionally, the processing module 302 is further configured to determine multiple second candidate image blocks in the second candidate search set, and is specifically configured to:

获取与第二中心图像块步长距离为第一距离的图像块作为第一步长图像块；acquiring an image block whose step distance from the second center image block is the first distance as the first step length image block;

根据第二中心图像块与第一步长图像块的匹配结果确定第三中心图像块；Determine the third center image block according to the matching result between the second center image block and the first-length image block;

获取与第三中心图像块步长距离为第一距离的图像块作为第二步长图像块；acquiring an image block whose step distance from the third central image block is the first distance as a second step image block;

获取与第二中心图像块步长距离为第二距离的图像块作为第三步长图像块，第二距离大于第一距离；acquiring an image block whose step distance from the second center image block is a second distance as a third step image block, and the second distance is greater than the first distance;

根据第二中心图像块与第三步长图像块的匹配结果确定第四中心图像块；Determine the fourth center image block according to the matching result of the second center image block and the third step size image block;

获取与第四中心图像块步长距离为第二距离的图像块作为第四步长图像块；acquiring an image block whose step distance from the fourth central image block is the second distance as the fourth step image block;

第二中心图像块，第一步长图像块，第二步长图像块，第三步长图像块和第四步长图像块组成第二候选搜索集合中的多个第二候选图像块。The second central image block, the first-sized image block, the second-sized image block, the third-sized image block and the fourth-sized image block form a plurality of second candidate image blocks in the second candidate search set.

可选地，精确搜索结果为第一中心图像块与多个第二候选图像块的匹配结果，根据精确搜索结果确定第一中心图像块的运动估计结果，包括：Optionally, the precise search result is a matching result between the first central image block and a plurality of second candidate image blocks, and the motion estimation result of the first central image block is determined according to the precise search result, including:

根据精确搜索结果确定最终匹配块，最终匹配块为多个第二候选图像块中，与第一中心图像块匹配度最高的图像块；The final matching block is determined according to the precise search result, and the final matching block is the image block with the highest matching degree with the first central image block among the plurality of second candidate image blocks;

计算获得第一中心图像块与最终匹配块之间的最终运动向量，作为第一中心图像块的运动估计结果。The final motion vector between the first central image block and the final matching block is obtained by calculation as the motion estimation result of the first central image block.

可选地，处理模块302还用于确定第一中心图像块与所述多个第二候选图像块的匹配度，具体用于：Optionally, the processing module 302 is further configured to determine the degree of matching between the first central image block and the plurality of second candidate image blocks, and is specifically configured to:

计算获得所述第一中心图像块与所述第二候选图像块的第二绝对误差和，所述第二绝对误差和为所述第一中心图像块中的多个像素点与所述第二候选图像块中待匹配的多个像素点之间，像素值之差的绝对值之和；Calculate and obtain a second absolute error sum of the first central image block and the second candidate image block, where the second absolute error sum is a plurality of pixels in the first central image block and the second absolute error sum The sum of the absolute values of the difference between the pixel values between the multiple pixels to be matched in the candidate image block;

将所述第二绝对误差和与第二平滑项，以及第一距离差求和，获得第二估计值，所述第二平滑项根据所述第二候选图像块的运动向量与所述第一中心图像块的相邻图像块的运动向量的差值和确定，所述第一距离差根据所述第一中心图像块与所述第二候选图像块之间的运动向量的x或者y方向偏移的绝对值的最大值确定；Summing the second absolute error sum, the second smoothing term, and the first distance difference to obtain a second estimated value, the second smoothing term is based on the motion vector of the second candidate image block and the first The difference sum of the motion vectors of the adjacent image blocks of the central image block is determined, and the first distance difference is determined according to the x or y direction offset of the motion vector between the first central image block and the second candidate image block. The maximum value of the absolute value of the shift is determined;

值得指出的是，其中，运动估计装置的具体功能实现方式可以参见上述运动估计方法的描述，这里不再进行赘述。所述的运动估计装置中的各个单元或模块可以分别或全部合并为一个或若干个另外的单元或模块来构成，或者其中的某个(些)单元或模块还可以再拆分为功能上更小的多个单元或模块来构成，这可以实现同样的操作，而不影响本发明的实施例的技术效果的实现。上述单元或模块是基于逻辑功能划分的，在实际应用中，一个单元(或模块)的功能也可以由多个单元(或模块)来实现，或者多个单元(或模块)的功能由一个单元(或模块)实现。It is worth noting that, for the specific function implementation manner of the motion estimation apparatus, reference may be made to the description of the above motion estimation method, which will not be repeated here. Each unit or module in the motion estimation device may be combined into one or several other units or modules, respectively or all, or some of the units or modules may be further split into functionally more functional units. It is composed of multiple small units or modules, which can realize the same operation without affecting the realization of the technical effects of the embodiments of the present invention. The above units or modules are divided based on logical functions. In practical applications, the function of one unit (or module) can also be implemented by multiple units (or modules), or the functions of multiple units (or modules) are implemented by one unit. (or module) implementation.

关于上述实施例中描的各个装置、产品包含模块/单元，其可以是软件模块/单元，也可以是硬件模块/单元，或者也可以部分是软件模块/单元，部分是硬件模块/单元。例如，对于应用或集成芯片的各个装置、产品其包含的各个模块/单元可以都采用电路等硬件的方式实现，或者至少部分模块/单元可以采用软件程序的方式实现，该运行于芯片内部集成处理器，剩余的部分模块/单元可以采用电路等硬件方式实现；对于应于或集成芯片模组的各个装置、产品，其包含的各个模块/单元可以都采用电路等硬件的方式实现，不同模块/单元可以位于芯片模组的同一件(例如片、电路模块等)或者不同组件中，至少部分/单元可以采用软件程序的方式实现，该软件程运行于芯片模组内部集成处理器剩余部分模块/单元可以采用电路等硬件方式实现；对于应或集成终端的各个装置、产品，其包含的模块/单元可以都采用电路等硬件的方式实现，不同的模块/单元可以位于终端内同一组件(例如，芯片、电路模块等)或者不同组件中，或者至少部分模块/单元可以采用软件程序的方式实现，该序运行于终端内部集成的处理器，剩余分模块/单元可以采用电路等硬件方式实现。Each device and product described in the above embodiments includes modules/units, which may be software modules/units or hardware modules/units, or may be partly software modules/units and partly hardware modules/units. For example, for each device or product of an application or integrated chip, each module/unit included in the product may be implemented by hardware such as circuits, or at least some modules/units may be implemented by software programs, which run on the integrated processing inside the chip. The remaining part of the modules/units can be implemented by hardware such as circuits; for each device and product corresponding to or integrated with chip modules, each module/unit included can be implemented by hardware such as circuits. The unit may be located in the same piece of the chip module (such as a chip, a circuit module, etc.) or in different components, and at least part/unit may be implemented by a software program that runs on the remaining part of the integrated processor module inside the chip module/ The unit can be implemented in hardware such as circuits; for each device or product that corresponds to or integrates with the terminal, the modules/units it contains can be implemented in hardware such as circuits, and different modules/units can be located in the same component in the terminal (for example, Chips, circuit modules, etc.) or different components, or at least some of the modules/units can be implemented in the form of software programs, which run on the processor integrated inside the terminal, and the remaining sub-modules/units can be implemented in hardware such as circuits.

基于上述方法实施例以及装置实施例的描述，本发明实施例还提供一种运动估计设备。请参见图4，是本发明实施例提供的一种运动估计设备的结构示意图。如图4所示，上述的运动估计装置300可以应用于所述运动估计设备400，所述运动估计设备400可以包括：处理器401，网络接口404和存储器405，此外，所述运动估计设备400还可以包括：用户接口403，和至少一个通信总线402。其中，通信总线402用于实现这些组件之间的连接通信。其中，用户接口403可以包括显示屏(Display)、键盘(Keyboard)，可选用户接口403还可以包括标准的有线接口、无线接口。网络接口404可选的可以包括标准的有线接口、无线接口(如WI-FI接口)。存储器405可以是高速RAM存储器，也可以是非不稳定的存储器(non-volatilememory)，例如至少一个磁盘存储器。存储器405可选的还可以是至少一个位于远离前述处理器401的存储装置。如图4所示，作为一种计算机存储介质的存储器405中可以包括操作系统、网络通信模块、用户接口模块以及设备控制应用程序。Based on the descriptions of the foregoing method embodiments and apparatus embodiments, an embodiment of the present invention further provides a motion estimation device. Please refer to FIG. 4 , which is a schematic structural diagram of a motion estimation device provided by an embodiment of the present invention. As shown in FIG. 4 , the above-mentioned motion estimation apparatus 300 can be applied to the motion estimation apparatus 400 , and the motion estimation apparatus 400 may include: a processor 401 , a network interface 404 and a memory 405 , in addition, the motion estimation apparatus 400 Also included: a user interface 403, and at least one communication bus 402. Among them, the communication bus 402 is used to realize the connection and communication between these components. The user interface 403 may include a display screen (Display) and a keyboard (Keyboard), and the optional user interface 403 may also include a standard wired interface and a wireless interface. Optionally, the network interface 404 may include a standard wired interface and a wireless interface (eg, a WI-FI interface). The memory 405 may be high-speed RAM memory or non-volatile memory, such as at least one disk memory. The memory 405 can optionally also be at least one storage device located away from the aforementioned processor 401 . As shown in FIG. 4 , the memory 405 as a computer storage medium may include an operating system, a network communication module, a user interface module, and a device control application program.

在图4所示的运动估计设备400中，网络接口404可提供网络通讯功能；而用户接口403主要用于为用户提供输入的接口；而处理器401可以用于调用存储器405中存储的设备控制应用程序，以实现上述视频帧图像的运动估计方法的步骤。In the motion estimation device 400 shown in FIG. 4 , the network interface 404 can provide a network communication function; the user interface 403 is mainly used to provide an input interface for the user; and the processor 401 can be used to call the device control stored in the memory 405 An application program to implement the steps of the above-mentioned motion estimation method for video frame images.

应当理解，本发明实施例中所描述的运动估计设备400可执行前文所述视频帧图像的运动估计方法，也可执行前文所述运动估计装置的描述，在此不再赘述。另外，对采用相同方法的有益效果描述，也不再进行赘述。It should be understood that the motion estimation apparatus 400 described in the embodiments of the present invention can perform the motion estimation method for the video frame image described above, and can also perform the description of the motion estimation apparatus described above, which is not repeated here. In addition, the description of the beneficial effects of using the same method will not be repeated.

此外，这里需要指出的是：本发明实施例还提供了一种计算机存储介质，且所述计算机存储介质中存储有前文提及的视频处理装置所执行的计算机程序，且所述计算机程序包括程序指令，当所述处理器执行所述程序指令时，能够执行前文所述视频处理方法的描述，因此，这里将不再进行赘述。另外，对采用相同方法的有益效果描述，也不再进行赘述。对于本发明所涉及的计算机存储介质实施例中未披露的技术细节，请参照本发明方法实施例的描述。In addition, it should be pointed out here that an embodiment of the present invention further provides a computer storage medium, and the computer storage medium stores a computer program executed by the aforementioned video processing apparatus, and the computer program includes a program instruction, when the processor executes the program instruction, it can execute the description of the video processing method described above, and therefore, it will not be repeated here. In addition, the description of the beneficial effects of using the same method will not be repeated. For technical details not disclosed in the computer storage medium embodiments involved in the present invention, please refer to the description of the method embodiments of the present invention.

本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程，是可以通过计算机程序来指令相关的硬件来完成，所述的程序可存储于一计算机可读取存储介质中，该程序在执行时，可包括如上述各方法的实施例的流程。其中，所述的存储介质可为磁碟、光盘、只读存储记忆体(Read-Only Memory，ROM)或随机存储记忆体(Random AccessMemory，RAM)等。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented by instructing relevant hardware through a computer program, and the program can be stored in a computer-readable storage medium. During execution, the processes of the embodiments of the above-mentioned methods may be included. The storage medium may be a magnetic disk, an optical disk, a read-only memory (Read-Only Memory, ROM), or a random access memory (Random Access Memory, RAM) or the like.

以上所揭露的仅为本发明较佳实施例而已，当然不能以此来限定本发明之权利范围，因此依本发明权利要求所作的等同变化，仍属本发明所涵盖的范围。The above disclosures are only preferred embodiments of the present invention, and of course, the scope of the rights of the present invention cannot be limited by this. Therefore, equivalent changes made according to the claims of the present invention are still within the scope of the present invention.

Claims

1. A method for motion estimation of a video frame image, the method comprising:

acquiring a first frame image in a target video, and dividing the first frame image into a plurality of image blocks;

obtaining a first central image block of the plurality of image blocks;

determining a plurality of first candidate image blocks in the first candidate search set, specifically including:

acquiring a first motion vector of a first image block and a second motion vector of a second image block, determining a third image block corresponding to the first image block when the first frame image moves to the second frame image according to the first image block and the first motion vector, determining a fourth image block corresponding to the second image block when the first frame image moves to the second frame image according to the second image block and the second motion vector, wherein the first image block is a left adjacent image block of a first central image block, the second image block is an upper adjacent image block of the first central image block, and the second frame image is a next frame image adjacent to the first frame image;

acquiring a third motion vector of a fifth image block moving from a position on a third frame image to a current position and a fourth motion vector of a sixth image block moving from the position on the third frame image to the current position, determining a seventh image block corresponding to the fifth image block when the first frame image moves to a second frame image according to the fifth image block and the third motion vector, determining an eighth image block corresponding to the sixth image block when the first frame image moves to the second frame image according to the sixth image block and the fourth motion vector, wherein the fifth image block is a right adjacent image block of the first central image block, the sixth image block is a lower adjacent image block of the first central image block, and the third frame image is a previous frame image adjacent to the first frame image;

acquiring a zero-point image block corresponding to the first center image block in the second frame image, wherein the coordinate position of the zero-point image block in the second frame image is the same as the coordinate position of the first center image block in the first frame image;

acquiring a global motion vector of the third frame of image, wherein the global motion vector is obtained according to motion vector clusters corresponding to a plurality of image blocks divided by the third frame of image;

acquiring a corresponding image block of the first central image block on the second frame image according to the global motion vector to be used as a global image block;

the third image block, the fourth image block, the seventh image block, the eighth image block, the zero image block and the global image block form a plurality of first candidate image blocks in the first candidate search set;

performing a rough search for the plurality of first candidate image blocks in a first candidate search set by using the first center image block, and determining a second center image block according to a rough search result, wherein the first candidate image block is determined according to image blocks in adjacent frame images of the first frame image;

determining a plurality of second candidate image blocks in a second candidate search set according to the second central image block, wherein the second candidate image blocks are determined according to step length distances between the second central image block and other image blocks, and the other image blocks and the second central image block are located in the same frame of image;

adopting the first central image block to perform accurate search aiming at the plurality of second candidate image blocks, and determining a motion estimation result of the first central image block according to an accurate search result;

and determining the motion estimation result of the first frame image according to the motion estimation results of the plurality of image blocks of the first frame image.

2. The method of claim 1, wherein the coarse search result is a match between the first central image block and the first candidate image blocks, and wherein determining a second central image block according to the coarse search result comprises:

and determining the image block with the highest matching degree with the first central image block in the first candidate image blocks as the second central image block.

3. The method according to claim 2, further comprising determining a degree of matching between the first center image block and the plurality of first candidate image blocks, specifically comprising:

calculating to obtain a first absolute error sum of the first central image block and the first candidate image block, wherein the first absolute error sum is a sum of absolute values of differences between a plurality of pixel points in the first central image block and a plurality of pixel points to be matched in the first candidate image block;

summing the first absolute error sum and a first smooth term to obtain a first estimated value, wherein the first smooth term is determined according to the difference sum of the motion vector of the first candidate image block and the motion vector of the adjacent image block of the first central image block;

and determining the matching degree between the first central image block and the first candidate image block according to the first estimated value.

4. The method according to any one of claims 1 to 3, wherein the determining a plurality of second candidate image blocks in a second candidate search set according to the second center image block specifically includes:

acquiring an image block with a first distance from the step length of the second central image block as a first step length image block;

determining a third central image block according to the matching result of the second central image block and the first step length image block;

acquiring an image block with the step length distance from the third central image block being the first distance as a second step length image block;

acquiring an image block with a step length distance from the second central image block as a second distance which is greater than the first distance as a third step length image block;

determining a fourth central image block according to the matching result of the second central image block and the third step length image block;

acquiring an image block with the step length distance from the fourth central image block being the second distance as a fourth step length image block;

the second center image block, the first step length image block, the second step length image block, the third step length image block and the fourth step length image block form a plurality of second candidate image blocks in the second candidate search set.

5. The method of claim 4, wherein the exact search result is a matching result of the first central image block and the plurality of second candidate image blocks, and wherein determining the motion estimation result of the first central image block according to the exact search result comprises:

determining a final matching block according to the accurate searching result, wherein the final matching block is the image block with the highest matching degree with the first central image block in the plurality of second candidate image blocks;

and calculating to obtain a final motion vector between the first central image block and the final matching block as a motion estimation result of the first central image block.

6. The method according to claim 5, further comprising determining a degree of matching of the first center image block with the plurality of second candidate image blocks, specifically comprising:

calculating to obtain a second absolute error sum of the first central image block and the second candidate image block, wherein the second absolute error sum is the sum of absolute values of differences of pixel values between a plurality of pixel points in the first central image block and a plurality of pixel points to be matched in the second candidate image block;

summing the second sum of absolute errors with a second smooth term determined according to a sum of differences between the motion vector of the second candidate image block and the motion vector of an adjacent image block to the first center image block, and a first distance difference determined according to a maximum value of an absolute value of x or y direction offset of the motion vector between the first center image block and the second candidate image block, to obtain a second estimated value;

and determining the matching degree between the first central image block and the second candidate image block according to the second estimated value.

7. A motion estimation apparatus, characterized in that the apparatus comprises:

the acquisition module is used for acquiring a first frame image in a target video;

the processing module is used for dividing the first frame image into a plurality of image blocks to obtain a first central image block in the plurality of image blocks;

the processing module is further configured to determine a plurality of first candidate image blocks in the first candidate search set, and is specifically configured to:

the processing module is further configured to perform a coarse search on a plurality of first candidate image blocks in the first candidate search set by using the first center image block, and determine a second center image block according to a coarse search result, where the first candidate image block is determined according to image blocks in adjacent frame images of the first frame image;

the processing module is further configured to determine, according to the second central image block, a plurality of second candidate image blocks in a second candidate search set, where the second candidate image block is determined according to step length distances between the second central image block and other image blocks, and the other image blocks and the second central image block are located in the same frame of image;

the processing module is further configured to perform an accurate search for the plurality of second candidate image blocks by using the first central image block, and determine a motion estimation result of the first central image block according to an accurate search result; and determining the motion estimation result of the first frame image according to the motion estimation results of the plurality of image blocks of the first frame image.

8. The apparatus of claim 7, wherein the coarse search result is a matching result between the first center image block and the first candidate image blocks, and wherein determining a second center image block according to the coarse search result comprises:

9. The apparatus according to claim 8, wherein the processing module is further configured to determine a degree of matching between the first center image block and the plurality of first candidate image blocks, and is specifically configured to:

calculating to obtain a first absolute difference sum of the first central image block and the first candidate image block, wherein the first absolute difference sum is a sum of absolute values of differences of pixel values between a plurality of pixel points in the first central image block and a plurality of pixel points to be matched in the first candidate image block;

and determining the matching degree between the first central image block and the first candidate image block according to the first estimation value.

10. The apparatus according to any of claims 7 to 9, wherein the processing module is further configured to determine a plurality of second candidate image blocks in the second candidate search set, and specifically configured to:

acquiring an image block with a second distance from the step length of the second central image block as a third step length image block, wherein the second distance is greater than the first distance;

11. The apparatus of claim 10, wherein the exact search result is a matching result of the first central image block and the plurality of second candidate image blocks, and wherein the determining the motion estimation result of the first central image block according to the exact search result comprises:

12. The apparatus according to claim 11, wherein the processing module is further configured to determine matching degrees of the first center image block and the plurality of second candidate image blocks, and is specifically configured to:

summing the second sum of absolute differences with a second smooth term determined according to the sum of differences between the motion vector of the second candidate image block and the motion vector of the adjacent image block to the first central image block, and a first distance difference determined according to the maximum value of the absolute values of x-or y-direction offsets of the motion vectors between the first central image block and the second candidate image block, to obtain a second estimated value;

13. A motion estimation device, characterized by comprising: a processor and a memory;

the processor is coupled to a memory, wherein the memory is configured to store program code and the processor is configured to invoke the program code to perform the video frame image motion estimation method of any of claims 1-6.

14. A computer storage medium, characterized in that it stores a computer program comprising program instructions which, when executed by a processor, perform a video frame image motion estimation method according to any one of claims 1 to 6.