CN111316286A

CN111316286A - Trajectory prediction method and device, storage medium, driving system and vehicle

Info

Publication number: CN111316286A
Application number: CN201980005403.6A
Authority: CN
Inventors: 崔健; 陈晓智
Original assignee: SZ DJI Technology Co Ltd
Current assignee: Shenzhen Zhuoyu Technology Co Ltd
Priority date: 2019-03-27
Filing date: 2019-03-27
Publication date: 2020-06-19
Anticipated expiration: 2039-03-27
Also published as: CN111316286B; WO2020191642A1

Abstract

Provided are a trajectory prediction method and device, a storage medium, a driving system and a vehicle. The method comprises the steps of obtaining global semantic data and global track data of an area where an object to be predicted is located (S202), then fusing the global semantic data and the global track data to obtain global fusion data (S204), extracting features in the global fusion data to obtain global features (S206), and further processing the global features by using a trained track prediction model to obtain a target track of the object to be predicted (S208). The method can realize the track prediction of the moving object by combining the global data, has higher prediction accuracy and reduces the occurrence probability of accidents to a certain extent.

Description

Trajectory prediction method and device, storage medium, driving system and vehicle

技术领域technical field

本发明涉及智能交通技术领域，尤其涉及一种轨迹预测方法及装置、存储介质、驾驶系统与车辆。The invention relates to the technical field of intelligent transportation, and in particular, to a trajectory prediction method and device, a storage medium, a driving system and a vehicle.

背景技术Background technique

随着智能交通领域的发展，对运动对象的运动轨迹的预测算法在路径规划领域具备重大意义。通过对运动对象的运动轨迹进行预测，能够在已知运动对象未来可能的运动轨迹的情况下进行路径规划，有利于防止碰撞等意外情况的发生。With the development of the field of intelligent transportation, the prediction algorithm for the trajectory of moving objects is of great significance in the field of path planning. By predicting the motion trajectory of the moving object, the path planning can be carried out under the condition that the possible motion trajectory of the moving object in the future is known, which is beneficial to prevent the occurrence of unexpected situations such as collision.

目前的轨迹预测算法一般以运动对象自身的运动数据为基础，根据运动对象所属类别确定运动对象适用的运动模型，并利用该运动模型处理运动对象自身的运动数据，之后，再通过后处理的方式整合区域语义信息，即可预测出待预测对象的运动轨迹。The current trajectory prediction algorithm is generally based on the motion data of the moving object itself, determines the motion model suitable for the moving object according to the category of the moving object, and uses the motion model to process the motion data of the moving object itself, and then uses the post-processing method. By integrating regional semantic information, the motion trajectory of the object to be predicted can be predicted.

现有的轨迹预测算法以运动对象自身的运动数据为基础，无法从全局出发进行轨迹预测，这容易导致不同运动对象的预测轨迹出现交叉，进而导致以此为依据的路径规划或调度发生碰撞等意外事故，存在较大的安全隐患。The existing trajectory prediction algorithm is based on the motion data of the moving object itself, and cannot predict the trajectory from a global perspective, which easily leads to the intersection of the predicted trajectories of different moving objects, which in turn leads to collisions in the path planning or scheduling based on this. Accidents present a greater safety hazard.

发明内容SUMMARY OF THE INVENTION

本发明实施例提供一种轨迹预测方法及装置、存储介质、驾驶系统与车辆，能够结合全局数据实现对运动对象的轨迹预测，具备较高的预测准确率，并在一定程度上降低意外事故的发生概率。Embodiments of the present invention provide a trajectory prediction method and device, a storage medium, a driving system, and a vehicle, which can realize trajectory prediction of moving objects in combination with global data, have a high prediction accuracy, and reduce accidents to a certain extent. probability of occurrence.

第一方面，本发明实施例提供了一种轨迹预测方法，包括：In a first aspect, an embodiment of the present invention provides a trajectory prediction method, including:

获取待预测对象所在区域的全局语义数据与全局轨迹数据；Obtain the global semantic data and global trajectory data of the region where the object to be predicted is located;

融合所述全局语义数据与所述全局轨迹数据，得到全局融合数据；fusing the global semantic data and the global trajectory data to obtain global fusion data;

提取所述全局融合数据中的特征，得到全局特征；Extracting features in the global fusion data to obtain global features;

利用训练好的轨迹预测模型处理所述全局特征，得到所述待预测对象的目标轨迹。The global feature is processed by using the trained trajectory prediction model to obtain the target trajectory of the object to be predicted.

第二方面，本发明实施例提供了一种轨迹预测装置，包括：In a second aspect, an embodiment of the present invention provides a trajectory prediction device, including:

获取模块，用于获取待预测对象所在区域的全局语义数据与全局轨迹数据；The acquisition module is used to acquire the global semantic data and global trajectory data of the region where the object to be predicted is located;

融合模块，用于融合所述全局语义数据与所述全局轨迹数据，得到全局融合数据；a fusion module, configured to fuse the global semantic data and the global trajectory data to obtain global fusion data;

特征提取模块，用于提取所述全局融合数据中的特征，得到全局特征；a feature extraction module for extracting features in the global fusion data to obtain global features;

预测模块，用于利用训练好的轨迹预测模型处理所述全局特征，得到所述待预测对象的目标轨迹。The prediction module is used to process the global feature by using the trained trajectory prediction model to obtain the target trajectory of the object to be predicted.

第三方面，本发明实施例提供了一种轨迹预测装置，包括：In a third aspect, an embodiment of the present invention provides a trajectory prediction device, including:

存储器；memory;

处理器；以及processor; and

计算机程序；Computer program;

其中，所述计算机程序存储在所述存储器中，并被配置为由所述处理器执行以实现如第一方面所述的方法。Wherein, the computer program is stored in the memory and configured to be executed by the processor to implement the method of the first aspect.

第四方面，本发明实施例提供了一种计算机可读存储介质，其上存储有计算机程序，In a fourth aspect, an embodiment of the present invention provides a computer-readable storage medium on which a computer program is stored,

所述计算机程序被处理器执行以实现如第一方面所述的方法。The computer program is executed by a processor to implement the method as described in the first aspect.

第五方面，本发明实施例提供了一种驾驶系统，包括：In a fifth aspect, an embodiment of the present invention provides a driving system, including:

轨迹预测装置，用于执行如第一方面所述的方法；a trajectory prediction device for performing the method according to the first aspect;

运动控制器，用于根据所述目标轨迹控制被控制对象运动。The motion controller is used to control the movement of the controlled object according to the target trajectory.

一种可能的设计中，所述被控制对象与所述待预测对象为不同对象。In a possible design, the controlled object and the to-be-predicted object are different objects.

第六方面，本发明实施例提供了一种车辆，包括：In a sixth aspect, an embodiment of the present invention provides a vehicle, including:

如第二方面或第三方面所述的轨迹预测装置，用于执行如第一方面所述的方法。The trajectory prediction apparatus according to the second aspect or the third aspect is used to perform the method according to the first aspect.

第七方面，本发明实施例提供了一种车辆，包括：In a seventh aspect, an embodiment of the present invention provides a vehicle, including:

如第五方面所述的驾驶系统。The driving system of the fifth aspect.

第八方面，本发明实施例提供了一种无人飞行器的控制装置，包括：In an eighth aspect, an embodiment of the present invention provides a control device for an unmanned aerial vehicle, including:

如第五方面所述的驾驶系统。The driving system of the fifth aspect.

本发明实施例所提供的技术方案，通过对待预测对象所在区域的全局语义数据与全局轨迹数据进行获取与处理，能够得到该区域的全局特征，进而，利用训练好的轨迹预测模型对全局特征进行处理，即可得到待预测对象的目标轨迹，换言之，本发明实施例所提供的技术方案从全局语义和全局轨迹出发，在预测一个待预测对象的轨迹时，考虑该区域内的全部运动对象，并结合该区域的全局语义数据，以实现对该区域内任一运动对象的轨迹预测，相较于仅考虑单一的待预测对象的预测方法，本方案具备较高的预测准确率，并且，当以此为依据执行后续的路径规划或调度，也能够在一定程度上降低意外事故的发生概率，具备更高的安全性。According to the technical solution provided by the embodiment of the present invention, by acquiring and processing the global semantic data and the global trajectory data of the region where the object to be predicted is located, the global features of the region can be obtained, and then the trained trajectory prediction model is used to perform the global feature analysis. After processing, the target trajectory of the object to be predicted can be obtained. In other words, the technical solution provided by the embodiment of the present invention starts from global semantics and global trajectory. When predicting the trajectory of an object to be predicted, all moving objects in the area are considered. Combined with the global semantic data of the area, to realize the trajectory prediction of any moving object in the area, compared with the prediction method that only considers a single object to be predicted, this scheme has a higher prediction accuracy, and when Performing subsequent path planning or scheduling based on this can also reduce the probability of accidents to a certain extent and provide higher safety.

附图说明Description of drawings

图1为本发明实施例提供的一种轨迹预测场景的俯视示意图；FIG. 1 is a schematic top view of a trajectory prediction scene according to an embodiment of the present invention;

图2为本发明实施例提供的一种轨迹预测方法的流程示意图；2 is a schematic flowchart of a trajectory prediction method provided by an embodiment of the present invention;

图3为本发明实施例提供的另一种轨迹预测方法的流程示意图；3 is a schematic flowchart of another trajectory prediction method provided by an embodiment of the present invention;

图4为本发明实施例提供的另一种轨迹预测方法的流程示意图；4 is a schematic flowchart of another trajectory prediction method provided by an embodiment of the present invention;

图5为本发明实施例提供的循环神经网络模型的循环单元结构示意图；5 is a schematic diagram of a cyclic unit structure of a cyclic neural network model provided by an embodiment of the present invention;

图6为本发明实施例提供的循环神经网络模型的模型架构示意图；6 is a schematic diagram of a model architecture of a recurrent neural network model provided by an embodiment of the present invention;

图7为本发明实施例提供的长短期记忆网络模型的模型架构示意图；7 is a schematic diagram of a model architecture of a long short-term memory network model according to an embodiment of the present invention;

图8为本发明实施例提供的另一种轨迹预测方法的流程示意图；8 is a schematic flowchart of another trajectory prediction method provided by an embodiment of the present invention;

图9为本发明实施例提供的一种轨迹预测装置的功能方块图；9 is a functional block diagram of a trajectory prediction apparatus provided by an embodiment of the present invention;

图10为本发明实施例提供的一种轨迹预测装置的实体结构示意图；10 is a schematic diagram of a physical structure of a trajectory prediction apparatus provided by an embodiment of the present invention;

图11为本方面实施例提供的一种驾驶系统的架构示意图；FIG. 11 is a schematic structural diagram of a driving system according to an embodiment of the present invention;

图12为本发明实施例提供的一种车辆的架构示意图；12 is a schematic structural diagram of a vehicle according to an embodiment of the present invention;

图13为本发明实施例提供的另一种车辆的架构示意图。FIG. 13 is a schematic structural diagram of another vehicle according to an embodiment of the present invention.

通过上述附图，已示出本公开明确的实施例，后文中将有更详细的描述。这些附图和文字描述并不是为了通过任何方式限制本公开构思的范围，而是通过参考特定实施例为本领域技术人员说明本公开的概念。The above-mentioned drawings have shown clear embodiments of the present disclosure, and will be described in more detail hereinafter. These drawings and written descriptions are not intended to limit the scope of the disclosed concepts in any way, but rather to illustrate the disclosed concepts to those skilled in the art by referring to specific embodiments.

具体实施方式Detailed ways

这里将详细地对示例性实施例进行说明，其示例表示在附图中。下面的描述涉及附图时，除非另有表示，不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本公开相一致的所有实施方式。相反，它们仅是与如所附权利要求书中所详述的、本公开的一些方面相一致的装置和方法的例子。Exemplary embodiments will be described in detail herein, examples of which are illustrated in the accompanying drawings. Where the following description refers to the drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the illustrative examples below are not intended to represent all implementations consistent with this disclosure. Rather, they are merely examples of apparatus and methods consistent with some aspects of the present disclosure as recited in the appended claims.

首先对本发明所涉及的名词进行解释：First, the terms involved in the present invention are explained:

运动对象：是指能够实现轨迹移动的生物或物体。本发明实施例所涉及到的运动对象可以包括但不限于：车辆、动物、人、机器人与无人飞行器中的至少一种。其中，车辆可以为无人驾驶车辆，如无人地面车辆(Unmanned Ground Vehicle，UGV)，或者，处于自动驾驶模式的私家车辆或公交车辆等。Moving object: refers to a creature or object that can achieve trajectory movement. The moving objects involved in the embodiments of the present invention may include, but are not limited to, at least one of vehicles, animals, people, robots, and unmanned aerial vehicles. Wherein, the vehicle may be an unmanned vehicle, such as an unmanned ground vehicle (Unmanned Ground Vehicle, UGV), or a private vehicle or a public transportation vehicle in an automatic driving mode.

待预测对象：待预测目标轨迹的一个或多个运动对象。其中，多个是指两个或两个以上，后续涉及该概念，不再赘述。Object to be predicted: One or more moving objects of the target trajectory to be predicted. Among them, a plurality of refers to two or more than two, and this concept will be involved in the following, and will not be repeated here.

语义对象：是指区域中各个语义概念上的对象。请参考图1所示场景，该场景中的语义对象包括：车辆、车道线、车道。可知，图1仅用以示例，在实际的轨迹预测场景中，语义对象的语义类别还包括多种，如：树木、障碍物、栏杆、指示牌、人、动物等，本发明实施例对于各语义对象的语义类别无特殊限定。Semantic objects: refer to the objects on each semantic concept in the area. Please refer to the scene shown in Figure 1. The semantic objects in this scene include: vehicles, lane lines, and lanes. It can be seen that FIG. 1 is only used as an example. In an actual trajectory prediction scene, the semantic categories of semantic objects also include many kinds, such as: trees, obstacles, railings, signs, people, animals, etc. The semantic categories of semantic objects are not particularly limited.

长短期记忆网络(Long Short Term Memory，LSTM)模型：是循环神经网络(Recurrent Neural Network，RNN)的一种变种模型，相较于RNN模型，LSTM具备更长的时间依赖建模能力。The Long Short Term Memory (LSTM) model is a variant of the Recurrent Neural Network (RNN) model. Compared with the RNN model, LSTM has longer time-dependent modeling capabilities.

本发明实施例所提供的技术方案具体的应用场景为：针对运动对象的轨迹预测场景。A specific application scenario of the technical solution provided by the embodiment of the present invention is: a trajectory prediction scenario for a moving object.

进一步的，本发明实施例所提供的技术方案还可以具体应用于路径规划场景，此时，可根据预测出的轨迹，来实现对某一个或多个运动对象的路径规划。Further, the technical solutions provided by the embodiments of the present invention can also be specifically applied to a path planning scenario. In this case, the path planning for one or more moving objects can be implemented according to the predicted trajectory.

此外，本发明实施例所提供的技术方案还可以具体应用于车辆调度场景。例如，通过对其他不可调度车辆或对象的轨迹预测，来实现对可调度车辆的调度。In addition, the technical solutions provided by the embodiments of the present invention can also be specifically applied to vehicle scheduling scenarios. For example, dispatchable vehicles can be dispatched by predicting the trajectory of other non-dispatchable vehicles or objects.

如背景技术所述，现有的轨迹预测方法仅是针对单独的一个运动对象，当确定该运动对象的类别后，则通过该类别对应的运动模型，对该运动对象自身的运动数据进行处理，从而，预测出该运动对象的运动轨迹。一方面，这种预测方式受制于对象类别的限制，需要准确判断出运动对象的类别，才能够以该类别对应的运动模型得到较为准确的预测结果；另一方面，这种预测方式仅依赖于运动对象本身的运动数据，并未从全局角度结合当前运动环境、其他运动对象的运动情况进行综合分析，这种未考虑运动对象所在区域内的其他运动或非运动的对象，极有可能预测出的两个同一类别的运动对象的轨迹相交，从而，若以此为依据进行路径规划或对象调度，极有可能发生碰撞等意外事故，存在较大的安全性风险。As described in the background art, the existing trajectory prediction method is only for a single moving object. After the category of the moving object is determined, the motion data of the moving object itself is processed through the motion model corresponding to the category. Thus, the motion trajectory of the moving object is predicted. On the one hand, this prediction method is limited by the object category, and it is necessary to accurately determine the category of the moving object in order to obtain more accurate prediction results with the motion model corresponding to the category; on the other hand, this prediction method only depends on The motion data of the moving object itself is not comprehensively analyzed from a global perspective combined with the current moving environment and the motion of other moving objects. This kind of objects that do not consider other moving or non-moving objects in the area where the moving object is located is very likely to be predicted. The trajectories of two moving objects of the same category intersect, so if path planning or object scheduling is performed based on this, accidents such as collisions are very likely to occur, and there is a greater safety risk.

基于此，本发明实施例提供的技术方案，旨在解决现有技术的如上技术问题，并提出如下解决思路：综合考虑待预测对象所在区域的全局数据，包括全局语义数据与全局轨迹数据，据此得到全局特征，并以全局特征作为轨迹预测模型的输入，来获取待预测对象的目标轨迹。Based on this, the technical solutions provided by the embodiments of the present invention aim to solve the above technical problems in the prior art, and propose the following solution idea: comprehensively consider the global data of the region where the object to be predicted is located, including global semantic data and global trajectory data, according to This obtains the global feature, and uses the global feature as the input of the trajectory prediction model to obtain the target trajectory of the object to be predicted.

基于这种设计，本发明实施例所提供的轨迹预测方法可以具体执行于某一运动对象的内置处理器或所持的终端设备中，或者，也可以具体执行于云端或后台服务器。Based on this design, the trajectory prediction method provided by the embodiments of the present invention may be specifically executed in a built-in processor of a moving object or a terminal device held by it, or may be specifically executed in a cloud or a background server.

举例说明。在一种可能的场景中，自动驾驶车辆的第一处理器可自行规划行驶路线，而该自动驾驶车辆的第二处理器用于执行本方案所提供的轨迹预测方法，并用于将预测到的轨迹输入第一内置理器，以便于第一处理器可根据预测出的轨迹进行后续的路径规划。其中，第一处理器和第二处理器可以为同一处理器，或者，也可以为不同处理器，例如，可以为高级驾驶辅助系统(Advanced Driving Assistant System，ADAS)中的一个或两个处理器；以及，第一处理器和第二处理器可以为车辆的总控制器中的一部分，或者，也可以为控制该无人驾驶车辆行驶的后台总服务器或云端服务器。for example. In a possible scenario, the first processor of the self-driving vehicle can plan the driving route by itself, and the second processor of the self-driving vehicle is used to execute the trajectory prediction method provided by this solution, and is used to predict the trajectory of the predicted trajectory. Input to the first internal processor, so that the first processor can perform subsequent path planning according to the predicted trajectory. The first processor and the second processor may be the same processor, or may be different processors, for example, may be one or two processors in an Advanced Driving Assistant System (ADAS) and, the first processor and the second processor may be part of the overall controller of the vehicle, or may also be a background overall server or a cloud server that controls the driving of the unmanned vehicle.

下面以具体地实施例对本发明的技术方案以及本申请的技术方案如何解决上述技术问题进行详细说明。下面这几个具体的实施例可以相互结合，对于相同或相似的概念或过程可能在某些实施例中不再赘述。下面将结合附图，对本发明的实施例进行描述。The technical solutions of the present invention and how the technical solutions of the present application solve the above-mentioned technical problems will be described in detail below with specific examples. The following specific embodiments may be combined with each other, and the same or similar concepts or processes may not be repeated in some embodiments. Embodiments of the present invention will be described below with reference to the accompanying drawings.

实施例一Example 1

本发明实施例提供了一种轨迹预测方法。请参考图2-图4，其中，图2示出了本发明实施例提供的一种轨迹预测方法的流程示意图，图3示出了本发明式实施例提供的轨迹预测方法在一种具体的应用场景下的实现流程示意图，图4为图3所示流程的一种具体实现方式。The embodiment of the present invention provides a trajectory prediction method. Please refer to FIG. 2 to FIG. 4 , wherein FIG. 2 shows a schematic flowchart of a trajectory prediction method provided by an embodiment of the present invention, and FIG. 3 shows a specific embodiment of the trajectory prediction method provided by the present invention. A schematic diagram of an implementation flow in an application scenario, FIG. 4 is a specific implementation manner of the flow shown in FIG. 3 .

如图2所示，该方法包括如下步骤：As shown in Figure 2, the method includes the following steps:

S202，获取待预测对象所在区域的全局语义数据与全局轨迹数据。S202: Obtain global semantic data and global trajectory data of the region where the object to be predicted is located.

如前所述，待预测对象为一种运动对象，其可以包括但不限于如下至少一种：车辆、动物、人、机器人与无人飞行器。此外，本发明实施例中对待预测对象的数目无特别限定，可以为一个或多个。As mentioned above, the object to be predicted is a moving object, which may include, but is not limited to, at least one of the following: a vehicle, an animal, a human, a robot, and an unmanned aerial vehicle. In addition, in this embodiment of the present invention, the number of objects to be predicted is not particularly limited, and may be one or more.

其中，全局语义数据用于描述待预测对象所在区域中各对象的语义类别，此时，各运动对象或非运动对象均作为语义对象，具备各自的语义类别。全局轨迹数据用于描述该区域中各运动对象的历史运动坐标。可知，在具体实现时，全局轨迹数据可以包括至少一帧的历史运动坐标。The global semantic data is used to describe the semantic category of each object in the region where the object to be predicted is located. In this case, each moving object or non-moving object is regarded as a semantic object and has its own semantic category. The global trajectory data is used to describe the historical motion coordinates of each moving object in this area. It can be known that, in specific implementation, the global trajectory data may include historical motion coordinates of at least one frame.

图3示出了以图1所示场景为例时，本方案的实现流程示意图。如图3所示，该行驶场景中包含3个语义，分别是：车辆、车道与车道线时，此时，需要在该场景中获取的全局语义数据，也就是各语义对象的语义数据；以及，还需要获取该场景的全局轨迹数据，也就是各运动对象：车辆1(作为待预测对象)、车辆2和车辆3的轨迹数据。FIG. 3 shows a schematic diagram of the implementation flow of this solution when the scenario shown in FIG. 1 is taken as an example. As shown in Figure 3, the driving scene contains three semantics, namely: vehicle, lane and lane line, at this time, the global semantic data that needs to be acquired in the scene, that is, the semantic data of each semantic object; and , it is also necessary to obtain the global trajectory data of the scene, that is, the trajectory data of each moving object: vehicle 1 (as the object to be predicted), vehicle 2 and vehicle 3 .

需要说明的是，图3仅为示意性的，在具体的实现场景中，全局轨迹数据与全局语义数据的表现形式不局限于单个获取的方式，可作为整体数据被获取到。It should be noted that FIG. 3 is only schematic, and in a specific implementation scenario, the representations of the global trajectory data and the global semantic data are not limited to a single acquisition method, but can be acquired as overall data.

在如图4所示的一种实现场景中，各运动对象的轨迹数据可以通过LSTM模型来实现，后续具体说明。In an implementation scenario as shown in FIG. 4 , the trajectory data of each moving object can be realized by the LSTM model, which will be described in detail later.

S204，融合所述全局语义数据与所述全局轨迹数据，得到全局融合数据。S204, fuse the global semantic data and the global trajectory data to obtain global fusion data.

具体的，如图3所示，融合模块用于将每一帧对应的全局语义数据与全局轨迹数据进行融合，得到各帧的全局融合数据。Specifically, as shown in FIG. 3 , the fusion module is used to fuse the global semantic data corresponding to each frame with the global trajectory data to obtain the global fusion data of each frame.

S206，提取所述全局融合数据中的特征，得到全局特征。S206, extracting features in the global fusion data to obtain global features.

如图3所示，特征提取模块用于在全局融合数据中提取出全局特征。在如图4所示的一种实现场景中，该步骤可以通过卷积神经网络(Convolutional Neural Networks，CNN)模型来实现，后续具体说明。As shown in Figure 3, the feature extraction module is used to extract global features from the global fusion data. In an implementation scenario as shown in FIG. 4 , this step may be implemented by a Convolutional Neural Networks (CNN) model, which will be described in detail later.

S208，利用训练好的轨迹预测模型处理所述全局特征，得到所述待预测对象的目标轨迹。S208, using the trained trajectory prediction model to process the global feature to obtain the target trajectory of the object to be predicted.

本发明采用训练好的轨迹预测模型来预测运动对象的运动轨迹，此时，该轨迹预测模型的输入为全局特征，输出为运动对象的运动轨迹。The present invention uses the trained trajectory prediction model to predict the motion trajectory of the moving object. At this time, the input of the trajectory prediction model is the global feature, and the output is the motion trajectory of the moving object.

具体而言，本发明实施例所提供的轨迹预测模型可以包括但不限于：如下至少一种：LSTM模型、多层感知器(Multi-Layer Perception，MLP)；其中，所述多层感知器包括：RNN模型、门控循环单元(gated recurrent neural network，GRU)模型。例如，图4所示的实现场景中，即通过LSTM模型来实现轨迹预测，此时，Y＝LSTM(feature)，其中，Y表示目标轨迹，feature表示全局特征。Specifically, the trajectory prediction model provided in this embodiment of the present invention may include, but is not limited to, at least one of the following: an LSTM model, a multi-layer perceptron (Multi-Layer Perception, MLP); wherein the multi-layer perceptron includes : RNN model, gated recurrent neural network (GRU) model. For example, in the implementation scenario shown in Fig. 4, the trajectory prediction is realized by the LSTM model, at this time, Y=LSTM(feature), where Y represents the target trajectory, and feature represents the global feature.

请参考图5-7，其中，图5示出了RNN模型中一个循环单元的设计逻辑，图6示出了RNN模型的模型架构示意图，图7示出了LSTM模型的模型架构示意图。Please refer to Figures 5-7, wherein Figure 5 shows the design logic of a recurrent unit in the RNN model, Figure 6 shows a schematic diagram of the model architecture of the RNN model, and Figure 7 shows a schematic diagram of the model architecture of the LSTM model.

RNN模型作为一种时序建模的有效手段，相比于普通的神经网络，如图5所示，其区别主要在于将上一帧的输出或者中间状态作为当前帧的输入，以实现对历史消息及时序关系的融合。将如图5所示的循环单元在时间展开后，即可得到如图6所示的RNN模型。如图6所示，RNN模型可以实现时序建模。因此，可以通过RNN模型来实现本方案中的轨迹预测步骤。As an effective means of time series modeling, RNN model, compared with ordinary neural network, as shown in Figure 5, the main difference is that the output of the previous frame or the intermediate state is used as the input of the current frame to realize the historical message and integration of temporal relationships. The RNN model shown in Figure 6 can be obtained after the cyclic unit shown in Figure 5 is expanded in time. As shown in Figure 6, the RNN model can realize time series modeling. Therefore, the trajectory prediction step in this scheme can be implemented through the RNN model.

如图7所示，在LSTM模型中的每一个重复模块中均包含4个交互层，这4个交互层以特殊的方式进行交互，使得上一帧的输出或者中间状态作为当前帧的输入，从而，相比于图6所示的RNN模型，LSTM模型具备更优异的时间依赖建模能力。此外，考虑到针对轨迹的预测与运动对象的历史数据具备较强的时间关联关系，因此，采用LSTM模型来实现轨迹预测可得到更为接近实际发展的轨迹预测结果。As shown in Figure 7, each repeating module in the LSTM model contains 4 interaction layers, and these 4 interaction layers interact in a special way, so that the output of the previous frame or the intermediate state is used as the input of the current frame, Therefore, compared to the RNN model shown in Figure 6, the LSTM model has better time-dependent modeling capabilities. In addition, considering that the prediction of the trajectory has a strong temporal correlation with the historical data of the moving object, using the LSTM model to realize the trajectory prediction can obtain the trajectory prediction result that is closer to the actual development.

通过前述设计，本发明实施例所提供的技术方案能够从全局数据出发，实现对任一待预测对象的轨迹预测，相较于仅以待预测对象自身的运动数据出发的轨迹预测方式，本方案具备较高的预测准确率，并且，当以此为依据执行后续的路径规划或调度，也能够在一定程度上降低意外事故的发生概率，具备更高的安全性。Through the foregoing design, the technical solution provided by the embodiments of the present invention can realize trajectory prediction for any object to be predicted based on global data. Compared with the trajectory prediction method based only on the motion data of the object to be predicted, this solution It has a high prediction accuracy, and when the subsequent path planning or scheduling is performed based on this, it can also reduce the probability of accidents to a certain extent and have higher safety.

以下，对图2所示方法的实现方式进行具体说明。Hereinafter, the implementation of the method shown in FIG. 2 will be described in detail.

S202包含两方面全局数据的获取：全局语义数据与全局轨迹数据。这两种全局数据的获取方式可以参考图8所示流程。S202 includes two aspects of global data acquisition: global semantic data and global trajectory data. For the acquisition methods of these two global data, refer to the flow shown in FIG. 8 .

一方面，如图8所示，获取全局语义数据的方式可以包括如下步骤：On the one hand, as shown in FIG. 8 , the way of acquiring global semantic data may include the following steps:

S202-12，获取所述待预测对象所在区域的全局区域图像。S202-12: Acquire a global area image of the area where the object to be predicted is located.

该全局区域图像可以为实时获取到的图像，实时获取到的图像具备更高的及时性，以此得到的全局语义数据也更为准确。具体而言，实时获取方式可以包括但不限于：通过图像采集设备实时采集图像。其中，图像采集设备可以为该轨迹预测装置(轨迹预测方法的执行主体)的一部分；或者，与该轨迹预测装置具备实时地数据交互。举例说明，若该轨迹预测装置可以为车辆的主控制器中的处理器A，则图像采集设备可以为车辆黑匣子中的摄像头，其可以将采集到的图像直接输入处理器A；或者，图像采集设备可以为区域内设置的摄像头，如道路上或路边设置的摄像头，此时，处理器A可以向该区域内摄像头或该区域内摄像头的后台服务器实时请求并接收全局区域图像。The global region image can be an image obtained in real time, and the image obtained in real time has higher timeliness, and the obtained global semantic data is also more accurate. Specifically, the real-time acquisition method may include, but is not limited to: real-time acquisition of images by an image acquisition device. Wherein, the image acquisition device may be a part of the trajectory prediction apparatus (execution subject of the trajectory prediction method); or, it may have real-time data interaction with the trajectory prediction apparatus. For example, if the trajectory prediction device can be the processor A in the main controller of the vehicle, the image acquisition device can be a camera in the black box of the vehicle, which can directly input the acquired image to the processor A; The device can be a camera set in an area, such as a camera set on a road or a roadside. At this time, the processor A can request and receive the global area image in real time from the camera in the area or the background server of the camera in the area.

除实时获取的方式之外，还可以通过调用已采集数据的方式来获取全局语义数据。具体的，一种实现场景中，可以获取高精度地图中关于该区域的全局区域图像；另一种实现场景中，可以获取其他处理器或存储器中已采集到的该区域的全局区域图像。并且，这种实现方式只能获取到该区域的环境信息，如道路、指示牌、车道线等非运动对象的非实时数据，而无法获取到实时场景中的运动对象的运动情况，因此，在以此方式实现本方案时，仅适用于针对单一待预测对象的轨迹预测，而无法结合其他运动对象的轨迹实现综合预测，预测准确率较弱。In addition to real-time acquisition, global semantic data can also be acquired by invoking the collected data. Specifically, in one implementation scenario, a global area image of the area in the high-precision map can be acquired; in another implementation scenario, a global area image of the area that has been collected in other processors or memory can be acquired. Moreover, this implementation method can only obtain the environmental information of the area, such as non-real-time data of non-moving objects such as roads, signs, lane lines, etc., but cannot obtain the movement of moving objects in the real-time scene. Therefore, in the When the solution is implemented in this way, it is only suitable for trajectory prediction of a single object to be predicted, and cannot be combined with the trajectories of other moving objects to achieve comprehensive prediction, and the prediction accuracy is weak.

需要说明的是，本发明实施例中后续进行处理的图像为俯视图像，因此，若以前述实现方式获取到的并非俯视图像时，还需要对采集到的图像进行俯视投影，以得到满足后续处理需要的俯视图像。It should be noted that, in this embodiment of the present invention, the image to be subsequently processed is an overhead image. Therefore, if the acquired image is not an overhead image in the foregoing implementation manner, an overhead projection of the collected image is also required to satisfy the subsequent processing requirements. The desired overhead image.

一种可能的设计中，本发明实施例所涉及到的俯视图像可以具体表现为：数字正射影像(Digital Orthophoto Map，DOM)图像。In a possible design, the overhead image involved in the embodiment of the present invention may be embodied as: a digital orthophoto map (Digital Orthophoto Map, DOM) image.

此外，为了便于处理，还可以进一步个性化设置“待预测对象所在区域”的区域形状或尺寸，本发明实施例对此无特别限定。具体的，可以以待预测对象为中心，获取俯视图一定长宽尺寸的矩形区域为其所在区域，例如，可以获取如图1(或图3)所示的长为W、宽为H的矩形区域的图像。又例如，还可以将待预测对象所在的整条道路作为待预测对象所在区域。In addition, in order to facilitate processing, the shape or size of the region of the "region where the object to be predicted is located" may be further personalized, which is not particularly limited in this embodiment of the present invention. Specifically, taking the object to be predicted as the center, a rectangular area with a certain length and width in the top view can be obtained as the area where it is located. For example, a rectangular area with a length W and a width H as shown in FIG. 1 (or FIG. 3 ) can be obtained Image. For another example, the entire road where the object to be predicted is located may also be used as the region where the object to be predicted is located.

S202-14，对所述全局区域图像中的各像素分别进行语义识别，得到各像素的语义类别。S202-14: Perform semantic recognition on each pixel in the global area image to obtain a semantic category of each pixel.

一种可能的设计中，可以通过深度学习来实现对各像素的语义识别。若通过该方式实现，需要在执行该步骤之前，利用预设的像素样本数据，对像素语义识别模型进行深度学习，以得到满足应用需求(可通过对损失函数的定义实现)的像素语义识别模型。如此，在执行该步骤时，仅需将全局区域图像输入该像素语义识别模型，该像素语义识别模型的输出即为各像素的语义类别。In a possible design, the semantic recognition of each pixel can be achieved through deep learning. If it is implemented in this way, it is necessary to use the preset pixel sample data to perform deep learning on the pixel semantic recognition model before performing this step, so as to obtain a pixel semantic recognition model that meets the application requirements (which can be achieved by defining the loss function). . In this way, when performing this step, it is only necessary to input the global region image into the pixel semantic recognition model, and the output of the pixel semantic recognition model is the semantic category of each pixel.

另一种可能的设计中，还可以各像素的像素值为依据，将各像素的像素值分别与各语义类别对应的像素区间进行比较，从而，针对任一像素值，将该像素值落入的像素区间对应的语义类别，作为该像素对应的语义类别。其中，各语义类别与像素区间之间的对应关系，可以通过自定义方式预设。In another possible design, the pixel value of each pixel can also be compared with the pixel interval corresponding to each semantic category based on the pixel value of each pixel, so that for any pixel value, the pixel value falls within The semantic category corresponding to the pixel interval of , as the semantic category corresponding to the pixel. The correspondence between each semantic category and the pixel interval can be preset in a custom way.

S202-16，根据各像素的语义类别，对所述全局区域图像进行语义标注，得到所述全局语义信息。S202-16: Perform semantic annotation on the global area image according to the semantic category of each pixel to obtain the global semantic information.

由于各像素的语义类别已经确定，在执行该步骤时，可根据各像素的语义类别，对所述全局区域图像进行语义分割，得到多个语义对象；从而，对对各语义对象分别进行语义标注，得到所述全局语义信息。Since the semantic category of each pixel has been determined, when performing this step, the global area image can be semantically segmented according to the semantic category of each pixel to obtain a plurality of semantic objects; thus, semantic annotation is performed on each semantic object. to obtain the global semantic information.

其中，语义标注仅用于区分各对象的语义类别，可以任意可区分方式来进行标注。例如，可以通过不同的颜色来标识各语义对象。或者，如图1(或图3)所示，可以通过不同的底纹来标识各语义对象。可知，经过标识后，具备同样标识的语义对象为同一类语义对象。Among them, semantic annotation is only used to distinguish the semantic category of each object, and can be labeled in any distinguishable manner. For example, each semantic object may be identified by a different color. Alternatively, as shown in FIG. 1 (or FIG. 3 ), each semantic object may be identified by different shading. It can be known that, after being identified, the semantic objects with the same identification are of the same class of semantic objects.

此时，需要说明的是，在不同的实现场景中，与该待预测对象所属类别相同的其他运动对象的标注方式，可以与待预测对象的标注方式相同，也可以不同。如图1所示，当待预测对象为车辆1时，该待预测对象所在区域还包括同一类别的运动对象：车辆2与车辆3。此时，若通过第一轨迹预测模型(第一轨迹预测模型用于预测所述待预测对象的目标轨迹，后续详述实现方式)来实现S208步骤时，如图1所示，需要将车辆1与车辆2、车辆3进行区分标识，车辆1具备一种标识，车辆2与车辆3为另一种标识。或者，若通过第二轨迹预测模型(第二轨迹预测模型用于预测所述区域中全部运动对象的运动轨迹，后续详述实现方式)来实现S208步骤时，则无需对同一类别的运动对象进行区分标识，车辆1、车辆2与车辆3可使用同一种标识方式进行标识(同种标识的方式图1未示出)。At this time, it should be noted that, in different implementation scenarios, the labeling method of other moving objects of the same category as the object to be predicted may be the same as or different from the labeling method of the object to be predicted. As shown in FIG. 1 , when the object to be predicted is vehicle 1 , the area where the object to be predicted is located also includes moving objects of the same category: vehicle 2 and vehicle 3 . At this time, if step S208 is implemented through the first trajectory prediction model (the first trajectory prediction model is used to predict the target trajectory of the object to be predicted, the implementation will be described in detail later), as shown in FIG. Distinguish identification with vehicle 2 and vehicle 3, vehicle 1 has one kind of identification, and vehicle 2 and vehicle 3 are another kind of identification. Or, if step S208 is implemented through a second trajectory prediction model (the second trajectory prediction model is used to predict the motion trajectories of all moving objects in the area, and the implementation will be described in detail later), then there is no need to perform To distinguish identification, vehicle 1 , vehicle 2 and vehicle 3 can be identified using the same identification method (the same identification method is not shown in FIG. 1 ).

以及，在执行语义标注时，还可以进一步自定义划分网格，每个网格可以包括一个或多个像素点，其划分方式可以通过预设的分辨率来实现。例如，可以将图1所示的全局区域图像划分为长度为20cm的正方形网格，如此，在执行后续的语义标注时，仅需要对网格进行标注即可。当网格包含多个像素点时，划分网格的实现方式有利于降低标记量，提高了处理效率。And, when performing semantic annotation, the grid can be further customized, and each grid can include one or more pixels, and the dividing method can be realized by a preset resolution. For example, the global area image shown in FIG. 1 can be divided into square grids with a length of 20 cm. In this way, when performing subsequent semantic annotations, only the grids need to be labeled. When the grid contains multiple pixels, the implementation of the grid is beneficial to reduce the amount of marking and improve the processing efficiency.

除前述实现方式之外，图8所述的S202-14与S202-16步骤还可以通过一个神经网络模型来实现。也就是，在执行S202-14之前，训练语义识别模型，如此，将S202-12步骤获取到的全局区域图像输入该语义识别模型，该语义识别模型的输出即为标注了语义类别的全局区域图像，也就得到了全局语义数据。In addition to the foregoing implementation manner, steps S202-14 and S202-16 described in FIG. 8 can also be implemented by a neural network model. That is, before executing S202-14, the semantic recognition model is trained, so that the global region image obtained in step S202-12 is input into the semantic recognition model, and the output of the semantic recognition model is the global region image marked with the semantic category. , and the global semantic data is obtained.

针对前述涉及到的语义识别模型、像素语义识别模型的类别无特别限定，采用CNN模型或其他神经网络模型均可实现，而二者的样本数据则需要根据模型的输入和输出做不同的标注何设计，不再赘述。There are no special restrictions on the categories of the aforementioned semantic recognition models and pixel semantic recognition models, which can be realized by using the CNN model or other neural network models, and the sample data of the two need to be marked differently according to the input and output of the model. Design, no more details.

另一方面，如图8所示，获取全局轨迹数据的方式可以包括如下步骤：On the other hand, as shown in FIG. 8 , the method of acquiring global trajectory data may include the following steps:

S202-22，获取所述待预测对象所在区域内各运动对象的轨迹点集，所述轨迹点集由所述运动对象的坐标点按照时序顺序集合而成。S202-22: Acquire a track point set of each moving object in the region where the object to be predicted is located, where the track point set is formed by collecting coordinate points of the moving objects in a time sequence order.

该步骤用于获取当前区域中的各运动对象的轨迹点集，其中，每个轨迹点集由该运动对象的多个坐标点构成，为了便于处理，可以将各运动对象的坐标点转换为同一个坐标系下的坐标点。例如，可以将各轨迹点集中的坐标点转换为以图1所示的矩形区域的两条直角边构成的直角坐标系中的坐标点，每个坐标点的表现形式为(X，Y)，而每个运动对象的轨迹点集可表示为{(X_i，Y_i)}，其中i用于表示个坐标点的时序顺序。This step is used to obtain the track point set of each moving object in the current area, wherein each track point set is composed of a plurality of coordinate points of the moving object. In order to facilitate processing, the coordinate points of each moving object can be converted into the same A coordinate point in a coordinate system. For example, the coordinate points in each track point set can be converted into coordinate points in the rectangular coordinate system formed by the two right-angled sides of the rectangular area shown in FIG. 1, and the expression form of each coordinate point is (X, Y), The trajectory point set of each moving object can be represented as {(X _i , Y _i )}, where i is used to represent the time sequence order of the coordinate points.

具体而言，在具体实现时，可通过获取每个运动对象在以当前时刻为终点的时间区间内的坐标点构成前述坐标点集。其中，时间区间的长度可根据需要预设，例如，可以获取当前时刻前3s内各运动对象的轨迹点集。Specifically, during specific implementation, the aforementioned coordinate point set can be formed by acquiring the coordinate points of each moving object within a time interval whose end point is the current moment. The length of the time interval can be preset as required, for example, the track point set of each moving object within 3s before the current moment can be acquired.

需要说明的是，该步骤中，轨迹点集可以是该执行主体主动监测到的，也可以是通过向其他处理器或采集装置请求数据得到的。例如，若执行主体为车辆1的主控制器中的处理器A，则本车辆1的坐标点可以通过自身的定位器，如GPS，采集得到，由本车辆1的定位器将采集到的坐标数据发送给处理器A，由处理器A进行坐标转换，得到本车辆1的轨迹点集；而其他车辆的轨迹点集则可以通过向其他处理器请求的方式得到，例如，若与其他车辆存在通信，可分别向其他车辆获取其轨迹点集；又例如，可以向该区域的路面监视器获取其他车辆的轨迹点集；此外，也可以通过自身采集其他车辆的图像并计算与自身的间距的方式，计算获取到其他车辆的轨迹点集。实现方式可以有多种，不再赘述。It should be noted that, in this step, the trajectory point set may be actively monitored by the executive body, or may be obtained by requesting data from other processors or collection devices. For example, if the execution subject is the processor A in the main controller of the vehicle 1, the coordinate points of the vehicle 1 can be collected by its own locator, such as GPS, and the coordinate data collected by the locator of the vehicle 1 can be obtained. Send it to the processor A, and the processor A will perform coordinate transformation to obtain the trajectory point set of the vehicle 1; and the trajectory point set of other vehicles can be obtained by requesting other processors, for example, if there is communication with other vehicles , the track point set of other vehicles can be obtained from other vehicles; for example, the track point set of other vehicles can be obtained from the road monitor in the area; in addition, the image of other vehicles can also be collected by itself and the distance from itself can be calculated. , calculate and obtain the set of trajectory points of other vehicles. There may be various implementation manners, which will not be repeated here.

此外，该轨迹点集的数据源(或直接采集源)与前述S202-12中全局区域图像的数据源(或直接采集源)可以不同。In addition, the data source (or direct acquisition source) of the track point set may be different from the data source (or direct acquisition source) of the global area image in the foregoing S202-12.

S202-24，对各运动对象的所述轨迹点集进行编码处理，得到各运动对象的轨迹特征。S202-24: Perform coding processing on the trajectory point set of each moving object to obtain the trajectory feature of each moving object.

该步骤也可以通过神经网络模型来实现，将前述S202-22得到的各运动对象的轨迹点集(例如，各运动对象在3s内的轨迹点集)输入编码模型(一种训练好的神经网络模型，例如，可采用如图4所示的LSTM模型)，该编码模型的输出即为各运动对象的轨迹特征(encoder)。This step can also be implemented by a neural network model, inputting the trajectory point set of each moving object obtained in the aforementioned S202-22 (for example, the trajectory point set of each moving object within 3s) into the coding model (a trained neural network The model, for example, the LSTM model shown in FIG. 4 can be used), and the output of the encoding model is the trajectory feature (encoder) of each moving object.

具体的，针对任一运动对象而言，其轨迹特征可以表现为：encoder＝LSTM{(X_i，Y_i)}。其中，轨迹特征(encoder)的长度可以假设为C，C的取值一般为预设经验值。Specifically, for any moving object, its trajectory feature can be expressed as: encoder=LSTM{(X _i , Y _i )}. Among them, the length of the trajectory feature (encoder) can be assumed to be C, and the value of C is generally a preset empirical value.

S202-26，根据各运动对象的所述轨迹特征，构建轨迹张量，以作为所述全局轨迹数据。S202-26, construct a trajectory tensor according to the trajectory feature of each moving object, as the global trajectory data.

基于S202-24步骤中获取到的各运动对象的轨迹特征，该步骤可以构建一个轨迹张量(tensor)，其尺寸为C*H*W，将各运动对象的轨迹特征(encoder)对应存放至该tensor中即可。具体的，针对任一运动对象，将该运动对象的encoder对应存储在该运动对象的中心位置。一种可能的设计中，也可按照图1所示方式，在tensor中划分网格，如此，该运动对象的中心位置位于该Tensor中的哪一个网格，就将该运动对象的encoder对应存储在该网格中即可。Based on the trajectory features of each moving object obtained in step S202-24, this step can construct a trajectory tensor (tensor) whose size is C*H*W, and store the trajectory features (encoder) of each moving object in the corresponding in the tensor. Specifically, for any moving object, the encoder of the moving object is correspondingly stored in the center position of the moving object. In a possible design, the grid can also be divided in the tensor according to the method shown in Figure 1. In this way, which grid in the Tensor the center position of the moving object is located in, the encoder of the moving object is stored correspondingly. in this grid.

通过如图8所示的实现方式，可以实现对全局语义数据与全局轨迹数据的获取。如前所述的实现方式中，全局语义数据可以表现为一个W*H的图像，而全局轨迹数据则表现为一个尺寸为C*H*W的tensor，因此，在执行S204所述的融合步骤时，可将二者融合为一个尺寸为(C+1)*H*W的融合tensor。该融合tensor可具体表示为：tensor((C+1)*H*W)。Through the implementation as shown in FIG. 8 , the acquisition of global semantic data and global trajectory data can be achieved. In the aforementioned implementation manner, the global semantic data can be represented as a W*H image, and the global trajectory data can be represented as a tensor with a size of C*H*W. Therefore, the fusion step described in S204 is performed. , the two can be fused into a fused tensor of size (C+1)*H*W. The fusion tensor can be specifically expressed as: tensor((C+1)*H*W).

基于前述步骤得到的全局融合数据，只需要再对该全局融合数据进行特征提取，即可得到包含待预测对象在内的全局特征。具体实现时，也可以通过神经网络模型实现。也就是，利用训练好的特征提取模型处理所述融合信息，得到所述全局特征。其中，本发明实施例所涉及到的特征提取模型至少包括：如图4所示的卷积神经网络(ConvolutionalNeural Networks，CNN)模型。与前述利用神经网络模型实现数据处理的方式类似，需要在执行该步骤之前，利用特征提取样本对该CNN模型进行训练。模型训练过程不再赘述。Based on the global fusion data obtained in the preceding steps, it is only necessary to perform feature extraction on the global fusion data to obtain global features including the object to be predicted. In specific implementation, it can also be implemented through a neural network model. That is, the fusion information is processed by using the trained feature extraction model to obtain the global feature. The feature extraction model involved in the embodiment of the present invention at least includes: a convolutional neural network (Convolutional Neural Networks, CNN) model as shown in FIG. 4 . Similar to the aforementioned method of implementing data processing using a neural network model, it is necessary to use feature extraction samples to train the CNN model before performing this step. The model training process is not repeated here.

同样的，在执行S208之前，也需要完成针对轨迹预测模型的训练学习。在具体的实现场景中，一般在执行本方案之前即完成轨迹预测模型(以及前述各实现方式中所涉及到的神经网络模型)的训练，以便于实时高效地实现轨迹预测，这种实现方式具备较高的实时性，有利于在实时实现轨迹预测，进而近似实时地实现路径规划或调度。Similarly, before executing S208, the training and learning for the trajectory prediction model also needs to be completed. In a specific implementation scenario, the training of the trajectory prediction model (and the neural network model involved in the foregoing implementations) is generally completed before the implementation of this solution, so as to realize the trajectory prediction efficiently in real time. This implementation has High real-time performance is beneficial to realize trajectory prediction in real time, and then realize path planning or scheduling in approximately real time.

在具体实现轨迹预测模型的训练时，可以训练出根据全局特征预测某一单一运动对象(待预测对象)的第一轨迹预测模型。针对待预测对象而言，这种单一预测方式具备更快的处理效率，有利于实时场景下的路径规划和调度。When specifically implementing the training of the trajectory prediction model, a first trajectory prediction model for predicting a single moving object (object to be predicted) according to the global feature can be trained. For the object to be predicted, this single prediction method has faster processing efficiency, which is beneficial to path planning and scheduling in real-time scenarios.

或者，也可以训练出根据全局特征预测区域内包含的全部运动对象的第二轨迹预测模型。其中，当利用第二轨迹预测模型处理全局特征时，将所述全局特征输入所述第二轨迹预测模型，获取所述第二轨迹预测模型输出的所述区域中全部运动对象的运动轨迹，并将所述全部运动对象中的所述待预测对象的运动轨迹，作为所述目标轨迹即可。这种全局预测方式能够一次性输出区域内全部运动对象的运动轨迹，有利于全局调度的实现，也有利于降低调度或路径规划过程中意外事故的发生概率，提高安全性。Alternatively, it is also possible to train a second trajectory prediction model that predicts all moving objects contained in the region according to the global feature. Wherein, when using the second trajectory prediction model to process the global feature, the global feature is input into the second trajectory prediction model, and the motion trajectories of all moving objects in the region output by the second trajectory prediction model are obtained, and The motion trajectory of the object to be predicted among all the moving objects may be used as the target trajectory. This global prediction method can output the motion trajectories of all moving objects in the area at one time, which is beneficial to the realization of global scheduling, as well as reducing the probability of accidents during the scheduling or path planning process, and improving safety.

此外，还可以训练出根据全局特征预测区域内包含的多个(非全部)运动对象的第三轨迹预测模型，模型训练方式及S208的实现方式同上，不再赘述。In addition, a third trajectory prediction model that predicts multiple (but not all) moving objects included in the region according to the global feature can also be trained. The model training method and the implementation method of S208 are the same as above, and will not be repeated here.

综上，基于所训练出的轨迹预测模型的设计不同，本发明实施例所提供的技术方案不仅能实现针对单一运动对象的轨迹预测，还能够实现多个运动对象的轨迹预测。且如上所述的轨迹预测模型对对象类别无依赖，通过上述训练好的轨迹预测模型，可实现对任意类别的运动对象的轨迹预测，具备更高的灵活性，还可适用于具备多类对象的场景中的轨迹预测。To sum up, based on the different designs of the trained trajectory prediction models, the technical solutions provided by the embodiments of the present invention can not only realize trajectory prediction for a single moving object, but also realize trajectory prediction for multiple moving objects. And the above trajectory prediction model has no dependence on the object category. Through the above trained trajectory prediction model, the trajectory prediction of any category of moving objects can be realized, which has higher flexibility and can also be applied to objects with multiple categories. trajectory prediction in the scene.

此外，在一些特殊的实现场景中，也可以如现有实现方式，分别为各类运动对象分别训练各自对应的轨迹预测模型。也就是，本发明实施例所提供的技术方案也可针对各类别的运动对象实现个性化预测。其中，为各类运动对象分别训练各自的轨迹预测模型时，样本数据为该类别对象的相关数据。In addition, in some special implementation scenarios, it is also possible to train respective trajectory prediction models for various types of moving objects, respectively, as in the existing implementation manner. That is, the technical solutions provided by the embodiments of the present invention can also implement personalized prediction for various types of moving objects. Wherein, when training respective trajectory prediction models for various types of moving objects, the sample data is the relevant data of the objects of this type.

如前所述，通过前述各实现方式得到的待预测对象的目标轨迹之后，即可利用该目标轨迹作进一步处理。As mentioned above, after the target trajectory of the object to be predicted is obtained through the foregoing implementation manners, the target trajectory can be used for further processing.

一种可能的设计中，可以根据所述目标轨迹，为所述待预测对象进行运动规划。也就是，根据该待预测对象的预测轨迹来实现进一步的路径规划。In a possible design, motion planning may be performed for the object to be predicted according to the target trajectory. That is, further path planning is implemented according to the predicted trajectory of the object to be predicted.

另一种可能的设计中，可以根据所述目标轨迹，为其他运动对象进行运动规划。也就是，在为其他某一个或多个运动对象进行路径规划的过程中，可以根据该待预测对象的预测轨迹来规划路线，以避免和该待预测对象发生碰撞或其他安全事故。In another possible design, motion planning may be performed for other moving objects according to the target trajectory. That is, in the process of path planning for one or more other moving objects, a route can be planned according to the predicted trajectory of the object to be predicted, so as to avoid collision or other safety accidents with the object to be predicted.

进而，根据规划好的运动路径，实现运动对象的调度。Furthermore, according to the planned motion path, the scheduling of the moving objects is realized.

可以理解的是，上述实施例中的部分或全部步骤或操作仅是示例，本申请实施例还可以执行其它操作或者各种操作的变形。此外，各个步骤可以按照上述实施例呈现的不同的顺序来执行，并且有可能并非要执行上述实施例中的全部操作。It can be understood that, some or all of the steps or operations in the foregoing embodiments are only examples, and other operations or variations of various operations may also be performed in the embodiments of the present application. Furthermore, the various steps may be performed in a different order presented in the above-described embodiments, and may not perform all operations in the above-described embodiments.

实施例二Embodiment 2

基于上述实施例一所提供的轨迹预测方法，本发明实施例进一步给出实现上述方法实施例中各步骤及方法的装置实施例。Based on the trajectory prediction method provided in the above-mentioned first embodiment, the embodiment of the present invention further provides an embodiment of an apparatus for implementing the steps and methods in the above-mentioned method embodiment.

本发明实施例提供了一种轨迹预测装置，请参考图9，该轨迹预测装置600，包括：An embodiment of the present invention provides a trajectory prediction apparatus, please refer to FIG. 9 , the trajectory prediction apparatus 600 includes:

获取模块61，用于获取待预测对象所在区域的全局语义数据与全局轨迹数据；an acquisition module 61, configured to acquire global semantic data and global trajectory data of the region where the object to be predicted is located;

融合模块62，用于融合所述全局语义数据与所述全局轨迹数据，得到全局融合数据；A fusion module 62, configured to fuse the global semantic data and the global trajectory data to obtain global fusion data;

特征提取模块63，用于提取所述全局融合数据中的特征，得到全局特征；Feature extraction module 63, for extracting features in the global fusion data to obtain global features;

预测模块64，用于利用训练好的轨迹预测模型处理所述全局特征，得到所述待预测对象的目标轨迹。The prediction module 64 is configured to process the global feature by using the trained trajectory prediction model to obtain the target trajectory of the object to be predicted.

一种可能的设计中，所述获取模块61，具体用于：In a possible design, the acquisition module 61 is specifically used for:

获取所述待预测对象所在区域的全局区域图像；obtaining a global area image of the area where the object to be predicted is located;

对所述全局区域图像中的各像素分别进行语义识别，得到各像素的语义类别；Semantic recognition is performed on each pixel in the global area image, respectively, to obtain the semantic category of each pixel;

根据各像素的语义类别，对所述全局区域图像进行语义标注，得到所述全局语义信息。According to the semantic category of each pixel, semantic annotation is performed on the global area image to obtain the global semantic information.

其中，所述获取模块61，还进一步具体用于：Wherein, the acquisition module 61 is further specifically used for:

根据各像素的语义类别，对所述全局区域图像进行语义分割，得到多个语义对象；According to the semantic category of each pixel, semantic segmentation is performed on the global area image to obtain a plurality of semantic objects;

对各语义对象分别进行语义标注，得到所述全局语义信息。Semantic annotation is performed on each semantic object to obtain the global semantic information.

本发明实施例所涉及到的所述全局区域图像为数字正射影像DOM图像。The global area image involved in the embodiment of the present invention is a digital orthophoto DOM image.

另一种可能的设计中，所述获取模块61，具体用于：In another possible design, the obtaining module 61 is specifically used for:

获取所述待预测对象所在区域内各运动对象的轨迹点集，所述轨迹点集由所述运动对象的坐标点按照时序顺序集合而成；acquiring the trajectory point set of each moving object in the area where the object to be predicted is located, and the trajectory point set is formed by the coordinate points of the moving object assembled in time sequence;

对各运动对象的所述轨迹点集进行编码处理，得到各运动对象的轨迹特征；Encoding the set of trajectory points of each moving object to obtain the trajectory features of each moving object;

根据各运动对象的所述轨迹特征，构建轨迹张量，以作为所述全局轨迹数据。According to the trajectory features of each moving object, a trajectory tensor is constructed to serve as the global trajectory data.

一种可能的设计中，所述融合模块63，具体用于：In a possible design, the fusion module 63 is specifically used for:

利用训练好的特征提取模型处理所述融合信息，得到所述全局特征。The fusion information is processed by using the trained feature extraction model to obtain the global feature.

其中，本发明实施例所涉及到的所述特征提取模型至少包括：卷积神经网络CNN模型。Wherein, the feature extraction model involved in the embodiment of the present invention includes at least: a convolutional neural network CNN model.

本发明实施例所涉及到的所述轨迹预测模型包括如下至少一种：长短期记忆网络LSTM模型、多层感知器MLP；The trajectory prediction model involved in the embodiment of the present invention includes at least one of the following: a long short-term memory network LSTM model, and a multi-layer perceptron MLP;

其中，所述多层感知器包括：循环神经网络RNN模型、门控循环单元GRU模型。Wherein, the multilayer perceptron includes: a recurrent neural network RNN model and a gated recurrent unit GRU model.

一种可能的设计中，所述轨迹预测模型为第一轨迹预测模型，所述第一轨迹预测模型用于预测所述待预测对象的目标轨迹。In a possible design, the trajectory prediction model is a first trajectory prediction model, and the first trajectory prediction model is used to predict the target trajectory of the object to be predicted.

另一种可能的设计中，所述轨迹预测模型为第二轨迹预测模型，所述第二轨迹预测模型用于预测所述区域中全部运动对象的运动轨迹；此时，预测模块64，具体用于：In another possible design, the trajectory prediction model is a second trajectory prediction model, and the second trajectory prediction model is used to predict the motion trajectories of all moving objects in the area; in this case, the prediction module 64 specifically uses At:

将所述全局特征输入所述第二轨迹预测模型，获取所述第二轨迹预测模型输出的所述区域中全部运动对象的运动轨迹，并将所述全部运动对象中的所述待预测对象的运动轨迹，作为所述目标轨迹。Input the global feature into the second trajectory prediction model, obtain the motion trajectories of all moving objects in the region output by the second trajectory prediction model, and use the The motion trajectory is used as the target trajectory.

本发明实施例中，所述待预测对象包括如下至少一种：车辆、动物、人、机器人与无人飞行器。In the embodiment of the present invention, the object to be predicted includes at least one of the following: a vehicle, an animal, a person, a robot, and an unmanned aerial vehicle.

此外，一种或可能的设计中，轨迹预测装置600还可以包括：In addition, in one or a possible design, the trajectory prediction apparatus 600 may further include:

规划模块(图9未示出)，用于根据所述目标轨迹，为所述待预测对象进行运动规划。A planning module (not shown in FIG. 9 ), configured to perform motion planning for the object to be predicted according to the target trajectory.

图9所示实施例的轨迹预测装置600可用于执行上述方法实施例的技术方案，其实现原理和技术效果可以进一步参考方法实施例中的相关描述，可选的，该轨迹预测装置600可以是终端设备或后台服务器等。The trajectory prediction apparatus 600 in the embodiment shown in FIG. 9 can be used to implement the technical solutions of the above method embodiments. For the implementation principle and technical effect, reference may be made to the relevant descriptions in the method embodiments. Optionally, the trajectory prediction apparatus 600 may be Terminal devices or background servers, etc.

应理解以上图9所示轨迹预测装置600的各个模块的划分仅仅是一种逻辑功能的划分，实际实现时可以全部或部分集成到一个物理实体上，也可以物理上分开。且这些模块可以全部以软件通过处理元件调用的形式实现；也可以全部以硬件的形式实现；还可以部分模块以软件通过处理元件调用的形式实现，部分模块通过硬件的形式实现。例如，预测模块64可以为单独设立的处理元件，也可以集成在轨迹预测装置600中，例如终端的某一个芯片中实现，此外，也可以以程序的形式存储于轨迹预测装置600的存储器中，由轨迹预测装置600的某一个处理元件调用并执行以上各个模块的功能。其它模块的实现与之类似。此外这些模块全部或部分可以集成在一起，也可以独立实现。这里所述的处理元件可以是一种集成电路，具有信号的处理能力。在实现过程中，上述方法的各步骤或以上各个模块可以通过处理器元件中的硬件的集成逻辑电路或者软件形式的指令完成。It should be understood that the division of each module of the trajectory prediction apparatus 600 shown in FIG. 9 above is only a division of logical functions, and may be fully or partially integrated into a physical entity in actual implementation, or may be physically separated. And these modules can all be implemented in the form of software calling through processing elements; they can also all be implemented in hardware; some modules can also be implemented in the form of software calling through processing elements, and some modules can be implemented in hardware. For example, the prediction module 64 can be a separately established processing element, or can be integrated in the trajectory prediction device 600, such as a certain chip of the terminal, and can also be stored in the memory of the trajectory prediction device 600 in the form of a program, The functions of the above modules are called and executed by a certain processing element of the trajectory prediction apparatus 600 . The implementation of other modules is similar. In addition, all or part of these modules can be integrated together, and can also be implemented independently. The processing element described here may be an integrated circuit with signal processing capability. In the implementation process, each step of the above-mentioned method or each of the above-mentioned modules can be completed by an integrated logic circuit of hardware in the processor element or an instruction in the form of software.

例如，以上这些模块可以是被配置成实施以上方法的一个或多个集成电路，例如：一个或多个特定集成电路(Application Specific Integrated Circuit，ASIC)，或，一个或多个微处理器(digital singnal processor，DSP)，或，一个或者多个现场可编程门阵列(Field Programmable Gate Array，FPGA)等。再如，当以上某个模块通过处理元件调度程序的形式实现时，该处理元件可以是通用处理器，例如中央处理器(Central ProcessingUnit，CPU)或其它可以调用程序的处理器。再如，这些模块可以集成在一起，以片上系统(system-on-a-chip，SOC)的形式实现。For example, the above modules may be one or more integrated circuits configured to implement the above methods, such as: one or more specific integrated circuits (Application Specific Integrated Circuit, ASIC), or one or more microprocessors (digital) singnal processor, DSP), or, one or more Field Programmable Gate Arrays (Field Programmable Gate Array, FPGA), etc. For another example, when one of the above modules is implemented in the form of a processing element scheduler, the processing element may be a general-purpose processor, such as a central processing unit (Central Processing Unit, CPU) or other processors that can invoke programs. For another example, these modules can be integrated together and implemented in the form of a system-on-a-chip (SOC).

并且，本发明实施例提供了一种轨迹预测装置，请参考图10，该轨迹预测装置600，包括：Moreover, an embodiment of the present invention provides a trajectory prediction apparatus, please refer to FIG. 10 , the trajectory prediction apparatus 600 includes:

存储器610；memory 610;

处理器620；以及processor 620; and

计算机程序；Computer program;

其中，计算机程序存储在存储器610中，并被配置为由处理器620执行以实现如上述实施例所述的方法。The computer program is stored in the memory 610 and is configured to be executed by the processor 620 to implement the methods described in the above embodiments.

其中，轨迹预测装置600中处理器620的数目可以为一个或多个，处理器620也可以称为处理单元，可以实现一定的控制功能。所述处理器620可以是通用处理器或者专用处理器等。在一种可选地设计中，处理器620也可以存有指令，所述指令可以被所述处理器620运行，使得所述轨迹预测装置600执行上述方法实施例中描述的轨迹预测方法。The number of processors 620 in the trajectory prediction apparatus 600 may be one or more, and the processors 620 may also be referred to as processing units, which may implement certain control functions. The processor 620 may be a general-purpose processor or a special-purpose processor, or the like. In an optional design, the processor 620 may also store instructions, and the instructions may be executed by the processor 620, so that the trajectory prediction apparatus 600 executes the trajectory prediction method described in the above method embodiments.

在又一种可能的设计中，轨迹预测装置600可以包括电路，所述电路可以实现前述方法实施例中发送或接收或者通信的功能。In yet another possible design, the trajectory prediction apparatus 600 may include a circuit, and the circuit may implement the function of sending or receiving or communicating in the foregoing method embodiments.

可选地，所述轨迹预测装置600中存储器610的数目可以为一个或多个，存储器610上存有指令或者中间数据，所述指令可在所述处理器620上被运行，使得所述轨迹预测装置600执行上述方法实施例中描述的方法。可选地，所述存储器610中还可以存储有其他相关数据。可选地处理器620中也可以存储指令和/或数据。所述处理器620和存储器610可以单独设置，也可以集成在一起。Optionally, the number of memories 610 in the trajectory prediction apparatus 600 may be one or more, and instructions or intermediate data are stored in the memory 610, and the instructions can be executed on the processor 620 to make the trajectory The prediction apparatus 600 executes the methods described in the above method embodiments. Optionally, other related data may also be stored in the memory 610 . Optionally, processor 620 may also store instructions and/or data. The processor 620 and the memory 610 may be provided separately or integrated together.

此外，如图10所示，在该轨迹预测装置600中还设置有收发器630，其中，所述收发器630可以称为收发单元、收发机、收发电路、或者收发器等，用于与测试设备或其他终端设备进行数据传输或通信，在此不再赘述。In addition, as shown in FIG. 10 , a transceiver 630 is also provided in the trajectory prediction apparatus 600, wherein the transceiver 630 may be called a transceiver unit, a transceiver, a transceiver circuit, or a transceiver, etc., and is used for testing and testing. The device or other terminal device performs data transmission or communication, which is not repeated here.

如图10所示，存储器610、处理器620与收发器630通过总线连接并通信。As shown in FIG. 10 , the memory 610 , the processor 620 and the transceiver 630 are connected and communicated through a bus.

若该轨迹预测装置600用于实现对应于图2中的方法时，例如，可以由收发器630获取全局语义数据与全局轨迹数据。而处理器620用于完成相应的确定或者控制操作，可选的，还可以在存储器610中存储相应的指令。各个部件的具体的处理方式可以参考前述实施例的相关描述。If the trajectory prediction apparatus 600 is used to implement the method corresponding to FIG. 2 , for example, the transceiver 630 can acquire global semantic data and global trajectory data. The processor 620 is used to complete corresponding determination or control operations, and optionally, corresponding instructions may also be stored in the memory 610 . For the specific processing manner of each component, reference may be made to the relevant descriptions of the foregoing embodiments.

此外，本发明实施例提供了一种可读存储介质，其上存储有计算机程序，该计算机程序被处理器执行以实现如实施例一所述的方法。In addition, an embodiment of the present invention provides a readable storage medium on which a computer program is stored, and the computer program is executed by a processor to implement the method described in the first embodiment.

此外，本发明实施例提供了一种驾驶系统，请参考图11，该驾驶系统800包括：In addition, an embodiment of the present invention provides a driving system, please refer to FIG. 11 , the driving system 800 includes:

轨迹预测装置600，用于执行如实施例一中任一实现方式所述的方法；A trajectory prediction apparatus 600, configured to execute the method described in any implementation manner of Embodiment 1;

运动控制器810，用于根据所述轨迹预测装置获取到的目标轨迹控制被控制对象运动。The motion controller 810 is configured to control the movement of the controlled object according to the target trajectory obtained by the trajectory prediction device.

其中，被控制对象与待预测对象为同一对象。例如，一种可能的设计中，车辆对自身的轨迹预测及路线规划场景中，运动控制器可根据轨迹预测装置600预测出的自身的目标轨迹，来实现对自身行驶路线的规划；以及，进一步地，可实现对被控制对象的自动运动，也就是，实现自动驾驶。The controlled object and the to-be-predicted object are the same object. For example, in a possible design, in the scenario of the vehicle's own trajectory prediction and route planning, the motion controller can realize the planning of its own driving route according to its own target trajectory predicted by the trajectory prediction device 600; and, further Therefore, the automatic movement of the controlled object can be realized, that is, automatic driving can be realized.

此外，被控制对象与待预测对象可以为不同对象。以前述场景为例，车辆可以对路面上与自身较为接近的其他车辆进行轨迹预测，以便于在执行自身的运动控制时，能够尽量避免与其他车辆或运动障碍物(车辆、人、动物等)的碰撞，以降低安全事故的发生概率，有利于提高安全性。In addition, the controlled object and the to-be-predicted object may be different objects. Taking the aforementioned scenario as an example, the vehicle can predict the trajectory of other vehicles on the road that are closer to itself, so that when executing its own motion control, it can try to avoid contact with other vehicles or moving obstacles (vehicles, people, animals, etc.) In order to reduce the probability of safety accidents, it is beneficial to improve safety.

在另一具体的实现场景中，运动控制器810亦可将轨迹预测装置600获取到的目标轨迹进行输出，以便于用户在驾驶或控制被控制对象运动时，可以将该目标轨迹作为参考。尤其是在多个对象的场景中，通过对其他多个待预测对象进行目标轨迹的预测，更有利于提高多对象场景中的控制安全。In another specific implementation scenario, the motion controller 810 can also output the target trajectory obtained by the trajectory prediction device 600, so that the user can use the target trajectory as a reference when driving or controlling the movement of the controlled object. Especially in the scene of multiple objects, it is more beneficial to improve the control security in the multi-object scene by predicting the target trajectory of other objects to be predicted.

具体而言，被控制对象可以包括但不限于如下至少一种：车辆、动物、人、机器人与无人飞行器。此外，本发明实施例中对被控制对象的数目无特别限定，可以为一个或多个。例如，运动控制器可以为车辆的行驶控制器，也可以为无人飞行器的飞行控制器等，不再赘述。Specifically, the controlled objects may include, but are not limited to, at least one of the following: vehicles, animals, people, robots, and unmanned aerial vehicles. In addition, in this embodiment of the present invention, the number of controlled objects is not particularly limited, and may be one or more. For example, the motion controller may be a driving controller of a vehicle, or a flight controller of an unmanned aerial vehicle, etc., which will not be described again.

此外，本发明实施例提供了一种车辆。In addition, embodiments of the present invention provide a vehicle.

请参考图12，该车辆900包括：Referring to Figure 12, the vehicle 900 includes:

轨迹预测装置600，用于执行如实施例一中任一实现方式所述的方法。The trajectory prediction apparatus 600 is configured to execute the method described in any one of the implementation manners of the first embodiment.

或者，如图13所示，该车辆900包括：Alternatively, as shown in FIG. 13, the vehicle 900 includes:

如图11所示的驾驶系统800。Driving system 800 as shown in FIG. 11 .

此外，本发明实施例还提供了一种无人飞行器的控制装置。In addition, an embodiment of the present invention also provides a control device for an unmanned aerial vehicle.

一种可能的设计中，该无人飞行器的控制装置，包括：In a possible design, the control device of the unmanned aerial vehicle includes:

另一设计中，该无人飞行器的控制装置，包括：In another design, the control device of the unmanned aerial vehicle includes:

驾驶系统800。Driving system 800 .

具体的，无人飞行器与无人飞行器的控制装置可以独立设计，也可以组合设计(该控制装置设置于无人飞行器内部)，本发明实施例对此无特别限定。Specifically, the unmanned aerial vehicle and the control device of the unmanned aerial vehicle can be designed independently, or can be designed in combination (the control device is arranged inside the unmanned aerial vehicle), which is not particularly limited in the embodiment of the present invention.

可知，车辆与无人飞行器的控制装置为可承载前述轨迹预测装置的被控制对象，如前说书，除此之外，还进一步包括机器人或机器玩具等，不再赘述。It can be seen that the control device of the vehicle and the unmanned aerial vehicle is the controlled object that can carry the aforementioned trajectory prediction device. As mentioned in the previous story, in addition to this, it further includes robots or machine toys, etc., which will not be repeated.

本领域普通技术人员可以理解：实现上述方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成，前述的程序可以存储于一计算机可读取存储介质中，该程序在执行时，执行包括上述方法实施例的步骤；而前述的存储介质包括：ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。Those of ordinary skill in the art can understand that all or part of the steps of implementing the above method embodiments can be completed by program instructions related to hardware, the aforementioned program can be stored in a computer-readable storage medium, and when the program is executed, execute It includes the steps of the above method embodiments; and the aforementioned storage medium includes: ROM, RAM, magnetic disk or optical disk and other media that can store program codes.

由于本实施例中的各模块能够执行实施例一所示的方法，本实施例未详细描述的部分，可参考对实施例一的相关说明。Since each module in this embodiment can execute the method shown in Embodiment 1, for parts that are not described in detail in this embodiment, reference may be made to the relevant description of Embodiment 1.

最后应说明的是：以上各实施例仅用以说明本发明的技术方案，而非对其限制；尽管参照前述各实施例对本发明进行了详细的说明，本领域的普通技术人员应当理解：其依然可以对前述各实施例所记载的技术方案进行修改，或者对其中部分或者全部技术特征进行等同替换；而这些修改或者替换，并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention, but not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: The technical solutions described in the foregoing embodiments can still be modified, or some or all of the technical features thereof can be equivalently replaced; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the technical solutions of the embodiments of the present invention. scope.

Claims

1. A trajectory prediction method, comprising:

acquiring global semantic data and global track data of an area where an object to be predicted is located;

fusing the global semantic data and the global track data to obtain global fusion data;

extracting features in the global fusion data to obtain global features;

and processing the global characteristics by using a trained track prediction model to obtain the target track of the object to be predicted.

2. The method according to claim 1, wherein the obtaining global semantic data of an area where the object to be predicted is located comprises:

acquiring a global area image of an area where the object to be predicted is located;

performing semantic recognition on each pixel in the global area image to obtain the semantic category of each pixel;

and according to the semantic category of each pixel, performing semantic annotation on the global area image to obtain the global semantic information.

3. The method according to claim 2, wherein the semantically labeling the global region image according to the semantic category of each pixel to obtain the global semantic information comprises:

according to the semantic category of each pixel, performing semantic segmentation on the global area image to obtain a plurality of semantic objects;

and performing semantic annotation on each semantic object to obtain the global semantic information.

4. The method according to claim 2 or 3, wherein the global area image is a Digital Orthophoto (DOM) image.

5. The method according to claim 1, wherein the obtaining global trajectory data of an area where the object to be predicted is located comprises:

acquiring a track point set of each moving object in the area of the object to be predicted, wherein the track point set is formed by collecting coordinate points of the moving objects according to a time sequence;

coding the track point set of each moving object to obtain the track characteristics of each moving object;

and constructing a track tensor according to the track characteristics of each moving object to serve as the global track data.

6. The method according to any one of claims 1-3 and 5, wherein the extracting features in the global fusion data to obtain global features comprises:

and processing the fusion information by using a trained feature extraction model to obtain the global features.

7. The method of claim 6, wherein the feature extraction model comprises at least: convolutional neural network CNN model.

8. The method of claim 1, wherein the trajectory prediction model comprises at least one of: a long and short term memory network LSTM model and a multilayer perceptron MLP;

wherein the multilayer perceptron comprises: a Recurrent Neural Network (RNN) model and a Gated Recurrent Unit (GRU) model.

9. The method according to claim 1 or 8, wherein the trajectory prediction model is a first trajectory prediction model for predicting a target trajectory of the object to be predicted.

10. The method according to claim 1 or 8, wherein the trajectory prediction model is a second trajectory prediction model for predicting the motion trajectories of all moving objects in the area;

the processing the global feature by using the trained track prediction model to obtain the target track of the object to be predicted comprises the following steps:

inputting the global features into the second track prediction model, obtaining the motion tracks of all the moving objects in the area output by the second track prediction model, and taking the motion tracks of the objects to be predicted in all the moving objects as the target track.

11. The method of claim 1, wherein the object to be predicted comprises at least one of: vehicles, animals, humans, robots and unmanned aerial vehicles.

12. The method of claim 1, further comprising:

and performing motion planning on the object to be predicted according to the target track.

13. A trajectory prediction device, comprising:

the acquisition module is used for acquiring global semantic data and global track data of an area where an object to be predicted is located;

the fusion module is used for fusing the global semantic data and the global track data to obtain global fusion data;

the feature extraction module is used for extracting features in the global fusion data to obtain global features;

and the prediction module is used for processing the global characteristics by utilizing a trained track prediction model to obtain the target track of the object to be predicted.

14. A trajectory prediction device, comprising:

a memory;

a processor; and

a computer program;

wherein the computer program is stored in the memory and configured to be executed by the processor to implement the method of any one of claims 1 to 12.

15. A computer-readable storage medium, having stored thereon a computer program,

the computer program is executed by a processor to implement the method of any one of claims 1 to 12.

16. A driving system, comprising:

trajectory prediction means for performing the method of any one of claims 1 to 12;

and the motion controller is used for controlling the controlled object to move according to the target track acquired by the track prediction device.

17. A vehicle, characterized by comprising:

the driving system of claim 16.