[go: up one dir, main page]

CN114943305A - Data processing method, data processing device, storage medium and server - Google Patents

Data processing method, data processing device, storage medium and server Download PDF

Info

Publication number
CN114943305A
CN114943305A CN202210725134.4A CN202210725134A CN114943305A CN 114943305 A CN114943305 A CN 114943305A CN 202210725134 A CN202210725134 A CN 202210725134A CN 114943305 A CN114943305 A CN 114943305A
Authority
CN
China
Prior art keywords
point
data
track
order
driver
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210725134.4A
Other languages
Chinese (zh)
Inventor
陈冲冲
石立臣
强成仓
廖泽平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Yishi Huolala Technology Co Ltd
Original Assignee
Shenzhen Yishi Huolala Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Yishi Huolala Technology Co Ltd filed Critical Shenzhen Yishi Huolala Technology Co Ltd
Priority to CN202210725134.4A priority Critical patent/CN114943305A/en
Publication of CN114943305A publication Critical patent/CN114943305A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/40Business processes related to the transportation industry

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Economics (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

本申请实施例公开了一种数据处理方法、装置、存储介质及服务器。该方法包括:基于订单数据和司机操作数据,对行驶轨迹数据中轨迹点的时间和位置进行约束,从行驶轨迹数据中确定多个候选轨迹点;对多个候选轨迹点进行聚类,得到多个聚类簇;基于订单数据和司机操作数据,对多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇;对司机在候选簇内各轨迹点上的停留时长、候选簇内各轨迹点的位置进行约束,从候选簇内的轨迹点中确定订单的目标起点或目标终点。本方案使用聚类算法和策略识别实际装卸货点,可以提升货运场景下真实位置点挖掘的准确性。

Figure 202210725134

The embodiments of the present application disclose a data processing method, device, storage medium and server. The method includes: based on order data and driver operation data, constraining the time and position of track points in the driving track data, and determining multiple candidate track points from the driving track data; clustering the multiple candidate track points to obtain multiple track points. Based on the order data and driver operation data, the positions of the cluster center points of the multiple clusters are constrained, and the candidate clusters are determined from the multiple clusters; The duration of stay and the position of each trajectory point in the candidate cluster are constrained, and the target start point or target end point of the order is determined from the trajectory points in the candidate cluster. This solution uses clustering algorithms and strategies to identify the actual loading and unloading points, which can improve the accuracy of mining real location points in freight scenarios.

Figure 202210725134

Description

数据处理方法、装置、存储介质及服务器Data processing method, device, storage medium and server

技术领域technical field

本申请涉电子计算机技术领域,尤其涉及一种数据处理方法、装置、存储介质及服务器。The present application relates to the technical field of electronic computers, and in particular, to a data processing method, device, storage medium and server.

背景技术Background technique

现有的轨迹挖掘方案,大多是关于停留点的识别算法研究、客运行业的出租车上下客地点研究以及长途货车装卸货点研究。而关于同城货运场景货车装卸货点的识别算法研究,目前还处于空缺状态,然而同城货运行业对装卸货点识别的需求又越来越紧迫。Most of the existing trajectory mining schemes are about the identification algorithm research of stop points, the research of taxi pick-up and drop-off points in the passenger transportation industry, and the research of long-distance truck loading and unloading points. The research on the identification algorithm of truck loading and unloading points in the same-city freight scene is still vacant. However, the need for the identification of loading and unloading points in the same-city freight industry is becoming more and more urgent.

对于三方平台而言,同城货运具有代表性的运输轨迹是司机在平台确认接单,与客户电话确认订单后按约定时间前往订单起点,先在订单起点附近停车一段时间在平台确认到达并与客户电话确认实际装货点,然后前往实际装货点并停留一段时间装货,接着前往订单终点,同样在到达目的地后会停留一段时间在平台确认到达并与客户电话确认实际卸货点,然后前往实际卸货点并停留一段时间卸货。卸货完成后则会前往下一个订单的起点准备开始下一轮运输行程。可知,提高挖掘点为真实装卸点的占比,对提高碰面效率是有重要意义的。For the tripartite platform, the representative transportation trajectory of intra-city freight is that the driver confirms the order on the platform, confirms the order with the customer by phone, and then goes to the starting point of the order according to the agreed time. Confirm the actual loading point by phone, then go to the actual loading point and stop for a period of time to load, then go to the end of the order, and also stay for a period of time after reaching the destination, confirm the arrival on the platform and confirm the actual unloading point with the customer by phone, then go to Actual unloading point and stop for a period of time to unload. After the unloading is completed, it will go to the starting point of the next order to prepare for the next round of transportation. It can be seen that increasing the proportion of excavation points as real loading and unloading points is of great significance to improve the efficiency of encounters.

目前三方平台用户在平台下单,部分订单在司机接单后根据用户下单时填写起点规划路线,并在到达起点前需要电话沟通用户进行人工导航,司机与用户货物碰面沟通成本大。在司机轨迹数据分析中发现,一是司机轨迹来自司机定位,大约5%的轨迹缺失率;二是司机确认装货点为真实装货点的比例为51%,司机确认卸货点为真实卸货点的比例为49%;三是用户下单起点为真实装货点的比例为13%,用户下单终点为真实装货点的比例为7%。可知,现有方案订单信息中装卸货位置信息的准确性较差。At present, users of the tripartite platform place orders on the platform. For some orders, after the driver accepts the order, the route is planned according to the starting point when the user places the order, and before reaching the starting point, it is necessary to communicate with the user by telephone for manual navigation. The communication cost between the driver and the user's goods is high. In the analysis of driver trajectory data, it is found that firstly, the driver's trajectory comes from the driver's positioning, and about 5% of the trajectory is missing; secondly, the proportion of drivers who confirm that the loading point is the real loading point is 51%, and the driver confirms that the unloading point is the real unloading point. The third is that the proportion of users placing an order at the real loading point is 13%, and the end point of the user's order is the real loading point is 7%. It can be seen that the accuracy of the loading and unloading location information in the order information of the existing solution is poor.

发明内容SUMMARY OF THE INVENTION

本申请实施例提供一种数据处理方法、装置、存储介质及服务器,可提高真实位置点挖掘的准确性。Embodiments of the present application provide a data processing method, device, storage medium, and server, which can improve the accuracy of real location point mining.

第一方面,本申请实施例提供一种数据处理方法,应用于货运场景下,包括:In a first aspect, an embodiment of the present application provides a data processing method, which is applied in a freight shipping scenario, including:

获取订单数据和对应的行驶轨迹数据;Obtain order data and corresponding driving trajectory data;

基于订单数据和司机操作数据,对所述行驶轨迹数据中轨迹点的时间和位置进行约束,从所述行驶轨迹数据中确定多个候选轨迹点;Based on the order data and the driver's operation data, constrain the time and position of the trajectory points in the driving trajectory data, and determine a plurality of candidate trajectory points from the driving trajectory data;

对多个候选轨迹点进行聚类,得到多个聚类簇;Clustering multiple candidate trajectory points to obtain multiple clusters;

基于所述订单数据和所述司机操作数据,对所述多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇;Based on the order data and the driver operation data, constrain the positions of the cluster center points of the plurality of clusters, and determine candidate clusters from the plurality of clusters;

对司机在所述候选簇内各轨迹点上的停留时长、所述候选簇内各轨迹点的位置进行约束,从所述候选簇内的轨迹点中确定所述订单的目标起点。Constraining the duration of the driver's stay on each trajectory point in the candidate cluster and the position of each trajectory point in the candidate cluster, and determining the target starting point of the order from the trajectory points in the candidate cluster.

第二方面,本申请实施例提供了另一种数据处理方法,应用于货运场景下,包括:In the second aspect, the embodiment of the present application provides another data processing method, which is applied in a freight transport scenario, including:

获取订单数据和对应的行驶轨迹数据;Obtain order data and corresponding driving trajectory data;

基于订单数据和司机操作数据,对所述行驶轨迹数据中轨迹点的时间和位置进行约束,从所述行驶轨迹数据中确定多个候选轨迹点;Based on the order data and the driver's operation data, constrain the time and position of the trajectory points in the driving trajectory data, and determine a plurality of candidate trajectory points from the driving trajectory data;

对多个候选轨迹点进行聚类,得到多个聚类簇;Clustering multiple candidate trajectory points to obtain multiple clusters;

基于所述订单数据和所述司机操作数据,对所述多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇;Based on the order data and the driver operation data, constrain the positions of the cluster center points of the plurality of clusters, and determine candidate clusters from the plurality of clusters;

对司机在所述候选簇内各轨迹点上的停留时长、所述候选簇内各轨迹点的位置进行约束,从所述候选簇内的轨迹点中确定所述订单的目标终点。Constraining the duration of the driver's stay on each track point in the candidate cluster and the position of each track point in the candidate cluster, and determining the target end point of the order from the track points in the candidate cluster.

第三方面,本申请实施例提供了另一种数据处理装置,应用于货运场景下,包括:In a third aspect, the embodiments of the present application provide another data processing device, which is applied in a freight transport scenario, including:

第一获取单元,用于获取订单数据和对应的行驶轨迹数据;a first acquiring unit, used for acquiring order data and corresponding driving track data;

第一确定单元,用于基于订单数据和司机操作数据,对所述行驶轨迹数据中轨迹点的时间和位置进行约束,从所述行驶轨迹数据中确定多个候选轨迹点;a first determining unit, configured to constrain the time and position of the trajectory points in the driving trajectory data based on the order data and driver operation data, and determine a plurality of candidate trajectory points from the driving trajectory data;

第一聚类单元,用于对多个候选轨迹点进行聚类,得到多个聚类簇;The first clustering unit is used for clustering multiple candidate trajectory points to obtain multiple cluster clusters;

第二确定单元,用于基于所述订单数据和所述司机操作数据,对所述多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇;a second determining unit, configured to constrain the positions of the cluster center points of the multiple clusters based on the order data and the driver operation data, and determine candidate clusters from the multiple clusters;

第三确定单元,用于对司机在所述候选簇内各轨迹点上的停留时长、所述候选簇内各轨迹点的位置进行约束,从所述候选簇内的轨迹点中确定所述订单的目标起点。The third determining unit is configured to constrain the length of stay of the driver on each track point in the candidate cluster and the position of each track point in the candidate cluster, and determine the order from the track points in the candidate cluster target starting point.

第四方面,本申请实施例提供了另一种数据处理装置,应用于货运场景下,包括:In a fourth aspect, the embodiments of the present application provide another data processing device, which is applied in a freight transport scenario, including:

第二获取单元,用于获取订单数据和对应的行驶轨迹数据;a second acquiring unit, configured to acquire order data and corresponding driving track data;

第四确定单元,用于基于订单数据和司机操作数据,对所述行驶轨迹数据中轨迹点的时间和位置进行约束,从所述行驶轨迹数据中确定多个候选轨迹点;a fourth determining unit, configured to constrain the time and position of the trajectory points in the driving trajectory data based on the order data and the driver operation data, and determine a plurality of candidate trajectory points from the driving trajectory data;

第二聚类单元,用于对多个候选轨迹点进行聚类,得到多个聚类簇;The second clustering unit is used for clustering multiple candidate trajectory points to obtain multiple cluster clusters;

第五确定单元,用于基于所述订单数据和所述司机操作数据,对所述多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇;a fifth determining unit, configured to constrain the positions of the cluster center points of the multiple clusters based on the order data and the driver operation data, and determine candidate clusters from the multiple clusters;

第六确定单元,用于对司机在所述候选簇内各轨迹点上的停留时长、所述候选簇内各轨迹点的位置进行约束,从所述候选簇内的轨迹点中确定所述订单的目标终点。The sixth determination unit is used to constrain the length of stay of the driver on each trajectory point in the candidate cluster and the position of each trajectory point in the candidate cluster, and determine the order from the trajectory points in the candidate cluster target end point.

第五方面,本申请实施例还提供了一种计算机可读存储介质,所述存储介质中存储有多条指令,所述指令适于由处理器加载以执行上述的数据处理方法。In a fifth aspect, an embodiment of the present application further provides a computer-readable storage medium, where a plurality of instructions are stored in the storage medium, and the instructions are suitable for being loaded by a processor to execute the foregoing data processing method.

第六方面,本申请实施例还提供了一种服务器,包括处理器及存储器,所述处理器与所述存储器电性连接,所述存储器用于存储指令和数据,处理器用于执行上述的数据处理方法。In a sixth aspect, an embodiment of the present application further provides a server, including a processor and a memory, the processor is electrically connected to the memory, the memory is used to store instructions and data, and the processor is used to execute the above data Approach.

本申请实施例,基于订单数据和司机操作数据,对行驶轨迹数据中轨迹点的时间和位置进行约束,从行驶轨迹数据中确定多个候选轨迹点;对多个候选轨迹点进行聚类,得到多个聚类簇;基于订单数据和司机操作数据,对多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇;对司机在候选簇内各轨迹点上的停留时长、候选簇内各轨迹点的位置进行约束,从候选簇内的轨迹点中确定订单的目标起点或目标终点。本方案使用聚类算法和策略识别实际装卸货点,可以提升货运场景下真实位置点挖掘的准确性。In this embodiment of the present application, based on the order data and driver operation data, the time and position of the trajectory points in the driving trajectory data are constrained, and multiple candidate trajectory points are determined from the driving trajectory data; the multiple candidate trajectory points are clustered to obtain Multiple clusters; based on order data and driver operation data, the positions of the cluster center points of multiple clusters are constrained, and candidate clusters are determined from multiple clusters; The duration of stay on the candidate cluster and the position of each trajectory point in the candidate cluster are constrained, and the target start point or target end point of the order is determined from the trajectory points in the candidate cluster. This solution uses clustering algorithms and strategies to identify the actual loading and unloading points, which can improve the accuracy of mining real location points in freight scenarios.

附图说明Description of drawings

为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions in the embodiments of the present application more clearly, the following briefly introduces the drawings that are used in the description of the embodiments. Obviously, the drawings in the following description are only some embodiments of the present application. For those skilled in the art, other drawings can also be obtained from these drawings without creative effort.

图1是本申请实施例提供的数据处理方法的流程示意图。FIG. 1 is a schematic flowchart of a data processing method provided by an embodiment of the present application.

图2是本申请实施例提供的装卸货点挖掘系统的架构示意图。FIG. 2 is a schematic structural diagram of a loading and unloading point excavation system provided by an embodiment of the present application.

图3是本申请实施例提供的数据处理方法的另一流程示意图。FIG. 3 is another schematic flowchart of a data processing method provided by an embodiment of the present application.

图4是本申请实施例提供的数据处理装置的一结构示意图。FIG. 4 is a schematic structural diagram of a data processing apparatus provided by an embodiment of the present application.

图5是本申请实施例提供的数据处理装置的一结构示意图。FIG. 5 is a schematic structural diagram of a data processing apparatus provided by an embodiment of the present application.

图6是本申请实施例提供的电子设备的一结构示意图。FIG. 6 is a schematic structural diagram of an electronic device provided by an embodiment of the present application.

图7是本申请实施例提供的电子设备的另一结构示意图。FIG. 7 is another schematic structural diagram of an electronic device provided by an embodiment of the present application.

具体实施方式Detailed ways

下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are only a part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those skilled in the art without creative work fall within the protection scope of the present application.

目前业界货运场景下装卸货点识别基本都是基于货车监控数据分析车速、载重、车向变化,从而识别装卸货点,进而识别司机特定行为,为驾驶员推送配货信息,实现智能化配货。然而在三方货运平台发单进行同城货运服务场景下,一般可以获取的数据有司机定位信息、司机操作信息、其他订单信息。其中,司机操作确认的装货点为真实装货点的比例为51%,而这一比例在卸货点只有49%,用户下单起点为真实装货点的比例为13%,用户下单终点为真实装货点的比例为7%。因此,基于可获取数据挖掘真实装卸货点需要新方法。同时,挖掘点是真实装卸货点的占比越高,对用户推荐准确装卸货点的比例就越高,这对给发单用户提供装卸货点推荐服务、提高车货碰面效率、改善司乘体验有重要意义。At present, the identification of loading and unloading points in the freight scene of the industry is basically based on the monitoring data of trucks to analyze changes in vehicle speed, load, and vehicle direction, so as to identify the loading and unloading points, and then identify the specific behavior of drivers, push delivery information for drivers, and realize intelligent delivery. . However, in the scenario where the tripartite freight platform issues orders for intra-city freight services, the data that can generally be obtained include driver positioning information, driver operation information, and other order information. Among them, the proportion of the loading point confirmed by the driver's operation is 51%, while this proportion is only 49% at the unloading point, the proportion of the user's order starting point is the real loading point is 13%, and the user's order ending point is 13%. The ratio for the real loading point is 7%. Therefore, new methods are needed to mine real loading and unloading points based on available data. At the same time, the higher the proportion of digging points that are real loading and unloading points, the higher the proportion of recommending accurate loading and unloading points to users. Experience matters.

为了提高司机货物碰面效率,降低碰面成本,让更多的司机与用户选择并留存于平台,需要一种挖掘实际装卸货点的方法,进而生成真实装卸货点数据库,对用户提供装卸货点推荐服务。基于此,本申请实施例提供一种数据处理方法、装置、存储介质及服务器,结合用户订单数据、司机操作数据及行驶轨迹数据对真实装卸货点进行识别挖掘,可提高货运接单过程中的定点率。In order to improve the efficiency of driver-cargo encounters, reduce the cost of encounters, and allow more drivers and users to choose and store them on the platform, a method of mining actual loading and unloading points is needed, and then a database of actual loading and unloading points is generated to provide users with loading and unloading point recommendations. Serve. Based on this, the embodiments of the present application provide a data processing method, device, storage medium, and server, which can identify and mine real loading and unloading points in combination with user order data, driver operation data, and driving trajectory data, which can improve the process of receiving freight orders. Fixed point rate.

在一实施例中,提供一种数据处理方法,应用服务器中。参考图1,该数据处理方法的具体流程可以如下:In one embodiment, a data processing method is provided in an application server. Referring to Figure 1, the specific flow of the data processing method may be as follows:

101、获取订单数据和对应的行驶轨迹数据。101. Acquire order data and corresponding driving track data.

本方案应用于货运场景下。具体的,订单数据指用户通过电子设备中安装的货运APP、小程序或网页所发起货运需求的相关数据,例如用户基本信息(如姓名、联系方式)、货运基本信息(如货运起点、货运终点、运货时间、货物明细等等)。This solution is used in freight scenarios. Specifically, order data refers to the data related to the freight demand initiated by the user through the freight APP, applet or web page installed in the electronic device, such as basic user information (such as name, contact information), basic freight information (such as freight origin, freight destination) , delivery time, cargo details, etc.).

102、基于订单数据和司机操作数据,对所述行驶轨迹数据中轨迹点的时间和位置进行约束,从行驶轨迹数据中确定多个候选轨迹点。行驶轨迹数据指本次货运过程中的实际行驶路线数据和行驶时间数据。102. Based on the order data and the driver operation data, constrain the time and position of the trajectory points in the driving trajectory data, and determine a plurality of candidate trajectory points from the driving trajectory data. The driving trajectory data refers to the actual driving route data and driving time data during the freight process.

在一实施方式中,在基于订单数据和司机操作数据,对行驶轨迹数据中轨迹点的时间和位置进行约束,从行驶轨迹数据中确定多个候选轨迹点时,具体可以包括以下流程:In one embodiment, based on the order data and the driver's operation data, the time and position of the trajectory points in the driving trajectory data are constrained, and when multiple candidate trajectory points are determined from the driving trajectory data, the following procedures may be specifically included:

根据司机操作数据、所述行驶轨迹数据中各轨迹点的时间、及第一时间约束条件,确定订单轨迹的起点路段;Determine the starting point section of the order track according to the driver's operation data, the time of each track point in the driving track data, and the first time constraint;

根据订单数据、所述司机操作数据、所述起点路段中各轨迹点的位置、起点路段中各轨迹点的时间、第一距离约束条件及第二时间约束条件,从起点路段中确定多个候选轨迹点。According to the order data, the driver's operation data, the position of each trajectory point in the starting point road segment, the time of each trajectory point in the starting point road segment, the first distance constraint condition and the second time constraint condition, a plurality of candidates are determined from the starting point road segment track point.

在一实施方式中,司机操作数据包括:司机确认接单时间和司机确认到达起点时间。在根据司机操作数据、轨迹数据中各轨迹点的时间、及第一时间约束条件,确定订单轨迹的起点路段时,具体可以基于轨迹数据中各轨迹点的时间,选取司机确认接单时间对应的轨迹点、及司机确定到达起点前指定时间段内的轨迹点,作为订单轨迹的起点路段。In one embodiment, the driver operation data includes: the time when the driver confirms the receipt of the order and the time when the driver confirms the arrival at the starting point. When determining the starting point section of the order track based on the driver's operation data, the time of each track point in the track data, and the first time constraint, you can specifically select the time corresponding to the time of the driver's confirmation of receiving the order based on the time of each track point in the track data. The track point and the track point within the specified time period determined by the driver before reaching the starting point are used as the starting point section of the order track.

在一实施方式中,订单数据包括:订单起点,司机操作数据包括:司机确认接单时间、司机确认到达起点时间、司机确认装货点。在根据订单数据、所述司机操作数据、起点路段中各轨迹点的位置、起点路段中各轨迹点的时间、第一距离约束条件及第二时间约束条件,从起点路段中确定多个候选轨迹点时,具体可以包括以下流程:In one embodiment, the order data includes: the starting point of the order, and the driver operation data includes: the time when the driver confirms the receipt of the order, the time when the driver confirms the arrival at the starting point, and the driver confirms the loading point. Determine a plurality of candidate trajectories from the starting point road segment according to the order data, the driver operation data, the position of each trajectory point in the starting point road segment, the time of each trajectory point in the starting point road segment, the first distance constraint condition and the second time constraint condition When clicking, it can specifically include the following processes:

基于行驶轨迹数据中各轨迹点与订单起点的距离、或者行驶轨迹数据中各轨迹点与司机确认装货点的距离,从起点路段中筛选出满足第一距离约束条件的轨迹点;Based on the distance between each trajectory point in the driving trajectory data and the starting point of the order, or the distance between each trajectory point in the driving trajectory data and the driver's confirmed loading point, screen out the trajectory points that satisfy the first distance constraint from the starting point road segment;

基于所筛选出的轨迹点的时间与司机确认装货时间的时间差、及所筛选出的轨迹点的时间与司机确认到达起点时间的时间差,从所筛选出的轨迹点中确定出时间差满足第二时间约束条件的轨迹点,作为候选轨迹点。Based on the time difference between the time of the selected trajectory points and the time when the driver confirms the loading, and the time difference between the time of the selected trajectory points and the time when the driver confirms the arrival at the starting point, it is determined from the selected trajectory points that the time difference satisfies the second The trajectory points of the time constraints are used as candidate trajectory points.

103、对多个候选轨迹点进行聚类,得到多个聚类簇。103. Cluster the multiple candidate trajectory points to obtain multiple clusters.

具体的,可以用基于密度聚类方法DBSCAN分别对装卸货候选轨迹点进行聚类,最小样本量n为2,ε-邻域半径eps为0.00009。Specifically, the density-based clustering method DBSCAN can be used to cluster the loading and unloading candidate trajectory points respectively, the minimum sample size n is 2, and the ε-neighborhood radius eps is 0.00009.

104、基于订单数据和所述司机操作数据,对多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇。104. Based on the order data and the driver operation data, constrain the positions of the cluster center points of the multiple clusters, and determine a candidate cluster from the multiple clusters.

在一实施方式中,订单数据可包括:订单起点,所述司机操作数据包括:司机确认装货完成点。在基于订单数据和所述司机操作数据,对所述多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇时,具体可包括以下流程:In one embodiment, the order data may include: the origin of the order, and the driver operation data may include: the driver confirms the loading completion point. When constraining the positions of the cluster center points of the multiple clusters based on the order data and the driver operation data, and determining candidate clusters from the multiple clusters, the following processes may be specifically included:

获取簇中心点与订单起点的第一距离、及簇中心点与司机确认装货完成点的第二距离;Obtain the first distance between the cluster center point and the starting point of the order, and the second distance between the cluster center point and the point where the driver confirms the completion of loading;

分别根据第一距离和第二距离对多个聚类簇进行排序,得到第一排序结果和第二排序结果;Sorting the plurality of clusters according to the first distance and the second distance, respectively, to obtain the first sorting result and the second sorting result;

分别根据第一排序结果和第二排序结果,从多个聚类簇中筛选出排序满足第一排序约束条件的聚类簇;According to the first sorting result and the second sorting result, respectively, filter out the clusters that satisfy the first sorting constraint from the plurality of clusters;

将筛选出的聚类簇、及司机确认装货完成点所在的聚类簇,作为候选簇。The filtered cluster and the cluster where the driver confirms the loading completion point are used as candidate clusters.

105、对司机在候选簇内各轨迹点上的停留时长、候选簇内各轨迹点的位置进行约束,从候选簇内的轨迹点中确定订单的目标起点。105. Constrain the length of stay of the driver on each trajectory point in the candidate cluster and the position of each trajectory point in the candidate cluster, and determine the target starting point of the order from the trajectory points in the candidate cluster.

在一实施方式中,从候选簇内的轨迹点中确定所述订单的目标起点时,可以分别获取所述候选簇内各轨迹点与订单起点的第三距离、候选簇内各轨迹点与司机确认装货点的第四距离、候选簇内各轨迹点与司机确认装货完成点的第五距离;获取所述停留时长与第三距离的第一比值、停留时长与第四距离的第二比值、停留时长与第五距离的第三比值。最后,基于第一比值、第二比值及第三比值,从候选簇内的轨迹点中确定订单的目标起点。In one embodiment, when the target starting point of the order is determined from the trajectory points in the candidate cluster, the third distance between each trajectory point in the candidate cluster and the order starting point, the trajectory points in the candidate cluster and the driver can be obtained respectively. Confirm the fourth distance of the loading point, the fifth distance between each trajectory point in the candidate cluster and the driver's confirmed loading completion point; obtain the first ratio of the dwell time to the third distance, and the second ratio of the dwell time to the fourth distance The third ratio of the ratio, the dwell time and the fifth distance. Finally, based on the first ratio, the second ratio and the third ratio, the target origin of the order is determined from the trajectory points within the candidate cluster.

在一实施方式中,在基于第一比值、第二比值及第三比值,从候选簇内的轨迹点中确定所述订单的目标起点时,首先,可基于第一比值、第二比值、所述第三比值及预设评分策略,对所述候选簇内的轨迹点中的轨迹点进行打分。然后,根据打分结果,从候选簇内的轨迹点中筛选出分数最高的轨迹点,确为订单的目标起点。In one embodiment, when determining the target starting point of the order from the trajectory points in the candidate cluster based on the first ratio, the second ratio and the third ratio, firstly, based on the first ratio, the second ratio, the all The third ratio and the preset scoring strategy are used to score the track points in the track points in the candidate cluster. Then, according to the scoring result, the trajectory point with the highest score is selected from the trajectory points in the candidate cluster, which is indeed the target starting point of the order.

同理,在一实施例中,对于轨迹订单目标终点的挖掘和识别,可参考上述逻辑策略。也即,还提供另一种数据处理方法,具体可以如下:Similarly, in an embodiment, for the mining and identification of the target end point of the trajectory order, the above-mentioned logic strategy may be referred to. That is, another data processing method is also provided, which can be specifically as follows:

获取订单数据和对应的行驶轨迹数据;Obtain order data and corresponding driving trajectory data;

基于订单数据和司机操作数据,对行驶轨迹数据中轨迹点的时间和位置进行约束,从行驶轨迹数据中确定多个候选轨迹点;Based on the order data and driver operation data, the time and position of the trajectory points in the driving trajectory data are constrained, and multiple candidate trajectory points are determined from the driving trajectory data;

对多个候选轨迹点进行聚类,得到多个聚类簇;Clustering multiple candidate trajectory points to obtain multiple clusters;

基于订单数据和司机操作数据,对多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇;Based on order data and driver operation data, the positions of the cluster center points of multiple clusters are constrained, and candidate clusters are determined from multiple clusters;

对司机在候选簇内各轨迹点上的停留时长、候选簇内各轨迹点的位置进行约束,从候选簇内的轨迹点中确定订单的目标终点。Constrain the duration of the driver's stay on each trajectory point in the candidate cluster and the position of each trajectory point in the candidate cluster, and determine the target destination of the order from the trajectory points in the candidate cluster.

具体的,在挖掘目标终点时,订单数据可以包括:用户基本信息(如姓名、联系方式)、货运基本信息(如货运起点、货运终点、运货时间、货物明细等等)。用户操作数据可以包括:司机确认到达终点时间、司机确认完单时间、司机确认的卸货点信息、司机确认的卸货完成点信息。Specifically, when mining the target destination, the order data may include: basic user information (such as name, contact information), and basic freight information (such as origin, destination, time, details, etc.). The user operation data may include: the time when the driver confirms the arrival at the end point, the time when the driver confirms the order, the unloading point information confirmed by the driver, and the unloading completion point information confirmed by the driver.

由上可知,本实施例提供的数据处理方法,基于订单数据和司机操作数据,对行驶轨迹数据中轨迹点的时间和位置进行约束,从行驶轨迹数据中确定多个候选轨迹点;对多个候选轨迹点进行聚类,得到多个聚类簇;基于订单数据和司机操作数据,对多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇;对司机在候选簇内各轨迹点上的停留时长、候选簇内各轨迹点的位置进行约束,从候选簇内的轨迹点中确定订单的目标起点或目标终点。本方案使用聚类算法和策略识别实际装卸货点,可以提升货运场景下真实位置点挖掘的准确性。It can be seen from the above that the data processing method provided in this embodiment, based on the order data and driver operation data, constrains the time and position of the trajectory points in the driving trajectory data, and determines a plurality of candidate trajectory points from the driving trajectory data; The candidate trajectory points are clustered to obtain multiple clusters; based on the order data and driver operation data, the positions of the cluster center points of the multiple clusters are constrained, and the candidate clusters are determined from the multiple clusters; The duration of the driver's stay on each trajectory point in the candidate cluster and the position of each trajectory point in the candidate cluster are constrained, and the target start point or target end point of the order is determined from the trajectory points in the candidate cluster. This solution uses clustering algorithms and strategies to identify the actual loading and unloading points, which can improve the accuracy of mining real location points in freight scenarios.

在本申请又一实施例中,还提供一种实际装卸货点挖掘系统。参考图2,该系统架构可以包括:电子设备、通信设备、以及服务器设备。通信设备服务用于联通服务器与终端设备,提供数据交互链路;该通信设备可以通过但不限于以下设备实现:无线网络(WiFi/4G/5G)、有线网络、卫星通讯等。In yet another embodiment of the present application, an actual loading and unloading point excavation system is also provided. Referring to FIG. 2, the system architecture may include: electronic equipment, communication equipment, and server equipment. The communication device service is used to connect the server and the terminal device, and provide a data interaction link; the communication device can be realized by but not limited to the following devices: wireless network (WiFi/4G/5G), wired network, satellite communication, etc.

具体的,用户可使用电子设备通过通信设备与服务器完成数据发送与接收的交互操作。服务器和电子设备中可运行软件程序实现数据发送、数据接收、数据处理、数据展示、模型构建、模型预测等任务。Specifically, the user can use the electronic device to complete the interactive operation of data sending and receiving through the communication device and the server. Software programs can be run on servers and electronic devices to realize tasks such as data transmission, data reception, data processing, data display, model construction, and model prediction.

电子设备包括但不限于电脑、手机、平板等智能终端设备,可以通过通信设备接收来自服务器设备的数据。该电子设备中可安装货运APP,用户可通过APP下单形成货运订单,司机可通过电子设备中安装的APP接受货运订单,并根据订单信息到达货运起点执行货运作业。本实施例中,服务器泛指服务器设施,可以是单个独立服务器或服务器集群,可通过在服务器中运行相应程序实现模型构建和部署。Electronic devices include but are not limited to smart terminal devices such as computers, mobile phones, and tablets, which can receive data from server devices through communication devices. A freight APP can be installed in the electronic device. Users can place orders through the APP to form a freight order. The driver can accept the freight order through the APP installed in the electronic device, and arrive at the freight starting point to perform the freight operation according to the order information. In this embodiment, a server generally refers to a server facility, which may be a single independent server or a server cluster, and model construction and deployment may be implemented by running a corresponding program in the server.

服务器通过系统软件与应用软件提供基础服务能力,在此基础上,服务器提供了货运实际装卸货点的挖掘能力。在本方案中,参考图3,服务器可实现如下功能:The server provides basic service capabilities through system software and application software. On this basis, the server provides the mining capabilities of the actual loading and unloading points of freight. In this solution, referring to Figure 3, the server can implement the following functions:

步骤一、数据的清洗与处理Step 1. Data cleaning and processing

(11)轨迹数据分段:取司机确认接单时间和司机确认到达起点前20min时间的较晚时间作为订单轨迹的起点,取司机确认完单后过20min的时间和司机确认卸货完成后20min时间的较早时间作为订单轨迹的终点,然后通过司机ID与时间将订单数据与轨迹数据进行关联。(11) Segmentation of trajectory data: Take the later time of the driver's confirmation of receiving the order and 20 minutes before the driver's confirmation of arriving at the starting point as the starting point of the order trajectory, and take the time 20 minutes after the driver's confirmation of the order and 20 minutes after the driver's confirmation that the unloading is completed. The earlier time is used as the end point of the order track, and then the order data is associated with the track data through the driver ID and time.

具体的,司机轨迹数据按时间存储,轨迹数据di={driver_id,loactioni,ti},i=1,2,…,m,其中driver_id为司机代号,loactioni={loni,lati},loni为某时刻ti(例如“2021-12-1713:13:13”)位置的经度,lati为某时刻ti位置的纬度;订单数据order={order_id,driver_id,o_order_t,o_order_loaction,o_arrived_t,o_loading_t,o_loading_location,o_loaded_t,o_loaded_location,d_order_t,d_order_loaction,d_arrived_t,d_unloading_t,d_unloading_location,d_complete_t,d_complete_location},其中order_id为司机代号,o_order_t,o_order_loaction分别为用户下单起点的时间和位置,o_arrived_t,o_loading_t,o_loading_location,o_loaded_t,o_loaded_location分别为司机确认的到达起点时间、装货开始时间和位置、装货完成时间和位置,d_order_t,d_order_loaction分别为用户下单终点的时间和位置,d_arrived_t,d_unloading_t,d_unloading_location,d_complete_t,d_complete_location分别为司机确认的到达终点时间、卸货开始时间和位置、订单完成时间和位置。取司机确认接单时间和司机确认到达起点前一段时间(如20min)的较晚时间作为订单轨迹的起点,取司机确认完单后过一段时间(如20min)的时间和司机确认卸货完成后一段时间(如20min)的较早时间作为订单轨迹的终点,然后通过司机ID与时间将订单数据与轨迹数据进行关联。Specifically, the driver trajectory data is stored by time, the trajectory data di={driver_id, loactioni, ti}, i=1, 2, ..., m, where driver_id is the driver code, loactioni={loni, lati}, loni is a certain time ti (for example, "2021-12-1713:13:13") is the longitude of the location, and lati is the latitude of the location of ti at a certain time; order data order={order_id, driver_id, o_order_t, o_order_loaction, o_arrived_t, o_loading_t, o_loading_location, o_loaded_t, o_loaded_location , d_order_t, d_order_loaction, d_arrived_t, d_unloading_t, d_unloading_location, d_complete_t, d_complete_location}, where order_id is the driver code, o_order_t, o_order_loaction are the time and location of the user's order starting point, o_arrived_t, o_loading_t, o_loading_location, o_loaded_t, o_loaded_location are the confirmation of the driver, respectively Arrival time, loading start time and location, loading completion time and location, d_order_t, d_order_loaction are the time and location of the end point of the user's order, respectively, d_arrived_t, d_unloading_t, d_unloading_location, d_complete_t, d_complete_location are the arrival time confirmed by the driver, Unloading start time and location, order completion time and location. The starting point of the order track is the time when the driver confirms the order receipt and the later time (such as 20 minutes) before the driver confirms the arrival at the starting point. The earlier time (eg 20min) is used as the end point of the order track, and then the order data and track data are associated with the driver ID and time.

(12)由于GPS的定位精度和信号强度问题,部分司机轨迹中轨迹点漂移等情况。由于轨迹采集数据存在一定时间间隔,因此可通过计算轨迹点分别和前后轨迹点距离之和与前后轨迹点直线距离比例是否合理来进行判断。若超过设定比值,则轨迹数据出现漂移,将视为噪声数据进行剔除。(12) Due to the problem of GPS positioning accuracy and signal strength, the trajectory points in some drivers' trajectories drift. Since there is a certain time interval in the trajectory collection data, it can be judged by calculating whether the ratio of the distance between the trajectory points and the distance between the front and rear trajectory points and the straight line distance between the front and rear trajectory points is reasonable. If it exceeds the set ratio, the trajectory data will drift, and it will be regarded as noise data to be eliminated.

(13)订单数据中记录的用户和司机操作位置与时间,排除操作时间与操作顺序不符的数据。例如司机确认到达起点时间晚于确认装货开始时间,这是不符合操作顺序逻辑的。(13) The operation position and time of the user and the driver recorded in the order data, excluding the data whose operation time does not match the operation order. For example, the driver confirms that the arrival time at the starting point is later than the confirmed loading start time, which is not in line with the operation sequence logic.

(14)轨迹装卸货分类:筛选距离用户下单起点或司机确认装货点一定距离范围(如500m)内,且轨迹点时间与司机确认的装货时间、到达起点时间相差指定时长(如10min)内,标记为装货点候选轨迹点。同理,筛选距离用户下单终点或司机确认卸货点一定距离范围(如500m)内,且轨迹点时间与司机确认的卸货时间、到达终点时间相差指定时长(如10min)内,标记为卸货点候选轨迹点。(14) Track loading and unloading classification: filter within a certain distance (such as 500m) from the starting point of the user’s order or the loading point confirmed by the driver, and the time of the track point is different from the loading time and arrival time confirmed by the driver by a specified period of time (such as 10min). ), marked as candidate trajectory points for loading points. In the same way, the screening is within a certain distance (such as 500m) from the end point of the user's order or the unloading point confirmed by the driver, and the track point time is within the specified time period (such as 10min) from the unloading time confirmed by the driver and the arrival time at the end point, and it is marked as the unloading point. candidate trajectory points.

步骤二、对装卸货点进行判别Step 2: Identify the loading and unloading points

(21)用基于密度聚类方法DBSCAN分别对装卸货候选轨迹点进行聚类,最小样本量n为2,ε-邻域半径eps为0.00009,点簇中心点作为候选点位,进入后续打分阶段。(21) Use the density-based clustering method DBSCAN to cluster the candidate trajectory points of loading and unloading respectively. The minimum sample size n is 2, the ε-neighborhood radius eps is 0.00009, and the center point of the point cluster is used as the candidate point, and enters the subsequent scoring stage. .

(22)由于有部分轨迹存在司机找路,间隔很久后重复轨迹点,若用簇内点位时间差值计算停留时间,则存在错误。因此,假设轨迹采集间隔时间为n秒,则停留时间的计算方法可设为点簇的个数乘以n秒。(22) Since there are some trajectories where drivers find their way, the trajectory points are repeated after a long interval. If the time difference between points in the cluster is used to calculate the dwell time, there is an error. Therefore, assuming that the trajectory collection interval is n seconds, the calculation method of the dwell time can be set as the number of point clusters multiplied by n seconds.

(23)选取聚类簇中心点与司机确认装货完成点距离最近的前三点位、簇中心点与用户下单起点距离最近的点位以及司机确认装货完成点作为实际装货点的候选点位;同理完成实际卸货点候选点位的选取。(23) Select the first three points with the closest distance between the cluster center point and the driver's confirmed loading completion point, the closest point between the cluster center point and the user's order starting point, and the driver's confirmed loading completion point as the actual loading point. Candidate points; similarly, the selection of candidate points for the actual unloading point is completed.

(24)通过max{候选点位停留时长/(候选点位与用户下单起点距离),候选点位停留时长/(候点位与司机确认装货开始点距离),候选点位停留时长/(点位与司机确认装货完成点距离)},给每个候选点位打分,取分数最高的一个点位作为装货点的挖掘点。同理可完成卸货点的判别,即max{候选点位停留时长/(候选点位与用户下单终点距离),候选点位停留时长/(候点位与司机确认卸货开始点距离),候选点位停留时长/(点位与司机确认卸货完成点距离)},按照相同策略给每个候选点位打分,取分数最高的一个点位作为卸货点的挖掘点。(24) Through max{the length of stay at the candidate point/(the distance between the candidate point and the starting point of the user's order), the length of stay at the candidate point/(the distance between the candidate point and the starting point for the driver to confirm the loading), the length of stay at the candidate point/ (The distance between the point and the point where the driver confirms the completion of loading)}, score each candidate point, and take the point with the highest score as the excavation point of the loading point. In the same way, the judgment of the unloading point can be completed, that is, max{the length of stay at the candidate point/(the distance between the candidate point and the end point of the user's order), the length of the stay at the candidate point/(the distance between the candidate point and the driver's confirmed unloading start point), the candidate Point stay time/(distance between the point and the point where the driver confirms the unloading completion)}, score each candidate point according to the same strategy, and take the point with the highest score as the excavation point of the unloading point.

步骤三、效果评估Step three, effect evaluation

(31)计算人工标注装卸货点与上述挖掘算法输出的装卸货点的距离。(31) Calculate the distance between the manually marked loading and unloading point and the loading and unloading point output by the above mining algorithm.

(32)计算各距离定点率。其中,装货地定点率为装货地挖掘点与装货地人工标注点距离一定范围(如30m)内的点个数与全部标注点个数的比值;卸货地定点率为卸货地挖掘点与卸货地人工标注点距离一定范围(如30m)内的点个数与全部标注点个数的比值。(32) Calculate the fixed point rate of each distance. Among them, the fixed point rate of the loading place is the ratio of the number of points within a certain distance (such as 30m) between the excavation point of the loading place and the manually marked point of the loading place to the number of all marked points; the fixed point rate of the unloading place is the excavation point of the unloading place. The ratio of the number of points within a certain distance (such as 30m) from the manually marked points of the unloading place to the total number of marked points.

可知,本方案结合了用户订单数据、行驶轨迹数据和司机操作数据对真实装货点和卸货点进行识别挖掘,可提高货运接单过程中的定点率。It can be seen that this scheme combines user order data, driving trajectory data and driver operation data to identify and mine the real loading point and unloading point, which can improve the fixed point rate in the process of receiving freight orders.

在本申请又一实施例中,还提供一种数据处理装置。该数据处理装置可以软件或硬件的形式集成在服务器中。如图4所示,该数据处理装置200可以包括:第一获取单元201、第一确定单元202、第一聚类单元203、第二确定单元204和第三确定单元205,其中:In yet another embodiment of the present application, a data processing apparatus is also provided. The data processing device can be integrated in the server in the form of software or hardware. As shown in FIG. 4, the data processing apparatus 200 may include: a first obtaining unit 201, a first determining unit 202, a first clustering unit 203, a second determining unit 204, and a third determining unit 205, wherein:

第一获取单元201,用于获取订单数据和对应的行驶轨迹数据;The first obtaining unit 201 is used to obtain order data and corresponding driving track data;

第一确定单元202,用于基于订单数据和司机操作数据,对所述行驶轨迹数据中轨迹点的时间和位置进行约束,从所述行驶轨迹数据中确定多个候选轨迹点;The first determining unit 202 is configured to constrain the time and position of the trajectory points in the driving trajectory data based on the order data and the driver operation data, and determine a plurality of candidate trajectory points from the driving trajectory data;

第一聚类单元203,用于对多个候选轨迹点进行聚类,得到多个聚类簇;The first clustering unit 203 is used for clustering multiple candidate trajectory points to obtain multiple cluster clusters;

第二确定单元204,用于基于所述订单数据和所述司机操作数据,对所述多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇;The second determining unit 204 is configured to constrain the positions of the cluster center points of the multiple clusters based on the order data and the driver operation data, and determine candidate clusters from the multiple clusters;

第三确定单元205,用于对司机在所述候选簇内各轨迹点上的停留时长、所述候选簇内各轨迹点的位置进行约束,从所述候选簇内的轨迹点中确定所述订单的目标起点。The third determining unit 205 is configured to constrain the length of stay of the driver on each trajectory point in the candidate cluster and the position of each trajectory point in the candidate cluster, and determine the driver from the trajectory points in the candidate cluster. The target starting point for the order.

在一实施方式中,第一确定单元202用于:In one embodiment, the first determining unit 202 is used for:

根据所述司机操作数据、所述行驶轨迹数据中各轨迹点的时间、及第一时间约束条件,确定订单轨迹的起点路段;According to the driver operation data, the time of each trajectory point in the driving trajectory data, and the first time constraint, determine the starting point section of the order trajectory;

根据所述订单数据、所述司机操作数据、所述起点路段中各轨迹点的位置、所述起点路段中各轨迹点的时间、第一距离约束条件及第二时间约束条件,从所述起点路段中确定多个候选轨迹点。According to the order data, the driver operation data, the position of each track point in the starting point road segment, the time of each track point in the starting point road segment, the first distance constraint condition and the second time constraint condition, from the starting point A plurality of candidate trajectory points are determined in the road segment.

在一实施方式中,所述司机操作数据包括:司机确认接单时间和司机确认到达起点时间;在根据所述司机操作数据、所述行驶轨迹数据中各轨迹点的时间、及第一时间约束条件,确定订单轨迹的起点路段时,第一确定单元202具体用于:In one embodiment, the driver operation data includes: the time when the driver confirms the receipt of the order and the time when the driver confirms the arrival at the starting point; the time according to the driver operation data, the time of each trajectory point in the driving trajectory data, and the first time constraint condition, when determining the starting point segment of the order track, the first determining unit 202 is specifically used for:

基于所述行驶轨迹数据中各轨迹点的时间,选取司机确认接单时间对应的轨迹点、及司机确定到达起点前指定时间段内的轨迹点,作为订单轨迹的起点路段。Based on the time of each track point in the driving track data, the track point corresponding to the time when the driver confirms the order receipt and the track point within the specified time period before the driver determines to arrive at the starting point are selected as the starting point section of the order track.

在一实施方式中,所述订单数据包括:订单起点,所述司机操作数据包括:司机确认接单时间、司机确认到达起点时间、司机确认装货点;在根据所述订单数据、所述司机操作数据、所述起点路段中各轨迹点的位置、所述起点路段中各轨迹点的时间、第一距离约束条件及第二时间约束条件,从所述起点路段中确定多个候选轨迹点时,第一确定单元202具体用于:In one embodiment, the order data includes: the starting point of the order, and the driver operation data includes: the driver confirms the order receiving time, the driver confirms the arrival time at the starting point, and the driver confirms the loading point; Operation data, the position of each trajectory point in the starting point road segment, the time of each trajectory point in the starting point road segment, the first distance constraint condition and the second time constraint condition, when multiple candidate trajectory points are determined from the starting point road segment , the first determining unit 202 is specifically used for:

基于所述行驶轨迹数据中各轨迹点与所述订单起点的距离、或者所述行驶轨迹数据中各轨迹点与所述司机确认装货点的距离,从所述起点路段中筛选出满足第一距离约束条件的轨迹点;Based on the distance between each trajectory point in the driving trajectory data and the starting point of the order, or the distance between each trajectory point in the driving trajectory data and the loading point confirmed by the driver, filter out the road segments that satisfy the first requirement from the starting point. The trajectory point of the distance constraint;

基于所筛选出的轨迹点的时间与司机确认装货时间的时间差、及所筛选出的轨迹点的时间与司机确认到达起点时间的时间差,从所筛选出的轨迹点中确定出时间差满足第二时间约束条件的轨迹点,作为所述候选轨迹点。Based on the time difference between the time of the selected trajectory points and the time when the driver confirms the loading, and the time difference between the time of the selected trajectory points and the time when the driver confirms the arrival at the starting point, it is determined from the selected trajectory points that the time difference satisfies the second The trajectory points of the time constraints are used as the candidate trajectory points.

在一实施方式中,所述订单数据包括:订单起点,所述司机操作数据包括:司机确认装货完成点;第二确定单元204用于:In one embodiment, the order data includes: the starting point of the order, and the driver operation data includes: the driver confirms the loading completion point; the second determining unit 204 is used for:

获取簇中心点与订单起点的第一距离、及簇中心点与司机确认装货完成点的第二距离;Obtain the first distance between the cluster center point and the starting point of the order, and the second distance between the cluster center point and the point where the driver confirms the completion of loading;

分别根据第一距离和第二距离对所述多个聚类簇进行排序,得到第一排序结果和第二排序结果;Sorting the plurality of clusters according to the first distance and the second distance, respectively, to obtain a first sorting result and a second sorting result;

分别根据第一排序结果和第二排序结果,从多个聚类簇中筛选出排序满足第一排序约束条件的聚类簇;According to the first sorting result and the second sorting result, respectively, filter out the clusters that satisfy the first sorting constraint from the plurality of clusters;

将筛选出的聚类簇、及司机确认卸货完成点所在的聚类簇,作为候选簇。The selected cluster and the cluster where the driver confirms the unloading completion point are used as candidate clusters.

在一实施方式中,第三确定单元205用于:In one embodiment, the third determining unit 205 is used for:

分别获取所述候选簇内各轨迹点与订单起点的第三距离、所述候选簇内各轨迹点与司机确认装货点的第四距离、所述候选簇内各轨迹点与司机确认装货完成点的第五距离;Obtain the third distance between each trajectory point in the candidate cluster and the starting point of the order, the fourth distance between each trajectory point in the candidate cluster and the driver's confirmed loading point, and each trajectory point in the candidate cluster and the driver's confirmed loading point. The fifth distance of the completion point;

获取所述停留时长与所述第三距离的第一比值、所述停留时长与所述第四距离的第二比值、所述停留时长与所述第五距离的第三比值;obtaining a first ratio of the stay duration to the third distance, a second ratio of the stay duration to the fourth distance, and a third ratio of the stay duration to the fifth distance;

基于所述第一比值、所述第二比值及所述第三比值,从所述候选簇内的轨迹点中确定所述订单的目标起点。Based on the first ratio, the second ratio, and the third ratio, a target origin of the order is determined from trajectory points within the candidate cluster.

在一实施方式中,在基于所述第一比值、所述第二比值及所述第三比值,从所述候选簇内的轨迹点中确定所述订单的目标起点时,第三确定单元205进一步用于:In one embodiment, when determining the target starting point of the order from the trajectory points in the candidate cluster based on the first ratio, the second ratio and the third ratio, the third determining unit 205 Further use for:

基于所述第一比值、所述第二比值、所述第三比值及预设评分策略,对所述候选簇内的轨迹点中的轨迹点进行打分;based on the first ratio, the second ratio, the third ratio and a preset scoring strategy, scoring the track points in the track points in the candidate cluster;

根据打分结果,从所述候选簇内的轨迹点中筛选出分数最高的轨迹点,确为所述订单的目标起点。According to the scoring result, the trajectory point with the highest score is selected from the trajectory points in the candidate cluster, which is indeed the target starting point of the order.

由上可知,本申请实施例提供的数据处理装置,基于订单数据和司机操作数据,对行驶轨迹数据中轨迹点的时间和位置进行约束,从行驶轨迹数据中确定多个候选轨迹点;对多个候选轨迹点进行聚类,得到多个聚类簇;基于订单数据和司机操作数据,对多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇;对司机在候选簇内各轨迹点上的停留时长、候选簇内各轨迹点的位置进行约束,从候选簇内的轨迹点中确定订单的目标起点。本方案使用聚类算法和策略识别实际装货点,可以提升货运场景下真实位置点挖掘的准确性。As can be seen from the above, the data processing device provided by the embodiment of the present application, based on the order data and the driver operation data, constrains the time and position of the trajectory points in the driving trajectory data, and determines a plurality of candidate trajectory points from the driving trajectory data; Cluster the candidate trajectory points to obtain multiple clusters; based on the order data and driver operation data, constrain the positions of the cluster center points of the multiple clusters, and determine the candidate clusters from the multiple clusters; Constrain the duration of the driver's stay on each trajectory point in the candidate cluster and the position of each trajectory point in the candidate cluster, and determine the target starting point of the order from the trajectory points in the candidate cluster. This scheme uses clustering algorithms and strategies to identify actual loading points, which can improve the accuracy of real location mining in freight scenarios.

在本申请又一实施例中,还提供一种数据处理装置。该数据处理装置可以软件或硬件的形式集成在服务器中。如图5所示,该数据处理装置300可以包括:第二获取单元301、第四确定单元302、第二聚类单元303、第五确定单元304和第六确定单元305,其中:In yet another embodiment of the present application, a data processing apparatus is also provided. The data processing device can be integrated in the server in the form of software or hardware. As shown in FIG. 5 , the data processing apparatus 300 may include: a second obtaining unit 301, a fourth determining unit 302, a second clustering unit 303, a fifth determining unit 304 and a sixth determining unit 305, wherein:

第一获取单元301,用于获取订单数据和对应的行驶轨迹数据;The first obtaining unit 301 is used to obtain order data and corresponding driving track data;

第四确定单元302,用于基于订单数据和司机操作数据,对所述行驶轨迹数据中轨迹点的时间和位置进行约束,从所述行驶轨迹数据中确定多个候选轨迹点;The fourth determination unit 302 is configured to constrain the time and position of the trajectory points in the driving trajectory data based on the order data and the driver operation data, and determine a plurality of candidate trajectory points from the driving trajectory data;

第二聚类单元303,用于对多个候选轨迹点进行聚类,得到多个聚类簇;The second clustering unit 303 is configured to perform clustering on a plurality of candidate trajectory points to obtain a plurality of clusters;

第五确定单元304,用于基于所述订单数据和所述司机操作数据,对所述多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇;a fifth determining unit 304, configured to constrain the positions of the cluster center points of the multiple clusters based on the order data and the driver operation data, and determine candidate clusters from the multiple clusters;

第六确定单元305,用于对司机在所述候选簇内各轨迹点上的停留时长、所述候选簇内各轨迹点的位置进行约束,从所述候选簇内的轨迹点中确定所述订单的目标起点。The sixth determination unit 305 is configured to constrain the length of stay of the driver on each track point in the candidate cluster and the position of each track point in the candidate cluster, and determine the track point from the candidate cluster. The target starting point for the order.

在一实施方式中,第四确定单元302用于:In one embodiment, the fourth determining unit 302 is used for:

根据所述司机操作数据、所述行驶轨迹数据中各轨迹点的时间、及第一时间约束条件,确定订单轨迹的终点路段;According to the driver operation data, the time of each trajectory point in the driving trajectory data, and the first time constraint, determine the end road section of the order trajectory;

根据所述订单数据、所述司机操作数据、所述终点路段中各轨迹点的位置、所述终点路段中各轨迹点的时间、第一距离约束条件及第二时间约束条件,从所述终点路段中确定多个候选轨迹点。According to the order data, the driver's operation data, the position of each track point in the end road segment, the time of each track point in the end road segment, the first distance constraint condition and the second time constraint condition, from the end point A plurality of candidate trajectory points are determined in the road segment.

在一实施方式中,所述司机操作数据包括:司机确认完单时间和司机确认到达终点时间;在根据所述司机操作数据、所述行驶轨迹数据中各轨迹点的时间、及第一时间约束条件,确定订单轨迹的终点路段时,第四确定单元302具体用于:In one embodiment, the driver operation data includes: the time when the driver confirms the single and the time when the driver confirms the arrival at the end point; the time according to the driver operation data, the time of each trajectory point in the driving trajectory data, and the first time constraint condition, when determining the end road segment of the order track, the fourth determining unit 302 is specifically used for:

基于所述行驶轨迹数据中各轨迹点的时间,选取司机确认完单时间对应的轨迹点、及司机确定到达终点前指定时间段内的轨迹点,作为订单轨迹的终点路段。Based on the time of each track point in the driving track data, the track point corresponding to the driver's confirmation of the single time and the track point within the specified time period before the driver's determination to reach the end point are selected as the end road section of the order track.

在一实施方式中,所述订单数据包括:订单终点,所述司机操作数据包括:司机确认完单时间、司机确认到达终点时间、司机确认卸货点;在根据所述订单数据、所述司机操作数据、所述终点路段中各轨迹点的位置、所述终点路段中各轨迹点的时间、第一距离约束条件及第二时间约束条件,从所述终点路段中确定多个候选轨迹点时,第四确定单元302具体用于:In one embodiment, the order data includes: the order end point, and the driver operation data includes: the time when the driver confirms the order, the time when the driver confirms the arrival at the end point, and the driver confirms the unloading point; data, the position of each trajectory point in the end road segment, the time of each trajectory point in the end road segment, the first distance constraint and the second time constraint, when multiple candidate trajectory points are determined from the end road segment, The fourth determining unit 302 is specifically configured to:

基于所述行驶轨迹数据中各轨迹点与所述订单终点的距离、或者所述行驶轨迹数据中各轨迹点与所述司机确认卸货点的距离,从所述终点路段中筛选出满足第一距离约束条件的轨迹点;Based on the distance between each trajectory point in the driving trajectory data and the end point of the order, or the distance between each trajectory point in the driving trajectory data and the driver's confirmed unloading point, filter out the road segments that satisfy the first distance from the end point. Constraint trajectory points;

基于所筛选出的轨迹点的时间与司机确认卸货时间的时间差、及所筛选出的轨迹点的时间与司机确认到达终点时间的时间差,从所筛选出的轨迹点中确定出时间差满足第二时间约束条件的轨迹点,作为所述候选轨迹点。Based on the time difference between the time of the selected trajectory point and the time when the driver confirms the unloading, and the time difference between the time of the selected trajectory point and the time when the driver confirms the arrival at the end point, it is determined from the selected trajectory points that the time difference satisfies the second time The trajectory points of the constraints are used as the candidate trajectory points.

在一实施方式中,所述订单数据包括:订单终点,所述司机操作数据包括:司机确认卸货完成点;第五确定单元304用于:In one embodiment, the order data includes: the end point of the order, and the driver operation data includes: the driver confirms the unloading completion point; the fifth determining unit 304 is used for:

获取簇中心点与订单终点的第一距离、及簇中心点与司机确认卸货完成点的第二距离;Obtain the first distance between the cluster center point and the order end point, and the second distance between the cluster center point and the point where the driver confirms the unloading completion;

分别根据第一距离和第二距离对所述多个聚类簇进行排序,得到第一排序结果和第二排序结果;Sorting the plurality of clusters according to the first distance and the second distance, respectively, to obtain a first sorting result and a second sorting result;

分别根据第一排序结果和第二排序结果,从多个聚类簇中筛选出排序满足第一排序约束条件的聚类簇;According to the first sorting result and the second sorting result, respectively, filter out the clusters that satisfy the first sorting constraint from the plurality of clusters;

将筛选出的聚类簇、及司机确认卸货完成点所在的聚类簇,作为候选簇。The selected cluster and the cluster where the driver confirms the unloading completion point are used as candidate clusters.

在一实施方式中,第六确定单元305用于:In one embodiment, the sixth determining unit 305 is used for:

分别获取所述候选簇内各轨迹点与订单终点的第三距离、所述候选簇内各轨迹点与司机确认卸货点的第四距离、所述候选簇内各轨迹点与司机确认卸货完成点的第五距离;Obtain the third distance between each trajectory point in the candidate cluster and the order end point, the fourth distance between each trajectory point in the candidate cluster and the driver's confirmed unloading point, and each trajectory point in the candidate cluster and the driver's confirmed unloading completion point. the fifth distance;

获取所述停留时长与所述第三距离的第一比值、所述停留时长与所述第四距离的第二比值、所述停留时长与所述第五距离的第三比值;obtaining a first ratio of the stay duration to the third distance, a second ratio of the stay duration to the fourth distance, and a third ratio of the stay duration to the fifth distance;

基于所述第一比值、所述第二比值及所述第三比值,从所述候选簇内的轨迹点中确定所述订单的目标终点。Based on the first ratio, the second ratio, and the third ratio, a target end point of the order is determined from trajectory points within the candidate cluster.

在一实施方式中,在基于所述第一比值、所述第二比值及所述第三比值,从所述候选簇内的轨迹点中确定所述订单的目标终点时,第六确定单元305进一步用于:In one embodiment, when determining the target end point of the order from the trajectory points in the candidate cluster based on the first ratio, the second ratio and the third ratio, the sixth determining unit 305 Further use for:

基于所述第一比值、所述第二比值、所述第三比值及预设评分策略,对所述候选簇内的轨迹点中的轨迹点进行打分;based on the first ratio, the second ratio, the third ratio and a preset scoring strategy, scoring the track points in the track points in the candidate cluster;

根据打分结果,从所述候选簇内的轨迹点中筛选出分数最高的轨迹点,确为所述订单的目标终点。According to the scoring result, the trajectory point with the highest score is selected from the trajectory points in the candidate cluster, which is indeed the target end point of the order.

由上可知,本申请实施例提供的数据处理装置,基于订单数据和司机操作数据,对行驶轨迹数据中轨迹点的时间和位置进行约束,从行驶轨迹数据中确定多个候选轨迹点;对多个候选轨迹点进行聚类,得到多个聚类簇;基于订单数据和司机操作数据,对多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇;对司机在候选簇内各轨迹点上的停留时长、候选簇内各轨迹点的位置进行约束,从候选簇内的轨迹点中确定订单的目标终点。本方案使用聚类算法和策略识别实际卸货点,可以提升货运场景下真实位置点挖掘的准确性。As can be seen from the above, the data processing device provided by the embodiment of the present application, based on the order data and the driver operation data, constrains the time and position of the trajectory points in the driving trajectory data, and determines a plurality of candidate trajectory points from the driving trajectory data; Cluster the candidate trajectory points to obtain multiple clusters; based on the order data and driver operation data, constrain the positions of the cluster center points of the multiple clusters, and determine the candidate clusters from the multiple clusters; Constrain the duration of the driver's stay on each trajectory point in the candidate cluster and the position of each trajectory point in the candidate cluster, and determine the target destination of the order from the trajectory points in the candidate cluster. This scheme uses clustering algorithms and strategies to identify actual unloading points, which can improve the accuracy of real location point mining in freight scenarios.

在本申请又一实施例中还提供一种服务器。如图6所示,服务器400包括处理器401和存储器402。其中,处理器401与存储器402电性连接。In yet another embodiment of the present application, a server is also provided. As shown in FIG. 6 , the server 400 includes a processor 401 and a memory 402 . The processor 401 is electrically connected to the memory 402 .

处理器401是服务器400的控制中心,利用各种接口和线路连接整个服务器的各个部分,通过运行或加载存储在存储器402内的应用,以及调用存储在存储器402内的数据,执行服务器的各种功能和处理数据,从而对服务器进行整体监控。The processor 401 is the control center of the server 400, and uses various interfaces and lines to connect various parts of the entire server, and executes various functions of the server by running or loading the applications stored in the memory 402 and calling the data stored in the memory 402. function and process data for overall monitoring of the server.

在一实施例中,服务器400中的处理器401会按照如下的步骤,将一个或一个以上的应用的进程对应的指令加载到存储器402中,并由处理器401来运行存储在存储器402中的应用,从而实现各种功能:In one embodiment, the processor 401 in the server 400 loads the instructions corresponding to the processes of one or more applications into the memory 402 according to the following steps, and the processor 401 executes the instructions stored in the memory 402. application to achieve various functions:

获取订单数据和对应的行驶轨迹数据;Obtain order data and corresponding driving trajectory data;

基于订单数据和司机操作数据,对所述行驶轨迹数据中轨迹点的时间和位置进行约束,从所述行驶轨迹数据中确定多个候选轨迹点;Based on the order data and the driver's operation data, constrain the time and position of the trajectory points in the driving trajectory data, and determine a plurality of candidate trajectory points from the driving trajectory data;

对多个候选轨迹点进行聚类,得到多个聚类簇;Clustering multiple candidate trajectory points to obtain multiple clusters;

基于所述订单数据和所述司机操作数据,对所述多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇;Based on the order data and the driver operation data, constrain the positions of the cluster center points of the plurality of clusters, and determine candidate clusters from the plurality of clusters;

对司机在所述候选簇内各轨迹点上的停留时长、所述候选簇内各轨迹点的位置进行约束,从所述候选簇内的轨迹点中确定所述订单的目标起点。Constraining the duration of the driver's stay on each trajectory point in the candidate cluster and the position of each trajectory point in the candidate cluster, and determining the target starting point of the order from the trajectory points in the candidate cluster.

在一实施方式中,在基于订单数据和司机操作数据,对所述行驶轨迹数据中轨迹点的时间和位置进行约束,从所述行驶轨迹数据中确定多个候选轨迹点时,处理器401可以执行以下操作:In one embodiment, when the time and position of the trajectory points in the driving trajectory data are constrained based on the order data and driver operation data, and a plurality of candidate trajectory points are determined from the driving trajectory data, the processor 401 may: Do the following:

根据所述司机操作数据、所述行驶轨迹数据中各轨迹点的时间、及第一时间约束条件,确定订单轨迹的起点路段;According to the driver operation data, the time of each trajectory point in the driving trajectory data, and the first time constraint, determine the starting point section of the order trajectory;

根据所述订单数据、所述司机操作数据、所述起点路段中各轨迹点的位置、所述起点路段中各轨迹点的时间、第一距离约束条件及第二时间约束条件,从所述起点路段中确定多个候选轨迹点。According to the order data, the driver operation data, the position of each track point in the starting point road segment, the time of each track point in the starting point road segment, the first distance constraint condition and the second time constraint condition, from the starting point A plurality of candidate trajectory points are determined in the road segment.

在一实施方式中,所述司机操作数据包括:司机确认接单时间和司机确认到达起点时间;在根据所述司机操作数据、所述行驶轨迹数据中各轨迹点的时间、及第一时间约束条件,确定订单轨迹的起点路段时,处理器401可以执行以下操作:In one embodiment, the driver operation data includes: the time when the driver confirms the receipt of the order and the time when the driver confirms the arrival at the starting point; the time according to the driver operation data, the time of each trajectory point in the driving trajectory data, and the first time constraint Conditions, when determining the starting point segment of the order track, the processor 401 may perform the following operations:

基于所述行驶轨迹数据中各轨迹点的时间,选取司机确认接单时间对应的轨迹点、及司机确定到达起点前指定时间段内的轨迹点,作为订单轨迹的起点路段。Based on the time of each track point in the driving track data, the track point corresponding to the time when the driver confirms the order receipt and the track point within the specified time period before the driver determines to arrive at the starting point are selected as the starting point section of the order track.

在一实施方式中,所述订单数据包括:订单起点,所述司机操作数据包括:司机确认接单时间、司机确认到达起点时间、司机确认卸货点;在根据所述订单数据、所述司机操作数据、所述起点路段中各轨迹点的位置、所述起点路段中各轨迹点的时间、第一距离约束条件及第二时间约束条件,从所述起点路段中确定多个候选轨迹点时,处理器401可以执行以下操作:In one embodiment, the order data includes: the starting point of the order, and the driver operation data includes: the driver confirms the order receiving time, the driver confirms the arrival time at the starting point, and the driver confirms the unloading point; data, the position of each trajectory point in the starting point road segment, the time of each trajectory point in the starting point road segment, the first distance constraint condition and the second time constraint condition, when multiple candidate trajectory points are determined from the starting point road segment, The processor 401 can perform the following operations:

基于所述行驶轨迹数据中各轨迹点与所述订单起点的距离、或者所述行驶轨迹数据中各轨迹点与所述司机确认卸货点的距离,从所述起点路段中筛选出满足第一距离约束条件的轨迹点;Based on the distance between each trajectory point in the driving trajectory data and the starting point of the order, or the distance between each trajectory point in the driving trajectory data and the driver's confirmed unloading point, the first distance is selected from the starting point road segment Constraint trajectory points;

基于所筛选出的轨迹点的时间与司机确认卸货时间的时间差、及所筛选出的轨迹点的时间与司机确认到达起点时间的时间差,从所筛选出的轨迹点中确定出时间差满足第二时间约束条件的轨迹点,作为所述候选轨迹点。Based on the time difference between the time of the selected trajectory point and the time when the driver confirms the unloading, and the time difference between the time of the selected trajectory point and the time when the driver confirms the arrival at the starting point, it is determined from the selected trajectory points that the time difference satisfies the second time The trajectory points of the constraints are used as the candidate trajectory points.

在一实施方式中,所述订单数据包括:订单起点,所述司机操作数据包括:司机确认卸货完成点;在基于所述订单数据和所述司机操作数据,对所述多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇时,处理器401可以执行以下操作:In one embodiment, the order data includes: an order starting point, and the driver operation data includes: the driver confirms the unloading completion point; The position of the cluster center point is constrained, and when a candidate cluster is determined from multiple clusters, the processor 401 may perform the following operations:

获取簇中心点与订单起点的第一距离、及簇中心点与司机确认装货完成点的第二距离;Obtain the first distance between the cluster center point and the starting point of the order, and the second distance between the cluster center point and the point where the driver confirms the completion of loading;

分别根据第一距离和第二距离对所述多个聚类簇进行排序,得到第一排序结果和第二排序结果;Sorting the plurality of clusters according to the first distance and the second distance, respectively, to obtain a first sorting result and a second sorting result;

分别根据第一排序结果和第二排序结果,从多个聚类簇中筛选出排序满足第一排序约束条件的聚类簇;According to the first sorting result and the second sorting result, respectively, filter out the clusters that satisfy the first sorting constraint from the plurality of clusters;

将筛选出的聚类簇、及司机确认装货完成点所在的聚类簇,作为候选簇。The filtered cluster and the cluster where the driver confirms the loading completion point are used as candidate clusters.

在一实施方式中,在基于订单数据和司机操作数据,对所述行驶轨迹数据中轨迹点的时间和位置进行约束,从所述行驶轨迹数据中确定多个候选轨迹点时,处理器401具体可以执行以下操作:In one embodiment, when the time and position of the trajectory points in the driving trajectory data are constrained based on the order data and driver operation data, and multiple candidate trajectory points are determined from the driving trajectory data, the processor 401 specifically: You can do the following:

根据所述司机操作数据、所述行驶轨迹数据中各轨迹点的时间、及第一时间约束条件,确定订单轨迹的起点路段;According to the driver operation data, the time of each trajectory point in the driving trajectory data, and the first time constraint, determine the starting point section of the order trajectory;

根据所述订单数据、所述司机操作数据、所述起点路段中各轨迹点的位置、所述起点路段中各轨迹点的时间、第一距离约束条件及第二时间约束条件,从所述起点路段中确定多个候选轨迹点。According to the order data, the driver operation data, the position of each track point in the starting point road segment, the time of each track point in the starting point road segment, the first distance constraint condition and the second time constraint condition, from the starting point A plurality of candidate trajectory points are determined in the road segment.

在一实施方式中,司机操作数据包括:司机确认接单时间和司机确认到达起点时间;在根据所述司机操作数据、所述行驶轨迹数据中各轨迹点的时间、及第一时间约束条件,确定订单轨迹的起点路段时,处理器401具体可以执行以下操作:In one embodiment, the driver's operation data includes: the time when the driver confirms the receipt of the order and the time when the driver confirms the arrival at the starting point; according to the driver's operation data, the time of each trajectory point in the driving trajectory data, and the first time constraint, When determining the starting point segment of the order track, the processor 401 may specifically perform the following operations:

基于所述行驶轨迹数据中各轨迹点的时间,选取司机确认接单时间对应的轨迹点、及司机确定到达起点前指定时间段内的轨迹点,作为订单轨迹的起点路段。Based on the time of each track point in the driving track data, the track point corresponding to the time when the driver confirms the order receipt and the track point within the specified time period before the driver determines to arrive at the starting point are selected as the starting point section of the order track.

或者,在一实施例中,服务器400中的处理器401会按照如下的步骤,将一个或一个以上的应用的进程对应的指令加载到存储器402中,并由处理器401来运行存储在存储器402中的应用,从而实现各种功能:Alternatively, in one embodiment, the processor 401 in the server 400 loads the instructions corresponding to the processes of one or more applications into the memory 402 according to the following steps, and the processor 401 executes the instructions stored in the memory 402 applications in , so as to achieve various functions:

获取订单数据和对应的行驶轨迹数据;Obtain order data and corresponding driving trajectory data;

基于订单数据和司机操作数据,对所述行驶轨迹数据中轨迹点的时间和位置进行约束,从所述行驶轨迹数据中确定多个候选轨迹点;Based on the order data and the driver's operation data, constrain the time and position of the trajectory points in the driving trajectory data, and determine a plurality of candidate trajectory points from the driving trajectory data;

对多个候选轨迹点进行聚类,得到多个聚类簇;Clustering multiple candidate trajectory points to obtain multiple clusters;

基于所述订单数据和所述司机操作数据,对所述多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇;Based on the order data and the driver operation data, constrain the positions of the cluster center points of the plurality of clusters, and determine candidate clusters from the plurality of clusters;

对司机在所述候选簇内各轨迹点上的停留时长、所述候选簇内各轨迹点的位置进行约束,从所述候选簇内的轨迹点中确定所述订单的目标终点。Constraining the duration of the driver's stay on each track point in the candidate cluster and the position of each track point in the candidate cluster, and determining the target end point of the order from the track points in the candidate cluster.

存储器402可用于存储应用和数据。存储器402存储的应用中包含有可在处理器中执行的指令。应用可以组成各种功能模块。处理器401通过运行存储在存储器402的应用,从而执行各种功能应用以及数据处理。Memory 402 may be used to store applications and data. The application stored in memory 402 contains instructions executable in the processor. Applications can be composed of various functional modules. The processor 401 executes various functional applications and data processing by executing applications stored in the memory 402 .

在一些实施例中,如图7所示,服务器400还包括:显示屏403、控制电路404、射频电路405、输入单元406以及电源407。其中,处理器401分别与显示屏403、控制电路404、射频电路405、输入单元406以及电源407电性连接。In some embodiments, as shown in FIG. 7 , the server 400 further includes: a display screen 403 , a control circuit 404 , a radio frequency circuit 405 , an input unit 406 and a power supply 407 . The processor 401 is electrically connected to the display screen 403 , the control circuit 404 , the radio frequency circuit 405 , the input unit 406 and the power supply 407 respectively.

显示屏403可用于显示由用户输入的信息或提供给用户的信息以服务器的各种图形用户接口,这些图形用户接口可以由图像、文本、图标、视频和其任意组合来构成。The display screen 403 may be used to display information input by the user or information provided to the user and various graphical user interfaces of the server, which may be composed of images, text, icons, videos, and any combination thereof.

控制电路404与显示屏403电性连接,用于控制显示屏403显示信息。The control circuit 404 is electrically connected to the display screen 403 for controlling the display screen 403 to display information.

射频电路405用于收发射频信号,以通过无线通信与电子设备或其他服务器构建无线通讯,与电子设备或其他服务器之间收发信号。The radio frequency circuit 405 is used for transmitting and receiving radio frequency signals, so as to construct wireless communication with the electronic device or other servers through wireless communication, and send and receive signals with the electronic device or other servers.

输入单元406可用于接收输入的数字、字符信息或用户特征信息(例如指纹),以及产生与用户设置以及功能控制有关的键盘、鼠标、操作杆、光学或者轨迹球信号输入。其中,输入单元406可以包括指纹识别模组。Input unit 406 may be used to receive input numbers, character information, or user characteristic information (eg, fingerprints), and generate keyboard, mouse, joystick, optical, or trackball signal input related to user settings and function control. The input unit 406 may include a fingerprint identification module.

电源407用于给服务器400的各个部件供电。在一些实施例中,电源407可以通过电源管理系统与处理器401逻辑相连,从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。The power supply 407 is used to power various components of the server 400 . In some embodiments, the power supply 407 may be logically connected to the processor 401 through a power management system, so as to implement functions such as managing charging, discharging, and power consumption through the power management system.

尽管图7中未示出,服务器400还可以包括扬声器、蓝牙模块、摄像头等,在此不再赘述。Although not shown in FIG. 7 , the server 400 may further include a speaker, a Bluetooth module, a camera, and the like, which will not be repeated here.

由上可知,本申请实施例提供的服务器,基于订单数据和司机操作数据,对行驶轨迹数据中轨迹点的时间和位置进行约束,从行驶轨迹数据中确定多个候选轨迹点;对多个候选轨迹点进行聚类,得到多个聚类簇;基于订单数据和司机操作数据,对多个聚类簇的簇中心点的位置进行约束,从多个聚类簇中确定出候选簇;对司机在候选簇内各轨迹点上的停留时长、候选簇内各轨迹点的位置进行约束,从候选簇内的轨迹点中确定订单的目标起点或目标终点。本方案使用聚类算法和策略识别实际装卸货点,可以提升货运场景下真实位置点挖掘的准确性。As can be seen from the above, the server provided by the embodiment of the present application, based on the order data and driver operation data, constrains the time and position of the trajectory points in the driving trajectory data, and determines multiple candidate trajectory points from the driving trajectory data; The trajectory points are clustered to obtain multiple clusters; based on the order data and driver operation data, the positions of the cluster center points of the multiple clusters are constrained, and candidate clusters are determined from the multiple clusters; The duration of stay on each trajectory point in the candidate cluster and the position of each trajectory point in the candidate cluster are constrained, and the target start point or target end point of the order is determined from the trajectory points in the candidate cluster. This solution uses clustering algorithms and strategies to identify the actual loading and unloading points, which can improve the accuracy of mining real location points in freight scenarios.

在一些实施例中,还提供了一种计算机可读存储介质,该存储介质中存储有多条指令,该指令适于由处理器加载以执行上述任一数据处理方法。In some embodiments, a computer-readable storage medium is also provided, the storage medium having stored therein a plurality of instructions, the instructions being adapted to be loaded by a processor to perform any of the above data processing methods.

本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序可以存储于一计算机可读存储介质中,存储介质可以包括:只读存储器(ROM,Read Only Memory)、随机存取记忆体(RAM,RandomAccess Memory)、磁盘或光盘等。Those of ordinary skill in the art can understand that all or part of the steps in the various methods of the above embodiments can be completed by instructing relevant hardware through a program, and the program can be stored in a computer-readable storage medium, and the storage medium can include: Read Only Memory (ROM, Read Only Memory), Random Access Memory (RAM, Random Access Memory), magnetic disk or optical disk, etc.

以上对本申请实施例所提供的数据处理方法、装置、存储介质及服务器进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。The data processing method, device, storage medium, and server provided by the embodiments of the present application have been described in detail above. The principles and implementations of the present application are described with specific examples in this paper. The descriptions of the above embodiments are only used to help Understand the method of the present application and its core idea; at the same time, for those skilled in the art, according to the idea of the present application, there will be changes in the specific implementation and application scope. In summary, the content of this specification should not be It is construed as a limitation of this application.

Claims (10)

1. A data processing method is applied to a freight scene, and is characterized by comprising the following steps:
acquiring order data and corresponding driving track data;
constraining the time and the position of a track point in the driving track data based on order data and driver operation data, and determining a plurality of candidate track points from the driving track data;
clustering the candidate track points to obtain a plurality of clustering clusters;
based on the order data and the driver operation data, constraining the positions of cluster center points of the plurality of cluster clusters, and determining candidate clusters from the plurality of cluster clusters;
and restricting the stay time of the driver on each track point in the candidate cluster and the position of each track point in the candidate cluster, and determining the target starting point of the order from the track points in the candidate cluster.
2. The data processing method according to claim 1, wherein the determining a plurality of candidate trajectory points from the travel trajectory data by constraining the time and position of the trajectory points in the travel trajectory data based on the order data and the driver operation data comprises:
determining a starting point road section of the order track according to the driver operation data, the time of each track point in the driving track data and a first time constraint condition;
and determining a plurality of candidate track points from the starting road section according to the order data, the driver operation data, the positions of the track points in the starting road section, the time of the track points in the starting road section, a first distance constraint condition and a second time constraint condition.
3. The data processing method of claim 2, wherein the driver operation data comprises: the driver confirms the order receiving time and the driver confirms the time to reach the starting point;
determining a starting point road section of the order track according to the driver operation data, the time of each track point in the driving track data and a first time constraint condition, wherein the method comprises the following steps:
and selecting track points corresponding to the order taking confirmation time of the driver and track points in a specified time period before the driver confirms to reach the starting point as the starting point road section of the order track based on the time of each track point in the driving track data.
4. The data processing method of claim 2, wherein the order data comprises: an order starting point, the driver operational data comprising: the driver confirms the order receiving time, the driver confirms the arrival starting time and the driver confirms the loading point;
determining a plurality of candidate track points from the starting point road section according to the order data, the driver operation data, the positions of the track points in the starting point road section, the time of the track points in the starting point road section, a first distance constraint condition and a second time constraint condition, and including:
based on the distance between each track point in the driving track data and the order starting point or the distance between each track point in the driving track data and the loading point confirmed by the driver, selecting track points meeting a first distance constraint condition from the starting point road section;
and determining the track point with the time difference meeting the second time constraint condition from the screened track point based on the time difference between the screened track point and the confirmation loading time of the driver and the time difference between the screened track point and the confirmation arrival starting point time of the driver, and taking the track point as the candidate track point.
5. The data processing method of claim 1, wherein the order data comprises: an order starting point, the driver operational data comprising: the driver confirms the loading completion point;
the constraining the positions of cluster center points of the plurality of clusters based on the order data and the driver operation data, and determining candidate clusters from the plurality of clusters, comprising:
acquiring a first distance between a cluster center point and an order starting point and a second distance between the cluster center point and a loading completion point confirmed by a driver;
sorting the plurality of clustering clusters according to the first distance and the second distance respectively to obtain a first sorting result and a second sorting result;
screening cluster clusters with the ordering meeting the first ordering constraint condition from the plurality of cluster clusters according to the first ordering result and the second ordering result respectively;
and taking the screened cluster and the cluster where the loading completion point is confirmed by the driver as candidate clusters.
6. The data processing method according to any one of claims 1 to 5, wherein the determining the target starting point of the order from the track points in the candidate cluster by constraining the stay time of the driver on the track points in the candidate cluster and the positions of the track points in the candidate cluster comprises:
respectively obtaining a third distance between each track point in the candidate cluster and an order starting point, a fourth distance between each track point in the candidate cluster and a driver cargo confirmation point, and a fifth distance between each track point in the candidate cluster and a driver cargo confirmation completion point;
acquiring a first ratio of the stay time to the third distance, a second ratio of the stay time to the fourth distance, and a third ratio of the stay time to the fifth distance;
and determining a target starting point of the order from the track points in the candidate cluster based on the first ratio, the second ratio and the third ratio.
7. The data processing method of claim 6, wherein determining the target starting point of the order from the track points in the candidate cluster based on the first ratio, the second ratio, and the third ratio comprises:
scoring the track points in the candidate clusters based on the first ratio, the second ratio, the third ratio and a preset scoring strategy;
and according to the scoring result, screening track points with the highest score from the track points in the candidate clusters, and determining the track points as the target starting points of the order.
8. A data processing method is applied to a freight scene, and is characterized by comprising the following steps:
acquiring order data and corresponding driving track data;
constraining the time and the position of a track point in the driving track data based on order data and driver operation data, and determining a plurality of candidate track points from the driving track data;
clustering the candidate track points to obtain a plurality of clustering clusters;
based on the order data and the driver operation data, the positions of cluster center points of the plurality of clusters are restrained, and candidate clusters are determined from the plurality of clusters;
and restricting the stay time of the driver on each track point in the candidate cluster and the position of each track point in the candidate cluster, and determining the target end point of the order from the track points in the candidate cluster.
9. A computer-readable storage medium having stored thereon a plurality of instructions adapted to be loaded by a processor to perform the data processing method of any of claims 1-8.
10. A server, comprising a processor and a memory, wherein the processor is electrically connected to the memory, and the memory is used for storing instructions and data; the processor is configured to perform the data processing method of any one of claims 1-8.
CN202210725134.4A 2022-06-24 2022-06-24 Data processing method, data processing device, storage medium and server Pending CN114943305A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210725134.4A CN114943305A (en) 2022-06-24 2022-06-24 Data processing method, data processing device, storage medium and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210725134.4A CN114943305A (en) 2022-06-24 2022-06-24 Data processing method, data processing device, storage medium and server

Publications (1)

Publication Number Publication Date
CN114943305A true CN114943305A (en) 2022-08-26

Family

ID=82910303

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210725134.4A Pending CN114943305A (en) 2022-06-24 2022-06-24 Data processing method, data processing device, storage medium and server

Country Status (1)

Country Link
CN (1) CN114943305A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116187883A (en) * 2022-12-22 2023-05-30 北京万集科技股份有限公司 Identification method and system for collection and distribution area, storage medium and electronic device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018120424A1 (en) * 2016-12-29 2018-07-05 平安科技(深圳)有限公司 Location service-based method, device, equipment for crowd classification, and storage medium
CN109948701A (en) * 2019-03-19 2019-06-28 太原科技大学 A data clustering method based on spatiotemporal correlation between trajectories
WO2021056303A1 (en) * 2019-09-26 2021-04-01 Beijing Didi Infinity Technology And Development Co., Ltd. Systems and methods for determining a pick-up location
CN112989222A (en) * 2021-03-04 2021-06-18 北京嘀嘀无限科技发展有限公司 Position determination method and device and electronic equipment
CN113052397A (en) * 2021-04-19 2021-06-29 北京百度网讯科技有限公司 Method and device for determining getting-on information, electronic equipment and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018120424A1 (en) * 2016-12-29 2018-07-05 平安科技(深圳)有限公司 Location service-based method, device, equipment for crowd classification, and storage medium
CN109948701A (en) * 2019-03-19 2019-06-28 太原科技大学 A data clustering method based on spatiotemporal correlation between trajectories
WO2021056303A1 (en) * 2019-09-26 2021-04-01 Beijing Didi Infinity Technology And Development Co., Ltd. Systems and methods for determining a pick-up location
CN112989222A (en) * 2021-03-04 2021-06-18 北京嘀嘀无限科技发展有限公司 Position determination method and device and electronic equipment
CN113052397A (en) * 2021-04-19 2021-06-29 北京百度网讯科技有限公司 Method and device for determining getting-on information, electronic equipment and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
李泽: "基于交通大数据的智能挖掘与应用研究", 中国优秀硕士学位论文, no. 2, 15 February 2022 (2022-02-15), pages 034 - 553 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116187883A (en) * 2022-12-22 2023-05-30 北京万集科技股份有限公司 Identification method and system for collection and distribution area, storage medium and electronic device

Similar Documents

Publication Publication Date Title
US10268982B2 (en) Carrier and shipper interfacing and shipment tracking framework for efficient scheduling and transportation of cargo, with security monitoring and efficient payment to carriers
US20230088950A1 (en) Method and system for intelligent load optimization for vehicles
CN113222492B (en) Method and device for discriminating type of vehicle driving line, storage medium and terminal
CN110826951B (en) Transportation route loading method, device, electronic equipment and computer readable medium
CN111553548B (en) Goods picking method and device
WO2023024776A1 (en) Order delivery method, apparatus and system, and electronic device and computer-readable medium
US20210365680A1 (en) Systems and methods for determining trailer status
CN113988615B (en) Distribution method, system, equipment and storage medium based on inter-city logistics connection
US20200401958A1 (en) Systems and Methods for Improvements to Vehicle Routing Including Back-End Operations
CN111861081A (en) Order allocation method and device, electronic equipment and storage medium
CN113222205A (en) Path planning method and device
CN104881717A (en) Route recommend method and user terminal
CN115131093A (en) Abnormal order management method and related device
CN104252777A (en) Intelligent bidirectional real-time vehicle dispatching system
CN114943305A (en) Data processing method, data processing device, storage medium and server
CN112633587A (en) Logistics information processing method and device
CN113052397B (en) Method and device for determining boarding information, electronic equipment and storage medium
CN111507753A (en) Information pushing method and device and electronic equipment
CN115759307A (en) Order dispatching method and device, electronic equipment and storage medium
CN113095770B (en) Order processing method, order processing device, electronic equipment and readable storage medium
US11285962B2 (en) Methods and systems of one-click registration of drivers in a load transport network
CN114897428A (en) An order processing method, device, electronic device and storage medium
CN114648184A (en) Unmanned vehicle scheduling management method, electronic device and storage medium
CN113762841A (en) Waybill processing method and device, electronic equipment and storage medium
CN112990801A (en) Method, device and equipment for planning transportation path and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination