CN111814531A

CN111814531A - Method, apparatus, electronic device and storage medium for determining driver behavior

Info

Publication number: CN111814531A
Application number: CN202010307904.4A
Authority: CN
Inventors: 孙墨; 苟巍; 沈海峰
Original assignee: Beijing Didi Infinity Technology and Development Co Ltd
Current assignee: Beijing Didi Infinity Technology and Development Co Ltd
Priority date: 2020-04-17
Filing date: 2020-04-17
Publication date: 2020-10-23

Abstract

The present disclosure relates to methods, apparatus, electronic devices, and storage media for determining driver behavior. In one method, a sequence of images recording the state of the vehicle is acquired, the sequence of images being acquired by an image acquisition device deployed inside the vehicle. Changes in the state of the driver of the vehicle and changes in state of predetermined parts of the vehicle are detected based on the sequence of images. Based on the state change of the driver and the state change of the predetermined portion, the behavior of the driver to provide service to the occupants of the vehicle is determined. Further, corresponding apparatuses, electronic devices and storage media are provided. Using the above implementation manner, the acquired image sequence can be automatically analyzed without manual intervention, and then it can be determined whether the driver has performed the desired service behavior.

Description

Method, apparatus, electronic device and storage medium for determining driver behavior

技术领域technical field

本公开的各实现方式涉及图像处理，更具体地，涉及用于基于处理图像序列来确定司机行为的方法、装置、电子设备和存储介质。Various implementations of the present disclosure relate to image processing, and more particularly, to a method, apparatus, electronic device, and storage medium for determining driver behavior based on a sequence of processed images.

背景技术Background technique

随着计算机技术和网络技术的发展，目前已经开发出了用于提供在线车辆服务的应用。乘客可以利用这些应用来提交车辆服务请求，以便寻找适合的车辆并且前往期望的目的地。为了向乘客提供更加舒适的服务，司机可以执行某些类型的服务行为。例如，司机可以帮助乘车装卸行李，帮助乘客开关车门等。With the development of computer technology and network technology, applications for providing online vehicle services have been developed. Passengers can utilize these applications to submit vehicle service requests in order to find a suitable vehicle and travel to a desired destination. In order to provide more comfortable service to passengers, the driver may perform certain types of service behaviors. For example, the driver can help with loading and unloading luggage, helping passengers open and close doors, etc.

不同的司机可能会做出不同的行为，目前已经可以采集图像序列来记录司机的服务行为。然而，采集到的图像序列涉及大量数据，并且需要大量人工劳动来找到涉及司机服务行为的视频序列片段以便人工审核。此时，如何以更为有效的方式来处理图像序列，从而确定司机的服务行为是否得体进而衡量司机的服务水平，成为一个研究热点。Different drivers may behave differently, and it is now possible to collect image sequences to record driver service behaviors. However, the captured image sequences involve a large amount of data, and a lot of manual labor is required to find video sequence clips involving driver service behaviors for human review. At this time, how to process the image sequence in a more effective way, so as to determine whether the driver's service behavior is appropriate and then measure the driver's service level, has become a research hotspot.

发明内容SUMMARY OF THE INVENTION

期望能够开发并实现一种以更为有效的方式来确定司机的服务行为的技术方案。期望该技术方案能够与现有的应用相兼容，从而以更为有效的方式进行服务质量监控。It is desirable to develop and implement a technical solution for determining the service behavior of drivers in a more efficient manner. It is expected that the technical solution can be compatible with existing applications, so that service quality monitoring can be performed in a more effective manner.

根据本公开的第一方面，提供了一种用于确定司机行为的方法。在该方法中，获取记录车辆的状态的图像序列，图像序列是由部署在车辆内部的图像采集设备所采集的。基于图像序列检测车辆的司机的状态变化以及车辆的预定部分的状态变化。基于司机的状态变化和预定部分的状态变化，确定司机向车辆的乘客提供服务的行为。According to a first aspect of the present disclosure, there is provided a method for determining driver behavior. In the method, a sequence of images recording the state of the vehicle is acquired, the sequence of images being captured by an image capturing device deployed inside the vehicle. Changes in the state of the driver of the vehicle and changes in state of predetermined parts of the vehicle are detected based on the sequence of images. Based on the state change of the driver and the state change of the predetermined portion, the behavior of the driver to provide service to the occupants of the vehicle is determined.

根据本公开的第二方面，提供了一种用于确定司机行为的装置。该装置包括：获取模块，配置用于获取记录车辆的状态的图像序列，图像序列是由部署在车辆内部的图像采集设备所采集的；检测模块，配置用于基于图像序列检测车辆的司机的状态变化以及车辆的预定部分的状态变化；以及确定模块，配置用于基于司机的状态变化和预定部分的状态变化，确定司机向车辆的乘客提供服务的行为。According to a second aspect of the present disclosure, there is provided an apparatus for determining driver behavior. The device includes: an acquisition module configured to acquire an image sequence recording the state of the vehicle, the image sequence being acquired by an image acquisition device deployed inside the vehicle; a detection module configured to detect the state of the driver of the vehicle based on the image sequence a change and a change in state of a predetermined portion of the vehicle; and a determination module configured to determine an act of the driver providing a service to an occupant of the vehicle based on the change in state of the driver and the change in state of the predetermined portion.

根据本公开的第三方面，提供了一种电子设备，包括：存储器和处理器；其中存储器用于存储一条或多条计算机指令，其中一条或多条计算机指令被处理器执行以实现根据本公开的第一方面的方法。According to a third aspect of the present disclosure, there is provided an electronic device, comprising: a memory and a processor; wherein the memory is used to store one or more computer instructions, wherein the one or more computer instructions are executed by the processor to implement according to the present disclosure method of the first aspect.

根据本公开的第四方面，提供了一种计算机可读存储介质，其上存储有一条或多条计算机指令，其中一条或多条计算机指令被处理器执行实现根据本公开的第一方面的方法。According to a fourth aspect of the present disclosure, there is provided a computer-readable storage medium having stored thereon one or more computer instructions, wherein the one or more computer instructions are executed by a processor to implement the method according to the first aspect of the present disclosure .

附图说明Description of drawings

结合附图并参考以下详细说明，本公开各实现方式的特征、优点及其他方面将变得更加明显，在此以示例性而非限制性的方式示出了本公开的若干实现方式。在附图中：Features, advantages, and other aspects of various implementations of the present disclosure will become more apparent in conjunction with the accompanying drawings and with reference to the following detailed description, several implementations of which are shown here by way of example and not limitation. In the attached image:

图1示意性示出了其中可以使用根据本公开的示例性实现方式的车辆环境的框图；FIG. 1 schematically illustrates a block diagram of a vehicle environment in which exemplary implementations according to the present disclosure may be used;

图2示意性示出了根据本公开的示例性实现方式的用于确定司机行为的过程的框图；2 schematically illustrates a block diagram of a process for determining driver behavior in accordance with an exemplary implementation of the present disclosure;

图3示意性示出了根据本公开的示例性实现方式的用于确定司机行为的方法的流程图；FIG. 3 schematically shows a flowchart of a method for determining driver behavior according to an exemplary implementation of the present disclosure;

图4示意性示出了根据本公开的示例性实现方式的用于获取图像序列的框图；Figure 4 schematically illustrates a block diagram for acquiring a sequence of images according to an exemplary implementation of the present disclosure;

图5示意性示出了根据本公开的示例性实现方式的用于针对原始图像序列执行采样的框图；FIG. 5 schematically illustrates a block diagram for performing sampling on a sequence of raw images according to an exemplary implementation of the present disclosure;

图6A和6B分别示意性示出了根据本公开的示例性实现方式的图像序列中的不同图像中的司机区域的框图；6A and 6B respectively schematically illustrate block diagrams of driver regions in different images in a sequence of images according to an exemplary implementation of the present disclosure;

图7A、7B和7C分别示意性示出了根据本公开的示例性实现方式的图像序列中的不同图像中的后车窗区域的框图；Figures 7A, 7B and 7C respectively schematically illustrate block diagrams of rear window regions in different images in a sequence of images according to an exemplary implementation of the present disclosure;

图8示意性示出了根据本公开的示例性实现方式的图像序列中的图像中的车门区域的框图；以及FIG. 8 schematically illustrates a block diagram of a vehicle door area in an image in a sequence of images in accordance with an exemplary implementation of the present disclosure; and

图9示意性示出了根据本公开的示例性实现的用于确定司机行为的设备的框图。9 schematically illustrates a block diagram of an apparatus for determining driver behavior according to an exemplary implementation of the present disclosure.

具体实施方式Detailed ways

下面将参照附图更详细地描述本公开的优选实现。虽然附图中显示了本公开的优选实现，然而应该理解，可以以各种形式实现本公开而不应被这里阐述的实现所限制。相反，提供这些实现是为了使本公开更加透彻和完整，并且能够将本公开的范围完整地传达给本领域的技术人员。Preferred implementations of the present disclosure will be described in more detail below with reference to the accompanying drawings. While preferred implementations of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the implementations set forth herein. Rather, these implementations are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.

在本文中使用的术语“包括”及其变形表示开放性包括，即“包括但不限于”。除非特别申明，术语“或”表示“和/或”。术语“基于”表示“至少部分地基于”。术语“一个示例实现”和“一个实现”表示“至少一个示例实现”。术语“另一实现”表示“至少一个另外的实现”。术语“第一”、“第二”等等可以指代不同的或相同的对象。下文还可能包括其他明确的和隐含的定义。As used herein, the term "including" and variations thereof mean open-ended inclusion, ie, "including but not limited to". The term "or" means "and/or" unless specifically stated otherwise. The term "based on" means "based at least in part on". The terms "one example implementation" and "one implementation" mean "at least one example implementation." The term "another implementation" means "at least one additional implementation." The terms "first", "second", etc. may refer to different or the same objects. Other explicit and implicit definitions may also be included below.

首先参见图1描述本公开的示例性实现方式的应用环境。目前已经开发出了诸如在线车辆服务的应用。车辆服务公司可以指定司机应该遵守的服务标准。例如，可以指定司机应当为乘客开关车门，当乘客携带行李时应当帮助乘客向后备箱装卸行李，等等。车辆通常分布在城市道路中的各个位置，并且司机的服务行为通常发生在车外，难以基于集中式方式监控司机的服务水平。The application environment of an exemplary implementation of the present disclosure will be described first with reference to FIG. 1 . Applications such as online vehicle services have been developed. Vehicle service companies can specify service standards that drivers should adhere to. For example, it can be specified that the driver should open and close the doors for the passenger, help the passenger load and unload luggage into the trunk when the passenger is carrying luggage, and so on. Vehicles are usually distributed in various locations on urban roads, and the driver's service behavior usually occurs outside the vehicle, so it is difficult to monitor the driver's service level based on a centralized way.

出于安全性或者其他因素考虑，目前已经提出在车辆内部部署图像采集设备来记录车辆状态的技术方案。尽管采集到的图像序列可以记录司机的行为，由于采集到的图像序列通常跨越较大时间段(例如，每天8个小时)，此时如何逐一分析司机为每个乘客提供的服务行为成为一个技术难题。尤其是，当车辆服务应用的客服人员接到来自乘客的投诉(例如，抱怨司机没有帮助装卸行李和/或开关车门)时，客服人员不得不从大量图像序列中人工查看以便确认司机的服务行为。For reasons of safety or other factors, a technical solution has been proposed to deploy an image acquisition device inside the vehicle to record the state of the vehicle. Although the acquired image sequence can record the driver's behavior, since the acquired image sequence usually spans a large time period (for example, 8 hours a day), how to analyze the service behavior provided by the driver to each passenger one by one becomes a technical problem at this time. problem. In particular, when a customer service agent for a vehicle service application receives a complaint from a passenger (for example, complaining that the driver is not helping with loading and unloading luggage and/or opening and closing doors), the customer service agent has to manually look through a large sequence of images in order to confirm the driver's service behavior .

为了至少部分地解决上述问题以及其他潜在问题中的一个或者多个问题，本公开的示例性实现方式提出了一种用于确定司机行为的技术方案。图1示意性示出了其中可以使用根据本公开的示例性实现方式的车辆环境的框图100。可以在车辆内部部署图像采集设备，以便采集如图1所示的车辆状态的图像序列。图像110示意性示出了图像序列中的一个图像。根据本公开的示例性实现方式，可以分析采集到的图像序列，进而判断司机是否执行了规定的服务行为。To at least partially address one or more of the above-mentioned problems and other potential problems, exemplary implementations of the present disclosure propose a technical solution for determining driver behavior. FIG. 1 schematically illustrates a block diagram 100 of a vehicle environment in which exemplary implementations according to the present disclosure may be used. Image acquisition equipment can be deployed inside the vehicle in order to acquire a sequence of images of the state of the vehicle as shown in FIG. 1 . Image 110 schematically shows one image in a sequence of images. According to the exemplary implementation of the present disclosure, the acquired image sequence can be analyzed to determine whether the driver has performed the prescribed service behavior.

首先参见图2描述本公开的示例性实现方式的概要。图2示意性示出了根据本公开的示例性实现方式的用于确定司机行为的过程的框图200。如图2所示，可以获取图像序列210，并且基于图像序列210来确定与车辆相关的状态变化220。状态变化220可以包括两方面：车辆司机的状态变化222和车辆预定部分(例如，后备箱和/或车门)的状态变化224。继而，可以基于采集到的上述状态变化220来确定司机的服务行为230。利用本公开的示例性实现方式，可以在无需人工干预的情况下，自动分析采集到的图像序列210，进而确定司机是否执行了期望的服务行为。An overview of an exemplary implementation of the present disclosure is described first with reference to FIG. 2 . FIG. 2 schematically illustrates a block diagram 200 of a process for determining driver behavior in accordance with an exemplary implementation of the present disclosure. As shown in FIG. 2 , a sequence of images 210 may be acquired, and based on the sequence of images 210 , a state change 220 associated with the vehicle may be determined. The state change 220 may include two aspects: a state change 222 of the driver of the vehicle and a state change 224 of a predetermined portion of the vehicle (eg, the trunk and/or doors). Then, the driver's service behavior 230 may be determined based on the collected state changes 220 described above. With exemplary implementations of the present disclosure, the acquired image sequence 210 can be automatically analyzed without manual intervention to determine whether the driver has performed the desired service behavior.

在下文中，将参见图3描述本公开的示意性实现方式的更多细节。图3示意性示出了根据本公开的示例性实现方式的用于确定司机行为的方法300的流程图。在框310处，获取记录车辆的状态的图像序列210，图像序列210是由部署在车辆内部的图像采集设备所采集的。将会理解，可以在车辆内部部署视频采集设备，例如，可以在车辆后视镜或者其他位置处部署诸如摄像头等采集设备。可以向数据中心实时地传输采集到的图像序列210，和/或可以定期传输。Hereinafter, more details of an exemplary implementation of the present disclosure will be described with reference to FIG. 3 . FIG. 3 schematically illustrates a flowchart of a method 300 for determining driver behavior according to an exemplary implementation of the present disclosure. At block 310, a sequence of images 210 recording the state of the vehicle is acquired, the sequence of images 210 being captured by an image capture device deployed inside the vehicle. It will be appreciated that video capture devices may be deployed inside the vehicle, for example, capture devices such as cameras may be deployed in vehicle rear view mirrors or other locations. The acquired image sequence 210 may be transmitted to the data center in real-time, and/or may be transmitted periodically.

司机为乘客提供开关车门和装卸行李的服务总是出现在乘客上车和下车前后的时间段内。可以基于司机提供车辆服务的时间段，来查找与上述服务相关的图像序列在原始图像序列中的位置。具体地，可以获取司机接受来自乘客的车辆服务请求的开始时间、以及司机完成车辆服务请求的结束时间点。基于上述两个时间点，可以更加准确地找到有待分析的图像序列的位置，进而避免分析整个原始图像序列的情况。以此方式，可以提高处理效率并且降低时间开销。The driver's service of opening and closing doors and loading and unloading luggage for passengers always occurs during the time period before and after passengers get on and off. The location of the image sequence in the original image sequence related to the above-mentioned service can be found based on the time period during which the driver provided the vehicle service. Specifically, the start time when the driver accepts the vehicle service request from the passenger and the end time when the driver completes the vehicle service request can be obtained. Based on the above two time points, the position of the image sequence to be analyzed can be found more accurately, thereby avoiding the situation of analyzing the entire original image sequence. In this way, processing efficiency can be improved and time overhead can be reduced.

图4示意性示出了根据本公开的示例性实现方式的用于获取图像序列的框图400。如图4所示，原始图像序列410表示从图像采集设备中采集到的原始数据，并且时间轴表示司机提供车辆服务期间的时间跨度。可以从车辆服务应用获取司机接受车辆服务请求的开始时间420，并且可以选择开始时间420之后预定时间范围内的图像序列422。Figure 4 schematically illustrates a block diagram 400 for acquiring a sequence of images according to an exemplary implementation of the present disclosure. As shown in FIG. 4, the raw image sequence 410 represents raw data collected from the image capture device, and the time axis represents the time span during which the driver provides vehicle service. The start time 420 at which the driver accepts the vehicle service request may be obtained from the vehicle service application, and a sequence of images 422 within a predetermined time range after the start time 420 may be selected.

通常而言，在接受车辆服务请求之后，司机前往乘客指定的出发地将会花费一段时间。此时，可以将预定时间范围指定为开始时间420之后的例如2分钟至5分钟。根据本公开的示例性实现方式，可以基于司机接受请求的位置与出发地之间的预期行驶时间来指定预定时间范围。预期时间越长，则预定时间范围与开始时间420的差异越大。利用本公开的示例性实现方式，为了确定司机在开始服务时是否帮助乘客装载行李和/或开关车门，仅需要分析图像序列422即可，从而避免了分析整个原始图像序列410的各种开销。Typically, after accepting a vehicle service request, it will take a while for the driver to travel to the passenger's designated departure point. At this time, the predetermined time range may be designated as, for example, 2 minutes to 5 minutes after the start time 420 . According to an exemplary implementation of the present disclosure, the predetermined time range may be specified based on the expected travel time between the location where the driver accepts the request and the point of departure. The longer the expected time, the greater the difference between the predetermined time range and the start time 420. With exemplary implementations of the present disclosure, only the image sequence 422 needs to be analyzed in order to determine whether a driver assists a passenger with loading luggage and/or opening and closing doors when commencing service, thereby avoiding the various overheads of analyzing the entire raw image sequence 410 .

根据本公开的示例性实现方式，如图4所示，可以获取司机完成车辆服务请求的结束时间430，并且可以选择结束时间430之后预定时间范围内的图像序列432。例如，可以将预定时间范围指定为从结束时间430开始的数分钟。以此方式，仅需要分析图像序列432即可确定司机在结束服务时是否帮助乘客取出行李和/或开关车门，从而避免了分析整个原始图像序列410的各种开销。According to an exemplary implementation of the present disclosure, as shown in FIG. 4 , an end time 430 at which the driver completes the vehicle service request may be obtained, and an image sequence 432 within a predetermined time range after the end time 430 may be selected. For example, the predetermined time range may be specified as a few minutes from the end time 430 . In this way, only the sequence of images 432 needs to be analyzed to determine whether the driver assists the passenger with unloading luggage and/or opening and closing doors at the end of service, thereby avoiding the various overheads of analyzing the entire sequence of raw images 410 .

利用本公开的示例性实现方式，可以准确地定位可能包括服务行为的图像序列在原始图像序列中的位置。以此方式，可以降低待处理的图像序列的数据量并且提高处理效率。With exemplary implementations of the present disclosure, it is possible to accurately locate a sequence of images that may include service behaviors in the original sequence of images. In this way, the data volume of the image sequence to be processed can be reduced and the processing efficiency improved.

根据本公开的示例性实现方式，从图像采集设备采集到的原始图像序列410可能跨越较长时间。如果针对原始图像序列410中的每个图像进行处理将会占用大量时间和处理资源。为了降低处理时间和处理资源方面的开销，可以按照预定时间间隔，从原始图像序列410中执行采样以形成待处理的图像序列。According to exemplary implementations of the present disclosure, the raw image sequence 410 acquired from the image acquisition device may span a long period of time. Processing for each image in the raw image sequence 410 would take a significant amount of time and processing resources. In order to reduce overhead in terms of processing time and processing resources, sampling may be performed from the original image sequence 410 at predetermined time intervals to form the image sequence to be processed.

图5示意性示出了根据本公开的示例性实现方式的用于针对原始图像序列410执行采样的框图500。如图5所示，可以指定预定的时间间隔，例如指定从连续的N(N为整数，例如N＝4)帧图像中采样一帧图像。可以从原始图像序列510中选择第1帧、第N+1帧、第2N+1帧，等等，并且基于选择的各个图像帧来生成图像序列510。利用本公开的示例性实现方式，可以通过采样技术来降低待处理图像序列的数据量。以此方式，可以在不影响检测效果的情况下降低处理时间和处理资源的开销，进而提高处理效率。FIG. 5 schematically illustrates a block diagram 500 for performing sampling on an original image sequence 410 according to an exemplary implementation of the present disclosure. As shown in FIG. 5 , a predetermined time interval can be specified, for example, it is specified to sample one frame of images from consecutive N (N is an integer, for example, N=4) frame images. The 1 st frame, the N+1 th frame, the 2N+1 th frame, etc. may be selected from the original image sequence 510, and the image sequence 510 may be generated based on the selected respective image frames. With the exemplary implementation of the present disclosure, the amount of data of the image sequence to be processed can be reduced by sampling techniques. In this way, processing time and processing resource overhead can be reduced without affecting the detection effect, thereby improving processing efficiency.

根据本公开的示例性实现方式，可以结合如图4和图5所示的方法。具体地址，可以首先确定与服务开始和服务结束相关联的图像序列422和图像序列432。进一步，可以在图像序列422和432中进行采样，以便进一步降低待分析的图像序列的数据量。According to an exemplary implementation of the present disclosure, the methods shown in FIGS. 4 and 5 may be combined. For specific addresses, the image sequence 422 and the image sequence 432 associated with the service start and service end may be determined first. Further, sampling may be performed in the image sequences 422 and 432 in order to further reduce the data volume of the image sequences to be analyzed.

上文已经参加图4和图5描述了如何获取图像序列，在下文中，将描述如何确定状态变化220。返回图3，在框320处，基于图像序列210检测车辆司机的状态变化222以及车辆的预定部分的状态变化224。根据本公开的示例性实现方式，可以将图像序列210中的各个图像划分为不同的区域，以便检测各个区域中的状态变化。首先参见图6A和6B描述如何检测司机的状态变化222。How the sequence of images is acquired has been described above with reference to FIGS. 4 and 5 , in the following, how the state change 220 is determined will be described. Returning to FIG. 3 , at block 320 , a change in state 222 of the driver of the vehicle and a change in state 224 of a predetermined portion of the vehicle are detected based on the sequence of images 210 . According to an exemplary implementation of the present disclosure, each image in the sequence of images 210 may be divided into different regions in order to detect state changes in the various regions. Referring first to Figures 6A and 6B, it will be described how to detect the state change 222 of the driver.

图6A示意性示出了根据本公开的示例性实现方式的图像序列210中一个图像610A中的司机区域612的框图600A。如图6A所示，在图像序列210中，可以标识与车辆的驾驶座位相关联的司机区域612。可以通过监视司机区域612中的图像的变化，来检测司机的状态变化222。司机可以在出发地等候乘客，当乘客到来后司机可以下车并且帮助乘客装卸行李和/或开关车门。此时，可以检测司机区域612中是否包括司机的图像来确定司机是否下车。根据本公开的示例性实现方式，可以基于人脸识别的方式来检测司机是否下车。可以从图像序列210中检测包括司机图像的第一图像子序列。此时，在第一图像子序列的图像中司机位于司机区域中。FIG. 6A schematically illustrates a block diagram 600A of the driver area 612 in one image 610A of the sequence of images 210 in accordance with an exemplary implementation of the present disclosure. As shown in FIG. 6A, in the sequence of images 210, a driver area 612 associated with the driver's seat of the vehicle may be identified. The driver's state change 222 may be detected by monitoring changes in the image in the driver area 612 . The driver can wait for the passenger at the departure point, and when the passenger arrives, the driver can get off the bus and help the passenger load and unload luggage and/or open and close the doors. At this time, whether the driver's image is included in the driver area 612 may be detected to determine whether the driver gets off the vehicle. According to an exemplary implementation of the present disclosure, whether the driver gets off the vehicle can be detected based on a face recognition method. A first subsequence of images including driver images may be detected from the sequence of images 210 . In this case, the driver is located in the driver area in the images of the first image subsequence.

根据本公开的示例性实现方式，可以基于机器学习技术来识别司机的人脸。例如，可以基于目前已经开发的或者将在未来开发的多种人脸识别模型，来在图像序列中检测包括司机人脸的图像子序列以及不包括司机人脸的图像子序列。According to an exemplary implementation of the present disclosure, the driver's face may be recognized based on machine learning techniques. For example, image subsequences that include the driver's face and image subsequences that do not include the driver's face can be detected in the image sequence based on various face recognition models that have been developed or will be developed in the future.

可以继续分析第一图像子序列之后的图像序列，如果在第一图像子序列之后检测到不包括司机图像的第二图像子序列(此时，在第二图像子序列的图像中司机不在司机区域中)，可以确定司机的状态变化为“下车”。图6B示意性示出了根据本公开的示例性实现方式的图像序列中的另一图像中的司机区域的框图600B。如果确定后续的图像序列中不包括司机的人脸图像，则可以确定司机已经下车。Analysis of the image sequence following the first image subsequence can continue, if a second image subsequence that does not include the driver image is detected after the first image subsequence (at this point, the driver is not in the driver area in the images of the second image subsequence ), it can be determined that the status of the driver changes to "get off". 6B schematically illustrates a block diagram 600B of a driver area in another image in a sequence of images in accordance with an exemplary implementation of the present disclosure. If it is determined that the subsequent image sequence does not include the driver's face image, it can be determined that the driver has gotten off the vehicle.

根据本公开的示例性实现方式，为了避免出现由于司机扭头或者弯腰等动作导致的误判，可以指定时间长度阈值。例如，如果确定第二图像子序列中的图像所跨的时间长度高于时间长度阈值，则可以确定司机的状态变化为“下车”。具体地址，可以指定该时间长度阈值为0.5秒或者其他数值。备选地和/或附加地，还可以以图像帧的数量为单位来指定时间长度阈值。According to an exemplary implementation of the present disclosure, in order to avoid misjudgment caused by actions such as the driver turning his head or bending over, a time length threshold may be specified. For example, if it is determined that the images in the second sub-sequence of images span a time length above a time length threshold, it may be determined that the driver's state change is "getting off." For specific addresses, the time length threshold can be specified as 0.5 seconds or other values. Alternatively and/or additionally, the time length threshold may also be specified in units of image frames.

利用本公开的示例性实现方式，并不需要采集车辆外部的信息，可以仅基于车辆内部的状态来检测司机是否下车。进一步，使用图像子序列而不是单一图像帧来判断司机是否下车，可以降低出现误判的可能性，进而为确定司机的服务行为提供准确依据。With the exemplary implementation of the present disclosure, it is not necessary to collect information outside the vehicle, and it is possible to detect whether the driver gets off the vehicle based only on the state inside the vehicle. Further, using image sub-sequences instead of a single image frame to determine whether a driver gets off the bus can reduce the possibility of misjudgment, thereby providing an accurate basis for determining the driver's service behavior.

根据本公开的示例性实现方式，可以为车辆的预定部分指定预定区域。在此，预定部分可以包括车辆的后备箱和车门。后备箱的状态变化可以用于检测司机是否帮助乘客装卸行李，而车门的状态变化可以用于检测司机是否帮助乘客开关车门。具体地，在图像序列210中，可以标识与车辆的预定部分相关联的预定区域。在下文中，首先参见图7A至图7C描述如何确定后备箱的状态变化。According to an exemplary implementation of the present disclosure, a predetermined area may be designated for a predetermined portion of the vehicle. Here, the predetermined portion may include a trunk and a door of the vehicle. The state change of the trunk can be used to detect whether the driver helps the passenger load and unload luggage, and the state change of the door can be used to detect whether the driver helps the passenger to open and close the door. Specifically, in the sequence of images 210, a predetermined area associated with a predetermined portion of the vehicle may be identified. Hereinafter, how to determine the state change of the trunk is first described with reference to FIGS. 7A to 7C .

图7A示意性示出了根据本公开的示例性实现方式的图像序列中的图像710A中的后车窗区域的框图700A。将会理解，基于图像序列210，仅能从后车窗中查看后备箱是否被打开。为了确定后备箱的状态变化，在图像序列210中可以指定后车窗区域712。在司机已经下车后的图像序列中，可以基于后车窗区域712来检测后备箱的状态变化。7A schematically illustrates a block diagram 700A of a rear window area in image 710A in an image sequence in accordance with an exemplary implementation of the present disclosure. It will be appreciated that, based on the sequence of images 210, it is only possible to see from the rear window whether the trunk is opened. In order to determine the state change of the trunk, a rear window area 712 may be specified in the image sequence 210 . In the sequence of images after the driver has exited the vehicle, a change in the state of the trunk may be detected based on the rear window area 712 .

具体地，可以从图像序列中检测后备箱处于关闭状态的第三图像子序列(在第三图像子序列的图像中后车窗区域712中，后备箱处于关闭状态)。如图7A所示，此时后车窗区域712显示后备箱处于关闭状态。Specifically, a third image subsequence in which the trunk is in a closed state can be detected from the image sequence (in the rear window area 712 in the images of the third image subsequence, the trunk is in a closed state). As shown in FIG. 7A , the rear window area 712 shows that the trunk is in a closed state at this time.

根据本公开的示例性实现方式，可以基于机器学习技术来确定后备箱的状态。例如，可以预先采集包括多种车型的后备箱处于关闭、打开过程中、以及打开状态的图像的训练样本，并且可以基于训练样本来生成描述后备箱状态的判断模型。可以将图像序列中的图像输入模型，以便确定该图像中的后备箱的状态。According to an exemplary implementation of the present disclosure, the state of the trunk may be determined based on machine learning techniques. For example, training samples including images of trunks of various models in closed, in the process of opening, and open states may be collected in advance, and a judgment model describing the state of the trunk may be generated based on the training samples. An image from a sequence of images can be fed into the model in order to determine the state of the trunk in that image.

继而，可以在第三图像子序列之后的图像序列中，继续寻找是否存在后备箱被打开的图像。如果在第三图像子序列之后检测到后备箱被打开的第四图像子序列(换言之，在第四图像子序列的图像中后车窗区域712中的后备箱处于打开状态)，可以确定后备箱的状态变化为“打开”。Then, in the image sequence after the third image subsequence, it is possible to continue to search for whether there is an image in which the trunk is opened. If a fourth image subsequence with the trunk open (in other words, the trunk in the rear window area 712 is open in the images of the fourth image subsequence) is detected after the third image subsequence, it may be determined that the trunk The status changes to "Open".

根据本公开的示例性实现方式，为了进一步提高检测的准确性，还可以在图像序列中检测后备箱处于打开过程中的状态。具体地，可以在第三图像子序列和第四图像子序列之间检测包括后备箱处于打开过程中的图像子序列。图7B示意性示出了根据本公开的示例性实现方式的图像序列中的图像710B中的后车窗区域712的框图700B。如图7B所示，后车窗区域712显示后备箱处于打开过车中并且已经被部分打开。According to an exemplary implementation of the present disclosure, in order to further improve the detection accuracy, a state in which the trunk is in the process of being opened may also be detected in the image sequence. Specifically, the image subsequence including the trunk being opened may be detected between the third image subsequence and the fourth image subsequence. 7B schematically illustrates a block diagram 700B of a rear window area 712 in image 710B in an image sequence in accordance with an exemplary implementation of the present disclosure. As shown in Figure 7B, the rear window area 712 shows that the trunk is in the open vehicle and has been partially opened.

根据本公开的示例性实现方式，如果在第三图像子序列和第四图像子序列之间检测到第五图像子序列(在第五图像子序列的图像中后车窗区域712中的后备箱处于打开过程中)，则可以确定后备箱的状态变化为“打开”。According to an exemplary implementation of the present disclosure, if a fifth image subsequence (the trunk in the rear window area 712 in the image of the fifth image subsequence is detected between the third image subsequence and the fourth image subsequence is in the process of opening), it can be determined that the status of the trunk changes to "open".

图7C示意性示出了根据本公开的示例性实现方式的图像序列中的图像710C中的后车窗区域712的框图700C。如图7C所示，后车窗区域712显示后备箱已经被完全打开。此时，基于表示后备箱处于关闭状态的第三图像子序列、表示后备箱处于打开过程中的第五图像子序列以及表示后备箱处于打开状态的第四图像子序列，可以确定后备箱的“打开”状态变化。7C schematically illustrates a block diagram 700C of a rear window area 712 in image 710C in an image sequence in accordance with an exemplary implementation of the present disclosure. As shown in Figure 7C, the rear window area 712 shows that the trunk has been fully opened. At this time, based on the third image subsequence indicating that the trunk is closed, the fifth image subsequence indicating that the trunk is in the process of opening, and the fourth image subsequence indicating that the trunk is open, the " Open" state changes.

利用本公开的示例性实现方式，只需要确定图像序列中的后车窗区域712中的后备箱的状态，并且比较具有时序关系的多个状态，即可准确确定后备箱的状态变化。With the exemplary implementation of the present disclosure, it is only necessary to determine the state of the trunk in the rear window area 712 in the image sequence, and compare multiple states with a time sequence relationship to accurately determine the state change of the trunk.

根据本公开的示例性实现方式，为了提高检测的准确性，还可以进一步判断在后备箱打开过程期间，司机是否出现在在后车窗区域712中。例如，可以通过比较在车内采集的司机的衣着与在后车窗区域712中出现的人物的衣着，来进一步确定司机是否帮助乘客装卸行李。又例如，可以通过在后车窗区域712中进行司机人脸识别，来判断司机的服务行为。将会理解，因为后车窗外的景象会不断变化，例如云彩阴影遮挡，后车跟进的遮挡等特殊情况，通过状态变化的时序来检测司机动作，可以提供更高的准确性。According to an exemplary implementation of the present disclosure, in order to improve the detection accuracy, it may be further determined whether the driver appears in the rear window area 712 during the trunk opening process. For example, it may be further determined whether the driver assists the passenger in loading and unloading luggage by comparing the clothing of the driver collected in the vehicle with the clothing of the person appearing in the rear window area 712 . For another example, the driver's service behavior can be determined by performing face recognition of the driver in the rear window area 712 . It will be understood that because the scene outside the rear window will constantly change, such as the shadow of clouds, the occlusion of the following car and other special situations, detecting the driver's actions through the timing of state changes can provide higher accuracy.

上文已经参见图7A至图7C描述了如何确定后备箱的状态变化。可以以类似方式确定车辆车门的状态变化，以便确定司机是否为乘客开关车门。图8示意性示出了根据本公开的示例性实现方式的图像序列中的图像800中的车门区域的框图800。如图所示，可以为车门指定车门区域812。How to determine the state change of the trunk has been described above with reference to FIGS. 7A to 7C . A change in state of a vehicle door may be determined in a similar manner to determine whether the driver opens or closes the door for a passenger. FIG. 8 schematically shows a block diagram 800 of a vehicle door area in an image 800 in an image sequence according to an exemplary implementation of the present disclosure. As shown, a door area 812 may be designated for the door.

根据本公开的示例性实现方式，可以基于机器学习技术来确定车门区域812中的车门的状态。例如，可以预先采集包括多种车型的车门处于关闭、打开过程中、以及打开状态的图像的训练样本，并且可以基于训练样本来生成描述车门状态的判断模型。可以将图像序列中的图像输入模型，以便确定该图像中的车门的状态。利用本公开的示例性实现方式，只需要确定图像序列中的车门区域812中的车门的状态，并且比较具有时序关系的多个状态，即可准确确定车门的状态变化。According to example implementations of the present disclosure, the state of the doors in the door area 812 may be determined based on machine learning techniques. For example, training samples including images of doors of various vehicle types being closed, in the process of opening, and open may be collected in advance, and a judgment model describing the state of the doors may be generated based on the training samples. An image in the sequence of images can be fed into the model to determine the state of the door in the image. With the exemplary implementation of the present disclosure, it is only necessary to determine the state of the vehicle door in the vehicle door area 812 in the image sequence, and to compare multiple states having a time sequence relationship, the state change of the vehicle door can be accurately determined.

上文已经参见图6A、6B、7A、7B、7C以及图8描述了如何确定司机的状态变化222和预定部分的状态变化224。下文将返回图3描述如何确定司机的行为。在图3的框330处，基于司机的状态变化222和预定部分的状态变化224，确定司机向车辆的乘客提供服务的行为。将会理解，可以指定司机应该遵守的服务标准。例如，可以指定司机应当为乘客开关车门，当乘客携带行李时应当帮助乘客向后备箱装卸行李。因而，此时的行为可以包括为乘客装卸行李的行为以及为乘客开关车门的行为中的至少任一项。利用本公开的示例性实现方式，通过分析图像序列的方式，可以自动确定司机的服务行为，并且审查司机的行为是否符合多种类型的服务规范。以此方式，可以减低人工审查的人力和时间开销。How to determine the state change 222 of the driver and the state change 224 of the predetermined portion has been described above with reference to FIGS. 6A , 6B, 7A, 7B, 7C and 8 . The following will return to FIG. 3 to describe how the driver's behavior is determined. At block 330 of FIG. 3, based on the state change 222 of the driver and the state change 224 of the predetermined portion, the behavior of the driver to provide service to the occupants of the vehicle is determined. It will be understood that the service standard to which the driver should adhere can be specified. For example, it can be specified that the driver should open and close the doors for the passenger, and should help the passenger load and unload luggage into the trunk when the passenger is carrying luggage. Therefore, the behavior at this time may include at least any one of the behavior of loading and unloading luggage for the passenger and the behavior of opening and closing the vehicle door for the passenger. With exemplary implementations of the present disclosure, driver service behavior can be automatically determined and reviewed for compliance with various types of service specifications by analyzing image sequences. In this way, the labor and time overhead of manual review can be reduced.

根据本公开的示例性实现方式，如果按顺序检测到“下车”和“打开”相关的状态改变，则可以确定司机执行了规定的行为。在一个示例中，如果在图像序列210中检测到司机下车、车门被打开，则可以确定司机为乘客开关车门。在另一个示例中，如果在图像序列中检测到司机下车、后备箱被打开，则可以确定司机为乘客装卸行李。根据本公开的示例性实现方式，如果仅检测到“打开”相关的状态变化，但是没有检测到司机“下车”，可以确定司机没有执行规定的服务行为。According to an exemplary implementation of the present disclosure, if state changes related to 'getting off' and 'opening' are detected in sequence, it may be determined that the driver performed a prescribed action. In one example, if it is detected in the image sequence 210 that the driver gets out of the vehicle and the door is opened, it may be determined that the driver opens and closes the door for the passenger. In another example, if the driver is detected in the sequence of images getting out of the vehicle with the trunk open, it may be determined that the driver is loading and unloading luggage for the passenger. According to an exemplary implementation of the present disclosure, if only a state change related to "on" is detected, but "getting out" of the driver is not detected, it may be determined that the driver has not performed the prescribed service behavior.

利用本公开的示例性实现方式，不必以人工方式逐一检查与每次车辆服务相关的图像序列，而是可以基于自动图像处理的方式来评价司机的服务水平。进一步，当出现乘客投诉等争议情况时，可以调用上文描述的方法300，来初步判断司机是否执行了规定的服务行为。以此方式，可以降低客服人员的工作负载并且提高工作效率。With the exemplary implementation of the present disclosure, it is not necessary to manually check the sequence of images related to each vehicle service one by one, but the service level of the driver can be evaluated based on automatic image processing. Further, when there is a dispute such as a passenger complaint, the method 300 described above can be invoked to preliminarily determine whether the driver has performed the prescribed service behavior. In this way, the workload of the customer service personnel can be reduced and the work efficiency can be improved.

在上文中已经参见图2至图8详细描述了根据本公开的方法的示例，在下文中将描述相应的装置的实现。根据本公开的示例性实现方式，提供了一种用于确定司机行为的装置，包括：获取模块，配置用于获取记录车辆的状态的图像序列，图像序列是由部署在车辆内部的图像采集设备所采集的；检测模块，配置用于基于图像序列检测车辆的司机的状态变化以及车辆的预定部分的状态变化；以及确定模块，配置用于基于司机的状态变化和预定部分的状态变化，确定司机向车辆的乘客提供服务的行为。An example of a method according to the present disclosure has been described in detail above with reference to FIGS. 2 to 8 , and the implementation of the corresponding apparatus will be described below. According to an exemplary implementation of the present disclosure, there is provided an apparatus for determining a driver's behavior, comprising: an acquisition module configured to acquire an image sequence recording the state of the vehicle, the image sequence being obtained by an image acquisition device deployed inside the vehicle collected; a detection module configured to detect a state change of the driver of the vehicle and a state change of a predetermined portion of the vehicle based on the sequence of images; and a determination module configured to determine the driver based on the state change of the driver and the state change of the predetermined portion The act of providing service to the occupants of a vehicle.

根据本公开的示例性实现方式，检测模块包括：司机区域标识模块，配置用于在图像序列中，标识与车辆的驾驶座位相关联的司机区域；以及司机变化检测模块，配置用于基于司机区域检测司机的状态变化。According to an exemplary implementation of the present disclosure, the detection module includes: a driver area identification module configured to identify, in a sequence of images, a driver area associated with a driver seat of a vehicle; and a driver change detection module configured to identify a driver area based on the driver area Detect the status change of the driver.

根据本公开的示例性实现方式，变化检测模块包括：第一子序列检测模块，配置用于从图像序列中检测第一图像子序列，在第一图像子序列的图像中司机位于司机区域中；第二子序列检测模块，配置用于响应于在第一图像子序列之后检测到第二图像子序列，在第二图像子序列的图像中司机不在司机区域中，确定司机的状态变化为“下车”。According to an exemplary implementation of the present disclosure, the change detection module includes: a first subsequence detection module configured to detect a first subsequence of images from a sequence of images in which a driver is located in a driver area; The second subsequence detection module is configured to, in response to detecting the second image subsequence after the first image subsequence, in which the driver is not in the driver area in the images of the second image subsequence, determine that the state of the driver is changed to "down" car".

根据本公开的示例性实现方式，变化检测模块进一步包括：比较模块，配置用于响应于确定第二图像子序列中的图像所跨的时间长度满足时间长度阈值，确定司机的状态变化为“下车”。According to an exemplary implementation of the present disclosure, the change detection module further includes a comparison module configured to, in response to determining that the time length spanned by the images in the second subsequence of images satisfies the time length threshold, determine that the driver's state change is "down" car".

根据本公开的示例性实现方式，预定部分包括车辆的后备箱和车门，其中检测模块包括：预定区域标识模块，配置用于在图像序列中，标识与车辆的预定部分相关联的预定区域；以及车辆变化检测模块，配置用于基于预定区域检测预定部分的状态变化。According to an exemplary implementation of the present disclosure, the predetermined portion includes a trunk and a door of the vehicle, wherein the detection module includes: a predetermined area identification module configured to identify, in the sequence of images, a predetermined area associated with the predetermined portion of the vehicle; and A vehicle change detection module configured to detect a state change of a predetermined portion based on a predetermined area.

根据本公开的示例性实现方式，车辆变化检测模块包括：第三子序列检测模块，配置用于从图像序列中检测第三图像子序列，在第三图像子序列的图像中预定区域中的预定部分处于关闭状态；第四子序列检测模块，配置用于响应于在第三图像子序列之后检测到第四图像子序列，在第四图像子序列的图像中预定区域中的预定部分处于打开状态，确定预定部分的状态变化为“打开”。According to an exemplary implementation of the present disclosure, the vehicle change detection module includes: a third subsequence detection module configured to detect a third subsequence of images from the sequence of images, a predetermined subsequence in a predetermined region in an image of the third subsequence of images the portion is in an off state; the fourth subsequence detection module is configured to, in response to detecting the fourth image subsequence after the third image subsequence, a predetermined portion in a predetermined area in the image of the fourth image subsequence being in an open state , it is determined that the state of the predetermined part changes to "open".

根据本公开的示例性实现方式，车辆变化检测模块进一步包括：第五子序列检测模块，配置用于响应于在第三图像子序列和第四图像子序列之间检测到第五图像子序列，在第五图像子序列的图像中预定区域中的预定部分处于打开过程中状态，确定预定部分的状态变化为“打开”。According to an exemplary implementation of the present disclosure, the vehicle change detection module further includes a fifth subsequence detection module configured in response to detecting the fifth image subsequence between the third image subsequence and the fourth image subsequence, In the image of the fifth sub-sequence of images, the predetermined portion in the predetermined region is in the state of being opened, and it is determined that the state of the predetermined portion is changed to "open".

根据本公开的示例性实现方式，司机的行为包括为乘客装卸行李的行为以及为乘客开关车门的行为中的至少任一项，以及确定模块包括以下中的至少任一项：第一行为确定模块，配置用于响应于按顺序检测到“下车”以及“打开”，确定司机执行了行为；以及第二行为确定模块，配置用于响应于检测到“打开”但是没有检测到“下车”，确定司机没有执行行为。According to an exemplary implementation of the present disclosure, the behavior of the driver includes at least any one of the behavior of loading and unloading luggage for the passenger and the behavior of opening and closing the vehicle door for the passenger, and the determining module includes at least any one of the following: a first behavior determining module , configured to determine that the driver performed an action in response to detecting "getting off" and "opening" in sequence; and a second behavior determination module configured to respond to detecting "opening" but not detecting "getting off" , to determine that the driver did not perform the action.

根据本公开的示例性实现方式，获取模块包括以下中的至少任一项：第一图像序列获取模块，配置用于获取司机接受来自乘客的车辆服务请求的开始时间之后预定时间范围内的图像序列；以及第二图像序列获取模块，配置用于获取司机完成车辆服务请求的结束时间点之后预定时间范围内的图像序列。According to an exemplary implementation of the present disclosure, the acquisition module includes at least any one of the following: a first image sequence acquisition module configured to acquire an image sequence within a predetermined time range after the start time when the driver accepts the vehicle service request from the passenger and a second image sequence acquisition module configured to acquire an image sequence within a predetermined time range after the end time point when the driver completes the vehicle service request.

根据本公开的示例性实现方式，获取模块包括：原始图像序列获取模块，配置用于获取来自图像采集设备的原始图像序列；以及采样模块，配置用于按照预定时间间隔，从原始图像序列中执行采样以形成图像序列。According to an exemplary implementation of the present disclosure, the acquisition module includes: an original image sequence acquisition module configured to acquire an original image sequence from an image acquisition device; and a sampling module configured to execute from the original image sequence at predetermined time intervals Sampling to form a sequence of images.

根据本公开的示例性实现方式，提供了一种电子设备，包括：存储器和处理器；其中存储器用于存储一条或多条计算机指令，其中一条或多条计算机指令被处理器执行以实现上文描述的方法。According to an exemplary implementation of the present disclosure, there is provided an electronic device comprising: a memory and a processor; wherein the memory is used to store one or more computer instructions, wherein the one or more computer instructions are executed by the processor to achieve the above method described.

图9示出了其中可以实施本公开的一个或多个实施例的计算设备/服务器900的框图。应当理解，图9所示出的计算设备/服务器900仅仅是示例性的，而不应当构成对本文所描述的实施例的功能和范围的任何限制。9 shows a block diagram of a computing device/server 900 in which one or more embodiments of the present disclosure may be implemented. It should be understood that the computing device/server 900 shown in FIG. 9 is merely exemplary and should not constitute any limitation on the functionality and scope of the embodiments described herein.

如图9所示，计算设备/服务器900是通用计算设备的形式。计算设备/服务器900的组件可以包括但不限于一个或多个处理器或处理单元910、存储器920、存储设备930、一个或多个通信单元940、一个或多个输入设备950以及一个或多个输出设备960。处理单元910可以是实际或虚拟处理器并且能够根据存储器920中存储的程序来执行各种处理。在多处理器系统中，多个处理单元并行执行计算机可执行指令，以提高计算设备/服务器900的并行处理能力。As shown in FIG. 9, computing device/server 900 is in the form of a general purpose computing device. Components of computing device/server 900 may include, but are not limited to, one or more processors or processing units 910, memory 920, storage devices 930, one or more communication units 940, one or more input devices 950, and one or more Output device 960. The processing unit 910 may be an actual or virtual processor and can perform various processes according to programs stored in the memory 920 . In a multi-processor system, multiple processing units execute computer-executable instructions in parallel to increase the parallel processing capabilities of the computing device/server 900 .

计算设备/服务器900通常包括多个计算机存储介质。这样的介质可以是计算设备/服务器900可访问的任何可以获得的介质，包括但不限于易失性和非易失性介质、可拆卸和不可拆卸介质。存储器920可以是易失性存储器(例如寄存器、高速缓存、随机访问存储器(RAM))、非易失性存储器(例如，只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、闪存)或它们的某种组合。存储设备930可以是可拆卸或不可拆卸的介质，并且可以包括机器可读介质，诸如闪存驱动、磁盘或者任何其他介质，其可以能够用于存储信息和/或数据(例如用于训练的训练数据)并且可以在计算设备/服务器900内被访问。Computing device/server 900 typically includes multiple computer storage media. Such media can be any available media accessible by computing device/server 900, including but not limited to volatile and nonvolatile media, removable and non-removable media. Memory 920 may be volatile memory (eg, registers, cache, random access memory (RAM)), non-volatile memory (eg, read only memory (ROM), electrically erasable programmable read only memory (EEPROM) , Flash) or some combination of them. Storage device 930 may be removable or non-removable media, and may include machine-readable media, such as flash drives, magnetic disks, or any other media that may be capable of storing information and/or data (eg, training data for training). ) and can be accessed within computing device/server 900.

计算设备/服务器900可以进一步包括另外的可拆卸/不可拆卸、易失性/非易失性存储介质。尽管未在图9中示出，可以提供用于从可拆卸、非易失性磁盘(例如“软盘”)进行读取或写入的磁盘驱动和用于从可拆卸、非易失性光盘进行读取或写入的光盘驱动。在这些情况中，每个驱动可以由一个或多个数据介质接口被连接至总线(未示出)。存储器920可以包括计算机程序产品925，其具有一个或多个程序模块，这些程序模块被配置为执行本公开的各种实施例的各种方法或动作。Computing device/server 900 may further include additional removable/non-removable, volatile/non-volatile storage media. Although not shown in Figure 9, disk drives for reading or writing from removable, non-volatile magnetic disks (eg, "floppy disks") and for reading or writing from removable, non-volatile optical disks may be provided CD-ROM drive for reading or writing. In these cases, each drive may be connected to a bus (not shown) by one or more data media interfaces. Memory 920 may include a computer program product 925 having one or more program modules configured to perform various methods or actions of various embodiments of the present disclosure.

通信单元940实现通过通信介质与其他计算设备进行通信。附加地，计算设备/服务器900的组件的功能可以以单个计算集群或多个计算机器来实现，这些计算机器能够通过通信连接进行通信。因此，计算设备/服务器900可以使用与一个或多个其他服务器、网络个人计算机(PC)或者另一个网络节点的逻辑连接来在联网环境中进行操作。The communication unit 940 enables communication with other computing devices through a communication medium. Additionally, the functions of the components of computing device/server 900 may be implemented in a single computing cluster or multiple computing machines capable of communicating over a communication connection. Thus, computing device/server 900 may operate in a networked environment using logical connections to one or more other servers, network personal computers (PCs), or another network node.

输入设备950可以是一个或多个输入设备，例如鼠标、键盘、追踪球等。输出设备960可以是一个或多个输出设备，例如显示器、扬声器、打印机等。计算设备/服务器900还可以根据需要通过通信单元940与一个或多个外部设备(未示出)进行通信，外部设备诸如存储设备、显示设备等，与一个或多个使得用户与计算设备/服务器900交互的设备进行通信，或者与使得计算设备/服务器900与一个或多个其他计算设备通信的任何设备(例如，网卡、调制解调器等)进行通信。这样的通信可以经由输入/输出(I/O)接口(未示出)来执行。Input device 950 may be one or more input devices, such as a mouse, keyboard, trackball, and the like. Output device 960 may be one or more output devices, such as a display, speakers, printer, and the like. The computing device/server 900 may also communicate with one or more external devices (not shown), such as storage devices, display devices, etc., through the communication unit 940, as needed, with one or more external devices that enable the user to communicate with the computing device/server. 900 interacts with any device (eg, network card, modem, etc.) that enables computing device/server 900 to communicate with one or more other computing devices. Such communication may be performed via an input/output (I/O) interface (not shown).

根据本公开的示例性实现方式，提供了一种计算机可读存储介质，其上存储有一条或多条计算机指令，其中一条或多条计算机指令被处理器执行以实现上文描述的方法。According to an exemplary implementation of the present disclosure, there is provided a computer-readable storage medium having stored thereon one or more computer instructions, wherein the one or more computer instructions are executed by a processor to implement the method described above.

这里参照根据本公开实现的方法、装置(系统)和计算机程序产品的流程图和/或框图描述了本公开的各个方面。应当理解，流程图和/或框图的每个方框以及流程图和/或框图中各方框的组合，都可以由计算机可读程序指令实现。Aspects of the present disclosure are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products implemented in accordance with the present disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions.

这些计算机可读程序指令可以提供给通用计算机、专用计算机或其他可编程数据处理装置的处理单元，从而生产出一种机器，使得这些指令在通过计算机或其他可编程数据处理装置的处理单元执行时，产生了实现流程图和/或框图中的一个或多个方框中规定的功能/动作的装置。也可以把这些计算机可读程序指令存储在计算机可读存储介质中，这些指令使得计算机、可编程数据处理装置和/或其他设备以特定方式工作，从而，存储有指令的计算机可读介质则包括一个制造品，其包括实现流程图和/或框图中的一个或多个方框中规定的功能/动作的各个方面的指令。These computer readable program instructions may be provided to the processing unit of a general purpose computer, special purpose computer or other programmable data processing apparatus to produce a machine that causes the instructions when executed by the processing unit of the computer or other programmable data processing apparatus , resulting in means for implementing the functions/acts specified in one or more blocks of the flowchart and/or block diagrams. These computer readable program instructions can also be stored in a computer readable storage medium, these instructions cause a computer, programmable data processing apparatus and/or other equipment to operate in a specific manner, so that the computer readable medium on which the instructions are stored includes An article of manufacture comprising instructions for implementing various aspects of the functions/acts specified in one or more blocks of the flowchart and/or block diagrams.

也可以把计算机可读程序指令加载到计算机、其他可编程数据处理装置、或其他设备上，使得在计算机、其他可编程数据处理装置或其他设备上执行一系列操作步骤，以产生计算机实现的过程，从而使得在计算机、其他可编程数据处理装置、或其他设备上执行的指令实现流程图和/或框图中的一个或多个方框中规定的功能/动作。Computer-readable program instructions can also be loaded onto a computer, other programmable data processing apparatus, or other equipment to cause a series of operational steps to be performed on the computer, other programmable data processing apparatus, or other equipment to produce a computer-implemented process , thereby causing instructions executing on a computer, other programmable data processing apparatus, or other device to implement the functions/acts specified in one or more blocks of the flowcharts and/or block diagrams.

附图中的流程图和框图显示了根据本公开的多个实现的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上，流程图或框图中的每个方框可以代表一个模块、程序段或指令的一部分，模块、程序段或指令的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。在有些作为替换的实现中，方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如，两个连续的方框实际上可以基本并行地执行，它们有时也可以按相反的顺序执行，这依所涉及的功能而定。也要注意的是，框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合，可以用执行规定的功能或动作的专用的基于硬件的系统来实现，或者可以用专用硬件与计算机指令的组合来实现。The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various implementations of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more executables for implementing the specified logical function(s) instruction. In some alternative implementations, the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It is also noted that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented in dedicated hardware-based systems that perform the specified functions or actions , or can be implemented in a combination of dedicated hardware and computer instructions.

以上已经描述了本公开的各实现，上述说明是示例性的，并非穷尽性的，并且也不限于所公开的各实现。在不偏离所说明的各实现的范围和精神的情况下，对于本技术领域的普通技术人员来说许多修改和变更都是显而易见的。本文中所用术语的选择，旨在最好地解释各实现的原理、实际应用或对市场中的技术的改进，或者使本技术领域的其他普通技术人员能理解本文公开的各实现。While various implementations of the present disclosure have been described above, the foregoing description is exemplary, not exhaustive, and not limiting of the disclosed implementations. Numerous modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described implementations. The terminology used herein was chosen to best explain the principles of the implementations, the practical application or improvement over the technology in the marketplace, or to enable others of ordinary skill in the art to understand the implementations disclosed herein.

Claims

1. A method for determining driver behavior, comprising:

acquiring a sequence of images recording a state of a vehicle, the sequence of images being acquired by an image acquisition device deployed inside the vehicle;

detecting a change in state of a driver of the vehicle and a change in state of a predetermined portion of the vehicle based on the sequence of images; and

determining an action of the driver to provide a service to a passenger of the vehicle based on the state change of the driver and the state change of the predetermined portion.

2. The method of claim 1, wherein detecting the change in state of the driver comprises:

identifying, in the sequence of images, a driver zone associated with a driver seat of the vehicle; and

detecting the state change of the driver based on the driver region.

3. The method of claim 2, wherein detecting the change in state of the driver based on the driver zone comprises:

detecting a first image sub-sequence from the image sequence, in the images of which the driver is located in the driver region;

in response to detecting a second image sub-sequence after the first image sub-sequence, the driver is not in the driver zone in images of the second image sub-sequence, determining that the state change of the driver is "alighting".

4. The method of claim 3, wherein detecting the change in state of the driver based on the driver zone further comprises:

in response to determining that a length of time spanned by images in the second image subsequence satisfies a length of time threshold, determining that the state change of the driver is "alighting".

5. The method of claim 3, wherein the predetermined portion comprises a trunk and a door of the vehicle, wherein detecting the change in state of the predetermined portion based on the sequence of images comprises:

identifying, in the sequence of images, a predetermined region associated with the predetermined portion of the vehicle; and

detecting the change in state of the predetermined portion based on the predetermined area.

6. The method of claim 5, wherein detecting the change in state of the predetermined portion based on the predetermined area comprises:

detecting a third image sub-sequence from the sequence of images, the predetermined portion in the predetermined area being in an off state in the images of the third image sub-sequence;

in response to detecting a fourth image sub-sequence subsequent to the third image sub-sequence, the predetermined portion in the predetermined area being in an on state in images of the fourth image sub-sequence, determining that the state change of the predetermined portion is "on".

7. The method of claim 6, wherein detecting the change in state of the predetermined portion based on the predetermined area further comprises:

in response to detecting a fifth image sub-sequence between the third image sub-sequence and the fourth image sub-sequence, the predetermined portion in the predetermined area being in an in-process state in images of the fifth image sub-sequence, determining that the state change of the predetermined portion is "on".

8. The method of claim 6, wherein the behavior of the driver includes at least any one of loading and unloading luggage for the passenger and opening and closing a door for the passenger, and determining the behavior of the driver includes at least any one of:

determining that the driver performed the action in response to detecting "alighting" and "opening" in order; and

in response to detecting "open" but not detecting "alighting," determining that the driver is not performing the action.

9. The method of claim 1, wherein acquiring the sequence of images comprises at least any one of:

obtaining the sequence of images within a predetermined time range after a start time of acceptance of a vehicle service request by the driver from the passenger; and

acquiring the sequence of images within a predetermined time range after an end time point at which the driver completes the vehicle service request.

10. The method of claim 1, wherein acquiring the sequence of images comprises:

acquiring an original image sequence from the image acquisition device; and

at predetermined time intervals, sampling is performed from the original image sequence to form the image sequence.

11. An apparatus for determining driver behavior, comprising:

an acquisition module configured to acquire a sequence of images recording a state of a vehicle, the sequence of images being acquired by an image acquisition device deployed inside the vehicle;

a detection module configured to detect a change of state of a driver of the vehicle and a change of state of a predetermined portion of the vehicle based on the sequence of images; and

a determination module configured to determine an action of the driver to provide a service to a passenger of the vehicle based on the state change of the driver and the state change of the predetermined portion.

12. An electronic device, comprising:

a memory and a processor;

wherein the memory is to store one or more computer instructions, wherein the one or more computer instructions are to be executed by the processor to implement the method of any one of claims 1 to 10.

13. A computer readable storage medium having one or more computer instructions stored thereon, wherein the one or more computer instructions are executed by a processor to implement the method of any one of claims 1 to 10.