[go: up one dir, main page]

CN116339955B - Local optimization method, device and computer equipment for computing-for-communication framework - Google Patents

Local optimization method, device and computer equipment for computing-for-communication framework Download PDF

Info

Publication number
CN116339955B
CN116339955B CN202310595581.7A CN202310595581A CN116339955B CN 116339955 B CN116339955 B CN 116339955B CN 202310595581 A CN202310595581 A CN 202310595581A CN 116339955 B CN116339955 B CN 116339955B
Authority
CN
China
Prior art keywords
agent
task
local
agents
conflict
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202310595581.7A
Other languages
Chinese (zh)
Other versions
CN116339955A (en
Inventor
李�杰
陈润丰
彭婷
陈钇廷
马兆伟
王祥科
尹栋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National University of Defense Technology
Original Assignee
National University of Defense Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National University of Defense Technology filed Critical National University of Defense Technology
Priority to CN202310595581.7A priority Critical patent/CN116339955B/en
Publication of CN116339955A publication Critical patent/CN116339955A/en
Application granted granted Critical
Publication of CN116339955B publication Critical patent/CN116339955B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The application relates to a local optimization method, a device and computer equipment for a computing communication framework. The method comprises the following steps: and constructing a scoring function of tasks related to execution positions of all the agents, calculating tasks to be executed by adopting a local optimization method according to the scoring function to obtain conflict-free scheduling, calculating the tasks to be executed by adopting the local optimization method for all the agents to obtain conflict-free task scheduling, and transmitting the conflict-free task scheduling to other agents in a communication mode if the existence of conflict is detected when global convergence confirmation is carried out, sending a schedule to the other agents, and eliminating task conflict generated in a global convergence stage by adopting a consistency elimination principle to obtain a network multi-agent scheduling scheme. The application can reduce the communication times and improve the dispatching generation speed.

Description

计算换通信框架的局部优化方法、装置和计算机设备Local optimization method, device and computer equipment for computing communication framework

技术领域Technical Field

本申请涉及智能体调度技术领域,特别是涉及一种计算换通信框架的局部优化方法、装置和计算机设备。The present application relates to the field of intelligent agent scheduling technology, and in particular to a local optimization method, device and computer equipment for a computing-for-communication framework.

背景技术Background Art

多智能体调度并不是什么新鲜事,但近年来的蜂群热潮又一次激活了它。蜂群作为多智能体理论的重要应用形式,赋予了多智能体灵活性、局域性和自主性等新的特性,使其在工程上具有很高的价值,可以表现为一组车辆、无人机、机器人或其他具有个体动态的智能体。根据个体成本和功能的不同,集群可以采取群体行为(低成本、单一功能),也可以采取分布式精确调度(高成本、完整功能)。对于前者,基于阈值响应的群算法可以实现聚集、分散等行为。而对于后者,则需要多智能体调度算法来实现装配、调度等精确协调。Multi-agent scheduling is nothing new, but the swarm craze in recent years has reactivated it. As an important application form of multi-agent theory, the swarm gives multi-agents new characteristics such as flexibility, locality and autonomy, making it very valuable in engineering. It can be expressed as a group of vehicles, drones, robots or other agents with individual dynamics. Depending on the individual cost and function, the swarm can adopt group behavior (low cost, single function) or distributed precise scheduling (high cost, complete function). For the former, the swarm algorithm based on threshold response can realize aggregation, dispersion and other behaviors. For the latter, a multi-agent scheduling algorithm is needed to achieve precise coordination such as assembly and scheduling.

虽然调度仍然是一个将“智能体-任务”对与时间序列匹配的过程。蜂群大多由低成本的智能体组成,对计算、通信和存储提出了更高的要求,在实际应用中需要考虑。在这些影响因素中,由于芯片的快速发展,小型化、低功耗、高性能的计算单元层出不穷,如英伟达、AMD或英特尔系列产品,矛盾就不那么突出了。虽然自组织网络等通信技术已经取得了很大的进展,但由于环境影响,特别是在复杂地形遮挡或电磁环境中,它仍然是一个更具挑战性的因素。智能体之间的通信不可避免地受到环境的影响,导致不同程度的错误、丢包、延迟甚至中断,从而影响调度的生成时间,甚至的成败。因此,如何减少对风险沟通的依赖,增强调度生成的鲁棒性和时效性,是需要考虑的问题。Although scheduling is still a process of matching "agent-task" pairs with time series. The swarm is mostly composed of low-cost agents, which puts higher requirements on computing, communication and storage, which need to be considered in practical applications. Among these influencing factors, due to the rapid development of chips, miniaturized, low-power, and high-performance computing units are emerging in an endless stream, such as NVIDIA, AMD or Intel series products, the contradiction is not so prominent. Although communication technologies such as self-organizing networks have made great progress, it is still a more challenging factor due to environmental influences, especially in complex terrain occlusion or electromagnetic environments. The communication between agents is inevitably affected by the environment, resulting in varying degrees of errors, packet loss, delays and even interruptions, which affects the generation time of the schedule and even the success or failure. Therefore, how to reduce the reliance on risk communication and enhance the robustness and timeliness of schedule generation is a problem that needs to be considered.

传统方式是通过计算和通讯两个阶段解决调度问题,在计算阶段通过本地代理计算获得的调度,在通讯阶段通过和相邻代理通讯其安排来化解冲突,两个过程迭代,从而得到调度方案,就两个阶段而言,前一阶段侧重提高调度效率,后一阶段侧重提高优化性能,因此,使得总的调度生成时间短、通讯时间短成为亟待解决的问题。The traditional approach is to solve the scheduling problem through two stages: calculation and communication. In the calculation stage, the schedule is obtained by local agent calculation, and in the communication stage, conflicts are resolved by communicating with adjacent agents about their arrangements. The two processes are iterated to obtain a scheduling solution. As far as the two stages are concerned, the former stage focuses on improving scheduling efficiency, while the latter stage focuses on improving optimization performance. Therefore, making the total scheduling generation time short and the communication time short become issues that need to be urgently addressed.

发明内容Summary of the invention

基于此,有必要针对上述技术问题,提供一种网络多智能体调度计算换通信系统。Based on this, it is necessary to provide a network multi-agent scheduling computing and communication system to address the above technical problems.

一种计算换通信框架的局部优化方法,所述方法包括:A local optimization method for a computation-for-communication framework, the method comprising:

构建各个智能体执行位置相关的任务的评分函数;Construct scoring functions for each agent to perform position-related tasks;

根据所述评分函数,采用局部优化方法对待执行任务进行计算,得到无冲突调度;其中,所述局部优化方法中,各个智能体之间通过通讯方式进行交互,通过智能体存储的其他智能体通过执行任务所对应所述评分函数的得分,确定智能体的位置信息,以及通过预先设置的任务相关选择策略选择所述局部优化方法中包含的局部智能体;According to the scoring function, a local optimization method is used to calculate the tasks to be executed to obtain a conflict-free schedule; wherein, in the local optimization method, each agent interacts with each other through communication, and the location information of the agent is determined by the score of the scoring function corresponding to the task executed by other agents stored in the agent, and the local agent included in the local optimization method is selected through a preset task-related selection strategy;

对各个智能体通过局部优化方法进行待执行任务计算,得到无冲突任务调度,在进行全局收敛确认时,若检测到冲突存在,则将所述无冲突任务调度通过通讯方式传递给其他智能体;Calculate the tasks to be executed by each agent through a local optimization method to obtain a conflict-free task schedule. When confirming the global convergence, if a conflict is detected, the conflict-free task schedule is transmitted to other agents through communication;

向其他智能体发送时间表,采用一致性消除原则消除全局收敛阶段产生的任务冲突,得到网络多智能体调度方案。The schedule is sent to other agents, and the consistency elimination principle is used to eliminate the task conflicts generated in the global convergence stage to obtain the network multi-agent scheduling plan.

在其中一个实施例中,所述评分函数表示为:In one embodiment, the scoring function is expressed as:

;

其中,是智能体执行任务的得分,表示智能体的时间表,表示智能体按照时间表执行任务的回报,是为任务回报的折扣因子,为智能体沿时间表或智能体的位置到达任务位置的估计时间,为任务的初始任务回报。in, It is an intelligent agent Execute the task The score, Representing an Agent timetable, Representing an Agent According to the schedule Execute the task The return, It's for the mission The discount factor for the return, For intelligent agents Along the schedule Or the agent's location arrival task Estimated time of location, For the task Initial mission reward.

在其中一个实施例中,所述智能体进行任务调度的优化问题为:In one embodiment, the optimization problem for the agent to perform task scheduling is:

;

其中,n表示智能体数量,m表示任务数量,为二进制变量,当时,智能体执行任务,当时,表示智能体不执行任务是智能体执行任务的得分;所述优化问题的约束条件为:Where n represents the number of agents, m represents the number of tasks, is a binary variable, when When the intelligent agent Execute the task ,when , indicating that the agent Not performing tasks , It is an intelligent agent Execute the task The constraints of the optimization problem are:

;

;

;

;

其中,表示智能体在任务调度中执行任务的时间。in, Representing an Agent In task scheduling Execute tasks in time.

在其中一个实施例中,所述局部优化包括:任务交换和局部采样;In one of the embodiments, the local optimization includes: task exchange and local sampling;

通过所述任务交换增加局部范围得分;所述局部范围得分指的是本地范围内所有智能体执行任务的得分;Increasing the local scope score by the task exchange; the local scope score refers to the score of all agents performing tasks in the local scope;

通过所述局部采样得到局部范围内其他智能体的任务得分,从而进行局部估计得到无冲突任务调度。The task scores of other agents in the local range are obtained through the local sampling, so as to perform local estimation and obtain conflict-free task scheduling.

在其中一个实施例中,所述任务交换过程包括:In one embodiment, the task exchange process includes:

设置任务交换策略为:Set the task exchange strategy to:

;

;

;

其中,是智能体选择交换的策略,局部分数是智能体与智能体的分数之和,是将智能体的任务与智能体的任务交换,是交换后的局部分数。in, It is an intelligent agent Select Exchange strategy, local score It is an intelligent agent With Agent The sum of the scores of Is to make the intelligent agent Mission With Agent Mission exchange, is the local score after the exchange.

在其中一个实施例中,所述局部采样的过程为:In one embodiment, the local sampling process is:

根据局部范围内所有智能体的当前调度、任务信息、位置信息以及评分函数,构建局部拍卖为:According to the current schedule, task information, location information and scoring function of all agents in the local range, the local auction is constructed as:

;

其中,表示智能体i对局部范围内其他智能体的局部估计函数,当前时刻t其他智能体k的局部拍卖函数,表示任务的任务信息,表示智能体k的位置信息。in, represents the local estimation function of agent i for other agents in the local range, The local auction function of other agents k at the current time t, Indicates the task Task information, Represents the location information of agent k.

在其中一个实施例中,根据所述评分函数,计算智能体与待执行任务的距离信息;In one of the embodiments, according to the scoring function, distance information between the agent and the task to be performed is calculated;

根据至少三个所述距离信息,确定智能体的位置信息。The location information of the intelligent agent is determined based on at least three of the distance information.

在其中一个实施例中,所述显示智能体是当前智能体从局部内存求解当前迭代中其他智能体的获胜者向量,以及求解与当前智能体有任务冲突的其他智能体;所述隐式智能体是当前没有显示任务冲突下,可能发生冲突的智能体。In one embodiment, the displayed agent is the current agent that solves the winner vectors of other agents in the current iteration from the local memory, and solves other agents that have task conflicts with the current agent; the implicit agent is an agent that may conflict when there is no displayed task conflict.

一种计算换通信框架的局部优化装置,所述装置包括:A local optimization device for a computing-for-communication framework, the device comprising:

评分函数构建模块,用于构建各个智能体执行位置相关的任务的评分函数;The scoring function building module is used to build the scoring function for each agent to perform position-related tasks;

局部优化模块,用于根据所述评分函数,采用局部优化方法对待执行任务进行计算,得到无冲突调度;其中,所述局部优化方法中,各个智能体之间通过通讯方式进行交互,通过智能体存储的其他智能体通过执行任务所对应所述评分函数的得分,确定智能体的位置信息,以及通过预先设置的任务相关选择策略选择所述局部优化方法中包含的局部智能体;A local optimization module, used to calculate the tasks to be executed using a local optimization method according to the scoring function to obtain a conflict-free schedule; wherein, in the local optimization method, each agent interacts with each other through communication, determines the location information of the agent through the scores of the scoring function corresponding to the tasks executed by other agents stored in the agent, and selects the local agent included in the local optimization method through a preset task-related selection strategy;

调度模块,用于对各个智能体通过局部优化方法进行待执行任务计算,得到无冲突任务调度,在进行全局收敛确认时,若检测到冲突存在,则将所述无冲突任务调度通过通讯方式传递给其他智能体;The scheduling module is used to calculate the tasks to be executed by each intelligent agent through a local optimization method to obtain a conflict-free task schedule. When confirming the global convergence, if a conflict is detected, the conflict-free task schedule is transmitted to other intelligent agents through communication;

求解模块,用于向其他智能体发送时间表,采用一致性消除原则消除全局收敛阶段产生的任务冲突,得到网络多智能体调度方案。The solving module is used to send schedules to other agents and adopt the consistency elimination principle to eliminate the task conflicts generated in the global convergence stage to obtain the network multi-agent scheduling solution.

在其中一个实施例中,一种计算机设备,包括存储器和处理器,所述存储器存储有计算机程序,所述处理器执行所述计算机程序时实现方法的步骤为:In one embodiment, a computer device includes a memory and a processor, wherein the memory stores a computer program, and the processor executes the computer program to implement the steps of:

构建各个智能体执行位置相关的任务的评分函数;Construct scoring functions for each agent to perform position-related tasks;

根据所述评分函数,采用局部优化方法对待执行任务进行计算,得到无冲突调度;其中,所述局部优化方法中,各个智能体之间通过通讯方式进行交互,通过智能体存储的其他智能体通过执行任务所对应所述评分函数的得分,确定智能体的位置信息,以及通过预先设置的任务相关选择策略选择所述局部优化方法中包含的局部智能体;According to the scoring function, a local optimization method is used to calculate the tasks to be executed to obtain a conflict-free schedule; wherein, in the local optimization method, each agent interacts with each other through communication, and the location information of the agent is determined by the score of the scoring function corresponding to the task executed by other agents stored in the agent, and the local agent included in the local optimization method is selected through a preset task-related selection strategy;

对各个智能体通过局部优化方法进行待执行任务计算,得到无冲突任务调度,在进行全局收敛确认时,若检测到冲突存在,则将所述无冲突任务调度通过通讯方式传递给其他智能体;Calculate the tasks to be executed by each agent through a local optimization method to obtain a conflict-free task schedule. When confirming the global convergence, if a conflict is detected, the conflict-free task schedule is transmitted to other agents through communication;

向其他智能体发送时间表,采用一致性消除原则消除全局收敛阶段产生的任务冲突,得到网络多智能体调度方案。The schedule is sent to other agents, and the consistency elimination principle is used to eliminate the task conflicts generated in the global convergence stage to obtain the network multi-agent scheduling plan.

上述计算换通信框架的局部优化方法、装置和计算机设备,在提出计算换通信的思想下,通过适当增加计算量来减少通信次数,从而保持甚至减少调度生成时间。设计一种新的评分函数,该函数更加优化,可以为推断其他智能体的位置提供支持。在此之下,提出一种局部优化方法,进一步提高调度的优化性和时效性。利用任务交换和任务抽样的方法探索更好的调度方法,避免个体贪婪所导致的局部最优问题。通过局部估计提前解决智能体之间的冲突,减少迭代次数,同时采用所提出的任务相关智能体选择策略,将计算量花在关键智能体上。The local optimization method, device and computer equipment of the above computation-for-communication framework, under the idea of computation-for-communication, reduce the number of communications by appropriately increasing the amount of computation, thereby maintaining or even reducing the scheduling generation time. A new scoring function is designed, which is more optimized and can provide support for inferring the positions of other agents. Under this, a local optimization method is proposed to further improve the optimization and timeliness of scheduling. Task exchange and task sampling methods are used to explore better scheduling methods to avoid local optimal problems caused by individual greed. Conflicts between agents are resolved in advance through local estimation to reduce the number of iterations. At the same time, the proposed task-related agent selection strategy is adopted to spend the computation on key agents.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1为一个实施例中通信计算框架与传统市场框架的区别对比图,其中,(a)为传统基于市场的框架示意图,(b)为本申请的通信计算框架示意图;FIG1 is a comparison diagram of the differences between a communication computing framework and a traditional market framework in one embodiment, wherein (a) is a schematic diagram of a traditional market-based framework, and (b) is a schematic diagram of the communication computing framework of the present application;

图2为一个实施例中计算换通信框架的局部优化方法的示意性流程图;FIG2 is a schematic flow chart of a local optimization method for computing a communication framework in one embodiment;

图3为一个实施例中计算换通信框架的局部优化方法的框架图;FIG3 is a framework diagram of a local optimization method for computing a communication framework in one embodiment;

图4为一个实施例中计算机设备的内部示意图。FIG. 4 is a schematic diagram of the interior of a computer device in one embodiment.

具体实施方式DETAILED DESCRIPTION

为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。In order to make the purpose, technical solution and advantages of the present application more clearly understood, the present application is further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present application and are not used to limit the present application.

如图1所示,图1(a)为传统基于市场的框架示意图,图1 (b)为本申请的通信计算框架示意图。从图1(a)的传统基于市场的框架可以看出,总的调度时间包括规划时间、交流时间以及等待时间,而在规划时间、交流时间以及等待时间中包含计算时间和通讯时间。在本发明的计算换通信框架中,由于引入了计算换通讯的冲突消解策略,虽然总体的增量规划时间增长,但是大大降低了通讯时间,提高了调度的效率。As shown in Figure 1, Figure 1 (a) is a schematic diagram of a traditional market-based framework, and Figure 1 (b) is a schematic diagram of a communication computing framework of the present application. It can be seen from the traditional market-based framework of Figure 1 (a) that the total scheduling time includes planning time, communication time and waiting time, and the planning time, communication time and waiting time include computing time and communication time. In the computing-for-communication framework of the present invention, due to the introduction of the computing-for-communication conflict resolution strategy, although the overall incremental planning time increases, the communication time is greatly reduced and the scheduling efficiency is improved.

在一个实施例中,如图2所示,提供了一种计算换通信框架的局部优化方法,如下:In one embodiment, as shown in FIG2 , a local optimization method of a computation-for-communication framework is provided, as follows:

步骤202,构建各个智能体执行位置相关的任务的评分函数。Step 202, construct a scoring function for each agent to perform location-related tasks.

评分函数是智能体选择任务的依据,驱动智能体时间表的生成。The scoring function is the basis for the agent to select tasks and drives the generation of the agent's schedule.

步骤204,根据评分函数,采用局部优化方法对待执行任务进行计算,得到无冲突调度。Step 204 , according to the scoring function, a local optimization method is used to calculate the tasks to be executed to obtain a conflict-free schedule.

局部优化方法中,各个智能体之间通过通讯方式进行交互,通过智能体存储的其他智能体通过执行任务所对应评分函数的得分,确定智能体的位置信息,以及通过预先设置的任务相关选择策略选择局部优化方法中包含的局部智能体。In the local optimization method, each agent interacts with each other through communication. The location information of the agent is determined by the scores of the scoring functions corresponding to the tasks stored by other agents in the agent's storage, and the local agents included in the local optimization method are selected through the pre-set task-related selection strategy.

本步骤中,通过局部优化虽然增加了一定的计算量,但是可以极大的减少通讯次数,随着计算资源的极大膨胀,通过本发明的计算换通讯框架,即可以保证调度生成时间短,又可以减少通讯次数。In this step, although the amount of calculation is increased by local optimization, the number of communications can be greatly reduced. With the great expansion of computing resources, the calculation-for-communication framework of the present invention can ensure that the scheduling generation time is short and the number of communications can be reduced.

步骤206,对各个智能体通过局部优化方法进行待执行任务计算,得到无冲突任务调度,在进行全局收敛确认时,若检测到冲突存在,则将所述无冲突任务调度通过通讯方式传递给其他智能体。Step 206, calculate the tasks to be executed for each intelligent agent through a local optimization method to obtain a conflict-free task schedule. When confirming the global convergence, if a conflict is detected, the conflict-free task schedule is transmitted to other intelligent agents through communication.

通过局部优化,意图消解任务之间的冲突,从而得到无冲突的任务调度,从而再检测全局收敛。Through local optimization, the intention is to eliminate conflicts between tasks, so as to obtain conflict-free task scheduling, and then detect global convergence.

步骤208,向其他智能体发送时间表,采用一致性消除原则消除全局收敛阶段产生的任务冲突,得到网络多智能体调度方案。Step 208, sending a schedule to other agents, using the consistency elimination principle to eliminate task conflicts generated in the global convergence phase, and obtaining a network multi-agent scheduling solution.

上述计算换通信框架的局部优化方法中,在提出计算换通信的思想下,通过适当增加计算量来减少通信次数,从而保持甚至减少调度生成时间。设计一种新的评分函数,该函数更加优化,可以为推断其他智能体的位置提供支持。在此之下,提出一种局部优化方法,进一步提高调度的优化性和时效性。利用任务交换和任务抽样的方法探索更好的调度方法,避免个体贪婪所导致的局部最优问题。通过局部估计提前解决智能体之间的冲突,减少迭代次数,同时采用所提出的任务相关智能体选择策略,将计算量花在关键智能体上。In the local optimization method of the computation-for-communication framework mentioned above, under the idea of computation-for-communication, the number of communications is reduced by appropriately increasing the amount of computation, thereby maintaining or even reducing the scheduling generation time. A new scoring function is designed, which is more optimized and can provide support for inferring the positions of other agents. Under this, a local optimization method is proposed to further improve the optimization and timeliness of scheduling. Task exchange and task sampling methods are used to explore better scheduling methods to avoid local optimal problems caused by individual greed. Conflicts between agents are resolved in advance through local estimation to reduce the number of iterations. At the same time, the proposed task-related agent selection strategy is adopted to spend the computation on key agents.

对于网络多智能体的调度计算,一般分为两个阶段,即计算阶段和通讯阶段,在计算阶段,智能体都追求分数最大化的共同目标,但由于智能体能力有限,它们的分数都有上界。在通信阶段,各智能体都遵循出价最高者得的一致性规则,即任务属于得分最高的智能体,任务冲突可以通过得分来消解。因此,只要不出现任务分数振荡导致的死锁问题,基于市场的方法就一定可以收敛。影响收敛速度的原因主要有三个:第一是任务冲突的数量,这与智能体或任务的分布和算法的性能有关,例如个别贪婪算法容易出现任务冲突;二是任务评分的单调性,如果评分不单调,智能体偏好的任务可能会因为分数低而丢失,需要重新选择任务进行迭代,甚至会导致任务选择循环陷入僵局;第三是任务分数的更新,删除或添加任务后,一旦任务分数更新,任务的属性可能会发生变化,需要再次迭代以消解任务冲突。The scheduling calculation of multi-agents in the network is generally divided into two stages, namely the calculation stage and the communication stage. In the calculation stage, the agents all pursue the common goal of maximizing the score, but due to the limited capabilities of the agents, their scores have upper bounds. In the communication stage, each agent follows the consistency rule of the highest bidder, that is, the task belongs to the agent with the highest score, and the task conflict can be resolved by the score. Therefore, as long as there is no deadlock problem caused by task score oscillation, the market-based method will definitely converge. There are three main reasons that affect the convergence speed: the first is the number of task conflicts, which is related to the distribution of agents or tasks and the performance of the algorithm. For example, some greedy algorithms are prone to task conflicts; the second is the monotonicity of task scores. If the score is not monotonic, the task preferred by the agent may be lost due to low scores, and tasks need to be reselected for iteration, which may even cause the task selection cycle to deadlock; the third is the update of task scores. After deleting or adding tasks, once the task score is updated, the attributes of the task may change, and it is necessary to iterate again to resolve task conflicts.

综上,提升网络多智能体的调度效率的核心,在于如何加速收敛,即减少迭代次数。因此在本发明中,通过增加其他智能体的计算量来减少任务冲突,从而减少迭代,虽然计算时间延长,但迭代次数减少,从而减少了通信回合,计算量的增加主要是由于包括其他智能体在内的局部优化,预计将减少通信次数,且保持总调度生成时间不变甚至减少。In summary, the key to improving the scheduling efficiency of multi-agent networks lies in how to accelerate convergence, that is, reduce the number of iterations. Therefore, in the present invention, the task conflict is reduced by increasing the computational workload of other agents, thereby reducing iterations. Although the computational time is prolonged, the number of iterations is reduced, thereby reducing the communication rounds. The increase in computational workload is mainly due to local optimization including other agents, which is expected to reduce the number of communications and keep the total scheduling generation time unchanged or even reduce it.

在其中一个实施例中,如图3所示,本发明的计算换通信系统,与传统计算相似,保留了两个迭代阶段,在计算阶段,智能体根据冲突解决后的任务属性,剔除不属于自己的任务,然后通过局部优化计算出无冲突的任务,并将其加入智能体调度。接下来进行收敛确认,如果存在冲突,则将调度传递给其他智能体,在通信阶段解决冲突,否则将获得最终的调度。在通信阶段,智能体以任务获胜者、任务获胜标价和时间戳的形式向其他智能体发送自己的时间表,并接收其他智能体的时间表,然后根据一致性规则消解任务冲突。常用的一致性规则是价高者得,即任务出价最高的智能体比其他智能体出价更高获得任务。In one of the embodiments, as shown in FIG3 , the computing-for-communication system of the present invention, similar to traditional computing, retains two iterative stages. In the computing stage, the agent eliminates tasks that do not belong to it according to the task attributes after conflict resolution, and then calculates conflict-free tasks through local optimization and adds them to the agent schedule. Next, convergence confirmation is performed. If there is a conflict, the schedule is passed to other agents to resolve the conflict in the communication stage, otherwise the final schedule will be obtained. In the communication stage, the agent sends its own schedule to other agents in the form of task winners, task winning bids and timestamps, and receives the schedules of other agents, and then resolves task conflicts according to consistency rules. The commonly used consistency rule is that the highest bidder wins, that is, the agent with the highest task bid bids higher than other agents to obtain the task.

在其中一个实施例中,评分函数表示为:In one embodiment, the scoring function is expressed as:

;

其中,是智能体执行任务的得分,表示智能体的时间表,表示智能体按照时间表执行任务的回报,是为任务回报的折扣因子,为智能体沿时间表或智能体的位置到达任务位置的估计时间,为任务的初始任务回报。in, It is an intelligent agent Execute the task The score, Representing an Agent timetable, Representing an Agent According to the schedule Execute the task The return, It's for the mission The discount factor for the return, For intelligent agents Along the schedule Or the agent's location arrival task Estimated time of location, For the task Initial mission reward.

在其中一个实施例中,智能体进行任务调度的优化问题为:In one embodiment, the optimization problem for the agent to perform task scheduling is:

;

其中,n表示智能体数量,m表示任务数量,为二进制变量,当时,智能体执行任务,当时,表示智能体不执行任务,所述优化问题的约束条件为:Where n represents the number of agents, m represents the number of tasks, is a binary variable, when When the intelligent agent Execute the task ,when , indicating that the agent Not performing tasks , the constraints of the optimization problem are:

;

;

;

;

其中,表示智能体在任务调度中执行任务的时间。in, Representing an Agent In task scheduling Execute tasks in time.

具体的,多智能体调度是n个智能体分配m个任务,形成各自的调度。每个智能体的调度pi是分配任务的执行顺序和时间,满足任务的开始时间和截止日期等约束。一般可以表述为约束优化问题。上述优化问题中对参数进行改进,从而引入任务对智能体的日程安排的影响,驱使智能体在前一个任务的基础上探索新的任务,而不管新任务是否在其他智能体附近,这可能导致盲目的向外探索。与常见的评分函数不同,引入后一项来驱动智能体选择其位置附近的任务,使得智能体尽量避免竞争其他智能体附近的任务,这有助于减少智能体之间的冲突。每个智能体尝试选择离自己较近的任务,这样可以避免任务较近的智能体等待远处的其他智能体执行有时间限制的任务而导致任务失败。此外,引入的还隐含了智能体与任务之间的距离信息。Specifically, multi-agent scheduling is n agents assigning m tasks to form their own schedules. The scheduling of pi is to assign the execution order and time of tasks to meet the constraints such as the start time and deadline of the tasks. It can generally be expressed as a constrained optimization problem. Improvements were made to introduce tasks For Agents The schedule of drives the agent to explore new tasks based on the previous task, regardless of whether the new task is near other agents, which may lead to blind outward exploration. Different from the common scoring function, the latter term is introduced to drive the agent to select tasks near its location, so that the agent tries to avoid competing for tasks near other agents, which helps to reduce conflicts between agents. Each agent tries to choose tasks that are closer to itself, which can avoid agents with closer tasks waiting for other agents far away to perform time-limited tasks and cause task failure. In addition, the introduction of It also implies the distance information between the agent and the task.

在其中一个实施例中,局部优化包括:任务交换和局部采样,通过任务交换增加局部范围得分;局部范围得分指的是本地范围内所有智能体执行任务的得分;通过局部采样得到局部范围内其他智能体的任务得分,从而进行局部估计得到无冲突任务调度。In one embodiment, local optimization includes: task exchange and local sampling, where the local scope score is increased through task exchange; the local scope score refers to the score of all agents performing tasks within the local scope; and the task scores of other agents within the local scope are obtained through local sampling, thereby performing local estimation to obtain conflict-free task scheduling.

具体的,局部优化是指智能体考虑局部范围内其他智能体的调度,而不是像个体优化一样只考虑自己的调度。这允许智能体提前避免与局部范围内的其他智能体可能的冲突,并尝试逃避个体优化陷入的局部最优。局部优化主要包括两个步骤:第一步是任务交换,通过在智能体之间交换分配的任务来提高调度的性能;二是局部采样,对各种可行的调度进行采样,选出最优调度,进一步提高调度性能。在这两个步骤中,由于计算其他智能体的调度,任务冲突将得到消解。Specifically, local optimization means that the agent considers the scheduling of other agents in the local scope, rather than only considering its own scheduling like individual optimization. This allows the agent to avoid possible conflicts with other agents in the local scope in advance and try to escape the local optimum that the individual optimization falls into. Local optimization mainly includes two steps: the first step is task exchange, which improves the performance of scheduling by exchanging assigned tasks between agents; the second is local sampling, which samples various feasible schedules, selects the optimal schedule, and further improves the scheduling performance. In these two steps, task conflicts will be resolved due to the calculation of the schedules of other agents.

在其中一个实施例中,任务交换过程包括:In one embodiment, the task exchange process includes:

设置任务交换策略为:Set the task exchange strategy to:

;

;

;

其中,是智能体选择交换的策略,局部分数是智能体与智能体的分数之和,是将智能体的任务与智能体的任务交换,是交换后的局部分数。in, It is an intelligent agent Select Exchange strategy, local score It is an intelligent agent With Agent The sum of the scores of Is to make the intelligent agent Mission With Agent Mission exchange, is the local score after the exchange.

具体的,在任务交换步骤中,局部范围为交换任务的两个智能体,在局部采样中,局部范围为与任务相关的多个智能体。因此,局部优化的性能通过局部得分来评估,即指定的局部范围内所有智能体的得分之和。本地分数为:Specifically, in the task exchange step, the local scope is the two agents that exchange tasks, and in local sampling, the local scope is multiple agents related to the task. Therefore, the performance of local optimization is evaluated by the local score, which is the sum of the scores of all agents in the specified local scope. Local score for:

;

其中,是本地范围内的所有智能体,在任务交换和局部采样阶段不同。in, are all agents in the local scope, which are different during the task exchange and local sampling phases.

对于任务交换过程,智能体尝试与其他智能体交换任务以提高局部优化,这可以通过两个智能体组成的局部分数来体现。由于可能有许多可行的交换,因此选择最优的交换,使局部分数增加最多。For the task exchange process, the agent Try it with other agents Swap tasks to improve local optimization, which can be reflected by the local score of the two agents. Since there may be many feasible swaps, the optimal swap is chosen that increases the local score the most.

具体的,由于任务交换需要进行大量的遍历,因此可以采用启发式算法求解上述任务交换策略,以提高计算效率。Specifically, since task exchange requires a large number of traversals, a heuristic algorithm can be used to solve the above task exchange strategy to improve computational efficiency.

在其中一个实施例中,局部采样的过程为:In one embodiment, the local sampling process is:

根据局部范围内所有智能体的当前调度、任务信息、位置信息以及评分函数,构建局部拍卖为:According to the current schedule, task information, location information and scoring function of all agents in the local range, the local auction is constructed as:

;

其中,表示智能体i对局部范围内其他智能体的局部估计函数,当前时刻t其他智能体k的局部拍卖函数,表示任务的任务信息,表示智能体k的位置信息。in, represents the local estimation function of agent i for other agents in the local range, The local auction function of other agents k at the current time t, Indicates the task Task information, Represents the location information of agent k.

具体的,在每次采样中,智能体计算所有相关智能体的任务得分,预估其任务选择和结果进度,从而提前避免与其他智能体的任务冲突,减少智能体之间所需的通信。这种估计其他智能体调度的过程就是局部估计,主要是基于局部范围内所有智能体的当前调度,所有任务的信息,智能体的位置和评分函数来构造一个局部拍卖。Specifically, in each sampling, the agent Calculate all relevant agents The task scores of the agents are estimated, and their task selection and result progress are estimated, so as to avoid task conflicts with other agents in advance and reduce the communication required between agents. This process of estimating the scheduling of other agents is called local estimation. , mainly based on all agents in the local area The current schedule of all tasks Information , Agent Location and the scoring function To construct a local auction.

具体的,局部采样的具体步骤如下:Specifically, the specific steps of local sampling are as follows:

1、任务失格1. Mission failure

任务失格主要阻止智能体选择任务。最优无资格任务是最大化估计局部分数的任务。如果为0,则不存在非限定任务:Mission Failure Mainly prevents agents Select Task Optimal Unqualified Task is the task of maximizing the estimated local score. If If it is 0, there are no unrestricted tasks:

;

;

2、不合格的任务候选人2. Unqualified candidates for tasks

不合格的任务由可选的不合格任务组成,是局部范围内所有智能体的相关任务The unqualified tasks consist of optional unqualified tasks, which are all the agents in the local scope. Related tasks :

;

其中,智能体的显式任务候选人由出价高于对手的任务组成,隐式任务候选人由潜在冲突的任务组成:Among them, the intelligent Explicit task candidates Consists of tasks that outbid the opponent, implicit task candidates Consists of potentially conflicting tasks:

;

;

以下举例说明,智能体-1和智能体-2的调度显示在状态-1中。通过任务交换,智能体获得更好的调度(状态-2)。在状态-2中,智能体-3通过局部采样获得两个样本,其中采样-1的智能体提前知道智能体-2将通过估计获得任务,放弃任务-G而获得任务-H。而在采样-2中,智能体-3主动取消任务-H的资格,让位于智能体-2,选择距离较远的任务-I,局部得分较高。The following example shows that the schedule of Agent-1 and Agent-2 is shown in State-1. Through task exchange, the agents get a better schedule (State-2). In State-2, Agent-3 obtains two samples through local sampling, where the agent of Sampling-1 knows in advance that Agent-2 will obtain the task through estimation, giving up Task-G and obtaining Task-H. In Sampling-2, Agent-3 actively disqualifies Task-H and gives way to Agent-2, choosing Task-I which is farther away and has a higher local score.

在其中一个实施例中,为了实现上述局部优化过程,还需要推断其他智能体的位置。这些位置通常是通过通信获得的。但是,由于传统方法的通信协议中不包含智能体的位置,因此需要添加额外的通信协议和流量。为了保持传统方法的低流量优势,本发明尝试从通信接收到的历史数据推断其他智能体的位置,而不增加新的通信。In one embodiment, in order to implement the above-mentioned local optimization process, it is also necessary to infer the positions of other agents. These positions are usually obtained through communication. However, since the communication protocol of the traditional method does not include the position of the agent, it is necessary to add additional communication protocols and traffic. In order to maintain the low traffic advantage of the traditional method, the present invention attempts to infer the positions of other agents from the historical data received by the communication without adding new communications.

本发明中,智能体对任务的出价由评分计算,智能体到任务的估计到达时间为智能体与任务的直接或累计距离与智能体的速度之比,因此智能体到任务的距离可由计分函数的逆函数得到:In the present invention, the intelligent agent To the task The bid of the agent is calculated by the score. To the task The estimated arrival time of the agent With the task The direct or cumulative distance between , so the agent To the task The distance can be obtained by the inverse function of the scoring function:

;

其中,所有智能体都知道任务的位置、值和折扣因子,智能体的巡航速度已知。Among them, all agents know the task Location ,value and discount factor , Agent Cruising speed Known.

计算智能体到任务的距离后,智能体的位置在以任务的位置为圆心、距离为半径的圆上。从三个圆来确定一个交集,可以看出,获得智能体的位置只需要三个不同的任务Computational Agents To the task Distance Afterwards, the agent The location is based on the task The position is the center of the circle, the distance From the three circles to determine an intersection, it can be seen that the intelligent agent is obtained The location only requires three different tasks .

在另一个实施例中,由于,对太多其他智能体的计算将大大增加计算时间。因此,适当增加的计算量应该用于可能竞争任务的“竞争性”智能体。选择这些竞争智能体的核心思想是任务相关性。因此,还需要指定相应的任务相关智能体选择策略。In another embodiment, since the calculation of too many other agents will greatly increase the calculation time, an appropriately increased amount of calculation should be used for "competitive" agents that may compete for tasks. The core idea of selecting these competitive agents is task relevance. Therefore, it is also necessary to specify a corresponding task-related agent selection strategy.

1、任务相关智能体1. Task-related Agents

任务相关智能体对同一任务感兴趣,可能存在冲突,由显式智能体和隐式智能体组成:Task-Related Agents Interested in the same task, there may be conflicts, by explicit agents and implicit agents composition:

;

2、显示智能体2. Display Agent

显式智能体是当前存在显式任务冲突的智能体。智能体从局部内存中求出当前迭代中其他智能体的获胜者向量,然后求出所有与智能体的调度有任务冲突的智能体An explicit agent is an agent that currently has an explicit task conflict. From local memory Find the current iteration Other Agents The winner vector , and then find all the agents Scheduling Agents with Task Conflict :

;

3、隐式智能体3. Implicit Agents

隐式智能体是指在当前没有显式任务冲突的情况下,将来可能发生冲突的智能体。可以考虑探索隐式智能体的各种策略,主要是利用其他智能体的获胜者标价来推断时间表或位置Implicit agents are agents that may conflict in the future when there is no explicit task conflict at present. Various strategies for implicit agents can be considered. , mainly using other agents The winner's bid To infer the timetable or Location :

;

在有限时间内探索隐式智能体的策略。从智能体当前的调度开始,依次探索调度中的任务。主要探索以任务为中心,到任务的累计距离为半径的圆内最近的潜在智能体:Explore the implicit agent's strategy in a limited time. From the agent's current schedule Start by exploring the tasks in the schedule one by one. Centered on the mission The cumulative distance The closest potential agent within a circle of radius:

;

应该理解的是,虽然图2的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,图2中的至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些子步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that, although the various steps in the flowchart of Fig. 2 are displayed in sequence according to the indication of the arrows, these steps are not necessarily executed in sequence according to the order indicated by the arrows. Unless there is a clear explanation in this article, the execution of these steps is not strictly limited in order, and these steps can be executed in other orders. Moreover, at least a part of the steps in Fig. 2 may include a plurality of sub-steps or a plurality of stages, and these sub-steps or stages are not necessarily executed at the same time, but can be executed at different times, and the execution order of these sub-steps or stages is not necessarily to be carried out in sequence, but can be executed in turn or alternately with other steps or at least a part of the sub-steps or stages of other steps.

在其中一个实施例中,如图3所示,提供一种计算换通信框架的局部优化装置,包括:In one embodiment, as shown in FIG3 , a local optimization device for a computation-for-communication framework is provided, comprising:

评分函数构建模块302,用于构建各个智能体执行位置相关的任务的评分函数;A scoring function construction module 302 is used to construct a scoring function for each agent to perform a position-related task;

局部优化模块304,用于根据所述评分函数,采用局部优化方法对待执行任务进行计算,得到无冲突调度;其中,所述局部优化方法中,各个智能体之间通过通讯方式进行交互,通过智能体存储的其他智能体通过执行任务所对应所述评分函数的得分,确定智能体的位置信息,以及通过预先设置的任务相关选择策略选择所述局部优化方法中包含的局部智能体;The local optimization module 304 is used to calculate the tasks to be executed according to the scoring function using a local optimization method to obtain a conflict-free schedule; wherein, in the local optimization method, each agent interacts with each other through communication, determines the location information of the agent through the scores of the scoring function corresponding to the tasks executed by other agents stored in the agent, and selects the local agent included in the local optimization method through a preset task-related selection strategy;

调度模块306,用于对各个智能体通过局部优化方法进行待执行任务计算,得到无冲突任务调度,在进行全局收敛确认时,若检测到冲突存在,则将所述无冲突任务调度通过通讯方式传递给其他智能体;The scheduling module 306 is used to calculate the tasks to be executed by each agent through a local optimization method to obtain a conflict-free task schedule. When confirming the global convergence, if a conflict is detected, the conflict-free task schedule is transmitted to other agents through a communication method.

求解模块308,用于向其他智能体发送时间表,采用一致性消除原则消除全局收敛阶段产生的任务冲突,得到网络多智能体调度方案。The solution module 308 is used to send a schedule to other agents, and use the consistency elimination principle to eliminate the task conflicts generated in the global convergence stage to obtain a network multi-agent scheduling solution.

在其中一个实施例中,所述评分函数表示为:In one embodiment, the scoring function is expressed as:

;

其中,是智能体执行任务的得分,表示智能体的时间表,表示智能体按照时间表执行任务的回报,是为任务回报的折扣因子,为智能体沿时间表或智能体的位置到达任务位置的估计时间,为任务的初始任务回报。in, It is an intelligent agent Execute the task The score, Representing an Agent timetable, Representing an Agent According to the schedule Execute the task The return, It's for the mission The discount factor for the return, For intelligent agents Along the schedule Or the agent's location arrival task Estimated time of location, For the task Initial mission reward.

在其中一个实施例中,所述智能体进行任务调度的优化问题为:In one embodiment, the optimization problem for the agent to perform task scheduling is:

;

其中,n表示智能体数量,m表示任务数量,为二进制变量,当时,智能体执行任务,当时,表示智能体不执行任务是智能体执行任务的得分;所述优化问题的约束条件为:Where n represents the number of agents, m represents the number of tasks, is a binary variable, when When the intelligent agent Execute the task ,when , indicating that the agent Not performing tasks , It is an intelligent agent Execute the task The constraints of the optimization problem are:

;

;

;

;

其中,表示智能体在任务调度中执行任务的时间。in, Representing an Agent In task scheduling Execute tasks in time.

在其中一个实施例中,所述局部优化包括:任务交换和局部采样;In one of the embodiments, the local optimization includes: task exchange and local sampling;

通过所述任务交换增加局部范围得分;所述局部范围得分指的是本地范围内所有智能体执行任务的得分;Increasing the local scope score by the task exchange; the local scope score refers to the score of all agents performing tasks in the local scope;

通过所述局部采样得到局部范围内其他智能体的任务得分,从而进行局部估计得到无冲突任务调度。The task scores of other agents in the local range are obtained through the local sampling, so as to perform local estimation and obtain conflict-free task scheduling.

在其中一个实施例中,所述任务交换过程包括:In one embodiment, the task exchange process includes:

设置任务交换策略为:Set the task exchange strategy to:

;

;

;

其中,是智能体选择交换的策略,局部分数是智能体与智能体的分数之和,是将智能体的任务与智能体的任务交换,是交换后的局部分数。in, It is an intelligent agent Select Exchange strategy, local score It is an intelligent agent With Agent The sum of the scores of Is to make the intelligent agent Mission With Agent Mission exchange, is the local score after the exchange.

在其中一个实施例中,所述局部采样的过程为:In one embodiment, the local sampling process is:

根据局部范围内所有智能体的当前调度、任务信息、位置信息以及评分函数,构建局部拍卖为:According to the current schedule, task information, location information and scoring function of all agents in the local range, the local auction is constructed as:

;

其中,表示智能体i对局部范围内其他智能体的局部估计函数,当前时刻t其他智能体k的局部拍卖函数,表示任务的任务信息,表示智能体k的位置信息。in, represents the local estimation function of agent i for other agents in the local range, The local auction function of other agents k at the current time t, Indicates the task Task information, Represents the location information of agent k.

在其中一个实施例中,所述确定智能体的位置信息,包括:根据所述评分函数,计算智能体与待执行任务的距离信息;根据至少三个所述距离信息,确定智能体的位置信息。In one embodiment, the determining the location information of the agent includes: calculating the distance information between the agent and the task to be performed according to the scoring function; and determining the location information of the agent according to at least three of the distance information.

在其中一个实施例中,所述任务相关选择策略包括:显示智能体和隐式智能体;所述显示智能体是当前智能体从局部内存求解当前迭代中其他智能体的获胜者向量,以及求解与当前智能体有任务冲突的其他智能体;所述隐式智能体是当前没有显示任务冲突下,可能发生冲突的智能体。In one embodiment, the task-related selection strategy includes: an explicit agent and an implicit agent; the explicit agent is the current agent that solves the winner vector of other agents in the current iteration from the local memory, and solves other agents that have task conflicts with the current agent; the implicit agent is an agent that may conflict when there is no explicit task conflict.

在一个实施例中,提供了一种计算机设备,该计算机设备可以是终端,其内部结构图可以如图4所示。该计算机设备包括通过系统总线连接的处理器、存储器、网络接口、显示屏和输入装置。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统和计算机程序。该内存储器为非易失性存储介质中的操作系统和计算机程序的运行提供环境。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机程序被处理器执行时以实现一种计算换通信框架的局部优化方法。该计算机设备的显示屏可以是液晶显示屏或者电子墨水显示屏,该计算机设备的输入装置可以是显示屏上覆盖的触摸层,也可以是计算机设备外壳上设置的按键、轨迹球或触控板,还可以是外接的键盘、触控板或鼠标等。In one embodiment, a computer device is provided, which may be a terminal, and its internal structure diagram may be shown in FIG4. The computer device includes a processor, a memory, a network interface, a display screen, and an input device connected via a system bus. Among them, the processor of the computer device is used to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of the operating system and the computer program in the non-volatile storage medium. The network interface of the computer device is used to communicate with an external terminal via a network connection. When the computer program is executed by the processor, a local optimization method for a computing-communication framework is implemented. The display screen of the computer device may be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer device may be a touch layer covered on the display screen, or a key, trackball or touchpad provided on the housing of the computer device, or an external keyboard, touchpad or mouse, etc.

本领域技术人员可以理解,图4中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art will understand that the structure shown in FIG. 4 is merely a block diagram of a partial structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied. The specific computer device may include more or fewer components than those shown in the figure, or combine certain components, or have a different arrangement of components.

在一个实施例中,提供了一种计算机设备,包括存储器和处理器,该存储器存储有计算机程序,该处理器执行计算机程序时实现上述实施例中方法的步骤。In one embodiment, a computer device is provided, including a memory and a processor, wherein the memory stores a computer program, and the processor implements the steps of the method in the above embodiment when executing the computer program.

在一个实施例中,提供了一种计算机可读存储介质,其上存储有计算机程序,计算机程序被处理器执行时实现上述实施例中方法的步骤。In one embodiment, a computer-readable storage medium is provided, on which a computer program is stored. When the computer program is executed by a processor, the steps of the method in the above embodiment are implemented.

本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一非易失性计算机可读取存储介质中,该计算机程序在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink) DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。Those of ordinary skill in the art can understand that all or part of the processes in the above-mentioned embodiment methods can be implemented by instructing the relevant hardware through a computer program, and the computer program can be stored in a non-volatile computer-readable storage medium. When the computer program is executed, it can include the processes of the embodiments of the above-mentioned methods. Among them, any reference to memory, storage, database or other media used in the embodiments provided in this application may include non-volatile and/or volatile memory. Non-volatile memory may include read-only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM) or flash memory. Volatile memory may include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous link (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.

以上实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above embodiments may be arbitrarily combined. To make the description concise, not all possible combinations of the technical features in the above embodiments are described. However, as long as there is no contradiction in the combination of these technical features, they should be considered to be within the scope of this specification.

以上所述实施例仅表达了本申请的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干变形和改进,这些都属于本申请的保护范围。因此,本申请专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only express several implementation methods of the present application, and the descriptions thereof are relatively specific and detailed, but they cannot be understood as limiting the scope of the invention patent. It should be pointed out that, for a person of ordinary skill in the art, several variations and improvements can be made without departing from the concept of the present application, and these all belong to the protection scope of the present application. Therefore, the protection scope of the patent of the present application shall be subject to the attached claims.

Claims (8)

1.一种计算换通信框架的局部优化方法,其特征在于,所述方法包括:1. A local optimization method for a computation-for-communication framework, characterized in that the method comprises: 构建各个智能体执行位置相关的任务的评分函数;Construct scoring functions for each agent to perform position-related tasks; 根据所述评分函数,采用局部优化方法对待执行任务进行计算,得到无冲突调度;其中,所述局部优化方法中,各个智能体之间通过通讯方式进行交互,通过智能体存储的其他智能体通过执行任务所对应所述评分函数的得分,确定智能体的位置信息,以及通过预先设置的任务相关选择策略选择所述局部优化方法中包含的局部智能体;According to the scoring function, a local optimization method is used to calculate the tasks to be executed to obtain a conflict-free schedule; wherein, in the local optimization method, each agent interacts with each other through communication, and the location information of the agent is determined by the score of the scoring function corresponding to the task executed by other agents stored in the agent, and the local agent included in the local optimization method is selected through a preset task-related selection strategy; 对各个智能体通过局部优化方法进行待执行任务计算,得到无冲突任务调度,在进行全局收敛确认时,若检测到冲突存在,则将所述无冲突任务调度通过通讯方式传递给其他智能体;Calculate the tasks to be executed by each agent through a local optimization method to obtain a conflict-free task schedule. When confirming the global convergence, if a conflict is detected, the conflict-free task schedule is transmitted to other agents through communication; 向其他智能体发送时间表,采用一致性消除原则消除全局收敛阶段产生的任务冲突,得到网络多智能体调度方案;Send the schedule to other agents, use the consistency elimination principle to eliminate the task conflicts generated in the global convergence phase, and obtain the network multi-agent scheduling plan; 所述评分函数表示为:The scoring function is expressed as: ; 其中,是智能体执行任务的得分,表示智能体的时间表,表示智能体按照时间表执行任务的回报,是为任务回报的折扣因子,为智能体沿时间表或智能体的位置到达任务位置的估计时间,为任务的初始任务回报;in, It is an intelligent agent Execute the task The score, Representing an Agent timetable, Representing an Agent According to the schedule Execute the task The return, It's for the mission The discount factor for the return, For intelligent agents Along the schedule Or the agent's location arrival task Estimated time of location, For the task Initial mission reward; 所述任务相关选择策略包括:显示智能体和隐式智能体;The task-related selection strategies include: explicit agents and implicit agents; 所述显示智能体是当前智能体从局部内存求解当前迭代中其他智能体的获胜者向量,以及求解与当前智能体有任务冲突的其他智能体;The display agent is the current agent that solves the winner vectors of other agents in the current iteration from the local memory, and solves other agents that have task conflicts with the current agent; 所述隐式智能体是当前没有显示任务冲突下,可能发生冲突的智能体。The implicit agent is an agent that may conflict when there is currently no explicit task conflict. 2.根据权利要求1所述的计算换通信框架的局部优化方法,其特征在于,所述智能体进行任务调度的优化问题为:2. The local optimization method of the computation-for-communication framework according to claim 1 is characterized in that the optimization problem of task scheduling performed by the agent is: ; 其中,n表示智能体数量,m表示任务数量,为二进制变量,当时,智能体执行任务,当时,表示智能体不执行任务是智能体执行任务的得分;所述优化问题的约束条件为:Where n represents the number of agents, m represents the number of tasks, is a binary variable, when When the intelligent agent Execute the task ,when , indicating that the agent Not performing tasks , It is an intelligent agent Execute the task The constraints of the optimization problem are: ; ; ; ; 其中,表示智能体在任务调度中执行任务的时间。in, Representing an Agent In task scheduling Execute tasks in time. 3.根据权利要求1所述的计算换通信框架的局部优化方法,其特征在于,所述局部优化包括:任务交换和局部采样;3. The local optimization method of the computation-for-communication framework according to claim 1, characterized in that the local optimization includes: task exchange and local sampling; 通过所述任务交换增加局部范围得分;所述局部范围得分指的是本地范围内所有智能体执行任务的得分;Increasing the local scope score by the task exchange; the local scope score refers to the score of all agents performing tasks in the local scope; 通过所述局部采样得到局部范围内其他智能体的任务得分,从而进行局部估计得到无冲突任务调度。The task scores of other agents in the local range are obtained through the local sampling, so as to perform local estimation and obtain conflict-free task scheduling. 4.根据权利要求3所述的计算换通信框架的局部优化方法,其特征在于,所述任务交换过程包括:4. The local optimization method of the computation-for-communication framework according to claim 3, characterized in that the task exchange process comprises: 设置任务交换策略为:Set the task exchange strategy to: ; ; ; 其中,是智能体选择交换的策略,局部分数是智能体与智能体的分数之和,是将智能体的任务与智能体的任务交换,是交换后的局部分数。in, It is an intelligent agent Select Exchange strategy, local score It is an intelligent agent With Agent The sum of the scores of Is to make the intelligent agent Mission With Agent Mission exchange, is the local score after the exchange. 5.根据权利要求3所述的计算换通信框架的局部优化方法,其特征在于,所述局部采样的过程为:5. The local optimization method of the computation-for-communication framework according to claim 3, characterized in that the local sampling process is: 根据局部范围内所有智能体的当前调度、任务信息、位置信息以及评分函数,构建局部拍卖为:According to the current schedule, task information, location information and scoring function of all agents in the local range, the local auction is constructed as: ; 其中,表示智能体i对局部范围内其他智能体的局部估计函数,当前时刻t其他智能体k的局部拍卖函数,表示任务的任务信息,表示智能体k的位置信息。in, represents the local estimation function of agent i for other agents in the local range, The local auction function of other agents k at the current time t, Indicates the task Task information, Represents the location information of agent k. 6.根据权利要求1所述的计算换通信框架的局部优化方法,其特征在于,所述确定智能体的位置信息,包括:6. The local optimization method of the computation-for-communication framework according to claim 1, characterized in that the determining the location information of the intelligent agent comprises: 根据所述评分函数,计算智能体与待执行任务的距离信息;According to the scoring function, calculating the distance information between the agent and the task to be performed; 根据至少三个所述距离信息,确定智能体的位置信息。The location information of the intelligent agent is determined based on at least three of the distance information. 7.一种计算换通信框架的局部优化装置,其特征在于,所述装置包括:7. A local optimization device for a computation-for-communication framework, characterized in that the device comprises: 评分函数构建模块,用于构建各个智能体执行位置相关的任务的评分函数;The scoring function building module is used to build the scoring function for each agent to perform position-related tasks; 局部优化模块,用于根据所述评分函数,采用局部优化方法对待执行任务进行计算,得到无冲突调度;其中,所述局部优化方法中,各个智能体之间通过通讯方式进行交互,通过智能体存储的其他智能体通过执行任务所对应所述评分函数的得分,确定智能体的位置信息,以及通过预先设置的任务相关选择策略选择所述局部优化方法中包含的局部智能体;A local optimization module, used to calculate the tasks to be executed using a local optimization method according to the scoring function to obtain a conflict-free schedule; wherein, in the local optimization method, each agent interacts with each other through communication, determines the location information of the agent through the scores of the scoring function corresponding to the tasks executed by other agents stored in the agent, and selects the local agent included in the local optimization method through a preset task-related selection strategy; 调度模块,用于对各个智能体通过局部优化方法进行待执行任务计算,得到无冲突任务调度,在进行全局收敛确认时,若检测到冲突存在,则将所述无冲突任务调度通过通讯方式传递给其他智能体;The scheduling module is used to calculate the tasks to be executed by each intelligent agent through a local optimization method to obtain a conflict-free task schedule. When confirming the global convergence, if a conflict is detected, the conflict-free task schedule is transmitted to other intelligent agents through a communication method. 求解模块,用于向其他智能体发送时间表,采用一致性消除原则消除全局收敛阶段产生的任务冲突,得到网络多智能体调度方案;The solution module is used to send schedules to other agents, and use the consistency elimination principle to eliminate the task conflicts generated in the global convergence stage to obtain the network multi-agent scheduling solution; 所述评分函数表示为:The scoring function is expressed as: ; 其中,是智能体执行任务的得分,表示智能体的时间表,表示智能体按照时间表执行任务的回报,是为任务回报的折扣因子,为智能体沿时间表或智能体的位置到达任务位置的估计时间,为任务的初始任务回报;in, It is an intelligent agent Execute the task The score, Representing an Agent timetable, Representing an Agent According to the schedule Execute the task The return, It's for the mission The discount factor for the return, For intelligent agents Along the schedule Or the agent's location arrival task Estimated time of location, For the task Initial mission reward; 所述任务相关选择策略包括:显示智能体和隐式智能体;The task-related selection strategies include: explicit agents and implicit agents; 所述显示智能体是当前智能体从局部内存求解当前迭代中其他智能体的获胜者向量,以及求解与当前智能体有任务冲突的其他智能体;The display agent is the current agent that solves the winner vectors of other agents in the current iteration from the local memory, and solves other agents that have task conflicts with the current agent; 所述隐式智能体是当前没有显示任务冲突下,可能发生冲突的智能体。The implicit agent is an agent that may conflict when there is currently no explicit task conflict. 8.一种计算机设备,包括存储器和处理器,所述存储器存储有计算机程序,其特征在于,所述处理器执行所述计算机程序时实现权利要求1至6中任一项所述方法的步骤。8. A computer device comprising a memory and a processor, wherein the memory stores a computer program, wherein the processor implements the steps of the method according to any one of claims 1 to 6 when executing the computer program.
CN202310595581.7A 2023-05-25 2023-05-25 Local optimization method, device and computer equipment for computing-for-communication framework Active CN116339955B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310595581.7A CN116339955B (en) 2023-05-25 2023-05-25 Local optimization method, device and computer equipment for computing-for-communication framework

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310595581.7A CN116339955B (en) 2023-05-25 2023-05-25 Local optimization method, device and computer equipment for computing-for-communication framework

Publications (2)

Publication Number Publication Date
CN116339955A CN116339955A (en) 2023-06-27
CN116339955B true CN116339955B (en) 2023-08-11

Family

ID=86889764

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310595581.7A Active CN116339955B (en) 2023-05-25 2023-05-25 Local optimization method, device and computer equipment for computing-for-communication framework

Country Status (1)

Country Link
CN (1) CN116339955B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118544343A (en) * 2024-05-15 2024-08-27 北京航空航天大学 Task planning method, device, medium and product of multi-robot system

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6490611B1 (en) * 1999-01-28 2002-12-03 Mitsubishi Electric Research Laboratories, Inc. User level scheduling of inter-communicating real-time tasks
CN107122857A (en) * 2017-04-26 2017-09-01 南京航空航天大学 Workshop multiple target collaboration Optimization Scheduling based on multiple agent
CN107479380A (en) * 2017-08-25 2017-12-15 东北大学 Multi-Agent coordination control method based on evolutionary game theory
CN108304937A (en) * 2018-01-30 2018-07-20 中国计量大学 A kind of intelligent body electric business agreement based on population metathesis reaction algorithm
WO2019154944A1 (en) * 2018-02-08 2019-08-15 Prowler.Io Limited Distributed machine learning system
CN111586696A (en) * 2020-04-29 2020-08-25 重庆邮电大学 A Resource Allocation and Offloading Decision Method Based on Multi-Agent Architecture Reinforcement Learning
US11269683B1 (en) * 2020-07-27 2022-03-08 United States Of America As Represented By The Secretary Of The Navy Agent conflict resolution
CN114819273A (en) * 2022-03-22 2022-07-29 上海航天壹亘智能科技有限公司 Workshop scheduling method based on combination of multi-Agent global optimization and local optimization
WO2022193534A1 (en) * 2021-03-17 2022-09-22 北京交通大学 Service orchestration system and method based on intent driving in intelligent fusion identification network
CN115794341A (en) * 2022-11-16 2023-03-14 中国平安财产保险股份有限公司 Task scheduling method, device, equipment and storage medium based on artificial intelligence

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3270536B1 (en) * 2016-07-14 2019-03-06 Huawei Technologies Co., Ltd. Sdn controller and method for task scheduling, resource provisioning and service providing
WO2019134254A1 (en) * 2018-01-02 2019-07-11 上海交通大学 Real-time economic dispatch calculation method using distributed neural network
WO2022099596A1 (en) * 2020-11-13 2022-05-19 浙江大学 Adaptive learning intelligent scheduling unified computing framework and system for industrial personalized customized production

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6490611B1 (en) * 1999-01-28 2002-12-03 Mitsubishi Electric Research Laboratories, Inc. User level scheduling of inter-communicating real-time tasks
CN107122857A (en) * 2017-04-26 2017-09-01 南京航空航天大学 Workshop multiple target collaboration Optimization Scheduling based on multiple agent
CN107479380A (en) * 2017-08-25 2017-12-15 东北大学 Multi-Agent coordination control method based on evolutionary game theory
CN108304937A (en) * 2018-01-30 2018-07-20 中国计量大学 A kind of intelligent body electric business agreement based on population metathesis reaction algorithm
WO2019154944A1 (en) * 2018-02-08 2019-08-15 Prowler.Io Limited Distributed machine learning system
CN111586696A (en) * 2020-04-29 2020-08-25 重庆邮电大学 A Resource Allocation and Offloading Decision Method Based on Multi-Agent Architecture Reinforcement Learning
US11269683B1 (en) * 2020-07-27 2022-03-08 United States Of America As Represented By The Secretary Of The Navy Agent conflict resolution
WO2022193534A1 (en) * 2021-03-17 2022-09-22 北京交通大学 Service orchestration system and method based on intent driving in intelligent fusion identification network
CN114819273A (en) * 2022-03-22 2022-07-29 上海航天壹亘智能科技有限公司 Workshop scheduling method based on combination of multi-Agent global optimization and local optimization
CN115794341A (en) * 2022-11-16 2023-03-14 中国平安财产保险股份有限公司 Task scheduling method, device, equipment and storage medium based on artificial intelligence

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于安全智能管理Agent的虚拟企业结构;黄蓓等;《机械与电子》;全文 *

Also Published As

Publication number Publication date
CN116339955A (en) 2023-06-27

Similar Documents

Publication Publication Date Title
Gao et al. Com-DDPG: Task offloading based on multiagent reinforcement learning for information-communication-enhanced mobile edge computing in the Internet of Vehicles
Liu et al. Resource allocation with edge computing in IoT networks via machine learning
Qiu et al. Online deep reinforcement learning for computation offloading in blockchain-empowered mobile edge computing
CN112925308B (en) Path planning method, path planning device and computer storage medium
Xiao et al. Multi-agent reinforcement learning-based trading decision-making in platooning-assisted vehicular networks
Hu et al. Joint optimization of microservice deployment and routing in edge via multi-objective deep reinforcement learning
Zhang et al. A new fuzzy QoS-aware manufacture service composition method using extended flower pollination algorithm
Wei et al. OCVC: An overlapping-enabled cooperative vehicular fog computing protocol
CN112672382A (en) Hybrid collaborative computing unloading method and device, electronic equipment and storage medium
CN116339955B (en) Local optimization method, device and computer equipment for computing-for-communication framework
Aslan Mathematical model and a variable neighborhood search algorithm for mixed-model robotic two-sided assembly line balancing problems with sequence-dependent setup times
Li et al. Federated multiagent actor–critic learning task offloading in intelligent logistics
CN118860608A (en) Task scheduling method, device and storage medium between multiple local mobile clouds
Chai et al. Computation offloading for integrated satellite-terrestrial Internet of Vehicles in 6G edge network: A cooperative Stackelberg game
Zhang et al. Joint DNN partitioning and task offloading based on attention mechanism-aided reinforcement learning
CN115857491B (en) Multi-robot dynamic task allocation method and equipment based on contract net algorithm
Ye et al. Deep reinforcement learning for dependent task offloading in multi-access edge computing
Qi et al. Cluster-PSO based resource orchestration for multi-task applications in vehicular cloud
Sato et al. Multi-Agent Task Allocation Based on Reciprocal Trust in Distributed Environments
CN116094942B (en) Method and device for cooperative work of distributed intelligent equipment, electronic equipment and storage medium
Peng et al. An Online Computation Offloading Approach With Dual Stability Guarantee for Heterogeneous Tasks in MEC-Enabled IIoT
Zhang et al. D2D communication assisted edge computing based resource pricing and scheduling research in blockchain
Li et al. OL-CBBA: An online task allocation algorithm under weak communication conditions
CN116634026B (en) Network multi-agent scheduling calculation exchange communication system, method and computer equipment
Lu et al. Matching with contract-based resource trading in UAV-assisted MEC system: Y. Lu et al.

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant